MedIT Solutions

company

Verified

https://meditsolutions.pl

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

mkurman updated a model about 2 months ago

meditsolutions/Llama-3.2-SUN-HDIC-1B-Instruct

mkurman updated a model about 2 months ago

meditsolutions/Llama-3.2-SUN-HDIC-1B-Instruct

mkurman new activity 2 months ago

meditsolutions/SmolLM2-MedIT-Upscale-2B:Adding Evaluation Results

View all activity

meditsolutions's activity

mkurman

posted an update 4 days ago

Post

1924

Blurred-Thoughts Supervised Fine-Tuning (BT-SFT) 🤖

Can we teach a model to think completely on its own without reinforcement learning? Actually, yes.

We can do straightforward supervised fine-tuning using a relatively simple trick: blurring a part of CoT thoughts. But why is this effective?

We observed that various models differ in their thinking processes, and fine-tuning one model on another model’s thoughts (CoT) can sometimes be inefficient—often resulting in the model simply memorizing reasoning rather than learning how to actually think.

I discovered that this process can still be efficient if we clearly indicate when the model should start and stop thinking and uncover only a part of CoT and the expected answer, blurring the other part of CoT. This approach allows the model to learn only a portion of the thought process while still arriving at an expected answer.

To demonstrate this, you can watch my experimental BT-SFT on meditsolutions/Llama-3.2-SUN-2.5B-chat model, which was fine-tuned on 151 million tokens from the Magpie-Align/Magpie-Reasoning-V2-250K-CoT-Deepseek-R1-Llama-70B dataset.

Enjoy! 🚀

PS. If you were curious enough to read this, leave me a comment. It's always nice to chat with open-minded and intelligent ppl.

3 replies

mkurman

posted an update 5 days ago

Post

2708

Ok, my 14B DeepSeek R1 merge with Qwen2.5 1M is really hot right now—it's got 2.6k downloads! It's sitting pretty as the top trending model on the third page. 🔥

Check it out if you haven't already!
mkurman/Qwen2.5-14B-DeepSeek-R1-1M

11 replies

mkurman

posted an update 8 days ago

Post

1901

I’ve simplified things for the AI OS community!

Check out Qwen-2.5-14B-DeepSeek-R1-1M! This one's a cool blend of the latest Qwen 2.5 with 14 billion parameters and has a massive 1 million token context window. It also comes with the DeepSeek R1 version of the Qwen 2.5 14B base model.

Enjoy! 🚀

mkurman/Qwen2.5-14B-DeepSeek-R1-1M

mkurman

posted an update 17 days ago

Post

1261

ReasonFlow 🧠

Are you fascinated by reasoning models? If so, you won't want to miss my latest project! I've implemented multiple path generations to supercharge the reasoning capabilities of O1-like models. Explore how this work can elevate your model in complex reasoning tasks!

https://github.com/mkurman/ReasonFlow

Use it with:
mkurman/phi4-MedIT-10B-o1
- or -
mkurman/llama-3.2-MEDIT-3B-o1

1 reply

mkurman

posted an update 30 days ago

Post

1907

I kindly invite you to try my experimental Llama 3.2 3B with o1-like thinking.

It utilizes Thoughts when needed, so don't be surprised when it's not. It also has a minor bug that requires further fine-tuning (sometimes it starts with the <|python_tag|> instead of <Thought>).

Enjoy!

Give some likes and whatever to make me feel better and motivated to keep going 😂

mkurman/llama-3.2-MEDIT-3B-o1

mkurman

updated a model about 2 months ago

meditsolutions/Llama-3.2-SUN-HDIC-1B-Instruct

Text Generation • Updated Dec 11, 2024 • 285

mkurman

posted an update 2 months ago

Post

347

How Do I Contribute (HDIC)

Exciting times to come? We are working on a layer self-esteem technique to score their contribution to the final prediction. For now, it unlocks a lot of knowledge already stored in weights we couldn't force the model to extract by further fine-tuning!

mkurman

posted an update 2 months ago

Post

438

What AI-enhanced research tools would you recommend for searching and analyzing scientific papers?

5 replies

mkurman

in meditsolutions/SmolLM2-MedIT-Upscale-2B 2 months ago

Adding Evaluation Results

#1 opened 2 months ago by

leaderboard-pr-bot

mkurman

posted an update 2 months ago

Post

1180

We built a new small language model SmolLM2-MedIT-Upscale-2B, based on SmolLM2-1.7B-Instruct from Hugging Face. The premise was simple - increasing the vector in attention layers would positively impact the model's capabilities.

What did we prove?
In total, not much really, since we don't have the original trained under the same conditions as our upscale. However...

1. We scaled up the model without losing its quality
2. We confirmed that the method we devised works
3. After extremely short fine-tuning, the model achieved much better results in IFEval compared to the original (53.68 vs 64.29) and a higher overall average score in Open LLM Leaderboard (14.75 vs 15.17)

I consider this a big success 😇, since surpassing the original in metrics is often very time-consuming, generates high costs, and doesn't always work out.

Meanwhile, we're moving forward, training SmolLM2 400M Instruct as an upscale of 136M.

We're curious about how increasing the base and intermediate vectors will affect the model's quality. We'll compare it to the original and the 360M Instruct version released by Hugging Face.

License: Apache 2.0

meditsolutions/SmolLM2-MedIT-Upscale-2B

mkurman

updated a model 2 months ago

meditsolutions/SmolLM2-MedIT-Upscale-2B

Updated Dec 3, 2024 • 11 • 4

mkurman

in meditsolutions/Llama-3.2-SUN-1B-Instruct 2 months ago

Adding Evaluation Results

#1 opened 2 months ago by

mkurman

updated a model 2 months ago

meditsolutions/Llama-3.2-SUN-1B-Instruct

Text Generation • Updated Nov 30, 2024 • 125 • 4

mkurman

updated a collection 2 months ago

MedIT SUN

Collection

Llama 3.2 1B upscaled to 2.5B parameters • 4 items • Updated Nov 27, 2024 • 1

mkurman

updated a collection 3 months ago

Marsh Harrier

Collection

Bielik 11B v2.3 Instruct MedIT-merge and Llama-Pruning • 3 items • Updated Nov 13, 2024

mkurman

updated 2 models 3 months ago

meditsolutions/MSH-Lite-7B-v1-Bielik-v2.3-Instruct-Llama-Prune

Text Generation • Updated Nov 13, 2024 • 1.55k

meditsolutions/Llama-3.2-SUN-1B-chat

Text Generation • Updated Nov 7, 2024 • 112 • 1

mkurman

in meditsolutions/Llama-3.1-MedIT-SUN-8B 3 months ago

Adding Evaluation Results

#1 opened 3 months ago by

leaderboard-pr-bot

mkurman

updated a model 3 months ago

meditsolutions/Llama-3.1-MedIT-SUN-8B

Text Generation • Updated Nov 7, 2024 • 527 • 1

mkurman

posted an update 3 months ago

Post

715

We are happy to introduce MedIT SUN 1B, a downscaled version of the MedIT SUN 2.5B Llama 3.2 variant.

Give it a try!
meditsolutions/Llama-3.2-SUN-1B-chat

AI & ML interests

Recent Activity

Team members 2

meditsolutions's activity

Adding Evaluation Results

Adding Evaluation Results

Adding Evaluation Results