view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled • Oct 14, 2024 • 65
huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2 Text Generation • Updated 10 days ago • 1.7k • 36
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 5 days ago • 83
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published 5 days ago • 45
OnDeviceMedNotes/synthetic-medical-conversations-deepseek-v3 Viewer • Updated 6 days ago • 143k • 181 • 28
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 20 days ago • 132
Running on Zero 1.29k 🌍 Chat With Janus-Pro-7B A unified multimodal understanding and generation model.