Deepseek V3 (All Versions) Collection Deepseek V3 - available in bf16, original, and GGUF formats, with support for 2, 3, 4, 5, 6 and 8-bit quantized versions. • 3 items • Updated 3 days ago • 21
mrm8488/ModernBERT-base-ft-fineweb-edu-annotations Text Classification • Updated 18 days ago • 404 • 11
mrm8488/ModernBERT-large-ft-fineweb-edu-annotations Text Classification • Updated 10 days ago • 27 • 3
Smol but mighty Collection A collection of smoll but mighty models • 10 items • Updated 24 days ago • 4
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 23 days ago • 122
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published 25 days ago • 121
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 4 days ago • 78