Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 3 items β’ Updated 25 days ago β’ 356
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others β’ Jan 20 β’ 34
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 β’ 68
view article Article Announcing NVIDIA Cosmos World Foundation Models By mingyuliutw and 1 other β’ Jan 7 β’ 24
The Perfect Blend: Redefining RLHF with Mixture of Judges Paper β’ 2409.20370 β’ Published Sep 30, 2024 β’ 5
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published Dec 18, 2024 β’ 134
Qwen2-VL Collection Vision-language model series based on Qwen2 β’ 16 items β’ Updated Dec 6, 2024 β’ 205
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. β’ 23 items β’ Updated Dec 13, 2024 β’ 141
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper β’ 2412.03555 β’ Published Dec 4, 2024 β’ 127
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog β’ 9 items β’ Updated 10 days ago β’ 60
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. β’ 45 items β’ Updated Nov 28, 2024 β’ 525
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 16 items β’ Updated about 5 hours ago β’ 238
view article Article How to optimize your data labelling project with custom interfaces By burtenshaw and 9 others β’ Oct 16, 2024 β’ 18