π§ Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community β’ 12 items β’ Updated about 11 hours ago β’ 74
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 3 items β’ Updated 25 days ago β’ 356
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published 29 days ago β’ 325
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 β’ 147
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other β’ 28 days ago β’ 62
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release β’ 12 items β’ Updated about 6 hours ago β’ 69
view article Article SmolVLM Grows Smaller β Introducing the 250M & 500M Models! 29 days ago β’ 142
Enhancing Human-Like Responses in Large Language Models Paper β’ 2501.05032 β’ Published Jan 9 β’ 49
Cosmos Tokenizer Collection A suite of image and video tokenizers β’ 13 items β’ Updated Jan 17 β’ 39
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding Paper β’ 2412.10302 β’ Published Dec 13, 2024 β’ 17