view article Article π#89: AI in Action: How AI Engineers, Self-Optimizing Models, and Humanoid Robots Are Reshaping 2025 By Kseniase β’ 4 days ago β’ 4
view article Article What is test-time compute and how to scale it? By Kseniase and 1 other β’ 22 days ago β’ 46
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published 24 days ago β’ 195
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 β’ 69
view article Article Topic 23: What is LLM Inference, it's challenges and solutions for it By Kseniase β’ Jan 17 β’ 5
Centurio Collection Artifacts of the paper "Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model" β’ 6 items β’ Updated 24 days ago β’ 4
view article Article Synthetic Data Generation with FastData and Hugging Face By asoria β’ Jan 7 β’ 14
Sa2VA Model Zoo Collection Huggingace Model Zoo For Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos By Bytedance Seed CV Research β’ 4 items β’ Updated 20 days ago β’ 30
Deepseek V3 (All Versions) Collection Deepseek V3 - available in bf16, original, and GGUF formats, with support for 2, 3, 4, 5, 6 and 8-bit quantized versions. β’ 3 items β’ Updated 1 day ago β’ 33
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey Paper β’ 2412.18619 β’ Published Dec 16, 2024 β’ 55
view article Article π¦Έπ»#2: Your Go-To Vocabulary to Navigate the World of AI Agents and Agentic Workflows By Kseniase β’ Dec 28, 2024 β’ 10