olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org β’ 3 items β’ Updated 1 day ago β’ 46
Phi-4 Collection Phi-4 family of small language and multi-modal models. β’ 7 items β’ Updated about 4 hours ago β’ 82
Goku: Flow Based Video Generative Foundation Models Paper β’ 2502.04896 β’ Published 21 days ago β’ 90
Fast Video Generation with Sliding Tile Attention Paper β’ 2502.04507 β’ Published 22 days ago β’ 47
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. β’ 5 items β’ Updated 22 days ago β’ 50
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published 24 days ago β’ 195
Fully Autonomous AI Agents Should Not be Developed Paper β’ 2502.02649 β’ Published 24 days ago β’ 24
view article Article π#86: Four Freedoms of truly open AI By TuringPost and 1 other β’ 25 days ago β’ 5
view article Article From Hippocrates to AI: Reflections on the Evolution of Consent By giadap β’ 24 days ago β’ 8
view article Article Ο0 and Ο0-FAST: Vision-Language-Action Models for General Robot Control 25 days ago β’ 109