olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org β’ 3 items β’ Updated 2 days ago β’ 46
Foundation Text-Generation Models Below 360M Parameters Collection Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. β’ 31 items β’ Updated 3 days ago β’ 25
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google 10 days ago β’ 60
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 β’ 179
Executable Code Actions Elicit Better LLM Agents Paper β’ 2402.01030 β’ Published Feb 1, 2024 β’ 82
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. β’ 46 items β’ Updated 2 days ago β’ 535
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper β’ 2501.17161 β’ Published Jan 28 β’ 108
OLMo 2 Preview Post-trained Models Collection These model's tokenizer did not use HF's fast tokenizer, resulting in variations in how pre-tokenization was applied. Resolved in latest versions. β’ 6 items β’ Updated 18 days ago β’ 2
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths β’ 3 items β’ Updated 2 days ago β’ 102
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 8 items β’ Updated 5 days ago β’ 379
Deepseek V3 (All Versions) Collection Deepseek V3 - available in bf16, original, and GGUF formats, with support for 2, 3, 4, 5, 6 and 8-bit quantized versions. β’ 3 items β’ Updated 1 day ago β’ 33