Solving math word problems with process- and outcome-based feedback Paper β’ 2211.14275 β’ Published Nov 25, 2022 β’ 7
POINTS1.5: Building a Vision-Language Model towards Real World Applications Paper β’ 2412.08443 β’ Published 14 days ago β’ 38
NeMo Curator - Classifier Models Collection Classifier models that can be used in NeMo Curator for labelling/filtering datasets. β’ 9 items β’ Updated 11 days ago β’ 9
Molmo Collection Artifacts for open multimodal language models. β’ 5 items β’ Updated 27 days ago β’ 289
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. β’ 32 items β’ Updated 27 days ago β’ 62
The Big Benchmarks Collection Collection Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) β’ 13 items β’ Updated Nov 18 β’ 176
Open LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 62 items β’ Updated about 2 hours ago β’ 483
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper β’ 2404.14619 β’ Published Apr 22 β’ 126
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais β’ Nov 13 β’ 98
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 β’ 9 items β’ Updated 28 days ago β’ 99
Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data Paper β’ 2410.18558 β’ Published Oct 24 β’ 18