Helium-1 Collection Kyutai's Helium-1 2B Model, outperforming other state of the art small models. • 4 items • Updated 6 days ago • 1
J.O.S.I.E. v6.0 Collection Trained on opensourced and private custom DPO/ORPO datasets • 8 items • Updated 13 days ago • 2
Dolphin 3.0 Collection Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 7 items • Updated 19 days ago • 57
Llama 3.2 Collection Meta goes small with Llama3.2, both text only 1B and 3B, and the 11B Vision models. • 15 items • Updated Dec 17, 2024 • 11
Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination Paper • 2411.03823 • Published Nov 6, 2024 • 44
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level Paper • 2411.03562 • Published Nov 5, 2024 • 65
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated Dec 22, 2024 • 205
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding Paper • 2404.16710 • Published Apr 25, 2024 • 77
Josiefied and Abliterated Collection Abliterated, and further fine-tuned to be the most uncensored models available. • 16 items • Updated 12 days ago • 4
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 18 days ago • 293
Llama 3.2 Collection Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 23 items • Updated 4 days ago • 46