🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 11 items • Updated 8 days ago • 70
OpenR1-Math Collection Dataset and SFT model distilled from DeepSeek-R1. Check out our blog post for more details: https://huggingface.co./blog/open-r1/update-2 • 3 items • Updated 5 days ago • 6
Tools for learning AI Collection This is a collection of tools on the hub that teachers and students can use to learn AI! • 9 items • Updated 2 days ago • 55
Nomic Embed Collection Open Source Long Context Text Embedders • 8 items • Updated Feb 14, 2024 • 20
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 12 days ago • 108
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 12 days ago • 48
Mistral-Small-24B-2501 (All Versions) Collection A collection of Mistral's new Small 2501 models including GGUF, 4-bit and more! • 9 items • Updated 15 days ago • 5
Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models Paper • 2402.14207 • Published Feb 22, 2024 • 8
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published 16 days ago • 37
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated 17 days ago • 53
SFTvsRL Models & Data Collection This collection contains 4 initial checkpoints for https://github.com/LeslieTrue/SFTvsRL and necessary data for V-IRL training. • 5 items • Updated 14 days ago • 8
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 22 days ago • 106
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 23 days ago • 349
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 24 days ago • 100