rank1 Collection rank1 is the first test-time compute reasoning model in IR • 15 items • Updated about 22 hours ago • 3
OWLS: Scaling Laws for Speech Recognition and Translation Collection 🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. • 6 items • Updated 3 days ago • 3
Minions: Cost-efficient Collaboration Between On-device and Cloud Language Models Paper • 2502.15964 • Published 7 days ago • 1
"Actionable Help" in Crises: A Novel Dataset and Resource-Efficient Models for Identifying Request and Offer Social Media Posts Paper • 2502.16839 • Published 4 days ago • 1
Slam Collection All resources for SpeechLMs from "Slamming: Training a Speech Language Model on One GPU in a Day". We provide tokeniser, lm, and datasets • 6 items • Updated 3 days ago • 12
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models Paper • 2502.17387 • Published 4 days ago • 3
KB-Whisper Collection Whisper models trained on over 50,000 hours of Swedish speech data. • 5 items • Updated 14 days ago • 4
Open Image Preferences Collection Containing all artifacts for the Stable Diffusion 3.5L vs Flux Dev image preference community sprint. • 14 items • Updated Dec 19, 2024 • 9
Maths reasoning Collection Maths reasoning datasets found using https://huggingface.co./spaces/librarian-bots/huggingface-datasets-semantic-search • 14 items • Updated 14 days ago • 2
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 24 days ago • 195
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 17 days ago • 49