view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥 1 day ago • 56
Gemma 2 Swahili Quantized Models Collection A collection of gemma 2 swahili quantized models made by the community • 6 items • Updated 9 days ago • 1
Gemma 2 Swahili Collection Gemma 2 Swahili is a family of lightweight, state-of-the-art Swahili variants of Gemma 2 models. • 4 items • Updated 26 days ago • 1
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25, 2024 • 92
T5 release Collection The original T5 transformer release was done in two steps, the original T5 checkpoints and the improved T5v1 • 9 items • Updated Dec 13, 2024 • 12
Flan-T5 release Collection The Flan-T5 covers 4 checkpoints of different sizes each time. It also includes upgrades versions trained using Universal sampling • 7 items • Updated Dec 13, 2024 • 23
TimesFM Release Collection TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting. • 4 items • Updated Dec 24, 2024 • 13
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Dec 13, 2024 • 85
Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4 Paper • 2312.16171 • Published Dec 26, 2023 • 34
Automatic Speech Recognition 📝 Collection A collection of ASR models supported in 🤗 Transformers • 11 items • Updated Sep 16, 2023 • 8
HyenaDNA Models Collection HyenaDNA models usable directly with Hugging Face classes like AutoModel. • 8 items • Updated Nov 14, 2023 • 17