Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated 1 day ago • 162
view article Article PyTorchModelHubMixin: Bridging the Gap for Custom AI Models on Hugging Face By not-lain • 1 day ago • 9
Cosmos Tokenizer Collection A suite of image and video tokenizers • 10 items • Updated 6 days ago • 13
C4AI Aya Expanse Collection Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated 19 days ago • 25
view article Article Transformers.js v3: WebGPU support, new models & tasks, and more… 22 days ago • 58
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens Paper • 2410.13863 • Published 26 days ago • 35
LoLCATS Collection Linearizing LLMs with high quality and efficiency. We linearize the full Llama 3.1 model family -- 8b, 70b, 405b -- for the first time! • 4 items • Updated 30 days ago • 14
view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 • 29
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22 • 117
view article Article Improving Hugging Face Training Efficiency Through Packing with Flash Attention Aug 21 • 22