SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 11 days ago • 65
view article Article LLM Dataset Formats 101: A No‐BS Guide for Hugging Face Devs By tegridydev • 2 days ago • 3
WildChat-50m Collection All model responses associated with the WildChat-50m paper. • 55 items • Updated 5 days ago • 6
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency By not-lain • 4 days ago • 22
TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models Paper • 2501.16937 • Published 6 days ago • 4
OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas Paper • 2501.15427 • Published 8 days ago • 6
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation Paper • 2501.16764 • Published 6 days ago • 18
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 5 days ago • 80
Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation Paper • 2501.15907 • Published 7 days ago • 14
Towards General-Purpose Model-Free Reinforcement Learning Paper • 2501.16142 • Published 7 days ago • 22
view article Article Hunyuan video LoRA training study (Single image/style training) By neph1 • 6 days ago • 1
CLAIR and APO Collection Data and Models for the paper "Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment" • 8 items • Updated Aug 14, 2024 • 4
view article Article PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs By samuellimabraz • 10 days ago • 10