Phi-4 Collection Phi-4 family of small language and multi-modal models. • 7 items • Updated about 5 hours ago • 82
view article Article From Files to Chunks: Improving Hugging Face Storage Efficiency Nov 20, 2024 • 51
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 17 days ago • 49
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 334
Phi-4 (All Versions) Collection Microsoft's new Phi-4 models including mini & multimodal in all formats. Includes GGUF, 4-bit bnb and original versions. Includes Unsloth's bug fixes. • 7 items • Updated 1 day ago • 42
2024 Interconnects Artifacts Collection Models & datasets mentioned in the bottom section of posts! • 280 items • Updated Jan 2 • 6
Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published Dec 4, 2024 • 48
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated Dec 13, 2024 • 143
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15 • 118