Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper • 2503.01743 • Published 6 days ago • 65
When an LLM is apprehensive about its answers -- and when its uncertainty is justified Paper • 2503.01688 • Published 6 days ago • 19
Predictive Data Selection: The Data That Predicts Is the Data That Teaches Paper • 2503.00808 • Published 8 days ago • 51
Multi-Turn Code Generation Through Single-Step Rewards Paper • 2502.20380 • Published 10 days ago • 29