Autellix: An Efficient Serving Engine for LLM Agents as General Programs Paper • 2502.13965 • Published 9 days ago • 18
OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning Paper • 2502.11271 • Published 12 days ago • 16
Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models Paper • 2502.13533 • Published 10 days ago • 9
Intuitive physics understanding emerges from self-supervised pretraining on natural videos Paper • 2502.11831 • Published 11 days ago • 16
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? Paper • 2502.14502 • Published 9 days ago • 80
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 25 days ago • 109
Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization Paper • 2502.04295 • Published 22 days ago • 12
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search Paper • 2502.02508 • Published 24 days ago • 22
Sundial: A Family of Highly Capable Time Series Foundation Models Paper • 2502.00816 • Published 26 days ago • 3
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 24 days ago • 196
Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step Paper • 2501.13926 • Published Jan 23 • 37
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Paper • 2501.12895 • Published Jan 22 • 56
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper • 2501.11425 • Published Jan 20 • 92
O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning Paper • 2501.12570 • Published Jan 22 • 24