Search-o1: Agentic Search-Enhanced Large Reasoning Models Paper • 2501.05366 • Published 3 days ago • 50
Virgo: A Preliminary Exploration on Reproducing o1-like MLLM Paper • 2501.01904 • Published 9 days ago • 29
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper • 2412.21187 • Published 13 days ago • 34
ProgCo: Program Helps Self-Correction of Large Language Models Paper • 2501.01264 • Published 10 days ago • 24
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval Paper • 2412.14475 • Published 25 days ago • 53
Diving into Self-Evolving Training for Multimodal Reasoning Paper • 2412.17451 • Published 20 days ago • 42
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners Paper • 2412.17256 • Published 21 days ago • 45
Progressive Multimodal Reasoning via Active Retrieval Paper • 2412.14835 • Published 24 days ago • 72
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing Paper • 2305.11738 • Published May 19, 2023 • 8
UI Agent Collection a collection of algorithmic agents for user interfaces/interactions and program synthesis • 237 items • Updated 3 days ago • 40
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published about 1 month ago • 86
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published 26 days ago • 41
Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models Paper • 2412.12606 • Published 27 days ago • 41
Smaller Language Models Are Better Instruction Evolvers Paper • 2412.11231 • Published 28 days ago • 27
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation Paper • 2412.11919 • Published 27 days ago • 33