Phantom: Subject-consistent video generation via cross-modal alignment Paper • 2502.11079 • Published 5 days ago • 46
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 8 days ago • 179
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper • 2502.08910 • Published 8 days ago • 136
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance Paper • 2502.08127 • Published 9 days ago • 48
Retrieval-augmented Large Language Models for Financial Time Series Forecasting Paper • 2502.05878 • Published 12 days ago • 38
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Paper • 2502.07316 • Published 10 days ago • 40
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 13 days ago • 112
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published 10 days ago • 132
On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices Paper • 2502.04363 • Published 16 days ago • 11
Can LLMs Maintain Fundamental Abilities under KV Cache Compression? Paper • 2502.01941 • Published 17 days ago • 13
The Differences Between Direct Alignment Algorithms are a Blur Paper • 2502.01237 • Published 18 days ago • 111
Reward-Guided Speculative Decoding for Efficient LLM Reasoning Paper • 2501.19324 • Published 20 days ago • 37
GuardReasoner: Towards Reasoning-based LLM Safeguards Paper • 2501.18492 • Published 21 days ago • 81