Submitted by ClownRat 89 Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss · 9 authors 3
Submitted by ZetangForward 43 LOGO -- Long cOntext aliGnment via efficient preference Optimization · 5 authors 2
Submitted by dyyyyyyyy 42 Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch · 6 authors 3
Submitted by flavoredquark 37 Unbounded: A Generative Infinite Game of Character Life Simulation · 8 authors 2
Submitted by yuzhaouoe 20 Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering · 8 authors 3
Submitted by chiennv 18 Taipan: Efficient and Expressive State Space Language Models with Selective Attention · 11 authors 2
Submitted by EvanTHU 15 MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms · 5 authors 2
Submitted by ldwang 11 CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models · 10 authors 3
Submitted by aryopg 10 DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations · 8 authors 3
Submitted by Shilin-LU 10 Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances · 5 authors 2
Submitted by wangfuyun 10 Stable Consistency Tuning: Understanding and Improving Consistency Models · 3 authors 3
Submitted by Zhiwei840 9 ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning · 6 authors 2
Submitted by Zcchill 8 Value Residual Learning For Alleviating Attention Concentration In Transformers · 4 authors 2
Submitted by Fulu2024 7 The Nature of Mathematical Modeling and Probabilistic Optimization Engineering in Generative AI · 1 authors 2
Submitted by mnoukhov 7 Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models · 6 authors 2
Submitted by Dominic789654 7 Should We Really Edit Language Models? On the Evaluation of Edited Language Models · 7 authors 2
Submitted by brando 6 ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment · 7 authors 2
Submitted by Yingdong-Hu 6 Data Scaling Laws in Imitation Learning for Robotic Manipulation · 6 authors 2
Submitted by brando 5 Pantograph: A Machine-to-Machine Interaction Interface for Advanced Theorem Proving, High Level Reasoning, and Data Extraction in Lean 4 · 5 authors 2
Submitted by mamaj92 5 Multi-Draft Speculative Sampling: Canonical Architectures and Theoretical Limits · 6 authors 2