Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning Paper • 2502.03275 • Published 4 days ago • 11
Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking Paper • 2502.02339 • Published 5 days ago • 16
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 5 days ago • 140
Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment Paper • 2502.04328 • Published 3 days ago • 19
ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features Paper • 2502.04320 • Published 3 days ago • 21
Analyze Feature Flow to Enhance Interpretation and Steering in Language Models Paper • 2502.03032 • Published 4 days ago • 52
SliderSpace: Decomposing the Visual Capabilities of Diffusion Models Paper • 2502.01639 • Published 6 days ago • 24
Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations Paper • 2501.19066 • Published 9 days ago • 9
Current Pathology Foundation Models are unrobust to Medical Center Differences Paper • 2501.18055 • Published 11 days ago • 2
A Study on the Performance of U-Net Modifications in Retroperitoneal Tumor Segmentation Paper • 2502.00314 • Published 8 days ago • 3
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper • 2502.01341 • Published 6 days ago • 32
SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders Paper • 2501.18052 • Published 11 days ago • 6
DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning Paper • 2411.04983 • Published Nov 7, 2024 • 11
Large Language Models Think Too Fast To Explore Effectively Paper • 2501.18009 • Published 11 days ago • 22