Token-Efficient Long Video Understanding for Multimodal LLMs Paper • 2503.04130 • Published 3 days ago • 61
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning Paper • 2502.19634 • Published 11 days ago • 56
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 17 days ago • 128
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published 21 days ago • 141
Latent Radiance Fields with 3D-aware 2D Representations Paper • 2502.09613 • Published 24 days ago • 6
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 30 days ago • 121
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published Dec 13, 2024 • 140
PBADet: A One-Stage Anchor-Free Approach for Part-Body Association Paper • 2402.07814 • Published Feb 12, 2024 • 1
Learning Continuous Mesh Representation with Spherical Implicit Surface Paper • 2301.04695 • Published Jan 11, 2023
DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering Paper • 2406.02518 • Published Jun 4, 2024
6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering Paper • 2410.04974 • Published Oct 7, 2024
Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats Paper • 2410.12781 • Published Oct 16, 2024 • 6
MedVisionLlama: Leveraging Pre-Trained Large Language Model Layers to Enhance Medical Image Segmentation Paper • 2410.02458 • Published Oct 3, 2024 • 9
MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis Paper • 2410.02103 • Published Oct 2, 2024 • 8
DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion Paper • 2409.17145 • Published Sep 25, 2024 • 15
Phantom of Latent for Large Language and Vision Models Paper • 2409.14713 • Published Sep 23, 2024 • 29
3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt Paper • 2409.12892 • Published Sep 19, 2024 • 5