LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis Paper • 2412.15214 • Published Dec 19, 2024 • 15
p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay Paper • 2412.04449 • Published Dec 5, 2024 • 7
FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution Paper • 2410.22655 • Published Oct 30, 2024 • 1
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding Paper • 2403.15377 • Published Mar 22, 2024 • 25