Systran/faster-whisper-large-v3 Automatic Speech Recognition โข Updated Nov 23, 2023 โข 783k โข 318
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding Paper โข 2403.15377 โข Published Mar 22, 2024 โข 23
DreamReward: Text-to-3D Generation with Human Preference Paper โข 2403.14613 โข Published Mar 21, 2024 โข 36
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance Paper โข 2403.14781 โข Published Mar 21, 2024 โข 15