TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models Paper • 2502.06608 • Published 10 days ago • 31
Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance Paper • 2502.06145 • Published 11 days ago • 16
Pippo: High-Resolution Multi-View Humans from a Single Image Paper • 2502.07785 • Published 9 days ago • 9
VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation Paper • 2502.07531 • Published 9 days ago • 13
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation Paper • 2502.05179 • Published 13 days ago • 22
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 14 days ago • 48
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published 18 days ago • 179
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper • 2502.02492 • Published 16 days ago • 55
MatAnyone: Stable Video Matting with Consistent Memory Propagation Paper • 2501.14677 • Published 27 days ago • 30
People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text Paper • 2501.15654 • Published 25 days ago • 11
Histoires Morales: A French Dataset for Assessing Moral Alignment Paper • 2501.17117 • Published 23 days ago • 3
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces Paper • 2501.12909 • Published 29 days ago • 67
OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer Paper • 2406.16620 • Published Jun 24, 2024 • 2