DAMO-NLP-SG 's Collections

VideoLLaMA2

Optimized VideoLLaMA with improved spatial-temporal modeling and better audio understanding capability