Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DAMO-NLP-SG
/
VideoLLaMA2.1-7B-AV
like
8
Follow
Language Technology Lab at Alibaba DAMO Academy
41
Visual Question Answering
Transformers
Safetensors
lmms-lab/ClothoAQA
Loie/VGGSound
English
videollama2_qwen2
text-generation
Audio-visual Question Answering
Audio Question Answering
multimodal large language model
Inference Endpoints
arxiv:
2406.07476
arxiv:
2306.02858
License:
apache-2.0
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
main
VideoLLaMA2.1-7B-AV
/
README.md
Commit History
Update README.md
d944d42
verified
YifeiXin
commited on
21 days ago
Update README.md
b9c58e1
verified
lixin4ever
commited on
23 days ago
Update README.md
fba52ca
verified
lixin4ever
commited on
23 days ago
Update README.md
4c84984
verified
lixin4ever
commited on
24 days ago
Update README.md
c7e14fd
verified
YifeiXin
commited on
24 days ago
initial commit
eacaf06
verified
YifeiXin
commited on
25 days ago