Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
VITA-MLLM
/
VITA-1.5
like
36
Follow
VITA-MLLM
44
Video-Text-to-Text
Safetensors
vita-Qwen2
arxiv:
2501.01957
Model card
Files
Files and versions
Community
2
refs/pr/2
VITA-1.5
/
audio-encoder-Qwen2-7B-1107-weight-base-11wh-tunning
/
final.pt
Commit History
add all
077821d
shenyunhang
commited on
Dec 18, 2024