Yuhao Dong

THUdyh

AI & ML interests

None yet

Recent Activity

Organizations

EgoLife's profile picture EgoLife-Team's profile picture

THUdyh's activity

New activity in THUdyh/Oryx-ViT 1 day ago
New activity in THUdyh/Ola-Image 3 days ago

Add pipeline tag

#1 opened 6 days ago by
nielsr
New activity in THUdyh/Ola-Video 3 days ago

Add pipeline tag

#1 opened 6 days ago by
nielsr
New activity in THUdyh/Ola_speech_encoders 3 days ago

Add model card

#1 opened 6 days ago by
nielsr
New activity in THUdyh/Ola-Data 5 days ago

Add dataset card

#2 opened 6 days ago by
nielsr
posted an update 7 days ago
view post
Post
1956
🔥🔥Introducing Ola! State-of-the-art omni-modal understanding model with advanced progressive modality alignment strategy!
Ola ranks #1 on OpenCompass Leaderboard (<10B)
.
📜Paper: https://arxiv.org/abs/2502.04328
🛠️Code: https://github.com/Ola-Omni/Ola

🛠️We have fully released our video&audio training data, intermediate image&video model at THUdyh/ola-67b8220eb93406ec87aeec37. Try to build your own powerful omni-modal model with our data and models!