EgoLife

community

https://egolife-v1.github.io/

https://github.com/egolife-v1

Activity Feed Request to join this org

AI & ML interests

Egocentric Vision Assistant

Recent Activity

Nicous updated a Space about 9 hours ago

EgoLife-v1/EgoGPT

Choiszt updated a Space about 13 hours ago

EgoLife-v1/EgoGPT

Jingkang updated a Space 1 day ago

EgoLife-v1/EgoGPT

View all activity

EgoLife-v1's activity

Nicous

updated a Space about 9 hours ago

EgoGPT

💬

Generate detailed descriptions from videos

Choiszt

updated a Space about 13 hours ago

EgoGPT

💬

Generate detailed descriptions from videos

Jingkang

updated a Space 1 day ago

EgoGPT

💬

Generate detailed descriptions from videos

Choiszt

published a Space 5 days ago

README

💻

Choiszt

updated a model 5 days ago

lmms-lab/EgoGPT-7b-EgoIT

Updated about 10 hours ago • 9

Choiszt

updated a model 5 days ago

EgoLife-v1/EgoGPT-7b-Demo

Updated 5 days ago • 3

Choiszt

published a model 5 days ago

lmms-lab/EgoGPT-7b-EgoIT

Updated about 10 hours ago • 9

Choiszt

updated a model 5 days ago

EgoLife-v1/EgoGPT-7b-EgoIT-EgoLife

Updated 5 days ago • 4

Choiszt

published a model 5 days ago

EgoLife-v1/EgoGPT-7b-Demo

Updated 5 days ago • 3

Choiszt

updated a model 5 days ago

EgoLife-v1/EgoGPT-0.5b-Demo

Updated 5 days ago • 5

Choiszt

published 2 models 5 days ago

EgoLife-v1/EgoGPT-0.5b-Demo

Updated 5 days ago • 5

EgoLife-v1/EgoGPT-7b-EgoIT-EgoLife

Updated 5 days ago • 4

Choiszt

updated a model 5 days ago

EgoLife-v1/speech_encoder

Updated 5 days ago

Choiszt

published a model 5 days ago

EgoLife-v1/speech_encoder

Updated 5 days ago

THUdyh

posted an update 8 days ago

Post

1957

🔥🔥Introducing Ola! State-of-the-art omni-modal understanding model with advanced progressive modality alignment strategy!
Ola ranks #1 on OpenCompass Leaderboard (<10B)
.
📜Paper: https://arxiv.org/abs/2502.04328
🛠️Code: https://github.com/Ola-Omni/Ola

🛠️We have fully released our video&audio training data, intermediate image&video model at THUdyh/ola-67b8220eb93406ec87aeec37. Try to build your own powerful omni-modal model with our data and models!

THUdyh

authored a paper 22 days ago

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Paper • 2502.04328 • Published 22 days ago • 27

THUdyh

authored a paper about 2 months ago

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published Jan 7 • 25

THUdyh

posted an update 3 months ago

Post

1125

🚀🚀🚀Introducing Insight-V! An early attempt towards o1-like multi-modal reasoning.
We offer a structured long-chain visual reasoning data generation pipeline and a multi-agent system to unleash the reasoning potential of MLLMs.
📜 Paper: https://arxiv.org/abs/2411.14432
🛠️ Github: https://github.com/dongyh20/Insight-V
💼 Model Weight: THUdyh/insight-v-673f5e1dd8ab5f2d8d332035

Jingkang

authored a paper 3 months ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published Nov 21, 2024 • 23

THUdyh

authored a paper 3 months ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published Nov 21, 2024 • 23

AI & ML interests

Recent Activity

Team members 5

EgoLife-v1's activity

EgoGPT

EgoGPT

EgoGPT

README