Orr Zohar's picture

Orr Zohar PRO

orrzohar

·

https://orrzohar.github.io

AI & ML interests

Large Multi-Modal Models, Foundation Models, Video Understanding

Organizations

orrzohar's activity

New activity in lmms-lab/LLaVA-OneVision-Data about 2 months ago

Missing/corrupted images in dataset

#9 opened about 2 months ago by

commented a paper 3 months ago

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19 • 51 •

New activity in ShareGPT4Video/ShareGPT4Video 3 months ago

obtaining original videos for video instruction tuning

#23 opened 3 months ago by

commented 3 papers 4 months ago

$VILA^2$: VILA Augmented VILA

Paper • 2407.17453 • Published Jul 24 • 38 •

$VILA^2$: VILA Augmented VILA

Paper • 2407.17453 • Published Jul 24 • 38 •

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

Paper • 2407.06189 • Published Jul 8 • 24 •

New activity in HuggingFaceM4/idefics2-8b 6 months ago

Idefics2-pretraining

#54 opened 6 months ago by

New activity in meta-llama/Meta-Llama-3-8B-Instruct 7 months ago

The request to access the repo has been sent for several days, why hasn't it passed yet?

#70 opened 7 months ago by