59 12 21

Miquel Farré

mfarre

AI & ML interests

I like everything video

Recent Activity

updated a model 3 days ago

HuggingFaceTB/SmolVLM-Instruct

updated a model 3 days ago

HuggingFaceTB/SmolVLM-500M-Instruct

updated a model 3 days ago

HuggingFaceTB/SmolVLM-256M-Instruct

View all activity

Organizations

mfarre's activity

upvoted an article 17 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

18 days ago

• 197

upvoted a collection 17 days ago

SmolVLM2 📺 Smallest video LM ever 🤏🏻

Collection

11 items • Updated 12 days ago • 56

upvoted an article about 1 month ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 146

upvoted an article 2 months ago

Article

Announcing NVIDIA Cosmos World Foundation Models

and 1 other •

Jan 7

• 24

upvoted a paper 3 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 140

upvoted a paper 4 months ago

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Paper • 2410.17434 • Published Oct 22, 2024 • 28

upvoted 3 articles 6 months ago

Article

FineVideo: behind the scenes

Sep 23, 2024

• 29

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18, 2024

• 72

Article

Scaling robotics datasets with video encoding

Aug 27, 2024

• 38

upvoted a paper 7 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 126