Andres Marafioti

andito

AI & ML interests

Multimodal models, VLM and TTS

Articles

Organizations

Posts 2

view post
Post
984
Hugging face presents FineVideo πŸŽ₯! Unlocking the next generation of Video understanding πŸš€

🀯3400 hours of annotated Creative Common videos with rich character descriptions, scene splits, mood, and content descriptions per scene as well as QA pairs.
πŸ”₯
@mfarre processed over 2M videos of Youtube-CC to make this incredibly powerful selection.

Very psyched to fine-tune idefics on this dataset. ⚑️
Explore the videos: HuggingFaceFV/FineVideo-Explorer
view post
Post
1556
πŸš€ Introducing Hugging Face's Multilingual Speech-to-Speech! 🎀
πŸ’¬Our modular, cross-platform pipeline to run GPT4o-like experiences on device can now seamlessly switch languages mid-conversation with an imperceptible 100ms delay.

🌟 Building on an amazing early reception with 2600 stars on GitHub 🌟
πŸš€ We are expanding the library to support multiple languages
πŸ”₯ Try it out with a flag: --language fr
🀯 Or don't set the flag and let the system detect the language

πŸ’‘ What feature should we add next?