Aritra Roy Gosthipaty's picture

Aritra Roy Gosthipaty PRO

ariG23498

AI & ML interests

Deep Representation Learning

Recent Activity

upvoted a collection 3 days ago
Shot categorizer
updated a Space 3 days ago
ariG23498/shot-categorizer-demo
published a Space 3 days ago
ariG23498/shot-categorizer-demo
View all activity

Organizations

Hugging Face's profile picture Google's profile picture Notebooks-explorers's profile picture PyTorch Image Models's profile picture Keras's profile picture Cohere For AI's profile picture Hugging Test Lab's profile picture Hugging Face Fellows's profile picture Probing ViTs's profile picture TrystAI's profile picture PyImageSearch's profile picture Keras Dreambooth Event's profile picture Hugging Face OSS Metrics's profile picture Blog-explorers's profile picture ZeroGPU Explorers's profile picture kotol's profile picture gg-hf's profile picture MLX Community's profile picture IBM Granite's profile picture Open Generative Fill's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture nltpt's profile picture nltpt-q's profile picture qrias's profile picture Hugging Face Science's profile picture open/ acc's profile picture wut?'s profile picture LLM from Scratch's profile picture s0225's profile picture gg-hf-g's profile picture

ariG23498's activity

upvoted an article 5 days ago
view article
Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

β€’ 56
upvoted an article 9 days ago
upvoted 2 articles 10 days ago
view article
Article

SigLIP 2: A better multilingual vision language encoder

β€’ 126
view article
Article

HuggingFace, IISc partner to supercharge model building on India's diverse languages

β€’ 13
upvoted an article 13 days ago
view article
Article

Remote VAEs for decoding with HF endpoints πŸ€—

β€’ 34
upvoted an article 16 days ago
view article
Article

SmolVLM2: Bringing Video Understanding to Every Device

β€’ 196
upvoted an article 17 days ago
view article
Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

β€’ 62
upvoted 2 articles 18 days ago
view article
Article

ColPali: Efficient Document Retrieval with Vision Language Models πŸ‘€

By manu β€’
β€’ 214
view article
Article

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita πŸ”₯

β€’ 93
upvoted an article 25 days ago
view article
Article

From Llasa to Llasagna πŸ•: Finetuning LLaSA to generates Italian speech and other languages

By Steveeeeeeen and 1 other β€’
β€’ 26
upvoted an article 26 days ago
view article
Article

The Open Arabic LLM Leaderboard 2

β€’ 28