Aritra Roy Gosthipaty's picture

Aritra Roy Gosthipaty PRO

ariG23498

·

https://arig23498.github.io/

AI & ML interests

Deep Representation Learning

Recent Activity

upvoted a collection 3 days ago

Shot categorizer

updated a Space 3 days ago

ariG23498/shot-categorizer-demo

published a Space 3 days ago

ariG23498/shot-categorizer-demo

View all activity

Organizations

ariG23498's activity

upvoted a collection 3 days ago

Shot categorizer

Fine-tune of Florence-2 to generate shot categories, useful for data curation. Code: https://github.com/huggingface/movie-shot-categorizer. • 3 items • Updated 3 days ago • 2

updated a Space 3 days ago

Shot Categorizer Demo

Analyze image for color, lighting, and composition

published a Space 3 days ago

Shot Categorizer Demo

Analyze image for color, lighting, and composition

liked a Space 3 days ago

Phi4 Multimodal

Space demoing Phi4 MultiModal

updated a model 3 days ago

ariG23498/QwQ-32B-nf4

Text Generation • Updated 3 days ago • 10

published a model 3 days ago

ariG23498/QwQ-32B-nf4

Text Generation • Updated 3 days ago • 10

upvoted a collection 5 days ago

C4AI Aya Vision

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 5 days ago • 58

upvoted an article 5 days ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

5 days ago

• 56

liked a Space 5 days ago

Find a leaderboard

Explore and discover all leaderboards from the HF community

updated a dataset 5 days ago

huggingface/documentation-images

Viewer • Updated 2 days ago • 50 • 4.88M • 53

liked a Space 6 days ago

Tight Inversion

Transform images based on text prompts

upvoted an article 9 days ago

Article

Common AI Model Formats

By

•

9 days ago

• 27

upvoted 2 articles 10 days ago

Article

SigLIP 2: A better multilingual vision language encoder

16 days ago

• 126

Article

HuggingFace, IISc partner to supercharge model building on India's diverse languages

10 days ago

• 13

liked a Space 10 days ago

LLaDA

Large Language Diffusion Models

published a Space 10 days ago

Phi4 Multimodal

Space demoing Phi4 MultiModal

updated a Space 10 days ago

Phi4 Multimodal

Space demoing Phi4 MultiModal

upvoted a paper 10 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 111

upvoted a collection 10 days ago

Phi-4

Phi-4 family of small language and multi-modal models. • 7 items • Updated 5 days ago • 108

commented on Remote VAEs for decoding with HF endpoints 🤗 11 days ago

This is really nice!