Alexander Visheratin's picture

Alexander Visheratin PRO

visheratin

·

AI & ML interests

None yet

Recent Activity

updated a model 2 months ago

visheratin/mexma-siglip

updated a model 2 months ago

visheratin/nllb-clip-large-siglip

updated a model 2 months ago

visheratin/mexma-siglip

View all activity

Articles

Data exploration and filtering with Nomic Atlas

Breaking resolution curse of vision-language models

Organizations

Posts 5

Post

3294

Yesterday, xAI announced Grok-1.5 Vision - https://x.ai/blog/grok-1.5v. But more importantly, they also released a new VLM benchmark dataset - RealWorldQA. The only problem was that they released it as a ZIP archive. I fixed that! Now you can use it in your evaluations as a regular HF dataset: visheratin/realworldqa

Post

2034

Look at the beauty in the video — four different embeddings on the same map! In another community blog post, I explore how you can use Nomic Atlas to view and clean your dataset. You can check it out here - https://huggingface.co./blog/visheratin/nomic-data-cleaning

Papers 1

arxiv:2309.01859

spaces 2

Running on Zero

Mc Llava 3b

Laion Nllb

models 19

visheratin/mexma-siglip

Zero-Shot Image Classification • Updated Dec 4, 2024 • 165 • 3

visheratin/nllb-clip-large-siglip

Zero-Shot Image Classification • Updated Dec 4, 2024 • 731 • 4

visheratin/nllb-siglip-i18n

Zero-Shot Image Classification • Updated Jun 3, 2024 • 1 • 1

visheratin/nllb-clip-base-siglip

Zero-Shot Image Classification • Updated May 3, 2024 • 802 • 1

visheratin/mc-llava-3b-ft

Feature Extraction • Updated Mar 24, 2024 • 163

visheratin/nllb-siglip-mrl-large

Zero-Shot Image Classification • Updated Mar 10, 2024 • 846 • 13

visheratin/nllb-siglip-mrl-base

Zero-Shot Image Classification • Updated Mar 10, 2024 • 938 • 9

visheratin/MC-LLaVA-3b

Updated Feb 28, 2024 • 119 • 83

visheratin/nllb-clip-large-oc

Zero-Shot Image Classification • Updated Oct 24, 2023 • 3.48k • 2

visheratin/nllb-clip-base-oc

Zero-Shot Image Classification • Updated Oct 24, 2023 • 6.63k • 1

datasets 11

visheratin/documentation-images

Viewer • Updated Apr 16, 2024 • 1 • 4.29k

visheratin/realworldqa

Viewer • Updated Apr 13, 2024 • 765 • 408 • 33

visheratin/laion-coco-nllb

Viewer • Updated Apr 11, 2024 • 894k • 1.73k • 41

visheratin/nllb-coco-long

Viewer • Updated Apr 9, 2024 • 45.7k • 75

visheratin/SVIT

Viewer • Updated Mar 31, 2024 • 108k • 36

visheratin/google_landmarks_photos

Viewer • Updated Mar 19, 2024 • 1.27M • 51 • 3

visheratin/object_questions

Viewer • Updated Mar 17, 2024 • 132k • 45

visheratin/uber_text_qa

Viewer • Updated Mar 16, 2024 • 9.98k • 92 • 2

visheratin/google_landmarks_places

Viewer • Updated Mar 16, 2024 • 35.1k • 54 • 2

visheratin/unsplash-caption-questions-init

Viewer • Updated Feb 28, 2024 • 24.9k • 43