Alexander Visheratin's picture

Alexander Visheratin PRO

visheratin

·

AI & ML interests

None yet

Recent Activity

new activity 25 days ago

visheratin/mexma-siglip2:Benchmarks for COCO and Flickr?

new activity 25 days ago

visheratin/mexma-siglip2:How to Optimize Slow CPU Inference Speed

new activity about 2 months ago

visheratin/mexma-siglip2:Finetuning example

View all activity

Organizations

Posts 5

Post

3455

Yesterday, xAI announced Grok-1.5 Vision - https://x.ai/blog/grok-1.5v. But more importantly, they also released a new VLM benchmark dataset - RealWorldQA. The only problem was that they released it as a ZIP archive. I fixed that! Now you can use it in your evaluations as a regular HF dataset: visheratin/realworldqa

Articles 2

Article

5

Data exploration and filtering with Nomic Atlas

View all Articles

Papers 1

arxiv:2309.01859

spaces 3

Mexma Siglip2

Classify images based on text queries

Running on Zero

Mc Llava 3b

Generate answers to questions about images

Laion Nllb

models 20

visheratin/mexma-siglip2

Zero-Shot Image Classification • Updated Mar 6 • 254 • 4

visheratin/nllb-siglip-mrl-large

Zero-Shot Image Classification • Updated Mar 2 • 346 • 14

visheratin/nllb-siglip-mrl-base

Zero-Shot Image Classification • Updated Mar 2 • 370 • 9

visheratin/nllb-clip-base-siglip

Zero-Shot Image Classification • Updated Mar 2 • 537 • 1

visheratin/nllb-clip-large-siglip

Zero-Shot Image Classification • Updated Mar 2 • 504 • 5

visheratin/mexma-siglip

Zero-Shot Image Classification • Updated Mar 2 • 158 • 3

visheratin/nllb-siglip-i18n

Zero-Shot Image Classification • Updated Jun 3, 2024 • 3 • 1

visheratin/mc-llava-3b-ft

Feature Extraction • Updated Mar 24, 2024 • 1

visheratin/MC-LLaVA-3b

Updated Feb 28, 2024 • 663 • 84

visheratin/nllb-clip-large-oc

Zero-Shot Image Classification • Updated Oct 24, 2023 • 28 • 2

datasets 11

visheratin/documentation-images

Viewer • Updated Apr 16, 2024 • 1 • 4.55k

visheratin/realworldqa

Viewer • Updated Apr 13, 2024 • 765 • 1.1k • 33

visheratin/laion-coco-nllb

Viewer • Updated Apr 11, 2024 • 894k • 986 • 41

visheratin/nllb-coco-long

Viewer • Updated Apr 9, 2024 • 45.7k • 23

visheratin/SVIT

Viewer • Updated Mar 31, 2024 • 108k • 36

visheratin/google_landmarks_photos

Viewer • Updated Mar 19, 2024 • 1.27M • 33 • 3

visheratin/object_questions

Viewer • Updated Mar 17, 2024 • 132k • 22

visheratin/uber_text_qa

Viewer • Updated Mar 16, 2024 • 9.98k • 34 • 2

visheratin/google_landmarks_places

Viewer • Updated Mar 16, 2024 • 35.1k • 29 • 2

visheratin/unsplash-caption-questions-init

Viewer • Updated Feb 28, 2024 • 24.9k • 20