Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
merve
's Collections
Nov 22 Releases ❄️
Nov 15 Releases 🍂
Nov 1 Releases
MIT Talk 31/10 Papers
October 25 Releases
LOTUS 🪷
New Depth Models
BRAVE Models 🦁
Computer Vision Backbones 🧩
Image Classification Models 🐶 🐱
Object Detection Models 🥥
Image Segmentation Models 💜
Zero-shot Image Classification Models 🖼️
Image-to-Image Models 🎨
Video Classification Models 📺
Image-to-Text Models 📝
Text-to-Image Models 🥑
Foundation Models for Vision 🧩
Segment Anything Model
OWL-series 🦉
SigLIP
Awesome Document AI
SegGPT
Vision Language Models Papers 🖼️💬📝
gvhf/owl
gv-hf/owl
merve/owl2
Depth Anything v2 Release
Document VLM Papers
Vision Language Leaderboards
Video Language Models
SAM2
NVEagle
Multimodal RAG
Zero-shot Segmentation
Video Classification Models 📺
updated
Sep 19, 2023
Upvote
2
microsoft/xclip-base-patch32
Video Classification
•
Updated
Feb 4
•
368k
•
70
facebook/timesformer-base-finetuned-k400
Video Classification
•
Updated
Jan 2, 2023
•
76.6k
•
25
facebook/timesformer-base-finetuned-k600
Video Classification
•
Updated
Dec 12, 2022
•
7.28k
•
11
google/vivit-b-16x2
Video Classification
•
Updated
Aug 3, 2023
•
1.07k
•
7
google/vivit-b-16x2-kinetics400
Video Classification
•
Updated
Aug 3, 2023
•
413k
•
21
MCG-NJU/videomae-base
Video Classification
•
Updated
Mar 29
•
85.7k
•
37
MCG-NJU/videomae-large
Video Classification
•
Updated
Apr 1
•
226k
•
18
facebook/timesformer-base-finetuned-ssv2
Video Classification
•
Updated
Dec 12, 2022
•
1.2k
•
3
Upvote
2
Share collection
View history
Collection guide
Browse collections