a collection of Video-LLaVA 1.0
linbin
LanguageBind
AI & ML interests
None yet
Organizations
None yet
Collections
4
a collection of LanguageBind based on VIDAL-10M
-
LanguageBind/LanguageBind_Video_merge
Zero-Shot Image Classification • Updated • 55k • 2 -
LanguageBind/LanguageBind_Video
Zero-Shot Image Classification • Updated • 1.78k • 1 -
LanguageBind/LanguageBind_Audio
Zero-Shot Image Classification • Updated • 285 • 2 -
LanguageBind/LanguageBind_Image
Zero-Shot Image Classification • Updated • 61.1k • 3
models
13
LanguageBind/Video-LLaVA-V1.5
Updated
•
3
LanguageBind/LanguageBind_Audio_V1.5
Updated
•
1
LanguageBind/LanguageBind_Video_V1.5
Updated
•
1
LanguageBind/LanguageBind_Audio_FT
Zero-Shot Image Classification
•
Updated
•
32
•
1
LanguageBind/LanguageBind_Video_FT
Zero-Shot Image Classification
•
Updated
•
450
•
1
LanguageBind/Video-LLaVA-7B
Text Generation
•
Updated
•
25.7k
•
27
LanguageBind/LanguageBind_Video_merge
Zero-Shot Image Classification
•
Updated
•
55k
•
2
LanguageBind/Video-LLaVA-Pretrain-7B
Text Generation
•
Updated
•
27
•
1
LanguageBind/LanguageBind_Audio
Zero-Shot Image Classification
•
Updated
•
285
•
2
LanguageBind/LanguageBind_Video
Zero-Shot Image Classification
•
Updated
•
1.78k
•
1