Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
wangyueqian
/
MMDuet
like
4
Video-Text-to-Text
PEFT
Safetensors
wangyueqian/MMDuetIT
English
llava-onevision
llava
multimodal
online video understanding
video understanding
arxiv:
2411.17991
License:
mit
Model card
Files
Files and versions
Community
Use this model
main
MMDuet
1 contributor
History:
3 commits
wangyueqian
add paper and video demo to REAME.md
2366917
verified
28 days ago
.gitattributes
Safe
1.57 kB
Upload 9 files
29 days ago
README.md
Safe
1.42 kB
add paper and video demo to REAME.md
28 days ago
adapter_config.json
Safe
784 Bytes
Upload 9 files
29 days ago
adapter_model.safetensors
Safe
115 MB
LFS
Upload 9 files
29 days ago
added_tokens.json
Safe
101 Bytes
Upload 9 files
29 days ago
merges.txt
Safe
1.67 MB
Upload 9 files
29 days ago
special_tokens_map.json
Safe
387 Bytes
Upload 9 files
29 days ago
tokenizer.json
Safe
11.4 MB
LFS
Upload 9 files
29 days ago
tokenizer_config.json
Safe
2.17 kB
Upload 9 files
29 days ago
vocab.json
Safe
2.78 MB
Upload 9 files
29 days ago