Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
DAMO-NLP-SG
/
VideoLLaMA2.1-7B-AV
like
13
Follow
Language Technology Lab at Alibaba DAMO Academy
49
Visual Question Answering
Transformers
Safetensors
lmms-lab/ClothoAQA
Loie/VGGSound
English
videollama2_qwen2
text-generation
Audio-visual Question Answering
Audio Question Answering
multimodal large language model
Inference Endpoints
arxiv:
2406.07476
arxiv:
2306.02858
License:
apache-2.0
Model card
Files
Files and versions
Community
4
Train
Deploy
Use this model
main
VideoLLaMA2.1-7B-AV
Commit History
Update README.md
d944d42
verified
YifeiXin
commited on
Oct 25
Update README.md
b9c58e1
verified
lixin4ever
commited on
Oct 22
Update README.md
fba52ca
verified
lixin4ever
commited on
Oct 22
Update README.md
4c84984
verified
lixin4ever
commited on
Oct 22
Update README.md
c7e14fd
verified
YifeiXin
commited on
Oct 22
add VideoLLaMA2.1-AV model
acd1625
阔毅
commited on
Oct 21
initial commit
eacaf06
verified
YifeiXin
commited on
Oct 21