Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
xinyiW915
/
ReLaX-VQA
like
1
Visual Question Answering
5 datasets
deep-learning
vision
VQA
Transformer
CNN
arxiv:
2407.11496
License:
apache-2.0
Model card
Files
Files and versions
Community
main
ReLaX-VQA
Ctrl+K
Ctrl+K
1 contributor
History:
15 commits
Xinyi Wang
update README
045b2f8
12 days ago
metadata
first commit
13 days ago
model
Upload model
13 days ago
src
first commit
13 days ago
ugc_original_videos
first commit
13 days ago
.gitattributes
Safe
1.6 kB
first commit
13 days ago
.gitignore
78 Bytes
Update
13 days ago
Framework.png
18.9 MB
LFS
first commit
13 days ago
README.md
9.52 kB
update README
12 days ago
reported_result.ipynb
66.8 kB
first commit
13 days ago
requirements.txt
2.57 kB
first commit
13 days ago