Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
xinyiW915
/
ReLaX-VQA
like
1
Visual Question Answering
5 datasets
deep-learning
vision
VQA
Transformer
CNN
arxiv:
2407.11496
License:
apache-2.0
Model card
Files
Files and versions
Community
main
ReLaX-VQA
/
metadata
1 contributor
History:
1 commit
Xinyi Wang
first commit
211b431
7 days ago
greyscale_report
first commit
7 days ago
mos_files
first commit
7 days ago
CVD_2014_metadata.csv
21 kB
first commit
7 days ago
KONVID_1K_metadata.csv
67.7 kB
first commit
7 days ago
LIVE_QUALCOMM_metadata.csv
22 kB
first commit
7 days ago
LIVE_VQC_metadata.csv
33.4 kB
first commit
7 days ago
LSVQ_TEST_1080P_metadata.csv
288 kB
first commit
7 days ago
LSVQ_TEST_metadata.csv
655 kB
first commit
7 days ago
LSVQ_TRAIN_metadata.csv
2.55 MB
first commit
7 days ago
YOUTUBE_UGC_1080P_metadata.csv
40.5 kB
first commit
7 days ago
YOUTUBE_UGC_2160P_metadata.csv
14.6 kB
first commit
7 days ago
YOUTUBE_UGC_360P_metadata.csv
30.2 kB
first commit
7 days ago
YOUTUBE_UGC_480P_metadata.csv
36.2 kB
first commit
7 days ago
YOUTUBE_UGC_720P_metadata.csv
34.7 kB
first commit
7 days ago
YOUTUBE_UGC_metadata.csv
147 kB
first commit
7 days ago
test_videos.csv
117 Bytes
first commit
7 days ago