Update README.md
Browse files
README.md
CHANGED
@@ -2,4 +2,22 @@
|
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
# UMT-msrvtt-7k Model Card
|
5 |
-
UMT-msrvtt-7k is the model used to compute UMTScore for the [FETV](https://github.com/llyx97/FETV) benchmark. It is initialized from the [UMT model](https://github.com/OpenGVLab/unmasked_teacher/blob/main/multi_modality/MODEL_ZOO.md) (UMT-L/16, 25M) and is fined-tuned on the 7k training split of MSR-VTT for video-text retrieval.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
# UMT-msrvtt-7k Model Card
|
5 |
+
UMT-msrvtt-7k is the model used to compute UMTScore for the [FETV](https://github.com/llyx97/FETV) benchmark. It is initialized from the [UMT model](https://github.com/OpenGVLab/unmasked_teacher/blob/main/multi_modality/MODEL_ZOO.md) (UMT-L/16, 25M) and is fined-tuned on the 7k training split of MSR-VTT for video-text retrieval.
|
6 |
+
|
7 |
+
# Citation
|
8 |
+
```bibtex
|
9 |
+
@article{liu2023fetv,
|
10 |
+
title = {FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation},
|
11 |
+
author = {Yuanxin Liu and Lei Li and Shuhuai Ren and Rundong Gao and Shicheng Li and Sishuo Chen and Xu Sun and Lu Hou},
|
12 |
+
year = {2023},
|
13 |
+
journal = {arXiv preprint arXiv: 2311.01813}
|
14 |
+
}
|
15 |
+
```
|
16 |
+
```
|
17 |
+
@misc{li2023unmasked,
|
18 |
+
title={Unmasked Teacher: Towards Training-Efficient Video Foundation Models},
|
19 |
+
author={Kunchang Li and Yali Wang and Yizhuo Li and Yi Wang and Yinan He and Limin Wang and Yu Qiao},
|
20 |
+
year = {2023}
|
21 |
+
journal = {arXiv preprint arXiv: 2303.16058}
|
22 |
+
}
|
23 |
+
```
|