Update README.md
Browse files
README.md
CHANGED
@@ -8,4 +8,25 @@ language:
|
|
8 |
|
9 |
* [2024.05.28] π Our RLAIF-V paper is accesible at [arxiv](https://arxiv.org/abs/2405.17220) now!
|
10 |
* [2024.05.20] π We introduce [RLAIF-V](https://github.com/RLHF-V/RLAIF-V), our new alignment framework that utilize open-source models for feedback generation and reach **super GPT-4V trustworthiness**. You can download the corresponding [dataset](https://huggingface.co/datasets/openbmb/RLAIF-V-Dataset) and models ([7B](https://huggingface.co/openbmb/RLAIF-V-7B), [12B](https://huggingface.co/openbmb/RLAIF-V-12B)) now!
|
11 |
-
* [2024.04.11] π₯ Our data is used in [MiniCPM-V 2.0](https://huggingface.co/openbmb/MiniCPM-V-2), an **end-side** multimodal large language model that exhibits **comparable trustworthiness with GPT-4V**!
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
8 |
|
9 |
* [2024.05.28] π Our RLAIF-V paper is accesible at [arxiv](https://arxiv.org/abs/2405.17220) now!
|
10 |
* [2024.05.20] π We introduce [RLAIF-V](https://github.com/RLHF-V/RLAIF-V), our new alignment framework that utilize open-source models for feedback generation and reach **super GPT-4V trustworthiness**. You can download the corresponding [dataset](https://huggingface.co/datasets/openbmb/RLAIF-V-Dataset) and models ([7B](https://huggingface.co/openbmb/RLAIF-V-7B), [12B](https://huggingface.co/openbmb/RLAIF-V-12B)) now!
|
11 |
+
* [2024.04.11] π₯ Our data is used in [MiniCPM-V 2.0](https://huggingface.co/openbmb/MiniCPM-V-2), an **end-side** multimodal large language model that exhibits **comparable trustworthiness with GPT-4V**!
|
12 |
+
|
13 |
+
|
14 |
+
## Citation
|
15 |
+
|
16 |
+
If you find this dataset helpful, please consider cite our papers π:
|
17 |
+
|
18 |
+
```
|
19 |
+
@article{yu2023rlhf,
|
20 |
+
title={Rlhf-v: Towards trustworthy mllms via behavior alignment from fine-grained correctional human feedback},
|
21 |
+
author={Yu, Tianyu and Yao, Yuan and Zhang, Haoye and He, Taiwen and Han, Yifeng and Cui, Ganqu and Hu, Jinyi and Liu, Zhiyuan and Zheng, Hai-Tao and Sun, Maosong and others},
|
22 |
+
journal={arXiv preprint arXiv:2312.00849},
|
23 |
+
year={2023}
|
24 |
+
}
|
25 |
+
|
26 |
+
@article{yu2024rlaifv,
|
27 |
+
title={RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness},
|
28 |
+
author={Tianyu Yu and Haoye Zhang and Yuan Yao and Yunkai Dang and Da Chen and Xiaoman Lu and Ganqu Cui and Taiwen He and Zhiyuan Liu and Tat-Seng Chua and Maosong Sun},
|
29 |
+
journal={arXiv preprint arXiv:2405.17220},
|
30 |
+
year={2024},
|
31 |
+
}
|
32 |
+
```
|