Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ inference: true
|
|
9 |
|
10 |
# Aligned Diffusion Model via DDPO
|
11 |
|
12 |
-
Diffusion
|
13 |
```
|
14 |
close-sourced vlm: claude3-opus gpt-4o gpt-4v
|
15 |
```
|
@@ -38,12 +38,12 @@ image = pipeline(prompt=prompt, generator=generator, guidance_scale=5).images[0]
|
|
38 |
## Citation
|
39 |
```
|
40 |
@misc{chen2024mjbenchmultimodalrewardmodel,
|
41 |
-
|
42 |
-
|
43 |
-
|
44 |
-
|
45 |
-
|
46 |
-
|
47 |
-
|
48 |
}
|
49 |
```
|
|
|
9 |
|
10 |
# Aligned Diffusion Model via DDPO
|
11 |
|
12 |
+
Diffusion model aligned with the following reward models and Denoising Diffusion Policy Optimization (DDPO) algorithm
|
13 |
```
|
14 |
close-sourced vlm: claude3-opus gpt-4o gpt-4v
|
15 |
```
|
|
|
38 |
## Citation
|
39 |
```
|
40 |
@misc{chen2024mjbenchmultimodalrewardmodel,
|
41 |
+
title={MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?},
|
42 |
+
author={Zhaorun Chen and Yichao Du and Zichen Wen and Yiyang Zhou and Chenhang Cui and Zhenzhen Weng and Haoqin Tu and Chaoqi Wang and Zhengwei Tong and Qinglan Huang and Canyu Chen and Qinghao Ye and Zhihong Zhu and Yuqing Zhang and Jiawei Zhou and Zhuokai Zhao and Rafael Rafailov and Chelsea Finn and Huaxiu Yao},
|
43 |
+
year={2024},
|
44 |
+
eprint={2407.04842},
|
45 |
+
archivePrefix={arXiv},
|
46 |
+
primaryClass={cs.CV},
|
47 |
+
url={https://arxiv.org/abs/2407.04842},
|
48 |
}
|
49 |
```
|