Update README.md
Browse files
README.md
CHANGED
|
@@ -16,10 +16,6 @@ pipeline_tag: image-to-image
|
|
| 16 |
|
| 17 |
This repo contains the checkpoint from [OneReward](https://huggingface.co/bytedance-research/OneReward) processed into a single model suitable for ComfyUI use.
|
| 18 |
|
| 19 |
-
<p align="center">
|
| 20 |
-
<img src="assets/show.jpg" alt="assert" width="800">
|
| 21 |
-
</p>
|
| 22 |
-
|
| 23 |
**OneReward** is a novel RLHF methodology for the visual domain by employing Qwen2.5-VL as a generative reward model to enhance multitask reinforcement learning, significantly improving the policy model’s generation ability across multiple subtask. Building on OneReward, **FLUX.1-Fill-dev-OneReward** - based on FLUX Fill [dev], outperforms closed-source FLUX Fill [Pro] in inpainting and outpainting tasks, serving as a powerful new baseline for future research in unified image editing.
|
| 24 |
|
| 25 |
|
|
|
|
| 16 |
|
| 17 |
This repo contains the checkpoint from [OneReward](https://huggingface.co/bytedance-research/OneReward) processed into a single model suitable for ComfyUI use.
|
| 18 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 19 |
**OneReward** is a novel RLHF methodology for the visual domain by employing Qwen2.5-VL as a generative reward model to enhance multitask reinforcement learning, significantly improving the policy model’s generation ability across multiple subtask. Building on OneReward, **FLUX.1-Fill-dev-OneReward** - based on FLUX Fill [dev], outperforms closed-source FLUX Fill [Pro] in inpainting and outpainting tasks, serving as a powerful new baseline for future research in unified image editing.
|
| 20 |
|
| 21 |
|