File size: 750 Bytes
8d5b29a 640f8d4 8d5b29a 640f8d4 8d5b29a 640f8d4 8d5b29a 640f8d4 8d5b29a 640f8d4 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 |
---
library_name: diffusers
---
# MGIE
This repository contains the UNet and LLaVA model checkpoints from [Guiding Instruction-based Image Editing via Multimodal Large Language Models](https://arxiv.org/abs/2309.17102).
For a detailed example of usage, refer to [this notebook](https://github.com/apple/ml-mgie/blob/main/demo.ipynb) and the [official repository](https://github.com/apple/ml-mgie).
## Citation
```
@inproceedings{fu2024mgie,
author = {Tsu-Jui Fu and Wenze Hu and Xianzhi Du and William Yang Wang and Yinfei Yang, and Zhe Gan},
title = {{Guiding Instruction-based Image Editing via Multimodal Large Language Models}},
booktitle = {International Conference on Learning Representations (ICLR)},
year = {2024}
}
``` |