|
--- |
|
library_name: diffusers |
|
--- |
|
|
|
# MGIE |
|
|
|
This repository contains the UNet and LLaVA model checkpoints from [Guiding Instruction-based Image Editing via Multimodal Large Language Models](https://arxiv.org/abs/2309.17102). |
|
|
|
For a detailed example of usage, refer to [this notebook](https://github.com/apple/ml-mgie/blob/main/demo.ipynb) and the [official repository](https://github.com/apple/ml-mgie). |
|
|
|
## Citation |
|
|
|
``` |
|
@inproceedings{fu2024mgie, |
|
author = {Tsu-Jui Fu and Wenze Hu and Xianzhi Du and William Yang Wang and Yinfei Yang, and Zhe Gan}, |
|
β title = {{Guiding Instruction-based Image Editing via Multimodal Large Language Models}}, |
|
β booktitle = {International Conference on Learning Representations (ICLR)}, |
|
β year = {2024} |
|
} |
|
``` |