tsujuifu
/

ml-mgie

Model card Files Files and versions Community

File size: 1,149 Bytes

600b4cf
 
d93da3f
600b4cf
 
c353a30
 
 
00cb600
c353a30
 
7da3813

---
license: other
license_name: ml-mgie
license_link: https://github.com/apple/ml-mgie/blob/main/LICENSE.txt
---

# [ICLR'24] Guiding Instruction-based Image Editing via Multimodal Large Language Models
This repo contains [LLaVA-7B](https://huggingface.co/liuhaotian/LLaVA-Lightning-7B-delta-v1-1) and [pre-trained MGIE ckpt](https://docs-assets.developer.apple.com/ml-research/models/mgie/mgie_7b.tar.gz) (on IPr2Pr + MagicBrush) for [MGIE](https://huggingface.co/spaces/tsujuifu/ml-mgie)
<img src="https://raw.githubusercontent.com/apple/ml-mgie/main/mgie.png" width="60%" />

Please follow the [offical repo](https://github.com/apple/ml-mgie) and [ipynb](https://github.com/apple/ml-mgie/blob/main/demo.ipynb) to use it
<img src="https://raw.githubusercontent.com/apple/ml-mgie/main/demo.png" width="60%" />

```
@inproceedings{fu2024mgie,
  author = {Tsu-Jui Fu and Wenze Hu and Xianzhi Du and William Yang Wang and Yinfei Yang, and Zhe Gan}, 
  title = {{Guiding Instruction-based Image Editing via Multimodal Large Language Models}}, 
  booktitle = {International Conference on Learning Representations (ICLR)}, 
  year = {2024} 
}
```