ACE-0.6B-512px / README.md
nielsr's picture
nielsr HF staff
Add pipeline tag
6bc3ff8 verified
|
raw
history blame
1.33 kB
metadata
license: apache-2.0
language:
  - en
pipeline_tag: image-to-image
tags:
  - Diffusion Transformer
  - Image Editing
  - Scepter
  - ACE

ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer

Tongyi Lab, Alibaba Group

ACE is a unified foundational model framework that supports a wide range of visual generation tasks. By defining CU for unifying multi-modal inputs across different tasks and incorporating long-context CU, we introduce historical contextual information into visual generation tasks, paving the way for ChatGPT-like dialog systems in visual generation.

BibTeX

@article{han2024ace,
  title={ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer},
  author={Han, Zhen and Jiang, Zeyinzi and Pan, Yulin and Zhang, Jingfeng and Mao, Chaojie and Xie, Chenwei and Liu, Yu and Zhou, Jingren},
  journal={arXiv preprint arXiv:2410.00086},
  year={2024}
}