catvton-flux-try-on

Running on Zero

App Files Files Community

xiaozaa commited on 26 days ago

Commit

4fd56c0

•

1 Parent(s): 1beac4e

replace image

Browse files

Files changed (6) hide show

README.md +22 -9
example/result/2.png +0 -0
example/result/3.png +0 -0
flux_inpaint_garment.png +0 -0
flux_inpaint_tryon.png +0 -0
tryon_inference.py +4 -1

README.md CHANGED Viewed

@@ -1,17 +1,19 @@
 # catvton-flux
 An advanced virtual try-on solution that combines the power of [CATVTON](https://arxiv.org/abs/2407.15886) (Contrastive Appearance and Topology Virtual Try-On) with Flux fill inpainting model for realistic and accurate clothing transfer.
 ## Showcase
-| Original | Result |
-|----------|--------|
-| ![Original](example/person/1.jpg) | ![Result](example/result/1.png) |
-| ![Original](example/person/00008_00.jpg) | ![Result](example/result/2.png) |
-| ![Original](example/person/00008_00.jpg) | ![Result](example/result/3.png) |
 ## Model Weights
 The model weights are trained on the [VITON-HD](https://github.com/shadow2496/VITON-HD) dataset.
-🤗 [catvton-flux-alpha](https://huggingface.co/xiaozaa/catvton-flux-alpha)
 ## Prerequisites
 ```bash
@@ -39,9 +41,20 @@ python tryon_inference.py \
 ## Citation
 ```bibtex
-@misc{jiang2024catvton,
-title={CATVTON: A Contrastive Approach for Virtual Try-On Network},
-author={Chao Jiang and Xujie Zhang}
 }
 ```

 # catvton-flux
 An advanced virtual try-on solution that combines the power of [CATVTON](https://arxiv.org/abs/2407.15886) (Contrastive Appearance and Topology Virtual Try-On) with Flux fill inpainting model for realistic and accurate clothing transfer.
+Also inspired by [In-Context LoRA](https://arxiv.org/abs/2410.23775) for prompt engineering.
 ## Showcase
+| Original | Garment | Result |
+|----------|---------|---------|
+| ![Original](example/person/1.jpg) | ![Garment](example/garment/1.jpg) | ![Result](example/result/1.png) |
+| ![Original](example/person/1.jpg) | ![Garment](example/garment/04564_00.jpg) | ![Result](example/result/2.png) |
+| ![Original](example/person/00008_00.jpg) | ![Garment](example/garment/00034_00.jpg) | ![Result](example/result/3.png) |
 ## Model Weights
+Hugging Face: 🤗 [catvton-flux-alpha](https://huggingface.co/xiaozaa/catvton-flux-alpha)
 The model weights are trained on the [VITON-HD](https://github.com/shadow2496/VITON-HD) dataset.
 ## Prerequisites
 ```bash
 ## Citation
 ```bibtex
+@misc{chong2024catvtonconcatenationneedvirtual,
+ title={CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models},
+ author={Zheng Chong and Xiao Dong and Haoxiang Li and Shiyue Zhang and Wenqing Zhang and Xujie Zhang and Hanqing Zhao and Xiaodan Liang},
+ year={2024},
+ eprint={2407.15886},
+ archivePrefix={arXiv},
+ primaryClass={cs.CV},
+ url={https://arxiv.org/abs/2407.15886},
+}
+@article{lhhuang2024iclora,
+  title={In-Context LoRA for Diffusion Transformers},
+  author={Huang, Lianghua and Wang, Wei and Wu, Zhi-Fan and Shi, Yupeng and Dou, Huanzhang and Liang, Chen and Feng, Yutong and Liu, Yu and Zhou, Jingren},
+  journal={arXiv preprint arxiv:2410.23775},
+  year={2024}
 }
 ```

example/result/2.png CHANGED Viewed

example/result/3.png CHANGED Viewed

flux_inpaint_garment.png ADDED Viewed

flux_inpaint_tryon.png ADDED Viewed

tryon_inference.py CHANGED Viewed

@@ -97,6 +97,8 @@ def main():
     parser.add_argument('--steps', type=int, default=50, help='Number of inference steps')
     parser.add_argument('--guidance-scale', type=float, default=30, help='Guidance scale')
     parser.add_argument('--seed', type=int, default=0, help='Random seed')
     args = parser.parse_args()
@@ -110,7 +112,8 @@ def main():
         output_tryon_path=args.output_tryon,
         num_steps=args.steps,
         guidance_scale=args.guidance_scale,
-        seed=args.seed
     )
     print("Successfully saved garment and try-on images")

     parser.add_argument('--steps', type=int, default=50, help='Number of inference steps')
     parser.add_argument('--guidance-scale', type=float, default=30, help='Guidance scale')
     parser.add_argument('--seed', type=int, default=0, help='Random seed')
+    parser.add_argument('--width', type=int, default=768, help='Width')
+    parser.add_argument('--height', type=int, default=576, help='Height')
     args = parser.parse_args()
         output_tryon_path=args.output_tryon,
         num_steps=args.steps,
         guidance_scale=args.guidance_scale,
+        seed=args.seed,
+        size=(args.width, args.height)
     )
     print("Successfully saved garment and try-on images")