IDEA-CCNL
/

Taiyi-Stable-Diffusion-XL-3.5B

StableDiffusionXLPipeline

stable-diffusion

stable-diffusion-diffusers

Model card Files Files and versions Community

wuxiaojun commited on Jan 29

Commit

d223bb9

•

1 Parent(s): c97da87

init readme

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -42,7 +42,7 @@ The training of the Taiyi-Diffusion-XL text-to-image model encompasses three mai
 Our machine evaluation involved a comprehensive comparison of various models. The evaluation metrics included CLIP Similarity (CLIP Sim), Inception Score (IS), and Fréchet Inception Distance (FID), providing a robust assessment of each model's performance in terms of image quality, diversity, and alignment with textual descriptions. In the English dataset (COCO), Taiyi-XL demonstrated superior performance across all metrics, achieving the highest scores in CLIP Sim, IS, and FID. This indicates Taiyi-XL's effectiveness in generating images closely aligned with English text prompts while maintaining high image quality and diversity. Similarly, in the Chinese dataset (COCO-CN), Taiyi-XL outperformed other models, showcasing its robust bilingual capabilities.
-#### Table: Zero-shot image-text retrieval results
 | Model | CLIP Sim($\uparrow$) | FID($\downarrow$) | IS($\uparrow$) |
 |-------|----------------------|-------------------|----------------|

 Our machine evaluation involved a comprehensive comparison of various models. The evaluation metrics included CLIP Similarity (CLIP Sim), Inception Score (IS), and Fréchet Inception Distance (FID), providing a robust assessment of each model's performance in terms of image quality, diversity, and alignment with textual descriptions. In the English dataset (COCO), Taiyi-XL demonstrated superior performance across all metrics, achieving the highest scores in CLIP Sim, IS, and FID. This indicates Taiyi-XL's effectiveness in generating images closely aligned with English text prompts while maintaining high image quality and diversity. Similarly, in the Chinese dataset (COCO-CN), Taiyi-XL outperformed other models, showcasing its robust bilingual capabilities.
+#### Table: Comparison of different models based on CLIP Sim, IS, and FID across English (COCO) and Chinese (COCO-CN) datasets
 | Model | CLIP Sim($\uparrow$) | FID($\downarrow$) | IS($\uparrow$) |
 |-------|----------------------|-------------------|----------------|