rain1011
/

LaVIT-7B-v2

Model card Files Files and versions Community

rain1011 commited on Nov 18, 2023

Commit

2cc9aca

•

1 Parent(s): 38f9920

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -4,7 +4,7 @@ pipeline_tag: text-to-image
 ---
 # LaVIT: Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
-This is the latest version (LaVITv2) for the multi-modal large language model: **LaVIT**.
 In this version, We further improve LaVIT's image generation capability. In the updated version, the **aesthetic** and **prompt-alignment** of generated images has been improved. The **probability of watermark** is also greatly reduced. The improvements are summarized as follows:
   * Using LaVIT to generate better synthetic captions for the noisy Laion-Aesthetic (Like DALL-E 3).

 ---
 # LaVIT: Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
+This is the latest version (LaVITv2) for the multi-modal large language model: **LaVIT**. The inference code of LaVIT can be found in [here](https://github.com/jy0205/LaVIT).
 In this version, We further improve LaVIT's image generation capability. In the updated version, the **aesthetic** and **prompt-alignment** of generated images has been improved. The **probability of watermark** is also greatly reduced. The improvements are summarized as follows:
   * Using LaVIT to generate better synthetic captions for the noisy Laion-Aesthetic (Like DALL-E 3).