Update README.md
Browse files
README.md
CHANGED
@@ -4,7 +4,7 @@ pipeline_tag: text-to-image
|
|
4 |
---
|
5 |
|
6 |
# LaVIT: Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
|
7 |
-
This is the latest version (LaVITv2) for the multi-modal large language model: **LaVIT**.
|
8 |
|
9 |
In this version, We further improve LaVIT's image generation capability. In the updated version, the **aesthetic** and **prompt-alignment** of generated images has been improved. The **probability of watermark** is also greatly reduced. The improvements are summarized as follows:
|
10 |
* Using LaVIT to generate better synthetic captions for the noisy Laion-Aesthetic (Like DALL-E 3).
|
|
|
4 |
---
|
5 |
|
6 |
# LaVIT: Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
|
7 |
+
This is the latest version (LaVITv2) for the multi-modal large language model: **LaVIT**. The inference code of LaVIT can be found in [here](https://github.com/jy0205/LaVIT).
|
8 |
|
9 |
In this version, We further improve LaVIT's image generation capability. In the updated version, the **aesthetic** and **prompt-alignment** of generated images has been improved. The **probability of watermark** is also greatly reduced. The improvements are summarized as follows:
|
10 |
* Using LaVIT to generate better synthetic captions for the noisy Laion-Aesthetic (Like DALL-E 3).
|