Update README.md
Browse files
README.md
CHANGED
@@ -4,7 +4,11 @@ license: openrail++
|
|
4 |
|
5 |
This is a finetuned version of [stabilityai/stable-diffusion-2-base](https://huggingface.co/stabilityai/stable-diffusion-2-base), optimised for outputting English text.
|
6 |
|
7 |
-
The model is finetuned for
|
|
|
|
|
|
|
|
|
8 |
|
9 |
|
10 |
# Citation
|
|
|
4 |
|
5 |
This is a finetuned version of [stabilityai/stable-diffusion-2-base](https://huggingface.co/stabilityai/stable-diffusion-2-base), optimised for outputting English text.
|
6 |
|
7 |
+
The model is finetuned for 20 epochs on [pbevan11/GPT4V-captions-from-LVIS-typography](https://huggingface.co/datasets/pbevan11/GPT4V-captions-from-LVIS-typography), a curated dataset of image-caption pairs from LVIS with detailed transcriptions of the text present in the image. We trained with a learning rate of 5e-6.
|
8 |
+
|
9 |
+
---
|
10 |
+
|
11 |
+
![Image generation model spelling comparison](model_comparison.png)
|
12 |
|
13 |
|
14 |
# Citation
|