pbevan11
/

stable-diffusion-2-typography

StableDiffusionPipeline

Inference Endpoints

Model card Files Files and versions Community

pbevan11 commited on Mar 26, 2024

Commit

52579e2

·

verified ·

1 Parent(s): 30a9831

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -4,7 +4,11 @@ license: openrail++
 This is a finetuned version of [stabilityai/stable-diffusion-2-base](https://huggingface.co/stabilityai/stable-diffusion-2-base), optimised for outputting English text.
-The model is finetuned for 70 epochs on [pbevan11/GPT4V-captions-from-LVIS-typography](https://huggingface.co/datasets/pbevan11/GPT4V-captions-from-LVIS-typography), a curated dataset of image-caption pairs from LVIS with detailed transcriptions of the text present in the image.
 # Citation

 This is a finetuned version of [stabilityai/stable-diffusion-2-base](https://huggingface.co/stabilityai/stable-diffusion-2-base), optimised for outputting English text.
+The model is finetuned for 20 epochs on [pbevan11/GPT4V-captions-from-LVIS-typography](https://huggingface.co/datasets/pbevan11/GPT4V-captions-from-LVIS-typography), a curated dataset of image-caption pairs from LVIS with detailed transcriptions of the text present in the image. We trained with a learning rate of 5e-6.
+---
+![Image generation model spelling comparison](model_comparison.png)
 # Citation