PseudoTerminal X commited on
Commit
68ebd3f
1 Parent(s): 8a81cbe

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -6
README.md CHANGED
@@ -2,7 +2,7 @@
2
  license: openrail++
3
  ---
4
 
5
- # Terminus XL Gamma
6
 
7
  ## Model Details
8
 
@@ -10,13 +10,14 @@ license: openrail++
10
 
11
  Terminus XL Gamma is a new state-of-the-art latent diffusion model that uses zero-terminal SNR noise schedule and velocity prediction objective at training and inference time.
12
 
13
- Terminus is based on the same architecture as SDXL, and has the same layout. It has been trained on fewer steps with very high quality data captions via COCO and Midjourney.
14
 
15
  This model will not be capable of as many concepts as SDXL, and some subjects will simply look very bad.
16
 
17
- The objective of this model was to use min-SNR gamma loss to efficiently train a full model on a single A100-80G.
18
 
19
 
 
20
  - **Developed by:** pseudoterminal X (@bghira)
21
  - **Funded by:** pseudoterminal X (@bghira)
22
  - **Model type:** Latent Diffusion
@@ -64,11 +65,13 @@ This model's success largely depended on a somewhat small collection of very hig
64
 
65
  #### Preprocessing
66
 
67
- Followed SDXL's pretraining procedure using crop conditional inputs and centre-cropped images with their full size as the input.
68
 
69
- Trained on 512x512, followed by 768x768, and finally, ~1 megapixel multi-aspect training for the rest of the training time.
70
 
71
- Images were downsampled while maintaining aspect ratio and cropped on 64 pixel increments. Many aspect ratios were trained, but only a few are likely to work fully.
 
 
72
 
73
  #### Training Hyperparameters
74
 
 
2
  license: openrail++
3
  ---
4
 
5
+ # Terminus XL Gamma (v2 preview)
6
 
7
  ## Model Details
8
 
 
10
 
11
  Terminus XL Gamma is a new state-of-the-art latent diffusion model that uses zero-terminal SNR noise schedule and velocity prediction objective at training and inference time.
12
 
13
+ Terminus is based on a similar architecture to SDXL, and has the same layout. It has been trained on fewer steps with very high quality data captions via COCO and Midjourney.
14
 
15
  This model will not be capable of as many concepts as SDXL, and some subjects will simply look very bad.
16
 
17
+ The objective of this model was to use v-prediction and min-SNR gamma loss to efficiently train a full zero-terminal SNR model on a single A100-80G.
18
 
19
 
20
+ - **Fine-tuned from:** ptx0/terminus-xl-gamma-v1
21
  - **Developed by:** pseudoterminal X (@bghira)
22
  - **Funded by:** pseudoterminal X (@bghira)
23
  - **Model type:** Latent Diffusion
 
65
 
66
  #### Preprocessing
67
 
68
+ Most of the existing process for terminus-xl-gamma-v1 was followed, with the exception of training extensively on cropped images using SDXL's crop coordinates to improve fine details.
69
 
70
+ No images were upsampled during this training session. Images were downsampled using LANCZOS instead of BICUBIC filters to attain higher image fidelity and maintain more image context for the model to learn from.
71
 
72
+ Only high-quality photos were used in this training session, greatly improving the realism qualities.
73
+
74
+ ~770,000 images were used for this training run.
75
 
76
  #### Training Hyperparameters
77