Trained for 1 epochs and 30000 steps.

Trained with datasets ['text-embeds-pixart-filter', 'photo-concept-bucket', 'moviecollection', 'experimental', 'ethnic', 'sports', 'architecture', 'shutterstock', 'cinemamix-1mp', 'nsfw-1024', 'anatomy', 'bg20k-1024', 'yoga', 'photo-aesthetics', 'text-1mp', 'movieposters', 'normalnudes', 'pixel-art', 'signs', 'midjourney-v6-520k-raw', 'sfwbooru', 'nijijourney-v6-520k-raw', 'dalle3']
Learning rate 1e-06, batch size 24, and 1 gradient accumulation steps.
Used DDPM noise scheduler for training with epsilon prediction type and rescaled_betas_zero_snr=False
Using 'linspace' timestep spacing.
Base model: ptx0/pixart-900m-1024-ft-large
VAE: madebyollin/sdxl-vae-fp16-fix

Files changed (14) hide show

README.md +6 -6
optimizer.bin +1 -1
random_states_0.pkl +1 -1
scheduler.bin +1 -1
training_state-anatomy.json +0 -0
training_state-bg20k-1024.json +0 -0
training_state-dalle3.json +2 -2
training_state-midjourney-v6-520k-raw.json +2 -2
training_state-nijijourney-v6-520k-raw.json +2 -2
training_state-photo-concept-bucket.json +2 -2
training_state-sfwbooru.json +0 -0
training_state-text-1mp.json +0 -0
training_state.json +1 -1
transformer/diffusion_pytorch_model.safetensors +1 -1

README.md CHANGED Viewed

@@ -62,7 +62,7 @@ You may reuse the base model text encoder for inference.
 ## Training settings
 - Training epochs: 1
-- Training steps: 29500
 - Learning rate: 1e-06
 - Effective batch size: 192
   - Micro-batch size: 24
@@ -80,7 +80,7 @@ You may reuse the base model text encoder for inference.
 ### photo-concept-bucket
 - Repeats: 0
 - Total number of images: ~564672
-- Total number of aspect buckets: 13
 - Resolution: 1.0 megapixels
 - Cropped: False
 - Crop style: None
@@ -224,7 +224,7 @@ You may reuse the base model text encoder for inference.
 ### midjourney-v6-520k-raw
 - Repeats: 0
 - Total number of images: ~513792
-- Total number of aspect buckets: 15
 - Resolution: 1.0 megapixels
 - Cropped: False
 - Crop style: None
@@ -232,7 +232,7 @@ You may reuse the base model text encoder for inference.
 ### sfwbooru
 - Repeats: 0
 - Total number of images: ~271488
-- Total number of aspect buckets: 35
 - Resolution: 1.0 megapixels
 - Cropped: False
 - Crop style: None
@@ -240,7 +240,7 @@ You may reuse the base model text encoder for inference.
 ### nijijourney-v6-520k-raw
 - Repeats: 0
 - Total number of images: ~516288
-- Total number of aspect buckets: 11
 - Resolution: 1.0 megapixels
 - Cropped: False
 - Crop style: None
@@ -248,7 +248,7 @@ You may reuse the base model text encoder for inference.
 ### dalle3
 - Repeats: 0
 - Total number of images: ~1119168
-- Total number of aspect buckets: 3
 - Resolution: 1.0 megapixels
 - Cropped: False
 - Crop style: None

 ## Training settings
 - Training epochs: 1
+- Training steps: 30000
 - Learning rate: 1e-06
 - Effective batch size: 192
   - Micro-batch size: 24
 ### photo-concept-bucket
 - Repeats: 0
 - Total number of images: ~564672
+- Total number of aspect buckets: 10
 - Resolution: 1.0 megapixels
 - Cropped: False
 - Crop style: None
 ### midjourney-v6-520k-raw
 - Repeats: 0
 - Total number of images: ~513792
+- Total number of aspect buckets: 12
 - Resolution: 1.0 megapixels
 - Cropped: False
 - Crop style: None
 ### sfwbooru
 - Repeats: 0
 - Total number of images: ~271488
+- Total number of aspect buckets: 24
 - Resolution: 1.0 megapixels
 - Cropped: False
 - Crop style: None
 ### nijijourney-v6-520k-raw
 - Repeats: 0
 - Total number of images: ~516288
+- Total number of aspect buckets: 9
 - Resolution: 1.0 megapixels
 - Cropped: False
 - Crop style: None
 ### dalle3
 - Repeats: 0
 - Total number of images: ~1119168
+- Total number of aspect buckets: 2
 - Resolution: 1.0 megapixels
 - Cropped: False
 - Crop style: None

optimizer.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ae0eb6b041a0028b3a24d48874d631286d19e8b33e968ed0643a9b4659d2dd51
 size 5451415117

 version https://git-lfs.github.com/spec/v1
+oid sha256:6c5c3524172c5c7fdc86997bff9691007c354427e86a0fb6f21ac27156d3c7c0
 size 5451415117

random_states_0.pkl CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:360f68542b7351a488cde608cb684f61ba4d5fe6a08f52fa2651ae63d26f2604
 size 16100

 version https://git-lfs.github.com/spec/v1
+oid sha256:1e7465fd065b49dd9616b627da943c04583757ac9db86da61e68736c4e9921d2
 size 16100

scheduler.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b1a9b5b20185dcbb5340513578a6ce11f5f1d73ebc0940e7fb0827717d4e77b4
 size 1000

 version https://git-lfs.github.com/spec/v1
+oid sha256:f99930a90f196bd6f4e7e8666e4021fe8eb7bc320303852641ddb77d29e24739
 size 1000

training_state-anatomy.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

training_state-bg20k-1024.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

training_state-dalle3.json CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:73319ac6942a41bd35538d940e32414caf4afcc958bc6755ab2c031db7e2b1f4
-size 9073645

 version https://git-lfs.github.com/spec/v1
+oid sha256:5a4d148e6371f4ccac84b02848b7bc90fc9b7c8ca4a032c1a172ef4ccbb8f4ce
+size 9189686

training_state-midjourney-v6-520k-raw.json CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:82e5d11c71fa63ce38b568d6f1c2056f3ba4236e9547b631c35576169e7eb1f7
-size 6623991

 version https://git-lfs.github.com/spec/v1
+oid sha256:763440b8d650fd9a6706c21b9222b91f9140b53fdd78ae43fac5ad1904af6e55
+size 6784671

training_state-nijijourney-v6-520k-raw.json CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:66ee4d1c0ce8716bc3e907d2072ff1ad8a33017cf3625622343f92242a24e88a
-size 7119331

 version https://git-lfs.github.com/spec/v1
+oid sha256:165d3127ae2f24e4dc30c7c829b623ebd9e5ef067af45b706a6750df0ae572eb
+size 7269811

training_state-photo-concept-bucket.json CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b1878fde59c5028a4d28030c43b7c04a4f7d1ba9d099779a8c408f47afc80932
-size 5541572

 version https://git-lfs.github.com/spec/v1
+oid sha256:f4b0eb763811d8bbf45bed1944acf6bea69884b0e3aecccfa4fe1daca2d2636c
+size 5688858

training_state-sfwbooru.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

training_state-text-1mp.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

training_state.json CHANGED Viewed

@@ -1 +1 @@

- {"global_step": ~~29500~~, "epoch_step": 1, "epoch": 2, "exhausted_backends": ["pixel-art", "signs", "sports", "ethnic", "experimental", "movieposters", "normalnudes", "yoga", "cinemamix-1mp", "architecture", "moviecollection", "shutterstock", "nsfw-1024", "photo-aesthetics"], "repeats": {"bookcovers": 0, "signs": 0, "normalnudes": 0, "nijijourney": 0, "movieposters": 0, "celebrities": 0, "pixel-art": 0, "propagandaposters": 0, "sports": 0, "moviecollection": 0, "gay": 0, "experimental": 0, "yoga": 0, "ethnic": 0, "cinemamix-1mp": 0, "architecture": 0, "mj-60": 0, "text-1mp": 2, "shutterstock": 0, "nsfw-1024": 0, "photo-aesthetics": 0, "anatomy": 1, "bg20k-1024": 0, "sfwbooru": 0, "midjourney-v6-520k-raw": 0, "nijijourney-v6-520k-raw": 0, "photo-concept-bucket": 0, "dalle3": 0}}

+ {"global_step": 30000, "epoch_step": 1, "epoch": 2, "exhausted_backends": ["pixel-art", "signs", "sports", "ethnic", "experimental", "movieposters", "normalnudes", "yoga", "cinemamix-1mp", "architecture", "moviecollection", "shutterstock", "nsfw-1024", "photo-aesthetics"], "repeats": {"bookcovers": 0, "signs": 0, "normalnudes": 0, "nijijourney": 0, "movieposters": 0, "celebrities": 0, "pixel-art": 0, "propagandaposters": 0, "sports": 0, "moviecollection": 0, "gay": 0, "experimental": 0, "yoga": 0, "ethnic": 0, "cinemamix-1mp": 0, "architecture": 0, "mj-60": 0, "text-1mp": 3, "shutterstock": 0, "nsfw-1024": 0, "photo-aesthetics": 0, "anatomy": 1, "bg20k-1024": 0, "sfwbooru": 0, "midjourney-v6-520k-raw": 0, "nijijourney-v6-520k-raw": 0, "photo-concept-bucket": 0, "dalle3": 0}}

transformer/diffusion_pytorch_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d7d059800121590ddd35a4824a1a6734045ebd62667a773eef9ae94a8f0e6b0a
 size 1816969728

 version https://git-lfs.github.com/spec/v1
+oid sha256:32fb7ec01f6eb0b8e757796720ba5206c9deae585f3b699d73a98028c811f1bc
 size 1816969728