PseudoTerminal X
commited on
Commit
•
a409528
1
Parent(s):
b372e71
Trained for 0 epochs and 10000 steps.
Browse filesTrained with datasets ['text-embeds-pixart-filter', 'photo-concept-bucket', 'text-1mp', 'ideogram', 'midjourney-v6-520k-raw', 'sfwbooru', 'nijijourney-v6-520k-raw', 'dalle3']
Learning rate 1e-06, batch size 24, and 1 gradient accumulation steps.
Used DDPM noise scheduler for training with epsilon prediction type and rescaled_betas_zero_snr=False
Using 'trailing' timestep spacing.
Base model: terminusresearch/pixart-900m-1024-ft-v0.6
VAE: madebyollin/sdxl-vae-fp16-fix
- README.md +9 -1
- optimizer.bin +1 -1
- random_states_0.pkl +2 -2
- scheduler.bin +1 -1
- training_state-dalle3.json +2 -2
- training_state-ideogram.json +0 -0
- training_state-midjourney-v6-520k-raw.json +0 -0
- training_state-nijijourney-v6-520k-raw.json +0 -0
- training_state-photo-concept-bucket.json +0 -0
- training_state-sfwbooru.json +0 -0
- training_state-text-1mp.json +0 -0
- training_state.json +1 -1
- transformer/config.json +1 -1
- transformer/diffusion_pytorch_model.safetensors +1 -1
README.md
CHANGED
@@ -1562,7 +1562,7 @@ You may reuse the base model text encoder for inference.
|
|
1562 |
## Training settings
|
1563 |
|
1564 |
- Training epochs: 0
|
1565 |
-
- Training steps:
|
1566 |
- Learning rate: 1e-06
|
1567 |
- Effective batch size: 192
|
1568 |
- Micro-batch size: 24
|
@@ -1585,6 +1585,14 @@ You may reuse the base model text encoder for inference.
|
|
1585 |
- Cropped: True
|
1586 |
- Crop style: random
|
1587 |
- Crop aspect: square
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1588 |
### ideogram
|
1589 |
- Repeats: 15
|
1590 |
- Total number of images: ~36096
|
|
|
1562 |
## Training settings
|
1563 |
|
1564 |
- Training epochs: 0
|
1565 |
+
- Training steps: 10000
|
1566 |
- Learning rate: 1e-06
|
1567 |
- Effective batch size: 192
|
1568 |
- Micro-batch size: 24
|
|
|
1585 |
- Cropped: True
|
1586 |
- Crop style: random
|
1587 |
- Crop aspect: square
|
1588 |
+
### text-1mp
|
1589 |
+
- Repeats: 15
|
1590 |
+
- Total number of images: ~13056
|
1591 |
+
- Total number of aspect buckets: 1
|
1592 |
+
- Resolution: 1.0 megapixels
|
1593 |
+
- Cropped: True
|
1594 |
+
- Crop style: random
|
1595 |
+
- Crop aspect: square
|
1596 |
### ideogram
|
1597 |
- Repeats: 15
|
1598 |
- Total number of images: ~36096
|
optimizer.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5451415117
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:02b0d581baaedf9a359931ce85b0bc1ff20b03ee0974e26817b4d4ea6c24d943
|
3 |
size 5451415117
|
random_states_0.pkl
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:241516161f28420a23ee30cdbe589d5577647a66468942582715acd0fb9aacd3
|
3 |
+
size 16100
|
scheduler.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1000
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ec3386c1a1e3165e95e087cc5fed219f9a936cfdaf927dc9d05a2255eb24e135
|
3 |
size 1000
|
training_state-dalle3.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:da0b6577c4e6655c3f3c18f4c249ac901435373d3ab7c2d002c8a75ffba40abb
|
3 |
+
size 11809752
|
training_state-ideogram.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-midjourney-v6-520k-raw.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-nijijourney-v6-520k-raw.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-photo-concept-bucket.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-sfwbooru.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_state-text-1mp.json
ADDED
The diff for this file is too large to render.
See raw diff
|
|
training_state.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"global_step":
|
|
|
1 |
+
{"global_step": 10000, "epoch_step": 10000, "epoch": 1, "exhausted_backends": ["sfwbooru"], "repeats": {"ideogram": 8, "sfwbooru": 0, "text-1mp": 2}}
|
transformer/config.json
CHANGED
@@ -1,7 +1,7 @@
|
|
1 |
{
|
2 |
"_class_name": "PixArtTransformer2DModel",
|
3 |
"_diffusers_version": "0.30.0.dev0",
|
4 |
-
"_name_or_path": "/home/ubuntu/training/models/checkpoint-
|
5 |
"activation_fn": "gelu-approximate",
|
6 |
"attention_bias": true,
|
7 |
"attention_head_dim": 72,
|
|
|
1 |
{
|
2 |
"_class_name": "PixArtTransformer2DModel",
|
3 |
"_diffusers_version": "0.30.0.dev0",
|
4 |
+
"_name_or_path": "/home/ubuntu/training/models/checkpoint-9000",
|
5 |
"activation_fn": "gelu-approximate",
|
6 |
"attention_bias": true,
|
7 |
"attention_head_dim": 72,
|
transformer/diffusion_pytorch_model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1816969728
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d513286e10914ce5d1b124ae3ac44d58dd1f0a0acfb5c770c5bcb069699e8a79
|
3 |
size 1816969728
|