Bagheera Bghira commited on Mar 14

Commit

6a094b1

•

1 Parent(s): f365259

24150 steps: mj-60, text-1mp, anatomy, cinemamix, photo-aesthetics, shutterstock, sports, yoga, n1024

Browse files

Files changed (43) hide show

README.md +323 -1
assets/.keep +0 -0
assets/banner.png +3 -0
assets/dark-base.png +3 -0
assets/dark-flex.png +3 -0
assets/dark-realism.png +3 -0
assets/ellen-base.png +3 -0
assets/ellen-flex.png +3 -0
assets/ellen-realism.png +3 -0
assets/fam-base.png +3 -0
assets/fam-flex.png +3 -0
assets/fam-realism.png +3 -0
assets/woman-base.png +3 -0
assets/woman-flex.png +3 -0
assets/woman-realism.png +3 -0
ema_unet/config.json +3 -0
ema_unet/diffusion_pytorch_model.safetensors +3 -0
feature_extractor/preprocessor_config.json +3 -0
model_index.json +3 -0
optimizer.bin +3 -0
random_states_0.pkl +3 -0
scheduler.bin +3 -0
scheduler/scheduler_config.json +3 -0
text_encoder/config.json +3 -0
text_encoder/model.safetensors +3 -0
tokenizer/merges.txt +0 -0
tokenizer/special_tokens_map.json +3 -0
tokenizer/tokenizer_config.json +3 -0
tokenizer/vocab.json +3 -0
training_state-anatomy.json +3 -0
training_state-cinemamix-1mp.json +3 -0
training_state-mj-60.json +3 -0
training_state-nsfw-1024.json +3 -0
training_state-photo-aesthetics.json +3 -0
training_state-shutterstock.json +3 -0
training_state-sports.json +3 -0
training_state-text-1mp.json +3 -0
training_state-yoga.json +3 -0
training_state.json +3 -0
unet/config.json +3 -0
unet/diffusion_pytorch_model.safetensors +3 -0
vae/config.json +3 -0
vae/diffusion_pytorch_model.safetensors +3 -0

README.md CHANGED Viewed

@@ -1,3 +1,325 @@
 ---
-license: openrail
 ---

 ---
+license: creativeml-openrail-m
+tags:
+- stable-diffusion
+- stable-diffusion-2-1
+- text-to-image
+pinned: true
+library_name: diffusers
 ---
+# Model Card for pseudo-flex-base (1024x1024 base resolution)
+![img](assets/banner.png)
+<!-- Provide a quick summary of what the model is/does. [Optional] -->
+stable-diffusion-2-1 (stabilityai/stable-diffusion-2-1) finetuned with different aspect ratios, into a photography model (ptx0/pseudo-real-beta).
+## Sample images
+**Seed**: 2695929547
+**Steps**: 25
+**Sampler**: DDIM, default model config settings
+**Version**: Pytorch 2.0.1, Diffusers 0.17.1
+**Guidance**: 9.2
+**Guidance rescale**: 0.0
+| resolution      | model   |   stable diffusion             |   pseudo-flex                   |   realism-engine                 |
+|:---------------:|:-------:|:------------------------------:|:-------------------------------:|:---------------------------------:
+| 753x1004  (4:3) | v2-1    | ![img](assets/fam-base.png)    | ![img](assets/fam-flex.png)     | ![img](assets/fam-realism.png)   |
+| 1280x720 (16:9) | v2-1    | ![img](assets/ellen-base.png)  | ![img](assets/ellen-flex.png)   | ![img](assets/ellen-realism.png) |
+| 1024x1024 (1:1) | v2-1    | ![img](assets/woman-base.png)  | ![img](assets/woman-flex.png)   | ![img](assets/woman-realism.png) |
+| 1024x1024 (1:1) | v2-1    | ![img](assets/dark-base.png)   | ![img](assets/dark-flex.png)    | ![img](assets/dark-realism.png)  |
+## Background
+The `ptx0/pseudo-real-beta` pretrained checkpoint had its unet trained for 4,200 steps and its text encoder trained for 15,600 steps at a batch size of 15 with 10 gradient accumulations, on a diverse dataset:
+* cushman (8000 kodachrome slides from 1939 to 1969)
+* midjourney v5.1-filtered (about 22,000 upscaled v5.1 images)
+* national geographic (about 3-4,000 >1024x768 images of animals, wildlife, landscapes, history)
+* a small dataset of stock images of people vaping / smoking
+It has a diverse capability of photorealistic and adventure with strong prompt coherence. However, it lacks multi-aspect capability.
+The code used to train `pseudo-real-beta` did not have aspect bucketing support. I discovered `pseudo-flex-base` by @ttj, which supported theories I had.
+## Training code
+I added thorough aspect bucketing support to my training loop dataloader by having it throw away any image under 1024x1024, and condition all images so that the smaller side of the image is 1024. The aspect ratio of the image is used to determine the new length of the other dimension, eg. used as a multiple for landscape or a divisor for portrait mode.
+All batches have image of the same resolution. Different resolutions at the same aspect are all conditioned to 1024x... or ...x1024. A 1920x1080 image becomes approx 1820x1024.
+## Starting checkpoint
+This model, `pseudo-flex-base` was created by fine-tuning the base `stabilityai/stable-diffusion-2-1` 768 model on its frozen text encoder, for 1000 steps on 148,000 images from LAION HD using the TEXT field as their caption.
+The batch size was effectively 150 again. Batch size of 15 with 10 accumulations. This is very slow at very high resolutions, an aspect ratio of 1.5-1.7 will cause this to take about 700 seconds per iter on an A100 80G.
+This training took two days.
+## Text encoder swap
+At 1000 steps, the text encoder from `ptx0/pseudo-real-beta` was used experimentally with this model's unet in an attempt to resolve some residual image noise, eg. pixelation. That worked!
+The training was restarted from ckpt 1000 with this text encoder.
+## The beginnings of wide / portrait aspect appearing
+Validation prompts began to "pull together" from 1300 to 2950 steps. Some checkpoints show regression, but these usually resolve in about 100 steps. Improvements were always present, despite regresions.
+## Degradation and dataset swap
+As training has been going on for some time now on 148,000 images at a batch size of 150 over 3000 steps, images began to degrade. This is presumably due to having completed 3 repeats on all images in the set, and that's IF all images in the set had been used. Considering some of the image filters discarded about 50,000 images, we landed at 9 repeats per image on our super low learning rate.
+This caused two issues:
+* The images were beginning to show static noise.
+* The training was taking a very long time, and each checkpoint showed little improvement.
+* Overfitting to prompt vocabulary, and a lack of generalization.
+Ergo, at 1300 steps, the decision was made to cease training on the original LAION HD dataset, and instead, train on a *new* freshly-retrieved subset of high-resolution Midjourney v5.1 data.
+This consisted of 17,800 images at a base resolution of 1024x1024, with about 700 samples in portrait and 700 samples in landscape.
+## Contrast issues
+As the checkpoint 3275 was tested, a common observation was that darker images were washed out, and brighter images seemed "meh".
+Various CFG rescale and guidance levels were tested, with the best dark images occurring around `guidance_scale=9.2` and `guidance_rescale=0.0` but they remained "washed out".
+## Dataset change number two
+A new LAION subset was prepared with unique images and no square images - just a limited collection of aspect ratios:
+* 16:9
+* 9:16
+* 2:3
+* 3:2
+This was intended to speed up the understanding of the model, and prevent overfitting on captions.
+This LAION subset contained 17,800 images, evenly distributed through aspect ratios.
+The images were then captioned using T5 Flan with BLIP2, to obtain highly accurate results.
+## Contrast fix: offset noise / SNR gamma to the rescue?
+Offset noise and SNR gamma were applied experimentally to the checkpoint **4250**:
+* `snr_gamma=5.0`
+* `noise_offset=0.2`
+* `noise_pertubation=0.1`
+Within 25 steps of training, the contrast was back, and the prompt `a solid black square` once again produced a reasonable result.
+At 50 steps of offset noise, things really seemed to "click" and `a solid black square` had the fewest deformities I've seen.
+Step 75 checkpoint was broken. The SNR gamma math results in numeric instability and was disabled. The offset noise parameters were untouched.
+## Success! Improvement in quality and contrast.
+Similar to the text encoder swap, the images showed a marked improvement over the next several checkpoints.
+It was left to its own devices, and at step 4475, enough improvement was observed that another revision in this repository was created.
+# Status: Test release
+This model has been packaged up in a test form so that it can be thoroughly assessed by users.
+For usage, see - [How to Get Started with the Model](#how-to-get-started-with-the-model)
+### It aims to solve the following issues:
+1. Generated images looks like they are cropped from a larger image.
+2. Generating non-square images creates weird results, due to the model being trained on square images.
+### Limitations:
+1. It's trained on a small dataset, so its improvements may be limited.
+2. The model architecture of SD 2.1 is older than SDXL, and will not generate comparably good results.
+For 1:1 aspect ratio, it's fine-tuned at 1024x1024, although `ptx0/pseudo-real-beta` that it was based on, was last finetuned at 768x768.
+### Potential improvements:
+1. Train on a captioned dataset. This model used the TEXT field from LAION for convenience, though COCO-generated captions would be superior.
+2. Train the text encoder on large images.
+3. Periodic caption drop-out enforced to help condition classifier-free guidance capabilities.
+#  Table of Contents
+- [Model Card for pseudo-flex-base](#model-card-for--model_id-)
+- [Table of Contents](#table-of-contents)
+- [Table of Contents](#table-of-contents-1)
+- [Model Details](#model-details)
+  - [Model Description](#model-description)
+- [Uses](#uses)
+  - [Direct Use](#direct-use)
+  - [Downstream Use [Optional]](#downstream-use-optional)
+  - [Out-of-Scope Use](#out-of-scope-use)
+- [Bias, Risks, and Limitations](#bias-risks-and-limitations)
+  - [Recommendations](#recommendations)
+- [Training Details](#training-details)
+  - [Training Data](#training-data)
+  - [Training Procedure](#training-procedure)
+    - [Preprocessing](#preprocessing)
+    - [Speeds, Sizes, Times](#speeds-sizes-times)
+- [Evaluation](#evaluation)
+  - [Testing Data, Factors & Metrics](#testing-data-factors--metrics)
+    - [Testing Data](#testing-data)
+    - [Factors](#factors)
+    - [Metrics](#metrics)
+  - [Results](#results)
+- [Model Examination](#model-examination)
+- [Environmental Impact](#environmental-impact)
+- [Technical Specifications [optional]](#technical-specifications-optional)
+  - [Model Architecture and Objective](#model-architecture-and-objective)
+  - [Compute Infrastructure](#compute-infrastructure)
+    - [Hardware](#hardware)
+    - [Software](#software)
+- [Citation](#citation)
+- [Glossary [optional]](#glossary-optional)
+- [More Information [optional]](#more-information-optional)
+- [Model Card Authors [optional]](#model-card-authors-optional)
+- [Model Card Contact](#model-card-contact)
+- [How to Get Started with the Model](#how-to-get-started-with-the-model)
+# Model Details
+## Model Description
+<!-- Provide a longer summary of what this model is/does. -->
+stable-diffusion-2-1 (stabilityai/stable-diffusion-2-1 and ptx0/pseudo-real-beta) finetuned for dynamic aspect ratios.
+finetuned resolutions:
+|    |   width |   height | aspect ratio  | images |
+|---:|--------:|---------:|:--------------|-------:|
+|  0 |    1024 |     1024 | 1:1           |  90561 |
+|  1 |    1536 |     1024 | 3:2           |   8716 |
+|  2 |    1365 |     1024 | 4:3           |   6933 |
+|  3 |    1468 |     1024 | ~3:2          |    113 |
+|  4 |    1778 |     1024 | ~5:3          |   6315 |
+|  5 |    1200 |     1024 | ~5:4          |   6376 |
+|  6 |    1333 |     1024 | ~4:3          |   2814 |
+|  7 |    1281 |     1024 | ~5:4          |     52 |
+|  8 |    1504 |     1024 | ~3:2          |    139 |
+|  9 |    1479 |     1024 | ~3:2          |     25 |
+| 10 |    1384 |     1024 | ~4:3          |   1676 |
+| 11 |    1370 |     1024 | ~4:3          |     63 |
+| 12 |    1499 |     1024 | ~3:2          |    436 |
+| 13 |    1376 |     1024 | ~4:3          |     68 |
+Other aspects were in smaller buckets. It could have been done more succinctly or carefully, but careless handling of the data was a part of the experiment parameters.
+- **Developed by:** pseudoterminal
+- **Model type:** Diffusion-based text-to-image generation model
+- **Language(s)**: English
+- **License:** creativeml-openrail-m
+- **Parent Model:** https://huggingface.co/ptx0/pseudo-real-beta
+- **Resources for more information:** More information needed
+# Uses
+- see https://huggingface.co/stabilityai/stable-diffusion-2-1
+# Training Details
+## Training Data
+- LAION HD dataset subsets
+  - https://huggingface.co/datasets/laion/laion-high-resolution
+We only used a small portion of that, see [Preprocessing](#preprocessing)
+### Preprocessing
+All pre-processing is done via the scripts in `bghira/SimpleTuner` on GitHub.
+### Speeds, Sizes, Times
+- Dataset size: 100k image-caption pairs, after filtering.
+- Hardware: 1 A100 80G GPUs
+- Optimizer: 8bit Adam
+- Batch size: 150
+  - actual batch size: 15
+  - gradient_accumulation_steps: 10
+  - effective batch size: 150
+- Learning rate: Constant 4e-8 which was adjusted by reducing batch size over time.
+- Training steps: WIP (ongoing)
+- Training time: approximately 4 days (so far)
+## Results
+More information needed
+# Model Card Authors
+pseudoterminal
+# How to Get Started with the Model
+Use the code below to get started with the model.
+```python
+# Use Pytorch 2!
+import torch
+from diffusers import StableDiffusionPipeline, DiffusionPipeline, AutoencoderKL, UNet2DConditionModel, DDPMScheduler
+from transformers import CLIPTextModel
+# Any model currently on Huggingface Hub.
+model_id = 'ptx0/pseudo-flex-base'
+pipeline = DiffusionPipeline.from_pretrained(model_id)
+# Optimize!
+pipeline.unet = torch.compile(pipeline.unet)
+scheduler = DDPMScheduler.from_pretrained(
+    model_id,
+    subfolder="scheduler"
+)
+# Remove this if you get an error.
+torch.set_float32_matmul_precision('high')
+pipeline.to('cuda')
+prompts = {
+    "woman": "a woman, hanging out on the beach",
+    "man": "a man playing guitar in a park",
+    "lion": "Explore the ++majestic beauty++ of untamed ++lion prides++ as they roam the African plains --captivating expressions-- in the wildest national geographic adventure",
+    "child": "a child flying a kite on a sunny day",
+    "bear": "best quality ((bear)) in the swiss alps cinematic 8k highly detailed sharp focus intricate fur",
+    "alien": "an alien exploring the Mars surface",
+    "robot": "a robot serving coffee in a cafe",
+    "knight": "a knight protecting a castle",
+    "menn": "a group of smiling and happy men",
+    "bicycle": "a bicycle, on a mountainside, on a sunny day",
+    "cosmic": "cosmic entity, sitting in an impossible position, quantum reality, colours",
+    "wizard": "a mage wizard, bearded and gray hair, blue  star hat with wand and mystical haze",
+    "wizarddd": "digital art, fantasy, portrait of an old wizard, detailed",
+    "macro": "a dramatic city-scape at sunset or sunrise",
+    "micro": "RNA and other molecular machinery of life",
+    "gecko": "a leopard gecko stalking a cricket"
+}
+for shortname, prompt in prompts.items():
+    # old prompt: ''
+    image = pipeline(prompt=prompt,
+        negative_prompt='malformed, disgusting, overexposed, washed-out',
+        num_inference_steps=32, generator=torch.Generator(device='cuda').manual_seed(1641421826),
+        width=1368, height=720, guidance_scale=7.5, guidance_rescale=0.3, num_inference_steps=25).images[0]
+    image.save(f'test/{shortname}_nobetas.png', format="PNG")
+```

assets/.keep ADDED Viewed

File without changes

assets/banner.png ADDED Viewed

Git LFS Details

SHA256: 7da5ede9eab30bf5490fd2961166bc7662127e09b68b6c23039d74e885a7eed1
Pointer size: 132 Bytes
Size of remote file: 1.11 MB

assets/dark-base.png ADDED Viewed

Git LFS Details

SHA256: bf47584aa3bfe74f709b9ee2f709ea0119bf6a7c4ca826f68c5fe6483fb27445
Pointer size: 132 Bytes
Size of remote file: 1.66 MB

assets/dark-flex.png ADDED Viewed

Git LFS Details

SHA256: 761193fdb256ff8d2848b2983720006a240f6d392085287088b69e6affed4fdd
Pointer size: 131 Bytes
Size of remote file: 601 kB

assets/dark-realism.png ADDED Viewed

Git LFS Details

SHA256: fefb8f4ee2eea2041c19b449997d14633f77c1e87bd43e0afa1e2815beb8f2c4
Pointer size: 132 Bytes
Size of remote file: 1.25 MB

assets/ellen-base.png ADDED Viewed

Git LFS Details

SHA256: 113b029ffd65f34f9da140507b91e0eabc9595160ee352975d36c70625412314
Pointer size: 132 Bytes
Size of remote file: 1.47 MB

assets/ellen-flex.png ADDED Viewed

Git LFS Details

SHA256: 2b3b4ab9d1341e618113a7f744371023f4c099b9edd62424a0e883d25dba1b3b
Pointer size: 132 Bytes
Size of remote file: 1.15 MB

assets/ellen-realism.png ADDED Viewed

Git LFS Details

SHA256: abb71d4f3338c0c78ba5132132b90e8b6c5268291d2ab487fe4175ae9dc14753
Pointer size: 132 Bytes
Size of remote file: 1.39 MB

assets/fam-base.png ADDED Viewed

Git LFS Details

SHA256: 62f3a8a9336d1c896cdb0a866280cedce2091e0b37e9c9cf321537f0eb8eac0c
Pointer size: 132 Bytes
Size of remote file: 2.12 MB

assets/fam-flex.png ADDED Viewed

Git LFS Details

SHA256: bdc6cbc43a64cada596b5f629e7ea5d0eadc7aa9e30b82bc86527e467e658369
Pointer size: 132 Bytes
Size of remote file: 1.01 MB

assets/fam-realism.png ADDED Viewed

Git LFS Details

SHA256: 3fe33b99a22cf16db508c25bcee11d0f866e9e1b5d7e5be42600a107eb5b893a
Pointer size: 132 Bytes
Size of remote file: 1.13 MB

assets/woman-base.png ADDED Viewed

Git LFS Details

SHA256: 706c673fb1c361da7adc10e03bdd2714003ccef42b120caee1a34726fcd73b16
Pointer size: 132 Bytes
Size of remote file: 1.34 MB

assets/woman-flex.png ADDED Viewed

Git LFS Details

SHA256: 5e9ed6e2935de3605c99e79df3fd12b96a6143be0e03a242eed72f2c1f6721f2
Pointer size: 131 Bytes
Size of remote file: 529 kB

assets/woman-realism.png ADDED Viewed

Git LFS Details

SHA256: 3b27ae3ecaa955e3eecc968fc736c95c36b031dffbdad9edaa7009c6cce6bdeb
Pointer size: 132 Bytes
Size of remote file: 1.2 MB

ema_unet/config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4468b6b73594408a7f51643a8f67c29d29438b3ee625f2bbd34d76332f4e97ca
+size 2029

ema_unet/diffusion_pytorch_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fc09380d13cd243be98c642b70afe8dd0e888943ab7bc54a076edfdedaa05483
+size 3463726504

feature_extractor/preprocessor_config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4db495644e3e5bd8fcac52f70e7fc0b413c911086021acf73ac30e5911166e95
+size 518

model_index.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:45b32dee83ba23fcf7d836f5970f8f982ede609892bd4002c89ddf61bc2469a5
+size 543

optimizer.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d0e2d2ea0c77abd46362e23dcea4f54fc550ff68efe96a84314f9b5f918a6a4a
+size 6927867604

random_states_0.pkl ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:352a0b48037d2149964afd7f9885e50ac94d559f8dcce75f1f47aca9d31a243a
+size 14344

scheduler.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:da71f151a50994e7de5cbae45eafb922ef01bd084c5a3c45f65f9dc023e4ce1d
+size 1000

scheduler/scheduler_config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f86909bc657068f979a38b6533d35dee417a4b08f2ae1b085bb991ed7685cb18
+size 346

text_encoder/config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ab32fb24abe9e07afbb4d30cb6143c2ebde483dd209d59740d164a5d28f6c376
+size 701

text_encoder/model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:012bcb2fc34c59837cf9f13d28da60f45ddebb1ee0f0455e08d04a4735016045
+size 1361597016

tokenizer/merges.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f118ab3a983206e4f32583448de6bd6aae4ee21869135cef1f5848a753cdaab6
+size 460

tokenizer/tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:19d7b034cb0cc3ce9766c2231373ab8aa8991fc72e2c8f76558bfaae3de0d563
+size 737

tokenizer/vocab.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e089ad92ba36837a0d31433e555c8f45fe601ab5c221d4f607ded32d9f7a4349
+size 1059962

training_state-anatomy.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e148b0c288a31437a99f227900ac4ad7a7eced86c3a440ad3748e49d7bf971a0
+size 2082031

training_state-cinemamix-1mp.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:57757f3c983bd49a59369a585140f35d24388a107559b4b7a9de99194f0e20ae
+size 1055585

training_state-mj-60.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4fa7baf925f95f9519321101dd763bff0906548754bb1cfa52b719317b996914
+size 45936912

training_state-nsfw-1024.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d6976e3789a9eee0127e140e5970b0ac5be9628d04333ce715fe1fc1672054a2
+size 782526

training_state-photo-aesthetics.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f998606ecaec6243a0b982d3dbba63829874d44f564a32658bcd86711c3fbe90
+size 6948163

training_state-shutterstock.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ba6652798531e37f909500ca9643eab46b8608d82801eebc62046d158e16b772
+size 3263336

training_state-sports.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:010b9de55c47e7ce80edc785f0e331e9d72b8a0e47e7dc09d4c0a9e2e72b2841
+size 117672

training_state-text-1mp.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7dc37cdcb2512c8d486e72188196e77a722ea49f283ea6795b7ea3819b0a3e2c
+size 1536598

training_state-yoga.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:98e2e545c6e8fc215e04d8815a0a06e8eeabe3e03c3d172ccd26b3b8e5d64968
+size 538833

training_state.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5e394183cbd5c365fb4e0c51db3c8ff939f379c7df3d26eb4dc8a80f59212458
+size 310

unet/config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3197c08397f77e403b54b8bc16707b7e06d0a9c1d88e13419c5abeaa982e22b8
+size 1856

unet/diffusion_pytorch_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7be172ec2bcbf3d279034b6520b85f89a6108944526b2fde382341c57b035cdd
+size 3463726504

vae/config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:95d23ed9665de3ea1094f518e43ab731728721dc8ae2773d5f9b56118cca8e1d
+size 719

vae/diffusion_pytorch_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2aa1f43011b553a4cba7f37456465cdbd48aab7b54b9348b890e8058ea7683ec
+size 334643268