File size: 2,279 Bytes
07a3660
 
 
 
 
 
 
 
ee528be
 
 
 
 
07a3660
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
---
license: creativeml-openrail-m
library_name: diffusers
tags:
- stable-diffusion
- stable-diffusion-diffusers
- text-to-image
- diffusers
- diffusers-training
- stable-diffusion
- stable-diffusion-diffusers
- text-to-image
- diffusers
- diffusers-training
base_model: runwayml/stable-diffusion-v1-5
inference: true
---

<!-- This model card has been generated automatically according to the information the training script had access to. You
should probably proofread and complete it, then remove this comment. -->


# Text-to-image finetuning - rcannizzaro/sd-dsprites-counterfactual-with-vae-loss-vae-unfrozen

This pipeline was finetuned from **runwayml/stable-diffusion-v1-5** on the **osazuwa/dsprite-counterfactual** dataset. Below are some example images generated with the finetuned pipeline using the following prompts: ['A very large square, with no rotation, at the center horizontally and vertically.', 'A very large ellipse, with no rotation, at the center horizontally and vertically.', 'A very large heart shape, with no rotation, at the center horizontally and vertically.']: 

![val_imgs_grid](./val_imgs_grid.png)


## Pipeline usage

You can use the pipeline like so:

```python
from diffusers import DiffusionPipeline
import torch

pipeline = DiffusionPipeline.from_pretrained("rcannizzaro/sd-dsprites-counterfactual-with-vae-loss-vae-unfrozen", torch_dtype=torch.float16)
prompt = "A very large square, with no rotation, at the center horizontally and vertically."
image = pipeline(prompt).images[0]
image.save("my_image.png")
```

## Training info

These are the key hyperparameters used during training:

* Epochs: 3
* Learning rate: 1e-05
* Batch size: 100
* Gradient accumulation steps: 4
* Image resolution: 64
* Mixed-precision: fp16


More information on all the CLI arguments and the environment are available on your [`wandb` run page](https://wandb.ai/ricardocannizzaro/sd-dsprites-counterfactual-with-vae-loss-vae-unfrozen/runs/78d8xaoa).


## Intended uses & limitations

#### How to use

```python
# TODO: add an example code snippet for running this diffusion pipeline
```

#### Limitations and bias

[TODO: provide examples of latent issues and potential remediations]

## Training details

[TODO: describe the data used to train the model]