Mitsua commited on
Commit
4c0fb85
1 Parent(s): c31664d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +46 -2
README.md CHANGED
@@ -1,6 +1,50 @@
1
  ---
2
- license: openrail++
3
  datasets:
4
  - Mitsua/vroid-image-dataset-lite
5
  pipeline_tag: text-to-image
6
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: other
3
  datasets:
4
  - Mitsua/vroid-image-dataset-lite
5
  pipeline_tag: text-to-image
6
+ ---
7
+ # Model Card for VRoid Diffusion
8
+
9
+ <!-- Provide a quick summary of what the model is/does. -->
10
+
11
+ This is a latent text-to-image diffusion model to demonstrate how U-Net training affects the generated images.
12
+
13
+ Text Encoder is from [OpenCLIP ViT-H/14](https://github.com/mlfoundations/open_clip), MIT License, Training Data : LAION-2B
14
+ VAE is from [Mitsua Diffusion One](https://huggingface.co/Mitsua/mitsua-diffusion-one), Mitsua Open RAIL-M License, Training Data: Public Domain/CC0 + Licensed
15
+
16
+ U-Net is trained from scratch using full version of [VRoid Image Dataset Lite](https://huggingface.co/datasets/Mitsua/vroid-image-dataset-lite) with some modifications.
17
+
18
+ - VRoid is a trademark or registered trademark of Pixiv inc. in Japan and other regions.
19
+
20
+ ## Model Details
21
+
22
+ - `vroid_diffusion_test.safetensors`
23
+ - base variant.
24
+ - `vroid_diffusion_test_invert_red_blue.safetensors`
25
+ - `red` and `blue` in the caption is swapped.
26
+ - `pink` and `skyblue` in the caption is swapped.
27
+ - `vroid_diffusion_test_monochrome.safetensors`
28
+ - all training images are converted to grayscale.
29
+
30
+
31
+ ### Model Description
32
+
33
+ - **Developed by:** Abstract Engine.
34
+ - **License:** Mitsua Open RAIL-M License.
35
+
36
+ ## Uses
37
+
38
+ ### Direct Use
39
+
40
+ Text-to-Image generation for research purposes.
41
+
42
+ ### Out-of-Scope Use
43
+
44
+ Any deployed use case of the model.
45
+
46
+ ## Training Details
47
+
48
+ ### Training Data
49
+
50
+ We use full version of [VRoid Image Dataset Lite](https://huggingface.co/datasets/Mitsua/vroid-image-dataset-lite) with some modifications.