Krishan3168 commited on
Commit
22df4ac
·
1 Parent(s): 66b3a5f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -3
README.md CHANGED
@@ -1,3 +1,29 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Digit-Spatial-Diffusion's Hugging Face Models
2
+
3
+ This repository contains several machine learning models developed as part of my project, "Spatial Knowledge Injection into Diffusion-based Image Generation".
4
+
5
+ ## Project Overview
6
+
7
+ The project focused on the integration of spatial relations knowledge into diffusion-based image generation models. The main objective was to enhance the realism and context-awareness of the generated images. I experimented with four different methods to control the image generation process: baseline, handcrafted approach, typed control, and control dots.
8
+
9
+ ## Models
10
+
11
+ This repository includes the following models:
12
+
13
+ 1. **Baseline** - Stable Diffusion model trained with digits on our custom dataset
14
+ 2. **Handcrafted** - ControlNet with Handcrafted tesnor.
15
+ 3. **Typed** - ControlNet with typed digits as control signals.
16
+ 3. **Typed dot** - ControlNet with dots as control signals.
17
+
18
+ ## How to Use
19
+
20
+ Here's a basic example of how to use these models:
21
+
22
+ ```python
23
+ from transformers import AutoTokenizer, AutoModel
24
+
25
+ tokenizer = AutoTokenizer.from_pretrained("Krishan3168/Digit-Spatial-Diffusion")
26
+ model = AutoModel.from_pretrained("Krishan3168/Digit-Spatial-Diffusion")
27
+
28
+ inputs = tokenizer("Example input", return_tensors="pt")
29
+ outputs = model(**inputs)