sayakpaul
/

flux-lora-resizing

Diffusers

English

Model card Files Files and versions Community

sayakpaul HF staff commited on Sep 25, 2024

Commit

f4a6582

•

1 Parent(s): 68307ce

Update README.md

Browse files

Files changed (1) hide show

README.md +46 -26

README.md CHANGED Viewed

@@ -14,6 +14,11 @@ So, what if we could take an existing LoRA checkpoint with a high rank and reduc
 - Reduce the memory requirements
 - Enable use cases like `torch.compile()` (which require all the LoRAs to be of the same rank to avoid re-compilation)
 ## Random projections
 Basic idea:
@@ -34,6 +39,9 @@ Basic idea:
 Tried on this LoRA: [https://huggingface.co/glif/how2draw](https://huggingface.co/glif/how2draw). Unless explicitly specified, a rank of 4 was used for all experiments. Here’s a side-by-side comparison of the original and the reduced LoRAs (on the same seed).
 ```python
 from diffusers import DiffusionPipeline
 import torch
@@ -54,23 +62,35 @@ images = pipe(
 ).images
 ```
-![Yorkshire Terrier with smile, How2Draw](Make%20a%20high-rank%20LoRA%20low-rank%2010c1384ebcac80ca895dcc006a297900/image.png)
-Yorkshire Terrier with smile, How2Draw
-![a dolphin, How2Draw](Make%20a%20high-rank%20LoRA%20low-rank%2010c1384ebcac80ca895dcc006a297900/image%201.png)
-a dolphin, How2Draw
-![an owl, How3Draw](Make%20a%20high-rank%20LoRA%20low-rank%2010c1384ebcac80ca895dcc006a297900/image%202.png)
-an owl, How3Draw
-![A silhouette of a girl performing a ballet pose, with elegant lines to suggest grace and movement. The background can include simple outlines of ballet shoes and a music note. The image should convey elegance and poise in a minimalistic style, How2Draw](Make%20a%20high-rank%20LoRA%20low-rank%2010c1384ebcac80ca895dcc006a297900/image%203.png)
-A silhouette of a girl performing a ballet pose, with elegant lines to suggest grace and movement. The background can include simple outlines of ballet shoes and a music note. The image should convey elegance and poise in a minimalistic style, How2Draw
-Code:  [https://gist.github.com/sayakpaul/9bae12402eddd53a79ee1f64b659b07b#file-low_rank_lora-py](https://gist.github.com/sayakpaul/9bae12402eddd53a79ee1f64b659b07b#file-low_rank_lora-py)
 ### Notes
@@ -81,27 +101,27 @@ Code:  [https://gist.github.com/sayakpaul/9bae12402eddd53a79ee1f64b659b07b#file-
 ### Results
-![image.png](Make%20a%20high-rank%20LoRA%20low-rank%2010c1384ebcac80ca895dcc006a297900/image%204.png)
-![image.png](Make%20a%20high-rank%20LoRA%20low-rank%2010c1384ebcac80ca895dcc006a297900/image%205.png)
-![image.png](Make%20a%20high-rank%20LoRA%20low-rank%2010c1384ebcac80ca895dcc006a297900/image%206.png)
-![image.png](Make%20a%20high-rank%20LoRA%20low-rank%2010c1384ebcac80ca895dcc006a297900/image%207.png)
 ### Randomized SVD
 Full SVD can be time-consuming. Truncated SVD is useful very large sparse matrices. We can use randomized SVD for none-to-negligible loss in quality but significantly faster speed.
-![image.png](Make%20a%20high-rank%20LoRA%20low-rank%2010c1384ebcac80ca895dcc006a297900/image%208.png)
-![image.png](Make%20a%20high-rank%20LoRA%20low-rank%2010c1384ebcac80ca895dcc006a297900/image%209.png)
-![image.png](Make%20a%20high-rank%20LoRA%20low-rank%2010c1384ebcac80ca895dcc006a297900/image%2010.png)
-![image.png](Make%20a%20high-rank%20LoRA%20low-rank%2010c1384ebcac80ca895dcc006a297900/image%2011.png)
-Code: [https://gist.github.com/sayakpaul/9bae12402eddd53a79ee1f64b659b07b#file-svd_low_rank_lora-py](https://gist.github.com/sayakpaul/9bae12402eddd53a79ee1f64b659b07b#file-svd_low_rank_lora-py)
 ### Tune the knobs in SVD

 - Reduce the memory requirements
 - Enable use cases like `torch.compile()` (which require all the LoRAs to be of the same rank to avoid re-compilation)
+This project explores two options to reduce the original LoRA checkpoint into an even smaller one:
+* Random projections
+* SVD
 ## Random projections
 Basic idea:
 Tried on this LoRA: [https://huggingface.co/glif/how2draw](https://huggingface.co/glif/how2draw). Unless explicitly specified, a rank of 4 was used for all experiments. Here’s a side-by-side comparison of the original and the reduced LoRAs (on the same seed).
+<details>
+<summary>Inference code</summary>
 ```python
 from diffusers import DiffusionPipeline
 import torch
 ).images
 ```
+</details>
+<table>
+  <tbody>
+    <tr>
+      <td align="center"><img src="https://huggingface.co/sayakpaul/lower-rank-flux-lora/resolve/main/images/collage_0.png" alt="Image 1"></td>
+      <td align="center">Yorkshire Terrier with smile, How2Draw</td>
+    </tr>
+    <tr>
+      <td align="center"><img src="https://huggingface.co/sayakpaul/lower-rank-flux-lora/resolve/main/images/collage_1.png" alt="Image 2"></td>
+      <td align="center">a dolphin, How2Draw</td>
+    </tr>
+    <tr>
+      <td align="center"><img src="https://huggingface.co/sayakpaul/lower-rank-flux-lora/resolve/main/images/collage_2.png" alt="Image 3"></td>
+      <td align="center">an owl, How3Draw</td>
+    </tr>
+    <tr>
+      <td align="center"><img src="https://huggingface.co/sayakpaul/lower-rank-flux-lora/resolve/main/images/collage_3.png" alt="Image 4"></td>
+      <td align="center">
+        A silhouette of a girl performing a ballet pose, with elegant lines to suggest grace and movement.
+        The background can include simple outlines of ballet shoes and a music note.
+        The image should convey elegance and poise in a minimalistic style, How2Draw
+      </td>
+    </tr>
+  </tbody>
+</table>
+Code:  [`low_rank_lora.py`](https://huggingface.co/sayakpaul/lower-rank-flux-lora/blob/main/low_rank_lora.py)
 ### Notes
 ### Results
+![image.png](https://huggingface.co/sayakpaul/lower-rank-flux-lora/resolve/main/images/How2Draw-V2_000002800_svd_collage_0.png)
+![image.png](https://huggingface.co/sayakpaul/lower-rank-flux-lora/resolve/main/images/How2Draw-V2_000002800_svd_collage_1.png)
+![image.png](https://huggingface.co/sayakpaul/lower-rank-flux-lora/resolve/main/images/How2Draw-V2_000002800_svd_collage_2.png)
+![image.png](https://huggingface.co/sayakpaul/lower-rank-flux-lora/resolve/main/images/How2Draw-V2_000002800_svd_collage_3.png)
 ### Randomized SVD
 Full SVD can be time-consuming. Truncated SVD is useful very large sparse matrices. We can use randomized SVD for none-to-negligible loss in quality but significantly faster speed.
+![image.png](https://huggingface.co/sayakpaul/lower-rank-flux-lora/resolve/main/images/How2Draw-V2_000002800_rand_svd_collage_0.png)
+![image.png](https://huggingface.co/sayakpaul/lower-rank-flux-lora/resolve/main/images/How2Draw-V2_000002800_rand_svd_collage_1.png)
+![image.png](https://huggingface.co/sayakpaul/lower-rank-flux-lora/resolve/main/images/How2Draw-V2_000002800_rand_svd_collage_2.png)
+![image.png](https://huggingface.co/sayakpaul/lower-rank-flux-lora/resolve/main/images/How2Draw-V2_000002800_rand_svd_collage_3.png)
+Code: [`svd_low_rank_lora.py`](https://huggingface.co/sayakpaul/lower-rank-flux-lora/blob/main/svd_low_rank_lora.py)
 ### Tune the knobs in SVD