Astris
/

Furry-AO3-LoRA

Model card Files Files and versions Community

Astris commited on Jan 8, 2024

Commit

175dc21

·

1 Parent(s): 915f6e4

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -1,8 +1,6 @@
 ---
 language:
 - en
-tags:
-- not-for-all-audiences
 ---
 # Summary
 The name is self-explanatory. This LoRA was trained on 50MB of text taken from Archive Of Our Own (AO3). In total, 1441 stories were selected from the Furry fandom category. I don't remember what filters I used.
@@ -17,11 +15,17 @@ The name is self-explanatory. This LoRA was trained on 50MB of text taken from A
  - Targeted modules: Q, K, V, O, Gate, Up, Down
  - NEFTune alpha: 10 (to try to reduce overfitting)
  - Learning rate: 1e-4
 # Model Settings
  - Base model: Mistral 7B
  - Data Type: BF16, 4 bit quantization (thanks BitsandBytes)
 # Software and Hardware
  - Unsloth was used to speed up training.
  - Training was done on 1x RTX 3090 (with 24 GB of VRAM) and took 11 hours.

 ---
 language:
 - en
 ---
 # Summary
 The name is self-explanatory. This LoRA was trained on 50MB of text taken from Archive Of Our Own (AO3). In total, 1441 stories were selected from the Furry fandom category. I don't remember what filters I used.
  - Targeted modules: Q, K, V, O, Gate, Up, Down
  - NEFTune alpha: 10 (to try to reduce overfitting)
  - Learning rate: 1e-4
+ - Dropout: 0 (unsloth doesn't support LoRA dropout)
 # Model Settings
  - Base model: Mistral 7B
  - Data Type: BF16, 4 bit quantization (thanks BitsandBytes)
+# Misc Settings
+ - Batch size: 2
+ - Gradient Accumulation steps: 16
+ - LR Scheduler: Linear
 # Software and Hardware
  - Unsloth was used to speed up training.
  - Training was done on 1x RTX 3090 (with 24 GB of VRAM) and took 11 hours.