fishytorts
commited on
Commit
•
0867139
1
Parent(s):
7a114a3
added data set and loraconfig details
Browse files
README.md
CHANGED
@@ -1,8 +1,25 @@
|
|
1 |
---
|
2 |
library_name: peft
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4 |
## Training procedure
|
5 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
6 |
### Framework versions
|
7 |
|
8 |
|
|
|
1 |
---
|
2 |
library_name: peft
|
3 |
---
|
4 |
+
## Dataset procedure
|
5 |
+
- Dataset used: /tweets_hate_speech_detection
|
6 |
+
- size: 3196 (only 10% of dataset used)
|
7 |
+
- batch_size = 32
|
8 |
+
- num_epochs = 20
|
9 |
+
- learning_rate = 3e-4
|
10 |
+
- num_warmup_steps = 0.06 * (3196 * num_epochs)
|
11 |
+
- num_training_steps = (3196 * num_epochs)
|
12 |
+
|
13 |
## Training procedure
|
14 |
|
15 |
+
|
16 |
+
## LoraConfig procedure
|
17 |
+
r=8, #attention heads
|
18 |
+
lora_alpha=16, #alpha scaling
|
19 |
+
lora_dropout=0.1,
|
20 |
+
bias="none",
|
21 |
+
task_type="SEQ_CLS" # set this for CLM or Seq2Seq
|
22 |
+
|
23 |
### Framework versions
|
24 |
|
25 |
|