gvij commited on
Commit
a7580a4
1 Parent(s): c9ed098

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -1
README.md CHANGED
@@ -1,3 +1,37 @@
1
  ---
2
- license: apache-2.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
1
  ---
2
+ datasets:
3
+ - ewof/code-alpaca-instruct-unfiltered
4
+ library_name: peft
5
+ tags:
6
+ - gpt-j
7
+ - gpt-j-6b
8
+ - code
9
+ - instruct
10
+ - instruct-code
11
+ - code-alpaca
12
+ - alpaca-instruct
13
+ - alpaca
14
+ - llama7b
15
+ - gpt2
16
+ ---
17
+
18
+ We finetuned GPT-J 6B on Code-Alpaca-Instruct Dataset (ewof/code-alpaca-instruct-unfiltered) for 5 epochs or ~ 25,000 steps using [MonsterAPI](https://monsterapi.ai) no-code [LLM finetuner](https://docs.monsterapi.ai/fine-tune-a-large-language-model-llm).
19
+
20
+ This dataset is HuggingFaceH4/CodeAlpaca_20K unfiltered, removing 36 instances of blatant alignment.
21
+
22
+ The finetuning session got completed in 206 minutes and costed us only `$8` for the entire finetuning run!
23
+
24
+ #### Hyperparameters & Run details:
25
+ - Model Path: EleutherAI/gpt-j-6b
26
+ - Dataset: ewof/code-alpaca-instruct-unfiltered
27
+ - Learning rate: 0.0003
28
+ - Number of epochs: 5
29
+ - Data split: Training: 90% / Validation: 10%
30
+ - Gradient accumulation steps: 1
31
+
32
+ Loss metrics:
33
+ ![training loss](train-loss.png "Training loss")
34
+
35
  ---
36
+ license: apache-2.0
37
+ ---