Zangs3011 commited on
Commit
a48b295
1 Parent(s): 1ed170d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -13
README.md CHANGED
@@ -1,20 +1,36 @@
1
  ---
 
 
2
  library_name: peft
 
 
 
 
 
 
 
 
 
 
3
  ---
4
- ## Training procedure
5
 
 
6
 
7
- The following `bitsandbytes` quantization config was used during training:
8
- - load_in_8bit: True
9
- - load_in_4bit: False
10
- - llm_int8_threshold: 6.0
11
- - llm_int8_skip_modules: None
12
- - llm_int8_enable_fp32_cpu_offload: False
13
- - llm_int8_has_fp16_weight: False
14
- - bnb_4bit_quant_type: fp4
15
- - bnb_4bit_use_double_quant: False
16
- - bnb_4bit_compute_dtype: float32
17
- ### Framework versions
18
 
 
19
 
20
- - PEFT 0.4.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ datasets:
3
+ - ewof/code-alpaca-instruct-unfiltered
4
  library_name: peft
5
+ tags:
6
+ - llama2-7b
7
+ - code
8
+ - instruct
9
+ - instruct-code
10
+ - code-alpaca
11
+ - alpaca-instruct
12
+ - alpaca
13
+ - llama7b
14
+ - gpt2
15
  ---
 
16
 
17
+ We finetuned Llama2-7B on Code-Alpaca-Instruct Dataset (ewof/code-alpaca-instruct-unfiltered) for 5 epochs or ~ 25,000 steps using [MonsterAPI](https://monsterapi.ai) no-code [LLM finetuner](https://docs.monsterapi.ai/fine-tune-a-large-language-model-llm).
18
 
19
+ This dataset is HuggingFaceH4/CodeAlpaca_20K unfiltered, removing 36 instances of blatant alignment.
 
 
 
 
 
 
 
 
 
 
20
 
21
+ The finetuning session got completed in 4 hours and costed us only `$16` for the entire finetuning run!
22
 
23
+ #### Hyperparameters & Run details:
24
+ - Model Path: meta-llama/Llama-2-7b
25
+ - Dataset: ewof/code-alpaca-instruct-unfiltered
26
+ - Learning rate: 0.0003
27
+ - Number of epochs: 5
28
+ - Data split: Training: 90% / Validation: 10%
29
+ - Gradient accumulation steps: 1
30
+
31
+ Loss metrics:
32
+ ![training loss](train-loss.png "Training loss")
33
+
34
+ ---
35
+ license: apache-2.0
36
+ ---