fbjr commited on
Commit
2533bc2
1 Parent(s): da433e0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +37 -2
README.md CHANGED
@@ -1,7 +1,42 @@
1
  ---
2
  license: openrail
 
 
 
 
 
 
 
 
 
 
 
3
  ---
4
- for testing purposes only. qlora trained using peft on codellama/CodeLlama-7b-hf as base model. trained on mrm8488/unnatural-instructions, config 'core' dataset.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
 
6
  training data transformed to the following structure for testing purposes:
7
  ```Example 1:
@@ -39,4 +74,4 @@ You are given a paragraph from an article. Your task is to replace all the third
39
  His team's plans for the day were quickly ruined when their bus got a flat tire on the way to their first event of the day.
40
  The tournament organizers were not very happy with his team when they showed up late to their match. [/INST]
41
  Output: Our team's plans for the day were quickly ruined when our bus got a flat tire on the way to our first event of the day.
42
- The tournament organizers were not very happy with us when we showed up late to our match.```
 
1
  ---
2
  license: openrail
3
+ datasets:
4
+ - mrm8488/unnatural-instructions
5
+ language:
6
+ - en
7
+ library_name: peft
8
+ pipeline_tag: text-generation
9
+ tags:
10
+ - codellama
11
+ - llama2
12
+ - llama
13
+ - instruct
14
  ---
15
+ for testing purposes only. qlora trained using peft on codellama/CodeLlama-7b-hf as base model. trained on mrm8488/unnatural-instructions, config 'core' dataset.
16
+
17
+ trained at 1000 steps with checkpoint every 50. training/validation loss below:
18
+
19
+ ```Step Training Loss Validation Loss
20
+ 50 1.480500 0.935647
21
+ 100 0.894800 0.867328
22
+ 150 0.835700 0.841386
23
+ 200 0.846100 0.823671
24
+ 250 0.804600 0.791546
25
+ 300 0.744000 0.799941
26
+ 350 0.721900 0.707534
27
+ 400 0.702700 0.697420
28
+ 450 0.698200 0.691702
29
+ 500 0.674600 0.687037
30
+ 550 0.666700 0.683634
31
+ 600 0.687200 0.680872
32
+ 650 0.679300 0.677384
33
+ 700 0.698900 0.675221
34
+ 750 0.652500 0.673152
35
+ 800 0.672200 0.671620
36
+ 850 0.668700 0.669980
37
+ 900 0.638100 0.669189
38
+ 950 0.663200 0.668443
39
+ 1000 0.668300 0.668069```
40
 
41
  training data transformed to the following structure for testing purposes:
42
  ```Example 1:
 
74
  His team's plans for the day were quickly ruined when their bus got a flat tire on the way to their first event of the day.
75
  The tournament organizers were not very happy with his team when they showed up late to their match. [/INST]
76
  Output: Our team's plans for the day were quickly ruined when our bus got a flat tire on the way to our first event of the day.
77
+ The tournament organizers were not very happy with us when we showed up late to our match.```