fzzhang commited on
Commit
dfabf30
1 Parent(s): b78605f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -2
README.md CHANGED
@@ -5,7 +5,43 @@ datasets:
5
  language:
6
  - en
7
  ---
8
- Pretrained Model: https://huggingface.co/Toten5/Marcoroni-neural-chat-7B-v2
9
 
 
10
 
11
- Fine-tuned on GSM8K training set
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  language:
6
  - en
7
  ---
8
+ # Marcoroni-neural-chat-7B-v2_gsm8k
9
 
10
+ This model is a fine-tuned version of [Toten5/Marcoroni-neural-chat-7B-v2](https://huggingface.co/Toten5/Marcoroni-neural-chat-7B-v2) on the GSM8K dataset.
11
 
12
+ ## Model description
13
+
14
+ More information needed
15
+
16
+ ## Intended uses & limitations
17
+
18
+ More information needed
19
+
20
+ ## Training and evaluation data
21
+
22
+ More information needed
23
+
24
+ ## Training procedure
25
+
26
+ ### Training hyperparameters
27
+
28
+ The following hyperparameters were used during training:
29
+ - learning_rate: 1e-05
30
+ - train_batch_size: 4
31
+ - eval_batch_size: 8
32
+ - seed: 0
33
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
34
+ - lr_scheduler_type: linear
35
+ - num_epochs: 5
36
+
37
+ ### Training results
38
+
39
+
40
+
41
+ ### Framework versions
42
+
43
+ - PEFT 0.7.2.dev0
44
+ - Transformers 4.36.2
45
+ - Pytorch 2.1.2
46
+ - Datasets 2.16.1
47
+ - Tokenizers 0.15.1