sanjay920 commited on
Commit
659aebf
·
verified ·
1 Parent(s): 2f37ae8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +58 -30
README.md CHANGED
@@ -1,33 +1,61 @@
1
- ---
2
- language:
3
- - en
4
- ---
5
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6
  ---
7
 
8
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
9
- should probably proofread and complete it, then remove this comment. -->
10
-
11
- # run1_short_20000
12
-
13
- This model is a fine-tuned version of [Qwen/Qwen2-7B-Instruct](https://huggingface.co/Qwen/Qwen2-7B-Instruct) on the rubra_train_v1_qwen_short dataset.
14
-
15
- ### Training hyperparameters
16
-
17
- The following hyperparameters were used during training:
18
- - learning_rate: 2e-05
19
- - train_batch_size: 1
20
- - eval_batch_size: 8
21
- - seed: 42
22
- - gradient_accumulation_steps: 20
23
- - total_train_batch_size: 20
24
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
25
- - lr_scheduler_type: cosine
26
- - num_epochs: 1.0
27
-
28
- ### Framework versions
29
-
30
- - Transformers 4.41.2
31
- - Pytorch 2.3.1+cu121
32
- - Datasets 2.19.2
33
- - Tokenizers 0.19.1
 
 
 
 
 
1
  ---
2
+ license: apache-2.0
3
+ model-index:
4
+ - name: Rubra-Meta-Llama-3-8B-Instruct
5
+ results:
6
+ - task:
7
+ type: text-generation
8
+ dataset:
9
+ type: MMLU
10
+ name: MMLU
11
+ metrics:
12
+ - type: 5-shot
13
+ value: 68.88
14
+ verified: false
15
+ - task:
16
+ type: text-generation
17
+ dataset:
18
+ type: GPQA
19
+ name: GPQA
20
+ metrics:
21
+ - type: 0-shot
22
+ value: 30.36
23
+ verified: false
24
+ - task:
25
+ type: text-generation
26
+ dataset:
27
+ type: GSM-8K
28
+ name: GSM-8K
29
+ metrics:
30
+ - type: 8-shot, CoT
31
+ value: 75.82
32
+ verified: false
33
+ - task:
34
+ type: text-generation
35
+ dataset:
36
+ type: MATH
37
+ name: MATH
38
+ metrics:
39
+ - type: 4-shot, CoT
40
+ value: 28.72
41
+ verified: false
42
+ - task:
43
+ type: text-generation
44
+ dataset:
45
+ type: MT-bench
46
+ name: MT-bench
47
+ metrics:
48
+ - type: GPT-4 as Judge
49
+ value: 8.08
50
+ verified: false
51
+ tags:
52
+ - function-calling
53
+ - tool-calling
54
+ - agentic
55
+ - rubra
56
+ language:
57
+ - en
58
+ - zh
59
  ---
60
 
61
+ # Qwen2 7B Instruct