kyujinpy commited on
Commit
c49f354
1 Parent(s): 3c3f1f8

Upload 2 files

Browse files
Files changed (3) hide show
  1. .gitattributes +1 -0
  2. README.md +55 -0
  3. sakura.png +3 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ sakura.png filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,3 +1,58 @@
1
  ---
 
 
 
 
 
 
 
2
  license: cc-by-nc-sa-4.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - en
4
+ datasets:
5
+ - Intel/orca_dpo_pairs
6
+ - argilla/distilabel-math-preference-dpo
7
+ - kyujinpy/orca_math_dpo
8
+ pipeline_tag: text-generation
9
  license: cc-by-nc-sa-4.0
10
  ---
11
+
12
+ # **Sakura-SOLRCA-Math-Instruct-DPO**
13
+ <img src='./sakura.png' width=512>
14
+
15
+ ## Model Details
16
+
17
+ **Model Developers** Kyujin Han (kyujinpy)
18
+
19
+ **Method**
20
+ Using DPO method.
21
+ With [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs) and [argilla/distilabel-math-preference-dpo](https://huggingface.co/datasets/argilla/distilabel-math-preference-dpo).
22
+
23
+ I shared the merge version [kyujinpy/orca_math_dpo](https://huggingface.co/datasets/kyujinpy/orca_math_dpo).
24
+
25
+ I will share the information about my model. (training and code)
26
+ Please see: ⭐[Sakura-SOLAR(will update)]().
27
+
28
+ # **Model Benchmark**
29
+
30
+ ## Open leaderboard
31
+ - Follow up as [link](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard).
32
+
33
+ | Model | Average | ARC | HellaSwag | MMLU | TruthfulQA | Winogrande | GSM8K |
34
+ | --- | --- | --- | --- | --- | --- | --- | --- |
35
+ | Sakura-SOLRCA-Math-Instruct-DPO | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
36
+ | Sakura-SOLRCA-Instruct-DPO | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
37
+ | Sakura-SOLAR-Instruct-DPO-v2 | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
38
+ | Sakura-SOLAR-Instruct-DPO-v1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
39
+ | [kyujinpy/Sakura-SOLAR-Instruct](https://huggingface.co/kyujinpy/Sakura-SOLAR-Instruct) | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
40
+
41
+
42
+ # Implementation Code
43
+ ```python
44
+ ### KO-Platypus
45
+ from transformers import AutoModelForCausalLM, AutoTokenizer
46
+ import torch
47
+
48
+ repo = "kyujinpy/Sakura-SOLRCA-Math-Instruct-DPO"
49
+ OpenOrca = AutoModelForCausalLM.from_pretrained(
50
+ repo,
51
+ return_dict=True,
52
+ torch_dtype=torch.float16,
53
+ device_map='auto'
54
+ )
55
+ OpenOrca_tokenizer = AutoTokenizer.from_pretrained(repo)
56
+ ```
57
+
58
+ ---
sakura.png ADDED

Git LFS Details

  • SHA256: 5a066cccad8d305c4ea9bc00d720c7c487853b2eb4d50c9f879742ecfdb56396
  • Pointer size: 132 Bytes
  • Size of remote file: 1.35 MB