maywell commited on
Commit
7aeb93d
β€’
1 Parent(s): fa1aa4b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +69 -1
README.md CHANGED
@@ -1,3 +1,71 @@
1
  ---
2
- license: cc-by-nc-4.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: cc-by-sa-4.0
3
  ---
4
+
5
+ # **Synatra-7B-v0.3-dpo🐧**
6
+ ![Synatra-7B-v0.3-dpo](./Synatra.png)
7
+
8
+ ## Support Me
9
+ μ‹œλ‚˜νŠΈλΌλŠ” 개인 ν”„λ‘œμ νŠΈλ‘œ, 1인의 μžμ›μœΌλ‘œ 개발되고 μžˆμŠ΅λ‹ˆλ‹€. λͺ¨λΈμ΄ λ§ˆμŒμ— λ“œμ…¨λ‹€λ©΄ μ•½κ°„μ˜ 연ꡬ비 지원은 μ–΄λ–¨κΉŒμš”?
10
+ [<img src="https://cdn.buymeacoffee.com/buttons/default-orange.png" alt="Buy me a Coffee" width="217" height="50">](https://www.buymeacoffee.com/mwell)
11
+
12
+ Wanna be a sponser? (Please) Contact me on Telegram **AlzarTakkarsen**
13
+
14
+ # **License**
15
+
16
+ This model is strictly [*non-commercial*](https://creativecommons.org/licenses/by-sa/4.0/) (**cc-by-sa-4.0**) use, Under **5K MAU**
17
+ The "Model" is completely free (ie. base model, derivates, merges/mixes) to use for non-commercial purposes as long as the the included **cc-by-sa-4.0** license in any parent repository, and the non-commercial use statute remains, regardless of other models' licences.
18
+ If your service has over **5K MAU** contact me for license approval.
19
+
20
+ # **Model Details**
21
+ **Base Model**
22
+ [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1)
23
+
24
+ **Trained On**
25
+ A100 80GB * 1
26
+
27
+ **Instruction format**
28
+
29
+ It follows [ChatML](https://github.com/openai/openai-python/blob/main/chatml.md) format and **Alpaca(No-Input)** format.
30
+
31
+ # **Model Benchmark**
32
+
33
+ ## KOBEST_BOOLQ, SENTINEG, WIC - ZERO_SHOT
34
+ [EleutherAI/lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness/tree/polyglot)λ₯Ό μ‚¬μš©ν•˜μ—¬ BoolQ, SentiNeg, Wic을 μΈ‘μ •ν–ˆμŠ΅λ‹ˆλ‹€.
35
+
36
+ | Model | COPA | HellaSwag | BoolQ | SentiNeg
37
+ | --- | --- | --- | --- | --- | ---
38
+ | EleutherAI/polyglot-ko-12.8b | 0.7937 | 0.5954 | 0.4818 | 0.9117
39
+ | Synatra-7B-v0.3-base | 0.6344 | 0.5140 | 0.5226 | NaN
40
+ | **Synatra-7B-v0.3-dpo** | **0.6380** | **0.4780** | **0.8058** | **0.8942**
41
+
42
+ ## Ko-LLM-Leaderboard
43
+
44
+ On Benchmarking...
45
+
46
+ # **Implementation Code**
47
+
48
+ Since, chat_template already contains insturction format above.
49
+ You can use the code below.
50
+
51
+ ```python
52
+ from transformers import AutoModelForCausalLM, AutoTokenizer
53
+
54
+ device = "cuda" # the device to load the model onto
55
+
56
+ model = AutoModelForCausalLM.from_pretrained("maywell/Synatra-7B-v0.3-dpo")
57
+ tokenizer = AutoTokenizer.from_pretrained("maywell/Synatra-7B-v0.3-dpo")
58
+
59
+ messages = [
60
+ {"role": "user", "content": "λ°”λ‚˜λ‚˜λŠ” μ›λž˜ ν•˜μ–€μƒ‰μ΄μ•Ό?"},
61
+ ]
62
+
63
+ encodeds = tokenizer.apply_chat_template(messages, return_tensors="pt")
64
+
65
+ model_inputs = encodeds.to(device)
66
+ model.to(device)
67
+
68
+ generated_ids = model.generate(model_inputs, max_new_tokens=1000, do_sample=True)
69
+ decoded = tokenizer.batch_decode(generated_ids)
70
+ print(decoded[0])
71
+ ```