RichardErkhov commited on
Commit
1146aa8
·
verified ·
1 Parent(s): 9e6332b

uploaded readme

Browse files
Files changed (1) hide show
  1. README.md +73 -0
README.md ADDED
@@ -0,0 +1,73 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Quantization made by Richard Erkhov.
2
+
3
+ [Github](https://github.com/RichardErkhov)
4
+
5
+ [Discord](https://discord.gg/pvy7H8DZMG)
6
+
7
+ [Request more models](https://github.com/RichardErkhov/quant_request)
8
+
9
+
10
+ Llama-3-Smaug-8B - bnb 4bits
11
+ - Model creator: https://huggingface.co/abacusai/
12
+ - Original model: https://huggingface.co/abacusai/Llama-3-Smaug-8B/
13
+
14
+
15
+
16
+
17
+ Original model description:
18
+ ---
19
+ library_name: transformers
20
+ license: llama2
21
+ datasets:
22
+ - aqua_rat
23
+ - microsoft/orca-math-word-problems-200k
24
+ - m-a-p/CodeFeedback-Filtered-Instruction
25
+ - anon8231489123/ShareGPT_Vicuna_unfiltered
26
+ ---
27
+
28
+ # Llama-3-Smaug-8B
29
+
30
+ ### Built with Meta Llama 3
31
+
32
+
33
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64c14f95cac5f9ba52bbcd7f/OrcJyTaUtD2HxJOPPwNva.png)
34
+
35
+ This model was built using the Smaug recipe for improving performance on real world multi-turn conversations applied to
36
+ [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct).
37
+
38
+ ### Model Description
39
+
40
+ - **Developed by:** [Abacus.AI](https://abacus.ai)
41
+ - **License:** https://llama.meta.com/llama3/license/
42
+ - **Finetuned from model:** [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct).
43
+
44
+
45
+ ## Evaluation
46
+
47
+ ### MT-Bench
48
+
49
+ ```
50
+ ########## First turn ##########
51
+ score
52
+ model turn
53
+ Llama-3-Smaug-8B 1 8.77500
54
+ Meta-Llama-3-8B-Instruct 1 8.31250
55
+ ########## Second turn ##########
56
+ score
57
+ model turn
58
+ Meta-Llama-3-8B-Instruct 2 7.8875
59
+ Llama-3-Smaug-8B 2 7.8875
60
+ ########## Average ##########
61
+ score
62
+ model
63
+ Llama-3-Smaug-8B 8.331250
64
+ Meta-Llama-3-8B-Instruct 8.10
65
+ ```
66
+
67
+ | Model | First turn | Second Turn | Average |
68
+ | :---- | ---------: | ----------: | ------: |
69
+ | Llama-3-Smaug-8B | 8.78 | 7.89 | 8.33 |
70
+ | Llama-3-8B-Instruct | 8.31 | 7.89 | 8.10 |
71
+
72
+ This version of Smaug uses new techniques and new data compared to [Smaug-72B](https://huggingface.co/abacusai/Smaug-72B-v0.1), and more information will be released later on. For now, see the previous Smaug paper: https://arxiv.org/abs/2402.13228.
73
+