sanjay920 commited on
Commit
dae8c63
1 Parent(s): 154de4a

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +71 -0
README.md ADDED
@@ -0,0 +1,71 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3
3
+ model-index:
4
+ - name: Rubra-Meta-Llama-3-8B-Instruct
5
+ results:
6
+ - task:
7
+ type: text-generation
8
+ dataset:
9
+ type: MMLU
10
+ name: MMLU
11
+ metrics:
12
+ - type: 5-shot
13
+ value: 64.39
14
+ verified: false
15
+ - task:
16
+ type: text-generation
17
+ dataset:
18
+ type: GPQA
19
+ name: GPQA
20
+ metrics:
21
+ - type: 0-shot
22
+ value: 31.7
23
+ verified: false
24
+ - task:
25
+ type: text-generation
26
+ dataset:
27
+ type: GSM-8K
28
+ name: GSM-8K
29
+ metrics:
30
+ - type: 8-shot, CoT
31
+ value: 68.99
32
+ verified: false
33
+ - task:
34
+ type: text-generation
35
+ dataset:
36
+ type: MATH
37
+ name: MATH
38
+ metrics:
39
+ - type: 4-shot, CoT
40
+ value: 23.76
41
+ verified: false
42
+ - task:
43
+ type: text-generation
44
+ dataset:
45
+ type: MT-bench
46
+ name: MT-bench
47
+ metrics:
48
+ - type: GPT-4 as Judge
49
+ value: 8.03
50
+ verified: false
51
+ tags:
52
+ - function-calling
53
+ - tool-calling
54
+ - agentic
55
+ - rubra
56
+ - gguf
57
+ - llamacpp
58
+ language:
59
+ - en
60
+ ---
61
+
62
+ # Rubra Llama-3 8B
63
+
64
+ ## Model description
65
+ The model is the result of further post-training [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct). This model is designed for high performance in various instruction-following tasks and complex interactions, including multi-turn function calling and detailed conversations.
66
+
67
+ ## Training Data
68
+ The model underwent additional training on a proprietary dataset encompassing diverse instruction-following, chat, and function calling data. This post-training process enhances the model's ability to integrate tools and manage complex interaction scenarios effectively.
69
+
70
+ ## How to use
71
+ Refer to https://docs.rubra.ai/inference/llamacpp for usage. Feel free to ask/open issues up in our Github repo: https://github.com/rubra-ai/rubra