Jan commited on
Commit
7c754c0
1 Parent(s): b698126

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +69 -0
README.md ADDED
@@ -0,0 +1,69 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ ---
6
+ <!-- header start -->
7
+ <!-- 200823 -->
8
+ <div style="width: auto; margin-left: auto; margin-right: auto">
9
+ <img src="https://github.com/janhq/jan/assets/89722390/35daac7d-b895-487c-a6ac-6663daaad78e" alt="Jan banner" style="width: 100%; min-width: 400px; display: block; margin: auto;">
10
+ </div>
11
+
12
+ <p align="center">
13
+ <a href="https://jan.ai/">Jan</a>
14
+ - <a href="https://discord.gg/AsJ8krTT3N">Discord</a>
15
+ </p>
16
+ <!-- header end -->
17
+
18
+ # Model Description
19
+ This model uses the `DARE_TIES` merge method.
20
+
21
+ ```yaml
22
+ base_model: mistralai/Mistral-7B-v0.1
23
+ dtype: bfloat16
24
+ merge_method: dare_ties
25
+ models:
26
+ - model: mistralai/Mistral-7B-v0.1
27
+ - model: Weyaxi/OpenHermes-2.5-neural-chat-v3-3-Slerp
28
+ parameters:
29
+ density: 0.8
30
+ weight: 0.4
31
+ - model: Q-bert/MetaMath-Cybertron-Starling
32
+ parameters:
33
+ density: 0.8
34
+ weight: 0.3
35
+ - model: AIDC-ai-business/Marcoroni-7B-v3
36
+ parameters:
37
+ density: 0.8
38
+ weight: 0.3
39
+ parameters:
40
+ int8_mask: true
41
+
42
+ ```
43
+
44
+ # About Jan
45
+ Jan believes in the need for an open-source AI ecosystem and is building the infra and tooling to allow open-source AIs to compete on a level playing field with proprietary ones.
46
+
47
+ Jan's long-term vision is to build a cognitive framework for future robots, who are practical, useful assistants for humans and businesses in everyday life.
48
+
49
+ # Jan Model Merger
50
+ This is a test project for merging models.
51
+
52
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
53
+
54
+ Detailed results can be found here.
55
+
56
+ | Metric | Value |
57
+ |-----------------------|---------------------------|
58
+ | Avg. | ?|
59
+ | ARC (25-shot) | ? |
60
+ | HellaSwag (10-shot) | ? |
61
+ | MMLU (5-shot) | ?|
62
+ | TruthfulQA (0-shot) | ? |
63
+ | Winogrande (5-shot) | ? |
64
+ | GSM8K (5-shot) | ? |
65
+
66
+ # Acknowlegement
67
+ - [mergekit](https://github.com/cg123/mergekit)
68
+ - [DARE](https://github.com/yule-BUAA/MergeLM/blob/main/README.md)
69
+ - [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness)