xxx777xxxASD
commited on
Commit
•
508417c
1
Parent(s):
107fd9c
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,77 @@
|
|
1 |
-
---
|
2 |
-
license: llama3
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: llama3
|
3 |
+
tags:
|
4 |
+
- moe
|
5 |
+
language:
|
6 |
+
- en
|
7 |
+
---
|
8 |
+
|
9 |
+
<style>
|
10 |
+
.image-container {
|
11 |
+
position: relative;
|
12 |
+
display: inline-block;
|
13 |
+
}
|
14 |
+
|
15 |
+
.image-container img {
|
16 |
+
display: block;
|
17 |
+
border-radius: 10px;
|
18 |
+
box-shadow: 0 0 1px rgba(0, 0, 0, 0.3);
|
19 |
+
}
|
20 |
+
|
21 |
+
.image-container::before {
|
22 |
+
content: "";
|
23 |
+
position: absolute;
|
24 |
+
top: 0px;
|
25 |
+
left: 20px;
|
26 |
+
width: calc(100% - 40px);
|
27 |
+
height: calc(100%);
|
28 |
+
background-image: url("https://cdn-uploads.huggingface.co/production/uploads/64f5e51289c121cb864ba464/aM8nIpwbzIhtUVZPQUGYV.png");
|
29 |
+
background-size: cover;
|
30 |
+
filter: blur(10px);
|
31 |
+
z-index: -1;
|
32 |
+
}
|
33 |
+
</style>
|
34 |
+
<br>
|
35 |
+
|
36 |
+
|
37 |
+
<div class="image-container">
|
38 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/64f5e51289c121cb864ba464/aM8nIpwbzIhtUVZPQUGYV.png" style="width: 96%; margin: auto;" >
|
39 |
+
</div>
|
40 |
+
|
41 |
+
Experimental RP-oriented MoE, the idea was to get a model that would be equal to or better than Mixtral 8x7B and it's finetunes in RP/ERP tasks.
|
42 |
+
|
43 |
+
### Llama 3 SnowStorm v1.15B 4x8B
|
44 |
+
```
|
45 |
+
base_model: Sao10K_L3-8B-Stheno-v3.1
|
46 |
+
gate_mode: random
|
47 |
+
dtype: bfloat16
|
48 |
+
experts_per_token: 2
|
49 |
+
experts:
|
50 |
+
- source_model: Nitral-AI_Poppy_Porpoise-0.85-L3-8B
|
51 |
+
- source_model: NeverSleep_Llama-3-Lumimaid-8B-v0.1-OAS
|
52 |
+
- source_model: openlynn_Llama-3-Soliloquy-8B-v2
|
53 |
+
- source_model: Sao10K_L3-8B-Stheno-v3.1
|
54 |
+
```
|
55 |
+
|
56 |
+
|
57 |
+
## Models used
|
58 |
+
|
59 |
+
- [ChaoticNeutrals/Poppy_Porpoise-v0.7-L3-8B](https://huggingface.co/ChaoticNeutrals/Poppy_Porpoise-v0.7-L3-8B)
|
60 |
+
- [NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS)
|
61 |
+
- [openlynn/Llama-3-Soliloquy-8B-v2](https://huggingface.co/openlynn/Llama-3-Soliloquy-8B-v2)
|
62 |
+
- [Sao10K/L3-8B-Stheno-v3.1](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.1)
|
63 |
+
|
64 |
+
|
65 |
+
## Difference(from SnowStorm v1.0)
|
66 |
+
|
67 |
+
- Update from [ChaoticNeutrals/Poppy_Porpoise-v0.7-L3-8B](https://huggingface.co/ChaoticNeutrals/Poppy_Porpoise-v0.7-L3-8B) to [Nitral-AI/Poppy_Porpoise-0.85-L3-8B](https://huggingface.co/Nitral-AI/Poppy_Porpoise-0.85-L3-8B)
|
68 |
+
- Change base model from [NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS) to [Sao10K/L3-8B-Stheno-v3.1](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.1)
|
69 |
+
|
70 |
+
## Vision
|
71 |
+
|
72 |
+
[llama3_mmproj](https://huggingface.co/ChaoticNeutrals/LLaVA-Llama-3-8B-mmproj-Updated)
|
73 |
+
|
74 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/64f5e51289c121cb864ba464/yv4C6NalqORLjvY3KKZk8.png)
|
75 |
+
|
76 |
+
|
77 |
+
## Prompt format: Llama 3
|