xxx777xxxASD
/

L3_SnowStorm_4x8B

Text Generation

Mixture of Experts

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

xxx777xxxASD commited on May 20

Commit

cb13d04

•

1 Parent(s): 24b4d0d

Update README.md

Files changed (1) hide show

README.md +40 -3

README.md CHANGED Viewed

@@ -1,3 +1,40 @@
----
-license: llama3
----

+---
+license: llama3
+tags:
+- moe
+language:
+- en
+---
+Experimental RP-oriented MoE, the idea was to get a model that would be equal to or better than Mixtral 8x7B and it's finetunes in RP/ERP tasks.
+### Llama 3 SnowStorm 4x8B
+```
+base_model: NeverSleep_Llama-3-Lumimaid-8B-v0.1-OAS
+gate_mode: random
+dtype: bfloat16
+experts_per_token: 2
+experts:
+  - source_model: ChaoticNeutrals_Poppy_Porpoise-v0.7-L3-8B
+  - source_model: NeverSleep_Llama-3-Lumimaid-8B-v0.1-OAS
+  - source_model: openlynn_Llama-3-Soliloquy-8B-v2
+  - source_model: Sao10K_L3-8B-Stheno-v3.1
+```
+## Models used
+- [ChaoticNeutrals/Poppy_Porpoise-v0.7-L3-8B](https://huggingface.co/ChaoticNeutrals/Poppy_Porpoise-v0.7-L3-8B)
+- [NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS](https://huggingface.co/NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS)
+- [openlynn/Llama-3-Soliloquy-8B-v2](https://huggingface.co/openlynn/Llama-3-Soliloquy-8B-v2)
+- [Sao10K/L3-8B-Stheno-v3.1](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.1)
+## Vision
+[llama3_mmproj](https://huggingface.co/ChaoticNeutrals/LLaVA-Llama-3-8B-mmproj-Updated)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/64f5e51289c121cb864ba464/yv4C6NalqORLjvY3KKZk8.png)
+## Prompt format: Llama 3