sandmanbuzz
/

Air-Striker-Mixtral-8x7B-ZLoss-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sandmanbuzz commited on Feb 22, 2024

Commit

b0cc528

·

verified ·

1 Parent(s): 115e05d

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -8,6 +8,8 @@ tags:
 ---
 # Air-Striker-Mixtral-8x7B-ZLoss-Instruct
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details
@@ -19,7 +21,7 @@ This model was merged using the [linear](https://arxiv.org/abs/2203.05482) merge
 The following models were included in the merge:
 * [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1)
-* ./Air-Striker-Mixtral-8x7B-ZLoss-noninstruct/merged
 ### Configuration
@@ -30,7 +32,7 @@ models:
   - model: mistralai/Mixtral-8x7B-Instruct-v0.1
     parameters:
       weight: 0.5
-  - model: ./Air-Striker-Mixtral-8x7B-ZLoss-noninstruct/merged
     parameters:
       weight: 0.5
 merge_method: linear

 ---
 # Air-Striker-Mixtral-8x7B-ZLoss-Instruct
+So, bro literally posts nothing but exl2s of this model, and it's like, am I some kind of caveman? What am I gonna do with an exl2? Take a picture of it and send it to my mom? So we re-made it from the ground up using the [lora](https://huggingface.co/LoneStriker/Air-Striker-Mixtral-8x7B-ZLoss-LoRA), since at least dude's got that posted.
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details
 The following models were included in the merge:
 * [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1)
+* [sandmanbuzz/Air-Striker-Mixtral-8x7B-ZLoss](https://huggingface.co/sandmanbuzz/Air-Striker-Mixtral-8x7B-ZLoss)
 ### Configuration
   - model: mistralai/Mixtral-8x7B-Instruct-v0.1
     parameters:
       weight: 0.5
+  - model: sandmanbuzz/Air-Striker-Mixtral-8x7B-ZLoss
     parameters:
       weight: 0.5
 merge_method: linear