cognitivecomputations
/

dolphin-2.9.1-mixtral-1x22b

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Crystalcareai commited on May 22, 2024

Commit

fbd34cc

•

1 Parent(s): 03e64d4

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -38,14 +38,13 @@ It took 27 hours on 8xH100 provided by Crusoe Cloud.
 This model was fully fine-tuned, targeting all layers.
-The model is an extracted expert using SLERP and a custom script that we've open-sourced (I'll provide the link to the GitHub). It extracts a single expert which is the combined SLERP of all 8 experts from a Mixtral architecture. We decided to not fully convert to a dense model, for the sake of trying to keep as much of the original model's performance as possible, as this process is already quite surgical and there are a lot of variables to take into account.
 Dolphin-2.9 has a variety of instruction, conversational, and coding skills. It also has initial agentic abilities and supports function calling.
 Dolphin is uncensored. We have filtered the dataset to remove alignment and bias. This makes the model more compliant. You are advised to implement your own alignment layer before exposing the model as a service. It will be highly compliant with any requests, even unethical ones. Please read my blog post about uncensored models. https://erichartford.com/uncensored-models You are responsible for any content you create using this model. Enjoy responsibly.
-Dolphin is licensed Apache 2.0. I grant permission for any use, including commercial, that falls within accordance with Apache-2.0 license. Dolphin was trained on data generated from GPT4, among other models.
 ## Evals
 ![image/png](https://i.ibb.co/yNmCv76/file-nkvf-Q9-Mg-X57-GB7-Ayrl-YA2-Zsp.png)

 This model was fully fine-tuned, targeting all layers.
+The model is an extracted expert using SLERP and a custom script that we've open-sourced. It extracts a single expert which is the combined SLERP of all 8 experts from a Mixtral architecture. We decided to not fully convert to a dense model, for the sake of trying to keep as much of the original model's performance as possible, as this process is already quite surgical and there are a lot of variables to take into account.
 Dolphin-2.9 has a variety of instruction, conversational, and coding skills. It also has initial agentic abilities and supports function calling.
 Dolphin is uncensored. We have filtered the dataset to remove alignment and bias. This makes the model more compliant. You are advised to implement your own alignment layer before exposing the model as a service. It will be highly compliant with any requests, even unethical ones. Please read my blog post about uncensored models. https://erichartford.com/uncensored-models You are responsible for any content you create using this model. Enjoy responsibly.
+Dolphin is licensed under Apache 2.0. We grant permission for any use, including commercial, as long as it complies with the Apache-2.0 license. Dolphin was trained using data generated from GPT-4, among other models. For more details on the extraction process of the expert model, visit our GitHub repository: https://github.com/cognitivecomputations/extract-expert/tree/main
 ## Evals
 ![image/png](https://i.ibb.co/yNmCv76/file-nkvf-Q9-Mg-X57-GB7-Ayrl-YA2-Zsp.png)