Text Generation
Transformers
Safetensors
English
mixtral
Generated from Trainer
axolotl
conversational
Inference Endpoints
text-generation-inference
Crystalcareai commited on
Commit
fbd34cc
1 Parent(s): 03e64d4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -3
README.md CHANGED
@@ -38,14 +38,13 @@ It took 27 hours on 8xH100 provided by Crusoe Cloud.
38
 
39
  This model was fully fine-tuned, targeting all layers.
40
 
41
- The model is an extracted expert using SLERP and a custom script that we've open-sourced (I'll provide the link to the GitHub). It extracts a single expert which is the combined SLERP of all 8 experts from a Mixtral architecture. We decided to not fully convert to a dense model, for the sake of trying to keep as much of the original model's performance as possible, as this process is already quite surgical and there are a lot of variables to take into account.
42
 
43
  Dolphin-2.9 has a variety of instruction, conversational, and coding skills. It also has initial agentic abilities and supports function calling.
44
 
45
  Dolphin is uncensored. We have filtered the dataset to remove alignment and bias. This makes the model more compliant. You are advised to implement your own alignment layer before exposing the model as a service. It will be highly compliant with any requests, even unethical ones. Please read my blog post about uncensored models. https://erichartford.com/uncensored-models You are responsible for any content you create using this model. Enjoy responsibly.
46
 
47
- Dolphin is licensed Apache 2.0. I grant permission for any use, including commercial, that falls within accordance with Apache-2.0 license. Dolphin was trained on data generated from GPT4, among other models.
48
-
49
  ## Evals
50
 
51
  ![image/png](https://i.ibb.co/yNmCv76/file-nkvf-Q9-Mg-X57-GB7-Ayrl-YA2-Zsp.png)
 
38
 
39
  This model was fully fine-tuned, targeting all layers.
40
 
41
+ The model is an extracted expert using SLERP and a custom script that we've open-sourced. It extracts a single expert which is the combined SLERP of all 8 experts from a Mixtral architecture. We decided to not fully convert to a dense model, for the sake of trying to keep as much of the original model's performance as possible, as this process is already quite surgical and there are a lot of variables to take into account.
42
 
43
  Dolphin-2.9 has a variety of instruction, conversational, and coding skills. It also has initial agentic abilities and supports function calling.
44
 
45
  Dolphin is uncensored. We have filtered the dataset to remove alignment and bias. This makes the model more compliant. You are advised to implement your own alignment layer before exposing the model as a service. It will be highly compliant with any requests, even unethical ones. Please read my blog post about uncensored models. https://erichartford.com/uncensored-models You are responsible for any content you create using this model. Enjoy responsibly.
46
 
47
+ Dolphin is licensed under Apache 2.0. We grant permission for any use, including commercial, as long as it complies with the Apache-2.0 license. Dolphin was trained using data generated from GPT-4, among other models. For more details on the extraction process of the expert model, visit our GitHub repository: https://github.com/cognitivecomputations/extract-expert/tree/main
 
48
  ## Evals
49
 
50
  ![image/png](https://i.ibb.co/yNmCv76/file-nkvf-Q9-Mg-X57-GB7-Ayrl-YA2-Zsp.png)