Suparious commited on
Commit
23edc40
1 Parent(s): 3995440

Updating model summary

Browse files
Files changed (1) hide show
  1. README.md +26 -4
README.md CHANGED
@@ -3,6 +3,7 @@ base_model: alpindale/Mistral-7B-v0.2-hf
3
  library_name: transformers
4
  language:
5
  - en
 
6
  tags:
7
  - generated_from_trainer
8
  - quantized
@@ -12,6 +13,14 @@ tags:
12
  - endpoints_compatible
13
  - text-generation-inference
14
  - chatml
 
 
 
 
 
 
 
 
15
  model-index:
16
  - name: workspace/dolphin-2.8-mistral-7b
17
  results: []
@@ -33,16 +42,29 @@ prompt_template: '<|im_start|>system
33
 
34
  '
35
  ---
36
- # cognitivecomputations/dolphin-2.8-mistral-7b-v02 AWQ
37
 
38
  - Model creator: [cognitivecomputations](https://huggingface.co/cognitivecomputations)
39
  - Original model: [dolphin-2.8-mistral-7b-v02](https://huggingface.co/cognitivecomputations/dolphin-2.8-mistral-7b-v02)
40
 
 
 
41
  ## Model Summary
42
 
43
- This model is a fine-tuned version of [alpindale/Mistral-7B-v0.2-hf](https://huggingface.co/alpindale/Mistral-7B-v0.2-hf) on the None dataset.
44
- It achieves the following results on the evaluation set:
45
- - Loss: 0.4828
 
 
 
 
 
 
 
 
 
 
 
46
 
47
  ## How to use
48
 
 
3
  library_name: transformers
4
  language:
5
  - en
6
+ license: apache-2.0
7
  tags:
8
  - generated_from_trainer
9
  - quantized
 
13
  - endpoints_compatible
14
  - text-generation-inference
15
  - chatml
16
+ datasets:
17
+ - cognitivecomputations/dolphin
18
+ - cognitivecomputations/dolphin-coder
19
+ - cognitivecomputations/samantha-data
20
+ - jondurbin/airoboros-2.2.1
21
+ - teknium/openhermes-2.5
22
+ - m-a-p/Code-Feedback
23
+ - m-a-p/CodeFeedback-Filtered-Instruction
24
  model-index:
25
  - name: workspace/dolphin-2.8-mistral-7b
26
  results: []
 
42
 
43
  '
44
  ---
45
+ # cognitivecomputations/dolphin-2.8-mistral-7b-v02 🐬 AWQ
46
 
47
  - Model creator: [cognitivecomputations](https://huggingface.co/cognitivecomputations)
48
  - Original model: [dolphin-2.8-mistral-7b-v02](https://huggingface.co/cognitivecomputations/dolphin-2.8-mistral-7b-v02)
49
 
50
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/63111b2d88942700629f5771/ldkN1J0WIDQwU4vutGYiD.png" width="600" />
51
+
52
  ## Model Summary
53
 
54
+ My appreciation for the sponsors of Dolphin 2.8:
55
+ - [Crusoe Cloud](https://crusoe.ai/) - provided excellent on-demand 10xL40S node
56
+ - [Winston Sou](https://twitter.com/WinsonDabbles) - Along with a generous anonymous sponsor, donated a massive personally owned compute resource!
57
+ - [Abacus AI](https://abacus.ai/) - my employer and partner in many things.
58
+
59
+ This model is based on [Mistral-7b-v0.2](https://huggingface.co/alpindale/Mistral-7B-v0.2-hf) a new base model released by MistralAI on March 23, 2024 but they have not yet published on HuggingFace. Thanks to @alpindale for converting / publishing.
60
+
61
+ The base model has 32k context, and the full-weights fine-tune was with 16k sequence lengths.
62
+
63
+ It took 3 days on 10x L40S provided by [Crusoe Cloud](https://crusoe.ai/)
64
+
65
+ Dolphin-2.8 has a variety of instruction, conversational, and coding skills.
66
+
67
+ This model is uncensored. I have filtered the dataset to remove alignment and bias. This makes the model more compliant. You are advised to implement your own alignment layer before exposing the model as a service. It will be highly compliant to any requests, even unethical ones. Please read my blog post about uncensored models. https://erichartford.com/uncensored-models You are responsible for any content you create using this model. Enjoy responsibly.
68
 
69
  ## How to use
70