ehartford commited on
Commit
dcac0c1
1 Parent(s): 895467e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +64 -0
README.md CHANGED
@@ -1,3 +1,67 @@
1
  ---
 
 
 
 
 
 
 
 
 
 
 
2
  license: bigcode-openrail-m
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ datasets:
3
+ - cognitivecomputations/dolphin
4
+ - jondurbin/airoboros-2.2.1
5
+ - cognitivecomputations/dolphin-coder
6
+ - teknium/openhermes
7
+ - ise-uiuc/Magicoder-OSS-Instruct-75K
8
+ - ise-uiuc/Magicoder-Evol-Instruct-110K
9
+ - m-a-p/Code-Feedback
10
+ - m-a-p/CodeFeedback-Filtered-Instruction
11
+ language:
12
+ - en
13
  license: bigcode-openrail-m
14
  ---
15
+
16
+ DolphinCoder StarCoder2 15b 🐬
17
+
18
+ sponsored by [latitude.sh](https://www.latitude.sh/).
19
+
20
+ Join our Discord! https://discord.gg/cognitivecomputations
21
+
22
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/63111b2d88942700629f5771/ldkN1J0WIDQwU4vutGYiD.png" width="600" />
23
+
24
+ This model is based on StarCoder2-15b and is subject to
25
+
26
+ This Dolphin is *really good* at coding, I trained with a lot of coding data.
27
+
28
+ This model is uncensored. I have filtered the dataset to remove alignment and bias. This makes the model more compliant. You are advised to implement your own alignment layer before exposing the model as a service. It will be highly compliant to any requests, even unethical ones. Please read my blog post about uncensored models. https://erichartford.com/uncensored-models
29
+ You are responsible for any content you create using this model. Enjoy responsibly.
30
+
31
+ ## Training
32
+ It took 3 days to train 3 epochs on 8x H100s using qLoRA and Axolotl
33
+
34
+ Prompt format:
35
+ This model uses ChatML prompt format.
36
+ ```
37
+ <|im_start|>system
38
+ You are DolphinCoder, a helpful AI programming assistant.<|im_end|>
39
+ <|im_start|>user
40
+ {prompt}<|im_end|>
41
+ <|im_start|>assistant
42
+
43
+ ```
44
+
45
+ Example:
46
+ ```
47
+ <|im_start|>system
48
+ You are DolphinCoder, a master at software engineering and coding in any programming language.
49
+ <|im_start|>user
50
+ Please write me a program in golang that parses all the lines in a file, and reverses them character-wise, and saves it to a new file.
51
+ <|im_start|>assistant
52
+ ```
53
+
54
+ ## Gratitude
55
+ - This model was made possible by the generous sponsorship of [latitude.sh](https://www.latitude.sh/).
56
+ - Huge thank you to [BigCode](https://www.bigcode-project.org/) for training and publishing the weights of StarCoder2
57
+ - HUGE Thank you to the dataset authors: @ise-uiuc, @teknium, @m-a-p
58
+ - And HUGE thanks to @winglian and the Axolotl contributors for making the best training framework!
59
+ - [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
60
+ - Thank you to all the other people in the Open Source AI community who have taught me and helped me along the way.
61
+
62
+ ## Example Output
63
+
64
+
65
+ [If you would like to financially support my efforts](https://ko-fi.com/erichartford)
66
+
67
+ [swag](https://fa7113.myshopify.com/)