deepseek-ai
/

deepseek-coder-6.7b-instruct

Text Generation

text-generation-inference

Model card Files Files and versions

guoday commited on Nov 1, 2023

Commit

505e2a1

·

1 Parent(s): c050327

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ Deepseek Coder comprises a series of code language models trained on both 87% co
 ### 2. Model Summary
-deepseek-coder-5.7b-instruct is a 5.7B parameter model initialized from deepseek-coder-5.7b-base and fine-tuned on 2B tokens of instruction data.
 - **Home Page:** [DeepSeek](https://deepseek.com/)
 - **Repository:** [deepseek-ai/deepseek-coder](https://github.com/deepseek-ai/deepseek-coder)
 - **Chat With DeepSeek Coder:** [DeepSeek-Coder](https://coder.deepseek.com/)
@@ -32,8 +32,8 @@ Here give some examples of how to use our model.
 #### Chat Model Inference
 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM
-tokenizer = AutoTokenizer.from_pretrained("deepseek-coder-5.7b-instruct", trust_remote_code=True)
-model = AutoModelForCausalLM.from_pretrained("deepseek-coder-5.7b-instruct", trust_remote_code=True).cuda()
 system_prompt = "You are an AI programming assistant, utilizing the Deepseek Coder model, developed by Deepseek Company, and you only answer questions related to computer science. For politically sensitive questions, security and privacy issues, and other non-computer science questions, you will refuse to answer.\n"
 messages=[
     { 'role': 'user', 'content': "write a quick sort algorithm in python."}

 ### 2. Model Summary
+deepseek-coder-6.7b-instruct is a 6.7B parameter model initialized from deepseek-coder-6.7b-base and fine-tuned on 2B tokens of instruction data.
 - **Home Page:** [DeepSeek](https://deepseek.com/)
 - **Repository:** [deepseek-ai/deepseek-coder](https://github.com/deepseek-ai/deepseek-coder)
 - **Chat With DeepSeek Coder:** [DeepSeek-Coder](https://coder.deepseek.com/)
 #### Chat Model Inference
 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM
+tokenizer = AutoTokenizer.from_pretrained("deepseek-coder-6.7b-instruct", trust_remote_code=True)
+model = AutoModelForCausalLM.from_pretrained("deepseek-coder-6.7b-instruct", trust_remote_code=True).cuda()
 system_prompt = "You are an AI programming assistant, utilizing the Deepseek Coder model, developed by Deepseek Company, and you only answer questions related to computer science. For politically sensitive questions, security and privacy issues, and other non-computer science questions, you will refuse to answer.\n"
 messages=[
     { 'role': 'user', 'content': "write a quick sort algorithm in python."}