deepseek-coder-6.7B-chat
It was created by starting with the deepseek-coder-6.7B and training it on the open assistant dataset. We have attached the wandb report in pdf form to view the training run at a glance.
Reson
This model was fine tned to allow it to follow direction and is a steeping stone to further training, but still would be good for asking qestions about code.
How to use
You will need the transformers>=4.31
from transformers import AutoTokenizer
import transformers
import torch
model = "AIGym/deepseek-coder-6.7b-chat"
tokenizer = AutoTokenizer.from_pretrained(model)
pipeline = transformers.pipeline(
"text-generation",
model=model,
torch_dtype=torch.float16,
device_map="auto",
)
prompt = "What are the values in open source projects?"
formatted_prompt = (
f"### Human: {prompt}### Assistant:"
)
sequences = pipeline(
formatted_prompt,
do_sample=True,
top_k=50,
top_p = 0.7,
num_return_sequences=1,
repetition_penalty=1.1,
max_new_tokens=500,
)
for seq in sequences:
print(f"Result: {seq['generated_text']}")
Referrals
Run Pod - This is who I use to train th emodels on huggingface. If you use it we both get free crdits. - Visit Runpod's Website!
Paypal - If you want to leave a tip, it is appecaheted. - Visit My Paypal!
Open LLM Leaderboard Evaluation Results
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 40.90 |
AI2 Reasoning Challenge (25-Shot) | 36.01 |
HellaSwag (10-Shot) | 53.74 |
MMLU (5-Shot) | 38.22 |
TruthfulQA (0-shot) | 42.94 |
Winogrande (5-shot) | 57.54 |
GSM8k (5-shot) | 16.98 |
- Downloads last month
- 502
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.
Evaluation results
- normalized accuracy on AI2 Reasoning Challenge (25-Shot)test set Open LLM Leaderboard36.010
- normalized accuracy on HellaSwag (10-Shot)validation set Open LLM Leaderboard53.740
- accuracy on MMLU (5-Shot)test set Open LLM Leaderboard38.220
- mc2 on TruthfulQA (0-shot)validation set Open LLM Leaderboard42.940
- accuracy on Winogrande (5-shot)validation set Open LLM Leaderboard57.540
- accuracy on GSM8k (5-shot)test set Open LLM Leaderboard16.980