Edit model card

DeepSeek-Coder-Instruct-8x1.3b

DeepSeek-Coder-Instruct-8x1.3b is a merge of the following models using mergekit:

🧩 Configuration

base_model: deepseek-ai/deepseek-coder-1.3b-instruct
gate_mode: random
dtype: bfloat16
experts:
  - source_model: deepseek-ai/deepseek-coder-1.3b-instruct
    positive_prompts: [""]
  - source_model: deepseek-ai/deepseek-coder-1.3b-instruct
    positive_prompts: [""]
  - source_model: deepseek-ai/deepseek-coder-1.3b-instruct
    positive_prompts: [""]
  - source_model: deepseek-ai/deepseek-coder-1.3b-instruct
    positive_prompts: [""]
  - source_model: deepseek-ai/deepseek-coder-1.3b-instruct
    positive_prompts: [""]
  - source_model: deepseek-ai/deepseek-coder-1.3b-instruct
    positive_prompts: [""]
  - source_model: deepseek-ai/deepseek-coder-1.3b-instruct
    positive_prompts: [""]
  - source_model: deepseek-ai/deepseek-coder-1.3b-instruct
    positive_prompts: [""]
Downloads last month
1
Safetensors
Model size
7.03B params
Tensor type
BF16
·
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.