llama-stampede-64x101m

llama-stampede-64x101m is a merge of the following models using mergekit:

🧩 Configuration

base_model: BEE-spoke-data/smol_llama-101M-GQA
gate_mode: random
dtype: bfloat16
experts:
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
  - source_model: BEE-spoke-data/smol_llama-101M-GQA
    positive_prompts: [""]
Downloads last month
79
Safetensors
Model size
2.78B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.