Llama-3-8B-Instruct-MoE-3 / mergekit_moe_config.yml
VictorDCh's picture
Upload folder using huggingface_hub
7ec0ed1 verified
raw
history blame contribute delete
No virus
429 Bytes
base_model: meta-llama/Meta-Llama-3-8B-Instruct
gate_mode: random
dtype: bfloat16
experts_per_token: 2
experts:
- source_model: meta-llama/Meta-Llama-3-8B-Instruct
positive_prompts: []
- source_model: meta-llama/Meta-Llama-3-8B-Instruct
positive_prompts: []
- source_model: meta-llama/Meta-Llama-3-8B-Instruct
positive_prompts: []
- source_model: meta-llama/Meta-Llama-3-8B-Instruct
positive_prompts: []