Mathmate-7B-MoE / mergekit_moe_config.yml
Haleshot's picture
Upload folder using huggingface_hub
e1aa198 verified
raw
history blame contribute delete
575 Bytes
base_model: AI-MO/NuminaMath-7B-TIR
gate_mode: hidden
dtype: bfloat16
experts:
- source_model: AI-MO/NuminaMath-7B-TIR
positive_prompts:
- "This model is good at solving math questions at high school level and generating python code for the same"
# - source_model: Qwen/Qwen2-Math-7B-Instruct
# positive_prompts:
# - "This model is really good at solving college level math to olympiad level questions"
- source_model: deepseek-ai/DeepSeek-Prover-V1.5-RL
positive_prompts:
- "This model is good at formal theorem providing math problems"