Haleshot
/

Mathmate-7B-MoE

Mixture of Experts

AI-MO/NuminaMath-7B-TIR

deepseek-ai/DeepSeek-Prover-V1.5-RL

Model card Files Files and versions Community

Mathmate-7B-MoE / mergekit_moe_config.yml

Haleshot's picture

Upload folder using huggingface_hub

e1aa198 verified 3 months ago

history blame contribute delete

575 Bytes


	base_model: AI-MO/NuminaMath-7B-TIR
	gate_mode: hidden
	dtype: bfloat16
	experts:
	- source_model: AI-MO/NuminaMath-7B-TIR
	positive_prompts:
	- "This model is good at solving math questions at high school level and generating python code for the same"
	# - source_model: Qwen/Qwen2-Math-7B-Instruct
	# positive_prompts:
	# - "This model is really good at solving college level math to olympiad level questions"
	- source_model: deepseek-ai/DeepSeek-Prover-V1.5-RL
	positive_prompts:
	- "This model is good at formal theorem providing math problems"