RDson
/

Llama-3-Magenta-Instruct-4x8B-MoE

Text Generation

Mixture of Experts

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

Llama-3-Magenta-Instruct-4x8B-MoE / README.md

RDson's picture

Update README.md

85e3ee8 verified about 1 month ago

|

raw history blame contribute delete

No virus

2.48 kB

	---
	tags:
	- moe
	- llama
	- '3'
	- llama 3
	- 4x8b
	---

	# Llama-3-Magenta-Instruct-4x8B-MoE
	<img src="https://i.imgur.com/c1Mv8cy.png" width="640"/>


	## You should also check out the updated [Llama-3-Peach-Instruct-4x8B-MoE](https://huggingface.co/RDson/Llama-3-Peach-Instruct-4x8B-MoE)!

	GGUF files are available here: [Llama-3-Magenta-Instruct-4x8B-MoE-GGUF](https://huggingface.co/RDson/Llama-3-Magenta-Instruct-4x8B-MoE-GGUF).

	This is a experimental MoE using Mergekit, created from
	* [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
	* [nvidia/Llama3-ChatQA-1.5-8B](https://huggingface.co/nvidia/Llama3-ChatQA-1.5-8B)
	* [Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R](https://huggingface.co/Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R)
	* [Muhammad2003/Llama3-8B-OpenHermes-DPO](https://huggingface.co/Muhammad2003/Llama3-8B-OpenHermes-DPO)

	Mergekit yaml file:
	```
	base_model: Meta-Llama-3-8B-Instruct
	experts:
	- source_model: Meta-Llama-3-8B-Instruct
	positive_prompts:
	- "explain"
	- "chat"
	- "assistant"
	- "think"
	- "roleplay"
	- "versatile"
	- "helpful"
	- "factual"
	- "integrated"
	- "adaptive"
	- "comprehensive"
	- "balanced"
	negative_prompts:
	- "specialized"
	- "narrow"
	- "focused"
	- "limited"
	- "specific"
	- source_model: ChatQA-1.5-8B
	positive_prompts:
	- "python"
	- "math"
	- "solve"
	- "code"
	- "programming"
	negative_prompts:
	- "sorry"
	- "cannot"
	- "factual"
	- "concise"
	- "straightforward"
	- "objective"
	- "dry"
	- source_model: SFR-Iterative-DPO-LLaMA-3-8B-R
	positive_prompts:
	- "chat"
	- "assistant"
	- "AI"
	- "instructive"
	- "clear"
	- "directive"
	- "helpful"
	- "informative"
	- source_model: Llama3-8B-OpenHermes-DPO
	positive_prompts:
	- "analytical"
	- "accurate"
	- "logical"
	- "knowledgeable"
	- "precise"
	- "calculate"
	- "compute"
	- "solve"
	- "work"
	- "python"
	- "code"
	- "javascript"
	- "programming"
	- "algorithm"
	- "tell me"
	- "assistant"
	negative_prompts:
	- "creative"
	- "abstract"
	- "imaginative"
	- "artistic"
	- "emotional"
	- "mistake"
	- "inaccurate"
	gate_mode: hidden
	dtype: float16
	```
	Some inspiration for the Mergekit yaml file is from [LoneStriker/Umbra-MoE-4x10.7-2.4bpw-h6-exl2](https://huggingface.co/LoneStriker/Umbra-MoE-4x10.7-2.4bpw-h6-exl2).