FusionNet_34Bx2_MoE

Fine-tuned model on English language using MoE method.

Model description

The FusionNet_34Bx2_MoE is a model to experiment with the MoE method, which could significantly increase the performance of the original model. The FusionNet_34Bx2_MoE has 60.8B parameters, and this model is fine-tuned. Enjoy!

Usage

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained("TomGrc/FusionNet_34Bx2_MoE")
model = AutoModelForCausalLM.from_pretrained("TomGrc/FusionNet_34Bx2_MoE")

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	77.07
AI2 Reasoning Challenge (25-Shot)	72.95
HellaSwag (10-Shot)	86.22
MMLU (5-Shot)	77.05
TruthfulQA (0-shot)	71.31
Winogrande (5-shot)	83.98
GSM8k (5-shot)	70.89

Model tree for TomGrc/FusionNet_34Bx2_MoE

Quantizations

6 models

TomGrc
/

FusionNet_34Bx2_MoE

FusionNet_34Bx2_MoE

Model description

Usage

Open LLM Leaderboard Evaluation Results

Model tree for TomGrc/FusionNet_34Bx2_MoE

Evaluation results