cloudyu
/

Mixtral_34Bx2_MoE_60B

Text Generation

Mixture of Experts

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (3)

From your work, I find a new way to do model ensemble

#14 opened 9 months ago by

Adding Evaluation Results

#12 opened 10 months ago by

leaderboard-pr-bot

The function_calling and translation abilities are weaker than Mixtral 8x7b

#11 opened 11 months ago by

Add mixture of experts tag

#10 opened 12 months ago by

how this model goes work,can you share you idea or traning process? thanks

#9 opened 12 months ago by

Add merge tag

#8 opened 12 months ago by

Vram

#7 opened 12 months ago by

source code and paper?

#6 opened 12 months ago by

How does the MoE work?

#5 opened 12 months ago by

PacmanIncarnate

Quant pls?

#4 opened 12 months ago by

What is your config?

#3 opened 12 months ago by

Should not be called mixtral, the models made into the moe are yi based

#2 opened 12 months ago by

Add merge tags

#1 opened 12 months ago by