Can you do Mistral-Nemo-12B-Instruct-2407, LLaMa-3.1-8B-Instruct, and, Mistral-Large-2407, Please πŸ™

#1
by Joseph717171 - opened

@kalomaze Can you please make a version of Mistral-Nemo-12B-Instruct-2407, LLaMa-3.1-8B-Instruct, and, Mistral-Large-2407 like this please. πŸ™

Joseph717171 changed discussion title from Can you do Mistral-Nemo-12B-Instruct and LLaMa-3.1-8B-Instruct to Can you do Mistral-Nemo-12B-Instruct and LLaMa-3.1-8B-Instruct, Please πŸ™
Joseph717171 changed discussion title from Can you do Mistral-Nemo-12B-Instruct and LLaMa-3.1-8B-Instruct, Please πŸ™ to Can you do Mistral-Nemo-12B-Instruct-2407 and LLaMa-3.1-8B-Instruct, Please πŸ™
Joseph717171 changed discussion title from Can you do Mistral-Nemo-12B-Instruct-2407 and LLaMa-3.1-8B-Instruct, Please πŸ™ to Can you do Mistral-Nemo-12B-Instruct-2407, LLaMa-3.1-8B-Instruct, and, Mistral-Large-2407, Please πŸ™

I understand that this was an experiment. But, I am super excited at the prospect of running a dense model as an MoE. πŸ˜‹

Sign up or log in to comment