Can you do Mistral-Nemo-12B-Instruct-2407, LLaMa-3.1-8B-Instruct, and, Mistral-Large-2407, Please π
#1
by
Joseph717171
- opened
@kalomaze Can you please make a version of Mistral-Nemo-12B-Instruct-2407, LLaMa-3.1-8B-Instruct, and, Mistral-Large-2407 like this please. π
Joseph717171
changed discussion title from
Can you do Mistral-Nemo-12B-Instruct and LLaMa-3.1-8B-Instruct
to Can you do Mistral-Nemo-12B-Instruct and LLaMa-3.1-8B-Instruct, Please π
Joseph717171
changed discussion title from
Can you do Mistral-Nemo-12B-Instruct and LLaMa-3.1-8B-Instruct, Please π
to Can you do Mistral-Nemo-12B-Instruct-2407 and LLaMa-3.1-8B-Instruct, Please π
Joseph717171
changed discussion title from
Can you do Mistral-Nemo-12B-Instruct-2407 and LLaMa-3.1-8B-Instruct, Please π
to Can you do Mistral-Nemo-12B-Instruct-2407, LLaMa-3.1-8B-Instruct, and, Mistral-Large-2407, Please π
I understand that this was an experiment. But, I am super excited at the prospect of running a dense model as an MoE. π