GGUF
English
Mixture of Experts
olmo
olmoe

error loading model

#1
by LaferriereJC - opened

llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'olmoe'

If you're using AI inference app that uses llama.cpp backend, you need to wait until the app is updated with recent release of llama.cpp that has this merge that enables olmoe support: https://github.com/ggerganov/llama.cpp/pull/9462 or you could download latest release of llama.cpp and run this olmoe model directly from command line.

Sign up or log in to comment