Base Models Beat Aligned Models at Randomness and Creativity
Paper • 2505.00047 • Published • 3
This model is a bit mroe capable of being used as a true "assistant" or "agent".
According to this paper: Base Models Beat Aligned Models at Randomness and Creativity; and to avoid any possible "GPT-isms", I decided to train on a base model. Think of it as more mallable clay vs re-shaping something that was already formed to be something else.
This is what led to the behavior observed in this model, where the model just legitimately doesn't understand being an "assistant" outside of being a character that is an assistant. SO while the model is probably not useful outside of RP, it is also not intended to be.
Base model
mistralai/Mistral-Nemo-Base-2407