Aleksey Korshuk

AlekseyKorshuk

AI & ML interests

LLM, ChatBots, RLHF, AI Alignment

Organizations

Posts 1

view post
Post
If you have to choose one small base language model <=3B for ChatML Code Assistant (SFT+DPO) to validate the approach on the dataset and tune hyperparams, so later retrain with a larger base model like Mistral/Mixtral, what model would you pick?
๐Ÿงต