What are the training hyperparameters?

#4
by tongyx361 - opened

Are they the same as LLaMA-2? What about Llemma?

Sign up or log in to comment