how did you train it?

#7
by Wei-Wu - opened

The result seems very promising. Did you train it from scratch and rely on the scaling law or did you prune from a much larger model.

Sign up or log in to comment