Am I correct in saying that this model is just better at fine-tuning more efficiently than the standard distilled version?
· Sign up or log in to comment