Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
conceptofmind 
posted an update Jan 28
Post
A 1b dense causal language model begins to "saturate" in terms of accuracy around 5 epochs on 1.2T tokens.
In this post