tr3m-1B3-pile-checkpoints / global_step81000

Commit History

gelu_fast is the correct activation_function
4e6809d

bigscience-bot commited on