tr3m-1B3-pile-checkpoints / global_step18000

Commit History

gelu_fast is the correct activation_function
5be9f19

bigscience-bot commited on