tr3m-1B3-pile-checkpoints / global_step63000

Commit History

gelu_fast is the correct activation_function
1560b90

bigscience-bot commited on