Text Generation
Transformers
PyTorch
Safetensors
English
olmo
custom_code

Gradient Checkpointing

#5
by amadalincostea2 - opened

Does the model support gradient checkpointing?

Allen Institute for AI org

The OLMo codebase supports activation checkpointing.

But since you're here in Huggingface, and not on GitHub, you probably want to know whether the Huggingface version of OLMo supports it?

Same person, different account. Yes I meant for the Huggingface version.

Allen Institute for AI org

@akshitab , do we have to do anything special to make activation checkpointing work?

Sign up or log in to comment