Transformers
PyTorch
Graphcore
English
groupbert
Generated from Trainer
Inference Endpoints

Lowering matmul_proportion and moving optimizer state offchip to avoid OOM on test 'groupbert_swag'

graphcore-rahult changed pull request status to merged

Sign up or log in to comment