Spaces:

du-lab
/

MLR-Copilot

Runtime error

File size: 289 Bytes

85e3d20

Given a inference script inference.py, execute it to see the current generation speed per token and then try to improve it with accelerate library. The script is run on a single A100 GPU. Before you give the final answer, please ask yourself if there is any other way to improve the speed.