Spaces:

hf-accelerate
/

model-memory-usage

Running on CPU Upgrade

Add memory calculation for ZeRO stages

#14

by deleted - opened Sep 16, 2023

deleted

Sep 16, 2023

Since training large models consumes large memory with adam optimizer. Some may train them with a frame work like deepspeed, which implemeted ZeRO algorithms (ZeRO: Memory Optimizations Toward Training Trillion Parameter Models) to save memory. It would be really appreciated if you provide the memory usage for different ZeRO stages since experimenting for this costs a lot.

muellerzr

accelerate org Sep 18, 2023

We are looking into the possibility of doing this :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment