qwerrwe / docs /faq.md
winglian's picture
add to docs (#703)
a21935f unverified
|
raw
history blame
343 Bytes

Axolotl FAQ's

The trainer stopped and hasn't progressed in several minutes.

Usually an issue with the GPU's communicating with each other. See the NCCL doc

Exitcode -9

This usually happens when you run out of system RAM.

Exitcode -7 while using deepspeed

Try upgrading deepspeed w: pip install -U deepspeed