mpt-7b-storywriter-sharded / modeling_mpt.py

Commit History

✨ gradient checkpointing
ae54cae

pszemraj commited on

add MPTBlock to _no_split_modules
0688e28

pszemraj commited on

format
76b1322

pszemraj commited on

initial support for device_map=auto
304970e

pszemraj commited on

add sharded checkpoint
7ab236e

peter szemraj commited on