neuralink's picture
add the 70k checkpoint
4a65de7
{"type": "OptimizerFromGradientAccumulator", "parallelism": {"tp_size": "8", "dp_size": "8", "pp_size": "1"}, "configs": {}}