llama3-8b-infini-attention / optimizer /optimizer_config.json
neuralink's picture
neuralink HF staff
add ckp
0610800
{"type": "NamedOptimizer", "parallelism": {"tp_size": "4", "dp_size": "6", "pp_size": "1", "expert_parallel_size": "1"}, "configs": {}}