utahnlp/llama2_7b_sparsegpt_0.5_tulu2_sft_16gpu_bs128_sumloss_sparse_deepspeed Text Generation • Updated 9 days ago • 1
utahnlp/llama2_7b_magnitude_0.5_tulu2_sft_16gpu_bs128_sumloss_sparse_deepspeed Text Generation • Updated 9 days ago • 1