cygu/llama-2-7b-logit-watermark-distill-kgw-k1-gamma0.25-delta1 Text Generation • Updated May 1 • 19 • 1
thrunlab/sparse_sparse_80_percent_pretraining_warmup_20K_steps_5k Text Generation • Updated Feb 8 • 10
thrunlab/sparse_sparse_80_percent_pretraining_warmup_20K_0_2_steps_5k Text Generation • Updated Feb 9 • 10