cygu
/

pythia-1.4b-sampling-watermark-distill-kgw-k2-gamma0.25-delta2

Text Generation

Generated from Trainer

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

cygu commited on May 1

Commit

3b8879e

•

1 Parent(s): 08b6482

Update README.md

Files changed (1) hide show

README.md +4 -26

README.md CHANGED Viewed

@@ -1,31 +1,13 @@
 ---
 tags:
 - generated_from_trainer
-model-index:
-- name: pythia-1.4b-sampling-watermark-distill-kgw-k2-gamma0.25-delta2
-  results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# pythia-1.4b-sampling-watermark-distill-kgw-k2-gamma0.25-delta2
-This model is a fine-tuned version of [/scr-ssd/cygu/weights/pythia-1.4b/](https://huggingface.co//scr-ssd/cygu/weights/pythia-1.4b/) on an unknown dataset.
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -39,11 +21,7 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 500
 - num_epochs: 1.0
-### Training results
 ### Framework versions
 - Transformers 4.29.2

 ---
 tags:
 - generated_from_trainer
+- pythia
+license: apache-2.0
 ---
 ## Model description
+Sampling-based watermark distilled [Pythia 1.4B](https://huggingface.co/EleutherAI/pythia-1.4b) using the KGW \\(k=2, \gamma=0.25, \delta=2\\) watermarking strategy in the paper [On the Learnability of Watermarks for Language Models](https://arxiv.org/abs/2312.04469).
 ### Training hyperparameters
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 500
 - num_epochs: 1.0
+-
 ### Framework versions
 - Transformers 4.29.2