MiniLLM
/

MiniLLM-gpt2-760M

Text Generation

Model card Files Files and versions Community

t1101675 commited on Sep 26, 2024

Commit

1eb1bba

·

verified ·

1 Parent(s): 79e7dcb

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ pipeline_tag: text-generation
 ---
-# MiniLLM/MiniLLM-gpt2-760M
 [paper](https://arxiv.org/abs/2306.08543) | [code](https://github.com/microsoft/LMOps/tree/main/minillm)
@@ -22,7 +22,7 @@ pipeline_tag: text-generation
     <img src="https://cdn-uploads.huggingface.co/production/uploads/624ac662102fcdff87be51b9/7hBWGZzYMJihCRQ70XoiQ.png" width="1000">
 </p>
-**Note**: MiniLLM requires a [SFT model]() for initilization to perform the PPO optimization.
 ## Evaluation
@@ -33,9 +33,9 @@ We ask GPT-4 to give scores for the generated responses of MiniLLM. The prompts
 </p>
 ## Baseline Models
-+ [SFT w/o KD]()
-+ [KD]()
-+ [SeqKD]()
 ## Citation
 ```

 ---
+# MiniLLM-gpt2-760M
 [paper](https://arxiv.org/abs/2306.08543) | [code](https://github.com/microsoft/LMOps/tree/main/minillm)
     <img src="https://cdn-uploads.huggingface.co/production/uploads/624ac662102fcdff87be51b9/7hBWGZzYMJihCRQ70XoiQ.png" width="1000">
 </p>
+**Note**: MiniLLM requires a [SFT model](https://huggingface.co/MiniLLM/init-gpt2-760M) for initilization to perform the PPO optimization.
 ## Evaluation
 </p>
 ## Baseline Models
++ [SFT w/o KD](https://huggingface.co/MiniLLM/SFT-gpt2-760M)
++ [KD](https://huggingface.co/MiniLLM/KD-gpt2-760M)
++ [SeqKD](https://huggingface.co/MiniLLM/SeqKD-gpt2-760M)
 ## Citation
 ```