IDEA-CCNL
/

Wenzhong-GPT2-3.5B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Zimix commited on Nov 25, 2021

Commit

6a0aad9

•

1 Parent(s): 19d6f53

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -9,6 +9,17 @@ widget:
 As we all know, the single direction language model based on decoder structure has strong generation ability, such as GPT model. **The 3.5 billion parameter Wenzhong-3.5B large model, using 100G chinese common data, 32 A100 training for 28 hours,** is the largest open source **GPT2 large model of chinese**. **Our model performs well in Chinese continuation generation.**
 ## Usage
 ```python
 from transformers import pipeline, set_seed

 As we all know, the single direction language model based on decoder structure has strong generation ability, such as GPT model. **The 3.5 billion parameter Wenzhong-3.5B large model, using 100G chinese common data, 32 A100 training for 28 hours,** is the largest open source **GPT2 large model of chinese**. **Our model performs well in Chinese continuation generation.**
 ## Usage
+### load model
+```python
+from transformers import GPT2Tokenizer, GPT2Model
+tokenizer = GPT2Tokenizer.from_pretrained('IDEA-CCNL/Wenzhong-3.5B')
+model = GPT2Model.from_pretrained('IDEA-CCNL/Wenzhong-3.5B')
+text = "Replace me by any text you'd like."
+encoded_input = tokenizer(text, return_tensors='pt')
+output = model(**encoded_input)
+```
+### generation
 ```python
 from transformers import pipeline, set_seed