uer
/

pegasus-large-chinese-cluecorpussmall

Text2Text Generation

Inference Endpoints

Model card Files Files and versions Community

uer commited on Oct 25, 2023

Commit

31cdee4

·

1 Parent(s): 26f74e6

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -43,7 +43,7 @@ The model is pre-trained by [UER-py](https://github.com/dbiir/UER-py/) on [Tence
 Taking the case of PEGASUS-Base
 ```
-python3 preprocess.py --corpus_path corpora/cluecorpussmall.txt \
                       --vocab_path models/google_zh_vocab.txt \
                       --dataset_path cluecorpussmall_pegasus_seq512_dataset.pt \
                       --processes_num 32 --seq_length 512 \
@@ -63,8 +63,8 @@ python3 pretrain.py --dataset_path cluecorpussmall_pegasus_seq512_dataset.pt \
 Finally, we convert the pre-trained model into Huggingface's format:
 ```
-python3 scripts/convert_pegasus_from_uer_to_huggingface.py --input_model_path models/cluecorpussmall_pegasus_base_seq512_model.bin-1000000 \
-                                                           --output_model_path pytorch_model.bin \
                                                            --layers_num 12
 ```

 Taking the case of PEGASUS-Base
 ```
+python3 preprocess.py --corpus_path corpora/cluecorpussmall_bert.txt \
                       --vocab_path models/google_zh_vocab.txt \
                       --dataset_path cluecorpussmall_pegasus_seq512_dataset.pt \
                       --processes_num 32 --seq_length 512 \
 Finally, we convert the pre-trained model into Huggingface's format:
 ```
+python3 scripts/convert_pegasus_from_uer_to_huggingface.py --input_model_path models/cluecorpussmall_pegasus_base_seq512_model.bin-1000000 \
+                                                           --output_model_path pytorch_model.bin \
                                                            --layers_num 12
 ```