Update README.md
Browse files
README.md
CHANGED
@@ -43,7 +43,7 @@ The model is pre-trained by [UER-py](https://github.com/dbiir/UER-py/) on [Tence
|
|
43 |
Taking the case of PEGASUS-Base
|
44 |
|
45 |
```
|
46 |
-
python3 preprocess.py --corpus_path corpora/
|
47 |
--vocab_path models/google_zh_vocab.txt \
|
48 |
--dataset_path cluecorpussmall_pegasus_seq512_dataset.pt \
|
49 |
--processes_num 32 --seq_length 512 \
|
@@ -63,8 +63,8 @@ python3 pretrain.py --dataset_path cluecorpussmall_pegasus_seq512_dataset.pt \
|
|
63 |
Finally, we convert the pre-trained model into Huggingface's format:
|
64 |
|
65 |
```
|
66 |
-
python3 scripts/convert_pegasus_from_uer_to_huggingface.py --input_model_path models/cluecorpussmall_pegasus_base_seq512_model.bin-1000000 \
|
67 |
-
--output_model_path pytorch_model.bin \
|
68 |
--layers_num 12
|
69 |
```
|
70 |
|
|
|
43 |
Taking the case of PEGASUS-Base
|
44 |
|
45 |
```
|
46 |
+
python3 preprocess.py --corpus_path corpora/cluecorpussmall_bert.txt \
|
47 |
--vocab_path models/google_zh_vocab.txt \
|
48 |
--dataset_path cluecorpussmall_pegasus_seq512_dataset.pt \
|
49 |
--processes_num 32 --seq_length 512 \
|
|
|
63 |
Finally, we convert the pre-trained model into Huggingface's format:
|
64 |
|
65 |
```
|
66 |
+
python3 scripts/convert_pegasus_from_uer_to_huggingface.py --input_model_path models/cluecorpussmall_pegasus_base_seq512_model.bin-1000000 \
|
67 |
+
--output_model_path pytorch_model.bin \
|
68 |
--layers_num 12
|
69 |
```
|
70 |
|