shibing624
commited on
Commit
•
de09899
1
Parent(s):
158150b
Update README.md
Browse files
README.md
CHANGED
@@ -23,7 +23,11 @@ license: "apache-2.0"
|
|
23 |
## Usage
|
24 |
|
25 |
本项目开源在中文文本纠错项目:[pycorrector](https://github.com/shibing624/pycorrector),可支持t5模型,通过如下命令调用:
|
|
|
|
|
|
|
26 |
|
|
|
27 |
```python
|
28 |
from pycorrector.t5.t5_corrector import T5Corrector
|
29 |
nlp = T5Corrector("shibing624/mengzi-t5-base-chinese-correction").batch_t5_correct
|
@@ -48,6 +52,8 @@ mengzi-t5-base-chinese-correction
|
|
48 |
`-- tokenizer.json
|
49 |
```
|
50 |
|
|
|
|
|
51 |
### 训练数据集
|
52 |
#### SIGHAN+Wang271K中文纠错数据集
|
53 |
|
@@ -74,16 +80,7 @@ SIGHAN+Wang271K中文纠错数据集,数据格式:
|
|
74 |
]
|
75 |
```
|
76 |
|
77 |
-
```shell
|
78 |
-
macbert4csc
|
79 |
-
├── config.json
|
80 |
-
├── pytorch_model.bin
|
81 |
-
├── special_tokens_map.json
|
82 |
-
├── tokenizer_config.json
|
83 |
-
└── vocab.txt
|
84 |
-
```
|
85 |
|
86 |
-
如果需要训练t5-correction,请参考[https://github.com/shibing624/pycorrector/tree/master/pycorrector/t5](https://github.com/shibing624/pycorrector/tree/master/pycorrector/t5)
|
87 |
|
88 |
|
89 |
## Citation
|
|
|
23 |
## Usage
|
24 |
|
25 |
本项目开源在中文文本纠错项目:[pycorrector](https://github.com/shibing624/pycorrector),可支持t5模型,通过如下命令调用:
|
26 |
+
```
|
27 |
+
pip install -U pycorrector
|
28 |
+
```
|
29 |
|
30 |
+
run:
|
31 |
```python
|
32 |
from pycorrector.t5.t5_corrector import T5Corrector
|
33 |
nlp = T5Corrector("shibing624/mengzi-t5-base-chinese-correction").batch_t5_correct
|
|
|
52 |
`-- tokenizer.json
|
53 |
```
|
54 |
|
55 |
+
如果需要训练t5-correction,请参考[https://github.com/shibing624/pycorrector/tree/master/pycorrector/t5](https://github.com/shibing624/pycorrector/tree/master/pycorrector/t5)
|
56 |
+
|
57 |
### 训练数据集
|
58 |
#### SIGHAN+Wang271K中文纠错数据集
|
59 |
|
|
|
80 |
]
|
81 |
```
|
82 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
83 |
|
|
|
84 |
|
85 |
|
86 |
## Citation
|