qinluo
/

wobert-chinese-plus

Inference Endpoints

Model card Files Files and versions Community

qinluo commited on Nov 18, 2021

Commit

7ee236e

•

1 Parent(s): 9e1c1f2

Update README.md

Files changed (1) hide show

README.md +32 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ pytorch 模型见: https://github.com/JunnYu/WoBERT_pytorch
 pip install git+https://github.com/JunnYu/WoBERT_pytorch.git
 ```
-## 使用
 ```python
 from transformers import TFBertForMaskedLM as WoBertForMaskedLM
 from wobert import WoBertTokenizer
@@ -46,6 +46,37 @@ print(outputs_sentence)
 # 今天[天气|阳光|天|心情|空气]很好，我[想|要|打算|准备|就]去公园玩。
 ```
 ## 引用
 Bibtex：
 ```tex

 pip install git+https://github.com/JunnYu/WoBERT_pytorch.git
 ```
+## TF Example
 ```python
 from transformers import TFBertForMaskedLM as WoBertForMaskedLM
 from wobert import WoBertTokenizer
 # 今天[天气|阳光|天|心情|空气]很好，我[想|要|打算|准备|就]去公园玩。
 ```
+## Pytorch Example
+```python
+from transformers import BertForMaskedLM as WoBertForMaskedLM
+from wobert import WoBertTokenizer
+pretrained_model_or_path = 'qinluo/wobert-chinese-plus'
+tokenizer = WoBertTokenizer.from_pretrained(pretrained_model_or_path)
+model = WoBertForMaskedLM.from_pretrained(pretrained_model_or_path)
+text = '今天[MASK]很好，我[MASK]去公园玩。'
+inputs = tokenizer(text, return_tensors='pt')
+outputs = model(**inputs).logits[0]
+outputs_sentence = ''
+for i, id in enumerate(tokenizer.encode(text)):
+    if id == tokenizer.mask_token_id:
+        tokens = tokenizer.convert_ids_to_tokens(outputs[i].topk(k=5)[1])
+        outputs_sentence += '[' + '|'.join(tokens) + ']'
+    else:
+        outputs_sentence += ''.join(tokenizer.convert_ids_to_tokens([id], skip_special_tokens=True))
+print(outputs_sentence)
+# 今天[天气|阳光|天|心情|空气]很好，我[想|要|打算|准备|就]去公园玩。
+```
 ## 引用
 Bibtex：
 ```tex