IDEA-CCNL
/

Erlangshen-Roberta-330M-NLI

Text Classification

Inference Endpoints

Model card Files Files and versions Community

suolyer commited on Apr 19, 2022

Commit

6232cd0

•

1 Parent(s): 784f182

Update README.md

Files changed (1) hide show

README.md +45 -0

README.md CHANGED Viewed

@@ -1,3 +1,48 @@
 ---
 license: apache-2.0
 ---

 ---
+language:
+  - zh
 license: apache-2.0
+tags:
+- bert
+- NLU
+- NLI
+inference: false
 ---
+# Erlangshen-Roberta-330M-NLI, model (Chinese)，one model of [Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM).
+We collect 4 NLI（Natural Language Inference） datasets in the Chinese domain for finetune, with a total of 1014787 samples. Our model is mainly based on [roberta](https://huggingface.co/hfl/chinese-roberta-wwm-ext)
+## Usage
+```python
+from transformers import BertForSequenceClassification
+from transformers import BertTokenizer
+import torch
+tokenizer=BertTokenizer.from_pretrained('IDEA-CCNL/Erlangshen-Roberta-330M-NLI')
+model=BertForSequenceClassification.from_pretrained('IDEA-CCNL/Erlangshen-Roberta-330M-NLI')
+texta='今天的饭不好吃'
+textb='今天心情不好'
+output=model(torch.tensor([tokenizer.encode(texta,textb)]))
+print(torch.nn.functional.softmax(output.logits,dim=-1))
+```
+## Scores on downstream chinese tasks (without any data augmentation)
+|    Model   | cmnli    |  ocnli  | snli    |
+| :--------:    | :-----:  | :----:  | :-----:   |
+| Erlangshen-Roberta-110M-NLI | 80.83     |   78.56    | 88.01      |
+| Erlangshen-Roberta-330M-NLI | 82.25      |   79.82    | 88      |
+## Citation
+If you find the resource is useful, please cite the following website in your paper.
+```
+@misc{Fengshenbang-LM,
+  title={Fengshenbang-LM},
+  author={IDEA-CCNL},
+  year={2021},
+  howpublished={\url{https://github.com/IDEA-CCNL/Fengshenbang-LM}},
+}
+```