nghuyong commited on
Commit
a471ead
1 Parent(s): c695af3

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -0
README.md ADDED
@@ -0,0 +1,39 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # ERNIE-2.0-large
2
+
3
+ ## Introduction
4
+ ERNIE-health is a Chinese biomedical language model pre-trained from in-domain text of de-identified online doctor-patient dialogues, electronic medical records, and textbooks.
5
+
6
+ More detail:
7
+ https://github.com/PaddlePaddle/Research/tree/master/KG/eHealth
8
+ https://github.com/PaddlePaddle/PaddleNLP/tree/develop/model_zoo/ernie-health
9
+ https://arxiv.org/pdf/2110.07244.pdf
10
+
11
+ ## Released Model Info
12
+
13
+ |Model Name|Language|Model Structure|
14
+ |:---:|:---:|:---:|
15
+ |ernie-health-zh| Chinese |Layer:12, Hidden:768, Heads:12|
16
+
17
+ This released pytorch model is converted from the officially released PaddlePaddle ERNIE model and
18
+ a series of experiments have been conducted to check the accuracy of the conversion.
19
+
20
+ - Official PaddlePaddle ERNIE repo:https://github.com/PaddlePaddle/Research/tree/master/KG/eHealth
21
+ - Pytorch Conversion repo: https://github.com/nghuyong/ERNIE-Pytorch
22
+
23
+ ## How to use
24
+ ```Python
25
+ from transformers import AutoTokenizer, AutoModel
26
+ tokenizer = AutoTokenizer.from_pretrained("nghuyong/ernie-health-zh")
27
+ model = AutoModel.from_pretrained("nghuyong/ernie-health-zh")
28
+ ```
29
+
30
+ ## Citation
31
+
32
+ ```bibtex
33
+ @article{wang2021building,
34
+ title={Building Chinese Biomedical Language Models via Multi-Level Text Discrimination},
35
+ author={Wang, Quan and Dai, Songtai and Xu, Benfeng and Lyu, Yajuan and Zhu, Yong and Wu, Hua and Wang, Haifeng},
36
+ journal={arXiv preprint arXiv:2110.07244},
37
+ year={2021}
38
+ }
39
+ ```