Randolphzeng commited on
Commit
4039771
1 Parent(s): 4a98b2b

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +30 -0
README.md ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Abstract
2
+ This is a Chinese transformer-xl model trained on [Wudao dataset](https://resource.wudaoai.cn/home?ind&name=WuDaoCorpora%202.0&id=1394901288847716352)
3
+ and finetuned on a denoise dataset constructed by our team. The denoise task is to reconstruct a fluent and clean text from a noisy input which includes random insertion/swap/deletion/replacement/sentence reordering.
4
+
5
+ ## Usage
6
+
7
+ ### load model
8
+ ```python
9
+ from fengshen.models.transfo_xl_denoise.tokenization_transfo_xl_denoise import TransfoXLDenoiseTokenizer
10
+ from fengshen.models.transfo_xl_denoise.modeling_transfo_xl_denoise import TransfoXLDenoiseModel
11
+
12
+ model = TransfoXLDenoiseModel.from_pretrained("model_path or model name")
13
+ tokenizer = TransfoXLDenoiseTokenizer("model_path or model name")
14
+ ```
15
+
16
+ ### generation
17
+ to be updated later
18
+
19
+
20
+
21
+ ## Citation
22
+ If you find the resource is useful, please cite the following website in your paper.
23
+ ```
24
+ @misc{Fengshenbang-LM,
25
+ title={Fengshenbang-LM},
26
+ author={IDEA-CCNL},
27
+ year={2022},
28
+ howpublished={\url{https://github.com/IDEA-CCNL/Fengshenbang-LM}},
29
+ }
30
+ ```