File size: 2,237 Bytes
938db15 55417fc 938db15 5e53e87 affde79 5e53e87 938db15 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 |
---
language:
- zh
pipeline_tag: fill-mask
---
This is a RWKV6 architecture MLM model . The architect looks like:
![MLM](network.png "Network")
Currently it's only trained with 9% of CCIA2 corpus which makes it biased to web crawled Chinese data.
Try to use it via https://github.com/yynil/rwkv_lm_ext_runner/blob/main/tests/test_mlm.py
The result looks like:
```bash
texts = ['法国的首都在[MASK]。',
'[MASK]首都在北京。',
'生活的真谛是[MASK]。',
'在二战中,阿道夫·希特勒是[MASK]。',
'1949年十月一号,发生了一件大事,那就是中华人民共和国[MASK]。',
'原子核的行星模型,现在普遍认为是[MASK]。',
'根据量子场论,质量来自[MASK]的作用。',
'雨后,彩虹出现在天边,小美陶醉地说:"真[MASK]啊!"',]
法国的首都在巴黎。 <|endoftext|> <eop> <eop> <eop> <eop> <eop> <eop> <eop> <eop> <eop> <eop> <eop> <eop> prob is 0.6197544932365417 cum_prob is 0.6197544932365417
中国首都在北京。 <|endoftext|> <eop> <eop> <eop> <eop> <eop> <eop> <eop> <eop> <eop> <eop> <eop> <eop> prob is 0.08540000021457672 cum_prob is 0.08540000021457672
生活的真谛是快乐。 <|endoftext|> <eop> <eop> <eop> <eop> <eop> <eop> <eop> <eop> <eop> <eop> <eop> prob is 0.12731894850730896 cum_prob is 0.12731894850730896
在二战中,阿道夫·希特勒是领导者。 <|endoftext|> <eop> <eop> <eop> <eop> prob is 0.07036320120096207 cum_prob is 0.07036320120096207
1949年十月一号,发生了一件大事,那就是中华人民共和国成立。 <|endoftext|> <eop> <eop> prob is 0.7657460570335388 cum_prob is 0.7657460570335388
原子核的行星模型,现在普遍认为是行星。 <|endoftext|> <eop> <eop> <eop> <eop> <eop> <eop> prob is 0.043744392693042755 cum_prob is 0.043744392693042755
根据量子场论,质量来自质量的作用。 <|endoftext|> <eop> <eop> <eop> <eop> <eop> <eop> <eop> prob is 0.19818857312202454 cum_prob is 0.19818857312202454
雨后,彩虹出现在天边,小美陶醉地说:"真美啊!" <|endoftext|> prob is 0.5407435297966003 cum_prob is 0.5407435297966003
```` |