Yura Kuratov commited on
Commit
5415204
1 Parent(s): f8d1042

upload model with MLM and NSP heads

Browse files
Files changed (3) hide show
  1. README.md +1 -0
  2. flax_model.msgpack +2 -2
  3. pytorch_model.bin +2 -2
README.md CHANGED
@@ -6,6 +6,7 @@ language: en
6
 
7
  Conversational BERT \(English, cased, 12‑layer, 768‑hidden, 12‑heads, 110M parameters\) was trained on the English part of Twitter, Reddit, DailyDialogues\[1\], OpenSubtitles\[2\], Debates\[3\], Blogs\[4\], Facebook News Comments. We used this training data to build the vocabulary of English subtokens and took English cased version of BERT‑base as an initialization for English Conversational BERT.
8
 
 
9
 
10
  \[1\]: Yanran Li, Hui Su, Xiaoyu Shen, Wenjie Li, Ziqiang Cao, and Shuzi Niu. DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset. IJCNLP 2017.
11
 
6
 
7
  Conversational BERT \(English, cased, 12‑layer, 768‑hidden, 12‑heads, 110M parameters\) was trained on the English part of Twitter, Reddit, DailyDialogues\[1\], OpenSubtitles\[2\], Debates\[3\], Blogs\[4\], Facebook News Comments. We used this training data to build the vocabulary of English subtokens and took English cased version of BERT‑base as an initialization for English Conversational BERT.
8
 
9
+ 08.11.2021: upload model with MLM and NSP heads
10
 
11
  \[1\]: Yanran Li, Hui Su, Xiaoyu Shen, Wenjie Li, Ziqiang Cao, and Shuzi Niu. DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset. IJCNLP 2017.
12
 
flax_model.msgpack CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:43b49cc4b3a75f2ca3f7b0f9389ea17cd386306731f363786fabac6dece4e601
3
- size 433248237
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e7ff9a607bca83c520df04906ea08b6f280584382d6d3fbdcdf3762eeaf45fc0
3
+ size 435739144
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8288f7f19536773513294ff913cbff8c924a141b1a9ccbb212248aaedf419dd4
3
- size 433284118
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:58656f54d29a94eb09ac30516af19135aa930b31d6cd64b2481172396758e838
3
+ size 435820470