ku-nlp
/

deberta-v2-base-japanese-char-wwm

Inference Endpoints

Model card Files Files and versions Community

nobu-g commited on Mar 9, 2023

Commit

855548c

·

1 Parent(s): 8298794

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -61,7 +61,7 @@ Also note that Japanese Wikipedia was duplicated 10 times to make the total size
 ## Training procedure
 We first segmented texts in the corpora into words using [Juman++ 2.0.0-rc3](https://github.com/ku-nlp/jumanpp/releases/tag/v2.0.0-rc3) for whole word masking.
-Then, we built a sentencepiece model with 32000 tokens including all characters that appear in the training corpus.
 We tokenized raw corpora into character-level subwords using the sentencepiece model and trained the Japanese DeBERTa model using [transformers](https://github.com/huggingface/transformers) library.
 The training took 20 days using 8 NVIDIA A100-SXM4-40GB GPUs.

 ## Training procedure
 We first segmented texts in the corpora into words using [Juman++ 2.0.0-rc3](https://github.com/ku-nlp/jumanpp/releases/tag/v2.0.0-rc3) for whole word masking.
+Then, we built a sentencepiece model with 22,012 tokens including all characters that appear in the training corpus.
 We tokenized raw corpora into character-level subwords using the sentencepiece model and trained the Japanese DeBERTa model using [transformers](https://github.com/huggingface/transformers) library.
 The training took 20 days using 8 NVIDIA A100-SXM4-40GB GPUs.