Fill-Mask
Transformers
TensorBoard
Safetensors
12 languages
bert
Inference Endpoints
w11wo commited on
Commit
e52beec
1 Parent(s): b57f4a0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -25,7 +25,7 @@ tags:
25
 
26
  # NusaBERT Base
27
 
28
- NusaBERT Base is a multilingual encoder-based language model based on the [BERT](https://arxiv.org/abs/1810.04805) architecture. We conducted continued pre-training on open-source corpora of [sabilmakbar/indo_wiki](https://huggingface.co/datasets/sabilmakbar/indo_wiki), [acul3/KoPI-NLLB](https://huggingface.co/datasets/acul3/KoPI-NLLB), and [uonlp/CulturaX](https://huggingface.co/datasets/uonlp/CulturaX). On a held-out subset of the corpus, our model achieved:
29
 
30
  - `eval_accuracy`: 0.6866
31
  - `eval_loss`: 1.4876
 
25
 
26
  # NusaBERT Base
27
 
28
+ [NusaBERT](https://arxiv.org/abs/2403.01817) Base is a multilingual encoder-based language model based on the [BERT](https://arxiv.org/abs/1810.04805) architecture. We conducted continued pre-training on open-source corpora of [sabilmakbar/indo_wiki](https://huggingface.co/datasets/sabilmakbar/indo_wiki), [acul3/KoPI-NLLB](https://huggingface.co/datasets/acul3/KoPI-NLLB), and [uonlp/CulturaX](https://huggingface.co/datasets/uonlp/CulturaX). On a held-out subset of the corpus, our model achieved:
29
 
30
  - `eval_accuracy`: 0.6866
31
  - `eval_loss`: 1.4876