recobo
/

agriculture-bert-uncased

@@ -11,7 +11,7 @@ widget:
 A BERT-based language model further pre-trained from the checkpoint of [SciBERT](https://huggingface.co/allenai/scibert_scivocab_uncased).
 The dataset gathered is a balance between scientific and general works in agriculture domain and encompassing knowledge from different areas of agriculture research and practical knowledge.
-The corpus contains 1.3 million paragraphs from National Agricultural Library (NAL) from the US Gov. and 4.2 million paragraphs from books and common literature from the **Agriculture Domain**.
 The self-supervised learning approach of MLM was used to train the model.
 - Masked language modeling (MLM): taking a sentence, the model randomly masks 15% of the words in the input then run
@@ -23,8 +23,8 @@ The self-supervised learning approach of MLM was used to train the model.
 from transformers import pipeline
 fill_mask = pipeline(
     "fill-mask",
-    model="recobo/chemical-bert-uncased",
-    tokenizer="recobo/chemical-bert-uncased"
 )
-fill_mask("we create [MASK]")
 ```

 A BERT-based language model further pre-trained from the checkpoint of [SciBERT](https://huggingface.co/allenai/scibert_scivocab_uncased).
 The dataset gathered is a balance between scientific and general works in agriculture domain and encompassing knowledge from different areas of agriculture research and practical knowledge.
+The corpus contains 1.2 million paragraphs from National Agricultural Library (NAL) from the US Gov. and 5.3 million paragraphs from books and common literature from the **Agriculture Domain**.
 The self-supervised learning approach of MLM was used to train the model.
 - Masked language modeling (MLM): taking a sentence, the model randomly masks 15% of the words in the input then run
 from transformers import pipeline
 fill_mask = pipeline(
     "fill-mask",
+    model="recobo/agriculture-bert-uncased",
+    tokenizer="recobo/agriculture-bert-uncased"
 )
+fill_mask("[MASK] is the practice of cultivating plants and livestock.")
 ```