JeswinMS4
/

bert-base-intent

Text Classification

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

JeswinMS4 commited on Apr 14, 2023

Commit

8902020

•

1 Parent(s): 79ecd2a

Update README.md

Files changed (1) hide show

README.md +38 -1

README.md CHANGED Viewed

@@ -5,4 +5,41 @@ datasets:
 - hate_speech_offensive
 tags:
 - finance
----

 - hate_speech_offensive
 tags:
 - finance
+language:
+- en
+---
+#  BERT base model (uncased)
+Pretrained model on English language using a masked language modeling (MLM) objective. It was introduced in
+[this paper](https://arxiv.org/abs/1810.04805) and first released in
+[this repository](https://github.com/google-research/bert). This model is uncased: it does not make a difference
+between english and English.
+## Model Details
+### Model Description
+BERT is a transformers model pretrained on a large corpus of English data in a self-supervised fashion. This means it
+was pretrained on the raw texts only, with no humans labeling them in any way (which is why it can use lots of
+publicly available data) with an automatic process to generate inputs and labels from those texts. More precisely, it
+was pretrained with two objectives:
+- Masked language modeling (MLM): taking a sentence, the model randomly masks 15% of the words in the input then run
+  the entire masked sentence through the model and has to predict the masked words. This is different from traditional
+  recurrent neural networks (RNNs) that usually see the words one after the other, or from autoregressive models like
+  GPT which internally masks the future tokens. It allows the model to learn a bidirectional representation of the
+  sentence.
+- Next sentence prediction (NSP): the models concatenates two masked sentences as inputs during pretraining. Sometimes
+  they correspond to sentences that were next to each other in the original text, sometimes not. The model then has to
+  predict if the two sentences were following each other or not.
+This way, the model learns an inner representation of the English language that can then be used to extract features
+useful for downstream tasks: if you have a dataset of labeled sentences, for instance, you can train a standard
+classifier using the features produced by the BERT model as inputs.
+- **Developed by:** [Jeswin MS, Venkatesh R, Kushal S Ballari]
+- **Model type:** [Intent Classification]
+- **Language(s) (NLP):** [English]
+- **License:** []
+- **Finetuned from model [optional]:** [Bert-base-uncased]