l3cube-pune's picture
Update README.md
a1174bc
metadata
language: mr
tags:
  - bert
license: cc-by-4.0
datasets:
  - L3Cube-MahaSent-MD
widget:
  - text: I like you. </s></s> I love you.

MahaSent-GT

MahaSent-GT is a MahaBERT(l3cube-pune/marathi-bert-v2) model fine-tuned on L3Cube-MahaSent-GT Corpus, a generic tweets domain, Marathi sentiment analysis dataset.
This dataset is a part of L3Cube-MahaSent-MD, a multi-domain Marathi sentiment analysis dataset.
The MahaSent-MD dataset contains domains like movie reviews, generic tweets, subtitles, and political tweets. This model is trained specifically on the generic small tweets domain.

The recommended multi-domain version of this model covering all domains is shared here: marathi-sentiment-md
[dataset link] (https://github.com/l3cube-pune/MarathiNLP)

More details on the dataset, models, and baseline results can be found in our [paper] (https://arxiv.org/abs/2306.13888)
Citing:

@article{pingle2023l3cube,
  title={L3Cube-MahaSent-MD: A Multi-domain Marathi Sentiment Analysis Dataset and Transformer Models},
  author={Pingle, Aabha and Vyawahare, Aditya and Joshi, Isha and Tangsali, Rahul and Joshi, Raviraj},
  journal={arXiv preprint arXiv:2306.13888},
  year={2023}
}
@article{joshi2022l3cube,
  title={L3cube-mahanlp: Marathi natural language processing datasets, models, and library},
  author={Joshi, Raviraj},
  journal={arXiv preprint arXiv:2205.14728},
  year={2022}
}

Other Marathi Sentiment models from the MahaSent family are shared here:

MahaSent-MD (multi domain)
MahaSent-GT (generic tweets)
MahaSent-MR (movie reviews)
MahaSent-PT (political tweets)
MahaSent-ST (TV subtitles)
MahaSent v1 (political tweets)