File size: 1,811 Bytes
5c94f14
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
74ea6ad
 
 
 
 
 
 
5c94f14
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
 ---
language: mr
tags:
- bert
license: cc-by-4.0
datasets:
- L3Cube-MahaNews-SHC
widget:
- text: "IND vs IRE : आयर्लंडच्या दौऱ्यासाठी कसा आहे भारतीय संघ, जाणून घ्या कोणाला मिळाली संधी..."
---


## MahaNews-SHC-BERT

MahaNews-SHC-BERT is a MahaBERT(<a href="https://huggingface.co/l3cube-pune/marathi-bert-v2">l3cube-pune/marathi-bert-v2</a>) model fine-tuned on full L3Cube-MahaNews-SHC Corpus, a Marathi short text / news headlines classification dataset. <br>
It is a topic identification cum short text classification model with 12 output categories <br>
[dataset link] (https://github.com/l3cube-pune/MarathiNLP)

More details on the dataset, models, and baseline results can be found in our [paper] (coming soon)
<br>
Citing:

```
@inproceedings{mittal2023l3cube,
  title={L3Cube-MahaNews: News-Based Short Text and Long Document Classification Datasets in Marathi},
  author={Mittal, Saloni and Magdum, Vidula and Hiwarkhedkar, Sharayu and Dhekane, Omkar and Joshi, Raviraj},
  booktitle={International Conference on Speech and Language Technologies for Low-resource Languages},
  pages={52--63},
  year={2023},
  organization={Springer}
}
```

Other Marathi Sentiment models from MahaNews family are shared here:<br>

<a href="https://huggingface.co/l3cube-pune/marathi-topic-long-doc"> MahaNews-LDC-BERT (long documents) </a> <br>
<a href="https://huggingface.co/l3cube-pune/marathi-topic-short-doc"> MahaNews-SHC-BERT (short text) </a> <br>
<a href="https://huggingface.co/l3cube-pune/marathi-topic-medium-doc"> MahaNews-LPC-BERT (medium paragraphs) </a> <br>
<a href="https://huggingface.co/l3cube-pune/marathi-topic-all-doc"> MahaNews-All-BERT (all document lengths) </a> <br>