l3cube-pune
/

marathi-bert-small

Inference Endpoints

Model card Files Files and versions Community

marathi-bert-small / README.md

l3cube-pune's picture

Create README.md

7c23194 about 1 year ago

|

raw history blame contribute delete

No virus

1.1 kB

	---
	license: cc-by-4.0
	language: mr
	datasets:
	- L3Cube-MahaCorpus
	---

	## MahaBERT-Small
	MahaBERT-Small is a smaller version of the Marathi BERT model with 6 transformer layers. It is a base-BERT model trained from scratch on L3Cube-MahaCorpus and other publicly available Marathi monolingual datasets.
	[dataset link] (https://github.com/l3cube-pune/MarathiNLP)

	More details on the dataset, models, and baseline results can be found in our [paper] (https://arxiv.org/abs/2202.01159)

	The best version of this model is available <a href='https://huggingface.co/l3cube-pune/marathi-bert-v2'> here </a>.

	```
	@InProceedings{joshi:2022:WILDRE6,
	author = {Joshi, Raviraj},
	title = {L3Cube-MahaCorpus and MahaBERT: Marathi Monolingual Corpus, Marathi BERT Language Models, and Resources},
	booktitle = {Proceedings of The WILDRE-6 Workshop within the 13th Language Resources and Evaluation Conference},
	month = {June},
	year = {2022},
	address = {Marseille, France},
	publisher = {European Language Resources Association},
	pages = {97--101}
	}
	```