en-hi-codemixed
This is a masked language model, based on the CamemBERT model architecture. en-hi-codemixed model was trained from scratch on English, Hindi, and codemixed English-Hindi corpora for 40 epochs. The corpora used consists of primarily web crawled data, including codemixed tweets, and focuses on conversational language and covid-19 pandemic.
- Downloads last month
- 27
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.