MaskedLM / README.md
SRDdev's picture
Update README.md
7ab934c
|
raw
history blame
915 Bytes
metadata
license: afl-3.0
datasets:
  - WillHeld/hinglish_top
language:
  - en
  - hi
metrics:
  - accuracy
library_name: transformers
pipeline_tag: fill-mask

SRDberta

This is a BERT model trained for Masked Language Modeling for Higlish Data.

Hinglish is a term used to describe the hybrid language spoken in India, which combines elements of Hindi and English. It is commonly used in informal conversations and in media such as Bollywood films

Inference

from transformers import AutoTokenizer, AutoModelForMaskedLM, pipeline

tokenizer = AutoTokenizer.from_pretrained("SRDdev/SRDBerta")

model = AutoModelForMaskedLM.from_pretrained("SRDdev/SRDBerta")

fill = pipeline('fill-mask', model='SRDberta', tokenizer='SRDberta')
fill_mask = fill.tokenizer.mask_token
fill(f'Aap {fill_mask} ho?')

Citation

Author: @SRDdev

framework : Pytorch
Year: Jan 2023