Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Telugu Question-Answering model trained on Tydiqa dataset from Google

How to use

Use the below script from your python terminal as the web interface for inference has few encoding issues for Telugu

from transformers.pipelines import pipeline, AutoModelForQuestionAnswering, AutoTokenizer
model = AutoModelForQuestionAnswering.from_pretrained(model_name)
tokenizer = AutoTokenizer.from_pretrained("kuppuluri/telugu_bertu_tydiqa",
                                          clean_text=False,
                                          handle_chinese_chars=False,
                                          strip_accents=False,
                                          wordpieces_prefix='##')
nlp = pipeline('question-answering', model=model, tokenizer=tokenizer)
result = nlp({'question': question, 'context': context})

Training data

I used Tydiqa Telugu data from Google https://github.com/google-research-datasets/tydiqa

PS: If you find my model useful, I would appreciate a note from you as it would encourage me to continue improving it and also add new models.

Downloads last month
12
Safetensors
Model size
110M params
Tensor type
I64
F32
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Space using kuppuluri/telugu_bertu_tydiqa 1