urt5-base-finetuned / README.md
mbshr's picture
Update README.md
458ed99
---
datasets:
- mbshr/XSUMUrdu-DW_BBC
language:
- ur
metrics:
- rouge
- bertscore
pipeline_tag: summarization
---
### Summarization Model (Type:T5)
Summarization: Extractive and Abstractive
- urT5 adapted from mT5 having monolingual vocabulary only; 40k tokens of Urdu.
- Fine-tuned on https://huggingface.co/mbshr/XSUMUrdu-DW_BBC, ref to https://doi.org/10.48550/arXiv.2310.02790 for details.
### Model Description
<!-- Provide a longer summary of what this model is. -->
- **Model type:** urT5 adapted version of mT5
- **Language(s) (NLP):** Urdu
- **Finetuned from model:** google/mt5-base
### Model Sources
<!-- Provide the basic links for the model. -->
- **Repository:** [More Information Needed]
- **Paper:** https://doi.org/10.48550/arXiv.2310.02790
## Uses
Summarization
## Evaluation & Results
<!-- This section describes the evaluation protocols and provides the results. -->
Evaluated on https://huggingface.co/mbshr/XSUMUrdu-DW_BBC
- ROUGE-1 F Score: 40.03 combined, 46.35 BBC Urdu datapoints only and 36.91 DW Urdu datapoints only)
- BERTScore: 75.1 combined, 77.0 BBC Urdu datapoints only and 74.16 DW Urdu datapoints only
## Citation [optional]
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
https://doi.org/10.48550/arXiv.2310.02790
## Contact
- mubashir.munaaf@gmail.com