julien-c HF staff commited on
Commit
ec83de8
1 Parent(s): 0d3d677

Migrate model card from transformers-repo

Browse files

Read announcement at https://discuss.huggingface.co/t/announcement-all-model-cards-will-be-migrated-to-hf-co-model-repos/2755
Original file history: https://github.com/huggingface/transformers/commits/master/model_cards/urduhack/roberta-urdu-small/README.md

Files changed (1) hide show
  1. README.md +30 -0
README.md ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: ur
3
+ thumbnail: https://raw.githubusercontent.com/urduhack/urduhack/master/docs/_static/urduhack.png
4
+ tags:
5
+ - roberta-urdu-small
6
+ - urdu
7
+ - transformers
8
+ license: mit
9
+ ---
10
+ ## roberta-urdu-small
11
+
12
+ [![License: MIT](https://img.shields.io/badge/license-MIT-blue.svg)](https://github.com/urduhack/urduhack/blob/master/LICENSE)
13
+ ### Overview
14
+ **Language model:** roberta-urdu-small
15
+ **Model size:** 125M
16
+ **Language:** Urdu
17
+ **Training data:** News data from urdu news resources in Pakistan
18
+ ### About roberta-urdu-small
19
+ roberta-urdu-small is a language model for urdu language.
20
+ ```
21
+ from transformers import pipeline
22
+ fill_mask = pipeline("fill-mask", model="urduhack/roberta-urdu-small", tokenizer="urduhack/roberta-urdu-small")
23
+ ```
24
+ ## Training procedure
25
+ roberta-urdu-small was trained on urdu news corpus. Training data was normalized using normalization module from
26
+ urduhack to eliminate characters from other languages like arabic.
27
+
28
+ ### About Urduhack
29
+ Urduhack is a Natural Language Processing (NLP) library for urdu language.
30
+ Github: https://github.com/urduhack/urduhack