iceman2434
/

xlm-roberta-base-ft-udpos213-top7lang

Token Classification

Model card Files Files and versions Community

xlm-roberta-base-ft-udpos213-top7lang / README.md

iceman2434's picture

Update README.md

5a76ef4 verified 21 days ago

|

history blame contribute delete

No virus

716 Bytes

	---
	datasets:
	- universal_dependencies
	language:
	- tl
	metrics:
	- f1
	pipeline_tag: token-classification
	---

	## Model Specification
	- Model: XLM-RoBERTa (base-sized model)
	- Training Data:
	- Combined Afrikaans, Hebrew, Bulgarian, Vietnamese, Norwegian, Urdu, & Czech corpora (Top 7 Languages)
	- Training Details:
	- Base configurations with learning rate 5e-5
	## Evaluation
	- Evaluation Dataset: Universal Dependencies Tagalog Ugnayan (Testing Set)
	- Tested in a zero-shot cross-lingual scenario on a Universal Dependencies Tagalog Ugnayan testing dataset (with 78.81\% Accuracy)
	## POS Tags
	- ADJ – ADP – ADV – CCONJ – DET – INTJ – NOUN – NUM – PART – PRON – PROPN – PUNCT – SCONJ – VERB