--- datasets: - universal_dependencies language: - tl metrics: - f1 pipeline_tag: token-classification --- ## Model Specification - Model: XLM-RoBERTa (base-sized model) - Training Data: - Combined Afrikaans, Hebrew, Bulgarian, Vietnamese, Norwegian, Urdu, Czech, Persian, Faroese, & English corpora (Top 10 Languages) - Training Details: - Base configurations with a minor adjustment in learning rate (4.5e-5) ## Evaluation - Evaluation Dataset: Universal Dependencies Tagalog Ugnayan (Testing Set) - Tested in a zero-shot cross-lingual scenario on a Universal Dependencies Tagalog Ugnayan testing dataset (with 77.90\% Accuracy) ## POS Tags - ADJ – ADP – ADV – CCONJ – DET – INTJ – NOUN – NUM – PART – PRON – PROPN – PUNCT – SCONJ – VERB