Edit model card

roberta-classical-chinese-large-upos

Model Description

This is a RoBERTa model pre-trained on Classical Chinese texts for POS-tagging and dependency-parsing, derived from roberta-classical-chinese-large-char. Every word is tagged by UPOS (Universal Part-Of-Speech) and FEATS.

How to Use

from transformers import AutoTokenizer,AutoModelForTokenClassification
tokenizer=AutoTokenizer.from_pretrained("KoichiYasuoka/roberta-classical-chinese-large-upos")
model=AutoModelForTokenClassification.from_pretrained("KoichiYasuoka/roberta-classical-chinese-large-upos")

or

import esupar
nlp=esupar.load("KoichiYasuoka/roberta-classical-chinese-large-upos")

Reference

Koichi Yasuoka: Universal Dependencies Treebank of the Four Books in Classical Chinese, DADH2019: 10th International Conference of Digital Archives and Digital Humanities (December 2019), pp.20-28.

See Also

esupar: Tokenizer POS-tagger and Dependency-parser with BERT/RoBERTa/DeBERTa models

Downloads last month
13
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train KoichiYasuoka/roberta-classical-chinese-large-upos