xx_fro_sigtyp_trf / README.md
ljvmiranda921's picture
Update README.md
ac6814f verified
---
tags:
- spacy
- token-classification
language:
- multilingual
model-index:
- name: xx_fro_sigtyp_trf
results:
- task:
name: TAG
type: token-classification
metrics:
- name: TAG (XPOS) Accuracy
type: accuracy
value: 0.8910235177
- task:
name: POS
type: token-classification
metrics:
- name: POS (UPOS) Accuracy
type: accuracy
value: 0.890459364
- task:
name: MORPH
type: token-classification
metrics:
- name: Morph (UFeats) Accuracy
type: accuracy
value: 0.9118816254
- task:
name: LEMMA
type: token-classification
metrics:
- name: Lemma Accuracy
type: accuracy
value: 0.8443364981
- task:
name: UNLABELED_DEPENDENCIES
type: token-classification
metrics:
- name: Unlabeled Attachment Score (UAS)
type: f_score
value: 0.7518266566
- task:
name: LABELED_DEPENDENCIES
type: token-classification
metrics:
- name: Labeled Attachment Score (LAS)
type: f_score
value: 0.6812799194
- task:
name: SENTS
type: token-classification
metrics:
- name: Sentences F-Score
type: f_score
value: 0.9002493766
---
| Feature | Description |
| --- | --- |
| **Name** | `xx_fro_sigtyp_trf` |
| **Version** | `0.1.0` |
| **spaCy** | `>=3.6.1,<3.7.0` |
| **Default Pipeline** | `transformer`, `parser`, `trainable_lemmatizer`, `tagger`, `morphologizer` |
| **Components** | `transformer`, `parser`, `trainable_lemmatizer`, `tagger`, `morphologizer` |
| **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
| **Sources** | n/a |
| **License** | n/a |
| **Author** | [n/a]() |
### Label Scheme
<details>
<summary>View label scheme (190 labels for 3 components)</summary>
| Component | Labels |
| --- | --- |
| **`parser`** | `ROOT`, `acl`, `acl:relcl`, `advcl`, `advmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `case:det`, `cc`, `cc:nc`, `ccomp`, `conj`, `cop`, `csubj`, `dep`, `det`, `expl`, `flat`, `iobj`, `mark`, `nmod`, `nsubj`, `nummod`, `obj`, `obl`, `obl:advmod`, `parataxis`, `punct`, `vocative`, `xcomp` |
| **`tagger`** | `ADJcar__NumType=Card`, `ADJind__PronType=Ind`, `ADJord`, `ADJpos__Poss=Yes`, `ADJqua`, `ADJqua__Tense=Past\|VerbForm=Part`, `ADJqua__Tense=Pres\|VerbForm=Part`, `ADVgen`, `ADVgen.PROper`, `ADVgen__PronType=Ind`, `ADVgen__PronType=Prs,Rel`, `ADVgen__PronType=Rel`, `ADVint__PronType=Int`, `ADVneg`, `ADVneg.PROper__Polarity=Neg\|PronType=Prs`, `ADVneg__Polarity=Neg`, `ADVsub`, `CONcoo`, `CONcoo__PronType=Prs,Rel`, `CONsub`, `CONsub.PROper`, `CONsub__PronType=Rel`, `DETcar__NumType=Card`, `DETdef__Definite=Def`, `DETdef__Definite=Def\|PronType=Art`, `DETdef__PronType=Ind`, `DETdef__PronType=Prs`, `DETdem__PronType=Dem`, `DETdem__PronType=Prs`, `DETind__PronType=Ind`, `DETint__PronType=Int`, `DETndf__Definite=Ind`, `DETndf__Definite=Ind\|PronType=Art`, `DETord`, `DETord__NumType=Ord`, `DETpos__Poss=Yes`, `DETrel__PronType=Rel`, `INJ`, `NOMcom`, `NOMcom__Morph=VFin`, `NOMcom__VerbForm=Inf`, `NOMpro`, `PONfbl`, `PONfrt`, `PONpdr`, `PONpga`, `PONpxx`, `PRE`, `PRE.DETdef__Definite=Def\|PronType=Art`, `PRE.PROper`, `PRE.PROper__Definite=Def\|PronType=Art`, `PRE__Morph=VFin`, `PRE__PronType=Dem`, `PREdetdef__PronType=Prs,Rel`, `PROadv`, `PROadv__PronType=Dem`, `PROcar`, `PROcar__NumType=Card`, `PROdem__PronType=Dem`, `PROdem__PronType=Prs,Rel`, `PROimp`, `PROimp__PronType=Prs`, `PROind`, `PROind__PronType=Ind`, `PROind__PronType=Rel`, `PROint__PronType=Int`, `PROord__NumType=Ord`, `PROper`, `PROper.PROper__PronType=Prs`, `PROper__Poss=Yes`, `PROper__PronType=Prs`, `PROpos__Poss=Yes`, `PROpos__Poss=Yes\|PronType=Prs`, `PROrel`, `PROrel__PronType=Prs,Rel`, `PROrel__PronType=Rel`, `RED`, `VERcjg`, `VERcjg__VerbForm=Fin`, `VERcjg__VerbForm=Inf`, `VERinf__VerbForm=Inf`, `VERppa__Tense=Pres\|VerbForm=Part`, `VERppe`, `VERppe__Tense=Past`, `VERppe__Tense=Past\|VerbForm=Part`, `devenir__Tense=Past\|VerbForm=Part`, `devenir__VerbForm=Fin`, `laisser__VerbForm=Fin`, `remanoir__VerbForm=Fin`, `ressembler__VerbForm=Fin`, `sembler__VerbForm=Fin` |
| **`morphologizer`** | `POS=ADV`, `POS=PRON\|PronType=Prs`, `POS=ADV\|PronType=Dem`, `POS=VERB\|VerbForm=Fin`, `POS=VERB\|Tense=Pres\|VerbForm=Part`, `POS=PUNCT`, `POS=CCONJ`, `Definite=Def\|POS=DET\|PronType=Art`, `POS=NOUN`, `POS=DET\|PronType=Ind`, `POS=SCONJ`, `Definite=Def\|POS=ADP\|PronType=Art`, `NumType=Card\|POS=PRON`, `POS=DET\|Poss=Yes`, `POS=AUX\|VerbForm=Fin`, `POS=VERB\|VerbForm=Inf`, `POS=DET\|PronType=Rel`, `POS=PRON\|PronType=Prs,Rel`, `POS=ADP`, `POS=ADJ`, `POS=PROPN`, `POS=PRON\|PronType=Dem`, `POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=PRON\|PronType=Ind`, `POS=ADV\|Polarity=Neg`, `NumType=Card\|POS=NUM`, `POS=AUX\|VerbForm=Inf`, `Definite=Ind\|POS=DET\|PronType=Art`, `POS=ADV\|PronType=Ind`, `POS=ADJ\|PronType=Ind`, `POS=DET\|PronType=Dem`, `POS=INTJ`, `POS=ADJ\|Poss=Yes`, `POS=ADV\|PronType=Int`, `POS=PRON`, `NumType=Ord\|POS=PRON`, `POS=VERB`, `POS=ADJ\|Tense=Past\|VerbForm=Part`, `POS=PRON\|PronType=Int`, `POS=SCONJ\|PronType=Prs,Rel`, `POS=PRON\|Polarity=Neg\|PronType=Prs`, `POS=SCONJ\|PronType=Rel`, `POS=PRON\|Poss=Yes\|PronType=Prs`, `NumType=Card\|POS=DET`, `POS=NUM`, `POS=DET\|PronType=Prs`, `NumType=Card\|POS=ADJ`, `NumType=Ord\|POS=DET`, `POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=CCONJ\|PronType=Prs,Rel`, `Morph=VFin\|POS=ADP`, `POS=DET\|PronType=Int`, `POS=ADJ\|Tense=Pres\|VerbForm=Part`, `Morph=VFin\|POS=NOUN`, `POS=PRON\|Poss=Yes`, `POS=AUX`, `POS=ADV\|PronType=Rel`, `POS=PRON\|PronType=Rel`, `POS=SCONJ\|PronType=Prs`, `POS=ADP\|PronType=Prs,Rel`, `POS=NOUN\|VerbForm=Inf`, `Definite=Def\|POS=DET`, `POS=VERB\|Tense=Past`, `Definite=Ind\|POS=DET`, `POS=ADP\|PronType=Dem`, `POS=ADV\|PronType=Prs,Rel` |
</details>
### Accuracy
| Type | Score |
| --- | --- |
| `DEP_UAS` | 75.18 |
| `DEP_LAS` | 68.13 |
| `SENTS_P` | 87.41 |
| `SENTS_R` | 92.80 |
| `SENTS_F` | 90.02 |
| `LEMMA_ACC` | 84.43 |
| `TAG_ACC` | 89.10 |
| `POS_ACC` | 89.05 |
| `MORPH_ACC` | 91.19 |
| `TRANSFORMER_LOSS` | 130913.68 |
| `PARSER_LOSS` | 16324.89 |
| `TRAINABLE_LEMMATIZER_LOSS` | 904.27 |
| `TAGGER_LOSS` | 4331.12 |
| `MORPHOLOGIZER_LOSS` | 4719.16 |
### Citation
If you're using this model, please cite:
```
@inproceedings{miranda-2024-allen,
title = "{A}llen Institute for {AI} @ {SIGTYP} 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages",
author = "Miranda, Lester James",
booktitle = "Proceedings of the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP",
month = mar,
year = "2024",
address = "St. Julian's, Malta",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2024.sigtyp-1.18",
pages = "151--159",
}
```