Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Middle Assyrian model for BabyLemmatizer

Total data set size ca. 300k words (including lacunae). Consists of all Oracc texts labeled as Middle Assyrian. The model is augmented with the Neo-Assyrian data to improve the performance.

Evaluation results

Neural Net Evaluation
COMPONENT       AVG     CI       MODEL0
POS-tagger      96.76   ±0.00    96.76
Lemmatizer      94.43   ±0.00    94.43
Combined        92.77   ±0.00    92.77
POS-tagger OOV  89.26   ±0.00    89.26
Lemmatizer OOV  68.13   ±0.00    68.13
Combined   OOV  66.40   ±0.00    66.40
-----------------------------------------------
OOV input rate  10.21            10.21

Post-correct Evaluation
COMPONENT       AVG     CI       MODEL0
POS-tagger      96.76   ±0.00    96.76
Lemmatizer      94.46   ±0.00    94.46
Combined        92.79   ±0.00    92.79
POS-tagger OOV  89.26   ±0.00    89.26
Lemmatizer OOV  68.13   ±0.00    68.13
Combined   OOV  66.40   ±0.00    66.40
-----------------------------------------------
OOV input rate  10.21            10.21
Downloads last month
0
Unable to determine this model's library. Check the docs .