Aleksi Sahala commited on
Commit
17882b8
1 Parent(s): c573a22

update model

Browse files
Files changed (2) hide show
  1. README.md +30 -3
  2. neo-assyrian.tar.gz +3 -0
README.md CHANGED
@@ -1,3 +1,30 @@
1
- ---
2
- license: cc-by-4.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Neo-Assyrian model for [BabyLemmatizer](https://github.com/asahala/BabyLemmatizer)
2
+ Total data set size ca. 330k words (including lacunae). Consists of all Oracc texts labeled as Neo-Assyrian. Based on Oracc.
3
+
4
+ ## Evaluation results
5
+
6
+ ```
7
+ Neural Net Evaluation
8
+ COMPONENT AVG CI MODEL0
9
+ POS-tagger 97.49 ±0.00 97.49
10
+ Lemmatizer 95.38 ±0.00 95.38
11
+ Combined 94.28 ±0.00 94.28
12
+ POS-tagger OOV 90.45 ±0.00 90.45
13
+ Lemmatizer OOV 71.21 ±0.00 71.21
14
+ Combined OOV 69.64 ±0.00 69.64
15
+ -----------------------------------------------
16
+ OOV input rate 9.51 9.51
17
+
18
+
19
+
20
+ Post-correct Evaluation
21
+ COMPONENT AVG CI MODEL0
22
+ POS-tagger 97.49 ±0.00 97.49
23
+ Lemmatizer 95.44 ±0.00 95.44
24
+ Combined 94.34 ±0.00 94.34
25
+ POS-tagger OOV 90.45 ±0.00 90.45
26
+ Lemmatizer OOV 71.21 ±0.00 71.21
27
+ Combined OOV 69.64 ±0.00 69.64
28
+ -----------------------------------------------
29
+ OOV input rate 9.51 9.51
30
+ ```
neo-assyrian.tar.gz ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:292b7a5f97394fae753e1cf9952bca4abe892dc338e4194915e90bfc43cc7e05
3
+ size 219826812