Token Classification
spaCy
Tagalog
Eval Results
ljvmiranda921 commited on
Commit
fc62422
·
verified ·
1 Parent(s): 92d3cd8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -2
README.md CHANGED
@@ -58,7 +58,15 @@ model-index:
58
  type: f_score
59
  value: 0.9945634794
60
  ---
61
- Parsers for UD-NewsCrawl
 
 
 
 
 
 
 
 
62
 
63
  | Feature | Description |
64
  | --- | --- |
@@ -68,7 +76,7 @@ Parsers for UD-NewsCrawl
68
  | **Default Pipeline** | `tok2vec`, `trainable_lemmatizer`, `morphologizer`, `tagger`, `parser` |
69
  | **Components** | `tok2vec`, `trainable_lemmatizer`, `morphologizer`, `tagger`, `parser` |
70
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
71
- | **Sources** | [UD_Tagalog-NewsCrawl](https://huggingface.co/datasets/UD-Filipino/UD_Tagalog-NewsCrawl) (Angelina A. Aquino and Lester James V. Mirand and Elsie Marie T. Or) |
72
  | **License** | `MIT` |
73
  | **Author** | [Lester James V. Miranda](https://github.com/ljvmiranda921/UD_Tagalog-NewsCrawl-Dev) |
74
 
 
58
  type: f_score
59
  value: 0.9945634794
60
  ---
61
+
62
+ <img src="https://cdn-avatars.huggingface.co/v1/production/uploads/634e20a0c1ce28f1de920cc4/k7SJny1M3lDa5CH_T1bp3.png" width="130" height="130" align="right" />
63
+
64
+ # UD Parser (spaCy baseline)
65
+
66
+ This is the spaCy baseline trained on UD-NewsCrawl.
67
+ It is trained using a transition-based parser based on [Honnibal and Johnson (2015)](https://aclanthology.org/D15-1162/) and can perform dependency parsing, lemmatization, and morphological annotation.
68
+ The trainable lemmatizer is based on [Muller et al. (2015)](https://aclanthology.org/D15-1272/).
69
+ More information can be found [in this blog post](https://explosion.ai/blog/edit-tree-lemmatizer).
70
 
71
  | Feature | Description |
72
  | --- | --- |
 
76
  | **Default Pipeline** | `tok2vec`, `trainable_lemmatizer`, `morphologizer`, `tagger`, `parser` |
77
  | **Components** | `tok2vec`, `trainable_lemmatizer`, `morphologizer`, `tagger`, `parser` |
78
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
79
+ | **Sources** | [UD_Tagalog-NewsCrawl](https://huggingface.co/datasets/UD-Filipino/UD_Tagalog-NewsCrawl) (Angelina A. Aquino and Lester James V. Miranda and Elsie Marie T. Or) |
80
  | **License** | `MIT` |
81
  | **Author** | [Lester James V. Miranda](https://github.com/ljvmiranda921/UD_Tagalog-NewsCrawl-Dev) |
82