syke9p3's picture
Update README.md
e324b85
|
raw
history blame
7.9 kB
---
language:
- tl
library_name: transformers
pipeline_tag: token-classification
tags:
- language
- grammar
---
# Model Card for Model ID
<!-- Provide a quick summary of what the model is/does. -->
This is a BERT Tagalog Base Uncased Part of Speech tagger fine-tuned model of [Jiang et. al.'s pre-trained bert-tagalog-base-uncased model](https://huggingface.co/GKLMIP/bert-tagalog-base-uncased).
## Model Details
### Model Description
<!-- Provide a longer summary of what this model is. -->
- **Developed by:** syke9p3, mnemoria, xenoxia, riakm
- **Shared by:** syke9p3
- **Model type:** BERT Tagalog Base Uncased
- **Languages (NLP):** Tagalog, Filipino
- **Dataset:** Sagum et. al.'s annotated Tagalog Corpora based on MGNN Tagset convention. This model was trained in 800 sentences and evaluated with 200 sentences.
- **Finetuned from model**: [Jiang et. al.'s pre-trained bert-tagalog-base-uncased model](https://huggingface.co/GKLMIP/bert-tagalog-base-uncased)
### Tags
| Part of Speech | Tags |
|-----------------------------------------------|------|
| **Noun** | NNC |
| Common Noun | NNC |
| Proper Noun | NNP |
| Proper Noun Abbreviation | NNPA |
| Common Noun Abbreviation | NNCA |
| **Pronoun** | PR |
| as Subject (Palagyo)/Personal Pronouns Singular | PRS |
| Personal Pronouns | PRP |
| Possessive Subject (Paari) | PRSP |
| Pointing to an Object Demonstrative/(Paturol/Pamatlig) | PRO |
| Question/Interrogative (Pananong)/Singular | PRQ |
| Question/Interrogative Plural | PRQP |
| Location (Panlunan) | PRL |
| Comparison (Panulad) | PRC |
| Found (Pahimaton) | PRF |
| Indefinite | PRI |
| **Determiner** | DT |
| Determiner (Pantukoy) for Common Noun Plural | DTC |
| Determiner (Pantukoy) for Proper Noun | DTP |
| Determiner (Pantukoy) for Proper Noun Plural | DTPP |
| Lexical Marker | LM |
| Conjunctions (Pang-ugnay) | CC, CCT, CCR, CCB, CCA |
| Ligatures (Pang-angkop) | CCP |
| Preposition (Pang-ukol) | CCU |
| **Verb (Pandiwa)** | VB |
| Neutral/Infinitive | VBW |
| Auxiliary, Modal/Pseudo-verbs | VBS |
| Existential | VBH |
| Non-existential | VBN |
| Time Past (Perfective) | VBTS |
| Time Present (Imperfective) | VBTR |
| Time Future (Contemplative) | VBTF |
| Recent past | VBTP |
| Actor Focus | VBAF |
| Object/Goal Focus | VBOF |
| Benefactive Focus | VBOB |
| Locative Focus | VBOL |
| Instrumental Focus | VBOI |
| Referential/Measurement Focus | VBRF |
| **Adjective (Pang-uri)** | JJ |
| Describing (Panlarawan) | JJD |
| Used for Comparison (same level) (Pahambing Magkatulad) | JJC |
| Comparison Comparative (more) (Palamang) | JJCC |
| Comparison Superlative (most) (Pasukdol) | JJCS |
| Comparison Negation (not quite) (Di-Magkatulad) | JJCN |
| Describing Number (Pamilang) | JJN |
| **Adverb (Pang-Abay)** | RB |
| Describing “How” (Pamaraan) | RBD |
| Number (Panggaano/Panukat) | RBN |
| Conditional (Kondisyunal) | RBK |
| Causative (Pananhi) | RBP |
| Benefactive (Benepaktibo) | RBB |
| Referential (Pangkaukulan) | RBR |
| Question (Pananong) | RBQ |
| Agree (Panang-ayon) | RBT |
| Disagree (Pananggi) | RBF |
| Frequency (Pamanahon) | RBW |
| Possibility (Pang-agam) | RBM |
| Place (Panlunan) | RBL |
| Enclitics (Paningit) | RBI |
| Interjections (Sambitla) | RBJ |
| Social Formula (Pormularyong Panlipunan) | RBS |
|**Cardinal Number (Bilang)** | CD |
| Digit, Rank, Count | CDB |
| **Topicless (Walang Paksa)** | TS |
| Foreign Words | FW |
| **Punctuation (Pananda)** | PM |
| Period | PMP |
| Exclamation Point | PME |
| Question Mark | PMQ |
| Comma | PMC |
| Semi-colon | PMSC |
| **Symbols** | PMS |
| Compound Tags | \<tag1>\<tag2>...\<tagN> | | | |
|------------|---------------|----------|------------|------------|
| CCB_CCP | JJCS_VBRF_CCP | PRI_CCT | RBL_JJD | VBS_CCP |
| CCR_CCA | JJCS_VBTR | PRI_LM | RBL_JJD_CCP| VBTF_CCP |
| CCR_CCB | JJCS_VBTR_VBOF| PRL_CCP | RBL_NNC | VBTF_JJD |
| CCR_CCP | JJCS_VBTR_VBRF| PRL_LM | RBL_NNP | VBTF_VBAF |
| CCR_LM | JJCS_VBTS | PRO_CCB | RBL_NNPA | VBTF_VBOB |
| CCT_CCA | JJCS_VBW | PRO_CCP | RBL_NNP_NNP| VBTF_VBOF |
| CCT_CCP | JJC_CCB | PRO_LM | RBL_PRL | VBTF_VBOF_CCP |
| CCT_LM | JJC_CCP | PRP_CCB | RBM_CCP | VBTR_CCP |
| CCU_DTP | JJC_JJD | PRP_CCP | RBM_LM | VBTR_VBAF |
| CDB_CCA | JJC_PRL | PRP_LM | RBN_CCP | VBTR_VBAF_CCP |
| CDB_CCP | JJD_CCA | PRQ_CCP | RBP_CCP | VBTR_VBOB |
| CDB_LM | JJD_CCB | PRQ_LM | RBQ_CCB | VBTR_VBOF |
| CDB_NNC | JJD_CCP | PRSP_CCP | RBQ_CCP | VBTR_VBOF_CCP |
| CDB_NNC_CCP| JJD_CCT | PRS_CCB | RBQ_LM | VBTR_VBRF |
| JJCC_CCP | JJD_NNC | PRS_CCP | RBR_DTP | VBTR_VBRF_CCP |
| JJCC_JJD | JJD_NNP | PRS_LM | RBS_CCP | VBTS_CCA |
| JJCN_CCP | JJN_CCA | RBD_CCB | RBT_CCB | VBTS_CCP |
| JJCN_LM | JJN_CCB | RBD_CCP | RBT_CCP | VBTS_JJD |
| JJCS_CCB | JJN_CCP | RBD_LM | RBT_CCT | VBTS_LM |
| JJCS_CCP | JJN_NNC | RBF_CCP | RBT_LM | VBTS_VBAF |
| JJCS_JJC | JJN_NNC_CCP | RBF_JJD | RBW_CCA | VBTS_VBOB |
| JJCS_JJC_CCP| JJN_NNP | RBF_JJD_CCP | RBW_CCB | VBTS_VBOF |
| JJCS_JJD | NNC_CCA | RBF_LM | RBW_CCP | VBTS_VBOF_CCP |
| JJCS_JJD_CCB| NNC_CCB | RBF_RBW | RBW_DTP | VBTS_VBOL |
| JJCS_JJD_CCP| NNC_CCP | RBF_VBTR | RBW_LM | VBTS_VBRF |
| JJCS_JJD_NNC| NNC_LM | RBF_VBW_CCP| RBW_RBI | VBW_CCB |
| JJCS_JJN | NNC_PMC | RBI_CCA | VBAF_CCP | VBW_CCP |
| JJCS_JJN_CCP| NNP_CCA | RBI_CCP | VBH_CCB | VBW_CDB |
| JJCS_RBF | NNP_CCP | RBI_LM | VBH_CCP | VBW_LM |
| JJCS_VBAF | PRC_CCB | RBJ_CCP | VBN_CCP | |
| JJCS_VBAF_CCP| PRC_CCP | RBK_LM | VBOB_CCP | |
| JJCS_VBN_CCP| PRI_CCB | RBL_CCP
## Bias, Risks, and Limitations
This model has not been fully tested so please use with caution.