akhtet commited on
Commit
956faec
1 Parent(s): 8f6c4ec

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -39
README.md CHANGED
@@ -51,7 +51,7 @@ Thus it is useful for Natural Language Inference and related tasks such as Zero-
51
 
52
  ## Bias, Risks, and Limitations
53
 
54
- Please consult the papers for original foundation model DeBERTaV3 [https://arxiv.org/abs/2111.09543].
55
  <!-- Any limitations with myXNLI ? -->
56
 
57
  ## How to Get Started with the Model
@@ -62,53 +62,24 @@ Use the code below to get started with the model.
62
 
63
  ## Training Details
64
 
65
- The model is fine-tuned on myXNLI dataset https://huggingface.co/datasets/akhtet/myXNLI
66
 
67
  From this dataset, 4 different copies training data from myXNLI were concatenated, each with sentence pairs in en-en, en-my, my-en and my-my combinations.
68
 
69
  Training on cross-matched language data as above improved the NLI accuracy over training separately in each language.
70
- This was inspired by the approach from another model [https://huggingface.co/joeddav/xlm-roberta-large-xnli]
71
 
72
- ### Training Procedure
73
-
74
- <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
75
-
76
- - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
77
 
78
  ## Evaluation
79
 
80
- <!-- This section describes the evaluation protocols and provides the results. -->
81
-
82
- ### Testing Data, Factors & Metrics
83
-
84
- #### Testing Data
85
-
86
- <!-- This should link to a Data Card if possible. -->
87
-
88
- [More Information Needed]
89
-
90
- #### Factors
91
-
92
- <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
93
-
94
- [More Information Needed]
95
-
96
- #### Metrics
97
-
98
- <!-- These are the evaluation metrics being used, ideally with a description of why. -->
99
-
100
- [More Information Needed]
101
-
102
- ### Results
103
-
104
- [More Information Needed]
105
-
106
- #### Summary
107
 
108
- https://www.bbc.com/burmese/articles/cmj15enzk2go
109
- https://my.wikipedia.org/wiki/%E1%80%A1%E1%80%B1%E1%80%AC%E1%80%84%E1%80%BA%E1%80%9C%E1%80%A1%E1%80%94%E1%80%BA%E1%80%86%E1%80%94%E1%80%BA%E1%80%B8
 
110
 
111
- ## Citation [optional]
112
 
113
  <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
114
 
@@ -118,4 +89,4 @@ https://my.wikipedia.org/wiki/%E1%80%A1%E1%80%B1%E1%80%AC%E1%80%84%E1%80%BA%E1%8
118
 
119
  ## Model Card Contact
120
 
121
- [More Information Needed]
 
51
 
52
  ## Bias, Risks, and Limitations
53
 
54
+ Please refer to the papers for original foundation model: DeBERTa [https://arxiv.org/abs/2006.03654] and DeBERTaV3 [https://arxiv.org/abs/2111.09543].
55
  <!-- Any limitations with myXNLI ? -->
56
 
57
  ## How to Get Started with the Model
 
62
 
63
  ## Training Details
64
 
65
+ The model is fine-tuned on myXNLI dataset [https://huggingface.co/datasets/akhtet/myXNLI]. The English portion of myXNLI is from XNLI dataset.
66
 
67
  From this dataset, 4 different copies training data from myXNLI were concatenated, each with sentence pairs in en-en, en-my, my-en and my-my combinations.
68
 
69
  Training on cross-matched language data as above improved the NLI accuracy over training separately in each language.
70
+ This approach was inspired by another model [https://huggingface.co/joeddav/xlm-roberta-large-xnli]
71
 
72
+ The model was fine-tuned using this combined dataset for a single epoch.
 
 
 
 
73
 
74
  ## Evaluation
75
 
76
+ This model has been evaluted on myXNLI testset for Myanmar accuracy. We also provide the accuracy for English using XNLI testset.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
77
 
78
+ | Model | Myanmar accuracy | English accuracy |
79
+ | ----- | ----- | ----- |
80
+ | mDeBERTa-v3-base-myXNLI | 88.02 | 80.99 |
81
 
82
+ ## Citation
83
 
84
  <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
85
 
 
89
 
90
  ## Model Card Contact
91
 
92
+ Aung Kyaw Htet