model update
Browse files
README.md
CHANGED
@@ -46,23 +46,38 @@ model-index:
|
|
46 |
- name: MoverScore (Question Generation)
|
47 |
type: moverscore_question_generation
|
48 |
value: 62.49
|
49 |
-
- name:
|
50 |
-
type:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
51 |
value: 90.17
|
52 |
-
- name: QAAlignedRecall-BERTScore (Question & Answer Generation) [Gold Answer]
|
53 |
-
type:
|
54 |
value: 90.16
|
55 |
-
- name: QAAlignedPrecision-BERTScore (Question & Answer Generation) [Gold Answer]
|
56 |
-
type:
|
57 |
value: 90.17
|
58 |
-
- name: QAAlignedF1Score-MoverScore (Question & Answer Generation) [Gold Answer]
|
59 |
-
type:
|
60 |
value: 68.22
|
61 |
-
- name: QAAlignedRecall-MoverScore (Question & Answer Generation) [Gold Answer]
|
62 |
-
type:
|
63 |
value: 68.21
|
64 |
-
- name: QAAlignedPrecision-MoverScore (Question & Answer Generation) [Gold Answer]
|
65 |
-
type:
|
66 |
value: 68.23
|
67 |
---
|
68 |
|
@@ -117,16 +132,24 @@ output = pipe("Нелишним будет отметить, что, разви
|
|
117 |
| ROUGE_L | 31.39 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
118 |
|
119 |
|
120 |
-
- ***Metric (Question & Answer Generation)***:
|
121 |
|
122 |
| | Score | Type | Dataset |
|
123 |
|:--------------------------------|--------:|:--------|:-----------------------------------------------------------------|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
124 |
| QAAlignedF1Score (BERTScore) | 90.17 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
125 |
| QAAlignedF1Score (MoverScore) | 68.22 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
126 |
| QAAlignedPrecision (BERTScore) | 90.17 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
127 |
| QAAlignedPrecision (MoverScore) | 68.23 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
128 |
| QAAlignedRecall (BERTScore) | 90.16 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
129 |
| QAAlignedRecall (MoverScore) | 68.21 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
|
|
130 |
|
131 |
|
132 |
|
|
|
46 |
- name: MoverScore (Question Generation)
|
47 |
type: moverscore_question_generation
|
48 |
value: 62.49
|
49 |
+
- name: BLEU4 (Question & Answer Generation (with Gold Answer))
|
50 |
+
type: bleu4_question_answer_generation_with_gold_answer
|
51 |
+
value: 18.61
|
52 |
+
- name: ROUGE-L (Question & Answer Generation (with Gold Answer))
|
53 |
+
type: rouge_l_question_answer_generation_with_gold_answer
|
54 |
+
value: 51.04
|
55 |
+
- name: METEOR (Question & Answer Generation (with Gold Answer))
|
56 |
+
type: meteor_question_answer_generation_with_gold_answer
|
57 |
+
value: 43.1
|
58 |
+
- name: BERTScore (Question & Answer Generation (with Gold Answer))
|
59 |
+
type: bertscore_question_answer_generation_with_gold_answer
|
60 |
+
value: 90.03
|
61 |
+
- name: MoverScore (Question & Answer Generation (with Gold Answer))
|
62 |
+
type: moverscore_question_answer_generation_with_gold_answer
|
63 |
+
value: 67.82
|
64 |
+
- name: QAAlignedF1Score-BERTScore (Question & Answer Generation (with Gold Answer)) [Gold Answer]
|
65 |
+
type: qa_aligned_f1_score_bertscore_question_answer_generation_with_gold_answer_gold_answer
|
66 |
value: 90.17
|
67 |
+
- name: QAAlignedRecall-BERTScore (Question & Answer Generation (with Gold Answer)) [Gold Answer]
|
68 |
+
type: qa_aligned_recall_bertscore_question_answer_generation_with_gold_answer_gold_answer
|
69 |
value: 90.16
|
70 |
+
- name: QAAlignedPrecision-BERTScore (Question & Answer Generation (with Gold Answer)) [Gold Answer]
|
71 |
+
type: qa_aligned_precision_bertscore_question_answer_generation_with_gold_answer_gold_answer
|
72 |
value: 90.17
|
73 |
+
- name: QAAlignedF1Score-MoverScore (Question & Answer Generation (with Gold Answer)) [Gold Answer]
|
74 |
+
type: qa_aligned_f1_score_moverscore_question_answer_generation_with_gold_answer_gold_answer
|
75 |
value: 68.22
|
76 |
+
- name: QAAlignedRecall-MoverScore (Question & Answer Generation (with Gold Answer)) [Gold Answer]
|
77 |
+
type: qa_aligned_recall_moverscore_question_answer_generation_with_gold_answer_gold_answer
|
78 |
value: 68.21
|
79 |
+
- name: QAAlignedPrecision-MoverScore (Question & Answer Generation (with Gold Answer)) [Gold Answer]
|
80 |
+
type: qa_aligned_precision_moverscore_question_answer_generation_with_gold_answer_gold_answer
|
81 |
value: 68.23
|
82 |
---
|
83 |
|
|
|
132 |
| ROUGE_L | 31.39 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
133 |
|
134 |
|
135 |
+
- ***Metric (Question & Answer Generation, Reference Answer)***: Each question is generated from *the gold answer*. [raw metric file](https://huggingface.co/lmqg/mt5-small-ruquad-qg/raw/main/eval/metric.first.answer.paragraph.questions_answers.lmqg_qg_ruquad.default.json)
|
136 |
|
137 |
| | Score | Type | Dataset |
|
138 |
|:--------------------------------|--------:|:--------|:-----------------------------------------------------------------|
|
139 |
+
| BERTScore | 90.03 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
140 |
+
| Bleu_1 | 45.81 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
141 |
+
| Bleu_2 | 34.13 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
142 |
+
| Bleu_3 | 25.81 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
143 |
+
| Bleu_4 | 18.61 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
144 |
+
| METEOR | 43.1 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
145 |
+
| MoverScore | 67.82 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
146 |
| QAAlignedF1Score (BERTScore) | 90.17 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
147 |
| QAAlignedF1Score (MoverScore) | 68.22 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
148 |
| QAAlignedPrecision (BERTScore) | 90.17 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
149 |
| QAAlignedPrecision (MoverScore) | 68.23 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
150 |
| QAAlignedRecall (BERTScore) | 90.16 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
151 |
| QAAlignedRecall (MoverScore) | 68.21 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
152 |
+
| ROUGE_L | 51.04 | default | [lmqg/qg_ruquad](https://huggingface.co/datasets/lmqg/qg_ruquad) |
|
153 |
|
154 |
|
155 |
|