Spaces:
Runtime error
Runtime error
fix grammar error
Browse files
app.py
CHANGED
@@ -21,7 +21,7 @@ The evaluation is conducted on 4 tasks across 8 datasets:
|
|
21 |
- MIRACL (Thai only), contains 733 test samples, https://huggingface.co/datasets/miracl/miracl
|
22 |
- TyDiQA (Thai only), contains 763 test samples, https://huggingface.co/datasets/chompk/tydiqa-goldp-th
|
23 |
## Metrics
|
24 |
-
The evaluation
|
25 |
1. STS: Spearman’s Rank Correlation
|
26 |
2. Text Classification: F1 Score
|
27 |
3. Pair Classification: Average Precision
|
|
|
21 |
- MIRACL (Thai only), contains 733 test samples, https://huggingface.co/datasets/miracl/miracl
|
22 |
- TyDiQA (Thai only), contains 763 test samples, https://huggingface.co/datasets/chompk/tydiqa-goldp-th
|
23 |
## Metrics
|
24 |
+
The evaluation metric for each task is as follows:
|
25 |
1. STS: Spearman’s Rank Correlation
|
26 |
2. Text Classification: F1 Score
|
27 |
3. Pair Classification: Average Precision
|