File size: 826 Bytes
973fdec
6a6fcc8
973fdec
 
 
 
 
 
 
 
 
 
 
98081dd
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
---
language: "vi"
tags:
- vi
- xlm-roberta
license: "MIT"
metrics:
- f1
- em

---


# XLM-RoBERTa large for QA on Vietnamese languages (also support various languages)

## Overview

- Language model: xlm-roberta-large
- Fine-tune: [deepset/xlm-roberta-large-squad2](https://huggingface.co/deepset/xlm-roberta-large-squad2)
- Language: Vietnamese
- Downstream-task: Extractive QA
- Dataset: [mailong25/bert-vietnamese-question-answering](https://github.com/mailong25/bert-vietnamese-question-answering/tree/master/dataset)
- Training data: train-v2.0.json (SQuAD 2.0 format)
- Eval data: dev-v2.0.json (SQuAD 2.0 format)
- Infrastructure: 1x Tesla P100 (Google Colab)

## Performance

Evaluated on dev-v2.0.json
```
  exact: 136 / 141
  f1: 0.9692671394799054
```

## Author

An Pham: ancs21.ps [at] gmail.com

## License

MIT