yulongl commited on
Commit
0be77bb
1 Parent(s): 2210b7c

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +46 -0
README.md ADDED
@@ -0,0 +1,46 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Basic Information
2
+
3
+ This is the Dr. Decr model used in XOR-TyDi leaderboard task 1 whitebox submission.
4
+
5
+ https://nlp.cs.washington.edu/xorqa/
6
+
7
+
8
+ The detailed implementation of the model can be found in:
9
+
10
+ https://arxiv.org/pdf/2112.08185.pdf
11
+
12
+ Source code to train the model can be found in:
13
+ ```
14
+ To be updated.
15
+ ```
16
+
17
+ It is a Neural IR model therefore not directly compatible with Huggingface API. The inference result on XOR Dev dataset is:
18
+ ```
19
+ R@2kt R@5kt
20
+ te 79.41 83.19
21
+ bn 77.96 82.89
22
+ fi 65.92 72.61
23
+ ja 63.07 67.63
24
+ ko 60.35 68.07
25
+ ru 60.76 68.35
26
+ ar 65.70 73.14
27
+ Avg 67.60 73.70
28
+ ```
29
+
30
+ # Limitations
31
+
32
+ This model used pre-trained XLMR model and fine tuned on 7 languages in XOR-TyDi leaderboard. The performance of other languages was not tested.
33
+
34
+
35
+ # Citation
36
+ ```
37
+ @article{Li2021_DrDecr,
38
+ doi = {10.48550/ARXIV.2112.08185},
39
+ url = {https://arxiv.org/abs/2112.08185},
40
+ author = {Li, Yulong and Franz, Martin and Sultan, Md Arafat and Iyer, Bhavani and Lee, Young-Suk and Sil, Avirup},
41
+ keywords = {Computation and Language (cs.CL), Artificial Intelligence (cs.AI), FOS: Computer and information sciences, FOS: Computer and information sciences},
42
+ title = {Learning Cross-Lingual IR from an English Retriever},
43
+ publisher = {arXiv},
44
+ year = {2021}
45
+ }
46
+ ```