Update README.md
Browse files
README.md
CHANGED
@@ -40,5 +40,5 @@ paragraph2 = '2020년은 미국의 지식재산 정책에 있어서 중요한
|
|
40 |
# Training details
|
41 |
Datasets:
|
42 |
- KorQuAD2.1 and AIHub Goverment Documents cleaned from HTML (randomly sampled 20k samples for each)
|
43 |
-
- Positive samples: randomly break into a pair with '\n'
|
44 |
-
- Negative samples: randomly break into a pair (separate in the middle of a sentence)
|
|
|
40 |
# Training details
|
41 |
Datasets:
|
42 |
- KorQuAD2.1 and AIHub Goverment Documents cleaned from HTML (randomly sampled 20k samples for each)
|
43 |
+
- Positive samples: context is randomly break into a pair with '\n' (line breakers between list items, paragraphs, header and paragraphs, ...)
|
44 |
+
- Negative samples: context is randomly break into a pair (separate in the middle of a sentence)
|