es00jung commited on
Commit
dd00ada
1 Parent(s): 39e5ba3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -40,5 +40,5 @@ paragraph2 = '2020년은 미국의 지식재산 정책에 있어서 중요한
40
  # Training details
41
  Datasets:
42
  - KorQuAD2.1 and AIHub Goverment Documents cleaned from HTML (randomly sampled 20k samples for each)
43
- - Positive samples: randomly break into a pair with '\n'
44
- - Negative samples: randomly break into a pair (separate in the middle of a sentence)
 
40
  # Training details
41
  Datasets:
42
  - KorQuAD2.1 and AIHub Goverment Documents cleaned from HTML (randomly sampled 20k samples for each)
43
+ - Positive samples: context is randomly break into a pair with '\n' (line breakers between list items, paragraphs, header and paragraphs, ...)
44
+ - Negative samples: context is randomly break into a pair (separate in the middle of a sentence)