Update README.md
Browse files
README.md
CHANGED
@@ -206,14 +206,14 @@ a) On data without Punctuation and Capitalization
|
|
206 |
|
207 |
| Version | Tokenizer | Vocabulary Size | MCV 12.0 Dev | MCV 12.0 Test | MLS Dev | MLS Test | VoxPopuli Dev | VoxPopuli Test |
|
208 |
|---------|-----------------------|-----------------|--------------|---------------|---------|----------|---------------|----------------|
|
209 |
-
| 1.18.0 | SentencePiece Unigram | 1024 | 5.14
|
210 |
|
211 |
|
212 |
b) On data with Punctuation and Capitalization
|
213 |
|
214 |
| Version | Tokenizer | Vocabulary Size | MCV 12.0 Dev | MCV 12.0 Test | MLS Dev | MLS Test | VoxPopuli Dev | VoxPopuli Test |
|
215 |
|---------|-----------------------|-----------------|--------------|---------------|---------|----------|---------------|----------------|
|
216 |
-
| 1.18.0 | SentencePiece Unigram | 1024 | 7.75
|
217 |
|
218 |
|
219 |
## Limitations
|
|
|
206 |
|
207 |
| Version | Tokenizer | Vocabulary Size | MCV 12.0 Dev | MCV 12.0 Test | MLS Dev | MLS Test | VoxPopuli Dev | VoxPopuli Test |
|
208 |
|---------|-----------------------|-----------------|--------------|---------------|---------|----------|---------------|----------------|
|
209 |
+
| 1.18.0 | SentencePiece Unigram | 1024 | 5.14% | 5.68% | 13.83% | 11.71% | 12.80% | 15.72% |
|
210 |
|
211 |
|
212 |
b) On data with Punctuation and Capitalization
|
213 |
|
214 |
| Version | Tokenizer | Vocabulary Size | MCV 12.0 Dev | MCV 12.0 Test | MLS Dev | MLS Test | VoxPopuli Dev | VoxPopuli Test |
|
215 |
|---------|-----------------------|-----------------|--------------|---------------|---------|----------|---------------|----------------|
|
216 |
+
| 1.18.0 | SentencePiece Unigram | 1024 | 7.75% | 8.17% | 26.37% | 22.48% | 16.78% | 19.55% |
|
217 |
|
218 |
|
219 |
## Limitations
|