Update README.md
Browse files
README.md
CHANGED
@@ -9,8 +9,7 @@ metrics:
|
|
9 |
- cross_entropy
|
10 |
---
|
11 |
|
12 |
-
**(!) _Don't forget to preprocess unknown_tokens and substitute them with <|endoftext
|
13 |
-
|
14 |
**Dependence of the cross entropy loss on the length of the context for prediction**
|
15 |
|
16 |
- x-axis*128 = context length
|
|
|
9 |
- cross_entropy
|
10 |
---
|
11 |
|
12 |
+
**(!) _Don't forget to preprocess unknown_tokens and substitute them with <|endoftext|>. Otherwise the <unk> tokens in dataset will be split into the '<', 'unk' and '>' tokens_**
|
|
|
13 |
**Dependence of the cross entropy loss on the length of the context for prediction**
|
14 |
|
15 |
- x-axis*128 = context length
|