Update README.md
Browse files
README.md
CHANGED
@@ -34,7 +34,7 @@ Details:
|
|
34 |
- uses Claude3 tokenizer (as hf GPT2 tokenizer)
|
35 |
- hidden size 1024, 12 layers, 8 experts
|
36 |
|
37 |
-
achieves the following results on the evaluation set (
|
38 |
- Loss: 3.0366
|
39 |
- Accuracy: 0.4514
|
40 |
- Num Input Tokens Seen: 1975517184
|
|
|
34 |
- uses Claude3 tokenizer (as hf GPT2 tokenizer)
|
35 |
- hidden size 1024, 12 layers, 8 experts
|
36 |
|
37 |
+
achieves the following results on the evaluation set (_most recent dataset_):
|
38 |
- Loss: 3.0366
|
39 |
- Accuracy: 0.4514
|
40 |
- Num Input Tokens Seen: 1975517184
|