Update README.md
Browse files
README.md
CHANGED
@@ -73,14 +73,14 @@ print(tokenizer.decode(output))
|
|
73 |
```
|
74 |
|
75 |
|
76 |
-
## Model Details
|
77 |
|
78 |
- **Model type:** Transformer-based Language Model
|
79 |
-
- **Total seen tokens:**
|
80 |
|
81 |
|Model|Params|Layers|Hidden size|Heads|Context length|
|
82 |
|:---:|:---:|:---:|:---:|:---:|:---:|
|
83 |
-
|13b model|13b|40|5120|40|
|
84 |
|
85 |
|
86 |
## Training
|
|
|
73 |
```
|
74 |
|
75 |
|
76 |
+
## Model Details
|
77 |
|
78 |
- **Model type:** Transformer-based Language Model
|
79 |
+
- **Total seen tokens:** 256B
|
80 |
|
81 |
|Model|Params|Layers|Hidden size|Heads|Context length|
|
82 |
|:---:|:---:|:---:|:---:|:---:|:---:|
|
83 |
+
|13b model|13b|40|5120|40|4096|
|
84 |
|
85 |
|
86 |
## Training
|