Update README.md
Browse files
README.md
CHANGED
@@ -63,6 +63,8 @@ datasets:
|
|
63 |
|
64 |
# smol_llama-81M-tied
|
65 |
|
|
|
|
|
66 |
A small 81M param (total) decoder model, enabled through tying the input/output embeddings. This is the first version of the model.
|
67 |
|
68 |
- 768 hidden size, 6 layers
|
|
|
63 |
|
64 |
# smol_llama-81M-tied
|
65 |
|
66 |
+
<img src="smol-llama-banner.png" alt="banner" style="max-width:80%; height:auto;">
|
67 |
+
|
68 |
A small 81M param (total) decoder model, enabled through tying the input/output embeddings. This is the first version of the model.
|
69 |
|
70 |
- 768 hidden size, 6 layers
|