deliciouscat
commited on
Commit
•
3f75b60
1
Parent(s):
054bc60
Update README.md
Browse files
README.md
CHANGED
@@ -12,6 +12,8 @@ language:
|
|
12 |
|
13 |
- Decoder: `deliciouscat/deberta-v3-base-decoder-v0.1` (6 transformer layers, 8 attention heads)
|
14 |
|
|
|
|
|
15 |
## Data used
|
16 |
|
17 |
`HuggingFaceFW/fineweb` -> sampled 124800
|
|
|
12 |
|
13 |
- Decoder: `deliciouscat/deberta-v3-base-decoder-v0.1` (6 transformer layers, 8 attention heads)
|
14 |
|
15 |
+
-> 297511524(298M) params
|
16 |
+
|
17 |
## Data used
|
18 |
|
19 |
`HuggingFaceFW/fineweb` -> sampled 124800
|