leafspark
/

wikichat

Text Generation

Inference Endpoints

Model card Files Files and versions Community

leafspark commited on Apr 15

Commit

b8fb65b

•

1 Parent(s): b37641c

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ The GGUFs uploaded are full FP16 precision.
 - 40M parameters
 - 8 attention heads
 - 28 layers
-- 1536 context
 ## Prompt Format:
 ```
@@ -38,8 +38,8 @@ Ensure clarity and practicality, allowing readers to easily follow and apply the
 ## Training Details:
 - 1x RTX 3070 8GB
 - 1x Ryzen 3 3700x
-- 4810 iterations
-- Approx 50k tokens (>0.01 epoches)
 - Training data = 1 billion tokens
 ## Notes:

 - 40M parameters
 - 8 attention heads
 - 28 layers
+- 4096 context (upgraded from 1536, please expect a temporary performance drop)
 ## Prompt Format:
 ```
 ## Training Details:
 - 1x RTX 3070 8GB
 - 1x Ryzen 3 3700x
+- 7660 iterations
+- Approx 100 million tokens/120k samples (>0.01 epoches)
 - Training data = 1 billion tokens
 ## Notes: