BlinkDL commited on
Commit
69d0ce8
1 Parent(s): 7ef50b9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -24,9 +24,10 @@ ctx_len = 1024
24
  n_layer = 24
25
  n_embd = 2048
26
 
27
- New checkpoint: RWKV-4-Pile-1B5-20220929-ctx4096.pth : Fine-tuned to ctx_len = 4096. Use it only when your ctxlen is long. Might be slightly weaker for short ctxlens.
 
28
 
29
- Final checkpoint: RWKV-4-Pile-1B5-20220903-8040.pth : Trained on the Pile for 332B tokens.
30
  * Pile loss 2.0415
31
  * LAMBADA ppl 7.04, acc 56.43%
32
  * PIQA acc 72.36%
 
24
  n_layer = 24
25
  n_embd = 2048
26
 
27
+ RWKV-4-Pile-1B5-20220929-ctx4096.pth : Fine-tuned to ctx_len 4096.
28
+ * Likely better when ctxlen > 100. Please test.
29
 
30
+ RWKV-4-Pile-1B5-20220903-8040.pth : Trained on the Pile for 332B tokens.
31
  * Pile loss 2.0415
32
  * LAMBADA ppl 7.04, acc 56.43%
33
  * PIQA acc 72.36%