BlinkDL commited on
Commit
1a3be9c
1 Parent(s): 89b3b19

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -24,6 +24,9 @@ ctx_len = 1024
24
  n_layer = 40
25
  n_embd = 5120
26
 
 
 
 
27
  Final checkpoint: RWKV-4-Pile-14B-20230213-8019.pth : Trained on the Pile for 331B tokens.
28
  * Pile loss 1.7579
29
  * LAMBADA ppl 3.81, acc 71.05%
 
24
  n_layer = 40
25
  n_embd = 5120
26
 
27
+ RWKV-4-Pile-14B-2023xxxx-ctx4096-testxxx.pth : Fine-tuned to ctx_len 4096.
28
+ * Likely better. Please test.
29
+
30
  Final checkpoint: RWKV-4-Pile-14B-20230213-8019.pth : Trained on the Pile for 331B tokens.
31
  * Pile loss 1.7579
32
  * LAMBADA ppl 3.81, acc 71.05%