rwkv-4-world
#3
by
manh-linh
- opened
- README.md +4 -6
- RWKV-4-World-ARAtuned-7B-v1-20230803-ctx4096.pth → RWKV-4-World-CHNtuned-7B-v1-OnlyForTest_49%_trained-20230703-ctx4096.pth +1 -1
- RWKV-4-World-JPNtuned-7B-v1-20230718-ctx4096.pth +0 -3
- RWKV-4-World-CHNtuned-7B-v1-20230709-ctx4096.pth → RWKV-4-World-JPNtuned-7B-v1-OnlyForTest_10%_trained-20230704-ctx4096.pth +2 -2
README.md
CHANGED
@@ -29,14 +29,12 @@ datasets:
|
|
29 |
|
30 |
RWKV-4 trained on 100+ world languages (70% English, 15% multilang, 15% code).
|
31 |
|
32 |
-
|
33 |
-
|
34 |
-
XXXtuned = finetune of World on MC4, OSCAR, wiki, etc.
|
35 |
|
36 |
How to use:
|
37 |
-
* use
|
38 |
-
* use
|
39 |
-
*
|
40 |
|
41 |
The differences between World & Raven:
|
42 |
* set pipeline = PIPELINE(model, "rwkv_vocab_v20230424") instead of 20B_tokenizer.json (EXACTLY AS WRITTEN HERE. "rwkv_vocab_v20230424" is included in rwkv 0.7.4+)
|
|
|
29 |
|
30 |
RWKV-4 trained on 100+ world languages (70% English, 15% multilang, 15% code).
|
31 |
|
32 |
+
Some_Pile + Some_RedPajama + Some_OSCAR + All_Wikipedia + All_ChatGPT_Data_I_can_find
|
|
|
|
|
33 |
|
34 |
How to use:
|
35 |
+
* use latest rwkv pip package (0.7.4+)
|
36 |
+
* use https://github.com/BlinkDL/ChatRWKV/blob/main/v2/benchmark_world.py to test it
|
37 |
+
* larger models are stronger even though not fully trained yet
|
38 |
|
39 |
The differences between World & Raven:
|
40 |
* set pipeline = PIPELINE(model, "rwkv_vocab_v20230424") instead of 20B_tokenizer.json (EXACTLY AS WRITTEN HERE. "rwkv_vocab_v20230424" is included in rwkv 0.7.4+)
|
RWKV-4-World-ARAtuned-7B-v1-20230803-ctx4096.pth → RWKV-4-World-CHNtuned-7B-v1-OnlyForTest_49%_trained-20230703-ctx4096.pth
RENAMED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 15035393448
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1050d59453c8292bff2a046c43bd20893b8cdb91031fb656c113c6c02131001b
|
3 |
size 15035393448
|
RWKV-4-World-JPNtuned-7B-v1-20230718-ctx4096.pth
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:3e4c7664ce893ac1f6bb59cd76664fb5c872cb076bb82dbd534db0555b6e9fa5
|
3 |
-
size 15035393458
|
|
|
|
|
|
|
|
RWKV-4-World-CHNtuned-7B-v1-20230709-ctx4096.pth → RWKV-4-World-JPNtuned-7B-v1-OnlyForTest_10%_trained-20230704-ctx4096.pth
RENAMED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4b1debccdb74bec1620764164077709dd5f8514d399e25e0f6824c127dfbdac3
|
3 |
+
size 15035393443
|