Update README.md
Browse files
README.md
CHANGED
@@ -36,12 +36,6 @@ How to use:
|
|
36 |
* use latest rwkv pip package (0.8.0+)
|
37 |
* use https://github.com/BlinkDL/ChatRWKV/blob/main/v2/benchmark_world.py and https://github.com/BlinkDL/ChatRWKV/blob/main/API_DEMO_WORLD.py to test it
|
38 |
|
39 |
-
The differences between World & Raven:
|
40 |
-
* set pipeline = PIPELINE(model, "rwkv_vocab_v20230424") instead of 20B_tokenizer.json (EXACTLY AS WRITTEN HERE. "rwkv_vocab_v20230424" is included in rwkv 0.7.4+)
|
41 |
-
* use Question/Answer or User/AI or Human/Bot for chat. **DO NOT USE Bob/Alice or Q/A**
|
42 |
-
|
43 |
-
For 0.1/0.4/1.5B models, use **fp32** for first layer (will overflow in fp16 at this moment - fixable in future), or bf16 if you have 30xx/40xx GPUs. Example strategy: cuda fp32 *1 -> cuda fp16
|
44 |
-
|
45 |
NOTE: the new greedy tokenizer (https://github.com/BlinkDL/ChatRWKV/blob/main/tokenizer/rwkv_tokenizer.py) will tokenize '\n\n' as one single token instead of ['\n','\n']
|
46 |
|
47 |
QA prompt (replace \n\n in xxx to \n):
|
|
|
36 |
* use latest rwkv pip package (0.8.0+)
|
37 |
* use https://github.com/BlinkDL/ChatRWKV/blob/main/v2/benchmark_world.py and https://github.com/BlinkDL/ChatRWKV/blob/main/API_DEMO_WORLD.py to test it
|
38 |
|
|
|
|
|
|
|
|
|
|
|
|
|
39 |
NOTE: the new greedy tokenizer (https://github.com/BlinkDL/ChatRWKV/blob/main/tokenizer/rwkv_tokenizer.py) will tokenize '\n\n' as one single token instead of ['\n','\n']
|
40 |
|
41 |
QA prompt (replace \n\n in xxx to \n):
|