|
--- |
|
license: apache-2.0 |
|
datasets: |
|
- BAAI/COIG-PC |
|
- ehartford/dolphin |
|
- emozilla/booksum-summary-analysis_llama-8192 |
|
- OpenLeecher/GPT4-10k |
|
- 0x70DA/stackoverflow-chat-data |
|
- togethercomputer/Long-Data-Collections |
|
--- |
|
|
|
## RWKV 7B world focus on reading comprehension |
|
|
|
This is a experimental model based on RWKV 7B world. |
|
|
|
why this model is special? ===> |
|
remove eod, add special token, change vocabs. |
|
|
|
this model is used to QA in large texts, do some in context learning with knowledge indexed database. |
|
|
|
|
|
## trainning details |
|
|
|
train with this kind of new format, |
|
|
|
```<s>User: <sys>xxxx\n\n</sys>xxxxx</s><s>Assistant: xxxxx\n\n</s><s>User: xxxx\n\n</s><s>Assistant: \n\n</s>``` |
|
|
|
so ,use User Assistant as your prefix names. |
|
and when inference in RWKV runner, just use the following format is fine. |
|
|
|
User: xxxx\n\nAssistant: xxxx\n\n,in which are the test cases used. |
|
|
|
|
|
-------------------------------------------- |
|
to use this model with RWKV runner,some effort needed, copy back-python folder to a new one ,which is in the same folder with rwkv-runner.exe(or the file to run) , then pastin rwkv_vocab_v20230424.txt into rwkv_pip folder to replace the vocabs file |
|
|
|
../py310/python main.py in this new folder, then use RWKV runner setting API to 127.0.0.0.1:8000, and go to 127.0.0.1:8000/docs to switch model using this one |
|
|
|
try different temp and topp , 1.2 0.5 may works. |
|
|
|
|
|
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/K5k9xaIjqm96buZ5czzfE.png) |
|
|
|
|
|
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/NmnLnzJq9FYSTd8w-uX4g.png) |
|
|
|
|
|
|
|
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/yfWGk3n_G-5tDfQKNzkaV.png) |
|
|
|
|
|
temp 1.2 topp 0.6 |
|
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6176b32847ee6431f632981e/eb5_sEfPt8SHarLXJP1ig.png) |
|
|