Crataco
/

RWKV-4-PilePlus-Series-GGML

Text Generation

Model card Files Files and versions Community

Merry commited on May 24, 2023

Commit

b1fb133

·

1 Parent(s): 0ec7445

Update README.md

Files changed (1) hide show

README.md +34 -0

README.md CHANGED Viewed

@@ -1,3 +1,37 @@
 ---
 license: apache-2.0
 ---

 ---
+language:
+- en
+tags:
+- ggml
+- text-generation
+- causal-lm
+- rwkv
 license: apache-2.0
+datasets:
+- EleutherAI/pile
+- togethercomputer/RedPajama-Data-1T
 ---
+**Last updated:** 2023-05-23
+This is [BlinkDL/rwkv-4-pileplus](https://huggingface.co/BlinkDL/rwkv-4-pileplus) converted to GGML for use with rwkv.cpp and KoboldCpp. [rwkv.cpp's conversion instructions](https://github.com/saharNooby/rwkv.cpp#option-32-convert-and-quantize-pytorch-model) were followed.
+Original model card is below.
+* * *
+# RWKV-4 PilePlus
+## Model Description
+RWKV-4-pile models finetuning on [RedPajama + some of Pile v2 = 1.7T tokens]. Updated with 2020+2021+2022 data, and better at all European languages.
+Although some of these are intermedia checkpoints (XXXGtokens means finetuned for XXXG tokens), you can already use them because I am finetuning from Pile models (instead of retraining).
+Note: not instruct tuned yet, and recommended to replace vanilla Pile models.
+7B and 14B coming soon.
+See https://github.com/BlinkDL/RWKV-LM for details.
+Use https://github.com/BlinkDL/ChatRWKV to run it.