xiaol
/

Mobius-RWKV-Chat-12B-128k

Model card Files Files and versions Community

xiaol commited on Feb 26

Commit

cdc4023

•

1 Parent(s): 4d509e3

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -13,6 +13,7 @@ In comparison with the previous released Mobius, the improvements include:
 * Significant performance improvement;
 * Multilingual support ;
 * Stable support of 128K context length.
 ## Usage
@@ -21,7 +22,7 @@ We encourage you use few shots to use this model, Desipte Directly use User: xxx
 ## More details
 Mobius 12B 128k based on RWKV v5.2 arch, which is leading state based RNN+CNN+Transformer Mixed language large language model which focus opensouce community
 * 10~100 trainning/inference cost reduce;
-* state based,which mean good at learning compression feature from language;
 * community support.
 ## requirements
@@ -33,4 +34,4 @@ Mobius 12B 128k based on RWKV v5.2 arch, which is leading state based RNN+CNN+Tr
 ## future plan
 If you need a HF version let us know
-[Mobius-Chat-12B-128k](https://huggingface.co/TimeMobius/Mobius-Chat-12B-128k)

 * Significant performance improvement;
 * Multilingual support ;
 * Stable support of 128K context length.
+* Base model [Mobius-mega-12B-128k-base](https://huggingface.co/TimeMobius/Moibus-mega-12B-128k-base)
 ## Usage
 ## More details
 Mobius 12B 128k based on RWKV v5.2 arch, which is leading state based RNN+CNN+Transformer Mixed language large language model which focus opensouce community
 * 10~100 trainning/inference cost reduce;
+* state based,selected memory, which mean good at grok;
 * community support.
 ## requirements
 ## future plan
 If you need a HF version let us know
+[Mobius-Chat-12B-128k](https://huggingface.co/TimeMobius/Mobius-Chat-12B-128k)