TimeMobius
/

Mobius-RWKV-12B-base-m1

Model card Files Files and versions Community

TimeMobius commited on Dec 31, 2023

Commit

79168c1

·

1 Parent(s): 077d34a

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ inference: false
 ---
 # Model Card for Mobius-12B-base-m1
 The Mobius-12B-base-m1 Large Language Model (LLM) is a pretrained model based on RWKV v5 arch.
-We utilized 0.01 billion tokens to conduct post-training on this model for alignment benchmarks, excluding the utilization of SFT and DPO. The process took approximately 10 hours, employing 4 * a800.
 ## Warning

 ---
 # Model Card for Mobius-12B-base-m1
 The Mobius-12B-base-m1 Large Language Model (LLM) is a pretrained model based on RWKV v5 arch.
+We utilized 0.01 billion tokens to conduct post-training on this model for alignment benchmarks, excluding the utilization of [DPO and SFT](https://github.com/BBuf/trl/pull/1). The process took approximately 10 hours, employing 4 * a800.
 ## Warning