xiaol commited on
Commit
6921c8d
·
1 Parent(s): cbd7077

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -13,7 +13,7 @@ Copy from this [model card](https://huggingface.co/TimeMobius/Mobius-12B-base-m1
13
 
14
  # Model Card for Mobius-12B-base-m1
15
  The Mobius-12B-base-m1 Large Language Model (LLM) is a pretrained model based on RWKV v5 arch.
16
- We utilized 0.01 billion tokens to conduct post-training on this model for alignment benchmarks, excluding the utilization of SFT and DPO. The process took approximately 10 hours, employing 4 * a800.
17
 
18
 
19
  ## Warning
 
13
 
14
  # Model Card for Mobius-12B-base-m1
15
  The Mobius-12B-base-m1 Large Language Model (LLM) is a pretrained model based on RWKV v5 arch.
16
+ We utilized 0.01 billion tokens to conduct post-training on this model for alignment benchmarks, excluding the utilization of [DPO and SFT](https://github.com/BBuf/trl/pull/1). The process took approximately 10 hours, employing 4 * a800.
17
 
18
 
19
  ## Warning