Update README.md
Browse files
README.md
CHANGED
@@ -13,7 +13,7 @@ Copy from this [model card](https://huggingface.co/TimeMobius/Mobius-12B-base-m1
|
|
13 |
|
14 |
# Model Card for Mobius-12B-base-m1
|
15 |
The Mobius-12B-base-m1 Large Language Model (LLM) is a pretrained model based on RWKV v5 arch.
|
16 |
-
We utilized 0.01 billion tokens to conduct post-training on this model for alignment benchmarks, excluding the utilization of
|
17 |
|
18 |
|
19 |
## Warning
|
|
|
13 |
|
14 |
# Model Card for Mobius-12B-base-m1
|
15 |
The Mobius-12B-base-m1 Large Language Model (LLM) is a pretrained model based on RWKV v5 arch.
|
16 |
+
We utilized 0.01 billion tokens to conduct post-training on this model for alignment benchmarks, excluding the utilization of [DPO and SFT](https://github.com/BBuf/trl/pull/1). The process took approximately 10 hours, employing 4 * a800.
|
17 |
|
18 |
|
19 |
## Warning
|