Commit
·
79168c1
1
Parent(s):
077d34a
Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@ inference: false
|
|
11 |
---
|
12 |
# Model Card for Mobius-12B-base-m1
|
13 |
The Mobius-12B-base-m1 Large Language Model (LLM) is a pretrained model based on RWKV v5 arch.
|
14 |
-
We utilized 0.01 billion tokens to conduct post-training on this model for alignment benchmarks, excluding the utilization of
|
15 |
|
16 |
|
17 |
## Warning
|
|
|
11 |
---
|
12 |
# Model Card for Mobius-12B-base-m1
|
13 |
The Mobius-12B-base-m1 Large Language Model (LLM) is a pretrained model based on RWKV v5 arch.
|
14 |
+
We utilized 0.01 billion tokens to conduct post-training on this model for alignment benchmarks, excluding the utilization of [DPO and SFT](https://github.com/BBuf/trl/pull/1). The process took approximately 10 hours, employing 4 * a800.
|
15 |
|
16 |
|
17 |
## Warning
|