Update README.md
Browse files
README.md
CHANGED
@@ -6,11 +6,9 @@ datasets:
|
|
6 |
|
7 |
# INTELLECT-2
|
8 |
|
9 |
-
INTELLECT-2 is a 32 billion parameter language model trained through
|
10 |
|
11 |
-
The model was trained using [prime-rl], a framework designed for distributed asynchronous RL, using GRPO over verifiable rewards along with modifications for improved training stability.
|
12 |
-
|
13 |
-
For detailed information, see our [technical report](link).
|
14 |
|
15 |

|
16 |
|
|
|
6 |
|
7 |
# INTELLECT-2
|
8 |
|
9 |
+
INTELLECT-2 is a 32 billion parameter language model trained through a reinforcement learning run leveraging globally distributed, permissionless GPU resources contributed by the community.
|
10 |
|
11 |
+
The model was trained using [prime-rl], a framework designed for distributed asynchronous RL, using GRPO over verifiable rewards along with modifications for improved training stability. For detailed information on our infrastructure and training recipe, see our [technical report](link).
|
|
|
|
|
12 |
|
13 |

|
14 |
|