Safetensors
qwen2
justus27 commited on
Commit
d652eda
·
verified ·
1 Parent(s): 407f1d7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -4
README.md CHANGED
@@ -6,11 +6,9 @@ datasets:
6
 
7
  # INTELLECT-2
8
 
9
- INTELLECT-2 is a 32 billion parameter language model trained through globally distributed reinforcement learning (RL) run on permissionless, community-contributed GPU resources.
10
 
11
- The model was trained using [prime-rl], a framework designed for distributed asynchronous RL, using GRPO over verifiable rewards along with modifications for improved training stability.
12
-
13
- For detailed information, see our [technical report](link).
14
 
15
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64a32edf17b9f57eaec2ea65/0NFEBL9eAObkU4IQ_hAo0.png)
16
 
 
6
 
7
  # INTELLECT-2
8
 
9
+ INTELLECT-2 is a 32 billion parameter language model trained through a reinforcement learning run leveraging globally distributed, permissionless GPU resources contributed by the community.
10
 
11
+ The model was trained using [prime-rl], a framework designed for distributed asynchronous RL, using GRPO over verifiable rewards along with modifications for improved training stability. For detailed information on our infrastructure and training recipe, see our [technical report](link).
 
 
12
 
13
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64a32edf17b9f57eaec2ea65/0NFEBL9eAObkU4IQ_hAo0.png)
14