Update README.md
Browse files
README.md
CHANGED
|
@@ -25,7 +25,7 @@ OctoThinker-3B-Short-Zero is trained using the R1-Zero-style reinforcement learn
|
|
| 25 |
### Training Recipe for OctoThinker-3B-Short-Base
|
| 26 |
|
| 27 |
<div style="display: flex; justify-content: left; gap: 20px;">
|
| 28 |
-
<img src="https://cdn-uploads.huggingface.co/production/uploads/62cbeb2d72dfd24b86bdf977/
|
| 29 |
</div>
|
| 30 |
|
| 31 |
|
|
|
|
| 25 |
### Training Recipe for OctoThinker-3B-Short-Base
|
| 26 |
|
| 27 |
<div style="display: flex; justify-content: left; gap: 20px;">
|
| 28 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/62cbeb2d72dfd24b86bdf977/2sFzePngjjopTs0SeCS9R.png" alt="Data Pipeline" style="width:90%;">
|
| 29 |
</div>
|
| 30 |
|
| 31 |
|