bradhiltonendercorp commited on
Commit
8e3301d
·
verified ·
1 Parent(s): 7a868a7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -19,7 +19,7 @@ Deductive Reasoning Qwen 14B is a reinforcement fine-tune of [Qwen 2.5 14B Instr
19
 
20
  Here are some additional resources to check out:
21
 
22
- - [Blog Post](https://openpipe.ai/blog)
23
  - [Training Recipe](https://github.com/openpipe/deductive-reasoning)
24
  - [RL Experiments](https://github.com/openpipe/rl-experiments)
25
  - [Deductive Reasoning Qwen 32B](https://huggingface.co/OpenPipe/Deductive-Reasoning-Qwen-32B)
 
19
 
20
  Here are some additional resources to check out:
21
 
22
+ - [Blog Post](https://openpipe.ai/blog/using-grpo-to-beat-o1-o3-mini-and-r1-on-temporal-clue)
23
  - [Training Recipe](https://github.com/openpipe/deductive-reasoning)
24
  - [RL Experiments](https://github.com/openpipe/rl-experiments)
25
  - [Deductive Reasoning Qwen 32B](https://huggingface.co/OpenPipe/Deductive-Reasoning-Qwen-32B)