bradhiltonendercorp commited on
Commit
7755dcc
·
verified ·
1 Parent(s): ea3ccdd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +26 -3
README.md CHANGED
@@ -1,3 +1,26 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ license_link: https://huggingface.co/OpenPipe/Deductive-Reasoning-Qwen-14B/blob/main/LICENSE
4
+ language:
5
+ - en
6
+ pipeline_tag: text-generation
7
+ base_model:
8
+ - Qwen/Qwen2.5-1.5B-Instruct
9
+ tags:
10
+ - chat
11
+ library_name: transformers
12
+ ---
13
+
14
+ # Deductive-Reasoning-Qwen-14B
15
+
16
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/674a1d102c0f27a385772cfe/JauBmEQM0FpOdShBMSfst.png)
17
+
18
+ Deductive Reasoning Qwen 14B is a reinforcement fine-tune of [Qwen 2.5 14B Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct) to solve challenging deduction problems from the [Temporal Clue](https://github.com/bradhilton/temporal-clue) dataset, trained by [OpenPipe](https://openpipe.ai)!
19
+
20
+ Here are some additional resources to check out:
21
+
22
+ - Blog Post
23
+ - [Training Recipe](https://github.com/openpipe/deductive-reasoning)
24
+ - [RL Experiments](https://github.com/openpipe/rl-experiments)
25
+ - [Deductive Reasoning Qwen 32B](https://huggingface.co/OpenPipe/Deductive-Reasoning-Qwen-32B)
26
+