teknium commited on
Commit
9fb010d
·
verified ·
1 Parent(s): f09488e

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +96 -0
README.md ADDED
@@ -0,0 +1,96 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: llama3
5
+ tags:
6
+ - Llama-3
7
+ - RL
8
+ - Atropos
9
+ - Tool Calling
10
+ - Nous Research
11
+ - instruct
12
+ - finetune
13
+ - reasoning
14
+ - function calling
15
+ - transformers
16
+ - reinforcement-learning
17
+ - json mode
18
+ - chatml
19
+ base_model: meta-llama/Meta-Llama-3.1-8B
20
+ library_name: transformers
21
+
22
+ ---
23
+
24
+ # DeepHermes Tool Calling Specialist - Atropos RL
25
+
26
+ ## Model Overview
27
+
28
+ The **DeepHermes Tool Calling Specialist - Atropos RL** model is an experimental artifact fine-tuned by Nous Research using their innovative open-source reinforcement learning framework—Atropos. This variant specifically improves the tool calling performance of the **DeepHermes 3 Llama-3.1 8B** model during its reasoning mode.
29
+
30
+ **Note**: This model is intended as an experimental artifact and is not designed for broad, general-purpose use.
31
+
32
+ ## Atropos Open Source Framework
33
+
34
+ Atropos is Nous Research’s open-source Reinforcement Learning environment stack, designed to enhance various aspects of LLM functionalities through structured RL methodologies. We encourage contributions and exploration:
35
+
36
+ 🔗 [Atropos GitHub Repository](https://github.com/NousResearch/Atropos)
37
+
38
+ ## Benchmark Results
39
+
40
+ Evaluations on the Berkeley Function Calling benchmark demonstrate significant improvements in tool calling accuracy during reasoning mode, compared to its base model:
41
+
42
+ | Benchmark | Base Accuracy | Atropos RL Accuracy | Improvement |
43
+ | --------- | ------------- | ------------------- | ----------- |
44
+ | Parallel | 0.10 | 0.46 | **4.6x** |
45
+ | Simple | 0.21 | 0.5175 | **2.5x** |
46
+
47
+ These enhancements are due to RL fine-tuning specifically targeted at improving reasoning-based tool calling capabilities.
48
+
49
+ ## Key Features
50
+
51
+ - **Improved Tool Calling in Reasoning Mode**: Reinforcement learning significantly boosts tool usage during complex reasoning tasks.
52
+ - **Open-Source RL Framework**: Utilizes the fully open-source Atropos RL Environments.
53
+ - **Active Open Source Community**: Contributions welcomed on the Atropos GitHub.
54
+ - **Upcoming SOTA RL Trainer**: A state-of-the-art open-source reinforcement learning trainer by Nous Research is coming soon.
55
+
56
+ ## Usage
57
+
58
+ This model supports multiple inference modes including:
59
+
60
+ - **Reasoning (Deep Thinking Mode)**
61
+ - **Standard Chat/Instruction Mode**
62
+ - **Structured JSON Outputs**
63
+ - **Function Calling**
64
+
65
+ Detailed documentation and example inference code are available:
66
+
67
+ \*Note: You must first place DeepHermes' reasoning system prompt, and then append  your function calling system prompt after for it to do reasoning and tool calling simultaneously.\* 
68
+
69
+ 🔗 [Hermes Function Calling GitHub](https://github.com/NousResearch/Hermes-Function-Calling)
70
+
71
+ ## Quantized Versions
72
+
73
+ Quantized GGUF versions available here:
74
+
75
+ 🔗 [DeepHermes 3 Quantized GGUF](https://huggingface.co/NousResearch/DeepHermes-3-Llama-3-8B-Preview-GGUF)
76
+
77
+ ## How to Cite
78
+
79
+ ```bibtex
80
+ @misc{
81
+ title={DeepHermes Tool Calling Specialist - Atropos RL},
82
+ author={Teknium and Roger Jin and Chen Guang and Jai Suphavadeeprasit and Jeffrey Quesnelle},
83
+ year={2025},
84
+ url={https://huggingface.co/NousResearch/DeepHermes-Tool-Calling-Specialist-Atropos-RL}
85
+ }
86
+ ```
87
+
88
+ ## Community and Support
89
+
90
+ For questions, issues, or findings, please open issues or discussions in the respective GitHub repositories:
91
+
92
+ - [Atropos Framework Issues](https://github.com/NousResearch/Atropos/issues)
93
+ - [DeepHermes Models Issues](https://github.com/NousResearch/Hermes-Function-Calling/issues)
94
+
95
+ Nous Research encourages active community engagement and open-source contributions to continuously improve model performance and capabilities.
96
+