Update README.md
Browse files
README.md
CHANGED
@@ -1,22 +1,43 @@
|
|
1 |
---
|
2 |
-
base_model:
|
|
|
|
|
|
|
|
|
|
|
3 |
tags:
|
4 |
-
-
|
5 |
-
-
|
6 |
-
|
7 |
-
- mistral
|
8 |
-
- trl
|
9 |
-
license: apache-2.0
|
10 |
-
language:
|
11 |
-
- en
|
12 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
13 |
|
14 |
-
|
|
|
|
|
|
|
|
|
15 |
|
16 |
-
- **Developed by:** LeroyDyer
|
17 |
-
- **License:** apache-2.0
|
18 |
-
- **Finetuned from model :** LeroyDyer/_Spydaz_Web_AI_Reasoner_BaseModel
|
19 |
|
20 |
-
This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
|
21 |
|
22 |
-
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|
|
|
1 |
---
|
2 |
+
base_model:
|
3 |
+
- LeroyDyer/SpydazWeb_AI_HumanAGI_002
|
4 |
+
- LeroyDyer/LCARS_TOP_SCORE
|
5 |
+
- LeroyDyer/_Spydaz_Web_AI_CheckPointsMerged
|
6 |
+
- LeroyDyer/_Spydaz_Web_AI_ReasoningCheckPointsMerged
|
7 |
+
library_name: transformers
|
8 |
tags:
|
9 |
+
- mergekit
|
10 |
+
- merge
|
11 |
+
|
|
|
|
|
|
|
|
|
|
|
12 |
---
|
13 |
+
# BASE MODEL - REASONER
|
14 |
+
|
15 |
+
The base model has been created as a new staarting point : It has been fully primed with various types of chains of thoughts and step by step solutions : enabling for reward training to take place . this model has been trained with various languges ( not intensivly ), enabling for cross languge understanding ;
|
16 |
+
Here we create a valid start point for agent based modelling , As we find that some training actually affects existing knowledge , hence agents become a thing ! or if you prefr, distillations ....
|
17 |
+
These agents can be medical , technical , roleplayers etc .
|
18 |
+
|
19 |
+
## Rewards and modelling reasoning capablitys
|
20 |
+
|
21 |
+
Modelling reasoning begins with mathmatics , here we focus where the mdel should have been inesivly pretrained but was not , SO we focus on basic mathmatical tasks , then programming , diagnosis etc :
|
22 |
+
This scheme can be used also with other tasks , such as planning providing structured outputs for the task being performed. as well explanationsif required :
|
23 |
+
|
24 |
+
Advance reasoning does not come from chain of thoughts !!! or distilation !!! ... It comes from the ability for the model to create a explanation for exisrting problems , and finding alturnative solutions , then optimising the best solutions whilst learning each route taken to get to the answer :
|
25 |
+
Previously it has been simulating a answer using patern recognition . or recall of a verbatum problem .. SO now we would like it to find the inner part of the task... Ie calculate .. this calccualtion process enables thinking !
|
26 |
+
We can also use it for emotive responses , and interview techniques . so it ill explain why it asked that particular question or gave that type of response , ie if it was empathic or had sentimental value etc , such as determoining the sentiment of the use and the intent and using this also as a reflective point on the response given and why could it have been different to acheive the same goals !
|
27 |
+
|
28 |
+
|
29 |
+
|
30 |
+
### Merge Method ( past Checkpoints and Pretraining)
|
31 |
+
|
32 |
+
This model was merged using the [Linear](https://arxiv.org/abs/2203.05482) merge method.
|
33 |
+
|
34 |
+
### Models Merged
|
35 |
|
36 |
+
The following models were included in the merge:
|
37 |
+
* [LeroyDyer/SpydazWeb_AI_HumanAGI_002](https://huggingface.co/LeroyDyer/SpydazWeb_AI_HumanAGI_002)
|
38 |
+
* [LeroyDyer/LCARS_TOP_SCORE](https://huggingface.co/LeroyDyer/LCARS_TOP_SCORE)
|
39 |
+
* [LeroyDyer/_Spydaz_Web_AI_CheckPointsMerged](https://huggingface.co/LeroyDyer/_Spydaz_Web_AI_CheckPointsMerged)
|
40 |
+
* [LeroyDyer/_Spydaz_Web_AI_ReasoningCheckPointsMerged](https://huggingface.co/LeroyDyer/_Spydaz_Web_AI_ReasoningCheckPointsMerged)
|
41 |
|
|
|
|
|
|
|
42 |
|
|
|
43 |
|
|