LeroyDyer commited on
Commit
21a46e6
·
verified ·
1 Parent(s): a1bf072

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -15
README.md CHANGED
@@ -1,22 +1,43 @@
1
  ---
2
- base_model: LeroyDyer/_Spydaz_Web_AI_Reasoner_BaseModel
 
 
 
 
 
3
  tags:
4
- - text-generation-inference
5
- - transformers
6
- - unsloth
7
- - mistral
8
- - trl
9
- license: apache-2.0
10
- language:
11
- - en
12
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
 
14
- # Uploaded model - Basic allignment - Bench Testing
 
 
 
 
15
 
16
- - **Developed by:** LeroyDyer
17
- - **License:** apache-2.0
18
- - **Finetuned from model :** LeroyDyer/_Spydaz_Web_AI_Reasoner_BaseModel
19
 
20
- This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
21
 
22
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
1
  ---
2
+ base_model:
3
+ - LeroyDyer/SpydazWeb_AI_HumanAGI_002
4
+ - LeroyDyer/LCARS_TOP_SCORE
5
+ - LeroyDyer/_Spydaz_Web_AI_CheckPointsMerged
6
+ - LeroyDyer/_Spydaz_Web_AI_ReasoningCheckPointsMerged
7
+ library_name: transformers
8
  tags:
9
+ - mergekit
10
+ - merge
11
+
 
 
 
 
 
12
  ---
13
+ # BASE MODEL - REASONER
14
+
15
+ The base model has been created as a new staarting point : It has been fully primed with various types of chains of thoughts and step by step solutions : enabling for reward training to take place . this model has been trained with various languges ( not intensivly ), enabling for cross languge understanding ;
16
+ Here we create a valid start point for agent based modelling , As we find that some training actually affects existing knowledge , hence agents become a thing ! or if you prefr, distillations ....
17
+ These agents can be medical , technical , roleplayers etc .
18
+
19
+ ## Rewards and modelling reasoning capablitys
20
+
21
+ Modelling reasoning begins with mathmatics , here we focus where the mdel should have been inesivly pretrained but was not , SO we focus on basic mathmatical tasks , then programming , diagnosis etc :
22
+ This scheme can be used also with other tasks , such as planning providing structured outputs for the task being performed. as well explanationsif required :
23
+
24
+ Advance reasoning does not come from chain of thoughts !!! or distilation !!! ... It comes from the ability for the model to create a explanation for exisrting problems , and finding alturnative solutions , then optimising the best solutions whilst learning each route taken to get to the answer :
25
+ Previously it has been simulating a answer using patern recognition . or recall of a verbatum problem .. SO now we would like it to find the inner part of the task... Ie calculate .. this calccualtion process enables thinking !
26
+ We can also use it for emotive responses , and interview techniques . so it ill explain why it asked that particular question or gave that type of response , ie if it was empathic or had sentimental value etc , such as determoining the sentiment of the use and the intent and using this also as a reflective point on the response given and why could it have been different to acheive the same goals !
27
+
28
+
29
+
30
+ ### Merge Method ( past Checkpoints and Pretraining)
31
+
32
+ This model was merged using the [Linear](https://arxiv.org/abs/2203.05482) merge method.
33
+
34
+ ### Models Merged
35
 
36
+ The following models were included in the merge:
37
+ * [LeroyDyer/SpydazWeb_AI_HumanAGI_002](https://huggingface.co/LeroyDyer/SpydazWeb_AI_HumanAGI_002)
38
+ * [LeroyDyer/LCARS_TOP_SCORE](https://huggingface.co/LeroyDyer/LCARS_TOP_SCORE)
39
+ * [LeroyDyer/_Spydaz_Web_AI_CheckPointsMerged](https://huggingface.co/LeroyDyer/_Spydaz_Web_AI_CheckPointsMerged)
40
+ * [LeroyDyer/_Spydaz_Web_AI_ReasoningCheckPointsMerged](https://huggingface.co/LeroyDyer/_Spydaz_Web_AI_ReasoningCheckPointsMerged)
41
 
 
 
 
42
 
 
43