Lambent commited on
Commit
31a24cf
·
verified ·
1 Parent(s): b2e640f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -2
README.md CHANGED
@@ -11,12 +11,18 @@ model-index:
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
  should probably proofread and complete it, then remove this comment. -->
13
 
 
 
14
  This version has been tuned from the fascinating arcee-ai/SuperNova-Medius as root model.
15
 
16
- (Methodology: A bit of custom fine-tuning, with the plurality from the 'filtered' subset of argilla/ifeval-like-data experimentally trained
 
 
 
 
17
  with 'input/output' roles rather than 'user/assistant'
18
  (other instruction sampling stayed chatml-style, some continued pretraining added with a bias to older public domain styles);
19
- ties merged at full saturation with the original over base Qwen, then this DPO.)
20
 
21
  [<img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/axolotl-ai-cloud/axolotl)
22
  <details><summary>See axolotl config</summary>
 
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
  should probably proofread and complete it, then remove this comment. -->
13
 
14
+ <img src="https://cdn.midjourney.com/2cf1309c-bcde-41e1-bd58-957feccb3ed8/0_1.jpeg"></img>
15
+
16
  This version has been tuned from the fascinating arcee-ai/SuperNova-Medius as root model.
17
 
18
+ Censorship remains notable on this one, just including the Not For All Audiences tag due to dataset.
19
+
20
+ EQ-Bench is about 1 point lower than its ancestor, but fixed a syntax issue. May indicate a bit of expected intelligence loss.
21
+
22
+ Methodology: A bit of custom fine-tuning, with the plurality from the 'filtered' subset of argilla/ifeval-like-data experimentally trained
23
  with 'input/output' roles rather than 'user/assistant'
24
  (other instruction sampling stayed chatml-style, some continued pretraining added with a bias to older public domain styles);
25
+ ties merged at full saturation with the original over base Qwen, then this DPO.
26
 
27
  [<img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/axolotl-ai-cloud/axolotl)
28
  <details><summary>See axolotl config</summary>