inflatebot
commited on
Commit
•
170cea8
1
Parent(s):
898050f
typo
Browse files
README.md
CHANGED
@@ -40,7 +40,7 @@ The base model for Mag Mell is [Mistral-Nemo-Base-2407-chatml](https://huggingfa
|
|
40 |
Early testing versions had a tendency to leak tokens, but this should be more or less hammered out. It recently (12-18-2024) came to attention that Cache Quantization may either cause or exacerbate this issue.
|
41 |
|
42 |
## Merge Details
|
43 |
-
Mag
|
44 |
Intended to be a general purpose "Best of Nemo" model for any fictional, creative use case.
|
45 |
|
46 |
6 models were chosen based on 3 categories; they were then paired up and merged via layer-weighted SLERP to create intermediate "specialists" which are then evaluated in their domain.
|
|
|
40 |
Early testing versions had a tendency to leak tokens, but this should be more or less hammered out. It recently (12-18-2024) came to attention that Cache Quantization may either cause or exacerbate this issue.
|
41 |
|
42 |
## Merge Details
|
43 |
+
Mag Mell is a multi-stage merge, Inspired by hyper-merges like [Tiefighter](https://huggingface.co/KoboldAI/LLaMA2-13B-Tiefighter) and [Umbral Mind.](https://huggingface.co/Casual-Autopsy/L3-Umbral-Mind-RP-v2.0-8B)
|
44 |
Intended to be a general purpose "Best of Nemo" model for any fictional, creative use case.
|
45 |
|
46 |
6 models were chosen based on 3 categories; they were then paired up and merged via layer-weighted SLERP to create intermediate "specialists" which are then evaluated in their domain.
|