inflatebot
/

MN-12B-Mag-Mell-R1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

inflatebot commited on 9 days ago

Commit

170cea8

•

1 Parent(s): 898050f

typo

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -40,7 +40,7 @@ The base model for Mag Mell is [Mistral-Nemo-Base-2407-chatml](https://huggingfa
 Early testing versions had a tendency to leak tokens, but this should be more or less hammered out. It recently (12-18-2024) came to attention that Cache Quantization may either cause or exacerbate this issue.
 ## Merge Details
-Mag Mel is a multi-stage merge, Inspired by hyper-merges like [Tiefighter](https://huggingface.co/KoboldAI/LLaMA2-13B-Tiefighter) and [Umbral Mind.](https://huggingface.co/Casual-Autopsy/L3-Umbral-Mind-RP-v2.0-8B)
 Intended to be a general purpose "Best of Nemo" model for any fictional, creative use case.
 6 models were chosen based on 3 categories; they were then paired up and merged via layer-weighted SLERP to create intermediate "specialists" which are then evaluated in their domain.

 Early testing versions had a tendency to leak tokens, but this should be more or less hammered out. It recently (12-18-2024) came to attention that Cache Quantization may either cause or exacerbate this issue.
 ## Merge Details
+Mag Mell is a multi-stage merge, Inspired by hyper-merges like [Tiefighter](https://huggingface.co/KoboldAI/LLaMA2-13B-Tiefighter) and [Umbral Mind.](https://huggingface.co/Casual-Autopsy/L3-Umbral-Mind-RP-v2.0-8B)
 Intended to be a general purpose "Best of Nemo" model for any fictional, creative use case.
 6 models were chosen based on 3 categories; they were then paired up and merged via layer-weighted SLERP to create intermediate "specialists" which are then evaluated in their domain.