Edit model card

UPD: this model series is succeeded by EVA
Unprivated, to store for historical reasons
There's not much point in those merges, Celeste 70B 0.1 pretty much melded Celeste's and Magnum's datasets anyway
To be continued, but on a different base, under a different name, and actually trained this time, without shortcuts

MN-12B-Starcannon-v2

This is a merge of pre-trained language models created using mergekit. Turned out to be a bit more Magnum-esque, but still is very creative, and writing style is pretty nice, even if some slop words appear time to time. Might be a good fit for people wanting more variety than Magnum has, and more verbose prose than Celeste v1.9 has.

Dynamic FP8
Static GGUF (by Mradermacher)
EXL2 (by kingbri of RoyalLab)

Merge Details

Merge Method

This model was merged using the TIES merge method using nothingiisreal/MN-12B-Celeste-V1.9 as a base.

Merge fodder

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
    - model: intervitens/mini-magnum-12b-v1.1
      parameters:
        density: 0.3
        weight: 0.5
    - model: nothingiisreal/MN-12B-Celeste-V1.9
      parameters:
        density: 0.7
        weight: 0.5

merge_method: ties
base_model: nothingiisreal/MN-12B-Celeste-V1.9
parameters:
    normalize: true
    int8_mask: true
dtype: bfloat16
Downloads last month
12
Safetensors
Model size
12.2B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for yejingfu/MN-12B-Starcannon-v2