You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

nntoan_prexpert_80-20

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Linear merge method using /root/sn120/nntoan209_affine-2ef1e-5F23B as a base.

Models Merged

The following models were included in the merge:

  • /root/sn120/prexpert_affine-155-5Fj8b

Configuration

The following YAML configuration was used to produce this model:

# Linear merge: prexpert (80%) + nntoan209 (20%)
merge_method: linear
dtype: bfloat16

base_model: /root/sn120/nntoan209_affine-2ef1e-5F23B

parameters:
  normalize: true

models:
  - model: /root/sn120/prexpert_affine-155-5Fj8b
    parameters:
      weight: 0.10
  - model: /root/sn120/nntoan209_affine-2ef1e-5F23B
    parameters:
      weight: 0.90
Downloads last month
22
Safetensors
Model size
33B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for lenitokore/temp-croc-live-g-1