Natkituwu
/

Kunokukulemonchini-7b-6.5bpw-exl2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Kunokukulemonchini-7b-6.5bpw-exl2

This is an 6.5 bpw exl2 quant of a merger icefog72/Kunokukulemonchini-7b.

With a 4060 8GB. i am able to run this at 16K context at 8bit cache. and 24K context with 4bit cache.

Merge Details

Slightly edited kukulemon-7B config.json before merge to get at least ~32k context window.

Merge Method

This model was merged using the SLERP merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:


slices:
  - sources:
      - model: grimjim/kukulemon-7B
        layer_range: [0, 32]
      - model: Nitral-AI/Kunocchini-7b-128k-test
        layer_range: [0, 32]
merge_method: slerp
base_model: Nitral-AI/Kunocchini-7b-128k-test
parameters:
  t:
    - filter: self_attn
      value: [0, 0.5, 0.3, 0.7, 1]
    - filter: mlp
      value: [1, 0.5, 0.7, 0.3, 0]
    - value: 0.5
dtype: float16

Downloads last month: 11

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for Natkituwu/Kunokukulemonchini-7b-6.5bpw-exl2

Nitral-AI/Kunocchini-7b-128k-test

grimjim/kukulemon-7B

Merge model

this model