Edit model card

Llama-3.2-Kapusta-JapanChibi-3B-v1

やめてください、私は小さくて役に立ちます

I love this model, but I don't understand Japanese, although it is also good in other languages.

Kapusta-JapanChibi-Logo256.png

This is an interesting merge of 3 cool models, created using mergekit. Enjoy exploring :)

Merge Details

Method

This model was merged using the model_stock method.

Models

The following models were included in the merge:

Configuration

The following YAML configurations was used to produce this model:

# Llama-3.2-Kapusta-JapanChibi-3B-v1
models:
  - model: AELLM/Llama-3.2-Chibi-3B
  - model: AXCXEPT/EZO-Llama-3.2-3B-Instruct-dpoE
merge_method: model_stock
base_model: Khetterman/Llama-3.2-Kapusta-3B-v8
dtype: bfloat16

My thanks to the authors of the original models, your work is incredible. Have a good time 🖤

Downloads last month
13
Safetensors
Model size
3.61B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for Khetterman/Llama-3.2-Kapusta-JapanChibi-3B-v1