Edit model card

4season/final_model_test_v2

Introduction

Supervised fine-tuning refers to a machine learning technique where a pre-trained model is further trained on a specific task or dataset with labeled examples (supervised learning). The process involves taking a model that has been pre-trained on a large general dataset and then adapting it to a more focused task by continuing the training using task-specific data.

This model is test version, alignment-tuned model.

We utilize state-of-the-art instruction fine-tuning methods including direct preference optimization (DPO).

Downloads last month
7
Safetensors
Model size
21.4B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for 4season/final_model_test_v2

Quantizations
1 model