RichardErkhov
/

Changgil_-_K2S3-SOLAR-11b-v4.0-4bits

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

Edit model card

YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Quantization made by Richard Erkhov.

Request more models

K2S3-SOLAR-11b-v4.0 - bnb 4bits

Model creator: https://huggingface.co/Changgil/
Original model: https://huggingface.co/Changgil/K2S3-SOLAR-11b-v4.0/

Original model description:

license: cc-by-nc-4.0 language: - ko

Developed by :

K2S3

Model Number:

K2S3-SOLAR-11b-v4.0

Base Model :

upstage/SOLAR-10.7B-v1.0

Training Data

The training data for this model includes the Standard Korean Dictionary, training data from KULLM at Korea University, abstracts of master's and doctoral theses, Korean language samples from AI Hub, alpaca-gpt4-data, and samples from The OpenOrca Dataset.
이 모델의 훈련 데이터에는 표준국어대사전, 고려대학교 KULLM에서 제공한 훈련 데이터, 석사 및 박사학위 논문의 초록, AI Hub에서 제공한 한국어 데이터 샘플, alpaca-gpt4-data, 그리고 OpenOrca Dataset에서 제공한 샘플들이 포함됩니다.

Training Method

This model was fine-tuned on the "upstage/SOLAR-10.7B-v1.0" base model using a full parameter tuning method with SFT (Supervised Fine-Tuning).
이 모델은 "upstage/SOLAR-10.7B-v1.0" 기반 모델을 SFT를 사용하여 전체 파라미터 조정 방법으로 미세조정되었습니다.

Hardware

Hardware: Utilized two A100 (80G*2EA) GPUs for training.
Training Factors: This model was fine-tuned with SFT, using the HuggingFace SFTtrainer and applied fsdp.
이 모델은 SFT를 사용하여 HuggingFace SFTtrainer와 fsdp를 적용하여 미세조정되었습니다.

Downloads last month: 1

Safetensors

Model size

5.66B params

Tensor type

F32

·

FP16

·

U8

·

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.