Changgil
/

K2S3-SOLAR-11b-v3.0

Text Generation

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

Edit model card

Developed by :

K2S3

Model Number:

K2S3-SOLAR-11b-v3.0

Base Model :

upstage/SOLAR-10.7B-v1.0

Training Data

The training data for this model includes the Standard Korean Dictionary, training data from KULLM at Korea University, abstracts of master's and doctoral theses, Korean language samples from AI Hub, alpaca-gpt4-data, and samples from The OpenOrca Dataset.
이 모델의 훈련 데이터에는 표준국어대사전, 고려대학교 KULLM에서 제공한 훈련 데이터, 석사 및 박사학위 논문의 초록, AI Hub에서 제공한 한국어 데이터 샘플, alpaca-gpt4-data, 그리고 OpenOrca Dataset에서 제공한 샘플들이 포함됩니다.

Training Method

This model was fine-tuned on the "upstage/SOLAR-10.7B-v1.0" base model using a full parameter tuning method with SFT (Supervised Fine-Tuning).
이 모델은 "upstage/SOLAR-10.7B-v1.0" 기반 모델을 SFT를 사용하여 전체 파라미터 조정 방법으로 미세조정되었습니다.

Hardware

Hardware: Utilized two A100 (80G*2EA) GPUs for training.
Training Factors: This model was fine-tuned with SFT, using the HuggingFace SFTtrainer and applied fsdp.
이 모델은 SFT를 사용하여 HuggingFace SFTtrainer와 fsdp를 적용하여 미세조정되었습니다.

Downloads last month: 4,020

Safetensors

Model size

10.7B params

Tensor type

FP16

·