Developed by :

  • K2S3

Model Number:

  • K2S3-SOLAR-11b-v3.0

Base Model :

Training Data

  • The training data for this model includes the Standard Korean Dictionary, training data from KULLM at Korea University, abstracts of master's and doctoral theses, Korean language samples from AI Hub, alpaca-gpt4-data, and samples from The OpenOrca Dataset.
  • 이 λͺ¨λΈμ˜ ν›ˆλ ¨ λ°μ΄ν„°μ—λŠ” ν‘œμ€€κ΅­μ–΄λŒ€μ‚¬μ „, κ³ λ €λŒ€ν•™κ΅ KULLMμ—μ„œ μ œκ³΅ν•œ ν›ˆλ ¨ 데이터, 석사 및 λ°•μ‚¬ν•™μœ„ λ…Όλ¬Έμ˜ 초둝, AI Hubμ—μ„œ μ œκ³΅ν•œ ν•œκ΅­μ–΄ 데이터 μƒ˜ν”Œ, alpaca-gpt4-data, 그리고 OpenOrca Datasetμ—μ„œ μ œκ³΅ν•œ μƒ˜ν”Œλ“€μ΄ ν¬ν•¨λ©λ‹ˆλ‹€.

Training Method

  • This model was fine-tuned on the "upstage/SOLAR-10.7B-v1.0" base model using a full parameter tuning method with SFT (Supervised Fine-Tuning).
  • 이 λͺ¨λΈμ€ "upstage/SOLAR-10.7B-v1.0" 기반 λͺ¨λΈμ„ SFTλ₯Ό μ‚¬μš©ν•˜μ—¬ 전체 νŒŒλΌλ―Έν„° μ‘°μ • λ°©λ²•μœΌλ‘œ λ―Έμ„Έμ‘°μ •λ˜μ—ˆμŠ΅λ‹ˆλ‹€.

Hardware

  • Hardware: Utilized two A100 (80G*2EA) GPUs for training.
  • Training Factors: This model was fine-tuned with SFT, using the HuggingFace SFTtrainer and applied fsdp.
  • 이 λͺ¨λΈμ€ SFTλ₯Ό μ‚¬μš©ν•˜μ—¬ HuggingFace SFTtrainer와 fsdpλ₯Ό μ μš©ν•˜μ—¬ λ―Έμ„Έμ‘°μ •λ˜μ—ˆμŠ΅λ‹ˆλ‹€.
Downloads last month
63
Safetensors
Model size
11B params
Tensor type
F16
Β·
Inference Providers NEW

Model tree for Changgil/K2S3-SOLAR-11b-v3.0

Quantizations
4 models

Spaces using Changgil/K2S3-SOLAR-11b-v3.0 9