internvl3_5-8b-korean-fullft

OpenGVLab/InternVL3_5-8B-Instruct ๋ฅผ ํ•œ๊ตญ์–ด ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋ฐ์ดํ„ฐ๋กœ ํŒŒ์ธํŠœ๋‹ํ•œ InternVL3.5 specialist.

ํ•ญ๋ชฉ ๊ฐ’
Base model OpenGVLab/InternVL3_5-8B-Instruct
Method Full FT
Domain ํ•œ๊ตญ์–ด ์ข…ํ•ฉ

Hyperparameters

๋ณ„๋„ ์„œ๋ฒ„ ํ•™์Šต โ€” ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ๋กœ๊ทธ ๋ฏธ๋™๋ด‰.

Training Loss

ํ•™์Šต loss ๋กœ๊ทธ ๋ฏธ๋™๋ด‰ (๋ณ„๋„ ์„œ๋ฒ„ ํ•™์Šต).

Training Data

๊ตฌ์„ฑ: 18๊ฐœ ์„œ๋ธŒ์…‹ (ํ•œ๊ตญ์–ด specialist SFT)

subset repeat
aihub_visual_ShortQA_30k 1
hf_korLlava_Caption_20k 1
llava_ko_recap_30k 1
out_kor_llava_20k 1
chartRqa1_30k 1
chartRqa2_20k 1
tableVqa_Reason_20k 1
tableVqa_Caption_20k 1
aihub_subjectTxt_OCR_20k 1
aihub_visual_OCR_15k 1
kisti_arxiv_OCR_15k 1
kisti_hanbat_Reason_30k 1
kisti_documen_Reason_10k 1
aihub_mathMultiple_kor_M0 1
aihub_mathSubjective_kor_M0 1
kisti_hanbat_Vqa_25k 1
hf_latexUpdate_15k 1
aihub_subjectImg_Parse_10k 1

Usage

from transformers import AutoModel, AutoTokenizer
import torch
m = AutoModel.from_pretrained("yujuyeon/internvl3_5-8b-korean-fullft", torch_dtype=torch.bfloat16,
                              trust_remote_code=True).eval().cuda()
tok = AutoTokenizer.from_pretrained("yujuyeon/internvl3_5-8b-korean-fullft", trust_remote_code=True, use_fast=False)
Downloads last month
-
Safetensors
Model size
9B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for yujuyeon/internvl3_5-8b-korean-fullft

Finetuned
(3)
this model