internvl3_5-8b-kor-eng-lora

OpenGVLab/InternVL3_5-8B ๋ฅผ ํ•œ๊ตญ์–ด ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋ฐ์ดํ„ฐ๋กœ ํŒŒ์ธํŠœ๋‹ํ•œ InternVL3.5 specialist.

ํ•ญ๋ชฉ ๊ฐ’
Base model OpenGVLab/InternVL3_5-8B
Method LoRA r64 โ†’ merged
Domain ํ•œ+์˜ ์ฐจํŠธ/ํ‘œ/๋ฌธ์„œ/OCR

Hyperparameters

๋ณ„๋„ ์„œ๋ฒ„ ํ•™์Šต โ€” ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ๋กœ๊ทธ ๋ฏธ๋™๋ด‰.

Training Loss

ํ•™์Šต loss ๋กœ๊ทธ ๋ฏธ๋™๋ด‰ (๋ณ„๋„ ์„œ๋ฒ„ ํ•™์Šต).

Training Data

๊ตฌ์„ฑ: 25๊ฐœ ์„œ๋ธŒ์…‹ (ํ•œ๊ตญ์–ด specialist SFT)

subset repeat
chartRqa1_30k 1.0
chartRqa2_20k 1.0
tableVqa_Reason_20k 1.0
tableVqa_Caption_20k 1.0
aihub_subjectTxt_OCR_20k 1.0
aihub_visual_OCR_15k 1.0
kisti_arxiv_OCR_15k 1.0
kisti_hanbat_Reason_30k 1.0
kisti_documen_Reason_10k 1.0
kisti_hanbat_Vqa_25k 1.0
aihub_subjectImg_Parse_10k 1.0
hf_latexUpdate_15k 1.0
kopub_vdr_OCR_40k 1.0
aihub_visual_ShortQA_30k 1.0
hf_korLlava_Caption_20k 1.0
llava_ko_recap_30k 1.0
out_kor_llava_20k 1.0
aihub_mathMultiple_kor 1.0
aihub_mathSubjective_kor 1.0
cauldron_en_replay 1.0
chartmoe_chartqa_unified_30k 1.0
chartmoe_chartgemma_163k 1.0
chartmoe_chart2table_200k 1.0
chartmoe_chart2code_100k 1.0
dvqa_train_200k 1.0

Usage

from transformers import AutoModel, AutoTokenizer
import torch
m = AutoModel.from_pretrained("yujuyeon/internvl3_5-8b-kor-eng-lora", torch_dtype=torch.bfloat16,
                              trust_remote_code=True).eval().cuda()
tok = AutoTokenizer.from_pretrained("yujuyeon/internvl3_5-8b-kor-eng-lora", trust_remote_code=True, use_fast=False)
Downloads last month
17
Safetensors
Model size
9B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for yujuyeon/internvl3_5-8b-kor-eng-lora