internvl3_5-8b-kth-fullft

OpenGVLab/InternVL3_5-8B-Pretrained ๋ฅผ ํ•œ๊ตญ์–ด ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋ฐ์ดํ„ฐ๋กœ ํŒŒ์ธํŠœ๋‹ํ•œ InternVL3.5 specialist.

ํ•ญ๋ชฉ ๊ฐ’
Base model OpenGVLab/InternVL3_5-8B-Pretrained
Method Full FT
Domain ๋ณ„๋„ ์„œ๋ฒ„ ํ•™์Šต โ€” ํ•™์Šต ๋กœ๊ทธ ๋ฏธ๋™๋ด‰. (Korean multimodal, stage1)

Hyperparameters

๋ณ„๋„ ์„œ๋ฒ„ ํ•™์Šต โ€” ํ•˜์ดํผํŒŒ๋ผ๋ฏธํ„ฐ ๋กœ๊ทธ ๋ฏธ๋™๋ด‰.

Training Loss

ํ•™์Šต loss ๋กœ๊ทธ ๋ฏธ๋™๋ด‰ (๋ณ„๋„ ์„œ๋ฒ„ ํ•™์Šต).

Training Data

Korean multimodal SFT (stage1). ๋ฐ์ดํ„ฐ ๊ตฌ์„ฑ ๋น„๊ณต๊ฐœ.

Usage

from transformers import AutoModel, AutoTokenizer
import torch
m = AutoModel.from_pretrained("yujuyeon/internvl3_5-8b-kth-fullft", torch_dtype=torch.bfloat16,
                              trust_remote_code=True).eval().cuda()
tok = AutoTokenizer.from_pretrained("yujuyeon/internvl3_5-8b-kth-fullft", trust_remote_code=True, use_fast=False)
Downloads last month
24
Safetensors
Model size
11B params
Tensor type
F32
ยท
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for yujuyeon/internvl3_5-8b-kth-fullft

Finetuned
(5)
this model