You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Qwen3-VL-8B-Instruct-Vision-R1-LoRA

该模型是基于 Qwen/Qwen3-VL-8B-Instruct 使用 LLaMA Factory 进行 QLoRA 微调得到的多模态视觉语言模型。

模型描述

  • 基础模型: Qwen3-VL-8B-Instruct
  • 微调方法: QLoRA (4-bit 量化 + LoRA)
  • 训练数据: vision_r1_mulberry_sft_full
  • LoRA 秩: 8
  • LoRA 目标模块: all
  • 参数量: ~8.7B

训练设置

参数
学习率 1.0e-4
Batch Size 1 (梯度累积: 8)
优化器 AdamW
学习率调度 Cosine
训练轮数 3 epochs
训练步数 375 steps
最终损失 0.638
训练时间 ~45 分钟 (单卡 24GB 3090)

使用方法

使用 LLaMA Factory

CUDA_VISIBLE_DEVICES=0 llamafactory-cli chat \
    --model_name_or_path Qwen/Qwen3-VL-8B-Instruct \
    --adapter_name_or_path lumos2548/Qwen3-VL-8B-Instruct-Vision-R1-LoRA \
    --template qwen3_vl_nothink \
    --finetuning_type lora

使用 Transformers

from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained(
    "Qwen/Qwen3-VL-8B-Instruct",
    device_map="auto"
)

model = PeftModel.from_pretrained(
    model,
    "lumos2548/Qwen3-VL-8B-Instruct-Vision-R1-LoRA"
)

tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen3-VL-8B-Instruct")

框架版本

  • PEFT 0.18.1
  • Transformers 5.2.0
  • Pytorch 2.6.0+cu124
  • LLaMA Factory 0.13.0
Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for lumos2548/Qwen3-VL-8B-Instruct-Vision-R1-LoRA

Adapter
(125)
this model