You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

OralGPT-CMF-v1

OralGPT-CMF-v1 is a Qwen3-VL-4B-Instruct based multimodal model fine-tuned on the OralGPT/CMF dataset for oral and maxillofacial case understanding.

Base model

  • Base: Qwen/Qwen3-VL-4B-Instruct
  • Fine-tuning framework: LLaMA-Factory
  • Fine-tuning method: LoRA SFT, merged into full model weights for this release
  • Template: qwen3_vl_nothink
  • Vision tower during training: frozen
  • Training epochs: 3
  • cutoff_len: 16384
  • image_max_pixels: 131072

Usage note

This repository contains the merged full model weights, not only the LoRA adapter.

The model is intended for research and testing. It should not be used as a standalone clinical decision-making system.

Dataset

  • OralGPT/CMF
Downloads last month
-
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for OralGPT/OralGPT-CMF-v1

Finetuned
(307)
this model