omni-reasoner

omni-reasoner is a multimodal instruction model for image/video-text reasoning.

Usage

import torch
from transformers import AutoProcessor, Qwen3VLForConditionalGeneration

model_id = "YOUR_ORG/omni-reasoner"

processor = AutoProcessor.from_pretrained(model_id, trust_remote_code=True)
model = Qwen3VLForConditionalGeneration.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16,
    device_map="auto",
    trust_remote_code=True,
).eval()

Inference

Use deterministic decoding for evaluation.

outputs = model.generate(**inputs, max_new_tokens=2048, do_sample=False)
Downloads last month
2
Safetensors
Model size
9B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support