Cosmos3 Nano Reasoner BNB8 VLM

This is an unofficial, locally repacked derivative of nvidia/Cosmos3-Nano for the VLM/reasoner path only.

It keeps the understanding tower used by the local vLLM loader:

  • model.language_model.*
  • lm_head.*
  • model.visual.*

It removes the Cosmos3 generator-side tensors, including diffusion *_moe_gen, added cross-attention projections, video/audio/action projection modules, VAE/sound-tokenizer assets, and Diffusers pipeline metadata.

The tensors are stored with stock Qwen3-VL key names and the config advertises Qwen3VLForConditionalGeneration, so consumers should not need the local vllm_cosmos3 plugin just to load the VLM path.

Original model materials are governed by the NVIDIA Open Model Dataset and Weights License 1.1. Retain upstream notices and review the original model card before redistribution or deployment.

Downloads last month
-
Safetensors
Model size
9B params
Tensor type
F32
BF16
I8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support

Model tree for ThePyProgrammer/Cosmos3-Nano-reasoner-bnb8-vllm-und-only

Quantized
(9)
this model