Llama-3.2-1B-Vision

A vision-enhanced version of the Llama-3.1-8B language model, capable of understanding and describing images while maintaining the base model's language capabilities.

Model Details

  • Base Model: Llama-3.1-8B
  • Model Type: Vision-Language Model
  • Last Updated: December ?, 2024
  • Model Architecture: Llama architecture with SigLIP vision encoder
Downloads last month
44
Safetensors
Model size
8.03B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for kadirnar/Llama-3.1-8B-Vision

Finetuned
(1067)
this model

Collection including kadirnar/Llama-3.1-8B-Vision