Fourier-LLaVA-v1.5-7B-64

Official checkpoints for Fourier Compressor: Frequency-Domain Visual Token Compression for Vision-Language Models.

Model Details

Model Base Model Visual Tokens Compression Weights
Fourier-LLaVA-v1.5-7B-256 LLaVA-v1.5-7B 256 55.6% πŸ€— HF
Fourier-LLaVA-v1.5-7B-144 LLaVA-v1.5-7B 144 75.0% πŸ€— HF
Fourier-LLaVA-v1.5-7B-64 LLaVA-v1.5-7B 64 88.9% πŸ€— HF
Fourier-LLaVA-v1.5-7B-36 LLaVA-v1.5-7B 36 93.8% πŸ€— HF
Fourier-LLaVA-v1.5-13B-144 LLaVA-v1.5-13B 144 75.0% πŸ€— HF
Fourier-Qwen2-VL-2B-0.67 Qwen2-VL-2B-Instruct Dynamic 55.6% πŸ€— HF
Fourier-Qwen2.5-VL-3B-0.67 Qwen2.5-VL-3B-Instruct Dynamic 55.6% πŸ€— HF

Links

Downloads last month
16
Safetensors
Model size
7B params
Tensor type
F16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for whyisverysmart/Fourier-LLaVA-v1.5-7B-64

Finetuned
(32)
this model

Collection including whyisverysmart/Fourier-LLaVA-v1.5-7B-64

Paper for whyisverysmart/Fourier-LLaVA-v1.5-7B-64