Spaces

·

The AI App Directory

New Space What is Spaces?

Qwen2.5 VL 72B Instruct

Interact with Qwen2.5-VL-Chat model using text and files

Running on CPU Upgrade

Open VLM Leaderboard

VLMEvalKit Evaluation Results Collection

Running on Zero

Chat with DeepSeek-VL2-small

Generate responses using images and text input

Running on Zero

Qwen2 VL Localization

Detect objects in images and get bounding boxes

Running on Zero

Qwen2.5 VL Instruct Demo

Space for Qwen2.5-VL-3B and 7B image + text demo.

Running on Zero

VLM R1 Referral Expression

Highlight described objects in images

Qwen2-VL-72B

Engage in multi-modal conversations with images and videos

Running on Zero

Vlm Comparer

Compare any two VLMs, side-by-side.

Running on Zero

Grpo Vlm Decoder

A VLM-based message decoder that is trained via GRPO

Qwen-VL-Plus

Chat with images and text using Qwen-VL-Plus

Qwen-VL-Max

Interact with images and texts using Qwen-VL-Max

Gradio Lite

Convert images to grayscale

Running on Zero

Lumina Next T2I

Generate high-resolution images from text prompts

Running on Zero

TraVisionLM - Turkish Visual Language Model

Analyze images and answer questions about them

Running on Zero

Qwen2 7B VL Demo

Generate responses from text and images

Running on Zero

Qwen2-VL-2B

Generate text from images and videos

Running on Zero

Qwen2-VL-7B

Generate text by combining an image and a question

Running on Zero

Qwen2-VL-7B

Generate text based on an image or video

Vlms

Analyze images and describe their contents using AI models

Audio SR

Fixed fork of the original audio sr!

Ertugrul Qwen2 VL 7B Captioner Relaxed

Generate captions for images

OCR Using Qwen2 VL

Qwen2-VL is a vision-language model that performs OCR

Running on Zero

DeepSeek VL 1.3B Chat

Describe an image based on a question

Running on Zero

Florence Llama

Generate text responses based on images and input text