Ideogram 4
Ideogram 4 state of the art open weights
Ideogram 4 state of the art open weights
Generate 3D models from a single image
Chat with a multimodal AI using text and images, audio, or video
8B CSM-style text-to-speech with voice cloning
Generate natural speech from your written text
Generate speech from text with optional voice cloning
Play notes to steer real-time music (PyTorch)
PaddleOCR-VL-1.6_Online_Demo
Generate 3D models from a single image
Ideogram 4 state of the art open weights
generate a video from an image with a text prompt
Detect and label objects in images and videos
State-of-the-art image generation, in your browser.
Demo of the Collection of Qwen Image Edit LoRAs
Image edit, text to image, image upscale, remove watermark
Generate vivid images from text prompts in seconds
text to video, image to video, video extend
FireRed-Image-Edit Γ Qwen-Image-Edit-Rapid (Transformers)
VoxCPM2 Nano-vLLM Demo
High-quality voice cloning TTS for 600+ languages
generate a video from an image with a text prompt
Audio-driven talking-head video generation (Meituan LongCat)
Generate videos from text, images, audio, or video clips
Generate full HTML web apps from text prompts
Dense video captions and timestamp search
High-fidelity 3D Generation from images
NVIDIA Cosmos3-Nano β text/image to video + audio
Run Bonsai-Image-4B models on GPU
10eros ltx 2.3 image-to-video with native audio
Chat with an AI assistant using the LFM2.5 model
generate a video from an image with a text prompt
Pixel Diffusion Decoder