Wan 2 2 First Last Frame
Generate a video by interpolating between two images with a prompt
Generate a video by interpolating between two images with a prompt
Generate a podcast audio from a script and voice samples
inapint with Qwen Image Edit for super precise edits
High-fidelity 3D Geometry Generation from single view image
Generate claymation style avatar to do your podcast
Mood Palette Generator
A Lightweight and Plug-and-Play Identity Control for Video G
ChatGPT with real-time web search & URL reading capability
Generate any application with DeepSeek
generate a video from an image with a text prompt
A new open-source dataset for training VLMs
Convert audio to text with context and language options
Generate a video by interpolating between two images with a prompt
Generate images by combining styles and subjects
Chatterbox TTS supporting 23 languages
Mood Palette Generator
Generate high-quality images from text prompts
High-fidelity 3D Geometry Generation from single view image
Embedding Leaderboard
Visualize embeddings in 3D space, powered by EmbeddingGemma
Real-time video captioning powered by FastVLM
Image-to-3D Generation
Fast 8 step inference of Qwen Image Edit
Nano Banana for Hugging Face PRO users
Generate web application code from descriptions
Generate a multi-speaker podcast from a script
Generate Gradio app code from user requests
Generate video from audio and image
Expressive Zeroshot TTS
Edit images based on user instructions
inapint with Qwen Image Edit for super precise edits
The ultimate guide to training LLM on large GPU Clusters