Whisper
Transcribe audio files or YouTube videos into text
Transcribe audio files or YouTube videos into text
Generate video from audio and image
Create quantized models from Hugging Face
Browse apps made with DeepSite
Kontext image editing on FLUX[dev]
Long-form multi-speaker dialogue generation
Frontier Japanese Speech synthesize Network
Upgraded to v1.0!
Generate images from text prompts
Convert images to 3D models with depth and normal maps
Uncensored General Intelligence Leaderboard
Generate images from text prompts
Generate captions for images in various styles
Generate images from text prompts
Chat with Xiaomi MiMo-Audio using voice
Easily expand image boundaries
Text-to-3D and Image-to-3D Generation
Generate a video from an image with a prompt
Next-Gen High-Resolution 3D Model Generation
Generate high-quality videos from text prompts and images
image generation
Chatterbox TTS supporting 23 languages
Image and video tasks with moondream3.
Use NVIDIA H100 GPU