An open-world instance segmentation model
A general and high-performance DETR-like model
Generate customized images using text and multiple images
Open-Domain 4D Avatarization
Generate images from text and image descriptions
Enhance image captions with detailed descriptions
FlexTok flexible sequence length autoencoding demo
New Ghibli EasyControl model is now released!!
Zero Shot voice cloning with llasa 3b (Unofficial Demo)
Convert images of humans to biomechanically accurate 3D skeletons
Generate 3D texture from image
Generate 3D texture from texts
Generate 3D models from images
A Chain-of-LoRA Agent for Long Video Reasoning
Generate any application with DeepSeek
High-fidelity 3D Geometry Generation from images
Demo for CFG-Zero*
Generate animated portraits from images and audio
Audio-driven Talking Portrait