Sesame CSM
Conversational speech generation
Conversational speech generation
An Agentic Framework with Tools for Complex Reasoning
A leaderboard for LLMs powering smolagents
Image to Compositional 3D Scene Generation
Fast image relighting using Latent Bridge Matching
Enhance image quality by enlarging it without losing details
A demo for exploring and analyzing large-scale model repos
Generate edited images with prompts
Conversational speech generation
Fast image relighting using Latent Bridge Matching
Enhance image quality by enlarging it without losing details
Wan: Open and Advanced Large-Scale Video Generative Models
Generate virtual camera views from input images
Scalable and Versatile 3D Generation from images
Try on virtual garments on your uploaded images
Generate edited images with prompts
Convert images and text to document formats
Gemini 2.0 native image generation co-doodling
MultiImages-to-3D Generation
Send text and get detailed responses
Execute user-defined code
Text-to-3D and Image-to-3D Generation
Image to Compositional 3D Scene Generation
Embedding Leaderboard
The ultimate guide to training LLM on large GPU Clusters
Blazingly Fast and Embarrassingly Simple Song Generation
Generate animated videos from images and prompts
FLUX Multilingual Text-Driven Image Generation and Editing
Generate images from text prompts
Edit and enhance images with custom color and edge modifications
VGGT (CVPR 2025)
Unleashing a limitless torrent of ingenious ideas