Sesame CSM
Conversational speech generation
Conversational speech generation
An Agentic Framework with Tools for Complex Reasoning
A leaderboard for LLMs powering smolagents
Image to Compositional 3D Scene Generation
Fast image relighting using Latent Bridge Matching
Enhance image quality by enlarging it without losing details
A demo for exploring and analyzing large-scale model repos
Generate edited images with prompts
Conversational speech generation
Generate virtual camera views from input images
Gemini 2.0 native image generation co-doodling
Enhance image quality by enlarging it without losing details
Generate edited images with prompts
Fast image relighting using Latent Bridge Matching
Convert images and text to document formats
MultiImages-to-3D Generation
Scalable and Versatile 3D Generation from images
Try on virtual garments on your uploaded images
Text-to-3D and Image-to-3D Generation
Wan: Open and Advanced Large-Scale Video Generative Models
Execute environment-specified commands
FLUX Multilingual Text-Driven Image Generation and Editing
Embedding Leaderboard
Image to Compositional 3D Scene Generation
Generate images from text prompts
The ultimate guide to training LLM on large GPU Clusters
Transform flat-lay shots into on-model photos
Edit and enhance images with custom color and edge modifications
VGGT (CVPR 2025)
Send text and get detailed responses
Flexible Photo Recrafting While Preserving Your Identity
An Agentic Framework with Tools for Complex Reasoning