TripoSG - High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models

TripoSG is a state-of-the-art image-to-3D generation foundation model that leverages large-scale rectified flow transformers to produce high-fidelity 3D shapes from single images.

Model Description

Model Architecture

TripoSG utilizes a novel architecture combining:

  • Rectified Flow (RF) based Transformer for stable, linear trajectory modeling
  • Advanced VAE with SDF-based representation and hybrid geometric supervision
  • Cross-attention mechanism for image feature condition
  • 1.5B parameters operating on 2048 latent tokens

Intended Uses

This model is designed for:

  • Converting single images to high-quality 3D meshes
  • Creative and design applications
  • Gaming and VFX asset creation
  • Prototyping and visualization

Requirements

  • CUDA-capable GPU (>8GB VRAM)

Usage

For detailed usage instructions, please visit our GitHub repository.

About

TripoSG is developed by Tripo, VAST AI Research, pushing the boundaries of 3D Generative AI. For more information:

Downloads last month
2,424
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ 1 Ask for provider support

Spaces using VAST-AI/TripoSG 3