Image-to-3D
3d
Real3D / README.md
hwjiang's picture
Update README.md
a992dbb verified
metadata
license: mit
tags:
  - 3d
pipeline_tag: image-to-3d

Real3D

Model Details:

Model Description:

We use the model architecture provided by TripoSR, which is a Transformer model for 2D-to-3D mapping built on LRM.

We scale it further on in-the-wild image collections by enabling unsupervised self-training and automatric data curation.

  • Developed by: Hanwen Jiang
  • License: MIT
  • Hardware: We train Real3D on 1 node (8GPU) with equivalent batch size of 80 for 5-6 days.

Model Sources:

Training Data: Real3D is jointly trained on synthetic data (Objaverse) and in-the-wild image collections. The former prevents training divergence, the latter introduces new knowldege from a broader distribution of real images. We use Objaverse renderings from Zero-1-to-3 and GObjaverse. The in the wild images are from ImageNet, OpenImages, etc.

Misuse, Malicious Use, and Out-of-Scope Use: The model should not be used to intentionally create or disseminate 3D models that people would foreseeably find disturbing, distressing, or offensive; or content that propagates historical or current stereotypes.