DIRECT

This repository contains the model weights for Direct 3D-Aware Object Insertion via Decomposed Visual Proxies.

DIRECT performs pose-controllable object insertion by decomposing the insertion condition into visual proxies, including a reference object image, a geometry proxy rendered from a reconstructed 3D object, and a scene context image.

Project page: https://gong1130.github.io/DIRECT/

Code: https://github.com/Gong1130/DIRECT

Usage

Please refer to the official code repository for installation instructions and interactive demo usage.

Model Details

This repository contains DIRECT-specific weights only:

  • lora.safetensors
  • condition_embedder.safetensors
  • x_embedder.safetensors
  • time_text_embed.safetensors
  • pooled_image_projector.safetensors
  • image_projector.safetensors
  • config.json

The model requires the following external models:

  • black-forest-labs/FLUX.1-Fill-dev
  • google/siglip2-so400m-patch14-384
  • microsoft/TRELLIS-image-large
Downloads last month
16
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for superGong/DIRECT

Finetuned
(31)
this model