DIRECT

This repository contains the model weights for Direct 3D-Aware Object Insertion via Decomposed Visual Proxies.

DIRECT performs pose-controllable object insertion by decomposing the insertion condition into visual proxies, including a reference object image, a geometry proxy rendered from a reconstructed 3D object, and a scene context image.

Project page: https://gong1130.github.io/DIRECT/

Code: https://github.com/Gong1130/DIRECT

Usage

Please refer to the official code repository for installation instructions and interactive demo usage.

Model Details

This repository contains DIRECT-specific weights only:

lora.safetensors
condition_embedder.safetensors
x_embedder.safetensors
time_text_embed.safetensors
pooled_image_projector.safetensors
image_projector.safetensors
config.json

The model requires the following external models:

black-forest-labs/FLUX.1-Fill-dev
google/siglip2-so400m-patch14-384
microsoft/TRELLIS-image-large

Downloads last month: 16

Model tree for superGong/DIRECT

Base model

black-forest-labs/FLUX.1-Fill-dev

Finetuned

(31)

this model