Upload folder using huggingface_hub
Browse files- README.md +59 -3
- ckpts/.gitkeep +1 -0
- ckpts/iscene_denoiser.pt +3 -0
- ckpts/iscene_image_conditioner.pt +3 -0
- config.yml +34 -0
- iscene_config.json +7 -0
README.md
CHANGED
|
@@ -1,3 +1,59 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: mit
|
| 3 |
-
--
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
base_model: microsoft/TRELLIS-image-large
|
| 4 |
+
pipeline_tag: image-to-3d
|
| 5 |
+
tags:
|
| 6 |
+
- image-to-3d
|
| 7 |
+
- 3d-generation
|
| 8 |
+
- scene-generation
|
| 9 |
+
- trellis
|
| 10 |
+
---
|
| 11 |
+
|
| 12 |
+
# IScene-v1
|
| 13 |
+
|
| 14 |
+
IScene-v1 is the first public-release checkpoint package for IScene inference.
|
| 15 |
+
|
| 16 |
+
This repository is intended to be used with the IScene code release. It contains the IScene-specific checkpoint files and minimal configuration needed to run single-image, segmentation-conditioned 3D scene generation.
|
| 17 |
+
|
| 18 |
+
## Contents
|
| 19 |
+
|
| 20 |
+
- `iscene_config.json`: public release metadata and checkpoint layout.
|
| 21 |
+
- `config.yml`: inference-only architecture configuration.
|
| 22 |
+
- `ckpts/iscene_denoiser.pt`: IScene denoiser checkpoint.
|
| 23 |
+
- `ckpts/iscene_image_conditioner.pt`: IScene image-conditioner checkpoint.
|
| 24 |
+
|
| 25 |
+
## Usage
|
| 26 |
+
|
| 27 |
+
The public IScene code should load this package with:
|
| 28 |
+
|
| 29 |
+
```python
|
| 30 |
+
from iscene.inference.inferencer import ISceneInferencer
|
| 31 |
+
|
| 32 |
+
inferencer = ISceneInferencer.from_pretrained("LuLing/IScene")
|
| 33 |
+
```
|
| 34 |
+
|
| 35 |
+
For local testing before uploading to Hugging Face:
|
| 36 |
+
|
| 37 |
+
```python
|
| 38 |
+
inferencer = ISceneInferencer.from_pretrained("release_hf/IScene-v1")
|
| 39 |
+
```
|
| 40 |
+
|
| 41 |
+
## Notes
|
| 42 |
+
|
| 43 |
+
- This package contains only the IScene release checkpoint files, not historical training logs or experimental checkpoints.
|
| 44 |
+
- The IScene code is expected to load the TRELLIS base model from `microsoft/TRELLIS-image-large` or an equivalent local mirror.
|
| 45 |
+
|
| 46 |
+
## Attribution
|
| 47 |
+
|
| 48 |
+
IScene-v1 builds on the TRELLIS image-conditioned 3D generation backbone. The public loader uses TRELLIS base components from `microsoft/TRELLIS-image-large`.
|
| 49 |
+
|
| 50 |
+
Please also cite and respect the license terms of TRELLIS:
|
| 51 |
+
|
| 52 |
+
- Project: https://trellis3d.github.io/
|
| 53 |
+
- Code: https://github.com/microsoft/TRELLIS
|
| 54 |
+
- Model: https://huggingface.co/microsoft/TRELLIS-image-large
|
| 55 |
+
- Paper: Structured 3D Latents for Scalable and Versatile 3D Generation
|
| 56 |
+
|
| 57 |
+
## License
|
| 58 |
+
|
| 59 |
+
This model package is prepared for release under the MIT License. Third-party TRELLIS attribution is included above and should be preserved.
|
ckpts/.gitkeep
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
|
ckpts/iscene_denoiser.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8e4642c719615ac035e23b976220b26af22992d3744c361624a17dbdc10b96c9
|
| 3 |
+
size 2239036066
|
ckpts/iscene_image_conditioner.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2a5d6d5d527edd6c5e95ecbc28e21ffb9bfdec08a725eaf52e5df8141c7a35dd
|
| 3 |
+
size 1217639882
|
config.yml
ADDED
|
@@ -0,0 +1,34 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Inference-only architecture config for IScene-v1.
|
| 2 |
+
# This file intentionally excludes training logs, data paths, cluster settings,
|
| 3 |
+
# experiment names, and checkpoint metadata.
|
| 4 |
+
|
| 5 |
+
models:
|
| 6 |
+
denoiser:
|
| 7 |
+
name: SparseStructureSceneContextFlowModel
|
| 8 |
+
args:
|
| 9 |
+
resolution: 16
|
| 10 |
+
in_channels: 8
|
| 11 |
+
out_channels: 8
|
| 12 |
+
model_channels: 1024
|
| 13 |
+
cond_channels: 1024
|
| 14 |
+
num_blocks: 24
|
| 15 |
+
num_heads: 16
|
| 16 |
+
mlp_ratio: 4
|
| 17 |
+
patch_size: 1
|
| 18 |
+
pe_mode: ape
|
| 19 |
+
qk_rms_norm: true
|
| 20 |
+
use_fp16: true
|
| 21 |
+
scene_context_attn_num: 5
|
| 22 |
+
learning_pattern: full-finetune
|
| 23 |
+
exp_setting: global
|
| 24 |
+
|
| 25 |
+
img_conditioner:
|
| 26 |
+
name: ImageConditioner
|
| 27 |
+
args:
|
| 28 |
+
image_cond_model: dinov2_vitl14_reg
|
| 29 |
+
cond_in_channels: 3
|
| 30 |
+
use_fp16: false
|
| 31 |
+
|
| 32 |
+
dataset:
|
| 33 |
+
args:
|
| 34 |
+
exp_setting: global
|
iscene_config.json
ADDED
|
@@ -0,0 +1,7 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"model_name": "IScene-v1",
|
| 3 |
+
"base_model_id": "microsoft/TRELLIS-image-large",
|
| 4 |
+
"config_file": "config.yml",
|
| 5 |
+
"denoiser_checkpoint": "ckpts/iscene_denoiser.pt",
|
| 6 |
+
"image_conditioner_checkpoint": "ckpts/iscene_image_conditioner.pt"
|
| 7 |
+
}
|