LuLing commited on
Commit
04794eb
·
verified ·
1 Parent(s): e154556

Upload folder using huggingface_hub

Browse files
README.md CHANGED
@@ -1,3 +1,59 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ base_model: microsoft/TRELLIS-image-large
4
+ pipeline_tag: image-to-3d
5
+ tags:
6
+ - image-to-3d
7
+ - 3d-generation
8
+ - scene-generation
9
+ - trellis
10
+ ---
11
+
12
+ # IScene-v1
13
+
14
+ IScene-v1 is the first public-release checkpoint package for IScene inference.
15
+
16
+ This repository is intended to be used with the IScene code release. It contains the IScene-specific checkpoint files and minimal configuration needed to run single-image, segmentation-conditioned 3D scene generation.
17
+
18
+ ## Contents
19
+
20
+ - `iscene_config.json`: public release metadata and checkpoint layout.
21
+ - `config.yml`: inference-only architecture configuration.
22
+ - `ckpts/iscene_denoiser.pt`: IScene denoiser checkpoint.
23
+ - `ckpts/iscene_image_conditioner.pt`: IScene image-conditioner checkpoint.
24
+
25
+ ## Usage
26
+
27
+ The public IScene code should load this package with:
28
+
29
+ ```python
30
+ from iscene.inference.inferencer import ISceneInferencer
31
+
32
+ inferencer = ISceneInferencer.from_pretrained("LuLing/IScene")
33
+ ```
34
+
35
+ For local testing before uploading to Hugging Face:
36
+
37
+ ```python
38
+ inferencer = ISceneInferencer.from_pretrained("release_hf/IScene-v1")
39
+ ```
40
+
41
+ ## Notes
42
+
43
+ - This package contains only the IScene release checkpoint files, not historical training logs or experimental checkpoints.
44
+ - The IScene code is expected to load the TRELLIS base model from `microsoft/TRELLIS-image-large` or an equivalent local mirror.
45
+
46
+ ## Attribution
47
+
48
+ IScene-v1 builds on the TRELLIS image-conditioned 3D generation backbone. The public loader uses TRELLIS base components from `microsoft/TRELLIS-image-large`.
49
+
50
+ Please also cite and respect the license terms of TRELLIS:
51
+
52
+ - Project: https://trellis3d.github.io/
53
+ - Code: https://github.com/microsoft/TRELLIS
54
+ - Model: https://huggingface.co/microsoft/TRELLIS-image-large
55
+ - Paper: Structured 3D Latents for Scalable and Versatile 3D Generation
56
+
57
+ ## License
58
+
59
+ This model package is prepared for release under the MIT License. Third-party TRELLIS attribution is included above and should be preserved.
ckpts/.gitkeep ADDED
@@ -0,0 +1 @@
 
 
1
+
ckpts/iscene_denoiser.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8e4642c719615ac035e23b976220b26af22992d3744c361624a17dbdc10b96c9
3
+ size 2239036066
ckpts/iscene_image_conditioner.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2a5d6d5d527edd6c5e95ecbc28e21ffb9bfdec08a725eaf52e5df8141c7a35dd
3
+ size 1217639882
config.yml ADDED
@@ -0,0 +1,34 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Inference-only architecture config for IScene-v1.
2
+ # This file intentionally excludes training logs, data paths, cluster settings,
3
+ # experiment names, and checkpoint metadata.
4
+
5
+ models:
6
+ denoiser:
7
+ name: SparseStructureSceneContextFlowModel
8
+ args:
9
+ resolution: 16
10
+ in_channels: 8
11
+ out_channels: 8
12
+ model_channels: 1024
13
+ cond_channels: 1024
14
+ num_blocks: 24
15
+ num_heads: 16
16
+ mlp_ratio: 4
17
+ patch_size: 1
18
+ pe_mode: ape
19
+ qk_rms_norm: true
20
+ use_fp16: true
21
+ scene_context_attn_num: 5
22
+ learning_pattern: full-finetune
23
+ exp_setting: global
24
+
25
+ img_conditioner:
26
+ name: ImageConditioner
27
+ args:
28
+ image_cond_model: dinov2_vitl14_reg
29
+ cond_in_channels: 3
30
+ use_fp16: false
31
+
32
+ dataset:
33
+ args:
34
+ exp_setting: global
iscene_config.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "model_name": "IScene-v1",
3
+ "base_model_id": "microsoft/TRELLIS-image-large",
4
+ "config_file": "config.yml",
5
+ "denoiser_checkpoint": "ckpts/iscene_denoiser.pt",
6
+ "image_conditioner_checkpoint": "ckpts/iscene_image_conditioner.pt"
7
+ }