Spaces:
Running
on
L40S
title: StableSpann3R
app_file: app.py
sdk: gradio
sdk_version: 4.42.0
3D Reconstruction with Spatial Memory
Paper | Project Page | Video
3D Reconstruction with Spatial Memory
Hengyi Wang, Lourdes Agapito
arXiv 2024
Installation
Clone Spann3R
git clone https://github.com/HengyiWang/spann3r.git cd spann3r
Create conda environment
conda create -n spann3r python=3.9 cmake=3.14.0 conda install pytorch==2.3.0 torchvision==0.18.0 torchaudio==2.3.0 pytorch-cuda=11.8 -c pytorch -c nvidia # use the correct version of cuda for your system pip install -r requirements.txt # Open3D has a bug from 0.16.0, please use dev version pip install -U -f https://www.open3d.org/docs/latest/getting_started.html open3d
Compile cuda kernels for RoPE
cd croco/models/curope/ python setup.py build_ext --inplace cd ../../../
Download the DUSt3R checkpoint
mkdir checkpoints cd checkpoints # Download DUSt3R checkpoints wget https://download.europe.naverlabs.com/ComputerVision/DUSt3R/DUSt3R_ViTLarge_BaseDecoder_512_dpt.pth
Download our checkpoint and place it under
./checkpoints
Demo
Download the example data (2 scenes from map-free-reloc) and unzip it as
./examples
Run demo:
python demo.py --demo_path ./examples/s00567 --kf_every 10 --vis
For visualization
--vis
, it will give you a window to adjust the rendering view. Once you find the view to render, please clickspace key
and close the window. The code will then do the rendering of the incremental reconstruction.
Training and Evaluation
Datasets
We use Habitat, ScanNet++, ScanNet, ArkitScenes, Co3D, and BlendedMVS to train our model. Please refer to data_preprocess.md.
Train
Please use the following command to train our model:
torchrun --nproc_per_node 8 train.py --batch_size 4
Eval
Please use the following command to evaluate our model:
python eval.py
Acknowledgement
Our code, data preprocessing pipeline, and evaluation scripts are based on several awesome repositories:
We thank the authors for releasing their code!
The research presented here has been supported by a sponsored research award from Cisco Research and the UCL Centre for Doctoral Training in Foundational AI under UKRI grant number EP/S021566/1. This project made use of time on Tier 2 HPC facility JADE2, funded by EPSRC (EP/T022205/1).
Citation
If you find our code or paper useful for your research, please consider citing:
@article{wang20243d,
title={3D Reconstruction with Spatial Memory},
author={Wang, Hengyi and Agapito, Lourdes},
journal={arXiv preprint arXiv:2408.16061},
year={2024}
}