Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator
Abstract
Monocular depth estimation (MDE) aims to predict scene depth from a single RGB image and plays a crucial role in 3D scene understanding. Recent advances in zero-shot MDE leverage normalized depth representations and distillation-based learning to improve generalization across diverse scenes. However, current depth normalization methods for distillation, relying on global normalization, can amplify noisy pseudo-labels, reducing distillation effectiveness. In this paper, we systematically analyze the impact of different depth normalization strategies on pseudo-label distillation. Based on our findings, we propose Cross-Context Distillation, which integrates global and local depth cues to enhance pseudo-label quality. Additionally, we introduce a multi-teacher distillation framework that leverages complementary strengths of different depth estimation models, leading to more robust and accurate depth predictions. Extensive experiments on benchmark datasets demonstrate that our approach significantly outperforms state-of-the-art methods, both quantitatively and qualitatively.
Community
Project page: https://distill-any-depth-official.github.io/
Code, models and demos are available now!
start
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining (2025)
- MetaFE-DE: Learning Meta Feature Embedding for Depth Estimation from Monocular Endoscopic Images (2025)
- DepthMaster: Taming Diffusion Models for Monocular Depth Estimation (2025)
- PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments (2025)
- Relative Pose Estimation through Affine Corrections of Monocular Depth Priors (2025)
- DEFOM-Stereo: Depth Foundation Model Based Stereo Matching (2025)
- Zero-Shot Monocular Scene Flow Estimation in the Wild (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper