DPT 3.0 release - a nielsr Collection

nielsr 's Collections

Image-to-text models

DPT 3.0 release

DPT 3.1 release

Depth Anything release

DPT 3.0 release

updated Jan 25

DPT 3.0 (MiDaS) models, leveraging ViT and ViT-hybrid backbones

Vision Transformers for Dense Prediction

Paper • 2103.13413 • Published Mar 24, 2021 • 1
Intel/dpt-large

Depth Estimation • Updated Feb 24 • 157k • 175

Note This model leverages a Vision Transformer (ViT) backbone for monocular depth estimation.
Intel/dpt-hybrid-midas

Depth Estimation • Updated Feb 9 • 464k • 84

Note This model leverages a hybrid Vision Transformer (ViT-hybrid) backbone for monocular depth estimation.