DPT 3.1 release - a nielsr Collection

nielsr 's Collections

Image-to-text models

DPT 3.0 release

DPT 3.1 release

Depth Anything release

DPT 3.1 release

updated Jan 25

DPT 3.1 (MiDaS) models, leveraging state-of-the-art vision backbones such as BEiT and Swinv2

Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer

Paper • 1907.01341 • Published Jul 2, 2019
Intel/dpt-beit-large-512

Depth Estimation • Updated Jun 21 • 25.3k • 7

Note This model gives the highest quality, but is also the most heavy in terms of computation as mentioned in the paper.
Intel/dpt-beit-large-384

Depth Estimation • Updated Jun 21 • 84
Intel/dpt-beit-base-384

Depth Estimation • Updated Dec 11, 2023 • 56.2k • 1
Intel/dpt-swinv2-large-384

Depth Estimation • Updated Jun 21 • 106

Note This model has moderately less quality, but has a better speed-performance trade-off
Intel/dpt-swinv2-base-384

Depth Estimation • Updated Dec 11, 2023 • 43
Intel/dpt-swinv2-tiny-256

Depth Estimation • Updated Jun 21 • 578 • 7

Note This model is recommended for deployment on embedded devices