keras-io
/

monocular-depth-estimation

Model card Files Files and versions Metrics Training metrics Community

monocular-depth-estimation / README.md

vumichien's picture

Update README.md

5dd7e8a over 2 years ago

|

1.83 kB

	---
	tags:
	- image-segmentation
	library_name: keras
	---
	## Model description
	Full credits go to: [Vu Minh Chien](https://www.linkedin.com/in/vumichien/)

	Depth estimation is a crucial step towards inferring scene geometry from 2D images. The goal in monocular depth estimation is to predict the depth value of each pixel or inferring depth information, given only a single RGB image as input.

	## Dataset
	[NYU Depth Dataset V2](https://cs.nyu.edu/~silberman/datasets/nyu_depth_v2.html) is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft Kinect.

	## Training procedure

	### Training hyperparameters
	Model architecture:
	- UNet with a pretrained DenseNet 201 backbone.

	The following hyperparameters were used during training:
	- learning_rate: 1e-04
	- train_batch_size: 16
	- seed: 42
	- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
	- lr_scheduler_type: ReduceLROnPlateau
	- num_epochs: 10

	### Training results

	\| Epoch \| Training loss \| Validation Loss \| Learning rate \|
	\|:------:\|:-------------:\|:---------------:\|:-------------:\|
	\| 1 \| 0.1333 \| 0.1315 \| 1e-04 \|
	\| 2 \| 0.0948 \| 0.1232 \| 1e-04 \|
	\| 3 \| 0.0834 \| 0.1220 \| 1e-04 \|
	\| 4 \| 0.0775 \| 0.1213 \| 1e-04 \|
	\| 5 \| 0.0736 \| 0.1196 \| 1e-04 \|
	\| 6 \| 0.0707 \| 0.1205 \| 1e-04 \|
	\| 7 \| 0.0687 \| 0.1190 \| 1e-04 \|
	\| 8 \| 0.0667 \| 0.1177 \| 1e-04 \|
	\| 9 \| 0.0654 \| 0.1177 \| 1e-04 \|
	\| 10 \| 0.0635 \| 0.1182 \| 9e-05 \|
	### View Model Demo
	![Model Demo](./demo.png)

	### View Model Plot
	![Model Image](./model.png)