Paper The following article is Open access

Perceptual losses for self-supervised depth estimation

, and

Published under licence by IOP Publishing Ltd
, , Citation Xiaoyu Liu et al 2021 J. Phys.: Conf. Ser. 1952 022040 DOI 10.1088/1742-6596/1952/2/022040

1742-6596/1952/2/022040

Abstract

Convolution neural network has shown excellent results in stereo and monocular disparity estimation, while most of the existing methods convert the image depth prediction problem into the image reconstruction problem, and calculate the depth of each pixel through the disparity between the generated left and right images. However, in the reconstruction task, the loss is still calculated at the pixel level when comparing the reconstructed picture with the original picture, which will greatly affect the estimation of picture depth due to the problems of illumination and occlusion. Therefore, when calculating the loss of image reconstruction, it is very important to compare the higher-level features extracted from the reconstructed image with the original image. In this paper, based on the existing methods, we have innovated the loss function and introduced perceptual loss, i.e., we use feedforward neural network to extract features to further evaluate the reconstructed image, to make the reconstruction loss of baseline more accurate and improve the accuracy and robustness of the depth prediction model. To compare the improved effect, we performed extensive experiments on KITTI driving data by the improved model set, and the experimental index obtains better performance than the original baseline model.

Export citation and abstract BibTeX RIS

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.
10.1088/1742-6596/1952/2/022040