Paper ID | 3D-3.6 | ||
Paper Title | ATTENTION-BASED SELF-SUPERVISED LEARNING MONOCULAR DEPTH ESTIMATION WITH EDGE REFINEMENT | ||
Authors | Chenweinan Jiang, Haichun Liu, Key Laboratory of Navigation and Location Based Services, Shanghai Jiao Tong University, China; Lanzhen Li, Shanghai West Hongqiao Navigation Technology CO.,LTD., China; Changchun Pan, Key Laboratory of Navigation and Location Based Services, Shanghai Jiao Tong University, China | ||
Session | 3D-3: Stereoscopic and multiview processing | ||
Location | Area J | ||
Session Time: | Wednesday, 22 September, 14:30 - 16:00 | ||
Presentation Time: | Wednesday, 22 September, 14:30 - 16:00 | ||
Presentation | Poster | ||
Topic | Three-Dimensional Image and Video Processing: Stereoscopic and multiview processing and display | ||
IEEE Xplore Open Preview | Click here to view in IEEE Xplore | ||
Abstract | Learning depth from a single image extracted from unlabeled videos has been attracting significant attention in the past few years. In this work, we propose a new depth estimation neural network with edge refinement to predict depth. First, we introduce a dual attention module into depth prediction module to integrate global information into local features and improve local features’ capability of representation. Second, to increase the details between objects in scenes, we propose a subnetwork to predict edges in four directions and combine the predicted depth and edges to increase the details by propagation operation. Besides, we integrate the gradients of the image into the photometric reprojection loss to handle the confusion caused by changing brightness. We conduct experiments on KITTI datasets and show that our network achieves the state-of-art result. |