Depth Estimation and Dense Reconstruction with the Monocular Camera (Tutor: Tianjun Zhang, wechat: z619850002)

Introduction

3D reconstruction based on the monocular vision is a classic task in the field of computer vision. In this project, you need to mainly focus on three tasks, including pose recovery, depth estimation and dense map construction.

https://img-blog.csdnimg.cn/20200920003021642.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L2h4eGp4dw==,size_16,color_FFFFFF,t_70

                                                    Fig. 1: Depth Estimation

 

https://www.pianshen.com/images/534/98df7c946aaca511341da07a5b6d2c1e.png

                                             Fig. 2: Dense Scene Reconstruction

 

   Fig. 3: Scene structure reconstruction incrementally using a single agent

Requirements

1)  Collecting at least one video sequence of images and recovering corresponding camera poses of these images based on any SFM or SLAM methods. After that, depth maps of key frames should be recovered by any proper algorithms. Finally, the dense map of the scene can be constructed incrementally.

2) When the video stream and corresponding poses are input, your system should be able to construct a dense map in real-time or quasi real-time, rather than offline. GPU is allowed.

Reference Materials

  1. A SFM pipeline, Colmap: link: https://github.com/colmap/colmap

  2. A SLAM pipeline, ORB-SLAM2: link: https://github.com/raulmur/ORB_SLAM2

  3. Traditional depth estimation method, SGM: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4359315

  4. Deep learning based method, DPT: https://www.sciencedirect.com/science/article/abs/pii/S0950705122007821

  5. Demo pipeline of SGM: https://github.com/z619850002/DepthEstimation-SGM

  6. Official implementation of DPT: https://github.com/isl-org/DPT

Created on: Nov. 09, 2023