Deep learning-based depth estimation methods from monocular image and videos: a comprehensive survey
Document Type
Article
Publication Date
10-3-2024
Journal Title
ACM Computing Surveys
ISSN
ISSN: 0360-0300 eISSN: 1557-7341
Keywords
3D estimation, taxonomy, 3D reconstruction
Disciplines
Computer Sciences
Abstract
Estimating depth from single RGB images and videos is of widespread interest due to its applications in many areas, including autonomous driving, 3D reconstruction, digital entertainment, and robotics. More than 500 deep learning-based papers have been published in the past 10 years, which indicates the growing interest in the task. This paper presents a comprehensive survey of the existing deep learning-based methods, the challenges they address, and how they have evolved in their architecture and supervision methods. It provides a taxonomy for classifying the current work based on their input and output modalities, network architectures, and learning methods. It also discusses the major milestones in the history of monocular depth estimation, and different pipelines, datasets, and evaluation metrics used in existing methods.
Recommended Citation
Rajapaksha, U,
Sohel, F,
Laga, H,
Diepeveen, D A,
and
Bennamoun, M.
(2024), Deep learning-based depth estimation methods from monocular image and videos: a comprehensive survey. ACM Computing Surveys, 56 (12), 1-51.
https://library.dpird.wa.gov.au/hc_researchart/13
This file is 5.1 MB. Files over 3MB may be slow to open. For best results, right-click and select "save as..."