SPADnet: deep RGB-SPAD sensor fusion assisted by monocular depth estimation

Zhanghao Sun; David B Lindell; Olav Solgaard; Gordon Wetzstein

doi:10.1364/OE.392386

SPADnet: deep RGB-SPAD sensor fusion assisted by monocular depth estimation

Opt Express. 2020 May 11;28(10):14948-14962. doi: 10.1364/OE.392386.

Authors

Zhanghao Sun, David B Lindell, Olav Solgaard, Gordon Wetzstein

PMID: 32403527
DOI: 10.1364/OE.392386

Abstract

Single-photon light detection and ranging (LiDAR) techniques use emerging single-photon detectors (SPADs) to push 3D imaging capabilities to unprecedented ranges. However, it remains challenging to robustly estimate scene depth from the noisy and otherwise corrupted measurements recorded by a SPAD. Here, we propose a deep sensor fusion strategy that combines corrupted SPAD data and a conventional 2D image to estimate the depth of a scene. Our primary contribution is a neural network architecture-SPADnet-that uses a monocular depth estimation algorithm together with a SPAD denoising and sensor fusion strategy. This architecture, together with several techniques in network training, achieves state-of-the-art results for RGB-SPAD fusion with simulated and captured data. Moreover, SPADnet is more computationally efficient than previous RGB-SPAD fusion networks.