Improving the Efficiency of 3D Monocular Object Detection and Tracking for Road and Railway Smart Mobility

Sensors (Basel). 2023 Mar 16;23(6):3197. doi: 10.3390/s23063197.

Abstract

Three-dimensional (3D) real-time object detection and tracking is an important task in the case of autonomous vehicles and road and railway smart mobility, in order to allow them to analyze their environment for navigation and obstacle avoidance purposes. In this paper, we improve the efficiency of 3D monocular object detection by using dataset combination and knowledge distillation, and by creating a lightweight model. Firstly, we combine real and synthetic datasets to increase the diversity and richness of the training data. Then, we use knowledge distillation to transfer the knowledge from a large, pre-trained model to a smaller, lightweight model. Finally, we create a lightweight model by selecting the combinations of width, depth & resolution in order to reach a target complexity and computation time. Our experiments showed that using each method improves either the accuracy or the efficiency of our model with no significant drawbacks. Using all these approaches is especially useful for resource-constrained environments, such as self-driving cars and railway systems.

Keywords: 3D bounding boxes estimation; 3D multi-object detection; dataset combination; deep learning; distance estimation; knowledge distillation; monocular 3D object detection; object localization; smart mobility.

Grants and funding

This work was funded under SEGULA Technologies’ collaboration with IRSEEM as part of their efforts in the field of railways’ smart mobility for future commercial applications through the Ph.D. Thesis of Antoin Mauri and Alexandre Evain. In addition, the ANRT (Association Nationale de la Recherche et de la Technologie)’s CIFRE program contributed to the financing of both Ph.D. Thesis.