UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

SeMLaPS: Real-Time Semantic Mapping With Latent Prior Networks and Quasi-Planar Segmentation

Wang, J; Tarrio, J; Agapito, L; Alcantarilla, PF; Vakhitov, A; (2023) SeMLaPS: Real-Time Semantic Mapping With Latent Prior Networks and Quasi-Planar Segmentation. IEEE Robotics and Automation Letters , 8 (12) pp. 7954-7961. 10.1109/LRA.2023.3322647. Green open access

[thumbnail of semlaps-arxiv-2306.16585.pdf]
Preview
Text
semlaps-arxiv-2306.16585.pdf - Accepted Version

Download (8MB) | Preview

Abstract

The availability of real-time semantics greatly improves the core geometric functionality of SLAM systems, enabling numerous robotic and AR/VR applications. We present a new methodology for real-time semantic mapping from RGB-D sequences that combines a 2D neural network and a 3D network based on a SLAM system with 3D occupancy mapping. When segmenting a new frame we perform latent feature re-projection from previous frames based on differentiable rendering. Fusing re-projected feature maps from previous frames with current-frame features greatly improves image segmentation quality, compared to a baseline that processes images independently. For 3D map processing, we propose a novel geometric quasi-planar over-segmentation method that groups 3D map elements likely to belong to the same semantic classes, relying on surface normals. We also describe a novel neural network design for lightweight semantic map post-processing. Our system achieves state-of-the-art semantic mapping quality within 2D-3D networks-based systems and matches the performance of 3D convolutional networks on three real indoor datasets, while working in real-time. Moreover, it shows better cross-sensor generalization abilities compared to 3D CNNs, enabling training and inference with different depth sensors.

Type: Article
Title: SeMLaPS: Real-Time Semantic Mapping With Latent Prior Networks and Quasi-Planar Segmentation
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/LRA.2023.3322647
Publisher version: https://doi.org/10.1109/LRA.2023.3322647
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: AI-enabled robotics, RGB-D perception, mapping
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10184267
Downloads since deposit
168Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item