UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

FETNet: Feature exchange transformer network for RGB-D object detection

Xiao, Z; Xue, J; Xie, P; Wang, G; (2021) FETNet: Feature exchange transformer network for RGB-D object detection. In: Proceedings of the the British Machine Vision Conference (BMVC) 2021. (pp. pp. 1-12). The BMVC Green open access

[thumbnail of ZhibinXiao-BMVC2021-final.pdf]
Preview
Text
ZhibinXiao-BMVC2021-final.pdf - Accepted Version

Download (1MB) | Preview

Abstract

In RGB-D object detection, due to the inherent difference between the RGB and Depth modalities, it remains challenging to simultaneously leverage sensed photometric and depth information. In this paper, to address this issue, we propose a Feature Exchange Transformer Network (FETNet), which consists of two well-designed components: the Feature Exchange Module (FEM), and the Multi-modal Vision Transformer (MViT). Specially, we propose the FEM to exchange part of the channels between RGB and depth features at each backbone stage, which facilitates the information flow, and bridges the gap, between the two modalities. Inspired by the success of Vision Transformer (ViT), we develop the variant MViT to effectively fuse multi-modal features and exploit the attention between the RGB and depth features. Different from previous methods developing from specified RGB detection algorithm, our proposal is generic. Extensive experiments prove that, when the proposed modules are integrated into mainstream RGB object detection methods, their RGB-D counterparts can obtain significant performance gains. Moreover, our FETNet surpasses state-of-the-art RGB-D detectors by 7.0% mAP on SUN RGB-D and 1.7% mAP on NYU Depth v2, which also well demonstrates the effectiveness of the proposed method.

Type: Proceedings paper
Title: FETNet: Feature exchange transformer network for RGB-D object detection
Event: The British Machine Vision Conference (BMVC) 2021
Location: Virtual Meeting
Dates: 22nd-25th November 2021
Open access status: An open access version is available from UCL Discovery
Publisher version: https://britishmachinevisionassociation.github.io/...
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher's terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10139902
Downloads since deposit
10,412Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item