UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Generalizing Surgical Instruments Segmentation to Unseen Domains with One-to-Many Synthesis

Wang, An; Islam, Mobarakol; Xu, Mengya; Ren, Hongliang; (2023) Generalizing Surgical Instruments Segmentation to Unseen Domains with One-to-Many Synthesis. In: 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). (pp. pp. 4608-4614). IEEE: Detroit, MI, USA. Green open access

[thumbnail of 2306.16285.pdf]
Preview
Text
2306.16285.pdf - Accepted Version

Download (948kB) | Preview

Abstract

Despite their impressive performance in various surgical scene understanding tasks, deep learning-based methods are frequently hindered from deploying to real-world surgical applications for various causes. Particularly, data collection, annotation, and domain shift in-between sites and patients are the most common obstacles. In this work, we mitigate data-related issues by efficiently leveraging minimal source images to generate synthetic surgical instrument segmentation datasets and achieve outstanding generalization performance on unseen real domains. Specifically, in our framework, only one background tissue image and at most three images of each foreground instrument are taken as the seed images. These source images are extensively transformed and employed to build up the foreground and background image pools, from which randomly sampled tissue and instrument images are composed with multiple blending techniques to generate new surgical scene images. Besides, we introduce hybrid training-time augmentations to diversify the training data further. Extensive evaluation on three real-world datasets, i.e., Endo2017, Endo2018, and RoboTool, demonstrates that our one-to-many synthetic surgical instruments datasets generation and segmentation framework can achieve encouraging performance compared with training with real data. Notably, on the RoboTool dataset, where a more significant domain gap exists, our framework shows its superiority of generalization by a considerable margin. We expect that our inspiring results will attract research attention to improving model generalization with data synthesizing.

Type: Proceedings paper
Title: Generalizing Surgical Instruments Segmentation to Unseen Domains with One-to-Many Synthesis
Event: 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Dates: 1 Oct 2023 - 5 Oct 2023
ISBN-13: 978-1-6654-9190-7
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/IROS55552.2023.10341609
Publisher version: http://dx.doi.org/10.1109/iros55552.2023.10341609
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Training; Learning systems; Image segmentation; Annotations; Instruments; Training data; Data collection
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Med Phys and Biomedical Eng
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10186370
Downloads since deposit
12Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item