UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Robustness Stress Testing in Medical Image Classification

Islam, M; Li, Z; Glocker, B; (2023) Robustness Stress Testing in Medical Image Classification. In: Sudre, CH and Baumgartner, CF and Dalca, A and Mehta, R and Qin, C and Wells, WM, (eds.) Uncertainty for Safe Utilization of Machine Learning in Medical Imaging: 5th International Workshop, UNSURE 2023, Held in Conjunction with MICCAI 2023, Vancouver, BC, Canada, October 12, 2023, Proceedings. (pp. pp. 167-176). Springer: Cham, Switzerland. Green open access

[thumbnail of 2308.06889.pdf]
Preview
Text
2308.06889.pdf - Accepted Version

Download (2MB) | Preview

Abstract

Deep neural networks have shown impressive performance for image-based disease detection. Performance is commonly evaluated through clinical validation on independent test sets to demonstrate clinically acceptable accuracy. Reporting good performance metrics on test sets, however, is not always a sufficient indication of the generalizability and robustness of an algorithm. In particular, when the test data is drawn from the same distribution as the training data, the iid test set performance can be an unreliable estimate of the accuracy on new data. In this paper, we employ stress testing to assess model robustness and subgroup performance disparities in disease detection models. We design progressive stress testing using five different bidirectional and unidirectional image perturbations with six different severity levels. As a use case, we apply stress tests to measure the robustness of disease detection models for chest X-ray and skin lesion images, and demonstrate the importance of studying class and domain-specific model behaviour. Our experiments indicate that some models may yield more robust and equitable performance than others. We also find that pretraining characteristics play an important role in downstream robustness. We conclude that progressive stress testing is a viable and important tool and should become standard practice in the clinical validation of image-based disease detection models.

Type: Proceedings paper
Title: Robustness Stress Testing in Medical Image Classification
Event: 5th International Workshop on Uncertainty for Safe Utilization of Machine Learning in Medical Imaging (UNSURE)
Location: CANADA, Vancouver
Dates: 12 Oct 2023
ISBN-13: 978-3-031-44335-0
Open access status: An open access version is available from UCL Discovery
DOI: 10.1007/978-3-031-44336-7_17
Publisher version: https://doi.org/10.1007/978-3-031-44336-7_17
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Med Phys and Biomedical Eng
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10187095
Downloads since deposit
36Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item