UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

A Quantitative Comparison Between Human and Artificial Intelligence in the Detection of Focal Cortical Dysplasia

Walger, L; Bauer, T; Kügler, D; Schmitz, MH; Schuch, F; Arendt, C; Baumgartner, T; ... Rüber, T; + view all (2024) A Quantitative Comparison Between Human and Artificial Intelligence in the Detection of Focal Cortical Dysplasia. Investigative Radiology 10.1097/RLI.0000000000001125. (In press).

[thumbnail of Walger-et-al_manuscript-clean-version.R1.pdf] Text
Walger-et-al_manuscript-clean-version.R1.pdf - Accepted Version
Access restricted to UCL open access staff until 13 November 2025.

Download (576kB)

Abstract

Objectives: Artificial intelligence (AI) is thought to improve lesion detection. However, a lack of knowledge about human performance prevents a comparative evaluation of AI and an accurate assessment of its impact on clinical decision-making. The objective of this work is to quantitatively evaluate the ability of humans to detect focal cortical dysplasia (FCD), compare it to state-of-the-art AI, and determine how it may aid diagnostics. Materials and Methods: We prospectively recorded the performance of readers in detecting FCDs using single points and 3-dimensional bounding boxes. We acquired predictions of 3 AI models for the same dataset and compared these to readers. Finally, we analyzed pairwise combinations of readers and models. Results: Twenty-eight readers, including 20 nonexpert and 5 expert physicians, reviewed 180 cases: 146 subjects with FCD (median age: 25, interquartile range: 18) and 34 healthy control subjects (median age: 43, interquartile range: 19). Nonexpert readers detected 47% (95% confidence interval [CI]: 46, 49) of FCDs, whereas experts detected 68% (95% CI: 65, 71). The 3 AI models detected 32%, 51%, and 72% of FCDs, respectively. The latter, however, also predicted more than 13 false-positive clusters per subject on average. Human performance was improved in the presence of a transmantle sign (P < 0.001) and cortical thickening (P < 0.001). In contrast, AI models were sensitive to abnormal gyration (P < 0.01) or gray-white matter blurring (P < 0.01). Compared with single experts, expert-expert pairs detected 13% (95% CI: 9, 18) more FCDs (P < 0.001). All AI models increased expert detection rates by up to 19% (95% CI: 15, 24) (P < 0.001). Nonexpert+AI pairs could still outperform single experts by up to 13% (95% CI: 10, 17). Conclusions: This study pioneers the comparative evaluation of humans and AI for FCD lesion detection. It shows that AI and human predictions differ, especially for certain MRI features of FCD, and, thus, how AI may complement the diagnostic workup.

Type: Article
Title: A Quantitative Comparison Between Human and Artificial Intelligence in the Detection of Focal Cortical Dysplasia
Location: United States
DOI: 10.1097/RLI.0000000000001125
Publisher version: http://dx.doi.org/10.1097/rli.0000000000001125
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > UCL Queen Square Institute of Neurology
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > UCL Queen Square Institute of Neurology > Clinical and Experimental Epilepsy
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10200461
Downloads since deposit
3Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item