Reproducibility in Radiomics: A Comparison of Feature Extraction Methods and Two Independent Datasets

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

Reproducibility in Radiomics: A Comparison of Feature Extraction Methods and Two Independent Datasets

Thomas, Hannah Mary T; Wang, Helen YC; Varghese, Amal Joseph; Donovan, Ellen M; South, Chris P; Saxby, Helen; Nisbet, Andrew; ... Evans, Philip M; + view all (2023) Reproducibility in Radiomics: A Comparison of Feature Extraction Methods and Two Independent Datasets. Applied Sciences , 13 (12) , Article 7291. 10.3390/app13127291. Green open access

Preview

Text
applsci-13-07291-v2.pdf - Published Version
Download (2MB) | Preview

Abstract

Radiomics involves the extraction of information from medical images that are not visible to the human eye. There is evidence that these features can be used for treatment stratification and outcome prediction. However, there is much discussion about the reproducibility of results between different studies. This paper studies the reproducibility of CT texture features used in radiomics, comparing two feature extraction implementations, namely the MATLAB toolkit and Pyradiomics, when applied to independent datasets of CT scans of patients: (i) the open access RIDER dataset containing a set of repeat CT scans taken 15 min apart for 31 patients (RIDER Scan 1 and Scan 2, respectively) treated for lung cancer; and (ii) the open access HN1 dataset containing 137 patients treated for head and neck cancer. Gross tumor volume (GTV), manually outlined by an experienced observer available on both datasets, was used. The 43 common radiomics features available in MATLAB and Pyradiomics were calculated using two intensity-level quantization methods with and without an intensity threshold. Cases were ranked for each feature for all combinations of quantization parameters, and the Spearman’s rank coefficient, rs, calculated. Reproducibility was defined when a highly correlated feature in the RIDER dataset also correlated highly in the HN1 dataset, and vice versa. A total of 29 out of the 43 reported stable features were found to be highly reproducible between MATLAB and Pyradiomics implementations, having a consistently high correlation in rank ordering for RIDER Scan 1 and RIDER Scan 2 (rs > 0.8). 18/43 reported features were common in the RIDER and HN1 datasets, suggesting they may be agnostic to disease site. Useful radiomics features should be selected based on reproducibility. This study identified a set of features that meet this requirement and validated the methodology for evaluating reproducibility between datasets.

Type:	Article
Title:	Reproducibility in Radiomics: A Comparison of Feature Extraction Methods and Two Independent Datasets
Open access status:	An open access version is available from UCL Discovery
DOI:	10.3390/app13127291
Publisher version:	https://doi.org/10.3390/app13127291
Language:	English
Additional information:	Copyright: © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https:// creativecommons.org/licenses/by/ 4.0/).
Keywords:	radiomics; reproducibility; repeatability; validation; lung cancer; head and neck cancer; CT imaging
UCL classification:	UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Med Phys and Biomedical Eng
URI:	https://discovery-pp.ucl.ac.uk/id/eprint/10172643

Downloads since deposit

2,312Downloads

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item