UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography-Mass Spectrometry Metabolomics Datasets

Climaco Pinto, R; Karaman, I; Lewis, MR; Hällqvist, J; Kaluarachchi, M; Graça, G; Chekmeneva, E; ... Ebbels, T; + view all (2021) Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography-Mass Spectrometry Metabolomics Datasets. Analytical Chemistry , 94 (14) pp. 5493-5503. 10.1021/acs.analchem.1c03592. Green open access

[thumbnail of acs.analchem.1c03592.pdf]
Preview
Text
acs.analchem.1c03592.pdf - Published Version

Download (3MB) | Preview

Abstract

Integration of multiple datasets can greatly enhance bioanalytical studies, for example, by increasing power to discover and validate biomarkers. In liquid chromatography-mass spectrometry (LC-MS) metabolomics, it is especially hard to combine untargeted datasets since the majority of metabolomic features are not annotated and thus cannot be matched by chemical identity. Typically, the information available for each feature is retention time (RT), mass-to-charge ratio (m/z), and feature intensity (FI). Pairs of features from the same metabolite in separate datasets can exhibit small but significant differences, making matching very challenging. Current methods to address this issue are too simple or rely on assumptions that cannot be met in all cases. We present a method to find feature correspondence between two similar LC-MS metabolomics experiments or batches using only the features' RT, m/z, and FI. We demonstrate the method on both real and synthetic datasets, using six orthogonal validation strategies to gauge the matching quality. In our main example, 4953 features were uniquely matched, of which 585 (96.8%) of 604 manually annotated features were correct. In a second example, 2324 features could be uniquely matched, with 79 (90.8%) out of 87 annotated features correctly matched. Most of the missed annotated matches are between features that behave very differently from modeled inter-dataset shifts of RT, MZ, and FI. In a third example with simulated data with 4755 features per dataset, 99.6% of the matches were correct. Finally, the results of matching three other dataset pairs using our method are compared with a published alternative method, metabCombiner, showing the advantages of our approach. The method can be applied using M2S (Match 2 Sets), a free, open-source MATLAB toolbox, available at https://github.com/rjdossan/M2S.

Type: Article
Title: Finding Correspondence between Metabolomic Features in Untargeted Liquid Chromatography-Mass Spectrometry Metabolomics Datasets
Location: United States
Open access status: An open access version is available from UCL Discovery
DOI: 10.1021/acs.analchem.1c03592
Publisher version: https://doi.org/10.1021/acs.analchem.1c03592
Language: English
Additional information: This version is the version of record available under the Creative Commons Attribution 4.0 International (CC BY 4.0) licence.
Keywords: Biomarkers, Chromatography, Liquid, Mass Spectrometry, Metabolomics
UCL classification: UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > UCL Queen Square Institute of Neurology > Neurodegenerative Diseases
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > UCL Queen Square Institute of Neurology
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10147605
Downloads since deposit
2,888Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item