UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

M3C: Monte Carlo reference-based consensus clustering

John, C; Watson, D; Russ, D; Goldmann, K; Ehrenstein, M; Pitzalis, C; Lewis, M; (2019) M3C: Monte Carlo reference-based consensus clustering. BioRxiv: Cold Spring Harbor, NY, USA. Green open access

[thumbnail of Watson_377002.full.pdf]
Preview
Text
Watson_377002.full.pdf

Download (5MB) | Preview

Abstract

Genome-wide data is used to stratify patients into classes for precision medicine using clustering algorithms. A common problem in this area is selection of the number of clusters (K). The Monti consensus clustering algorithm is a widely used method which uses stability selection to estimate K. However, the method has bias towards higher values of K and yields high numbers of false positives. As a solution, we developed Monte Carlo reference-based consensus clustering (M3C), which is based on this algorithm. M3C simulates null distributions of stability scores for a range of K values thus enabling a comparison with real data to remove bias and statistically test for the presence of structure. M3C corrects the inherent bias of consensus clustering as demonstrated on simulated and real expression data from The Cancer Genome Atlas (TCGA). For testing M3C, we developed clusterlab, a new method for simulating multivariate Gaussian clusters.

Type: Working / discussion paper
Title: M3C: Monte Carlo reference-based consensus clustering
Open access status: An open access version is available from UCL Discovery
DOI: 10.1101/377002
Publisher version: https://doi.org/10.1101/377002
Language: English
Additional information: The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
UCL classification: UCL
UCL > Provost and Vice Provost Offices
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10118990
Downloads since deposit
1,122Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item