UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Optimal strategies for learning multi-ancestry polygenic scores vary across traits

Lehmann, Brieuc; Mackintosh, Maxine; McVean, Gil; Holmes, Chris; (2023) Optimal strategies for learning multi-ancestry polygenic scores vary across traits. Nature Communications , 14 , Article 4023. 10.1038/s41467-023-38930-7. Green open access

[thumbnail of s41467-023-38930-7.pdf]
Preview
Text
s41467-023-38930-7.pdf - Published Version

Download (1MB) | Preview

Abstract

Polygenic scores (PGSs) are individual-level measures that aggregate the genome-wide genetic predisposition to a given trait. As PGS have predominantly been developed using European-ancestry samples, trait prediction using such European ancestry-derived PGS is less accurate in non-European ancestry individuals. Although there has been recent progress in combining multiple PGS trained on distinct populations, the problem of how to maximize performance given a multiple-ancestry cohort is largely unexplored. Here, we investigate the effect of sample size and ancestry composition on PGS performance for fifteen traits in UK Biobank. For some traits, PGS estimated using a relatively small African-ancestry training set outperformed, on an African-ancestry test set, PGS estimated using a much larger European-ancestry only training set. We observe similar, but not identical, results when considering other minority-ancestry groups within UK Biobank. Our results emphasise the importance of targeted data collection from underrepresented groups in order to address existing disparities in PGS performance.

Type: Article
Title: Optimal strategies for learning multi-ancestry polygenic scores vary across traits
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1038/s41467-023-38930-7
Publisher version: https://doi.org/10.1038/s41467-023-38930-7
Language: English
Additional information: This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
Keywords: Humans, Black People, Data Collection, Genetic Predisposition to Disease, Genome-Wide Association Study, Minority Groups, Multifactorial Inheritance, Genetics, Population
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10173260
Downloads since deposit
988Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item