Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic

Yebra, G; Hodcroft, EB; Ragonnet-Cronin, ML; Kozlakidis, Z; Pillay, D; Nastouli, E; Hayward, A; (2016) Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic. Scientific Reports , 6 , Article 39489. 10.1038/srep39489. Green open access

	Text (Published article) Yebra_Using nearly full-genome HIV sequence data improves phylogeny reconstruction.pdf - Published Version Download (393kB)
Preview	Text (Supplementary information) Yebra_Using nearly full-genome HIV sequence data improves phylogeny reconstruction Supplementary info.pdf Download (448kB) \| Preview

Abstract

HIV molecular epidemiology studies analyse viral pol gene sequences due to their availability, but whole genome sequencing allows to use other genes. We aimed to determine what gene(s) provide(s) the best approximation to the real phylogeny by analysing a simulated epidemic (created as part of the PANGEA_HIV project) with a known transmission tree. We sub-sampled a simulated dataset of 4662 sequences into different combinations of genes (gag-pol-env, gag-pol, gag, pol, env and partial pol) and sampling depths (100%, 60%, 20% and 5%), generating 100 replicates for each case. We built maximum-likelihood trees for each combination using RAxML (GTR + Γ), and compared their topologies to the corresponding true tree’s using CompareTree. The accuracy of the trees was significantly proportional to the length of the sequences used, with the gag-pol-env datasets showing the best performance and gag and partial pol sequences showing the worst. The lowest sampling depths (20% and 5%) greatly reduced the accuracy of tree reconstruction and showed high variability among replicates, especially when using the shortest gene datasets. In conclusion, using longer sequences derived from nearly whole genomes will improve the reliability of phylogenetic reconstruction. With low sample coverage, results can be highly variable, particularly when based on short sequences.

Type:	Article
Title:	Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic
Open access status:	An open access version is available from UCL Discovery
DOI:	10.1038/srep39489
Publisher version:	http://dx.doi.org/10.1038/srep39489
Language:	English
Additional information:	This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
UCL classification:	UCL UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Medical Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Medical Sciences > Div of Infection and Immunity UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Epidemiology and Health UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Epidemiology and Health > Epidemiology and Public Health UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > UCL GOS Institute of Child Health UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > UCL GOS Institute of Child Health > Infection, Immunity and Inflammation Dept
URI:	https://discovery-pp.ucl.ac.uk/id/eprint/1534519

Downloads since deposit

6,030Downloads

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item