UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

A simulation study to examine the information content in phylogenomic datasets under the multispecies coalescent model

Huang, J; Flouris, T; Yang, Z; (2020) A simulation study to examine the information content in phylogenomic datasets under the multispecies coalescent model. Molecular Biology and Evolution , 37 (11) pp. 3211-3224. 10.1093/molbev/msaa166. Green open access

[thumbnail of 2020HuangbppInformationMC.pdf]
Preview
Text
2020HuangbppInformationMC.pdf - Accepted Version

Download (830kB) | Preview

Abstract

We use computer simulation to examine the information content in multilocus data sets for inference under the multispecies coalescent model. Inference problems considered include estimation of evolutionary parameters (such as species divergence times, population sizes, and cross-species introgression probabilities), species tree estimation, and species delimitation based on Bayesian comparison of delimitation models. We found that the number of loci is the most influential factor for almost all inference problems examined. Although the number of sequences per species does not appear to be important to species tree estimation, it is very influential to species delimitation. Increasing the number of sites and the per-site mutation rate both increase the mutation rate for the whole locus and these have the same effect on estimation of parameters, but the sequence length has a greater effect than the per-site mutation rate for species tree estimation. We discuss the computational costs when the data size increases and provide guidelines concerning the subsampling of genomic data to enable the application of full-likelihood methods of inference.

Type: Article
Title: A simulation study to examine the information content in phylogenomic datasets under the multispecies coalescent model
Open access status: An open access version is available from UCL Discovery
DOI: 10.1093/molbev/msaa166
Publisher version: https://doi.org/10.1093/molbev/msaa166
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher's terms and conditions.
Keywords: Bayesian inference, BPP, information content, multispecies coalescent, MSC, MSC with introgression, MSci
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Genetics, Evolution and Environment
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10102929
Downloads since deposit
4,028Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item