UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Effects of sampling close relatives on some elementary population genetics analyses

Wang, J; (2018) Effects of sampling close relatives on some elementary population genetics analyses. Molecular Ecology Resources , 18 (1) pp. 41-54. 10.1111/1755-0998.12708. Green open access

[thumbnail of Figures_V2.pdf]
Preview
Text
Figures_V2.pdf - Accepted Version

Download (833kB) | Preview
[thumbnail of Wang_Relatives_V4.pdf]
Preview
Text
Wang_Relatives_V4.pdf - Accepted Version

Download (508kB) | Preview

Abstract

Many molecular ecology analyses assume the genotyped individuals are sampled at random from a population and thus are representative of the population. Realistically, however, a sample may contain excessive close relatives (ECR) because, for example, localized juveniles are drawn from fecund species. Our knowledge is limited about how ECR affect the routinely conducted elementary genetics analyses, and how ECR are best dealt with to yield unbiased and accurate parameter estimates. This study quantifies the effects of ECR on some popular population genetics analyses of marker data, including the estimation of allele frequencies, F-statistics, expected heterozygosity (He), effective and observed numbers of alleles, and the tests of Hardy–Weinberg equilibrium (HWE) and linkage equilibrium (LE). It also investigates several strategies for handling ECR to mitigate their impact and to yield accurate parameter estimates. My analytical work, assisted by simulations, shows that ECR have large and global effects on all of the above marker analyses. The naïve approach of simply ignoring ECR could yield low-precision and often biased parameter estimates, and could cause too many false rejections of HWE and LE. The bold approach, which simply identifies and removes ECR, and the cautious approach, which estimates target parameters (e.g., He) by accounting for ECR and using naïve allele frequency estimates, eliminate the bias and the false HWE and LE rejections, but could reduce estimation precision substantially. The likelihood approach, which accounts for ECR in estimating allele frequencies and thus target parameters relying on allele frequencies, usually yields unbiased and the most accurate parameter estimates. Which of the four approaches is the most effective and efficient may depend on the particular marker analysis to be conducted. The results are discussed in the context of using marker data for understanding population properties and marker properties.

Type: Article
Title: Effects of sampling close relatives on some elementary population genetics analyses
Open access status: An open access version is available from UCL Discovery
DOI: 10.1111/1755-0998.12708
Publisher version: http://dx.doi.org/10.1111/1755-0998.12708
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Allele Frequency; F-statistics; Genetic Variation; Hardy–weinberg Equilibrium; Linkage Equilibrium
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Genetics, Evolution and Environment
URI: https://discovery-pp.ucl.ac.uk/id/eprint/1567910
Downloads since deposit
41,724Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item