Modifying the Chi-square and the CMH test for population genetic inference: adapting to over-dispersion

02/21/2019
by   Kerstin Spitzer, et al.
0

Evolve and resequence studies provide a popular approach to simulate evolution in the lab and explore its genetic basis. In this context, the chi-square test, Fishers exact test, as well as the Cochran-Mantel-Haenszel test are commonly used to infer genomic positions affected by selection from temporal changes in allele frequency. However, the null model associated with these tests does not match the null hypothesis of actual interest. Indeed due to genetic drift and possibly other additional noise components such as pool sequencing, the null variance in the data can be substantially larger than accounted forby these common test statistics. This leads to p-values that are systematically too small and therefore a huge number of false positive results. Even, if the ranking rather than the actual p-values is of interest, a naive application of the mentioned tests will give misleading results, as the amount of over-dispersion varies from locus to locus. We therefore propose adjusted statistics that take the over-dispersion into account while keeping the formulas simple. This is particularly useful in genome-wide applications, where millions of SNPs can be handled with little computational effort. We then apply the adapted test statistics to real data fromDrosophila, and investigate how in-formation from intermediate generations can be included when avail-able. The obtained formulas may also be useful in other situations, provided that the null variance either is known or can be estimated.

READ FULL TEXT
research
09/29/2019

A Simple Yet Efficient Parametric Method of Local False Discovery Rate Estimation Designed for Genome-Wide Association Data Analysis

In genome-wide association studies (GWAS), hundreds of thousands of gene...
research
12/03/2021

Data-driven stabilizations of goodness-of-fit tests

Exact null distributions of goodness-of-fit test statistics are generall...
research
10/26/2020

Adaptive testing method for ergodic diffusion processes based on high frequency data

We consider parametric tests for multidimensional ergodic diffusions bas...
research
02/29/2020

Simultaneous test for Means: An Unblind Way to the F-test in One-way Analysis of Variance

After rejecting the null hypothesis in the analysis of variance, the nex...
research
05/03/2021

Bayesian tests of symmetry for the generalized von Mises distribution

Bayesian tests on the symmetry of the generalized von Mises model for pl...
research
04/05/2018

Adaptive test for ergodic diffusions plus noise

We propose some parametric tests for ergodic diffusion-plus-noise model,...
research
09/14/2022

Using Genetic Algorithms to Simulate Evolution

Evolution is the theory that plants and animals today have come from kin...

Please sign up or login with your details

Forgot password? Click here to reset