Multiple Testing in Genome-Wide Association Studies via Hierarchical Hidden Markov Models

12/20/2022
by   Pengfei Wang, et al.
0

The problems of large-scale multiple testing are often encountered in modern scientific researches. Conventional multiple testing procedures usually suffer considerable loss of testing efficiency due to the lack of consideration of correlations among tests. In fact, the appropriate use of correlation information not only enhances the efficacy of multiple testing but also improves the interpretability of the results. Since the disease- or trait-related single nucleotide polymorphisms (SNPs) often tend to be clustered and exhibit serial correlations, the hidden Markov model (HMM) based multiple testing procedure has been successfully applied in genome-wide association studies (GWAS). It is important to note that modeling the entire chromosome using one HMM is somewhat rough. To overcome this issue, this paper employs the hierarchical hidden Markov model (HHMM) to describe local correlations among tests and develops a multiple testing procedure that can not only automatically divide different class of chromosome regions, but also takes into account local correlations among tests. Theoretically, it is shown that the proposed multiple testing procedure is valid and optimal in some sense. Then a data-driven procedure is developed to mimic the oracle version. Extensive simulations and the real data analysis show that the novel multiple testing procedure outperforms its competitors.

READ FULL TEXT

page 15

page 16

page 18

page 19

research
04/04/2014

Multiple Testing for Neuroimaging via Hidden Markov Random Field

Traditional voxel-level multiple testing procedures in neuroimaging, mos...
research
03/13/2019

Rejoinder: "Gene Hunting with Hidden Markov Model Knockoffs"

In this paper we deepen and enlarge the reflection on the possible advan...
research
06/21/2018

Bayesian hierarchical models for SNP discovery from genome-wide association studies, a semi-supervised machine learning approach

Genome-wide association studies (GWASs) aim to detect genetic risk facto...
research
01/11/2021

Multiple Testing in Nonparametric Hidden Markov Models: An Empirical Bayes Approach

Given a nonparametric Hidden Markov Model (HMM) with two states, the que...
research
12/21/2022

kalis: A Modern Implementation of the Li Stephens Model for Local Ancestry Inference in R

Approximating the recent phylogeny of N phased haplotypes at a set of va...
research
11/24/2019

A change-point approach to identify hierarchical organization of topologically associated domains in chromatin interaction

The identification of spatial and temporal three-dimensional (3D) genome...
research
10/01/2022

Federated Generalized Linear Mixed Models for Collaborative Genome-wide Association Studies

As the sequencing costs are decreasing, there is great incentive to perf...

Please sign up or login with your details

Forgot password? Click here to reset