Statistical Methods for Genetic Variants Detection with Epigenomic Information

Statistical Methods for Genetic Variants Detection with Epigenomic Information
Author: Maria Constanza Rojo
Publisher:
Total Pages: 158
Release: 2019
Genre:
ISBN:


Download Statistical Methods for Genetic Variants Detection with Epigenomic Information Book in PDF, Epub and Kindle

Genome-wide association studies (GWAS) have successfully identified thousands of genetic variants contributing to disease and other phenotypes. However, significant obstacles hamper our ability to elucidate causal variants, identify genes affected by causal variants, and characterize the mechanisms by which genotypes influence phenotypes. The increasing availability of genome-wide functional annotation data provides unique opportunities to incorporate prior information into the analysis of GWAS to better understand the impact of variants on disease etiology. Regulatory genomic information has been recognized as a potential source that can improve the detection and biological interpretation of single-nucleotide polymorphisms (SNPs) in GWAS. Although there have been many advances in incorporating prior information into the prioritization of trait-associated variants in GWAS, functional annotation data has played a secondary role in the joint analysis of GWAS and molecular (i.e., expression) quantitative trait loci (eQTL) data in assessing evidence of association. Moreover, current methodologies that aim to integrate such annotation information focus mainly on fine-mapping and overlook the importance of its usage in earlier stages of GWAS analysis. Equally important, there is a lack of development in proper statistical frameworks that can perform selection of annotations and SNPs jointly. To address these shortcomings, we develop two statistical models: iFunMed and GRAD. iFunMed is a novel mediation framework to integrate GWAS and eQTL data with the utilization of publicly available functional annotation data. iFunMed extends the scope of standard mediation analysis by incorporating information from multiple genetic variants at a time and leveraging variant-level summary statistics. GRAD integrates high-dimensional auxiliary information into high-dimensional regression. This method allows annotation information to assist the detection of important genetic variants while identifying relevant annotation simultaneously. We provide an upper bound for the estimation error of the SNP effect sizes to gain insights on what factors affect estimation accuracy. For iFunMed, data-driven computational experiments convey how informative annotations improve SNP selection performance while emphasizing the robustness of the model to non-informative annotations. Applications to the Framingham Heart Study data indicate that iFunMed is able to boost the detection of SNPs with mediation effects that can be attributed to regulatory mechanisms. Simulation experiments indicate that GRAD can improve the identification of genetic variants by increasing the average area under the precision-recall curve by up to 60\%. Real data applications to the Framingham Heart Study show that GRAD can select relevant genetic variants while detecting several transcription factors involved in specific phenotypical changes.


Statistical Methods for Genetic Variants Detection with Epigenomic Information
Language: en
Pages: 158
Authors: Maria Constanza Rojo
Categories:
Type: BOOK - Published: 2019 - Publisher:

GET EBOOK

Genome-wide association studies (GWAS) have successfully identified thousands of genetic variants contributing to disease and other phenotypes. However, signifi
Statistical Methods in Genetic Epidemiology
Language: en
Pages: 458
Authors: Duncan C. Thomas
Categories: Medical
Type: BOOK - Published: 2004-01-29 - Publisher: Oxford University Press

GET EBOOK

This well-organized and clearly written text has a unique focus on methods of identifying the joint effects of genes and environment on disease patterns. It fol
Computational and Statistical Epigenomics
Language: en
Pages: 218
Authors: Andrew E. Teschendorff
Categories: Science
Type: BOOK - Published: 2015-05-12 - Publisher: Springer

GET EBOOK

This book introduces the reader to modern computational and statistical tools for translational epigenomics research. Over the last decade, epigenomics has emer
Assessing Rare Variation in Complex Traits
Language: en
Pages: 262
Authors: Eleftheria Zeggini
Categories: Medical
Type: BOOK - Published: 2015-08-13 - Publisher: Springer

GET EBOOK

This book is unique in covering a wide range of design and analysis issues in genetic studies of rare variants, taking advantage of collaboration of the editors
Statistical Methods for the Analysis of Genomic Data
Language: en
Pages: 136
Authors: Hui Jiang
Categories: Science
Type: BOOK - Published: 2020-12-29 - Publisher: MDPI

GET EBOOK

In recent years, technological breakthroughs have greatly enhanced our ability to understand the complex world of molecular biology. Rapid developments in genom