Biostatistics: From Local Ancestry to Rare Variant Analysis: Some Current Issues in Genetic Association Studies

Current genetic association studies face many challenges as they move into the “post-GWAS” era.  These include the impact of local ancestry, the design of sequencing studies, and the analysis of rare variants. In this talk, I first discuss the mechanisms in which admixture may lead to confounding by population substructure and heterogeneity due to differential patterns of linkage disequilibrium (LD). Understanding these mechanisms has important implications for the performance of association tests and I demonstrate how the use of local ancestry can help to highlight novel findings in the University of Southern California’s Children’s Health Study investigating asthma risk. Next, I discuss issues involving the design and analysis of association studies using next-generation sequencing. While the ability to identify individual-level data is lost (without bar-coding), sequencing pooled samples can have many design advantages for both discovery and association testing. For pooled data, I present a hierarchical Bayesian modeling approach that estimates the association of each variant using pools of cases and controls while accounting for the variation in read depth across pools and sequencing error. I discuss how the optimal design and performance is influenced by the number of pools, the number of individuals within each pool, and the average coverage per pool. This approach is then contrasted with a novel approach for rare variant analysis for individual-level data. Here, a Bayesian model uncertainty approach is used to average over the inclusion and direction of effect for each variant in a risk index. The approach allows for inference at both the group and variant-specific levels and shows increased power over alternative rare variant analysis methods. Future design and analytic approaches in “post-GWAS” genetic association studies must be cognizant of the interplay between many underlying mechanisms.  I end the talk by discussing methodological extensions for accounting for local ancestry in rare variant analysis and the incorporation of prior biological information.

Event Information

Date & Time(s)
Wednesday, May 22, 2013, 11:00 AM – 12:00 PM
Speaker(s)

David Conti
Division of Biostatistics, Department of Preventive Medicine
Zilkha Neurogenetic Institute, Keck School of Medicine
University of Southern California

Audience

This program is for the research community.

Location Information

Memorial Sloan Kettering Cancer Center
307 E 63rd Street, 3rd Floor conference room