Primary tabs

Analytical and Statistical Tools for Sequence Analysis for Alzheimer's Disease

Researchers: 
Funding year(s): 
2015 to 2016
Funding to date: 
$400,000

The availability of next generation sequencing data in large scale association studies for Alzheimer’s disease provides a unique research opportunity. The data contains the information that is required to identify causal disease susceptibility loci (DSL) for Alzheimer’s disease and many other mental health phenotypes and psychiatric diseases. In order to translate the wealth of information into DSL discovery for Alzheimer’s disease, powerful statistical methodology is required. So far, a large number of rare variant association tests have been proposed. However, they do not incorporate all the important information about the variants. So far, none of the existing approaches takes the physical location of the variant into account. Under the assumption that deleterious DSLs and protective DSLs cluster in different genomic regions, we will develop a general association analysis framework for Alzheimer’s disease that is built on spatial clustering approaches. The framework will be able to handle complex phenotypes, e.g. binary, quantitative, etc., and be applicable to different study designs, i.e. family-based studies and designs of unrelated subjects. If the DSLs cluster indeed, the increase of statistical power of the approach will be of practical relevance, enabling the discovery of DSLs. In the absence of DSL clustering, our approach will achieve similar power levels as existing methodology. Furthermore, in order to test larger genomic regions for association, we will develop network-based association methodology. The network-based approach will have sufficient power for larger genomic regions than existing approaches, and, at the same time, provide an intuitive understanding of the complex relationships between the variants that drive the association, fostering new biological insights. The approach can incorporate complex phenotypes and different design types. All the proposed methodology will be implemented in user-friendly software packages with existing user-communities, i.e. PBAT, NPBAT and R. We will test, validate and compare the proposed approaches with the existing methodology, using large scale simulation studies and by applications to the whole genome sequencing family study for Alzheimer’s disease from the Tanzi lab.