The goal of this proposal is to identify genes that affect susceptibility to stimulant and cannabis dependence using whole genome sequencing with genotype imputation. The advent of increasingly economical whole genome sequencing provides new opportunities to identify trait-associated sequence variations not detected by linkage or association methods. It is not certain, however, what the most effective analytic method for identifying trait-related loci will be. Thus, we propose to study four distinct analytic approaches to identify sequence variants that affect stimulant and cannabis dependence in three cohorts using a state-of-the-art informatics infrastructure. The three study cohorts were ascertained as part of the Mission Indian Study (PI Cindy Ehlers), the combined Yale-University of Connecticut Addiction Study samples (PI Joel Gelernter) and the San Francisco Family Study (PI Kirk Wilhelmsen). Because we are studying populations with three different continental origins, we will ascertain whether the major genetic risk factors for the traits of interest are shared or population-specific. Two of the three populations that make up our sample (i.e., Native Americans and African Americans) are understudied. Inclusion of these cohorts will allow for analyses within and across populations that differ in ascertainment strategies and ethnic composition, permitting strong tests of replication across populations and enabling direct comparisons of ascertainment and analytic approaches as they apply to genetic studies of addiction. We expect that the approach used for this study will evolve. Currently we plan to use four complementary analytic strategies: 1) a conventional SNP analysis of genotypes detected using >5X sequence coverage for polymorphisms with minor allele frequency (MAF) greater than 0.1%;2) a gene-based approach to determine whether more cases than controls have rare sequence variants (MAF<1%) likely to affect the function of a given gene;3) use of a simplified form of affected-only linkage analysis of known and distantly related individuals to identify long chromosomal segments that are likely to be shared identical by descent;and 4) extended analysis of higher-level structural variation. The technology and economics of DNA sequence analysis is rapidly evolving. Based on current costs we plan to complete >5X whole genome sequencing of at least 3000 subjects. Critical to our decision to pursue low-pass genomic sequencing was the development of multipoint imputation methods. Simulation analysis indicates that for the same cost more subjects can undergo >5X whole genome sequencing with imputation than exomic sequencing, thus providing more exomic data as well as rich data for the rest of the genome. The findings from this study may provide insight into the genetics of other substances of abuse which can be confirmed by data sharing with other projects responding to this RFA and to other projects in the NIDA Genetics Consortium.

Public Health Relevance

Cannabis and stimulants (methamphetamine and cocaine) are the most commonly used illicit drugs worldwide, and thus pose significant public health issues. Because cannabis and stimulant dependence have a substantial shared genetic etiology and exhibit significant genetic correlations with other drug dependencies, identifying sequence variants that affect cannabis and stimulant dependence should provide insights into the pathophysiology of dependence and inform prevention and treatment strategies.

National Institute of Health (NIH)
National Institute on Drug Abuse (NIDA)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZDA1-MXL-F (12))
Program Officer
Caulder, Mark
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of North Carolina Chapel Hill
Schools of Medicine
Chapel Hill
United States
Zip Code
Edwards, Alexis C; Deak, Joseph D; Gizer, Ian R et al. (2018) Meta-Analysis of Genetic Influences on Initial Alcohol Sensitivity. Alcohol Clin Exp Res 42:2349-2359
Sanghvi, Rashesh V; Buhay, Christian J; Powell, Bradford C et al. (2018) Characterizing reduced coverage regions through comparison of exome and genome sequencing data across 10 centers. Genet Med 20:855-866
Gizer, Ian R; Bizon, Chris; Gilder, David A et al. (2018) Whole genome sequence study of cannabis dependence in two independent cohorts. Addict Biol 23:461-473
Ding, Kuan-Fu; Finlay, Darren; Yin, Hongwei et al. (2018) Network Rewiring in Cancer: Applications to Melanoma Cell Lines and the Cancer Genome Atlas Patients. Front Genet 9:228
Ding, Kuan-Fu; Petricoin, Emanuel F; Finlay, Darren et al. (2018) Nonlinear mixed effects dose response modeling in high throughput drug screens: application to melanoma cell line analysis. Oncotarget 9:5044-5057
Yin, Xianyong; Bizon, Chris; Tilson, Jeffrey et al. (2017) Genome-wide meta-analysis identifies a novel susceptibility signal at CACNA2D3 for nicotine dependence. Am J Med Genet B Neuropsychiatr Genet :
Wang, Qian; Polimanti, Renato; Kranzler, Henry R et al. (2017) Genetic factor common to schizophrenia and HIV infection is associated with risky sexual behavior: antagonistic vs. synergistic pleiotropic SNPs enriched for distinctly different biological functions. Hum Genet 136:75-83
Jensen, K P; Smith, A H; Herman, A I et al. (2017) A protocadherin gene cluster regulatory variant is associated with nicotine withdrawal and the urge to smoke. Mol Psychiatry 22:242-249
Ding, Kuan-Fu; Finlay, Darren; Yin, Hongwei et al. (2017) Analysis of variability in high throughput screening data: applications to melanoma cell lines and drug responses. Oncotarget 8:27786-27799
Melroy-Greif, Whitney E; Wilhelmsen, Kirk C; Yehuda, Rachel et al. (2017) Genome-Wide Association Study of Post-Traumatic Stress Disorder in Two High-Risk Populations. Twin Res Hum Genet 20:197-207

Showing the most recent 10 out of 79 publications