Skip to main content

Genetic determinants of cannabis use: a systematic review protocol



With the legalization of cannabis in Canada, there is an increase trend in use. Cannabis has been known to have several health implications, one of which is the development of cannabis use disorder (CUD). CUD is more common in males than females, as well as in certain ethnic groups such as Native Americans. Additionally, both environmental and genetic risk factors have been found for cannabis use. The objective of this systematic review will be to summarize the genetic variants associated with cannabis use which have reached borderline genome-wide significance.


This systematic review will incorporate articles that have performed a genome-wide association study (GWAS) investigating cannabis use. MEDLINE, Web of Science, EMBASE, GWAS Catalog, GWAS Central, and NIH Database of Genotype and Phenotype will be searched using a comprehensive search strategy. The quality of genetic association studies (Q-Genie) tool will be utilized to assess the quality of the included studies. All screening and data extraction will occur independently by two authors. If feasible, a random-effects meta-analysis will be conducted on pooled odds ratios of single nucleotide polymorphisms reaching borderline genome-wide significance.


This systematic review will synthesize available GWAS on cannabis use. Results from this review will inform and direct further investigation of genetic variants associated with cannabis use.

Systematic review registration

PROSPERO CRD42020176016

Peer Review reports


On October 17, 2018, the Cannabis Act came into effect in Canada allowing for the legal growth of cannabis plants as well as the recreational possession and consumption of cannabis for those who are 18 years or older [1]. In response to the Cannabis Act, Statistics Canada has introduced a National Cannabis Survey which has been conducted every 3 months since February 2018. The NCS showed that nearly 17% of Canadians aged 15 years and older reported using cannabis within a 3-month period between mid-August and mid-September of 2019, a rate that was consistent with the rate of the year prior, when cannabis was an illicit substance. However, in the fourth quarter of 2019, cannabis use was increased when compared to the fourth quarter of 2018. Additionally, regardless of the year of study, cannabis consumption rates continue to be higher among males than females [2].

Cannabis use disorder (CUD) is defined as a problematic pattern of cannabis use leading to clinically significant impairment or distress. In 2013, the Diagnostic and Statistical Manual reported that CUD is prevalent in 3.4% of youth aged 12 to 17 years old and 1.5% of adults age 18 years or older. Trends of CUD also differ among sex and ethnicities. Rates of CUD are higher in males compared to females and rates of CUD are higher in Native American and Alaska Natives compared to other ethnic groups [3]. Results from a meta-analysis on twin studies estimated the heritability for cannabis use initiation to be 40–48% and 51–59% for problematic cannabis use, suggesting a genetic component to cannabis use and CUD [4]. A genome-wide association study (GWAS) combined five cohorts identifying several genes and single nucleotide polymorphisms (SNPs) associated with cannabis use and dependence [5]. A cluster of correlated SNPs in a novel region of chromosome 10 were identified at genome-wide significant levels in participants of European descent [5]. However, of three meta-analyses conducted on cannabis use in the literature, only one study identified a significate association [6,7,8]. One region on chromosome 16 was significantly associated with age of first cannabis use, with the strongest association for the intronic variant rs1574587 [7].

Interestingly, one study investigated the genetic and environmental risk factors for cannabis availability reported variation in cannabis initiation and symptoms of cannabis use disorder. Cannabis availability and initiation had a correlation of 0.48 and cannabis availability and symptoms of cannabis use disorder had a correlation of 0.23. Additionally, much of the variation associated with problematic use can be explained by shared environmental risk in cannabis availability leading to initiation and the genetic non-shared environmental risks for cannabis initiation [9]. These findings are of specific interest to Canada and other countries with legalization of cannabis is already in effect or being considered, as cannabis is increasingly more available since the legalization.

With cannabis availability increasing, and known heritability of CUD, it is important to understand the genetic risk factors associated with cannabis use. While meta-analyses of GWASs provide regions of interest, no known systematic review exists that summarizes identified genes and/or SNPs that have reached genome-wide significance for cannabis use. It is important to provide a summary of the literature which includes recent GWASs in the context of cannabis legalization. Further, understanding the genetic basis of cannabis use will assist health care workers in making science-informed decisions regarding the recommendation of recreational use and prescription of cannabis.


The main goal of this systematic review is to identify genetic variants from genome-wide association studies (GWASs) associated with cannabis use. Though genetic variants most commonly reported by GWASs are SNPs, this review will be inclusive of any other genetic markers reported in GWASs. We will summarize the results of GWASs which meet our inclusion criteria, and if possible, we will meta-analyze genetic variants that are reported in more than one primary study.

Primary objectives of this systematic review include the following:

  1. 1.

    Identify genetic variants associated with current cannabis use. Current cannabis use is defined by either self-report or positive urine drug screens within 1 month of the study being conducted.

  2. 2.

    Identify genetic variants associated with lifetime cannabis use. Lifetime cannabis use is defined by any self-reported or positive urine drug screens of cannabis use within one’s lifetime.

  3. 3.

    Identify genetic variants associated with CUD. CUD is defined by any diagnostic and classification systems used to diagnosis CUD or questionnaires validated to assess CUD.

Secondary objectives of this systematic review include the following:

  1. 1.

    Identify genetic variants associated with the adverse outcomes of cannabis use including psychiatric (cognitive impairment, psychotic symptoms, depression, anxiety, suicidal behavior) and non-psychiatric (chronic bronchitis, lung infections, chronic cough, increased risk of motor vehicle accidents) [10,11,12].

  2. 2.

    When feasible, perform subgroup summaries by sex or ethnic differences.

Methods and analysis

This protocol is reported in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses Protocols (PRISMA-P) statement [13] (see PRISMA-P checklist in Additional file 1). This protocol was registered within the International Prospective Register of Systematic Reviews (PROSPERO) (registration number: CRD42020176016).

Eligibility criteria

GWAS studies presenting original data on associations between cannabis use and genetic polymorphisms using any study design (i.e., case-control, cohort, etc.) will be included in this systematic review. All other types of studies will be excluded. Studies in any setting will be included and no restriction will be placed on age, sex, ethnic background, or language. Additionally, articles that do not present sufficient data to calculate the odds ratio (OR) with a 95% confidence interval will be excluded from quantitative analyses if data cannot be obtained after contacting the studies’ authors and the calculations cannot be made with the available published information. However, we will include these studies in the qualitative description of the review findings.

We will include studies investigating cannabis use disorder as defined by the Diagnostic and Statistical Manual-5 (DSM-5) or other diagnostic and classification systems such as the International Statistical Classification of Diseases and Related Health Problems-10 (ICD-10) or specific diagnostic scales designed to screen and diagnose dependence or use disorder of cannabis as well as any studies measuring any use of cannabis. We define cannabis use based on the included studies’ definitions and accept the following definition: current cannabis use is defined as either self-report or positive urine drug screens within 1 month of the study being conducted and lifetime cannabis use is defined as any self-reported or positive urine drug screens of cannabis use within one’s lifetime [14]. Clinical diagnoses and questionnaires validated to assess CUD will also be accepted. All studies not investigating current cannabis use, lifetime cannabis use, or CUD will be excluded. In the case of polymorphisms reported in duplicate publications from the same study population, the article that is the most recent will be included.

Information sources

A Health Science Librarian was consulted to develop a comprehensive search strategy. No language restriction will be placed on the search strategy, though studies will be limited to human studies. MEDLINE, Web of Science, EMBASE, GWAS Catalog, GWAS Central, and NIH Database of Genotype and Phenotype will be searched using the agreed-upon strategy, modified for each database. The search strategy will include all terms relevant to cannabis and genome-wide association studies. Databases will be searched from inception onwards. Sources of gray literature including dissertations and theses, clinical guidelines, and reports from regulatory agencies will be searched. Reference lists of relevant systematic reviews and all included studies will be checked to identify additional articles.

Search strategy

Draft search strategies for multiple electronic databases are provided in Additional file 2.

Study records

Data management

All of the references will be managed and organized through Zotero [15]. Covidence will be used for the management of this systematic review at the title and abstract, full text, and data extraction stages [16]. Prior to the formal screening process, a calibration will take place to pilot and refine the screening process. Training will be given to all team members on using Covidence prior to starting the review.

Selection process

Two independent reviewers will screen titles and abstracts for inclusion criteria. Full-text review will also be completed independently by two reviewers. Disagreements between reviewers will be resolved by consensus or including a third reviewer. We will record the reason for excluding studies at the full-text review stage.

Data collection process

Data extraction will take place independently and in duplicate for each eligible study. Standardized full-text data extraction forms will be constructed. The data extraction form will be pilot tested by two independent reviewers to determine the feasibility of this review and ensure all details are captured. In the event of missing data, we will contact study authors to obtain missing information where possible. All contact with the authors will be documented.

Data items

We will extract the following information: author, year of study, country, cohort population used, number of participants (separated by those included in the cannabis use group and non-cannabis use group), control population, the ethnicity of participants, mean age, sex ratio, the measure of cannabis use disorder or cannabis use or definition of cannabis use, inclusion and exclusion criteria, how cannabis use was reported (i.e., self-report, drug urine screens), frequency of cannabis use, and finally any genetic variants which reached the significance threshold set of p ≤ 10−7. Genome-wide significance is generally considered any SNP with a p value less than 5 × 10−8; however, SNPs reaching borderline significance, p < 10−7, will also be extracted as borderline significance has been found to be generally replicable [17].

Outcomes and prioritization

The main aim of the systematic review will be to assess variants reaching the given threshold associated with cannabis use outcomes from the primary studies included in this review.

The primary outcomes are as follows:

  1. 1.

    Current cannabis use is defined as either self-reported cannabis use or positive cannabis urine drug screens within 1 month of the study being conducted.

  2. 2.

    Lifetime cannabis use is defined as self-reported ever used cannabis during the individual’s lifetime.

  3. 3.

    CUD is defined by a diagnosis from the DSM-5 or other diagnostic and classification system such as the ICD-10 or specific diagnostic scales designed to screen and diagnose dependence or use disorder of cannabis.

For each of the outcomes above, we will collect information on each outcome as reported in the primary studies meeting the eligibility criteria, including dichotomous use of cannabis, percent positive urine screens, questionnaires, and diagnostic classification.

The secondary outcomes are as follows:

  1. 1.

    Adverse outcomes of cannabis use including psychiatric and non-psychiatric outcomes. We will collect data as reported in the primary studies included such as comorbid diagnosis and additional medication condition.

  2. 2.

    We will collect information from the included primary studies on sex and ethnic groups within the study. We will provide a qualitative summary and, if feasible, conduct a subgroup meta-analysis of genetic variants within specific ethnic groups.

Risk of bias in individual studies

Quality assessment will be completed in duplicate for each study included. The quality of genetic association studies (Q-Genie) tool [version 1.1] will be used. Disagreements of quality assessments will be resolved through discussion [18]. If a consensus is not reached through discussion, a third author will be consulted to resolve the disagreement.

Data synthesis

Studies included in this systematic review will undergo qualitative synthesis. Summary tables will be used which will include the sample size, size of cannabis group and non-cannabis group, sex distribution, mean age, study design, ethnic population, and outcome (current cannabis use, lifetime cannabis use, or CUD). A separate table will be used to display any variants reaching borderline genome-wide significance, the corresponding study it was reported in, the corresponding chromosome and position, minor allele, gene/locus, population size, outcome associated, measure, measure of association value, measure of variability, ethnicity, and p value reported.

Heterogeneity between the studies will be assessed through the I2 statistic with a 95% confidence interval. We will also report summary tables including the study design, population, and cannabis use measure/definition to describe heterogeneity qualitatively. If appropriate, a random-effects meta-analysis will be conducted on pooled odds ratios for the main outcome previously mentioned. If appropriate, the a random-effects meta-analysis will be conducted on pooled odds ratio for the secondary outcomes previously mentioned as well as a subgroup analyses of the participants’ sex and ethnicities. Subgroup analyses by participant’s sex account for any differences in cannabis use between sexes which has been previously reported in the literature [19,20,21]. Additionally, due to genetic differences between ethnicities, genetic associations may be more predominant in certain ethnic groups than others, as such a subgroup analysis will be conducted, if feasible [22]. Studies excluded from the quantitative analysis will be listed and an exclusion reason will be given.

If quantitative methods of analysis are not feasible for both the primary or secondary outcomes due to either low heterogeneity found by the I2 statistic or qualitative synthesis or no two study reports the same genetic variant, only qualitative synthesis results will be reported. We will not conduct a meta-analysis of individual participant data.


To help mitigate publication bias conference, abstracts will included, manual searches of references lists will be conducted, and Cochrane Clinical Trail Protocols Registry and databases will be searched for relevant clinical trial protocols. Additionally, the GWAS catalog will be manual searched for borderline significant variants associated with current cannabis use, lifetime cannabis use, or CUD to ensure all variants are captured within this review. Authors of conference abstracts will be contacted to determine the stage of the research project and all correspondence will be documented. If the published work was not captured by the search strategy and deemed eligible by two independent reviewers, it will be included. Two independent reviewers will search the references lists of all included studies. Any identified references, deemed eligible by two independent reviewers, will be included.

Confidence in cumulative estimate

The Grading of Recommendations Assessment, Development and Evaluation (GRADE) will be used to assess the strength of evidence. GRADE scores according to the risk of bias, publication bias, consistency, directness, and precision. A score of high-, moderate, low-, or very low-quality evidence will be assigned and summarized in a table [23].

Presenting and reporting of results

The full review will follow the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines with special consideration to the Human Genome Epidemiology Network (HuGENet) guidelines [24]. Although HuGENet reviews typically focus on a single gene, we will present information on each genetic variant-phenotype association reported which will include the study details, population, findings, and source of data.


A lack of consistent evidence exists in the current literature for genetic variants associated with cannabis use. In addition, this is the first known systematic review to synthesize the available evidence on genetic variants associated with cannabis use. The proposed systematic review aims to identify all genetic variants that have reached borderline genome-wide significance associated with cannabis use and CUD. The proposed systematic review will provide an overview of the current literature on the genetics of cannabis, aiding in the genetic understanding of cannabis use. Understanding the genetic contribution to cannabis use and its effects such as cannabis use disorder has the potential to aid medical practitioners in making decisions related to cannabis use for medical reasons and the associated potential risks. Additionally, variants reaching borderline genome-wide significance will be examined in the context of their known or biologically plausible relevance to further our understanding.

Anticipated limitations of this review existed at both the study and review level. Limitations at the study level may include a lack of reporting quality control steps, reporting of variants within linkage disequilibrium, small sample size, and a lack of reporting variants that failed to reach genome-wide significance (p < 5 × 10−8) but may have reached borderline significance levels (p < 10−7). At the review level, limitations exist in the expected high heterogeneity, differing outcomes for cannabis use reported in the literature and the exclusion of meta-analysis and candidate gene studies.

On completion of the systematic review, we will publish in a peer-review academic journal to reach both clinical and academic experts in the field. This systematic review will then inform and direct the further investigation of genetic variants associated with cannabis through candidate gene studies.

Availability of data and materials

Data sharing is not applicable to this article as no datasets were generated or analyzed during the current study.



Cannabis use disorder


Diagnostic and Statistical Manual 5th edition


Grading of Recommendations Assessment, Development and Evaluation


Genome-wide association study


Genome-wide association studies


The Human Genome Epidemiology Network


International Statistical Classification of Diseases and Related Health Problems -10


Preferred Reporting Items for Systematic Reviews and Meta-Analyses


Preferred Reporting Items for Systematic Reviews and Meta-Analyses Protocols


The quality of genetic association studies


  1. The Government of Canada. Cannabis legalization and regulation. 2018.

    Google Scholar 

  2. Statistics Canada. National Cannabis Survey, third quarter 2019. 2019.

    Google Scholar 

  3. American Psychiatric Association. Diagnostic and statistical manual of mental disorders. 5th ed. Arlington: American Psychaitric Publishing; 2013.

  4. Verweij KJH, Zietsch BP, Lynskey MT, Medland SE, Neale MC, Martin NG, et al. Genetic and environmental influences on cannabis use initiation and problematic use: a meta-analysis of twin studies. Addiction. 2010;105(3):417–30.

    Article  Google Scholar 

  5. Agrawal A, Chou YL, Carey CE, Baranger DAA, Zhang B, Sherva R, et al. Genome-wide association study identifies a novel locus for cannabis dependence. Mol Psychiatry. 2018;23(5):1293–302.

    Article  CAS  Google Scholar 

  6. Verweij K, Vinkhuyzen A, Benyamin B, Lynskey M, Quaye L, Agrawal A, et al. The genetic etiology of cannabis use initiation: a meta-analysis of genome-wide association studies and a SNP-based heritability estimation. Addict Biol. 2013;18(5):846–50.

    Article  CAS  Google Scholar 

  7. Minică CC, Verweij KJH, Most PJ, Mbarek H, Bernard M, Eijk KR, et al. Genome-wide association meta-analysis of age at first cannabis use. Addiction. 2018;113(11):2073–86 Available from:

    Article  Google Scholar 

  8. Stringer S, Minică CC, Verweij KJH, Mbarek H, Bernard M, Derringer J, et al. Genome-wide association study of lifetime cannabis use based on a large meta-analytic sample of 32 330 subjects from the International Cannabis Consortium. Transl Psychiatry. 2016;6(3):e769.

    Article  CAS  Google Scholar 

  9. Gillespie NA, Neale MC, Kendler KS. Pathways to cannabis abuse: a multi-stage model from cannabis availability, cannabis initiation and progression to abuse. Addiction. 2009;104(3):430–8.

    Article  Google Scholar 

  10. The Government of Canada. Cannabis health effects. 2019.

    Google Scholar 

  11. Hall W. What has research over the past two decades revealed about the adverse health effects of recreational cannabis use? Addiction. 2015;110(1):19–35.

    Article  Google Scholar 

  12. Gobbi G, Atkin T, Zytynski T, Wang S, Askari S, Boruff J, et al. Association of cannabis use in adolescence and risk of depression, anxiety, and suicidality in young adulthood: a systematic review and meta-analysis. JAMA Psychiatry. 2019;76(4):426–34.

    Article  Google Scholar 

  13. Shamseer L, Moher D, Clarke M, Ghersi D, Liberati A, Petticrew M, et al. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015: elaboration and explanation. BMJ. 2015;349:g7647.

  14. National Academies of Sciences Engineering and Medicine. The health effects of cannabis and cannabinoids: the current state of evidence and recommendations for research. Washington, DC: National Academies Press; 2017.

  15. Center for History New Media. Zotero: The next-generation research tool: George Mason University; 2009. Available from:

  16. Veritas Health Innovation. Covidence systematic review software. Melbourne. Available from:

  17. Dudbridge F, Gusnanto A. Estimation of significance thresholds for genomewide association scans. Genet Epidemiol Off Publ Int Genet Epidemiol Soc. 2008;32(3):227–34.

    Google Scholar 

  18. Sohani ZN, Meyre D, de Souza RJ, Joseph PG, Gandhi M, Dennis BB, et al. Assessing the quality of published genetic association studies in meta-analyses: the quality of genetic studies (Q-Genie) tool. BMC Genet. 2015;16:50.

    Article  Google Scholar 

  19. Cooper ZD, Haney M. Investigation of sex-dependent effects of cannabis in daily cannabis smokers. Drug Alcohol Depend. 2014;136:85–91.

    Article  Google Scholar 

  20. Lev-Ran S, Imtiaz S, Taylor BJ, Shield KD, Rehm J, Le Foll B. Gender differences in health-related quality of life among cannabis users: results from the National Epidemiologic Survey on Alcohol and Related Conditions. Drug Alcohol Depend. 2012;123(1–3):190–200.

    Article  Google Scholar 

  21. Van Gastel WA, MacCabe JH, Schubart CD, Van Otterdijk E, Kahn RS, Boks MPM. Cannabis use is a better indicator of poor mental health in women than in men: a cross-sectional study in young adults from the general population. Community Ment Health J. 2014;50(7):823–30.

    Article  Google Scholar 

  22. Marees AT, de Kluiver H, Stringer S, Vorspan F, Curis E, Marie-Claire C, et al. A tutorial on conducting genome-wide association studies: quality control and statistical analysis. Int J Methods Psychiatr Res. 2018;27(2):e1608.

    Article  Google Scholar 

  23. Guyatt GH, Oxman AD, Schünemann H, Tugwell P, Knottnerus A. GRADE guidelines: a new series of articles in the Journal of Clinical Epidemiology. J Clin Epidemiol. 2011;64(4):380–2.

    Article  Google Scholar 

  24. Little J, Higgins JPT, Bray M, Ioannidis J, Khoury M, Manolio T, et al. The HuGENetTM HuGE review handbook, version 1.0. Ottawa, Ontario: Canada HuGENet Canada Coord Cent; 2006.

    Google Scholar 

Download references


If amendments to this protocol are made, they will be documented and communicated to the journal. A data of amendment, description, and rationale will accompany each amendment.


This study is supported in part by CIHR (grant number PJT-156306). The funding source has no role in the study design, analysis, reporting, or the decision to publish the study.

Author information

Authors and Affiliations



ZS is the guarantor. AH and CC drafted the manuscript. AH, CC, and ZS contributed to the development of the selection criteria, the risk of bias assessment strategy, and data extraction criteria. SS provided expertise in developing the search strategy. All authors read, provided feedback, and approved the final manuscript.

Corresponding author

Correspondence to Zainab Samaan.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1.

“PRISMA-P 2015 Checklist” and contains PRISMA-P checklist.

Additional file 2.

Search strategy.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hillmer, A., Chawar, C., Sanger, S. et al. Genetic determinants of cannabis use: a systematic review protocol. Syst Rev 9, 190 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: