Comparative effectiveness and safety of pharmacological and non-pharmacological interventions for insomnia: an overview of reviews
Systematic Reviews volume 8, Article number: 281 (2019)
This review aimed to assess the existing evidence regarding the clinical effectiveness and safety of pharmacological and non-pharmacological interventions in adults with insomnia and identify where research or policy development is needed.
MEDLINE, Embase, PsycINFO, The Cochrane Library, and PubMed were searched from inception until June 14, 2017, along with relevant gray literature sites. Two reviewers independently screened titles/abstracts and full-text articles, and a single reviewer with an independent verifier completed charting, data abstraction, and quality appraisal.
A total of 64 systematic reviews (35 with meta-analysis) were included after screening 5024 titles and abstracts and 525 full-text articles. Eight of the included reviews were rated as high quality using the Assessment of Multiple Systematic Reviews 2 (AMSTAR2) tool, and over half of the included articles (n = 40) were rated as low or critically low quality. Consistent evidence of effectiveness across multiple outcomes based on more than one high- or moderate quality review with meta-analysis was found for zolpidem, suvorexant, doxepin, melatonin, and cognitive behavioral therapy (CBT), and evidence of effectiveness across multiple outcomes based on one high-quality review with meta-analysis was found for temazepam, triazolam, zopiclone, trazodone, and behavioral interventions. These interventions were mostly evaluated in the short term (< 16 weeks), and there was very little harms data available for the pharmacological interventions making it difficult to evaluate their risk-benefit ratio.
Assuming non-pharmacological interventions are preferable from a safety perspective CBT can be considered an effective first-line therapy for adults with insomnia followed by other behavioral interventions. Short courses of pharmacological interventions can be supplements to CBT or behavioral therapy; however, no evidence regarding the appropriate duration of pharmacological therapy is available from these reviews.
Systematic review registration
Insomnia is a common disorder in the general population. While precise estimates vary, multiple population-based studies in different countries have consistently found that approximately one third of adults (> 18 years of age) reported dissatisfaction with their sleep and at least one symptom of insomnia [1, 2] and 6–10% of the adult population met stricter criteria for a diagnosis of insomnia such as the Diagnostic and Statistical Manual of Mental Disorders (DSM-5)  or International Classification of Sleep Disorders (ICSD) . Insomnia can contribute to significant functional impairments at work or at home and is linked to reduced quality of life, problems with attention and memory, mood disturbances, and reduced ability to carry out normal daily activities . Furthermore, studies have indicated that insomnia may be an important risk factor for the onset of mental health disorders such as depression, anxiety, and substance abuse .
Clinical practice guidelines published in the USA, Canada, and Europe unanimously recommend that non-pharmacological approaches, especially cognitive behavioral therapies, should be the first-line treatment for chronic insomnia (symptoms for > 3 months) and that pharmacological treatment should only be used in acute cases (< 3 months) or as a short-term supplement to non-pharmacological approaches [6,7,8]. Evidence for over-the-counter (e.g., diphenhydramine) or natural remedies (melatonin, valerian) is considered weak or inconclusive, and these approaches are not recommended for acute or chronic insomnia [6,7,8]. Despite this, the rate of prescription sleep aid use, particularly non-benzodiazepines and off-label use of antidepressants, has risen significantly over the last 20 years [9,10,11], in some cases outpacing the diagnosis of sleep disorders among the general population . Furthermore, a large prospective study of former and current insomnia sufferers found that 70% of patients using a prescription sleep aid continued to do so at 1-year follow-up but did not demonstrate significant improvements in sleep compared to non-users . The use of non-prescription sleep aids is also common alongside prescription drugs; up to 60% of sleep aids used by adults with insomnia are non-prescription [12, 13].
Evidence is needed to support the development of guidelines that encourage the appropriate use of pharmacological interventions to treat insomnia and increase access to and uptake of non-pharmacological approaches. The objective of this overview of systematic reviews was to assess what has been established regarding the clinical effectiveness and safety of pharmacological and non-pharmacological interventions in adults with insomnia and identify areas where further research or policy development is needed.
This overview was commissioned by the Canadian Agency for Drugs and Technologies in Health (CADTH) as part of an assessment of the management of insomnia in adults in Canada. In accordance with guidance from the Cochrane Handbook, a protocol for the overview of systematic reviews was written a priori by the research team in consultation with the project owner and other stakeholders. The protocol was registered with the PROSPERO database (CRD42017072527)  and the full version can be found in Additional file 1. Results are reported using the Preferred Reporting Items for Overviews of Systematic Reviews Including Harms (PRIO-harms) checklist (Additional file 2: Appendix A) . As the methods have been reported fully in our report that was produced for CADTH , they are outlined briefly here.
Eligibility criteria for the overview were established using the Population, Intervention, Comparator, Outcome, and Study design (PICOS) framework to include the following:
Patients: adults > 18 years of age diagnosed with acute (< 3 months) or chronic (> 3 months) insomnia disorder according to the DSM diagnostic criteria, International Classification of Sleep Disorders, or Research Diagnostic Criteria for insomnia .
Interventions: prescription or non-prescription pharmacological interventions used to treat insomnia approved for use or under review for approval in Canada; non-pharmacological interventions included cognitive behavioral therapy, sleep restriction, relaxation, meditation, etc.; or a combination of pharmacological and non-pharmacological interventions. Herbal remedies or complementary and alternative medicine (CAM) were ineligible; exceptions were made for melatonin and mindfulness-based therapies as they were of special interest to stakeholders.
Comparator: inactive controls (e.g., placebo, wait-list control, self-monitoring) or active controls (e.g., another eligible intervention).
Effectiveness: sleep onset latency (SOL), total sleep time (TST), wake after sleep onset (WASO), sleep quality (SQ), sleep satisfaction (SS), sleep efficiency (SE), Insomnia Severity Index (ISI) scores, fatigue severity, and health-related quality of life (HrQoL)
Harms: hangover/morning sedation, accidental injuries, additional healthcare use related to harms of the intervention, delirium related to the intervention, sleep disordered breathing related to the intervention, addiction, dependence, or diversion of medications (A/D/D), and all-cause mortality related to the intervention
Study design: systematic knowledge syntheses including primary studies of any design with or without a meta-analysis, using the Cochrane Collaboration definition . Reviews were required to report that a literature search was carried out in at least one database in order to be eligible; articles identified as rapid reviews, literature reviews, narrative reviews, or other non-systematic knowledge syntheses were excluded from the overview.
Other: Published or unpublished systematic reviews were eligible for inclusion, as well as publications in any language.
Published literature was identified by searching MEDLINE, Embase, PsycINFO, The Cochrane Library, and PubMed from inception until June 14, 2017. The search strategy contained both controlled vocabulary (MeSH terms) and relevant keywords (e.g., insomnia, sleep initiation disorder), and a methodological filter was applied to limit the search to systematic reviews and meta-analyses. No date or language restrictions were applied. The search strategy was developed by an experienced librarian (BS) and peer-reviewed by another librarian (SJ) using the PRESS Checklist ; searches were carried out by an experienced information specialist (AE); the full search strategy is available in Additional file 2: Appendix B. Unpublished (or gray) literature was identified by searching sites based on the Gray Matters checklist ; the full list is available in Additional file 2: Appendix B. The literature search was supplemented by reviewing the bibliographies of the included reviews and other key papers, as well as contacting the authors of relevant conference abstracts and review protocols for manuscripts or unpublished data.
Study selection and data abstraction
Calibration exercises were completed with the review team prior to level 1 (title/abstract) and level 2 (full-text) screening, the charting exercise, and data abstraction to ensure reliability of the processes and revise forms as needed. Only one round of calibration using 25 citations was required prior to level 1 screening (> 75% agreement), charting (5 articles), and data abstraction (6 articles), while two rounds of calibration (> 75% agreement) were required prior to level 2 screening (15 and 25 articles, respectively). Level 1 and 2 screening was completed in duplicate by pairs of reviewers working independently and any discrepancies were resolved by a third reviewer, and charting and data abstraction were completed by a single reviewer and verified by a second. Screening was completed using synthesiSR, proprietary online software developed by the Knowledge Translation Program of St. Michael’s Hospital .
A charting exercise was completed prior to data abstraction to collect information on review characteristics, particularly how outcomes were reported and which outcome measures were used in the included reviews. Data abstraction items included review characteristics (e.g., year of conduct/literature search, type of included study designs), patient characteristics (e.g., type and number of patients, age mean, and standard deviation), interventions examined (e.g., type of intervention, dose/frequency), and outcomes examined (e.g., name of outcome, outcome measure/definition). A list of the primary studies included in all of the systematic reviews with meta-analysis (SR + MAs) was compiled and cross-referenced with the primary studies included in the SRs. Any SRs that completely overlapped with the primary studies included in the abstracted SR + MAs (e.g., did not contribute any new evidence) were excluded from the overview.
Quality appraisal and assessment of evidence
Quality appraisal was completed concurrently with data abstraction using the Assessing the Methodological Quality of Systematic Reviews tool version 2 (AMSTAR2) . The tool was tested in the same calibration exercises as the data abstraction form and assessments were completed by one reviewer and verified by a second. Additionally, a GRADE algorithm developed for Cochrane overviews of reviews was used to ascertain the strength of evidence of the reviews included in each treatment comparison for all outcomes . In this algorithm, each review starts with a ranking of high certainty and is downgraded 1 level for serious methodological concerns (sample size between 100 and 199 participants; high risk of bias in randomization and blinding for > 75% included studies; high heterogeneity (I2 > 75%); and “No” on one of these AMSTAR2 items: a priori research design, comprehensive literature search, duplicate study selection, or duplicate study abstraction) or 2 levels for very serious concerns (sample size < 100 participants and “No” on two or more of these AMSTAR2 items: a priori research design, comprehensive literature search, duplicate study selection, or duplicate study abstraction) .
No formal statistical analysis was planned for this overview as substantial clinical and methodological heterogeneity was expected across the included reviews and pooling the data or conducting an indirect comparison would not be appropriate in this situation. Lists of the primary studies in each included review were collated and cross-referenced in a matrix of evidence tables to ascertain the degree of overlap between reviews for each treatment comparison and outcome to provide context for the results. Additionally, a matrix of evidence for the entire overview was prepared and used to calculate the “corrected covered area” (CCA) to quantify the degree of overlap between all of the reviews included in this work .
Patient and public involvement
Patients and/or public were not involved in the development, design, or conduct of this research.
The literature search resulted in 5024 titles and abstracts to be screened after de-duplication, 4499 of which were excluded after level 1 screening for not meeting eligibility criteria (Fig. 1). A total of 525 full-text articles were retrieved for screening at level 2 where a further 312 articles were excluded, leaving 213 articles eligible for data abstraction (the list of excluded studies is available upon request). After completion of the charting exercise and data abstraction, a total of 64 articles, 34 published SR + MAs [25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58] and one unpublished SR + MA (Dr. Hae Sun Suh, unpublished data 2018) and 29 SRs [59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87], were included in this overview. A total of 358 index publications (primary studies) were cited 612 times across the 64 SR + MAs and SRs included in this overview; resulting in a CCA of 0.011 indicating little to no overlap across the included reviews.
The included reviews were conducted between 1997 and 2017 with the majority (75%) published after 2010 (Table 1; Additional file 2: Appendix C). Literature search dates for the included reviews ranged from 1996 to 2016 with more than half (62%) being conducted after 2010 (Table 1; Additional file 2: Appendix C). Only 11 (17%) of the included reviews searched databases from inception, and a further 5 (7%) reviews ran searches going back more than 50 years. The first authors of the SR + MAs were predominantly based in Asia (43%), specifically China (7/35), while the majority of SR authors were based in North America (65%), predominantly in the US (17/29). An average of 27 primary studies (range 3–139) were included in the SR + MAs, and an average of 8 primary studies (2–22) were included in the SRs. Randomized controlled trials were the most commonly included primary study design, appearing in 33 SR + MAs (94%) and 23 SRs (79%). Non-randomized controlled trials (NRCTs) were the next most common (7 SRs, 24%) followed by quasi-experimental study designs (1 SR + MA, 3%; 3 SRs, 10%) and observational studies (4 SRs, 14%). Two SR + MAs, and 4 SRs did not report the specific study designs included for review.
Study and patient characteristics
The overall sample size was reported in 24/35 SR + MAs and 21/29 SRs, averaging 1861 patients (range 171–6303) and 566 patients (34–1794), respectively. Other population characteristics such as mean age and the proportion of female participants appeared in only 7 SR + MAs and 1 SR. The majority of included reviews included patients with insomnia and another co-morbid condition (20 SR + MAs, 57%; 18 SRs, 62%), 12 SR + MAs (34%), and 6 SRs (21%) included patients with insomnia alone; 3 SR + MAs (9%) and 5 SRs (17%) did not report on the presence or absence of co-morbidities in the patient population (Table 1; Additional file 2: Appendix C).
Interventions and outcomes
The included SR + MAs and SRs examined a total of 32 different treatment comparisons across 11 different classes of interventions. All of the reported interventions were compared with at least one kind of inactive control (e.g., placebo/sham intervention, wait-list, symptom monitoring), and 8 of the reported interventions were compared with an active control (e.g., another eligible intervention—Table 2; Additional file 2: Appendix C).
Relevant SR + MAs or SRs that examined at least one eligible intervention could be identified for all of the effectiveness outcomes, but relevant SR + MAs or SRs could only be identified for three of the harms outcomes: hangover or morning sedation, accidental injuries, and addiction, dependence, or diversion related to an intervention.
Quality appraisal and strength of evidence results
Only six SR + MAs (20%) and two SRs (7%) were rated as high quality using the AMSTAR 2 tool, and the majority were rated as moderate quality (11 SR + MAs, 31%; 5 SRs, 17%), low quality (8 SR + MAs, 23%; 5 SRs, 17%), or critically low quality (10 SR + MAs, 29%; 17 SRs, 59%; Fig. 2). The full AMSTAR2 results are available in Additional file 2: Appendix D.
Out of the 11 classes of interventions included in this review, only two comparisons (melatonin compared to inactive controls and CBT compared to inactive controls) included reviews rated with a high strength of evidence based on GRADE and nine comparisons (benzodiazepines, non-benzodiazepines, suvorexant, antidepressants, melatonin, CBT, behavioral interventions, and mindfulness-based interventions all compared to inactive controls; and CBT compared to active controls) included reviews rated with a medium strength of evidence (Table 3). Five comparisons included in this overview (antipsychotics, diphenhydramine, and combination therapies all compared to inactive controls; non-benzodiazepines and antidepressants compared to active controls), only included reviews rated as having a low or very low strength of evidence based on GRADE (Table 3).
All of our results have been transparently reported in our report for CADTH that is available on their website , as well as in Additional file 2: Tables E1-E11 Appendix E. To focus our results for this publication, only the statistically significant results from SR + MAs are included in the text. For outcomes where no evidence from SR + MAs could be identified, positive results from individual studies included in relevant SRs are reported. Tables with the overlap in the primary studies included in the SRs and SR + MAs can be found in Tables F1-F11 Additional file 2: Appendix F and in Additional file 3.
One high-quality SR + MA  compared flurazepam to placebo and found improvements in SOL (10 RCTs, 532 patients) compared to placebo (Table 3; Table E1 Additional file 2: Appendix E). One high-quality  and one critically low-quality  SR + MA compared temazepam to placebo and found statistically significant improvements in SOL (2 RCTs, 72 patients), TST (2 RCTs, 72 patients), WASO (2 RCTs, 77 patients), and SQ (2 RCTs, 78 patients; Table 3; Table E1 Appendix E, Table F1 Additional file 2: Appendix F). One high-quality  and one critically low-quality  SR + MA compared triazolam to placebo and found significant improvements in SOL (8 RCTs, 539 patients and 28 RCTs, sample size not reported [NR]), TST (12 RCTs, sample size NR), and WASO (2 RCTs, 57 patients; Table 3).
Non-benzodiazepine receptor agonists
Two high-quality [26, 41] and two critically low-quality [45, 48] SR + MAs compared zolpidem to placebo and found improvements in SOL (5 to 29 RCTs, 355 to 1805 patients), TST (2 to 23 RCTs, 112 to 890 patients), WASO (8 RCTs, 896 patients), SQ (3 RCTs, 557 patients and 6 RCTs, 638 patients), and SE (4 RCTs, 226 patients; Table 3; Table E2 Appendix E, Table F2 Additional file 2: Appendix F). Also, one critically low-quality SR  compared nightly zolpidem doses to zolpidem “as needed” and found an increase in HRQoL for both groups (1 study, 789 patients; Table 3; Table E2 Additional file 2: Appendix E). One critically low-quality SR  compared zolpidem to triazolam and found improvements in TST (1 study, 16 patients), WASO (3 studies, 102 patients), and SE (2 studies, 86 patients; Table 3; Table E2 Additional file 2: Appendix E). One high-quality  and one critically low-quality  SR + MA compared zopiclone to placebo and found improvements in SOL (5 RCTs, 356 patients and 15 RCTs, sample size NR), and TST (13 RCTs, sample size NR). One critically low-quality SR  compared zolpidem, zopiclone, triazolam, temazepam, and placebo and found slightly increased risks of dependency or withdrawal symptoms in patients taking zopiclone compared to the other medications (7 studies, 450 patients; Table 3; Table E2 Additional file 2: Appendix E).
One high-quality  and two moderate quality [36, 38] SR + MAs compared suvorexant to placebo and found improvements in SOL, TST, WASO, SQ, and ISI scores as well as increased risks of hangover or morning sedation effects, accidental injury, and addiction or dependence (Table 3; Table E3 Appendix E, Table F3 Additional file 2: Appendix F).
Two high-quality [26, 41], one low-quality , and two critically low-quality [39, 45] SR + MAs compared doxepin to placebo and found improvements in SOL (2 to 3 RCTs, 60 to 415 patients), TST (2 to 7 RCTs, 60 to 1476 patients), WASO (2 to 4 RCTs, 60 to 558 patients), SQ (2 RCTs, 291 patients and 2 RCTs, 404 patients), SE (2 to 3 RCTs, 60 to 425 patients), and ISI scores (2 RCTs, 494 patients; Table 3; Additional file 2: Appendix E, Table E4). One high-quality SR + MA  and four critically low-quality SRs [74, 75, 77, 82] compared trazodone to placebo and found improvements in SOL (2 RCTs, 208 patients), TST (1 to 5 studies, 39 to 323 patients), WASO (1 to 2 studies, 15 to 306 patients), SQ (1 to 5 studies, 9 to 767 patients), and SE (2 to 3 studies, 20 to 56 patients; Table 3; Additional file 2: Appendix E, Table E4). Three critically low-quality SRs [75, 77, 82] all reported on the same RCT that compared trazodone and zolpidem to placebo (306 patients) and only found greater improvements in SOL for patients in the zolpidem group (Table 3; Table E4, Additional file 2: Appendix E and Table F4, Appendix F).
Four critically low-quality SRs [59, 67, 74, 86] compared quetiapine to placebo and found improvements in SOL (2 studies, 52 patients and 2 studies, 32 patients), TST (1 study 18 patients), SQ (1 to 3 studies, 18 to 84 patients), SE (1 study, 18 patients and 1 study, 27 patients), and ISI scores (1 study, 6 patients) as well as increased risk of hangover or morning sedation effects compared to placebo (2 studies, sample size NR; Table 3; Additional file 2: Table E5 Appendix E, Table F5, Appendix F).
Three high-quality [26, 27, 40], one moderate quality , three published critically low-quality [29, 45, 58], and one unpublished critically low-quality (Dr. Hae Sun Suh, unpublished data 2018) SR + MAs compared melatonin to placebo and found improvements in SOL (8 to 12 RCTs, 206 to 346 patients), TST (8 RCTs, 497 patients and 11 RCTs, sample size NR), and SQ (14 RCTs, sample size NR; Table 3; Additional file 2: Table E6 Appendix E, Table F6, Appendix F). Additionally, one critically low-quality SR  compared melatonin to placebo and found improvements in SS (1 study, 112 patients) and HRQoL (1 study, 42 patients).
Two critically low-quality SRs [69, 82] compared diphenhydramine to placebo and found improvements in SOL (3 studies, 226 patients and 4 studies, 332 patients), SE (1 study, 204 patients), and ISI scores (1 study, 184 patients; Table 3; Table E7, Additional file 2: Appendix E, Table F7, Appendix F).
Cognitive behavioral therapy
Four high-quality [25, 26, 41, 42], seven moderate quality [35, 43, 49,50,51, 55, 57], five low-quality [28, 31, 32, 47, 52], and three critically low-quality [34, 37, 44] SR + MAs compared CBT to inactive controls (e.g., wait-list control, symptom monitoring) and found improvements in SOL (2 to 108 RCTs, 122 to 2010 patients), TST (2 to 91 RCTs, 59 to 2009 patients), WASO (2 to 71 RCTs, 59 to 1655 patients), SQ (2 to 40 RCTs, 580 to 965 patients), SE (2 to 79 RCTs, 59 to 2009 patients), ISI scores (2 to 38 RCTs, 131 to 1655 patients), and fatigue symptoms (6 to 7 RCTs, 398 to 1098 patients; Table 3; Additional file 2: Table E8 Appendix E, Table F8 Appendix F). Additionally, one moderate quality and one low-quality SR  compared CBT to inactive controls and found improvements in HRQoL (1 study, 81 patients and 4 studies, 706 patients; Table 3; Additional file 2: Table E8 Appendix E). One moderate quality SR + MA  compared two different delivery methods of CBT and found greater improvements in SOL for self-help CBT compared to in-person CBT (3 RCTs, sample size NR), one moderate quality SR compared CBT to relaxation techniques and found improvements in WASO (1 study, 46 patients), one low-quality SR compared individual CBT to group CBT and found improvements in HRQoL for both groups (1 study, 58 patients), and one critically low-quality SR  compared CBT alone to CBT plus temazepam and found improvements in WASO for both group and improvements in SE for the CBT plus temazepam group only (1 study, 78 patients; Table 3; Additional file 2: Table E8 Appendix E, Table F8 Appendix F). Finally, one high-quality  and one moderate quality  SR + MA compared CBT plus relaxation techniques to inactive controls and found improvements for SOL (4 RCTs, 101 patients and 1 RCT, 26 patients) and SQ (3 RCTs, 184 patients; Table 3; Additional file 2: Table E8 Appendix E, Table F8 Appendix F).
One high-quality  and one critically low-quality  SR + MA compared behavioral therapy or brief behavioral interventions to inactive controls (unspecified) and found improvements in SOL (3 RCTs, 146 patients), WASO (3 studies, 146 patients), and SQ (5 studies, sample size NR; Table 3; Additional file 2: Table E9 Appendix E, Table F9 Appendix F). Additionally, one critically low-quality SR  compared sleep restriction to inactive controls and found improvements in SE (2 studies, 129 patients; Table 3; Additional file 2: Table E9 Appendix E).
One low-quality SR + MA  and one critically low-quality SR  compared mindfulness-based interventions (stress reduction, meditation) to inactive controls (wait-list, symptom monitoring, sleep hygiene education) and found improvements in SOL (2 studies, 83 patients), SQ (2 studies, 83 patients), and SE (3 studies, 205 patients; Table 3; Additional file 2: Table E10 Appendix E, Table F10 Appendix F).
One low-quality SR  examined mindfulness-based cognitive therapy plus pharmacotherapy (unspecified) and found improvements in TST (mindfulness + pharmacotherapy; 2 studies, 30 patients) and SQ (1 study, 14 patients) compared with baseline values (Table 3; Additional file 2: Table E11 Appendix E, Table F11 Appendix F).
This comprehensive overview of reviews included 64 systematic reviews representing 358 unique primary studies and found consistent evidence of effectiveness for both pharmacological and non-pharmacological interventions based on data from moderate to high quality SR + MAs. There was evidence of effectiveness across multiple outcomes reported in more than one high- or moderate quality SR + MA for zolpidem, suvorexant, doxepin, and melatonin, and evidence of effectiveness across multiple outcomes reported in one high-quality SR + MA for temazepam, triazolam, zopiclone, and trazodone. Additionally, the evidence for these interventions included reviews rated as having a high (melatonin) or medium (temazepam, triazolam, zolpidem, zopiclone, suvorexant, doxepin, and trazodone) strength of evidence based on GRADE. However, there was very little harms data available for these interventions. There was little to no evidence of effectiveness or no high- or moderate quality evidence available for flurazepam, quetiapine, or diphenhydramine. Moreover, most interventions were studied in the short term (< 12 weeks) and the primary studies included in the reviews tended to have small sample sizes. The lack of harms data and small study sizes are concerning given that a large proportion of the general population are on these medications. Likewise, there was evidence of effectiveness across multiple outcomes reported in multiple high- or moderate quality SR + MAs for CBT and reported in one high-quality SR + MA for BT; there were no high-quality SR + MAs that examined mindfulness-based or combination therapies. The evidence for these interventions also included reviews rated as a high (CBT) or medium (CBT and behavioral therapy) strength of evidence based on GRADE. The studies that examined CBT and BT were often conducted in the short term, and only one SR + MA examined the effect of online versus in-person CBT, which is an important question for future research given the cost of and difficulties accessing in-person CBT .
This overview of reviews identified several evidence gaps in the field of insomnia research, particularly the lack of harms data for pharmacological interventions, the effects of different doses, the effectiveness of sequencing or combining drug and non-drug interventions, and a dearth of head-to-head studies directly comparing pharmacological or non-pharmacological interventions. Additionally, the clinical significance of symptomatic changes in insomnia is poorly understood and standards that allow researchers to interpret whether a statistically significant change translates to a clinically significant one are needed (e.g., the minimal clinically important difference).
There are limitations of the included systematic reviews worth noting, particularly the low quality of the included evidence with more than 50% of the included reviews receiving a low- or critically low-quality score on the AMSTAR2 tool. This suggests that substantial improvements in the methods used to synthesize knowledge in this field are needed and that current results should be interpreted with caution. Systematic reviews in this field could be improved by increasing the use of a priori protocols, providing a rationale for including or excluding certain study designs, providing a list of excluded studies with reasons for exclusion, and transparently reporting the funding sources of primary studies included in the review.
There are also some limitations to the conduct of this overview that should be taken into consideration. Due to time and resource constraints, targeted searches for primary studies reporting harms outcomes could not be conducted, which is a deviation from our original protocol . Additionally, although the literature search attempted to find unpublished research and reviews in multiple languages, only one unpublished review and 2 reviews in languages other than English were identified, suggesting that these results are not generalizable beyond systematic reviews published in English. Additionally, the definition of inactive controls used in this overview included standard care interventions such as sleep hygiene and patient education, which may have resulted in underestimation of the effectiveness of some of the non-drug interventions as they were largely compared with these types of controls rather than true control conditions such as placebo or sham interventions. Also, the behavioral, mindfulness and cognitive behavioral interventions included in this review were categorized as reported by review authors. In the interest of capturing a comprehensive evidence base, we did not put any limitations on the eligibility of these interventions leading to a high degree of variability across the reviews. Finally, as stated previously, due to a lack of clinical standards for interpretation, none of the changes in outcomes reported here could be evaluated in terms of their clinical or symptomatic relevance.
There are several strengths of this overview that are worth noting, particularly the use of the Cochrane handbook  and an a priori protocol to guide the conduct of the overview, as well as the use of the AMSTAR2  tool for quality appraisal. The literature search was comprehensive and included both published and unpublished sources of information and had no restrictions on publication date or language of publication. The final list of eligible interventions and outcomes was developed in consultation with project stakeholders and clinical experts who were consulted throughout the overview process. Finally, the 64 included systematic reviews were closely examined for overlaps in the primary evidence which was found to be extensive and which we clearly highlighted throughout the “Results” section.
Based on the results of this overview, clinicians and patients with insomnia can consider CBT as a first-line intervention due to its consistent evidence of effectiveness and a high strength of evidence across multiple outcomes and because it is likely associated with few or no serious harms though there is insufficient evidence to properly evaluate the benefit to harm ratio for this intervention. If CBT is not effective, then other behavioral interventions can be considered or short courses of melatonin, zolpidem, suvorexant, or doxepin can be added to non-pharmacological therapy. However, these agents have only been tested in short-term studies and there is little evidence for their effectiveness or safety beyond 16 weeks of treatment.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.
A Measurement Tool to Assess Systematic Reviews
Cognitive Behavioral Therapy/Cognitive Behavioral Therapy for Insomnia
Insomnia Severity Index
Pittsburgh Sleep Quality Index
Sleep onset latency/sleep latency
Total sleep time
Wake after sleep onset
Morin CM, LeBlanc M, Belanger L, Ivers H, Merette C, Savard J. Prevalence of insomnia and its treatment in Canada. Can J Psychiatry. 2011;56(9):540–8.
Roth T. Insomnia: definition, prevalence, etiology, and consequences. J Clin Sleep Med. 2007;3(5 Suppl):S7–10.
American Psychiatric Association. Diagnostic and statistical manual of mental disorders (DSM-5): American Psychiatric Pub; 2013.
Morin CM, Jarrin DC. Epidemiology of insomnia: prevalence, course, risk factors, and public health burden. Sleep Med Clin. 2013;8(3):281–97.
Committee on Sleep Medicine and Research BoHSP. Sleep disorders and sleep deprivation: an unmet public health problem. Washington (DC): National Academies Press; 2006. Available from: https://www.nap.edu/catalog/11617/sleep-disorders-and-sleep-deprivation-an-unmet-public-health-problem
Group TOPTI. Assessment to management of adult insomnia: clinical practice guideline. Edmonton, Alberta; 2015.
Schutte-Rodin S, Broch L, Buysse D, Dorsey C, Sateia M. Clinical guideline for the evaluation and management of chronic insomnia in adults. J Clin Sleep Med. 2008;4(5):487–504.
Riemann D, Baglioni C, Bassetti C, Bjorvatn B, Dolenc Groselj L, Ellis JG, et al. European guideline for the diagnosis and treatment of insomnia. J Sleep Res. 2017;26(6):675–700.
Walsh JK. Drugs used to treat insomnia in 2002: regulatory-based rather than evidence-based medicine. Sleep. 2004;27(8):1441–2.
Moloney ME, Konrad TR, Zimmer CR. The medicalization of sleeplessness: a public health concern. Am J Public Health. 2011;101(8):1429–33.
Bertisch SM, Herzig SJ, Winkelman JW, Buettner C. National use of prescription medications for insomnia: NHANES 1999–2010. Sleep. 2014;37(2):343–9.
Pillai V, Cheng P, Kalmbach DA, Roehrs T, Roth T, Drake CL. Prevalence and predictors of prescription sleep aid use among individuals with DSM-5 insomnia: the role of hyperarousal. Sleep. 2016;39(4):825–32.
Winkelman JW. Clinical practice. Insomnia disorder. N Engl J Med. 2015;373(15):1437–44.
Tricco A, Rios P, Cardoso R, Morra D, Goodarzi Z, Farah B, et al. Comparative effectiveness and safety of pharmacological and non-pharmacological interventions for insomnia: an overview of reviews (CRD42017072527). York (GB): University of York Centre for Reviews and Dissemination; 2017. Available from: http://www.crd.york.ac.uk/PROSPERO/display_record.php?ID=CRD42017072527
Bougioukas KI, Liakos A, Tsapas A, Ntzani E, Haidich AB. Preferred reporting items for overviews of systematic reviews including harms checklist: a pilot tool to be used for balanced reporting of benefits and harms. J Clin Epidemiol. 2018;93:9–24.
Rios P, Cardoso R, Morra D, Nincic V, Goodarzi Z, Farah B, et al. Clinical evaluation of interventions for the management of insomnia: a review of reviews. Ottawa, ON; 2018.
Edinger JD, Bonnet MH, Bootzin RR, Doghramji K, Dorsey CM, Espie CA, et al. Derivation of research diagnostic criteria for insomnia: report of an American Academy of Sleep Medicine Work Group. Sleep. 2004;27(8):1567–96.
Cochrane handbook for systematic reviews of interventions. Version 5.1.0 [updated March 2011]: The Cochrane Collaboration; 2011. Available from: http://handbook-5-1.cochrane.org/.
McGowan J, Sampson M, Salzwedel DM, Cogo E, Foerster V, Lefebvre C. PRESS peer review of electronic search strategies: 2015 guideline statement. J Clin Epidemiol. 2016;75:40–6.
CADTH. Grey matters: a practical tool for searching health-related grey literature Ottawa, ON: CADTH; 2015 [Available from: https://www.cadth.ca/resources/finding-evidence/grey-matters.
Knowledge Translation Program. Synthesi.SR Toronto, Ontario: Li Ka Shing Knowledge Institute, St. Michael’s Hospital; 2014 [Available from: http://www.breakthroughkt.ca/login.php.
Shea BJ, Reeves BC, Wells G, Thuku M, Hamel C, Moran J, et al. AMSTAR 2: a critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. BMJ. 2017;358:j4008.
Pollock A, Farmer SE, Brady MC, Langhorne P, Mead GE, Mehrholz J, et al. An algorithm was developed to assign GRADE levels of evidence to comparisons within systematic reviews. J Clin Epidemiol. 2016;70:106–10.
Pieper D, Antoine SL, Mathes T, Neugebauer EA, Eikermann M. Systematic review finds overlapping reviews were not mentioned in every other overview. J Clin Epidemiol. 2014;67(4):368–75.
Ballesio A, Aquino M, Feige B, Johann AF, Kyle SD, Spiegelhalder K, et al. The effectiveness of behavioural and cognitive behavioural therapies for insomnia on depressive and fatigue symptoms: a systematic review and network meta-analysis. Sleep Med Rev. 2018;37:114–29.
Buscemi N, Vandermeer B, Friesen C, Bialy L, Tubman M, Ospina M, et al. Manifestations and management of chronic insomnia in adults. Evid Rep Technol Assess (Summ). 2005(125):1–10.
Buscemi N, Vandermeer B, Pandya R, Hooton N, Tjosvold L, Hartling L, et al. Melatonin for treatment of sleep disorders. Evid Rep Technol Assess (Summ). 2004(108):1–7.
Cheng SK, Dizon J. Computerised cognitive behavioural therapy for insomnia: a systematic review and meta-analysis. Psychother Psychosom. 2012;81(4):206–16.
Ferracioli-Oda E, Qawasmi A, Bloch MH. Meta-analysis: melatonin for the treatment of primary sleep disorders. PLoS One. 2013;8(5):e63773.
Gong H, Ni CX, Liu YZ, Zhang Y, Su WJ, Lian YJ, et al. Mindfulness meditation for insomnia: a meta-analysis of randomized controlled trials. J Psychosom Res. 2016;89:1–6.
Ho FY, Chan CS, Tang KN. Cognitive-behavioral therapy for sleep disturbances in treating posttraumatic stress disorder symptoms: a meta-analysis of randomized controlled trials. Clin Psychol Rev. 2016;43:90–102.
Ho FY, Chung KF, Yeung WF, Ng TH, Kwan KS, Yung KP, et al. Self-help cognitive-behavioral therapy for insomnia: a meta-analysis of randomized controlled trials. Sleep Med Rev. 2015;19:17–28.
Hwang E, Shin S. Effectiveness of non-pharmacological intervention for insomnia: a systematic review and meta-analysis. Ind J Sci Technol. 2016;9(40).
Irwin MR, Cole JC, Nicassio PM. Comparative meta-analysis of behavioral interventions for insomnia and their efficacy in middle-aged adults and in older adults 55+ years of age. Health Psychol. 2006;25(1):3–14.
Johnson JA, Rash JA, Campbell TS, Savard J, Gehrman PR, Perlis M, et al. A systematic review and meta-analysis of randomized controlled trials of cognitive behavior therapy for insomnia (CBT-I) in cancer survivors. Sleep Med Rev. 2016;27:20–8.
Kishi T, Matsunaga S, Iwata N. Suvorexant for primary insomnia: a systematic review and meta-analysis of randomized placebo-controlled trials. PLoS One. 2015;10(8):e0136910.
Koffel EA, Koffel JB, Gehrman PR. A meta-analysis of group cognitive behavioral therapy for insomnia. Sleep Med Rev. 2015;19:6–16.
Kuriyama A, Tabata H. Suvorexant for the treatment of primary insomnia: a systematic review and meta-analysis. Sleep Med Rev. 2017;35:1–7.
Liu Y, Xu X, Dong M, Jia S, Wei Y. Treatment of insomnia with tricyclic antidepressants: a meta-analysis of polysomnographic randomized controlled trials. Sleep Med. 2017;34:126–33.
McCleery J, Cohen DA, Sharpley AL. Pharmacotherapies for sleep disturbances in dementia. Cochrane Database Syst Rev. 2016;11:CD009178.
Minnesota Evidence-based Practice Center, Brasure M, MacDonald R, Fuchs E, Olson CM, Carlyle M, et al. Management of insomnia disorder (comparative effectiveness review; no.159). Rockville (MD): Agency for Healthcare Research and Quality (AHRQ); 2015. Available from: https://www.ncbi.nlm.nih.gov/books/NBK343503/.
Montgomery P, Dennis J. Cognitive behavioural interventions for sleep problems in adults aged 60+. Cochrane Database Syst Rev. 2003;1:CD003161.
Navarro-Bravo B, Parraga-Martinez I, Hidalgo JL-T, Andres-Pretel F, Rabanales-Sotos J. Group cognitive-behavioral therapy for insomnia: a meta-analysis. Anales de Psicologia. 2015;31(1):8–18.
Okajima I, Komada Y, Inoue Y. A meta-analysis on the treatment effectiveness of cognitive behavioral therapy for primary insomnia. Sleep Biol Rhythms. 2011;9(1):24–34.
Sateia MJ, Buysse DJ, Krystal AD, Neubauer DN, Heald JL. Clinical practice guideline for the pharmacologic treatment of chronic insomnia in adults: an American Academy of Sleep Medicine Clinical Practice Guideline. J Clin Sleep Med. 2017;13(2):307–49.
Seda G, Sanchez-Ortuno MM, Welsh CH, Halbower AC, Edinger JD. Comparative meta-analysis of prazosin and imagery rehearsal therapy for nightmare frequency, sleep quality, and posttraumatic stress. J Clin Sleep Med. 2015;11(1):11–22.
Seyffert M, Lagisetty P, Landgraf J, Chopra V, Pfeiffer PN, Conte ML, et al. Internet-delivered cognitive behavioral therapy to treat insomnia: a systematic review and meta-analysis. PLoS One. 2016;11(2):e0149139.
Soldatos CR, Dikeos DG, Whitehead A. Tolerance and rebound insomnia with rapidly eliminated hypnotics: a meta-analysis of sleep laboratory studies. Int Clin Psychopharmacol. 1999;14(5):287–303.
Tang NK, Lereya ST, Boulton H, Miller MA, Wolke D, Cappuccio FP. Nonpharmacological treatments of insomnia for long-term painful conditions: a systematic review and meta-analysis of patient-reported outcomes in randomized controlled trials. Sleep. 2015;38(11):1751–64.
Trauer JM, Qian MY, Doyle JS, Rajaratnam SM, Cunnington D. Cognitive behavioral therapy for chronic insomnia: a systematic review and meta-analysis. Ann Intern Med. 2015;163(3):191–204.
van Straten A, Cuijpers P. Self-help therapy for insomnia: a meta-analysis. Sleep Med Rev. 2009;13(1):61–71.
van Straten A, van der Zweerde T, Kleiboer A, Cuijpers P, Morin CM, Lancee J. Cognitive and behavioral therapies in the treatment of insomnia: a meta-analysis. Sleep Med Rev. 2017;09:09.
Xu J, Wang LL, Dammer EB, Li CB, Xu G, Chen SD, et al. Melatonin for sleep disorders and cognition in dementia: a meta-analysis of randomized controlled trials. Am J Alzheimers Dis Other Dement. 2015;30(5):439–47.
Yang B, Xu J, Xue Q, Wei T, Xu J, Ye C, et al. Non-pharmacological interventions for improving sleep quality in patients on dialysis: systematic review and meta-analysis. Sleep Med Rev. 2015;23:68–82.
Ye YY, Chen NK, Chen J, Liu J, Lin L, Liu YZ, et al. Internet-based cognitive-behavioural therapy for insomnia (ICBT-i): a meta-analysis of randomised controlled trials. BMJ Open. 2016;6(11):e010707.
Yuan JQ, Yang KH, Liu YL, Yang SP. Effectiveness and safety of doxepin for primary insomnia systematic review. [Chinese]. Chin J Evid Based Med. 2010;10(11):1325–30.
Zachariae R, Lyby MS, Ritterband LM, O'Toole MS. Efficacy of internet-delivered cognitive-behavioral therapy for insomnia - a systematic review and meta-analysis of randomized controlled trials. Sleep Med Rev. 2016;30:1–10.
Zhang W, Chen XY, Su SW, Jia QZ, Ding T, Zhu ZN, et al. Exogenous melatonin for sleep disorders in neurodegenerative diseases: a meta-analysis of randomized clinical trials. Neurol Sci. 2016;37(1):57–65.
Anderson SL, Vande Griend JP. Quetiapine for insomnia: a review of the literature. Am J Health-Syst Pharm. 2014;71(5):394–402.
Bellon A. Searching for new options for treating insomnia: are melatonin and ramelteon beneficial? J Psychiatr Pract. 2006;12(4):229–43.
Bogdanov S, Naismith S, Lah S. Sleep outcomes following sleep-hygiene-related interventions for individuals with traumatic brain injury: a systematic review. Brain Inj. 2017;31(4):422–33.
Brooks AT, Wallen GR. Sleep disturbances in individuals with alcohol-related disorders: a review of cognitive-behavioral therapy for insomnia (CBT-I) and associated non-pharmacological therapies. Subst Abus. 2014;8:55–62.
Chase JE, Gidal BE. Melatonin: therapeutic use in sleep disorders. Ann Pharmacother. 1997;31(10):1218–26.
Chiesa A, Serretti A. Usefulness of mindfulness meditations for psychiatric disorders: a systematic review. Psichiatria e Psicoterapia. 2009;28(2):93–110.
Cimolai N. Zopiclone: is it a pharmacologic agent for abuse? Can Fam Physician. 2007;53(12):2124–9.
Citrome L. Suvorexant for insomnia: a systematic review of the efficacy and safety profile for this newly approved hypnotic - what is the number needed to treat, number needed to harm and likelihood to be helped or harmed? Int J Clin Pract. 2014;68(12):1429–41.
Coe HV, Hong IS. Safety of low doses of quetiapine when used for insomnia. Ann Pharmacother. 2012;46(5):718–22.
Costello RB, Lentino CV, Boyd CC, O'Connell ML, Crawford CC, Sprengel ML, et al. The effectiveness of melatonin for promoting healthy sleep: a rapid evidence assessment of the literature. Nutr J. 2014;13:106.
Culpepper L, Wingertzahn MA. Over-the-counter agents for the treatment of occasional disturbed sleep or transient insomnia: a systematic review of efficacy and safety. Prim Care Companion CNS Disord. 2015;17(6).
Dickerson SS, Connors LM, Fayad A, Dean GE. Sleep-wake disturbances in cancer patients: narrative review of literature focusing on improving quality of life outcomes. Nat Sci Sleep. 2014;6:85–100.
Hellstrom A, Willman A. Promoting sleep by nursing interventions in health care settings: a systematic review. Worldviews Evid-Based Nurs. 2011;8(3):128–42.
Howell D, Oliver TK, Keller-Olaman S, Davidson JR, Garland S, Samuels C, et al. Sleep disturbance in adults with cancer: a systematic review of evidence for best practices in assessment and management for clinical practice. Ann Oncol. 2014;25(4):791–800.
Ishak WW, Bagot K, Thomas S, Magakian N, Bedwani D, Larson D, et al. Quality of life in patients suffering from insomnia. Innov Clin Neurosci. 2012;9(10):13–26.
Kolla BP, Mansukhani MP, Schneekloth T. Pharmacological treatment of insomnia in alcohol recovery: a systematic review. Alcohol Alcohol. 2011;46(5):578–85.
Mayers AG, Baldwin DS. Antidepressants and their effect on sleep. Hum Psychopharmacol. 2005;20(8):533–59.
McCurry SM, Logsdon RG, Teri L, Vitiello MV. Evidence-based psychological treatments for insomnia in older adults. Psychol Aging. 2007;22(1):18–27.
Mendelson WB. A review of the evidence for the efficacy and safety of trazodone in insomnia. J Clin Psychiatry. 2005;66(4):469–76.
Miller CB, Espie CA, Epstein DR, Friedman L, Morin CM, Pigeon WR, et al. The evidence base of sleep restriction therapy for treating insomnia disorder. Sleep Med Rev. 2014;18(5):415–24.
Swainston Harrison T, Keating GM. Zolpidem: a review of its use in the management of insomnia. CNS drugs. 2005;19(1):65–89.
Tamrat R, Goyal M, Huynh-Le MP. Systematic review of non-pharmacologic interventions to improve the sleep of hospitalized patients. J Gen Intern Med. 2013;28:S191.
Taylor DJ, Pruiksma KE. Cognitive and behavioural therapy for insomnia (CBT-I) in psychiatric populations: a systematic review. Int Rev Psychiatry. 2014;26(2):205–13.
Vande Griend JP, Anderson SL. Histamine−1 receptor antagonism for treatment of insomnia. J Am Pharm Assoc (2003). 2012;52(6):e210–9.
Venables HE. Sleep problems in cancer: effective psychological interventions - a systematic review. Res Medica. 2014;22(1):15–36.
Vural EM, van Munster BC, de Rooij SE. Optimal dosages for melatonin supplementation therapy in older adults: a systematic review of current literature. Drugs Aging. 2014;31(6):441–51.
Wang MY, Wang SY, Tsai PS. Cognitive behavioural therapy for primary insomnia: a systematic review. J Adv Nurs. 2005;50(5):553–64.
Wine JN, Sanda C, Caballero J. Effects of quetiapine on sleep in nonpsychiatric and psychiatric conditions. Ann Pharmacother. 2009;43(4):707–13.
Yeung WF, Chung KF, Yung KP, Ng TH. Doxepin for insomnia: a systematic review of randomized placebo-controlled trials. Sleep Med Rev. 2015;19:75–83.
Chodos H. Options for improving access to counselling, psychotherapy and psychological services for mental health problems and illnesses. Ottawa, ON: Mental Health Commission of Canada; 2017.
We would like to thank Becky Skidmore (BS) for drafting the database and gray literature searches, Alissa Epworth (AE) for running the searches and retrieving potentially relevant full-text articles, Sarah Jones (SJ) from CADTH for completing the PRESS checklist, Krystle Amog (KA) for help with formatting tables and the manuscript, Ananya Nair (AN) for help formatting tables and abstracting data from some articles, and Katrina Chiu (KC) for help with formatting the manuscript.
This work was supported by the Canadian Institutes of Health Research through the Drug Safety and Effectiveness Network; the funder had no participation in the design, conduct, or publication of this study. SES is funded by a Tier 1 Canada Research Chair in Knowledge Translation and the Mary Trimmer Chair in Geriatric Medicine; ACT is funded by a Tier 2 Canada Research Chair in Knowledge Synthesis and an Ontario Ministry of Research, Innovation, and Science Early Researcher Award.
Ethics approval and consent to participate
Consent for publication
Dr. Charles Morin is on the advisory board for Phillips, Merck, and Cereve; holds speaking honorariums from Merck, Eisai, and Abbott; provided expert testimony for Cereve; and receives book royalties from Elsevier and Sogides. Dr. Judith Leech has part ownership of two private sleep labs, Somnocor in Gatineau, Quebec, and West Ottawa Sleep Centre in Ottawa, Ontario. All other authors have no known conflicts of interest to declare. Dr. Tricco is an Associate Editor for BMC Systematic Reviews but will not be involved with any decisions related to this paper.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
PROSPERO Registration. The additional file includes the PROSPERO registration for the study.
The appendices include all supplemental data and information. Appendix A: PRIO-harms Checklist. Appendix B: Database Search Strategy and List of Gray Literature search sites. Appendix C: Review, participant, and intervention characteristics. Appendix D: AMSTAR Results. Appendix E: Detailed Tables of Results. Appendix F: Tables of primary studies by treatment comparison for outcomes with more than one included SR or SR + MA.
Matrix of Evidence. The matrix of evidence of primary studies across all included reviews.
About this article
Cite this article
Rios, P., Cardoso, R., Morra, D. et al. Comparative effectiveness and safety of pharmacological and non-pharmacological interventions for insomnia: an overview of reviews. Syst Rev 8, 281 (2019). https://doi.org/10.1186/s13643-019-1163-9