Beneficial and harmful effects of antidepressants versus placebo, ‘active placebo’, or no intervention for adults with major depressive disorder: a protocol for a systematic review of published and unpublished data with meta-analyses and trial sequential analyses
Systematic Reviews volume 10, Article number: 154 (2021)
Major depressive disorder is one of the most common, burdensome, and costly psychiatric disorders worldwide. Antidepressants are frequently used to treat major depressive disorder. It has been shown repeatedly that antidepressants seem to reduce depressive symptoms with a statistically significant effect, but the clinical importance of the effect sizes seems questionable. Both beneficial and harmful effects of antidepressants have not previously been sufficiently assessed. The main objective of this review will be to evaluate the beneficial and harmful effects of antidepressants versus placebo, ‘active placebo’, or no intervention for adults with major depressive disorder.
A systematic review with meta-analysis will be reported as recommended by Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA), bias will be assessed with the Cochrane Risk of Bias tool-version 2 (ROB2), our eight-step procedure will be used to assess if the thresholds for clinical significance are crossed, Trial Sequential Analysis will be conducted to control for random errors, and the certainty of the evidence will be assessed with the Grading of Recommendations Assessment, Development and Evaluation (GRADE) approach. To identify relevant trials, we will search both for published and unpublished trials in major medical databases from their inception to the present. Clinical study reports will be obtained from regulatory authorities and pharmaceutical companies. Two review authors will independently screen the results of the literature searches, extract data, and perform risk of bias assessment. We will include any published or unpublished randomised clinical trial comparing one or more antidepressants with placebo, ‘active placebo’, or no intervention for adults with major depressive disorder. The following active agents will be included: agomelatine, amineptine, amitriptyline, bupropion, butriptyline, cianopramine, citalopram, clomipramine, dapoxetine, demexiptiline, desipramine, desvenlafaxine, dibenzepin, dosulepin, dothiepin, doxepin, duloxetine, escitalopram, fluoxetine, fluvoxamine, imipramine, iprindole, levomilnacipran, lofepramine, maprotiline, melitracen, metapramine, milnacipran, mirtazapine, nefazodone, nortriptyline, noxiptiline, opipramol, paroxetine, protriptyline, quinupramine, reboxetine, sertraline, trazodone, tianeptine, trimipramine, venlafaxine, vilazodone, and vortioxetine. Primary outcomes will be depressive symptoms, serious adverse events, and quality of life. Secondary outcomes will be suicide or suicide attempt, suicidal ideation, and non-serious adverse events.
As antidepressants are commonly used to treat major depressive disorder in adults, a systematic review evaluating their beneficial and harmful effects is urgently needed. This review will inform best practice in treatment and clinical research of this highly prevalent and burdensome disorder.
Systematic review registration
Description of participants
Major depressive disorder is estimated by the World Health Organization (WHO) to affect more than 264 million people globally, making the disorder one of the leading causes of disability worldwide [1, 2]. The estimated lifetime prevalence of major depressive disorder is between 10 and 20% [3, 4]. In 2010, the annual economic burden in the USA alone was estimated to exceed 210 billion US dollars including both direct medical costs and indirect costs related to work ability and comorbidities . Major depressive disorder is characterised by depressed mood and loss of interest or pleasure resulting in significant psychological distress and functional impairment [6, 7]. Furthermore, risks of suicides and suicide attempts significantly increases during major depressive episodes [8, 9]. Together, these findings emphasise the need for efficacious and cost-effective treatments for this burdensome and highly prevalent psychiatric disorder, especially treatments where benefits outweigh harms.
Description of interventions
Pharmacotherapy is widely used in the treatment of major depressive disorder, particularly in the Western world, but also in several other countries [10, 11]. Data published in 2017 from the National Health and Nutrition Examination Survey showed that during 2011–2014, about one in eight people aged 12 and above in the USA reported taking antidepressants during the previous month . The use of antidepressants has increased nearly 65% over a 15-year time frame  and more than 60% of people in the USA taking antidepressants have been taking them for more than two years . Today, antidepressants for major depressive disorder, either alone or in combination with psychotherapy, are recommended by the UK National Institute for Health and Care Excellence (NICE) and the American Psychiatric Association, as well as different national clinical guidelines [13,14,15,16,17,18,19].
Several different antidepressants exist. Before the late 1980s, pharmacological treatment was limited to tricyclic antidepressants (TCAs) and monoamine oxidase inhibitors (MAOIs). TCAs and MAOIs are now commonly referred to as first-generation antidepressants. Now, second-generation antidepressants comprise most antidepressant prescriptions. These drugs include selective serotonin reuptake inhibitors (SSRIs), serotonin and norepinephrine reuptake inhibitors (SNRIs), and other drugs with related mechanisms of action that selectively target neurotransmitters. For an overview of the different types of antidepressants, please see Table 1 [10, 20, 21].
How the interventions might work
Antidepressants aim to increase the availability of specific neurotransmitters that are sought to play a role in the development of major depressive disorder, most commonly serotonin, noradrenaline, and dopamine. The various antidepressants target different neurotransmitters . For example, selective serotonin reuptake inhibitors (e.g., citalopram and fluoxetine) specifically block the reuptake of serotonin, while selective noradrenaline reuptake inhibitors (e.g. reboxetine) specifically block the reuptake of noradrenaline. Some antidepressants simultaneously block the reuptake of both serotonin and noradrenaline (e.g. duloxetine and venlaflaxine) and are commonly referred to as ‘dual-action’ drugs. However, it remains unclear exactly how antidepressants work in patients with major depressive disorder [23, 24]. The ‘monoamine hypothesis’ proposes that diminished activity of serotonergic, noradrenergic, and dopaminergic pathways plays a causal role in the pathophysiology of depression [25,26,27], but the role of serotonin in the pathophysiology and treatment of major depressive disorder is still unclear due to unreliable clinical biochemical findings and the difficulty of relating changes in serotonin activity to mood state .
Why is it important to do this review?
It has been repeatedly shown that antidepressants seem to reduce depressive symptoms with a statistically significant effect, but the effect sizes are small or minimal and without importance to patients [10, 29, 30]. A recent review of both within patient and between patient anchor-based approaches suggested that the minimal clinically important difference on the Hamilton Depression Rating Scale 17 (HDRS-17) is likely to be in the range from 3 to 5 points . Furthermore, there is inconsistent evidence concerning individual variability in who benefits from antidepressants [32, 33]. Considering this inconsistency, one must still assume that the average effect of antidepressants applies also to the individual patient [33, 34]. In addition, quality of life has previously been selectively reported in placebo-controlled trials of antidepressants . Therefore, the beneficial effects of antidepressants are currently unclear.
When establishing evidence for any intervention, the beneficial and harmful effects must be carefully assessed . If benefits are small or unimportant, society has less tolerability of risks of adverse events. Harmful effects are often insufficiently reported in journal articles compared to trial registries, causing significant under-reporting of harms associated with antidepressants [37, 38]. This might be the cause for conflicting evidence on whether antidepressants may trigger harmful effects in adults with major depressive disorder .
A meta-analysis was published in BMJ in 2009  assessing the risk of suicidality in randomised clinical trials of antidepressants based on proprietary data submitted to US Food and Drug Administration (FDA). The meta-analysis included major depressive disorder, other depression, other psychiatric disorders, and non-psychiatric disorders. The authors concluded that risk of suicidality associated with use of antidepressants is strongly age dependent . For suicidal behaviour or ideation and for suicidal behaviour only, the respective odds ratios were 1.62 (95% confidence interval [CI] 0.97 to 2.71) and 2.30 (95% CI 1.04 to 5.09) for participants aged < 25 years, 0.79 (95% CI 0.64 to 0.98) and 0.87 (95% CI 0.58 to 1.29) for those aged 25 to 64 years, and 0.37 (95% CI 0.18 to 0.76) and 0.06 (95% CI 0.01 to 0.58) for those aged ≥ 65 years. However, these age group subgroup analyses were not predefined in a registered or published protocol and should therefore be interpreted with caution.
In a study by Khan et al. , the Integrated Safety Summary data from approval packets for 14 investigational antidepressant programmes (1991–2013, 40,857 patients, 10,890 exposure years) were used to calculate suicides and suicide attempts per 100,000 patient exposure years for antidepressant and placebo treatment groups separately in patients with major depressive disorder. The study concluded that deaths by suicide and suicide attempts had decreased significantly in clinical trials assessing the effect of antidepressants following the year 2000 compared to the decade before 2000, and assessments of drug-placebo differences in suicide and suicide attempt rates revealed no significant differences . However, a reanalysis of the data from this study found different results . According to the reanalysis, there were 37 suicides (0.116%) and 206 suicide attempts (0.713%) in the antidepressant group versus 4 suicides (0.040%) and 28 suicide attempts (0.300%) in the placebo group. Thus, the suicide rate was significantly higher in the antidepressant group than in the placebo group (odds ratio [OR] 2.83; 95% CI 1.13 to 9.67, p = 0.02).
A large network meta-analysis was published in The Lancet in 2019 . The authors included placebo-controlled and head-to-head trials of 21 commonly used antidepressants . The authors recorded all outcomes as close to eight weeks as possible, that is, only short-term results were assessed. In this study, neither serious nor non-serious adverse events were assessed. Instead, the authors assessed ‘acceptability’ (treatment discontinuation measured by the proportion of participants who withdrew for any reason) and the proportion of participants who dropped out early because of adverse effects. But these outcomes are difficult to interpret clinically; participants might, for example, continue taking antidepressants even if they experience serious adverse effects.
We previously published a systematic review assessing the effects of the most commonly used antidepressants, SSRIs . This review assessed both beneficial and harmful effects of SSRIs. The results showed that there was a significant effect of SSRIs on depressive symptoms, but the effect was of questionable clinical relevance and comparable to that of the network meta-analysis . Moreover, we found almost no data on suicidal behaviour, and SSRIs significantly increased the risk of both serious and non-serious adverse events .
No former review has systematically assessed the beneficial and harmful effects of antidepressants including all types of antidepressants including both published trials and unpublished data from clinical study reports. Therefore, there is an urgent need for such a review. The present systematic review aims at forming the basis for evidence-based guideline recommendations for the use of antidepressants for major depressive disorder taking bias risks (systematic errors), play of chance (random errors), and certainty of the findings into consideration.
The present protocol has been registered in the PROSPERO database (CRD42020220279) and is reported in accordance with the reporting guidance provided in the Preferred Reporting Items for Systematic Reviews and Meta-Analysis Protocols (PRISMA-P) statement [43, 44] (see checklist in Additional file 1).
Criteria for considering studies for this review
Types of studies
We will include randomised clinical trials irrespective of setting, publication status, publication year, and language. We will not include quasi-randomised trials, cluster-randomised trials, or observational studies.
Types of participants
Adults (as defined by trialists) with a primary diagnosis of major depressive disorder as defined by standardised diagnostic criteria from either DSM-5 , ICD-11 , or earlier versions of these diagnostic manuals. Participants will be included irrespective of sex and comorbidities.
Types of interventions
As experimental intervention, we will accept the following: agomelatine, amineptine, amitriptyline, bupropion, butriptyline, cianopramine, citalopram, clomipramine, dapoxetine, demexiptiline, desipramine, desvenlafaxine, dibenzepin, dosulepin, dothiepin, doxepin, duloxetine, escitalopram, fluoxetine, fluvoxamine, imipramine, iprindole, levomilnacipran, lofepramine, maprotiline, melitracen, metapramine, milnacipran, mirtazapine, nefazodone, nortriptyline, noxiptiline, opipramol, paroxetine, protriptyline, quinupramine, reboxetine, sertraline, trazodone, tianeptine, trimipramine, venlafaxine, vilazodone, and vortioxetine. We will accept any of these antidepressants as experimental interventions irrespective of dose and duration of administration.
As control intervention, we will accept the following: placebo, ‘active placebo’ (a matching placebo that produces noticeable adverse effects that may convince the participant being treated and the blinded outcome assessors that the participants are receiving an active intervention), or no intervention. We will accept any of these control interventions irrespective of dose and duration of administration.
We will accept any cointervention, if the cointervention is planned to be delivered similarly in the intervention and control groups.
Depressive symptoms measured on the 17-item or 21-item Hamilton Depression Rating Scale (HDRS) 
Proportion of participants with one or more serious adverse events. We will use the International Conference on Harmonisation of technical requirements for registration of pharmaceuticals for human use—Good Clinical Practice (ICH-GCP) definition of a serious adverse event, which is any untoward medical occurrence that resulted in death, was life-threatening, required hospitalisation or prolonging of existing hospitalisation, and resulted in persistent or significant disability or jeopardised the participant . If the trialists do not use the ICH-GCP definition, we will include the data if the trialists use the term ‘serious adverse event.’ If the trialists do not use the ICH-GCP definition nor use the term serious adverse event, then we will also include the data, if the event clearly fulfils the ICH-GCP definition for a serious adverse event. We will secondly assess each serious adverse event separately.
Quality of life
Proportion of participants with either a suicide or a suicide attempt (as defined by the trialists)
Proportion of participants with one or more non-serious adverse events (any adverse event not classified as serious). We will secondly assess each non-serious adverse event separately.
Proportion of participants in remission (as defined by trialists)
Proportion of participants achieving response (as defined by trialists)
Assessment time points
We will assess all our outcomes at maximum follow-up.
Search methods for identification of studies
We will search Cochrane Central Register of Controlled Trials (CENTRAL) in the Cochrane Library, MEDLINE Ovid, Embase Ovid, Latin American and Caribbean Health Sciences Literature (LILACS; Bireme), PsycINFO (EBSCO host), Science Citation Index Expanded (SCI-EXPANDED; Web of Science), Conference Proceedings Citation Index—Science (CPCI-S; Web of Science), Social Sciences Citation Index (SSCI; Web of Science), Conference Proceedings Citation Index—Social Science & Humanities (CPCI-SSH; Web of Science), Chinese Biomedical Literature Database (CBM), China Network Knowledge Information (CNKI), Chinese Science Journal Database (VIP), and Wafang Database to identify relevant trials. We will search all databases from their inception to the present. For a detailed search strategy for all electronic databases, see Additional file 2. The search strategies for the Chinese databases will be given at review stage. Trials will be included irrespective of language, publication status, publication year, and publication type.
Searching other resources
We will include the data from a recent systematic review on 21 antidepressants by Cipriani et al. . The authors of this comprehensive review made the data available in a public repository. This is the largest database of new-generation antidepressants for the acute treatment of major depressive disorder compiled so far. Further, the reference lists of relevant publications will be checked for any unidentified randomised trials. We will contact the authors of included trials by email asking for unpublished randomised trials. To identify unpublished trials, we will also search clinical trial registers, websites of pharmaceutical companies, websites of US Food and Drug Administration (FDA), and European Medicines Agency (EMA). We will request FDA, EMA, and national medicines agencies to provide all publicly releasable information about relevant randomised clinical trials of antidepressants that were submitted for marketing approval, including clinical study reports . Additionally, we will hand search conference abstracts from psychiatry conferences for relevant trials. We will also include unpublished and grey literature trials if we identify these and assess relevant retraction statements and errata for included trials.
Data collection and analysis
We will perform and report the review following the recommendations stated in the Cochrane Handbook for Systematic Reviews of Interventions . Analyses will be performed using Stata version 16.1 (StataCorp LLC, College Station, TX, USA)  and Trial Sequential Analysis [52, 53].
Selection of studies
Two review authors will independently screen titles and abstracts. We will retrieve all relevant full-text study reports/publications, and two review authors will independently screen the full text and identify and record reasons for exclusion of the ineligible studies. The two review authors will resolve any disagreement through discussion, or, if required, they will consult a third author.
Data extraction and management
Two authors will independently extract data from included trials in a dedicated data extraction sheet developed for this review. Disagreements will be resolved by discussion with a third author. The two review authors will assess duplicate publications and companion papers of a trial together to evaluate all available data simultaneously (maximise data extraction, correct bias assessment). We will contact the trial authors by email to specify any additional data, which may not have been reported sufficiently or at all in the publication.
We will extract the following data: bias risk components (as defined below), trial design (parallel, factorial, or crossover), number of intervention groups, length of follow-up, estimation of sample size, and inclusion and exclusion criteria.
We will extract the following data: number of randomised participants, number of analysed participants, number of participants lost to follow-up/withdrawals/crossover, age range (mean or median), and sex ratio.
We will extract the following data: type of antidepressant, dose of intervention, and duration of intervention.
We will extract the following data: type of control intervention, dose of intervention, and duration of intervention.
All outcomes listed above will be extracted from each randomised clinical trial, and we will identify if outcomes are incomplete or selectively reported according to the criteria described later in ‘incomplete outcome data’ bias domain and ‘selective outcome reporting’ bias domain.
We will search for information regarding industry funding of either personal or academic activities for each trial author. We will judge a publication at high risk of for-profit bias if a trial is sponsored by the industry or if just one author has affiliation to the industry. We will note in the ‘Characteristics of included studies’ table, if outcome data were not reported in a usable way. Two review authors will independently transfer data into the Stata file . Disagreements will be resolved through discussion, or if required, we will consult with a third author.
Assessment of risk of bias in the included trials
Our bias risk assessment will be based on the Cochrane Risk of Bias tool-version 2 (RoB 2) as recommended in The Cochrane Handbook of Systematic Reviews of Interventions . We will evaluate the methodology in respect of the following bias domains:
Bias arising from the randomisation process
Low risk of bias. Allocation was adequately concealed, AND there are no baseline imbalances across intervention groups at baseline appear to be compatible with chance, AND an adequate (random or otherwise unpredictable) method was used to generate allocation sequence, OR there is no information about the method used to generate the allocation sequence.
Some concerns. Allocation was adequately concealed, AND there is a problem with the method of sequence generation, OR baseline imbalances suggest a problem with the randomisation process, OR no information is provided about concealment of allocation, AND baseline imbalances across intervention groups appear to be compatible with chance, OR no information to answer any of the signalling questions,
High risk of bias. Allocation sequence was not concealed, OR no information is provided about concealment of allocation sequence, AND baseline imbalances suggest a problem with the randomisation process.
Bias due to deviation from intended interventions
Low risk of bias. Participants, carers, and personnel were unaware of intervention groups during the trial, OR participants, carers, or personnel were aware of intervention groups during the trial but any deviations from intended intervention reflected usual practice, OR participants, carers, or personnel were aware of intervention groups during the trial but any deviations from intended intervention were unlikely to impact on the outcome, AND no participants were analysed in the wrong intervention groups (that is, on the basis of intervention actually received rather than of randomised allocation).
Some concerns. Participants, carers, or personnel were aware of intervention groups and there is no information on whether there were deviations from usual practice that were likely to impact on the outcome and were imbalanced between intervention groups, OR some participants were analysed in the wrong intervention groups (on the basis of intervention actually received rather than of randomised allocation) but there was little potential for a substantial impact on the estimated effect of intervention.
High risk of bias. Participants, carers, or personnel were aware of intervention groups, and there were deviations from intended interventions that were unbalanced between the intervention groups and likely to have affected the outcome, OR some participants were analysed in the wrong intervention groups (on the basis of intervention actually received rather than of randomised allocation), and there was potential for a substantial impact on the estimated effect of intervention.
Bias due to missing outcome data
Low risk of bias. No missing data OR non-differential missing data (similar proportion of and similar reasons for missing data in compared groups) OR evidence of robustness of effect estimate to missing data (based on adequate statistical methods for handling missing data and sensitivity analysis)
Some concerns. An unclear degree of missing data or unclear information on proportion and reasons for missingness in compared groups AND there is no evidence that the effect estimate is robust to missing data
High risk of bias. A high degree of missing data AND differential missing data (different proportion of or different reasons for missing data in compared groups) AND there is no evidence that the effect estimate is robust to missing data
Bias in measurement of outcomes
Low risk of bias. The outcome assessors were unaware of the intervention received by study participants, OR the outcome assessors were aware of the intervention received by study participants, but the assessment of the outcome was unlikely to be influenced by knowledge of the intervention received.
Some concerns. There is no information available to determine whether the assessment of the outcome is likely to be influenced by knowledge of the intervention received.
High risk of bias. The assessment of the outcome was likely to be influenced by knowledge of the intervention received by study participants.
Bias arising from selective reporting of results
Low risk of bias. Reported outcome data are unlikely to have been selected, on the basis of the results, from multiple outcome measurements (e.g. scales, definitions, time points) within the outcome domain, and reported outcome data are unlikely to have been selected, on the basis of the results, from multiple analyses of the data.
Some concerns. There is insufficient information available to exclude the possibility that reported outcome data were selected, on the basis of the results, from multiple outcome measurements (e.g. scales, definitions, time points) within the outcome domain, or from multiple analyses of the data. Given that analysis intentions are often unavailable or not reported with sufficient detail, we anticipate that this will be the default judgement for most trials.
High risk of bias. Reported outcome data are likely to have been selected, on the basis of the results, from multiple outcome measurements (e.g. scales, definitions, time points) within the outcome domain, or from multiple analyses of the data (or both).
Overall assessment of risk of bias
Low risk of bias. The trial is judged to be at low risk of bias for all domains.
High risk of bias. The trial is judged to be at high risk of bias or to be at some concerns in at least one domain. Our subgroup analysis will compare the intervention effect of trials at low risk of bias with trials at high risk of bias, that is one or more domains at some concern or high risk of bias.
We will assess the domains ‘missing outcome data’, ‘risk of bias in measurement of the outcome’, and ‘risk of bias in selection of the reported result’ for each outcome result. Thus, we can assess the bias risk for each outcome assessed in addition to each trial. Our primary conclusions will be based on the results of our primary outcome results with overall low risk of bias. Both our primary and secondary conclusions will be presented in the ‘Summary of findings’ tables.
Differences between the protocol and the review
We will conduct the review according to this published protocol and report any deviations from it in the ‘Differences between the protocol and the review’ section of the systematic review.
Measurement of treatment effect
We will calculate risk ratios (RRs) with 95% confidence interval (CI) for dichotomous outcomes, as well as the Trial Sequential Analysis-adjusted CIs (see the following).
We will calculate the mean differences (MDs) and consider calculating the standardised mean difference (SMD) with 95% CI for continuous outcomes. We will also calculate Trial Sequential Analysis-adjusted CIs (see the following).
Dealing with missing data
We will use intention-to-treat data if provided by the trialists . We will, as the first option, contact all trial authors to obtain any relevant missing data (i.e. for data extraction and for assessment of risk of bias, as specified above).
We will not impute missing values for any outcomes in our primary analysis. In our sensitivity analyses (see the following paragraph), we will impute data.
We will primarily analyse scores assessed at single time points. If only changes from baseline scores are reported, we will analyse the results together with follow-up scores . If standard deviations (SDs) are not reported, we will calculate the SDs using trial data, if possible. We will not use intention-to-treat data if the original report did not contain such data. We will not impute missing values for any outcomes in our primary analysis. In our sensitivity analysis (see the following paragraph) for continuous outcomes, we will impute data.
Assessment of heterogeneity
We will primarily investigate forest plots to visually assess any sign of heterogeneity. We will secondly assess the presence of statistical heterogeneity by chi2 test (threshold P < 0.10) and measure the quantities of heterogeneity by the I2 statistic [55, 56]. We will investigate possible heterogeneity through subgroup analyses. We may ultimately decide that a meta-analysis should be avoided .
Assessment of reporting biases
We will use a funnel plot to assess reporting bias if ten or more trials are included. We will visually inspect funnel plots to assess the risk of bias. We are aware of the limitations of a funnel plot (i.e. a funnel plot assesses bias due to small sample size). From this information, we assess possible reporting bias. For dichotomous outcomes, we will test asymmetry with the Harbord test  if τ2 is less than 0.1 and with the Rücker test if τ2 is more than 0.1. For continuous outcomes, we will use the regression asymmetry test  and the adjusted rank correlation .
Unit of analysis issues
We will only include randomised clinical trials. For trials using crossover design, only data from the first period will be included [36, 60]. There will therefore not be any unit of analysis issues. We will not include cluster-randomised trials, due to their problems with randomisation, and blinding.
We will undertake the meta-analysis according to The Cochrane Handbook for Systematic Reviews of Interventions , Keus et al. , and our eight-step procedure suggested by Jakobsen et al. . We will use the statistical software Stata version 16 to analyse data . We will assess our intervention effects with both random-effects model meta-analyses (Hartung-Knapp-Sidik-Jonkman)  and fixed-effect model meta-analyses (Mantel-Haenszel for dichotomous outcomes and inverse variance for continuous outcomes) [36, 64]. We will use the more conservative point estimate of the two . The more conservative point estimate is the estimate with the highest p-value. We assess a total of six primary and secondary outcomes, and we will therefore consider a p-value of 0.014 or less as the threshold for statistical significance . We will investigate possible heterogeneity through subgroup analyses. We will use our eight-step procedure to assess if the thresholds for significance are crossed . This eight-step procedure comprise of the following steps: (1) obtain the 95% confidence intervals and the P-values from both fixed-effect and random-effects meta-analyses and report the most conservative results as the main results, (2) explore the reasons behind substantial statistical heterogeneity using subgroup and sensitivity analyses (see step 6), (3) to take account of problems with multiplicity adjust the thresholds for significance according to the number of primary outcomes (we will both adjust the thresholds for significance according to the number of primary and secondary outcomes), (4) calculate required information sizes (≈ the a priori required number of participants for a meta-analysis to be conclusive) for all outcomes and analyse each outcome with trial sequential analysis. Report whether the trial sequential monitoring boundaries for benefit, harm, or futility are crossed, (5) calculate Bayes factors for all primary outcomes, (6) use subgroup analyses and sensitivity analyses to assess the potential impact of bias on the review results, (7) assess the risk of publication bias, and (8) assess the clinical significance of the statistically significant review results .
Where multiple trial arms are reported in a single trial, we will include only the relevant arms. If two comparisons are combined in the same meta-analysis, we will halve the control group (participants and amount of evens to avoid double-counting). For continuous data, we will keep the main score . Trials with a factorial design will be included. In case of, e.g. a 2 × 2 factorial designed trial, the two groups receiving antidepressants will be considered experimental groups, while the two groups receiving placebo, ‘active placebo’, or no intervention will be considered control groups.
Trial Sequential Analysis
Traditional meta-analysis runs the risk of random errors due to sparse data and repetitive testing of accumulating data when updating reviews. We wish to control the risks of type I errors and type II errors. We will therefore perform Trial Sequential Analysis on all outcomes, in order to calculate the diversity-adjusted required information size (DARIS; that is, the number of participants needed in a meta-analysis to detect or reject a certain intervention effect) and the cumulative Z-curve’s breach of relevant trial sequential monitoring boundaries [52, 53, 65,66,67,68,69,70,71]. A more detailed description of Trial Sequential Analysis software can be found in the manual  and at http://www.ctu.dk/tsa/. For dichotomous outcomes, we will estimate the required information size based on the observed proportion of patients with an outcome in the control group (the cumulative proportion of patients with an event in the control groups relative to all patients in the control groups), a relative risk reduction or a relative risk increase of 20%, an alpha of 1.6% for all our outcomes, a beta of 10%, and the observed diversity as suggested by the trials in the meta-analysis. For continuous outcomes, we will in the Trial Sequential Analysis use the observed standard deviation (SD) in the control group, a mean difference of three HDRS points when assessing depressive symptoms (for other continuous outcomes the observed SD/2), an alpha of 1.6% for all outcomes, a beta of 10%, and the observed diversity as suggested by the trials in the meta-analysis.
Subgroup analysis and integration of heterogeneity
We will perform the following subgroup analyses when analysing the primary outcomes (depressive symptoms, serious adverse events, quality of life).
Trials at high risk of bias compared to trials at low risk of bias
Trials with for-profit bias compared to trials at unknown or known risk of for-profit bias 
Types of antidepressant drug
Types of comparator (placebo, ‘active placebo’, no intervention)
Age groups (18 to 24 years, 25 to 64 years, ≥ 65 years)
We will use the formal test for subgroup interactions in Stata .
To assess the potential impact of the missing data for dichotomous outcomes, we will perform the two following sensitivity analyses on both the primary and secondary dichotomous outcomes.
‘Best-worst-case’ scenario. We will assume that all participants lost to follow-up in the antidepressant group survived, had no serious adverse events, had no suicides or suicide attempts, and had no non-serious adverse events, and that all those participants lost to follow-up in the control group did not survive, had a serious adverse event, died by suicide or had a suicide attempt, and had a non-serious adverse event.
‘Worst-best-case’ scenario. We will assume that all participants lost to follow-up in the antidepressant group did not survive, had a serious adverse event, died by suicide or had a suicide attempt, and had a non-serious adverse event, and that all those participants lost to follow-up in the control group survived, had no serious adverse events, had no suicides or suicide attempts, and had no non-serious adverse events.
We will present results of both scenarios in our review. When analysing depressive symptoms, suicidal ideation, and quality of life, a ‘beneficial outcome’ will be the group mean plus two SDs (we will secondly use one SD in another sensitivity analysis) of the group mean and a ‘harmful outcome’ will be the group mean minus two SDs (we will secondly use one SD in another sensitivity analysis) of the group mean . To assess the potential impact of missing SDs for continuous outcomes, we will perform the following sensitivity analysis:
Where SDs are missing and it is not possible to calculate them, we will impute SDs from trials with similar populations and low risk of bias. If we find no such trials, we will impute SDs from trials with a similar population. As the final option, we will impute the mean SD from all included trials.
We will present results of this scenario in our review. Other post hoc sensitivity analyses might be warranted if unexpected clinical or statistical heterogeneity is identified during the analysis of the review results .
‘Summary of findings’ tables
We will create summary of findings tables for each comparison including each of the prespecified primary and secondary outcomes (depressive symptoms, serious adverse events, quality of life, suicide or suicide attempt, non-serious adverse events). We will use the five Grading Recommendations Assessment Development Evaluation (GRADE) considerations (bias risk, heterogeneity, imprecision, indirectness, and publication bias) to assess the certainty of evidence [62, 73,74,75]. We will assess imprecision using Trial Sequential Analysis. We will downgrade imprecision in GRADE by two levels if the accrued number of participants is below 50% of the DARIS, and one level if between 50 and 100% of DARIS. We will not downgrade if the cumulative Z-curve crosses the monitoring boundaries for benefit, harm, or futility, or if DARIS is reached. We will justify all decisions to downgrade the quality of evidence using footnotes, and we will make comments to aid the reader’s understanding of the assessment where necessary. Firstly, we will present our results in the summary of findings tables based on the results from the trials with overall low risk of bias, and secondly, we will present the results based on all trials.
We will publish separate systematic reviews assessing the beneficial and harmful effects of the most frequently used antidepressants. We will subsequently gather data from all these reviews, update the searches and analyses, and finally publish the overall results from all antidepressants in a large publication. We will publish the following protocols and systematic reviews separately: (1) tricyclic antidepressants, (2) SSRIs, (3) venlaflaxine, (4) mirtazapine, and (5) duloxetine.
This protocol aims at assessing the beneficial and harmful effects of antidepressants versus placebo, ‘active placebo’, or no intervention in adults with major depressive disorder. Primary outcomes will be depressive symptoms, serious adverse events, and quality of life. Secondary outcomes will be suicide or suicide attempts, suicidal ideation, and non-serious adverse events.
Our protocol has a number of strengths. The predefined methodology is based on Cochrane methodology , PRISMA [76, 77], Keus et al. , our eight-step assessment suggested by Jakobsen et al. , Trial Sequential Analysis , and GRADE assessment [73,74,75]. Hence, this protocol considers both risks of random errors and risks of systematic errors . Further, we increase the statistical power by pooling various antidepressants as the experimental intervention. Moreover, we will include both unpublished and published trials as well as clinical study reports .
Our protocol also has limitations. The primary limitation is the potential for high statistical heterogeneity as a result of including various antidepressants as the experimental intervention. To minimise this limitation, we will carefully look for signs of heterogeneity and ultimately decide if data ought to be pooled and meta-analysed, and we have planned several subgroup analyses. Another limitation is the large number of analyses which increases the risk of type 1 error. We have adjusted our thresholds for significance according to the number of primary and secondary outcomes, but we have not adjusted our thresholds for significance according to the total number of comparisons (e.g. subgroup analyses and sensitivity analyses). As mentioned in the ‘Background’ section, we expect inadequate reporting of harmful effects in the included trials, which increases the risk of underestimation of harmful effects. Finally, we expect short follow-up periods.
Availability of data and materials
Data sharing is not applicable to this protocol article.
Beck’s Depression Inventory
Chinese Biomedical Literature Database
Cochrane Central Register of Controlled Trials
China Network Knowledge Information
Conference Proceedings Citation Index—Social Science & Humanities
Conference Proceedings Citation Index—Science
diversity-adjusted required information size
European Medicines Agency
US Food and Drug Administration
Grading of Recommendations Assessment, Development and Evaluation
Hamilton Depression Rating Scale
Good Clinical Practice
Latin American and Caribbean Health Sciences Literature
Montgomery-Asberg Depression Rating Scale
Monoamine oxidase inhibitors
UK National Institute for Health and Care Excellence
Preferred Reporting Items for Systematic Reviews and Meta-Analysis Protocols
Cochrane Risk of Bias tool-version 2
Science Citation Index Expanded
Standardised mean difference
Serotonin and norepinephrine reuptake inhibitors
Social Sciences Citation Index
Selective serotonin reuptake inhibitors
Chinese Science Journal Database
World Health Organization
World Health Organization (WHO). Depression (fact sheet) 2020. Available at https://www.who.int/news-room/fact-sheets/detail/depression [Accessed November 6, 2020]
Vos T, Lim SS, Abbafati C, Abbas KM, Abbasi M, Abbasifard M, et al. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet. 2020;396(10258):1204–22. https://doi.org/10.1016/S0140-6736(20)30925-9.
Lim GY, Tam WW, Lu Y, et al. Prevalence of depression in the community from 30 countries between 1994 and 2014. Sci Rep. 2018;8(1):1–10.
Hasin DS, Sarvet AL, Meyers JL, Saha TD, Ruan WJ, Stohl M, et al. Epidemiology of adult DSM-5 major depressive disorder and its specifiers in the United States. JAMA Psychiatry. 2018;75(4):336–46. https://doi.org/10.1001/jamapsychiatry.2017.4602.
Greenberg PE, Fournier AA, Sisitsky T, Pike CT, Kessler RC. The economic burden of adults with major depressive disorder in the United States (2005 and 2010). J Clin Psychiatry. 2015;76(2):155–62. https://doi.org/10.4088/JCP.14m09298.
American Psychiatric Association. Diagnostic and statistical manual of mental disorders (DSM-5®). Washington, DC: American Psychiatric Publishing; 2013. https://doi.org/10.1176/appi.books.9780890425596.
World Health Organization (WHO). International classification of diseases for mortality and morbidity statistics (11th Revision). Available at: https://icd.who.int/browse11/l-m/en. 2018 [Accessed November 6, 2020]
Holma KM, Melartin TK, Haukka J, Holma IAK, Sokero TP, Isometsä ET. Incidence and predictors of suicide attempts in DSM–IV major depressive disorder: a five-year prospective study. Am J Psychiatry. 2010;167(7):801–8. https://doi.org/10.1176/appi.ajp.2010.09050627.
Sokero TP, Melartin TK, Rytsälä HJ, Leskelä US, Lestelä-Mielonen PS, Isometsä ET. Prospective study of risk factors for attempted suicide among patients with DSM–IV major depressive disorder. Br J Psychiatry. 2005;186(4):314–8. https://doi.org/10.1192/bjp.186.4.314.
Jakobsen JC, Gluud C, Kirsch I. Should antidepressants be used for major depressive disorder? BMJ Evid Based Med. 2019;25(4):130–6. http://dx.doi.org/10.1136/bmjebm-2019-111238http://dx.doi.org/10.1136/bmjebm-2019-111238.
OECD. Antidepressant drugs consumption, 2000 and 2015 (or nearest year)2017. Paris: OECD Publishing; 2017. https://doi.org/10.1787/health_glance-2017-graph181-en.
Pratt LA, Brody DJ, Gu Q. Antidepressant use among persons aged 12 and over: United States, 2011-2014. NCHS Data Brief. Number 283. National Center Health Stat. 2017;(283):1–8. https://www.cdc.gov/nchs/data/databriefs/db283.pdf. Accessed 21 May 2021.
Cuijpers P, van Straten A, Warmerdam L, Andersson G. Psychotherapy versus the combination of psychotherapy and pharmacotherapy in the treatment of depression: a meta-analysis. Depress Anxiety. 2009;26(3):279–88. https://doi.org/10.1002/da.20519.
Bauer M, Severus E, Möller H-J, Young AH, WFSBP Task Force on Unipolar Depressive Disorders. Pharmacological treatment of unipolar depressive disorders: summary of WFSBP guidelines. Int J Psychiatry Clin Pract. 2017;21(3):166–76. https://doi.org/10.1080/13651501.2017.1306082.
Cleare A, Pariante CM, Young AH, Anderson IM, Christmas D, Cowen PJ, et al. Evidence-based guidelines for treating depressive disorders with antidepressants: a revision of the 2008 British Association for Psychopharmacology guidelines. J Psychopharmacol. 2015;29(5):459–525. https://doi.org/10.1177/0269881115581093.
Gelenberg A, Freeman M, Markowitz J, et al. American Psychiatric Association practice guidelines for the treatment of patients with major depressive disorder. Am J Psychiatry. 2010;167(Suppl. 10):9–118.
Lam RW, Kennedy SH, Grigoriadis S, McIntyre R, Milev R, Ramasubbu R, et al. Canadian Network for Mood and Anxiety Treatments (CANMAT) clinical guidelines for the management of major depressive disorder in adults.: III. Pharmacotherapy. J Affect Disord. 2009;117:S26–43. https://doi.org/10.1016/j.jad.2009.06.041.
Malhi GS, Bassett D, Boyce P, Bryant R, Fitzgerald PB, Fritz K, et al. Royal Australian and New Zealand College of Psychiatrists clinical practice guidelines for mood disorders. Aust N Z J Psychiatry. 2015;49(12):1087–206. https://doi.org/10.1177/0004867415617657.
The National Institute for Health and Care Excellence (NICE). Depression in adults: recognition and management. Clinical guideline [CG90] Published date: October 2009. Last updated: April 2018. Available at: https://www.nice.org.uk/guidance/cg90. [Accessed November 6, 2020]
Hirsch M, Birnbaum RJ. Switching antidepressant medications in adults, 2017. Available at: http://www.uptodate.com/index. [Accessed Nobember 6, 2020]
Furukawa TA, Salanti G, Atkinson LZ, Leucht S, Ruhe HG, Turner EH, et al. Comparative efficacy and acceptability of first-generation and second-generation antidepressants in the acute treatment of major depression: protocol for a network meta-analysis. BMJ Open. 2016;6(7):e010919. https://doi.org/10.1136/bmjopen-2015-010919.
Harmer CJ, Duman RS, Cowen PJ. How do antidepressants work? New perspectives for refining future treatment approaches. Lancet Psychiatry. 2017;4(5):409–18. https://doi.org/10.1016/S2215-0366(17)30015-9.
James GM, Baldinger-Melich P, Philippe C, et al. Effects of selective serotonin reuptake inhibitors on interregional relation of serotonin transporter availability in major depression. Front Hum Neurosci. 2017;11:48.
Andrews PW, Bharwani A, Lee KR, Fox M, Thomson JA Jr. Is serotonin an upper or a downer? The evolution of the serotonergic system and its role in depression and the antidepressant response. Neurosci Biobehav Rev. 2015;51:164–88. https://doi.org/10.1016/j.neubiorev.2015.01.018.
Warren JB. The trouble with antidepressants: why the evidence overplays benefits and underplays risks—an essay by John B Warren. BMJ. 2020;370:m3200.
Chávez-Castillo M, Nuñez V, Nava M, et al. Depression as a neuroendocrine disorder: emerging neuropsychopharmacological approaches beyond monoamines. Adv Pharmacol Pharmaceut Sci. 2019;2019:1–20. https://doi.org/10.1155/2019/7943481.
Hinz M, Stein A, Uncini T. The discrediting of the monoamine hypothesis. Int J Gen Med. 2012;5:135–42. https://doi.org/10.2147/IJGM.S27824.
Albert PR, Benkelfat C, Descarries L. The neurobiology of depression—revisiting the serotonin hypothesis. I. Cellular and molecular mechanisms. Philos Trans R Soc Lond B Biol Sci. 2012;367(1601):2378–81. https://doi.org/10.1098/rstb.2012.0190.
Jakobsen JC, Katakam KK, Schou A, et al. Selective serotonin reuptake inhibitors versus placebo in patients with major depressive disorder. A systematic review with meta-analysis and trial sequential analysis. BMC Psychiatry. 2017;17(1):58.
Hengartner MP, Jakobsen JC, Sorensen A, et al. Efficacy of new-generation antidepressants assessed with the Montgomery-Asberg Depression Rating Scale, the gold standard clinician rating scale: a meta-analysis of randomised placebo-controlled trials. PLoS One. 2019;15(2):e0229381. https://doi.org/10.1371/journal.pone.0229381 [Epub ahead of print].
Hengartner MP, Plöderl M. Estimates of the minimal important difference to evaluate the clinical significance of antidepressants in the acute treatment of moderate-to-severe depression. BMJ Evid Based Med. 2021:bmjebm-2020-111600. https://doi.org/10.1136/bmjebm-2020-111600 [Epub ahead of print].
Hieronymus F, Hieronymus M, Nilsson S, et al. Individual variability in treatment response to antidepressants in major depression: comparing trial-level and patient-level analyses. Acta Psychiatr Scand. 2020. https://doi.org/10.1111/ACPS.13205 [Pre-proof].
Ploderl M, Hengartner MP. What are the chances for personalised treatment with antidepressants? Detection of patient-by-treatment interaction with a variance ratio meta-analysis. BMJ Open. 2019;9(12):e034816. https://doi.org/10.1136/bmjopen-2019-034816.
Munkholm K, Winkelbeiner S, Homan P. Individual response to antidepressants for depression in adults-a meta-analysis and simulation study. PLoS One. 2020;15(8):e0237950. https://doi.org/10.1371/journal.pone.0237950.
Paludan-Muller AS, Sharma T, Rasmussen K, et al. Extensive selective reporting of quality of life in clinical study reports and publications of placebo-controlled trials of antidepressants. Int J Risk Saf Med. 2020. https://doi.org/10.3233/JRS-200051 [Pre-proof].
Higgins J, Thomas J, Chandler J, et al. Cochrane handbook for systematic reviews of interventions version 6.0 (updated July 2019). Cochrane, 2019. Available at: https://training.cochrane.org/cochrane-handbook-systematic-reviews-interventions [Accessed November, 6 2020]
Wieseler B, Kerekes MF, Vervoelgyi V, McGauran N, Kaiser T. Impact of document type on reporting quality of clinical drug trials: a comparison of registry reports, clinical study reports, and journal publications. BMJ. 2012;344(jan03 1):d8141. https://doi.org/10.1136/bmj.d8141.
de Vries YA, Roest AM, Beijers L, Turner EH, de Jonge P. Bias in the reporting of harms in clinical trials of second-generation antidepressants for depression and anxiety: a meta-analysis. Eur Neuropsychopharmacol. 2016;26(11):1752–9. https://doi.org/10.1016/j.euroneuro.2016.09.370.
Hengartner MP, Ploderl M. Newer-generation antidepressants and suicide risk in randomized controlled trials: a re-analysis of the FDA database. Psychother Psychosom. 2019;88(4):247–8. https://doi.org/10.1159/000501215.
Stone M, Laughren T, Jones ML, Levenson M, Holland PC, Hughes A, et al. Risk of suicidality in clinical trials of antidepressants in adults: analysis of proprietary data submitted to US Food and Drug Administration. BMJ. 2009;339(aug11 2):b2880. https://doi.org/10.1136/bmj.b2880.
Khan A, Mar KF, Gokul S, et al. Decreased suicide rates in recent antidepressant clinical trials. Psychopharmacol. 2018;235(5):1455–62. https://doi.org/10.1007/s00213-018-4856-1.
Cipriani A, Furukawa TA, Salanti G, Chaimani A, Atkinson LZ, Ogawa Y, et al. Comparative efficacy and acceptability of 21 antidepressant drugs for the acute treatment of adults with major depressive disorder: a systematic review and network meta-analysis. Lancet. 2018;391(10128):1357–66. https://doi.org/10.1016/S0140-6736(17)32802-7.
Moher D, Shamseer L, Clarke M, et al. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement. Sys Rev. 2015;4(1):1. https://doi.org/10.1186/2046-4053-4-1.
Shamseer L, Moher D, Clarke M, et al. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015: elaboration and explanation. BMJ. 2015;349:7647.
Hamilton M. A rating scale for depression. J Neurol Neurosurg Psychiatry. 1960;23(1):56–62. https://doi.org/10.1136/jnnp.23.1.56.
International conference on harmonisation of technical requirements for registration of pharmaceuticals for human use. ICH harmonised guideline: integrated addemdum to ICH E6(R1): guideline for good clinical practice (ICH-GCP). 2015. Available at: https://ichgcp.net/ [Accessed November 6, 2020]
Montgomery SA, Åsberg M. A new depression scale designed to be sensitive to change. Br J Psychiatry. 1979;134(4):382–9. https://doi.org/10.1192/bjp.134.4.382.
Beck AT, Steer RA, Brown GK. Beck depression inventory-II. San Antonio: Psychological Corporation. 1996;78(2):490–8.
Timmerby N, Andersen JH, Søndergaard S, Østergaard SD, Bech P. A systematic review of the clinimetric properties of the 6-item version of the Hamilton Depression Rating Scale (HAM-D6). Psychother Psychosom. 2017;86(3):141–9. https://doi.org/10.1159/000457131.
Maund E, Tendal B, Hróbjartsson A, et al. Benefits and harms in clinical trials of duloxetine for treatment of major depressive disorder: comparison of clinical study reports, trial registries, and publications. BMJ. 2014;348.
StataCorp. Stata Statistical Software: release 16 2019 [College Station, TX: StataCorp LLC http://www.stata.com]. Accessed 21 May 2021.
Copenhagen Trial Unit. TSA - trial sequential analysis. Available at: http://www.ctu.dk/tsa/ [Accessed November 6, 2020]
Thorlund K, Engstrøm J, Wetterslev J, et al. User manual for trial sequential analysis (TSA). Available at: http://www.ctu.dk/tsa/files/tsa_manual.pdf. [Accessed November 6, 2020]
Jakobsen JC, Gluud C, Wetterslev J, Winkel P. When and how should multiple imputation be used for handling missing data in randomised clinical trials–a practical guide with flowcharts. BMC Med Res Methodol. 2017;17(1):162. https://doi.org/10.1186/s12874-017-0442-1.
Higgins JP, Thompson SG. Quantifying heterogeneity in a meta-analysis. Stat Med. 2002;21(11):1539–58. https://doi.org/10.1002/sim.1186.
Higgins JP, Thompson SG, Deeks JJ, Altman DG. Measuring inconsistency in meta-analyses. BMJ. 2003;327(7414):557–60. https://doi.org/10.1136/bmj.327.7414.557.
Harbord RM, Egger M, Sterne JA. A modified test for small-study effects in meta-analyses of controlled trials with binary endpoints. Stat Med. 2006;25(20):3443–57. https://doi.org/10.1002/sim.2380.
Egger M, Smith GD, Schneider M, Minder C. Bias in meta-analysis detected by a simple, graphical test. BMJ. 1997;315(7109):629–34. https://doi.org/10.1136/bmj.315.7109.629.
Begg CB, Mazumdar M. Operating characteristics of a rank correlation test for publication bias. Biometrics. 1994;50(4):1088–101. https://doi.org/10.2307/2533446.
Elbourne DR, Altman DG, Higgins JP, et al. Meta-analyses involving cross-over trials: methodological issues. Int J Epidemiol. 2002;31(1):140–9. https://doi.org/10.1093/ije/31.1.140.
Keus F, Wetterslev J, Gluud C, van Laarhoven CJHM. Evidence at a glance: error matrix approach for overviewing available evidence. BMC Med Res Methodol. 2010;10(1):90. https://doi.org/10.1186/1471-2288-10-90.
Jakobsen JC, Wetterslev J, Winkel P, Lange T, Gluud C. Thresholds for statistical and clinical significance in systematic reviews with meta-analytic methods. BMC Med Res Methodol. 2014;14(1):120. https://doi.org/10.1186/1471-2288-14-120.
IntHout J, Ioannidis JPA, Borm GF. The Hartung-Knapp-Sidik-Jonkman method for random effects meta-analysis is straightforward and considerably outperforms the standard DerSimonian-Laird method. BMC Med Res Methodol. 2014;14(1):25. https://doi.org/10.1186/1471-2288-14-25.
DeMets DL. Methods for combining randomized clinical trials: strengths and limitations. Stat Med. 1987;6(3):341–8. https://doi.org/10.1002/sim.4780060325.
Wetterslev J, Thorlund K, Brok J, Gluud C. Trial sequential analysis may establish when firm evidence is reached in cumulative meta-analysis. J Clin Epidemiol. 2008;61(1):64–75. https://doi.org/10.1016/j.jclinepi.2007.03.013.
Brok J, Thorlund K, Gluud C, Wetterslev J. Trial sequential analysis reveals insufficient information size and potentially false positive results in many meta-analyses. J Clin Epidemiol. 2008;61(8):763–9. https://doi.org/10.1016/j.jclinepi.2007.10.007.
Brok J, Thorlund K, Wetterslev J, Gluud C. Apparently conclusive meta-analyses may be inconclusive—trial sequential analysis adjustment of random error risk due to repetitive testing of accumulating data in apparently conclusive neonatal meta-analyses. Int J Epidemiol. 2008;38(1):287–98. https://doi.org/10.1093/ije/dyn188.
Thorlund K, Devereaux P, Wetterslev J, Guyatt G, Ioannidis JP, Thabane L, et al. Can trial sequential monitoring boundaries reduce spurious inferences from meta-analyses? Int J Epidemiol. 2008;38(1):276–86. https://doi.org/10.1093/ije/dyn179.
Wetterslev J, Thorlund K, Brok J, Gluud C. Estimating required information size by quantifying diversity in random-effects model meta-analyses. BMC Med Res Methodol. 2009;9(1):86. https://doi.org/10.1186/1471-2288-9-86.
Thorlund K, Anema A, Mills E. Interpreting meta-analysis according to the adequacy of sample size. An example using isoniazid chemoprophylaxis for tuberculosis in purified protein derivative negative HIV-infected individuals. Clin Epidemiol. 2010;2:57.
Imberger G, Thorlund K, Gluud C, Wetterslev J. False-positive findings in Cochrane meta-analyses with and without application of trial sequential analysis: an empirical review. BMJ Open. 2016;6(8):e011890. https://doi.org/10.1136/bmjopen-2016-011890.
Lundh A, Lexchin J, Mintzes B, Schroll JB, Bero L. Industry sponsorship and research outcome: systematic review with meta-analysis. Intensive Care Med. 2018;44(10):1603–12. https://doi.org/10.1007/s00134-018-5293-7.
Guyatt GH, Oxman AD, Vist GE, Kunz R, Falck-Ytter Y, Alonso-Coello P, et al. GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. BMJ. 2008;336(7650):924–6. https://doi.org/10.1136/bmj.39489.470347.AD.
Guyatt GH, Oxman AD, Schünemann HJ, Tugwell P, Knottnerus A. GRADE guidelines: a new series of articles in the Journal of Clinical Epidemiology. J Clin Epidemiol. 2011;64(4):380–2. https://doi.org/10.1016/j.jclinepi.2010.09.011.
Schünemann HJ, Best D, Vist G, Oxman AD, GRADE Working Group. Letters, numbers, symbols and words: how to communicate grades of evidence and recommendations. Can Med Assoc J. 2003;169(7):677–80.
Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. Syst Rev. 2021;10(1):89. https://doi.org/10.1186/s13643-021-01626-4.
Page MJ, Moher D, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, et al. PRISMA 2020 explanation and elaboration: updated guidance and exemplars for reporting systematic reviews. BMJ. 2021;372:n160.
The expert help from Sarah Louise Klingenberg (Information Specialist, The Cochrane Hepato-Biliary Group, Copenhagen Trial Unit, Copenhagen, Denmark) in making the search strategy is hugely appreciated.
This protocol has not received any funding.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Juul, S., Siddiqui, F., Barbateskovic, M. et al. Beneficial and harmful effects of antidepressants versus placebo, ‘active placebo’, or no intervention for adults with major depressive disorder: a protocol for a systematic review of published and unpublished data with meta-analyses and trial sequential analyses. Syst Rev 10, 154 (2021). https://doi.org/10.1186/s13643-021-01705-6