- Open Access
FLT3 inhibitors in acute myeloid leukaemia: assessment of clinical effectiveness, adverse events and future research—a systematic review and meta-analysis
Systematic Reviews volume 9, Article number: 285 (2020)
FMS-like tyrosine kinase 3 (FLT3) is the most frequent mutation in AML. With two FLT3 inhibitors recently approved by the FDA (midostaurin and gilteritinib), there is a need to evaluate these targeted agents.
To assess the clinical effectiveness of FLT3 inhibitors in AML patients.
Standard systematic review methods were utilised. Searches were conducted to July 2020 for completed and in-progress randomised controlled trials of FLT3 inhibitors in AML. A fixed-effect meta-analysis was undertaken.
Eight completed trials involving 2656 patients and assessing five different FLT3 inhibitors (sorafenib, lestaurtinib, midostaurin, gilteritinib and quizartinib) were included. The pooled results were as follows (FLT3 inhibitor/control): overall survival hazard ratio (HR) = 0.83 (95% confidence interval [CI] 0.75 to 0.92, p = 0.0005), event-free survival HR = 0.85 (95% CI 0.77 to 0.94, p = 0.002), relapse-free survival HR = 0.76 (95% CI 0.64 to 0.90, p = 0.001), complete remission relative risk (RR) = 1.11 (95% CI 1.00 to 1.22. p = 0.05) and 60-day mortality RR = 1.04 (95% CI 0.77 to 1.40, p = 0.79).
Relative risk of grade 3 and above vascular, dermatological, respiratory and hepatobiliary adverse events were found to be statistically significantly higher in the FLT3 inhibitor group compared to control, but the actual numbers of events were relatively small. Nineteen ongoing trials are still in progress, only one of which specifically targets older patients with AML.
There is evidence to support the use of FLT3 inhibitors in patients with AML, but more data is needed to verify the optimum use of the drugs regarding type of inhibitor, disease stage and patient characteristics, not only in relation to disease control, but adverse events and quality of life. There are a large number of ongoing trials; therefore, the results of this review are not a fait accompli; thus, is it recommended that the review be updated in a couple of years’ time. Given the challenges in extracting the complete data set required to assess clinical effectiveness, it is highly recommended that ongoing and future trials improve transparency and consistency of reporting of all trial outcomes, particularly disease control and adverse events, to enable a global clinical effectiveness assessment.
Systematic review registration
With an incidence of over 20,000 in the USA, acute myeloid leukaemia (AML) is the most common acute leukaemia in adults [1, 2]. Prognosis is poor, with 5-year overall survival (OS) of 40% in patients 60 years or younger and just 20% in patients aged over 60 years .
FMS-like tyrosine kinase 3 (FLT3) is the most frequent mutation in AML, found in one-third of patients with de novo AML. FLT3 internal tandem duplications (ITD) occur in almost a quarter of newly diagnosed cases of AML and approximately 7% of FLT3 mutations are tyrosine kinase domain (TKD) point mutations [4, 5]. Patients with FLT3 mutations, particularly FLT3-ITD, have a poor prognosis with an increased risk of relapse . FLT3 inhibitors are tyrosine kinase inhibitors and are classified into first- and second-generation inhibitors based on their kinase specificity and potency. First-generation inhibitors include midostaurin and sorafenib. Second-generation inhibitors include quizartinib and gilteritinib, which are thought to be more potent and have less off-target effects. A number of different FLT3 inhibitors have been developed in the last twenty years and trialled in various treatment stages (remission induction, consolidation, maintenance, and in a relapsed/refractory setting) with varying evidence of efficacy, and there are also a number of FLT3 inhibitors undergoing assessment in ongoing trials. Two FLT3 inhibitors (midostaurin and gilteritinib) have recently been approved by the U.S. Food and Drug Administration (FDA) for use in patients with FLT3-mutated AML [7, 8]. Therefore, it is timely to undertake a systematic review and meta-analysis to assess the clinical effectiveness of FLT3 inhibitors in the treatment of patients with AML.
To investigate the clinical effectiveness (including survival, disease response and adverse events) of FLT3 inhibitors in the treatment of patients with AML.
Standard systematic review methods were employed and reported according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines . The review was based on an a priori protocol registered on PROSPERO (CRD42017055581) .
Study inclusion criteria
The review included randomised controlled trials (RCTs) assessing the clinical effectiveness of any FLT3 inhibitor in patients of all ages with any type of AML. Comparators could include either standard care, another experimental comparator or placebo.
Search strategy and study selection
A search of the bibliographic databases MEDLINE, EMBASE and MEDLINE In-Process (Ovid platform) was undertaken from 2000 to July 2020 using keywords such as ‘acute myeloid leukaemia’, ‘AML’, ‘fms-like tyrosine kinase 3’ and ‘FLT3’ (Supplementary File 1). The Ovid best balance filter of sensitivity and specificity was used, and no language restrictions were applied. Reference lists from included studies were citation-checked. Conference proceedings and ongoing trial databases were also searched between 2015 and July 2020 for completeness and to reduce the risk of publication and reporting bias. Study selection was undertaken by two reviewers independently with disagreements resolved by discussion.
Data extraction and quality assessment
Data were extracted onto standardised and piloted forms. Relevant outcomes included OS, event-free survival (EFS), relapse-free survival (RFS), complete remission (CR) (including its variants complete remission with incomplete recovery [CRi], complete remission with incomplete platelet recovery [CRp] and overall response rate [ORR = CR + CRi/CRp]), early death, 30-day mortality, 60-day mortality, treatment-related mortality and adverse events (AEs). The Cochrane Risk of Bias Tool for RCTs was used for quality assessment . Data extraction and quality assessment were based on information from published studies, conference abstracts, protocols and contact with study authors where relevant and undertaken by one reviewer (SM) with a second reviewer checking (DA/JW/SS).
Ongoing trials and conference abstracts were tabulated with trial characteristics, completion dates and results, where available.
Time-to-event data were extracted based on methods from Tierney  and Parmar : hazard ratios (HR), risk ratios (RR), 95% confidence intervals (CI), p values and survival proportions/events were used to calculate observed minus expected data and variance. Data were extracted from Kaplan-Meier curves where appropriate.
A fixed-effect meta-analysis was undertaken using Review Manager Version 5.4 . Main analyses were based on uncensored data. Treatment effect was measured using HR or RR as appropriate; a p value of less than 0.05 indicated statistical significance. Heterogeneity of treatment effect was measured using the χ2 and I2 statistics and was explored using subgroup analysis, utilising the test for subgroup differences. Publication bias was tested for the outcome OS. Subgroup analyses investigated the effects of FLT3 inhibitor type, FLT3 mutation, disease stage, and age on OS, EFS and RFS, where data was available, with a sensitivity analysis undertaken for censored populations. Censoring was defined from the point when patients underwent stem cell transplantation (SCT). In trials lacking EFS data, EFS was calculated (RFS + [N-CR], where N represents the total number of patients).
Grade 3 and above AEs as defined by the National Cancer Institute Common Toxicity Criteria (CTC version 3.0) were grouped by biological system and tabulated. Where more than one trial reported an AE, these were pooled using fixed-effect meta-analysis.
Nineteen ongoing trials potentially involving 8429 patients also met the inclusion criteria (Supplementary Table 1). A search of conference proceedings found eight abstracts of six trials (Supplementary Table 2), most of which were trial protocols, but two included primary and final trial results (SORMAIN  and RADIUS ).
All eight completed trials were multicentre; five were phase III [17, 19,20,21] and three phase II [15, 16, 18] trials. Five FLT3 inhibitors were investigated: sorafenib in two trials [15, 16], lestaurtinib in three [17, 18] and midostaurin , quizartinib  and gilteritinib  in one trial each. As part of their eligibility criteria, six trials required patients to have a FLT3 mutation [17,18,19,20,21].
Median follow-up was reported in seven trials and ranged from 17.8 to 59 months [15,16,17, 19,20,21]. Two trials included patients aged 18–60 years [15, 19] and one trial exclusively recruited patients aged over 60 years . The remaining five trials included a small number of patients outside of their eligibility criteria, which was patients aged under 60 years for two trials  and aged 18 years or over for three trials [18, 20, 21].
An abridged table of characteristics can be found in Table 1, with full details in Supplementary Table 3. Of note, one paper reported two separate lestaurtinib trials ; in both, trial participants were recruited from two larger trials (AML15 and AML17) [24, 25], where patients were treated with a variety of chemotherapy treatments. Patients from AML15  and AML17  were eligible for lestaurtinib randomisation if they had a FLT3 mutation. Patients from AML15 received intensive chemotherapy with or without lestaurtinib and patients from AML17 received intensive chemotherapy with lestaurtinib or placebo . Full chemotherapy and treatment schedules for all included trials are detailed in Supplementary Table 4.
The pooled HR, based on data from eight trials [15,16,17,18,19,20,21], favoured FLT3 inhibitors and was statistically significant (HR = 0.83, 95% CI 0.75 to 0.92, p = 0.0005, Fig. 2). There was moderate heterogeneity of treatment effect between trials (I2 = 32%). Use of midostaurin  showed a statistically significant increase in OS in newly diagnosed patients (HR = 0.78, 95% CI 0.63 to 0.96, p = 0.02), and both gilteritinib  and quizartinib  showed a statistically significant increase in OS in relapsed/refractory patients (HR = 0.64, 95% CI 0.49 to 0.83, p = 0.0009; and HR = 0.76, 95% CI 0.58 to 0.99, p = 0.04, respectively). No statistically significant improvement in OS was found with use of sorafenib or lestaurtinib. χ2 test for subgroup differences was not significant (χ2 = 8.68, p = 0.07, Fig. 2), but an I2 value of 53.9% suggests there was some subgroup differences between the FLT3 agents. A funnel plot of all eight studies did not indicate publication bias (Supplementary Figure 1).
Response data and RFS data were used to estimate EFS in two lestaurtinib trials  that did not report EFS. One trial of lestaurtinib lacked both EFS and RFS data therefore estimation was not possible . Pooling of data from seven trials [15,16,17, 19,20,21], including two for which EFS data was estimated , produced a statistically significant EFS in favour of FLT3 inhibitor group (HR = 0.85, 95% CI 0.77 to 0.94, p = 0.002, Fig. 3). EFS from five trials which directly reported EFS [15, 16, 19,20,21] was very similar (HR = 0.84, 95% CI 0.75 to 0.94, p = 0.002, Supplementary Figure 2). Overall heterogeneity was moderate for estimated EFS (seven trials: I2 = 46%) and substantial for reported EFS (five trials: I2 = 62%). Regarding individual FLT3 inhibitors, only midostaurin showed a benefit (HR = 0.78, 95% CI 0.66 to 0.93, p = 0.005, Fig. 3) , but the test for subgroup differences was not significant (χ2 = 2.36, p = 0.67, I2 = 0%).
RFS was improved with the use of FLT3 inhibitors based on pooled data from four trials [15, 17, 19] (HR = 0.76, 95% CI 0.64 to 0.90, p = 0.001) with low heterogeneity (I2 = 16%, Supplementary Figure 3). Sorafenib and midostaurin showed statistically significant benefit in RFS. χ2 test for subgroup differences between the different FLT3 inhibitors was not statistically significant (χ2 = 3.50, p = 0.17), but I2 at 42.9% suggested a moderate difference of effect between different FLT3 agents.
Pooling data on CR from six trials [15, 16, 18,19,20,21] (of all FLT3 inhibitors) was borderline in favour of FLT3 inhibitors (RR = 1.11, 95% CI 1.00 to 1.22, p = 0.05, Supplementary Figure 4) with substantial heterogeneity (I2 = 63%). Six trials [16,17,18, 20, 21] (one sorafenib, three lestaurtinib, one gilteritinib and one quizartinib) were pooled using ORR data (i.e. CR + CRi or CR + CRp) with results statistically significantly in favour of FLT3 inhibitor (RR = 1.20, 95% CI 1.10 to 1.29, p < 0.00001, Supplementary Figure 5), although with considerable heterogeneity (I2 = 95%). Both analyses (CR and ORR) tested positive for subgroup differences.
Subgroup analyses based on disease stage resulted in a statistically significant effect in OS in favour of FLT3 inhibitor for patients with newly diagnosed or secondary disease [15,16,17, 19] and relapsed/refractory disease [18, 20, 21] (HR = 0.86, 95% CI 0.75 to 0.99, p = 0.03, I2 = 0%, 5 trials; and HR = 0.80, 95% CI 0.69 to 0.93, p = 0.005, I2 = 74%, 3 trials, respectively; Supplementary Figure 6). Test for subgroup differences was not significant (χ2 = 0.49, p = 0.49).
Estimated EFS was found to be statistically significant and in favour of FLT3 inhibitors for newly diagnosed or secondary disease [15,16,17, 19], but not in relapsed/refractory disease [20, 21] (HR = 0.85, 95% CI 0.76 to 0.96, p = 0.007, I2 = 63%, 5 trials; and HR = 0.86, 95% CI 0.70 to 1.04, p = 0.12, I2 = 0%, 2 trials, respectively; Supplementary Figure 7). Test for subgroup differences was not statistically significant (χ2 = 0.00, p = 0.99).
No outcome data for RFS was provided for the subgroup of relapsed patients.
Three subgroups were categorised by age: 18–60 years, greater than 60 years and 18 years to unspecified. Patients aged 18–60 years [15, 17, 19] and older than 18 years [18, 20, 21] had statistically significantly improved OS (HR = 0.83, 95% CI 0.72 to 0.97, p = 0.02, I2 = 0%, 4 trials; and HR = 0.80, 95% CI 0.69 to 0.93, p = 0.005, I2 = 74%, 3 trials, respectively; Supplementary Figure 8), unlike patients older than 60 years  (HR = 1.03, 95% CI 0.73 to 1.45, p = 0.87, 1 trial). The test for subgroup differences was not statistically significant (χ2 = 1.75, p = 0.42, Supplementary Figure 8).
EFS in younger patients (18-60 years) also favoured FLT3 inhibitors (HR = 0.80, 95% CI 0.71 to 0.90, p = 0.0004, I2 = 0%, 4 trials, Supplementary Figure 9), but for 18 years to unspecified, which included older patients within the cohort [20, 21], this result was not statistically significant (HR = 0.86, 95% CI 0.70 to 1.04, p = 0.12, I2 = 0%, 2 trials). There was only one trial where only patients aged over 60 years were recruited , and the point estimate favoured control although this was not statistically significant (HR = 1.26, 95% CI 0.94 to 1.69, p = 0.13, Supplementary Figure 9). Heterogeneity was moderate (I2 = 46%) and test for subgroup differences was statistically significant (χ2 = 7.77, p = 0.02).
RFS data was not available for patients older than 60 years.
Pooled analyses for patients with FLT3 mutations (i.e. FLT3-ITD, FLT3-TKD, etc.) were not possible given the limited data and inconsistencies in reporting, however individual trial results are shown in Supplementary Tables 7-15.
Within included trials, patients were censored at the point of SCT. Six trials reported censored data for OS [15, 17,18,19,20,21] (HR = 0.82, 95% CI 0.73 to 0.93, p = 0.002, Supplementary Figure 10), which was in keeping with the results for the uncensored population; however, heterogeneity increased from an I2 of 32 to 44% and test for subgroup differences was borderline significant.
Only one sorafenib trial reported censored data for both EFS and RFS and again trends were in keeping with the pooled uncensored findings which were statistically significantly in favour of FLT3 inhibitor group (EFS, HR = 0.64, 95% CI 0.45 to 0.91, p = 0.01, and RFS, HR = 0.53, 95% CI 0.30 to 0.93, p = 0.03, data not shown) .
Early mortality and adverse events
Mortality data from each study was pooled (six trials reporting 60-day mortality [15,16,17, 19, 21] and one trial reporting 30-day mortality ). The actual number of deaths was low (88 in the FLT3 inhibitor group and 67 in the control group) and there was no statistically significant difference observed between FLT3 inhibitor group and control group (RR = 1.04, 95% CI 0.77 to 1.40, p = 0.79, Fig. 4). A moderate/substantial amount of heterogeneity was present (I2 = 68%) and test for subgroup differences was highly significant (χ2 = 17.41, p = 0.0006, Fig. 4). All mortality data is given in Supplementary Figure 11. Two trials [15, 16] reporting ‘early death’ (which was not defined and undistinguishable from 30- and 60-day mortality) showed a statistically significant increased risk in mortality with FLT3 inhibitor use compared to control (RR = 2.20, 95% CI 1.04 to 4.65, p = 0.04, I2 = 0%, Supplementary Figure 11). There was no evidence of effect for the use of FLT3 inhibitors in the reduction of 30-day or 60-day mortality (RR = 0.95, 95% CI 0.58 to 1.57, p = 0.85, I2 = 73%, 5 trials; and RR = 0.96, 95% CI 0.70 to 1.32, p = 0.80, I2 = 70%, 6 trials, respectively, Supplementary Figure 11). One trial reported treatment-related mortality (RR = 1.32, 95% CI 0.30 to 5.80, p = 0.71), although this was not statistically significant and numbers of events were low (4 events in sorafenib and 3 events in control group, data not shown) .
Grade 3 and above AEs are presented in Table 2 from seven trials [15,16,17, 19,20,21]. Reporting of AEs was extremely inconsistent between trials. Though percentages of some AEs were high, rates were similar between FLT3 inhibitor arms and their respective control groups. Pooling of AE data was undertaken according to biological system (Supplementary Figures 12-19). AEs from AML15 and AML17 were not reported separately in the primary lestaurtinib publication  and therefore have remained combined in the meta-analyses (shown as AML17 on data plots).
Statistically significant increases were seen in vascular AEs overall with use of FLT3 inhibitors (RR = 1.52, 95% CI 1.05 to 2.19, p = 0.02, I2 = 0%, Supplementary Figure 12) though individual vascular AEs (e.g. cardiac event, hypertension, etc.) were no different between FLT3 inhibitor and control groups. Dermatological AEs were also statistically significantly increased in FLT3 inhibitor group versus control group (RR = 1.55, 95% CI 1.28 to 1.87, p < 0.00001, I2 = 36%, Supplementary Figure 13) including all subgroups (alopecia p = 0.04, hand-foot-skin reaction p = 0.01, and rash, desquamation, petechiae, itching p = 0.0005). There were statistically significantly more respiratory AEs in FLT3 inhibitor group versus control group (RR = 1.25, 95% CI 1.03 to 1.52, p = 0.02, I2 = 0%, Supplementary Figure 14). Hepatobiliary AEs were also statistically significantly increased in the FLT3 inhibitor group compared to control group (RR = 1.98, 95% CI 1.46 to 2.69, p < 0.0001, I2 = 68%, Supplementary Figure 15). Overall, haematological AEs, metabolism and nutritional disorders, gastrointestinal disorders and constitutional symptoms were not significantly different between FLT3 inhibitor and control groups (RR = 1.03, 95% CI 1.00 to 1.07, p = 0.06, I2 = 51%, Supplementary Figure 16; RR = 1.10, 95% CI 0.86 to 1.41, p = 0.46, I2 = 17%, Supplementary Figure 17; RR = 1.07, 95% CI 0.88 to 1.29, p = 0.51, I2 = 35%, Supplementary Figure 18; RR = 1.05, 95% CI 0.95 to 1.17, p = 0.35, I2 = 0%, Supplementary Figure 19, respectively). Generally, the actual numbers of events were low across FLT3 inhibitor and control groups.
Risk of bias
The overall quality of included trials was good to moderate, particularly with regards to random sequence generation and allocation concealment. Attrition bias was noted in both sorafenib trials [15, 16] whereby not all randomised patients were included in the intention-to-treat analyses; however, the numbers excluded were small. Outcome reporting bias was identified in two trials [15, 18] where outcomes stated in the protocol/methods were not reported and/or outcomes not predefined in protocol/methods were reported (Supplementary Figures 20 and 21).
Nineteen ongoing RCTs were identified, with predicted study completion dates ranging from 2018 to 2028. Of these, three are investigating sorafenib, five midostaurin, five gilteritinib and one quizartinib. All of the sorafenib trials involve patients with newly diagnosed AML. Two (NCT01371981, NCT03164057) are multi-arm and in younger people aged below 29 and 21 years, with an estimated sample size of 1750 and 200, respectively. The third (ACTRN12611001112954) involves patients aged 15–65 years and aims to recruit 99 patients. Three midostaurin trials involve patients with newly diagnosed AML. All involve adults, with one (NCT03092674) aiming to recruit patients older than 60 years. Of the five gilteritinib trials, one is newly diagnosed AML patients aged under 22 years; the remaining are in adult populations, one each in newly diagnosed, at first complete response, post SCT and relapsed and refractory populations. The ongoing quizartinib trial is in adults with newly diagnosed disease. The remaining ongoing trials are investigating: crenolanib (3 trials), ponatinib (1 trial) and nintedanib (1 trial), with estimated sample sizes ranging from 9 to 510 participants (Supplementary Table 1).
Eight conference abstracts not related to the trials included in the main part of this review were identified. Six were protocols for ongoing trials. Two conference abstracts gave results, both were small trials. The RADIUS  trial randomised 60 patients aged between 18 and 70 years to midostaurin or standard care post SCT. RFS was 89% versus 79%, respectively at 18 months. The SORMAIN  trial assessed sorafenib also as maintenance therapy following allogeneic SCT in 83 patients. Median RFS was not reached in the sorafenib group and was 30.9 months in the placebo group, with a 2-year RFS of 85.0% and 53.3%, respectively (HR = 0.39, p = 0.0139, Supplementary Table 2).
The aim of this systematic review was to assess the clinical effectiveness of FLT3 inhibitors, as a class of drugs, in patients with any type of AML. Eight completed RCTs and 19 ongoing trials were identified from the review searches. Three FLT3 inhibitors, sorafenib, lestaurtinib and midostaurin, were first-generation and two, gilteritinib and quizartinib, were second-generation inhibitors. In five trials [15,16,17, 19], patients had a mix of primary, de novo and secondary AML, with FLT3 inhibitors given as part of induction and/or consolidation treatment in three trials [16, 17] plus maintenance up to 12 months in two [15, 19]. Three trials included patients with relapsed/refractory disease [18, 20, 21]. There was a wide age range within the trials with four trials [15, 17, 19] aiming to recruit patients aged 18 to 60 years, one over 60 years  and three recruiting patients aged 18 years or older with no upper age limit specified [18, 20, 21]. All but the two sorafenib trials required patients to have a FLT3 mutation [17,18,19,20,21].
The pooled HR for OS for all eight trials in the uncensored population showed a 17% benefit in survival, which was statistically significant [15,16,17,18,19,20,21]. A moderate level of heterogeneity was observed, which was explored using subgroup analysis. For type of FLT3 inhibitor, subgroup analysis indicated a slight difference of effect between inhibitors, with χ2 at 8.68 (p = 0.07) and I2 at 53.9%. Whilst all of the FLT3 inhibitors showed benefit as demonstrated by their subgroup point estimates, HRs for sorafenib and lestaurtinib were not statistically significant [15,16,17,18]. Interestingly, when censored data was pooled, whilst OS remained statistically significant, the sorafenib subgroup pooled estimate favoured control and of those favouring FLT3 only, gilteritinib remained statistically significant. This could be due to a clinical effect with SCT enhancing the survival benefit of FLT3 inhibitors whether in synergy or as an additional therapy, or it could be due to a statistical anomaly due to loss of power in the censored analysis. Subgroup analyses comparing disease stage—newly diagnosed or secondary AML versus relapsed or refractory disease—found no difference between subgroups, with both showing a benefit in favour of FLT3. However, there was substantial heterogeneity within the relapsed and refractory group (I2 = 74%), with lestaurtinib showing no benefit. With four ongoing trials in patients with relapsed disease, more data may establish if there is a benefit in relapsed/refractory patients. For subgroup analyses investigating age, a favourable HR for FLT3 inhibitors was demonstrated for patients younger than 60 years (HR = 0.83, 95% CI 0.72 to 0.97) and also in the group with the very wide age range from 18 years to unspecified (HR = 0.80, 95% CI 0.69 to 0.93). In the trial which excluded patients under 60 years, a non-significant difference in favour of control was observed (HR = 1.03, 95% CI 0.73 to 1.45). The test for subgroup differences did not indicate a subgroup difference, but heterogeneity was substantial in the 18 years to unspecified group suggesting that age may have a role in the effectiveness of FLT3 inhibitors as this grouping included patients in their eighties. Just one ongoing trial (NCT03092674; suspended June 2020) is specifically aimed at patients over 60 years, so there is a need to define and consider age in future trials.
Ultimately, survival improvements are the goal of therapy for patients with AML, but disease control is also an important aspect of care. EFS, RFS and response data can help to gauge disease control and the risk of relapse, which for patients with FLT3-ITD mutations is substantial. Both EFS (HR = 0.85, 95% CI 0.77 to 0.94) and RFS (HR = 0.76, 95% CI 0.64 to 0.90) suggest an anti-leukaemic effect, and both measures of response (complete response and overall response) also showed a pooled estimate in favour of FLT3; however, both also report substantial heterogeneity. Unfortunately, all of these results may have been affected by reporting bias, as not all of the trials reported all of these outcome measures.
Other biases may also have affected the trial results and interacted with the clinical heterogeneity within and across trials. For example, in the older patients in the Serve trial, there was a significant difference in patients stopping therapy in the sorafenib arm compared to the control arm due to toxicity or refusal (p < 0.001) , with the consequence that patients randomised to sorafenib did not receive their full planned courses, and therefore may have not received a full therapeutic dose thus reducing estimates of effectiveness. FLT3 mutation status may also have affected outcomes.
In our review, FLT3 mutation was not assessed in relation to OS due to lack of data; however, a recent meta-analysis which included non-randomised data has concluded that patients with FLT3-ITD-positive mutations are more sensitive to FLT3 inhibitor treatment, thereby achieving a better CR (OR = 1.89, 95% CI 1.06 to 3.37, p= 0.03) and ORR (OR = 3.07, 95% CI 2.13 to 4.43, p < 0.001) .
The mechanism of action of the agents used may also have affected the findings of this review. Six of the completed trials involved first-generation FLT3 inhibitors—sorafenib, lestaurtinib and midostaurin—and two involved second-generation FLT3 inhibitors—gilteritinib and quizartinib. First-generation inhibitors lack specificity to FLT3 and are therefore not as potent as second-generation FLT3 inhibitors which have been designed to only target FLT3. However, first-generation FLT3 inhibitors can target downstream of FLT3 and may also be effective in parallel signalling pathways and in other targets in AML cells . This may enhance their anti-leukaemic efficacy, particularly in patients who do not have FLT3 mutations, as was the case for a proportion of patients in the older trials. FLT3 inhibitors are also classed as type I and type II. Lestaurtinib, midostaurin and gilteritinib are type I inhibitors whereas sorafenib and quizartinib are type II inhibitors. Type I and II inhibitors differ in the way they interact with the ATP binding site . The consequence being that type I inhibitors can bind to both ITD and TKD mutations, whereas type II inhibitors can only inhibit ITD mutations, with resistance to type II inhibitors occurring when TKD mutations develop. There are nine ongoing trials investigating second-generation FLT3 inhibitors, with eight utilising type I inhibitors (crenolanib—3 trials, gilteritinib—5 trials) and one investigating a type II inhibitor (quizartinib—1 trial), which may eventually show a difference in effect size compared to first-generation inhibitors.
First-generation FLT3 inhibitors have been associated with toxic effects due to their off-target activity. There were broadly similar levels of AEs in both the intervention and control groups across most of the AEs reported for the first-generation inhibitors. As most of the trial participants had previously or were concurrently receiving chemotherapy, it is likely that many of the AEs reported were as a consequence to this.
For second-generation inhibitors, concerns have been raised about cardiac events, particularly an increase in the QT interval, which can lead to cardiac arrhythmia and sudden death. The QuANTUM-R study reported ten patients in the quizartinib arm who had had a grade 3 prolonged QT interval episode which was reported within the adverse events table, whereas episodes of the prolonged QT interval within the ADMIRAL trial were reported within the text, with 12 patients showing a prolonged QT interval, of which one was a QT greater than 501 ms and 6 were greater than 60 ms, which equates to grades 3 and 4 . This example demonstrates how challenging it is to extract data on AEs within these publications, with all the trials reporting either different AEs, or the same events in different formats making an accurate and valid comparison very difficult. Of the second-generation inhibitors, only the ADMIRAL trial (gilteritinib) reported 30- and 60-day mortality, with a substantial benefit in the gilteritinib arm, although patient numbers were relatively low. QuANTUM-R did not report 30- and 60-day mortality but reported ‘treatment-emergent deaths’, of which 80 (33%) were attributable to quizartinib and 16 (17%) to the control treatment. This equates to a RR of 1.93 (95% CI 1.17 to 3.17, p = 0.009), which is a substantial difference; however, they did not define ‘treatment-emergent’; therefore, this was not inputted into the pooled early death/mortality analysis.
Only the ADMIRAL trial reported quality of life data, and this was as a conference abstract; therefore, it is difficult to establish the wider implications of the treatment, particularly in this group of life-limited patients.
Strengths and limitations
Following a systematic methodology and comprehensive search strategy, eight completed RCTs were included in the review for full analysis. Trial publications, conference abstracts and protocols were utilised to augment the comprehensiveness of the review; despite this the review was limited by poor reporting and missing data. Where survival data was not explicitly reported, it was extracted from graphs and/or calculated, which is more imprecise than reported HRs, CIs and p values and may introduce error. Additionally, subgroup analyses were also restricted, with some subgroup estimates based solely on the results of a single trial, or not being possible.
Moderate to substantial heterogeneity was found within the pooled analysis. As a fixed-effects model was used, there could be room for criticism; however, the fixed-effect meta-analysis was planned a priori to the findings of the meta-analysis. Whilst there remains debate as to which model is best, the fixed-effect model has its advantages. Firstly, it does not give as much weight to the smaller trials as the random-effects model does, which is beneficial as there is evidence that small trials have a tendency to overestimate treatment effects; using the fixed-effect model can help to mitigate this overestimation. Secondly, although the random-effects model assumes that trials are not linked, prior trials will influence new trial designs. In fixed-effects, the model does not actually assume fixed-effect, but is statistically assumption free; therefore, the ‘assumption-free model’ might be a better term.
Implications for practice
Midostaurin was approved by the FDA in April 2017  and by the European Medicines Agency (EMA) in September 2017  for patients with newly diagnosed FLT3-mutated AML on the basis of the Stone trial . The marketing authorisation states that patients can be treated with midostaurin as part of induction therapy beginning on day 8 ; therefore, this has implications for practice as patients will need early and rapid testing for FLT3-mutation status. Lestaurtinib is no longer in clinical development  and sorafenib remains unlicensed for patients with AML. In November 2018, the FDA approved gilteritinib on the basis of interim data (response data and blood transfusion requirements) from the ADMIRAL trial (NCT02421939)  in relapsed and refractory patients, whereas quizartinib has been rejected by both the FDA and EMA for marketing authorisation for various reasons such as dropouts (23% of the control group did not receive chemotherapy), censoring and concerns about cardiac and infection AEs .
Implications for future research
There are questions remaining, particularly regarding when to give FLT3 inhibitors and what the effect of prior treatment with FLT3 inhibitor will be on subsequent FLT3 treatment. The completed trials administered FLT3 inhibitors during induction, consolidation and maintenance, and several ongoing trials are administering FLT3 inhibitors after SCT. This suggests that currently there is no evidence regarding when best to use these novel agents in the treatment pathway. In the relapsed and refractory populations, the majority of patients were FLT3 naïve; however, with midostaurin now approved for the treatment of newly diagnosed patients, this will change in future trial cohorts.
We identified 19 ongoing trials, aiming to recruit approximately 8500 patients. The success of these trials may be limited by the number of patients they are able to recruit, given the increasing number of new treatments requiring testing in AML. It may be that novel trial designs such as multi-arm, multi-stage platform trials need to be employed (see NCT01371981 and NCT03164057) in order to maximise resources.
There is evidence to support the use of FLT3 inhibitors in patients with AML in both the newly diagnosed and relapsed and refractory setting, although there remain questions regarding the optimum use of the drugs and better understanding of the toxicity profile and quality of life outcomes. Currently, this is an active area of research and this review is not a fait accompli; therefore, it is advised that this review is updated once ongoing research has been completed; however, future trials should endeavour to improve reporting to make trial results more transparent and accessible, in order that clinicians can differentiate between treatments to maximise patient benefit.
Availability of data and materials
The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.
Acute myeloid leukaemia
American Society for Hematology
Complete remission with incomplete recovery
Complete remission with incomplete platelet recovery
Common Toxicity Criteria (National Cancer Institute)
European Medicines Agency
U.S. Food and Drug Administration
FMS-like tyrosine kinase 3
Internal tandem duplication
Overall response rate
Preferred Reporting Items for Systematic Reviews and Meta-Analyses
Randomised controlled trial
Stem cell transplant
Tyrosine kinase domain
De Kouchkovsky I, Abdul-Hay M. Acute myeloid leukemia: a comprehensive review and 2016 update. Blood Cancer Journal. 2016;6:e441.
Cancer Research UK. Acute myeloid leukaemia (AML) statistics; 2015.
Burnett A. Treatment of acute myeloid leukemia: are we making progress? Hematol Am Soc Hematol Educ Program. 2012;2012:1–6.
Levis M. FLT3 mutations in acute myeloid leukemia: what is the best approach in 2013? Hematol Am Soc Hematol Educ Program. 2013;2013:220–6.
Wander SA, Levis MJ, Fathi AT. The evolving role of FLT3 inhibitors in acute myeloid leukemia: quizartinib and beyond. Ther Adv Hematol. 2014;5:65–77.
Kottaridis PD, Gale RE, Langabeer SE, et al. Studies of FLT3 mutations in paired presentation and relapse samples from patients with acute myeloid leukemia: implications for the role of FLT3 mutations in leukemogenesis, minimal residual disease detection, and possible therapy with FLT3 inhibitors. Blood. 2002;100:2393–8.
US Food & Drug Administration 2017: Midostaurin. Available from: https://www.fda.gov/Drugs/InformationOnDrugs/ApprovedDrugs/ucm555756.htm. Accessed 18 Dec 2018.
US Food & Drug Administration 2018: FDA approves gilteritinib for relapsed or refractory acute myeloid leukemia (AML) with a FLT3 mutation. Available from: https://www.fda.gov/Drugs/InformationOnDrugs/ApprovedDrugs/ucm627045.htm. Accessed 18 Dec 2018.
Moher D, Liberati A, Tetzlaff J, Altman DG, The PRISMA Group. Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. PLoS Med. 2009;6(7):e1000097. https://doi.org/10.1371/journal.pmed1000097.
Saimma Majothi, David Adams, Jayne Wilson, Laura Buckley, Keith Wheatley. The clinical effectiveness of FLT3 inhibitors for the treatment of acute myeloid leukaemia: a systematic review and meta-analysis. PROSPERO 2017 CRD42017055581 Available from: https://www.crd.york.ac.uk/prospero/display_record.php?ID=CRD42017055581.
Higgins JPT, Altman DG, Sterne JAC. Chapter 8: assessing risk of bias in included studies. In: Higgins JPT, Green S, editors. Cochrane Handbook for Systematic Reviews of Interventions. Chicester: John Wiley & Sons Ltd; 2008. p. 187–241.
Tierney JF, Stewart LA, Ghersi D, et al. Practical methods for incorporating summary time-to-event data into meta-analysis. Trials. 2007;8:1–16.
Parmar MK, Torri V, Stewart L. Extracting summary statistics to perform meta-analyses of the published literature for survival endpoints. Stat Med. 1998;17:2815–34.
Review Manager (RevMan) (2014) Version 5.3. Copenhagen: The Nordic Cochrane Centre, The Cochrane Collaboration.
Röllig C, Serve H, Hüttmann A, et al. Addition of sorafenib versus placebo to standard therapy in patients aged 60 years or younger with newly diagnosed acute myeloid leukaemia (SORAML): a multicentre, phase 2, randomised controlled trial. Lancet Oncol. 2015;16(16):1691–9.
Serve H, Krug U, Wagner R, et al. Sorafenib in combination with intensive chemotherapy in elderly patients with acute myeloid leukemia: results from a randomized, placebo-controlled trial. J Clin Oncol. 2013;31(25):3110–8.
Knapper S, Russell N, Gilkes A, et al. A randomized assessment of adding the kinase inhibitor lestaurtinib to first-line chemotherapy for FLT3-mutated AML. Blood. 2017;129:1143–54.
Levis M, Ravandi F, Wang ES, et al. Results from a randomized trial of salvage chemotherapy followed by lestaurtinib for patients with FLT3 mutant AML in first relapse. Blood. 2011;117:3294–301.
Stone RM, Mandrekar SJ, Sanford BL, et al. Midostaurin plus chemotherapy for acute myeloid leukemia with a FLT3 mutation. N Engl J Med. 2017;377:454-64.
Cortes JE, Khaled S, Martinelli G, et al. Quizartinib versus salvage chemotherapy in relapsed or refractory FLT3-ITD acute myeloid leukaemia (QuANTUM-R): a multicentre, randomised, controlled, open-label, phase 3 trial. Lancet Oncol. 2019;20:984–97.
Perl AE, Martinelli G, Cortes JE, et al. Gilteritinib or chemotherapy for relapsed or refractory FLT3-mutated AML. N Engl J Med. 2019;381:1728–40.
Burchert A, Bug G, Finke J, et al. 661 Sorafenib as maintenance therapy post allogeneic stem cell transplantation for FLT3-ITD positive AML: results from the randomized, double-blind, placebo-controlled multicentre sormain trial. Blood. 2018;132(Supplement 1):661.
Maziarz RT, Patnaik MM, Scott BL, et al. 662 Radius: a phase 2 randomized trial investigating standard of care ± midostaurin after allogeneic stem cell transplant in FLT3-ITD–mutated AML. Blood. 2018;132(Supplement 1):662.
Burnett AK, Russell NH, Hills RK, et al. Optimization of chemotherapy for younger patients with acute myeloid leukemia: results of the medical research council AML15 trial. J Clin Oncol. 2013;31:3360–8.
Burnett AK, Russell NH, Hills RK, et al. A randomized comparison of daunorubicin 90 mg/m2 vs 60 mg/m2 in AML induction: results from the UK NCRI AML17 trial in 1206 patients. Blood. 2015;125:3878–85.
Yang M, Zhao T, Lui T, et al. Use of FLT3 inhibitors in acute myeloid leukemia remission induction or salvage therapy: systematic review and meta-analysis. Cancer Manage Res. 2018;10:2635–52.
Larrosa-Garcia M, Baer MR. FLT3 inhibitors in acute myeloid leukemia: current status and future directions. Mol Cancer Ther. 2017;16:991–1001.
Setteyova L, Bacharova L, Mladosievicova B. QT prolongation due to targeted anticancer therapy. OnCOReview. 2016;6:A103–12.
Midostaurin for the treatment of acute myeloid leukaemia. EU/3/04/214, European Medicines Agency. Available from: https://www.ema.europa.eu/en/medicines/human/orphan-designations/eu304214. Accessed 18 Dec 2018.
National Institute for Health and Care Excellence (2018) Midostaurin for untreated acute myeloid leukaemia (Technology appraisal guidance TA523). Available from: www.nice.org.uk/guidance/ta523. Accessed 30 Jan 2019.
This work was supported by Cancer Research UK, which provides core funding to the Cancer Research UK Clinical Trials Unit, University of Birmingham.
This work was supported by grants from Cancer Research UK (JSW, SM and SS).
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
MEDLINE (Ovid) search strategy for FLT3 inhibitors in patients with AML. Table S1. Ongoing trials meeting review inclusion criteria (ordered alphabetically by FLT3 inhibitor [column name: FLT3 inhibitor group details]. Table S2. Conference abstracts of trials meeting review inclusion criteria (ordered alphabetically by intervention)*. Table S3. Characteristics of included studies. Table S4. Treatment schedules for included trials. Table S5. Outcome definitions according to trials included in review. Table S6. Data sources and methods of calculation for all included outcomes. Table S7. Baseline and outcome data for FLT3-mutated patients from Rollig C, 2015. Table S8. Baseline and outcome data for FLT3-mutated patients from Serve H, 2013. Table S9. Baseline data for FLT3-mutated patients from Knapper S, 2017. Table S10. Outcome data for FLT3-mutated patients from Knapper S, 2017. Table S11. Baseline and outcome data for FLT3-mutated patients from Levis M, 2011. Table S12. Baseline and outcome data for FLT3-mutated patients from Stone RM, 2017. Table S13. Baseline and outcome data for FLT3-mutated patients from Perl AE, 2019. Table S14. Baseline data from FLT3-ITD-mutated patients from Cortes JE, 2019. Table S15. Outcome data for FLT3-mutated AML patients from Cortes JE, 2019. Fig. S1. Funnel plot of all eight included trials for overall survival. Fig. S2. Forest plot of event-free survival data (uncensored population), grouped by FLT3 inhibitor. Fig. S3. Forest plot of relapse-free survival data (uncensored population), grouped by FLT3 inhibitor. Fig. S4 Forest plot of complete remission data, grouped by FLT3 inhibitor. Fig. S5. Forest plot of overall response rate (CR + CRi/CRp) data, grouped by FLT3 inhibitor. Fig. S6. Forest plot of overall survival data, grouped by disease stage Fig. S7. Forest plot of event-free survival (calculated and reported data), grouped by disease stage. Fig. S8. Forest plot of overall survival, grouped by age categories. Fig. S9. Forest plot of event-free survival (calculated and reported data), grouped by age categories. Fig. S10. Forest plot of overall survival data (censored population), grouped by FLT3 inhibitor. Fig. S11. Forest plot of early death/mortality data. Fig. S12. Forest plot of vascular adverse events data. Fig. S13. Forest plot of dermatological adverse events data. Fig. S14. Forest plot of respiratory adverse events data. Fig. S15. Forest plot of liver/hepatobiliary adverse events data. Fig. S16. Forest plot of haematological adverse events data. Fig. S17. Forest plot of metabolism and nutritional disorders data. Fig. S18. Forest plot of gastrointestinal disorders data. Fig. S19. Forest plot of constitutional symptoms data. Fig. S20. Risk of bias graph of included trials. Fig. S21. Risk of bias summary of included trials.
About this article
Cite this article
Majothi, S., Adams, D., Loke, J. et al. FLT3 inhibitors in acute myeloid leukaemia: assessment of clinical effectiveness, adverse events and future research—a systematic review and meta-analysis. Syst Rev 9, 285 (2020). https://doi.org/10.1186/s13643-020-01540-1
- FLT3 inhibitors
- Systematic review
- Adverse events