Skip to main content

Measurement properties of the EQ-5D in children and adolescents: a systematic review protocol

Abstract

Background

Although the EQ-5D instruments have been initially designed for adult populations, there are new studies evaluating and applying these instruments to children and adolescents. The EuroQol Group adapted and created two versions designed for these groups, i.e., the EQ-5D-Y versions. The measurement properties of the EQ-5D have been systematically reviewed in different health conditions. However, there is a lack of a proper systematic assessment including the studies’ risk of bias and focusing on recent studies assessing the EQ-5D instruments in children and adolescents. The lack of a systematic assessment of the EQ-5D versions does not allow us to have a comprehensive evaluation of the validity, reliability, and responsiveness of these instruments among children and adolescents. This systematic review aims to critically appraise and summarize the evidence on the measurement properties of the EQ-5D instruments (self-reported version – answered by children and adolescents; and proxy versions – versions reported by parents, caregivers, or health professionals) in children and adolescents.

Methods

A systematic review searching the following electronic databases: MEDLINE, EMBASE, CINAHL, EconLit, National Health Service Economic Evaluation Database (NHS-EED), Health Technology Assessment (HTA) database. Two independent reviewers will screen titles and abstracts and select full texts for eligibility. The COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) methodology will be followed to conduct three main assessment steps: risk of bias, quality criteria for measurement properties, and evidence synthesis.

Discussion

This systematic review will provide comprehensive information about the evidence regarding the measurement properties of EQ-5D instruments in children and adolescents of different settings and countries.

Systematic review registration

Open Science Framework with Registration https://osf.io/r8kt9/ and PROSPERO: CRD42020218382.

Peer Review reports

Background

The development of health-related quality of life (HRQoL) measurements in children and adolescents have increased in the last twenty years [1]. In general, HRQoL measurements are multidimensional and include physical, psychological, and social functioning domains [1, 2]. Most of the instruments aiming to assess this domain are self-reported and, consequently, refer to self-perceived HRQoL [2]. HRQoL instruments usually target two different methods of measurement: 1) an indirect measurement assessing the quality of life as a health status (i.e., measured by specific or generic questionnaires); or 2) a value of health status to generate both health profiles and index values (i.e., using the utility score measured by the EQ-5D for example) [2].

The EQ-5D instruments are well-known, widely used in the literature, and have been translated into more than 170 languages [3]. The EQ-5D responses can be interpreted in three different manners: 1) descriptively by using the health profiles of the individual items (e.g., 11,111 for the descriptive system); 2) country-specific index values estimated from preference weights of the different health states (this index usually ranges from 0 – representing “dead” – to 1 – representing “full health”; 1 in this case also represents the best level of severity or no severity in the five dimensions [11111] of the EQ-5D). This country-specific index is known as the value set of the EQ-5D for each country and it should be developed from a valuation study (e.g., composite time trade-off and discrete choice experiment) according to the recommendations from the EuroQol Group. The data should be collected from the general population in each country/region; and 3) self-rated health status measures (e.g., using the EQ-5D visual analogue scale [EQ-VAS]) [4]. A recent review of national health technology assessment guidelines showed that the EQ-5D was the most commonly used instrument in cost-utility analysis within economic evaluations [5].

The original 3-level version of EQ-5D (EQ-5D-3L) was developed in the 90's and, after its growth in use, the EuroQol Group developed in 2009 the five-level version (EQ-5D-5L) in order to improve its sensitivity and reduce ceiling effects [3, 6, 7]. After decades from the EQ-5D initial publication, the EuroQol Group adapted and created two versions specifically designed for children and adolescents, known as EQ-5D-Y-3L and EQ-5D-Y-5L [3, 8, 9]. For both instruments, there is the self-reported version (answered by children and adolescents) and a proxy version (generally reported by parents, caregivers or health professionals) [10]. The measurement properties of these instruments have been tested in children and adolescents with different health conditions (such as, type 1 diabetes mellitus, asthma, general population, psychiatric disorders, and idiopathic scoliosis) [11,12,13,14,15,16]. One systematic literature review evaluated the measurement properties of the EQ-5D instruments (EQ-5D adult version, EQ-5D-Y-3L, Dutch EQ-5D child version and, an extended questionnaire with cognitive dimension EQ-5D + C) in children and adolescents, but without assessing the risk of bias of the included studies [17]. This review included studies between 1999 and 2010, and a large amount of new studies of the EQ-5D-Y-3L [12, 17,18,19,20,21,22,23,24] have been published in the last ten years, including studies focusing on the more recent version, the EQ-5D-Y-5L [8, 14,15,16, 25]. Moreover, the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) initiative developed a recent methodological guideline for systematic reviews of patient-reported outcome measures (PROMs) like the EQ-5D, including a tool to assess the risk of bias of studies on measurement properties and to evaluate the quality of development of patient-reported outcome measures [26,27,28]. The lack of a systematic review with recent data assessing the risk of bias of the studies and the lack of a systematic assessment of the newly EQ-5D-Y versions does not allow to have a comprehensive picture on the validity, reliability and responsiveness of these instruments among children and adolescents.

Objective

The aim of this systematic review is to critically appraise and summarize the evidence on the measurement properties of the EQ-5D instruments (self-reported version – answered by children and adolescents; and, proxy versions – versions reported by parents, caregivers, or health professionals) in children and adolescents.

Methods

This protocol was registered on the Open Science Framework (OSF) https://osf.io/r8kt9/. This protocol also was registered on International Prospective Register of Systematic Reviews (PROSPERO) with registration number CRD42020218382. This review has been prepared following the Preferred Reporting Items for Systematic Reviews and Meta-Analysis Protocol (PRISMA-P) for the reporting, and the COSMIN guidance in the conduction [27, 29]. The details of PRISMA-P Checklist are presented in an Additional file 1.

Measurement instruments of interest

The instruments of interest are the EQ-5D instruments developed by the EuroQol Group, including adult and children versions: EQ-5D-3L, EQ-5D-5L, EQ-5D-Y-3L, and EQ-5D-Y-5L [6,7,8,9]. We decided to include the adult versions of the EQ-5D (EQ-5D-3L and EQ-5D-5L) because both were the first instruments of the EuroQol Group to be created and already had their measurement properties tested in children and adolescents [3, 18, 19, 30,31,32]. The evaluation of the EQ-5D instruments in all languages and with various modes of administration is of interest [3]

The EQ-5D-3L was developed for the adult population and it measures HRQoL in five dimensions: mobility, self-care, usual activities, pain/discomfort, and anxiety/depression. Each dimension has three levels of severity: ‘no problems’, ‘some problems’, and ‘extreme problems’ [4]. The EQ-5D-5L was also developed for the adult population and also measures HRQoL in the same five dimensions, however, each dimension has five levels of severity: ‘no problems’, ‘slight problems’, ‘moderate problems’, ‘severe problems’, and ‘extreme problems’ [6]. Both instruments can provide a health state of a descriptive system (e.g., 11,111 – meaning no problems in all dimensions), which can be converted to an index value (ranging from 0 – representing “dead” – to 1 – representing “full health”) of the general population of a country. Both instruments also include a visual analogue scale (EQ-VAS) ranging from 0 to 100, in which, zero represents the worse health-perceived status and 100 represents the best health-perceived status.

The EQ-5D-Y-3L and EQ-5D-Y-5L were created based on the EQ-5D-3L and EQ-5D-5L, respectively [8, 9]. Both instruments have a self-reported version (answered by children and adolescents) and proxy versions (reported by parents, caregivers, or health professionals when children are unable to report their HRQoL) and have been developed specifically for children and adolescents aged 8–15 years. Both self-reported versions (EQ-5D-Y-3L and EQ-5D-Y-5L) had language modifications to warrant better understanding by children and adolescents, but both have the same structure as the original versions of the instrument.

Eligibility criteria

We will include studies that measured HRQoL through one or more EQ-5D instruments in children and adolescents up to 19 years old (or at least 80% of the sample with mean age up to 19 years old). We will include studies that used either the self-reported version (answered by children and adolescents) or proxy versions (reported by parents, caregivers, or health professionals when children are unable to report their HRQoL) of the instruments, and that tested at least one measurement property in the study (e.g., internal consistency, reliability, responsiveness). We will include any types of studies of measurement properties/clinimetric (e.g., cross-sectional and cohort with test–retest). There will be no restrictions on the type of setting or health conditions of the population.

We will include studies from 1990 because it was in this period that the EuroQol Group published the first version of the EQ-5D [7]. There will be no restrictions on language, and we will only include full-text articles published in peer-review journals (we will not include any grey literature).

Literature search

We will perform the search strategy on the following databases: MEDLINE, EMBASE, CINAHL, EconLit, National Health Service Economic Evaluation Database (NHS-EED), and Health Technology Assessment (HTA) databases. To ensure a comprehensive search, we will conduct manual searches on EuroQol Research Foundation Website for potentially eligible studies [33]. We will also screen the reference lists of the included studies for other potentially eligible studies. In addition, we will search for studies meeting our inclusion criteria among the studies included in a previous systematic review [17].

Search strategy

The search strategy of this systematic review will be created based on three key elements, according to the COSMIN recommendations: 1) ‘children and adolescents’; 2) ‘EQ-5D’; and 3) a sensitive filter to identify studies about measurement properties of EQ-5D instruments [27]. The filter to identify studies of measurement properties is highly sensitive and it was previously validated [34]. The details of the search strategy on MEDLINE are presented in an Additional file 2.

Study selection

The results from the searches will be uploaded and managed in the EndNote Software version X9 for the removal of duplicates [35]. Two review authors (CMES and VSS) will independently screen all titles and abstracts, and also the full texts for potentially eligible studies. Any disagreement will be discussed and, if consensus cannot be reached, a third review author (TPY or GCM) will make a decision. Screening and data extraction will be performed using DistillerSR (Evidence Partners, Ottawa, Canada).

Data extraction, data items and outcomes

Two authors (CMES and VSS) will independently extract the data for each eligible study. Disagreements between the reviewers will be discussed and a third reviewer (TPY or GCM) will be contacted when a consensus cannot be reached. If necessary, we will contact the study’s authors to ask for additional information not reported in the article. We will make three attempts through email, over a period of one month. Two authors (TPY and GCM) with experience in conducting systematic reviews developed a data extraction form, and this form was based on those used to conduct COSMIN systematic reviews [27]. The main outcome of interest is any measurement properties of the EQ-5D instruments (self-reported or proxy versions) used to measure HRQoL in children and adolescents. We will extract the following data:

  • EQ-5D version (e.g., EQ-5D-3L);

  • Scores used (e.g., VAS, utility score);

  • Reference, year;

  • Language, country;

  • Population (sample size, female [%], age, description of health condition);

  • Context of use;

  • Mode of administration (e.g., self-report, parent/proxy report);

  • Number of missing data and response rates;

  • Measurement properties of interest (e.g., internal consistency, reliability, responsiveness) according to the COSMIN taxonomy;

  • Results of measurement properties;

  • Feasibility and interpretability data;

  • Recall period.

Evaluation of measurement properties and specific assessment of content validity

Measurement properties will be defined in line with the COSMIN taxonomy for PROMs [36]. Their evaluation will be performed through three components: 1) the risk of bias of the individual studies (methodological quality); 2) the application of quality criteria for (in)sufficient measurement properties; and 3) the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) assessment to rate the quality of evidence for each measurement property of each instrument (evidence synthesis) [26,27,28]. All of these three components will be assessed by two authors (CMES and VSS). The disagreements between the reviewers will be discussed and a third reviewer (TPY or GCM) will be contacted when a consensus cannot be reached.

Risk of bias

The risk of bias assessment of the included articles will be performed through the COSMIN Risk of Bias checklist for PROMs [28]. The checklist has standards referring to design requirements and preferred statistical methods to evaluate the methodological quality of studies on measurement properties [27]. According to this checklist, the first step of the assessment is to identify each measurement property assessed in the study (i.e., one study can describe one or more different measurement properties) according to the COSMIN taxonomy of measurement properties [36].

For each measurement property, there is a COSMIN box with all standards needed to assess the quality of a study on that specific measurement property. The COSMIN Risk of Bias Checklist encompasses ten boxes with standards for PROM development (box 1) and for nine measurement properties: content validity (box 2); structural validity (box 3); internal consistency (box 4); cross‐cultural validity\measurement invariance (box 5); reliability (box 6); measurement error (box 7); criterion validity (box 8); hypotheses testing for construct validity (box 9) and responsiveness (box 10) [28].

Content validity is assessed through the use of boxes 1 and 2, as the original studies for developing the instruments are necessary to assess this measurement property [26]. The content validity will be scored separately (steps 1 and 2 of the COSMIN methodology for assessing the content validity of PROMs) in order to: 1) assess the quality of the PROM development, using COSMIN box 1 (considering the concept elicitation and the cognitive interview); and 2) assess the quality of content validity studies on the PROM (if available), using COSMIN box 2 (according to the relevance, comprehensiveness, and comprehensibility from professionals and patients) [26].

The internal structure is evaluated by the content in boxes 3 to 5, and boxes 6 to 10 address the remaining measurement properties. As in one article, one or more different measurement properties can be studied, the COSMIN Risk of Bias checklist is a modular tool. This means that each measurement property studied should be assessed separately according to each corresponding box. It will not be necessary to complete the whole checklist when evaluating the methodological quality of an included study, instead, the measurement property evaluated in each article will determine which boxes need to be completed. For example, if an article evaluates internal consistency, construct validity, and reliability, then three boxes needed to be completed (i.e., box 4 to internal consistency, box 9 to construct validity, and box 6 to reliability).

Each standard within a COSMIN box will be rated in a four-point rating system as ‘very good’, ‘adequate’, ‘doubtful’, or ‘inadequate’ methodological quality [28]. To determine an overall score for each study the lowest rating of any standard in the correspondent boxes will need to be taken into consideration [37]. For example, if there is one standard rated as ‘adequate’ and another rated as ‘inadequate’, then the measurement property represented by this box will have an overall score of ‘inadequate’ methodological quality. This overall score of the risk of bias will be then used to grade the quality of the evidence (for each measurement property) [38]. In line with the COSMIN user manual, since the development studies of the EQ-5D-3L and the EQ-5D-5L were already scored in a previous systematic review, those ratings will be adopted in this review [39, 40].

Conventional psychometric tests of validity in the generic preference-based measures for use in economic evaluation can be considered inappropriate [41]. Thus, we will complement our methodological quality assessment using a checklist to describe the practically, reliability and validity for judging the appropriateness of the EQ-5D of the included studies, which incorporates economists’ notion of preferences. This checklist is composed of: practicality (three items); reliability (four items); validity subdivided into three parts: description (three items); valuation (four items); and empirical (one item) [41]. If one study tested reliability, then we will complete the four items of reliability on this checklist. This checklist can be found elsewhere [41].

Quality criteria for measurement properties

We will also assess the quality of all measurement properties of each included study using quality criteria [42, 43]. For each measurement property studied (in each included study), a criterion is defined as: sufficient ( +), insufficient (-), or indeterminate (?) [27].

Evidence synthesis

The quality of evidence will be measured through a modified version of the GRADE [27]. In line with the GRADE approach for systematic reviews of intervention studies, the quality of the evidence will be graded as high, moderate, low, or very low evidence. The results of each EQ-5D instrument (EQ-5D-3L, EQ-5D-5L, EQ-5D-Y-3L, and EQ-5D-Y-5L) will be assessed individually [44].

A rating system was developed specifically to summarize the quality of the evidence for content validity, step 3 of the COSMIN methodology for assessing the content validity of PROMs [26]. It consists on rating the studies' results of the PROM development and content validity following ten criteria. Then we will summarize all available evidence and grade the quality of the evidence. This rating system was divided in three steps:

  1. a)

    Rating the result of the single studies on PROM development and content validity (if available) against the 10 criteria for good content validity (5 on relevance, 1 on comprehensiveness, 4 on comprehensibility). The ratings will range as sufficient ( +), insufficient (-) or indeterminate (?);

  2. b)

    The results of the available studies are qualitatively summarized to determine whether overall, the relevance, comprehensiveness, comprehensibility, and overall content validity of the PROM is sufficient ( +), insufficient (–), or inconsistent ( ±). We will determine if the overall content validity of the PROM is sufficient or insufficient. The overall rating will be sufficient or insufficient if all the ratings per study were sufficient or insufficient, respectively. If the ratings present inconsistency between studies, we will explain the reasons for inconsistency and summarize the overall rating of the studies per subgroup. If no explanation is found, the overall rating will be determined as inconsistent ( ±);

  3. c)

    The overall ratings (relevance, comprehensiveness, comprehensibility, and content validity) will be accompanied by grading for the quality of the evidence. The evidence can be high, moderate, low, or very low. We will use the GRADE depending on the presence of three domains: risk of bias, inconsistency, and indirectness. We will start assuming that there is high-quality evidence for content validity. The levels of quality of evidence will be downgraded with one or more levels (to moderate, low, or very low) if there is a (serious or very serious) risk of bias, unexplained inconsistency in results, and/or indirect findings. We will follow a minimized version of the flow chart as additional guidance that can be consulted elsewhere [26].

Regarding the other measurement properties, the results of all available studies on a measurement property can be quantitatively pooled or qualitatively summarized. We will use the criteria for good measurement properties to determine whether overall the measurement properties of the EQ-5D instruments are sufficient ( +), insufficient (-), inconsistent ( ±) or indeterminate (?) [27]. We will then determine if the included studies are consistent or inconsistent [38]. Studies will be considered consistent if they showed at least 75% of the results of good measurement properties in accordance with each measurement property, i.e., measurement error insufficient (-) for at least 75% of cases. If the studies are consistent (at least 75% of the results sufficient or insufficient), the results of each measurement property will be pooled in a random effect meta-analysis. The pooled estimates will be assessed by calculating weighted means and 95% confidence intervals [27]. If the studies are inconsistent, we will try to explain the reasons for inconsistency and summarize per subgroup. In this case, we will qualitatively summarize the results. The results will be shown by range (lowest and highest) of values or percentage of confirmed hypotheses (when construct validity) [27]. If no explanation is found, the overall rating will be classified as inconsistent ( ±). If not enough information is available, the overall rating will be classified as indeterminate (?) [27].

Previously, we formulated a hypothesis to construct validity and responsiveness [27]. Related construct validity: 1) if the instruments measure a similar construct the correlation should be ≥ 0.50; 2) if measure a related, but not similar construct the correlation should be ≥ 0.30 and < 0.50; and 3) if instruments measure an unrelated construct the correlation should be < 0.30 (instruments that do not measure HRQoL in children and adolescents). Related to the responsiveness, we hypothesized an area under the curve (AUC) ≥ 0.70 (e.g., global perceived effect scales) [27].

For these other measurement properties, the downgrading in GRADE will be applied depending on the presence of four factors (instead of five): 1) risk of bias of the studies will be downgraded one level if there is serious risk of bias (i.e., multiple studies of doubtful quality available, or if there is only one study with adequate quality), two levels if there is very serious risk of bias (multiple studies of inadequate quality, or there is only one study with doubtful quality available, or three levels if there is extremely risk of bias – only one study with inadequate quality available); 2) inconsistency, i.e., unexplained inconsistency of results across studies (downgraded if the most studies was inconsistent according good measurement properties, we will decided if is necessary downgraded one or two levels according inconsistent found); 3) imprecision, i.e., total sample size of the available studies (downgraded one level if the total sample is below 100 and two level if the total sample is below 50); and 4) indirectness, i.e., evidence from different populations than the population of interest in the review (downgraded if the sample of studies will be very different, we will decided if is necessary downgraded one or two levels according heterogeneity found) [27].

Data synthesis

We will provide an overall quality criteria for good measurement properties (i.e., sufficient ( +), insufficient (-), inconsistent ( ±) or indeterminate (?)) as well as the respectively certainty of evidence (i.e., high, moderate, low and very low). The results of evidence synthesis will be categorized for each version of the EQ-5D instruments (EQ-5D-3L, EQ-5D-5L, EQ-5D-Y-3L, EQ-5D-Y-5L) in order to provide the best version of the instrument for children and adolescents. In addition, we will categorize each version of the instruments according to the mode of administration (i.e., self-report or proxy-report).

Discussion

Purpose

The aim of this systematic review is to critically appraise and summarize the evidence on measurement properties of the EQ-5D instruments (self- and proxy-reported version) in children and adolescents. Although similar systematic reviews have been published [17, 45, 46], there is no clear information on the literature regarding the risk of bias of the studies on measurement properties of the EQ-5D instruments used for children and adolescents. In addition, there is no detailed and robust assessment of the content validity for newly instruments (especially the EQ-5D-Y-3L and the EQ-5D-Y-5L), no information regarding other measurement properties (i.e., reliability, validity and responsiveness), and no information on the quality criteria and certainty of evidence following the COSMIN methodology [27].

Strengths and limitations

This systematic review has strengths and limitations. A strength of this systematic review is that we will assess the risk of bias in the included studies using the COSMIN Risk of Bias Checklist [28]. One systematic literature review of the EQ-5D (EQ-5D adult version, EQ-5D-Y-3L, Dutch EQ-5D child version, and, an extended questionnaire with cognitive dimension EQ-5D + C) in children and adolescents conducted in 2011 did not assess the risk of bias [17]. Therefore, it is necessary to understand the methodological quality of these studies. Another strength of this systematic review is to perform an update of this scenario of the EQ-5D instruments, and the use of the EQ-5D instruments among children and adolescents due to several studies that were published over 10 years. Studies have tested measurement properties of adult versions EQ-5D-3L and EQ-5D-5L in children and adolescents, and specific versions such as EQ-5D-Y-3L and EQ-5D-Y-5L [11, 12, 14, 16, 18, 19, 23, 25, 47, 48]. The results of this study will summarize the evidence of the measurement properties of the EQ-5D and their different interpretations (descriptive system, EQ-VAS, utility score) among children and adolescents. This systematic review is strengthened also by the adherence to the recommendations from PRISMA-P for the reporting, and the methodology COSMIN to conduct a systematic review of measurement properties of the EQ-5D instruments [27, 29, 36]. Our aim is specifically to assess the measurement properties of the EuroQol instruments among children and adolescents, but we believe a limitation can be the inclusion of only EQ-5D instruments. There are other generic instruments that aim to measure HRQoL in the youth population, such as the Pediatric Quality of Life Inventory (PedsQL), KIDSCREEN for example, as well as, the generic preference-based measures for children, such as the Child Health Utility 9D (CHU9D) for example [49,50,51]. Another limitation of this study is that although the EQ-5D instruments have been considered as a PROM for HRQoL, the EQ-5D instruments have their features in line with generic preference-based measures [52, 53]. Despite the literature showing that COSMIN guidelines can be limited to assess generic preference-based measures, we choose to base our assessment of measurement properties of the EQ-5D instruments using this guideline because there is no gold standard to assess generic preference-based measures [27]. Thus, in order to complement our assessment, we decided to add the use of a few measurement properties covered by COSMIN, but according to a specific checklist to describe the generic preference-based measures of HRQoL instruments [41].

Implications for practice and research

The clinical relevance of the EQ-5D (especially EQ-5D-Y-3L and EQ-5D-Y-5L) is to assess HRQoL and health-perceived status in children and adolescents aged 8–15 years. Furthermore, the EQ-5D can facilitate the calculation of the quality-adjusted life of years (QALYs) through the country-specific index values for example, that are used in economic evaluations. The EQ-5D instruments are useful to assess the HRQoL outcome in different design studies (e.g., cross-sectional, cohort, randomized controlled trial), in economic evaluation studies (e.g., cost-effectiveness, cost-utility analysis), and also in clinical practice by health professionals [3]. Thus, this systematic review will provide updated evidence on the measurement properties of EQ-5D in children and adolescents with several health conditions conducted in different cultural contexts and countries. This will help the choice between EQ-5D versions based on their measurement properties results and overall quality of evidence. It will also provide the risk of bias in studies and overall quality evidence of EQ-5D instruments in children and adolescents that, to our knowledge, is not available at the moment. In addition, an updated systematic review will help to disseminate the importance of the EQ-5D instruments in children and adolescents and to signalize the gap to guide future measurement properties studies.

Availability of data and materials

Not applicable.

Abbreviations

HRQoL:

Health-related quality of life

EQ-VAS:

EQ-5D visual analogue scale

COSMIN:

COnsensus-based Standards for the selection of health Measurement Instruments

OSF:

Open Science Framework

PROSPERO:

International Prospective Register of Systematic Reviews

PRISMA-P:

Preferred Reporting Items for Systematic Reviews and Meta-Analysis Protocol

MEDLINE:

Medical Literature Analysis and Retrieval System Online

EMBASE:

Excerpta Medica database

CINAHL:

Cumulative Index of Nursing and Allied Health Literature

NHS-EED:

National Health Service Economic Evaluation Database

HTA:

Health Technology Assessment

GRADE:

Grading of Recommendations Assessment, Development, and Evaluation

PROMs:

Patient-reported outcome measures

AUC:

Area under the curve

PedsQL:

Pediatric Quality of Life Inventory

CHU9D:

Child Health Utility 9D

QALYs:

Quality-adjusted life of years

References

  1. Solans M, Pane S, Estrada MD, Serra-Sutton V, Berra S, Herdman M, et al. Health-related quality of life measurement in children and adolescents: a systematic review of generic and disease-specific instruments. Value Health. 2008;11:742–64. https://doi.org/10.1111/j.1524-4733.2007.00293.x.

    Article  PubMed  Google Scholar 

  2. Karimi M, Brazier J. Health, health-related quality of life, and quality of life: what is the difference? Pharmacoeconomics. 2016;34:645–9.

    Article  PubMed  Google Scholar 

  3. Devlin NJ, Brooks R. EQ-5D and the EuroQol group: past, present and future. Appl Health Econ Health Policy. 2017;15:127–37.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Brooks R. EuroQol: the current state of play. Health Policy (New York). 1996;37:53–72.

    Article  CAS  Google Scholar 

  5. Kennedy-Martin M, Slaap B, Herdman M, van Reenen M, Kennedy-Martin T, Greiner W, et al. Which multi-attribute utility instruments are recommended for use in cost-utility analysis? a review of national health technology assessment (HTA) guidelines. Eur J Health Econ. 2020;21:1245–57. https://doi.org/10.1007/s10198-020-01195-8.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Herdman M, Gudex C, Lloyd A, Janssen M, Kind P, Parkin D, et al. Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Qual Life Res. 2011;20:1727–36.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. EuroQol, Group. EuroQol - a new facility for the measurement of health-related quality of life. Health Policy (New York). 1990;16:199–208.

    Article  Google Scholar 

  8. Kreimeier S, Åström M, Burström K, Egmar A-C, Gusi N, Herdman M, et al. EQ-5D-Y-5L: developing a revised EQ-5D-Y with increased response categories. Qual Life Res. 2019;28:1951–61. https://doi.org/10.1007/s11136-019-02115-x.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Wille N, Badia X, Bonsel G, Burström K, Cavrini G, Devlin N, et al. Development of the EQ-5D-Y: A child-friendly version of the EQ-5D. Qual Life Res. 2010;19:875–86.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Kreimeier S, Greiner W. EQ-5D-Y as a Health-related quality of life instrument for children and adolescents: the instrument’s characteristics, development, current use, and challenges of developing its value set. Value Health. 2019;22:31–7.

    Article  PubMed  Google Scholar 

  11. Mayoral K, Rajmil L, Murillo M, Garin O, Pont A, Alonso J, et al. Measurement properties of the online EuroQol-5D-youth instrument in children and adolescents with type 1 diabetes mellitus: questionnaire study. J Med Internet Res. 2019;21:e14947.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Bergfors S, Åström M, Burström K, Egmar AC. Measuring health-related quality of life with the EQ-5D-Y instrument in children and adolescents with asthma. Acta Paediatr Int J Paediatr. 2015;104:167–73.

    Article  Google Scholar 

  13. Ravens-Sieberer U, Wille N, Badia X, Bonsel G, Burström K, Cavrini G, et al. Feasibility, reliability, and validity of the EQ-5D-Y: results from a multinational study. Qual Life Res. 2010;19:887–97.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Åström M, Åström M, Åström M, Krig S, Ryding S, Cleland N, et al. EQ-5D-Y-5L as a patient-reported outcome measure in psychiatric inpatient care for children and adolescents - a cross-sectional study. Health Qual Life Outcomes. 2020;18:1–14.

    Article  Google Scholar 

  15. Wong CKH, Cheung PWH, Luo N, Lin J, Cheung JPY. Responsiveness of EQ-5D Youth version 5-level (EQ-5D-5L-Y) and 3-level (EQ-5D-3L-Y) in Patients with Idiopathic Scoliosis. Spine (Phila Pa 1976). 2019;44:1507–14. Available from: https://journals.lww.com/spinejournal/Abstract/2019/11010/Responsiveness_of_EQ_5D_Youth_version_5_level.11.aspx

  16. Wong CKH, Cheung PWH, Luo N, Cheung JPY. A head-to-head comparison of five-level (EQ-5D-5L-Y) and three-level EQ-5D-Y questionnaires in paediatric patients. Eur J Health Econ. 2019;20:647–56.

    Article  PubMed  Google Scholar 

  17. Noyes J, Edwards RT. EQ-5D for the assessment of health-related quality of life and resource allocation in children: a systematic methodological review. Value Health. 2011;14:1117–29. https://doi.org/10.1016/j.jval.2011.07.011.

    Article  CAS  PubMed  Google Scholar 

  18. Cheung PWH, Wong CKH, Lau ST, Cheung JPY. Responsiveness of the EuroQoL 5-dimension (EQ-5D) in adolescent idiopathic scoliosis. Eur Spine J. 2018;27:278–85. Available from: https://link.springer.com/article/10.1007%2Fs00586-017-5330-1

  19. Cheung PWH, Wong CKH, Samartzis D, Luk KDK, Lam CLK, Cheung KMC, et al. Psychometric validation of the EuroQoL 5-Dimension 5-Level (EQ-5D-5L) in Chinese patients with adolescent idiopathic scoliosis. Scoliosis Spinal Disord. 2016;11:1–12. https://doi.org/10.1186/s13013-016-0083-x.

    Article  CAS  Google Scholar 

  20. Scott D, Scott C, Jelsma J, Abraham D, Verstraete J. Validity and feasibility of the self-report EQ-5D-Y as a generic health-related quality of life outcome measure in children and adolescents with juvenile idiopathic arthritis in Western Cape, South Africa. S Afr J Physiother. 2019;75:1–9.

    Article  Google Scholar 

  21. Scott D, Ferguson GD, Jelsma J. The use of the EQ-5D-Y health related quality of life outcome measure in children in the Western Cape, South Africa: Psychometric properties, feasibility and usefulness - a longitudinal, analytical study. Health Qual Life Outcomes. 2017;15:1–14.

    Article  Google Scholar 

  22. Robles N, Rajmil L, Rodriguez-Arjona D, Azuara M, Codina F, Raat H, et al. Development of the web-based Spanish and Catalan versions of the Euroqol 5D-Y (EQ-5D-Y) and comparison of results with the paper version. Health Qual Life Outcomes. 2015;13:1–9.

    Article  Google Scholar 

  23. Shiroiwa T, Fukuda T, Shimozuma K. Psychometric properties of the Japanese version of the EQ-5D-Y by self-report and proxy-report: reliability and construct validity. Quality Life Res. 2019;28:3093–105. https://doi.org/10.1007/s11136-019-02238-1.

    Article  CAS  Google Scholar 

  24. Burström K, Bartonek A, Broström EW, Sun S, Egmar AC. EQ-5D-Y as a health-related quality of life measure in children and adolescents with functional disability in Sweden: testing feasibility and validity. Acta Paediatr Int J Paediatr. 2014;103:426–35.

    Article  Google Scholar 

  25. Lin J, Wong CKH, Cheung PWH, Luo N, Cheung JPY. Feasibility of Proxy-reported EQ-5D-3L-Y and Its Agreement in Self-reported EQ-5D-3L-Y for Patients with Adolescent Idiopathic Scoliosis. Spine (Phila Pa 1976). 2020;45:E799–807. Available from: https://journals.lww.com/spinejournal/Fulltext/9000/Feasibility_of_Proxy_reported_EQ_5D_3L_Y_and_Its.94320.aspx

  26. Terwee CB, Prinsen CAC, Chiarotto A, Westerman MJ, Patrick DL, Alonso J, et al. COSMIN methodology for evaluating the content validity of patient-reported outcome measures: a Delphi study. Quality Life Res. 2018;27:1159–70. https://doi.org/10.1007/s11136-018-1829-0.

    Article  CAS  Google Scholar 

  27. Prinsen CAC, Mokkink LB, Bouter LM, Alonso J, Patrick DL, de Vet HCW, et al. COSMIN guideline for systematic reviews of patient-reported outcome measures. Quality Life Res. 2018;27:1147–57. https://doi.org/10.1007/s11136-018-1798-3.

    Article  CAS  Google Scholar 

  28. Mokkink LB, de Vet HCW, Prinsen CAC, Patrick DL, Alonso J, Bouter LM, et al. COSMIN risk of bias checklist for systematic reviews of patient-reported outcome measures. Quality Life Res. 2018;27:1171–9. https://doi.org/10.1007/s11136-017-1765-4.

    Article  CAS  Google Scholar 

  29. Moher D, Shamseer L, Clarke M, Ghersi D, Liberati A, Petticrew M, et al. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement. Syst Rev. 2015;4:1–9. Available from: https://systematicreviewsjournal.biomedcentral.com/articles/https://doi.org/10.1186/2046-4053-4-1

  30. Dams J, Rimane E, Steil R, Renneberg B, Rosner R, König HH. Reliability, validity and responsiveness of the EQ-5D-5L in assessing and valuing health status in adolescents and young adults with posttraumatic stress disorder: a randomized controlled trail. Psychiatr Q. 2021;92:459–71.

    Article  PubMed  Google Scholar 

  31. Willems DCM, Joore MA, Nieman FHM, Severens JL, Wouters EFM, Hendriks JJE. Using EQ-5D in children with asthma, rheumatic disorders, diabetes, and speech/language and/or hearing disorders. Int J Technol Assess Health Care. 2009;25:391–9.

    Article  PubMed  Google Scholar 

  32. Stolk EA, Busschbach JJ v, Vogels T. Performance of the EuroQol in Children with Imperforate Anus. Quality of Life Research. 2000;9:29–38. Available from: https://link.springer.com/article/https://doi.org/10.1023/A:1008923029886

  33. EuroQol Research Foundation. EuroQol Group. 2020. Available from: https://euroqol.org/. Cited 2020 May 16.

  34. Terwee CB, Jansma EP, Riphagen II, de Vet HCW. Development of a methodological PubMed search filter for finding studies on measurement properties of measurement instruments. Qual Life Res. 2009;18:1115–23.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Bramer WM, Milic J, Mast F. Reviewing retrieved references for inclusion in systematic reviews using endnote. Journal of the Medical Library Association. 2017;105:84–7. Available from: http://jmla.pitt.edu/ojs/jmla/article/view/111

  36. Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol. 2010;63:737–45. Available from: https://www.jclinepi.com/article/S0895-4356(10)00090-9/fulltext

  37. Terwee CB, Mokkink LB, Knol DL, Ostelo RWJG, Bouter LM, de Vet HCW. Rating the methodological quality in systematic reviews of studies on measurement properties: a scoring system for the COSMIN checklist. Qual Life Res. 2012;21:651–7.

    Article  PubMed  Google Scholar 

  38. Mokkink LB, Prinsen CAC, Patrick DL, Alonso J, Bouter LM, de Vet HCW, et al. COSMIN methodology for systematic reviews of Patient-Reported Outcome Measures (PROMs) - user manual. Amsterdam; 2018. Available from: https://www.cosmin.nl/

  39. Chiarotto A, Terwee CB, Kamper SJ, Boers M, Ostelo RW. Evidence on the measurement properties of health-related quality of life instruments is largely missing in patients with low back pain: a systematic review. J Clin Epidemiol. 2018;102:23–37. https://doi.org/10.1016/j.jclinepi.2018.05.006.

    Article  PubMed  Google Scholar 

  40. Terwee CB, Prinsen CAC, Chiarotto A, de Vet HCW, Bouter LM, Alonso J, et al. COSMIN methodology for assessing the content validity of PROMs - User manual. Amsterdam; 2018. Available from: https://www.cosmin.nl/wp-content/uploads/COSMIN-methodology-for-content-validity-user-manual-v1.pdf

  41. Brazier J, Deverill M. A checklist for judging preference-based measures of health related quality of life: Learning from psychometrics. Health Econ. 1999;8:41–51.

    Article  CAS  PubMed  Google Scholar 

  42. Terwee CB, Bot SDM, de Boer MR, van der Windt DAWM, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60:34–42. Available from: https://www.jclinepi.com/article/S0895-4356(06)00174-0/fulltext

  43. Prinsen CAC, Vohra S, Rose MR, Boers M, Tugwell P, Clarke M, et al. How to select outcome measurement instruments for outcomes included in a “Core Outcome Set” - a practical guideline. Trials. 2016;17:1–10. https://doi.org/10.1186/s13063-016-1555-2.

    Article  Google Scholar 

  44. Guyatt GH, Oxman AD, Vist GE, Kunz R, Falck-Ytter Y, Alonso-Coello P, et al. GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. BMJ. 2008;336:924–6. Available from: https://www.bmj.com/content/336/7650/924

  45. Rowen D, Keetharuth AD, Poku E, Wong R, Pennington B, Wailoo A. A Review of the psychometric performance of selected child and adolescent preference-based measures used to produce utilities for child and adolescent health. Value in Health. 2021;24:443–60. https://doi.org/10.1016/j.jval.2020.09.012.

    Article  PubMed  Google Scholar 

  46. Golicki D, Młyńczak K. Measurement Properties of the EQ-5D-Y: a systematic review. Value Health. 2022;1–12.

  47. Brussoni M, Kruse S, Walker K. Validity and reliability of the EQ-5D-3LTM among a paediatric injury population. Health Qual Life Outcomes. 2013;11:1–14. Available from: https://hqlo.biomedcentral.com/articles/https://doi.org/10.1186/1477-7525-11-157

  48. Bouwmans C, van der Kolk A, Oppe M, Schawo S, Stolk E, van Agthoven M, et al. Validity and responsiveness of the EQ-5D and the KIDSCREEN-10 in children with ADHD. Eur J Health Econ. 2014;15:967–77. Available from: https://link.springer.com/article/10.1007%2Fs10198-013-0540-x

  49. Stevens K. Developing a descriptive system for a new preference-based measure of health-related quality of life for children. Qual Life Res. 2009;18:1105–13.

    Article  PubMed  Google Scholar 

  50. Varni JW, Seid M, Kurtin PS. PedsQLTM 4.0: reliability and validity of the pediatric quality of life inventoryTM Version 4.0 generic core scales in healthy and patient populations. Med Care. 2001;39:800–12.

    Article  CAS  PubMed  Google Scholar 

  51. Ravens-Sieberer U, Erhart M, Rajmil L, Herdman M, Auquier P, Bruil J, et al. Reliability, construct and criterion validity of the KIDSCREEN-10 score: a short measure for children and adolescents’ well-being and health-related quality of life. Qual Life Res. 2010;19:1487–500.

    Article  PubMed  PubMed Central  Google Scholar 

  52. Finch AP, Brazier JE, Mukuria C. What is the evidence for the performance of generic preference-based measures? a systematic overview of reviews. Eur J Health Econ. 2018;19:557–70.

    Article  PubMed  Google Scholar 

  53. Rabin R, Gudex C, Selai C, Herdman M. From translation to version management: a history and review of methods for the cultural adaptation of the euroqol five-dimensional questionnaire. Value in Health. 2014;17:70–6. https://doi.org/10.1016/j.jval.2013.10.006.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

CMES receives a master scholarship from the Sao Paulo Research Foundation (FAPESP) – process number 2020/06864–0; VSS receives a PhD scholarship from the Sao Paulo Research Foundation (FAPESP) – process number 2019/12049–0; TPY receives a research grant from the Sao Paulo Research Foundation (FAPESP) – process number 2017/17484–1. The funding agency of this study has no influence in the design of the study and collection, analysis, and interpretation of data and in writing of the manuscript. The study has not received external funding from any funding agency in the public, commercial or not-for-profit sectors.

Author information

Authors and Affiliations

Authors

Contributions

TY is the guarantor. CM, VS, GM AC and TY drafted the manuscript. CM, VS, GM, AC and TY contributed to the development of the selection criteria, risk of bias assessment and data extraction criteria. CM, VS and TY developed the search strategy. TY, GM, AC and MS provided statistical expertise. All authors read, provided feedback and approved the final manuscript.

Corresponding author

Correspondence to Caique de Melo do Espirito Santo.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

do Espirito Santo, C.d.M., Santos, V.S., Miyamoto, G.C. et al. Measurement properties of the EQ-5D in children and adolescents: a systematic review protocol. Syst Rev 13, 18 (2024). https://doi.org/10.1186/s13643-023-02443-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13643-023-02443-7

Keywords