4,357
Views
29
CrossRef citations to date
0
Altmetric
Pharmacoeconomic Trends

A systematic review of pharmacoeconomic guidelines

, , , , &
Pages 85-96 | Received 22 Jul 2017, Accepted 27 Sep 2017, Published online: 15 Oct 2017

Abstract

Objective: To review, summarize, and analyze both similarities and differences of pharmacoeconomic (PE) guidelines, to enable researchers to access their characteristics and the current state of PE guidelines; furthermore, to learn which methodological issues still remain contested and to promote the methodological development of PE guidelines.

Materials and methods: The authors performed a search for PE guidelines using PubMed, the Cochrane library database, and the websites of the International Society for Pharmacoeconomics and Outcomes Research. Information of each guideline was extracted using a pre-designed extraction template, which included 22 aspects; the guidelines were summarized in the forms of charts, and their characteristics have been described.

Results: A total of 40 PE guidelines were studied. The most common methodological issues include the types of analysis, sources for effectiveness, use of quality-adjusted life-years (QALYs) to measure outcomes, and use of incremental cost effectiveness ratios to present results. The majority of the guidelines preferred a cost utility analysis with outcomes expressed in terms of QALYs. Most of the guidelines preferred meta-analysis or meta-analysis of the randomized controlled trials, and required a systematic review of all evidence. Issues that varied most in the guidelines were the choice of the comparator, recommended costs to be included, methods related to indirect cost calculations, methods of sensitivity analysis, and discounting rate.

Conclusion: A comparison of these guidelines revealed that a number of differences exist among them in several key aspects, and some critical methodological issues still exist, for which no best solution is available. Furthermore, efforts need to be made to develop harmonious methods for the PE, and to improve the transferability of the outcomes of PE evaluations.

Introduction

Health resources are undoubtedly limited, and certainly cannot provide adequate treatment for all patients in all circumstancesCitation1. At the same time, health costs have been increasing over recent years. For example, it has been reported that national health expenditures in the US are projected to grow at an average annual rate of 5.6% from 2016 to 2025, accounting for 19.9% of the gross domestic product (GDP) by 2025Citation2. Moreover, seen from global resource allocation, total health expenditure accounts for about an average of 9.7% of the global GDPCitation3. Considering this situation, the question arises of how to best utilize these limited health resources and, consequently, how to reduce inferior programs and control health expenditures has become a challenging and hot research topic. Health economics (HE), developed to assess the outcomes of alternative treatment scenarios and policies, and to select optimal proposals while eliminating inferior programs, has become an effective method to solve such problemsCitation4. Pharmacoeconomics (PE) is a branch of HE, particularly focusing on the costs and benefits of pharmacy and medical technologyCitation3,Citation5. PE can effectively avoid resource waste and irrational distribution of medical resources and, thus, help to effectively allocate resources to maximize net health benefitsCitation6. This area is also the focus of this study.

Based on this, more attention has been focused on the development of the PE theory. An enormous increase in the PE literature could be observed over the past decades. However, reviews are divided about the quality of published PE studiesCitation7. A reason for this is that designs and implementation within PE studies are often unscientific. PE studies face several challenges, e.g. complex research methodology, or the fact that economic data can display substantial national, regional, or institutional variations, which can limit “transferability” and “comparability”. Moreover, many researchers are unfamiliar with the research methods of PE. To address these issues, guidelines for the improvement of the quality and comparability of PE research have been developed by several countries and organizationsCitation8. In 1992, Australia was the first country to introduce a mandatory methodological guideline for health economic evaluations of pharmaceuticalsCitation9,Citation10. After that, several countries and organizations developed their own HE/PE evaluation guidelines. Nevertheless, although these guidelines share a common basis, some of them require the presentation of local data, and integrated their own opinions with regard to methodological issues, especially for those for which there is still no consensus regarding appropriate or optimal methodsCitation11,Citation12. As a result, requirements stated in different guidelines are not always comparable, since they have their own characteristicsCitation11.

A review of national HE guidelines by Hjelmgren et al.Citation9 identified a lack of consensus on the key aspects of HE evaluations, such as choices of the perspective, and costs that should be included in the analysis. Furthermore, the study by Bracco and KrolCitation13 reviewed economic evaluations in European submission guidelines and showed a large variation among these guidelines in terms of recommendations regarding costs to be included and methods related to cost calculations and discounting. More recently, a similar study conducted by the EUnetHTA Joint Action et al.Citation12 compared existing methodological guidelines for the HE in countries that have organizations involved in the EUnetHTA. The study arrived at similar conclusions, revealing that these guidelines varied extensively in terms of recommendations regarding the sources for effectiveness and the choice of the perspective. However, most similar studies mainly focused on the HE, and the guidelines of European countries, thus ignoring guidelines for Africa, Asia, and Oceania.

We conducted a comprehensive overview and comparison of PE guidelines. The objective of this study was to update and add to previous similar research. In addition, we focus more on PE guidelines and differences among countries and organizations. We examined differences and similarities among guidelines to highlight the current state of PE guidelines, as well as methodological issues that are still contested or developed. By conducting this study, we hope to contribute to the further development of PE guidelines.

Materials and methods

Search strategy

This study is based on literature originating from various sources.

  1. Relevant guidelines were obtained by searching both PubMed and the Cochrane library databases up to March 15, 2017. The following terms were used (alone and in combination): “Pharmacoeconomic*”, “Health economic*”, “Pharmaceutical”, “Economic*”, “Health technology assessment*”, “cost effectiveness*”, “cost benefit*”, “cost utility*”, “recommend*”, “checklist*”, “guide*”. We combined the Medical Subject Heading (MeSH) and Title/Abstract Fields search. The full search strategy can be found in Supplementary Appendix 1.

  2. The websites of the International Society for Pharmacoeconomics and Outcomes Research (ISPOR), the primarily ISPOR’s “Pharmacoeconomic Guidelines Around The World” database was searched. All retrieved articles were imported into the bibliographic database of EndnoteX6.

Selection criteria

Inclusion criteria

Articles included satisfied the following criterion: PE or HE guidelines specifying that their area of application includes pharmaceuticals.

Exclusion criteria

  1. Older versions of guidelines.

  2. Papers reproducing or adapting others’ guidelines or recommendations as third-part content rather than providing original PE guidelines or recommendations.

  3. Guidelines which were only used to guide specific methodological issues or a sub-section rather than complete PE analysis (e.g. PE guidelines for modeling, budget impact analysis, cost analysis, expert opinions, “Piggyback” studies).

  4. Articles that describe how to read economic evaluations.

  5. Guidelines for a standard reporting format or question lists/tools for the evaluation of the quality of PE.

  6. Publications not in English or Chinese.

Study selection and information extraction

Initial screening by one reviewer (Ji Qu) removed all duplicates. Then, two blinded reviewers (Ye Zhao and Wen-juan Ma) assessed the titles and abstracts of the articles that were about PE/HE guidelines. After that, full-text articles of the remaining studies were browsed. Studies that met all of the selection criteria were finally reviewed. If differences occurred, consensus was sought, and, if this could not be reached, a third reviewer (Jin-hui Tian) was consulted and a final decision was achieved via discussion. The authors were contacted if more information was required to determine eligibility for inclusion. To obtain the necessary data, a template was pre-designed, which included the headings: year, primarily used for, affiliation of authors, type of guideline, purpose of the guidelines, target audience of author’s interests, study question, target population, perspective, comparator, types of analyses, preferred data sources for effectiveness, choices of outcome measures, costs to be included, modeling, time horizon, discounting rate, uncertainty, presentation of results, efficacy vs effectiveness, equity, and conflicts of interests. The template was based on items from a few examples of the existing guidelinesCitation14,Citation15. Two reviewers (Ye Zhao and Hai-ming Feng) independently collected and extracted the data. All data were compared between reviewers to check for accuracy and agreement prior to analysis. Inconsistencies between reviewers were reconciled via discussion. The results were tabulated under the following main topics: (1) General information about these guidelines; and (2) comparison of the main contents of these guidelines. In terms of the type of guideline in the general information section, the guidelines were divided into three groups according to their roles in the national reimbursement processes, following definitions used by Hjelmgren et al.Citation9, based on a prior work by DrummondCitation16. Formalized guidelines were defined as those whose recommendations were required or mandatory prior to reimbursement submissions. Informal guidelines were defined as those whose recommendations have been voluntary prior to reimbursement submissions. Guidelines for economic evaluations were defined as general recommendations intended to improve the methodology of PE evaluationsCitation9,Citation13. In addition (for recording convenience), we used the country name to replace the national PE/HE guideline listed in the ISPOR website. This does not, however, mean that the listed guideline was the only guideline for the country.

Results

A flowchart can be found in Supplementary Appendix 2, which followed a PRISMA flow diagramCitation17 and described the selection process of studies included in the paper. The electronic searches identified 8,418 citation records (PubMed = 6,311 and Cochrane library = 2,107); in addition, PE/HE guidelines found on the ISPOR website were also included (n = 31). After removal of duplicate copies (n = 53) via ENDNOTE X6, 8,396 studies remained. After screening the titles and abstracts, 311 articles were retrieved for full-text review. Of these 311 articles, 255 articles were included (220 did not comply with the inclusion criteria, 18 were not in English or Chinese, eight did not provide the original guidelines, six only described how to read guidelines, and three were standard reporting format or question lists/tools), which resulted in 56 articles that met the inclusion criteria. Of these 56 articles, two were excluded because we found an updated version, and 14 were excluded because they did not provide a complete guideline. Finally, the literature search resulted in 40 articles that met both the inclusion and exclusion criteria.

General information of the included guidelines

A structured summary of the general information of all 40 guidelines is presented in . Among these 40 guidelines, the first was published by Guyatt et al.Citation22 in 1986 and was aimed to guide PE evaluations. The most recent guidelines are the EUnetHTA Joint ActionCitation12, AustraliaCitation14, the NetherlandsCitation51, and the USCitation52, published in 2016. Among all 40 guidelines, 19Citation3,Citation14,Citation18,Citation19,Citation23,Citation26,Citation29,Citation30,Citation32,Citation37,Citation39,Citation40,Citation41,Citation43–46,Citation48,Citation49 were mainly developed for evaluations of pharmaceuticals, while 21Citation7,Citation12,Citation15,Citation20–22,Citation24,Citation25,Citation27,Citation28,Citation31,Citation33–36,Citation38,Citation42,Citation47,Citation50–52 were developed for all types of technologies. Of the guidelines, 45% were formal, 22.5% were informal, and 32.5% were guidelines for economic evaluations. All details are presented in and .

Table 1. General information of the included guidelines.

Table 2. Comparison of main content in these guidelines, categorized by topic.

Comparison of main content in these guidelines

A comparison of the detailed content of the 40 guidelines is presented in Supplementary Appendix 3. To simplify the understanding of the methodological issues that were recommended by the guidelines and their variation among countries, we categorized and presented the methodological issues according to topics in .

Purpose of these guidelines

The major purpose of these guidelines can be summarized as follows: (1) to provide standards for the conduct, reporting, and quality valuation of PE research, which can in turn increase the methodological quality, transparency, comparability, and uniformity of PE evaluations; (2) to provide a basis for drug reimbursement, and to assist policymakers in decision-making. Although the guidelines have a common basis, their purposes varied slightly, which becomes apparent when they are compared with each other.

Target audience of guideline’s interests

The main target audience of the guideline’s interests can be summarized as follows: pharmaceutical companies and manufacturers, academics, consumers, decision-makers, and researchers. Every guideline has its respective target audience and the information is presented in Supplementary Appendix 3.

Study question

Among all 40 investigated guidelines, 27 guidelines (67.5%)Citation3,Citation6,Citation14,Citation15,Citation18–21,Citation24–28,Citation30,Citation31,Citation34,Citation36,Citation38–43,Citation45,Citation47,Citation50,Citation51 indicated that the study questions should be clearly stated. Twenty-one guidelines (52.5%)Citation3,Citation4,Citation27,Citation30–35,Citation37,Citation39–41,Citation43–46,Citation49–52 stated that a clear description of the studied disease should be included, which may include epidemiological data, the burden of the disease, current treatments, and existing clinical guidelines for the disease. Twenty guidelines (50%)Citation3,Citation14,Citation15,Citation23,Citation27,Citation30–32,Citation35–37,Citation39–42,Citation45,Citation46,Citation50–52 required statement of the details of the studied medicines and technologies, which may include information about pharmacology, pharmacokinetics, pharmacodynamics, contraindications, and adverse events.

Target population

Thirty-five guidelines (87.5%)Citation3,Citation14,Citation15,Citation18–21,Citation23,Citation25–28,Citation30–52 required specification of the target population of the new health technology. Thirty guidelines (75%)Citation3,Citation14,Citation15,Citation18,Citation19,Citation21,Citation25–28,Citation30,Citation31,Citation33–36,Citation38–43,Citation45–52 recommended sub-group analysis if the need exists.

Perspective

Four guidelines (10%)Citation7,Citation18,Citation19,Citation25 only required that the perspective should be clearly stated without requiring a specific recommendation. Fourteen guidelines (35%)Citation3,Citation20–22,Citation24,Citation29,Citation33,Citation41,Citation43,Citation47–51 recommended inclusion of societal perspectives. Nineteen guidelines (47.5%)Citation12,Citation14,Citation15,Citation23,Citation26–28,Citation30–32,Citation34–37,Citation39,Citation40,Citation42,Citation45,Citation46 recommended inclusion of healthcare perspectives.

Comparator

Three guidelines (7.5%)Citation7,Citation18,Citation22 stated that the utilized comparator should be described in detail without requiring specific recommendations. The majority (15) of the guidelinesCitation15,Citation19–21,Citation24,Citation28,Citation30,Citation33,Citation36,Citation38,Citation43,Citation44,Citation48–50 recommended that the comparator should mainly be chosen from the most commonly used treatment, the least expensive, or the most effective currently available of the treatments (see ). Seven guidelines (17.5%)Citation14,Citation23,Citation40–42,Citation45,Citation52 indicated treatments that would most likely be replaced by new technologies could be chosen as comparators. Eleven guidelines (27.5%)Citation3,Citation25,Citation26,Citation29,Citation31,Citation35,Citation37,Citation39,Citation46,Citation47,Citation51 recommended that standard treatments should be chosen as the comparator. Two guidelines (5%)Citation27,Citation34 required that all therapeutic alternatives should be included, as a necessary means to estimate a cost-effectiveness frontier for multiple interventionsCitation53. Furthermore, two guidelines (5%)Citation32,Citation48 recommended the use of the most appropriate treatment (e.g. the most frequently used treatments, the minimum therapy methods, or no treatment) as comparators.

Types of economic analysis

The primary viewpoints of 14 guidelines (35%)Citation7,Citation18–24,Citation23,Citation33,Citation39,Citation44,Citation46,Citation47 were that the cost effectiveness analysis (CEA), cost utility analysis (CUA), cost minimization analysis (CMA), or cost benefit analysis (CBA) can be chosen, where the choice mainly depends on the research question; however, the choice should be justified. Twelve guidelines (30%)Citation14,Citation31,Citation32,Citation36,Citation37,Citation40,Citation41,Citation43,Citation45,Citation48,Citation50,Citation51 stated the CUA as the preferred type of economic analysis. Eleven guidelines (27.5%)Citation12,Citation15,Citation26–30,Citation35,Citation38,Citation49,Citation52 indicated either CEA or CUA as preferred analyses, and four guidelines (10%)Citation27,Citation35,Citation38,Citation42 indicated the CBA unlikely to be helpful. Only the Chinese guidelineCitation3 stated that the CUA or CBA are preferred. The German guidelineCitation34 provided no accurate description of the types of economic analysisCitation53. The CMA can only be used when no differences exist in the effects among compared technologiesCitation12.

Sources for effectiveness

Randomized controlled trials (RCTs) or the meta-analysis of RCTs were the preferred sources for effectiveness in 26 guidelines (65%)Citation12,Citation14,Citation19,Citation21,Citation24–32,Citation35,Citation36,Citation39–44,Citation46,Citation49–52. Although Mathes et al.Citation53 considered that the Polish guideline preferred using data from observational studies, the guideline itself stated that data from the highest available level of evidence has to be used, and systematic reviews (with or without meta-analysis) or direct comparisons of RCTs are of higher priority in terms of credibilityCitation42. Nine guidelines (22.5%)Citation12,Citation14,Citation18,Citation27,Citation31,Citation40,Citation42,Citation44,Citation51 required assessment of the quality of evidence. Twenty-three guidelines (57.5%)Citation12,Citation14,Citation15,Citation20,Citation23,Citation27,Citation28,Citation30–32,Citation35–37,Citation39–42,Citation44–46,Citation49,Citation51,Citation52 required a systematic literature search. Three guidelines (7.5%)Citation7,Citation38,Citation47 explicitly stated the attractivity of pragmatic trials. Fifteen guidelines (37.5%)Citation14,Citation23,Citation28,Citation31–35,Citation37,Citation41,Citation42,Citation45,Citation46,Citation51,Citation52 preferred direct comparisons. Eight guidelines (20%)Citation3,Citation15,Citation18,Citation23,Citation33,Citation34,Citation37,Citation45 did not further clarify which sources are preferred.

Choices of outcome measures

If the CEA was chosen as the method of PE analysis, the results should be expressed in terms of natural units, such as the years of life gained, hospital days prevented, and clinical parametersCitation3,Citation7,Citation12,Citation14,Citation15,Citation18,Citation21–24,Citation26–28,Citation30,Citation33,Citation35,Citation36,Citation38,Citation39,Citation42,Citation43,Citation46,Citation47,Citation49,Citation51. Almost all of the guidelines predominantly described the CUA with outcomes expressed in terms of quality-adjusted life-years (QALYs). The QALYs combine changes in both the quantity and quality-of-life (mortality and morbidity) into one composite measureCitation40, which is one of the reasons why the QALYs is particularly useful. The QALYs can be calculated by multiplying the quantity of life-years gained due to the intervention by health state utilities that reflect the health-related quality-of-life (HRQoL) during that timeCitation15. Health state utilities can be directly or indirectly measured. Direct measurement is a complex and expensive task. The main methods which can be used for direct measurement include: standard gamble (SG), time trade-off (TTO), and visual analog scale (VAS). The indirect measurement exists to use some instruments to indirectly obtain utilities-such as “Off the shelf” instruments: the Health Utilities Index (HUI) and the EQ-5DCitation15. Eighteen guidelines (45%)Citation14,Citation15,Citation23,Citation26–28,Citation30,Citation31,Citation35,Citation36,Citation40–42,Citation45,Citation47,Citation48,Citation50,Citation51 explicitly stated that indirect methods are recommended. Four guidelines (10%)Citation20,Citation22,Citation29,Citation33 recommended direct methods. Twenty-two guidelines (55%)Citation3,Citation7,Citation14,Citation15,Citation20,Citation21,Citation23,Citation27,Citation28,Citation30,Citation31,Citation36,Citation37,Citation39,Citation41,Citation43,Citation47–52 required statement of the population from which the utilities originate. The majority of guidelines recommended that the utilities should originate from the general population. Nine guidelines (22.5%)Citation3,Citation15,Citation27,Citation28,Citation31,Citation34,Citation36,Citation47,Citation49 stated that the quantity and quality-of-life should be presented in disaggregated and aggregated forms. Ten guidelines (25%)Citation14,Citation27,Citation30,Citation31,Citation35,Citation36,Citation40,Citation41,Citation45,Citation50 accepted mapping from disease-specific quality-of-life (QoL) measures to preference-based values if no data from the preferred instrument was available. If the CBA was chosen, the results should be expressed in a monetary form. Eight guidelines (20%)Citation3,Citation15,Citation19,Citation22,Citation33,Citation43,Citation47,Citation49 recommended the willingness to pay (WTP) when the results need to be monetized. Seven guidelines (17.5%)Citation3,Citation24,Citation29,Citation33,Citation36,Citation38,Citation40 suggested that intangible costs should be included in outcomes rather than in costs.

With regard to the type of endpoints, 28 guidelines (70%)Citation3,Citation7,Citation14,Citation15,Citation18,Citation23–25,Citation27,Citation28,Citation30,Citation31,Citation33,Citation35,Citation36–38,Citation40–44,Citation46–50,Citation52 stated that final endpoints are preferred and, if intermediate endpoints were to be used, the connection between them and final endpoints should be clear.

Costs

The choice of the perspective determines the costs to be included. Thirty-two guidelines (80%)Citation3,Citation7,Citation12,Citation15,Citation18–21,Citation23–27,Citation29,Citation30,Citation32,Citation34–38,Citation40,Citation42,Citation43,Citation45–51 required clear recording of the amount of resources or both quantities of resources and unit costs. Eight guidelines (20%)Citation22,Citation28,Citation31,Citation33,Citation39,Citation41,Citation44,Citation52 either did not clearly specify the presentation of resources or only roughly outlined that resources used should be recorded.

Even in those guidelines that recommended the same perspective, the included costs differed. Among the guidelines that recommended a societal perspective, 11 guidelines (78.5%)Citation3,Citation20,Citation21,Citation24,Citation29,Citation33,Citation38,Citation47–50 required that direct and indirect costs should be included. Only the Portuguese guidelineCitation43 suggested that intangible costs should also be included, which was not mandatory. This is because the Portuguese guideline had also recognized that intangible costs are not strictly economic costs, and can never be quantified or valuedCitation43. Direct costs can primarily be divided into medical and non-medical costs. In addition, the NetherlandsCitation51 used different requirements from the above-mentioned guidelines (see ).

From a healthcare perspective, 12 guidelines (63.1%)Citation14,Citation23,Citation26,Citation27,Citation28,Citation31,Citation36,Citation37,Citation40,Citation44–46 required only direct healthcare costs to be included. CanadaCitation15, EgyptCitation30, FinlandCitation32, GermanyCitation34, and HungaryCitation35 differed from the above-mentioned guidelines (see ).

In addition, 10 guidelines (25%)Citation7,Citation12,Citation18,Citation19,Citation22,Citation25,Citation39,Citation41,Citation42,Citation52 merely stated that all relevant costs should be included that are related to the chosen perspective, or that had recommendations not in accordance with the choice of perspective (e.g. PolandCitation42 recommended the healthcare perspective, while providing a recommendation regarding costs from a societal perspective).

Seventeen guidelines (42.5%)Citation3,Citation15,Citation20,Citation21,Citation25,Citation27,Citation29,Citation33,Citation34,Citation38,Citation39,Citation42,Citation47–51 provided different methods for the measurement of lost productivity, with little consensus. Four guidelines (10%)Citation15,Citation20,Citation21,Citation51 recommended friction cost methods. Eight guidelines (20%)Citation3,Citation25,Citation34,Citation38,Citation42,Citation48–50 recommended the human capital method (HCM). Five guidelines (12.5%)Citation27,Citation29,Citation33,Citation39,Citation47 suggested that lost productivity can be evaluated using the HCM or friction cost methods. Twelve guidelines (30%)Citation14,Citation15,Citation22,Citation26,Citation27,Citation30,Citation34–36,Citation40,Citation50,Citation51 required that costs should be adjusted for inflation and/or currency conversions. For future costs (which are costs associated with patients living longer as a result of a given intervention and, hence, consume additional health resources), numerous different opinions exist, and most of the guidelines divided them into direct and indirect. Nine (22.5%)Citation3,Citation15,Citation24,Citation27,Citation30,Citation34,Citation35,Citation40,Citation43 guidelines considered direct future costs (directly related to the consequence of the intervention or the treatment of the studied disease) should be included, among them, threeCitation31,Citation32,Citation40 guidelines stated the inclusion about indirect future costs (unrelated to their initial diagnosis or treatment) to still be a contentious issue. However, five (12.5%)Citation18,Citation27,Citation34,Citation40,Citation43 guidelines did not mention direct future costs, and only stated indirect future costs, which should not be included. The French guidelineCitation33 stated the inclusion about future costs to be a contentious issue. ThailandCitation50 considered future costs should not be included, while the NetherlandsCitation51 stated that all future costs should be included.

Modeling

All guidelines, except for Guyatt et al.Citation7, stated that the use of models is accepted in PE evaluations. Twenty-seven (67.5%)Citation3,Citation7,Citation14,Citation15,Citation21,Citation23,Citation24,Citation27,Citation28,Citation30,Citation31,Citation33,Citation34,Citation36–43,Citation45–47,Citation49,Citation50,Citation52 guidelines required that the details of models and justifications for the choice of modeling technique should be provided. Eight (20%)Citation12,Citation19,Citation25,Citation26,Citation32,Citation35,Citation44,Citation51 guidelines required that the details of models should be provided, but did not require any justification for the choice of the utilized modeling technique. Four (10%)Citation18,Citation20,Citation29,Citation48 guidelines only mentioned that models can be used. Fifteen (37.5%)Citation14,Citation15,Citation27,Citation30,Citation31,Citation33,Citation34,Citation36,Citation41,Citation42,Citation48–52 guidelines stated that a validation of the utilized model should be provided. Nine (60%)Citation15,Citation30,Citation31,Citation33,Citation34,Citation36,Citation41,Citation42,Citation48 of these further divided validity into both internal validity (to ensure the robustness of model) and external validity (can be tested in numerous ways including a comparison of the results with those of other models or via direct empirical evidence).

Time horizon

Twenty-five guidelines (62.5%)Citation3,Citation7,Citation12,Citation15,Citation19,Citation20,Citation26,Citation28,Citation30–33,Citation35,Citation36,Citation39–42,Citation45–47,Citation49–52 recommended the use of a sufficiently long time horizon of economic evaluations to reflect all important differences in either costs or outcomes among technologies that are compared. Four guidelines (10%)Citation18,Citation21,Citation25,Citation27 stated that the time horizon depends on the studied technology or the research question. Three guidelines (7.5%)Citation24,Citation37,Citation43 required the time horizon to coincide with clinical conditions and the effects of treatment. AustraliaCitation14 required that the time horizon should capture all important differences in both costs and outcomes; however, these do not have to unnecessarily extend beyond this. GermanyCitation34 stated that the time horizon should at least cover the duration of RCTs. SwedenCitation48 required that the time horizon should cover the period during which the main health effects and costs arise.

Discounting

Thirty guidelines (75%)Citation3,Citation7,Citation12,Citation14,Citation15,Citation20,Citation21,Citation24–26,Citation28,Citation30–41,Citation43,Citation45–50 recommended to use the same discount rate for both costs and outcomes, while three guidelines (7.5%)Citation27,Citation42,Citation51 recommended a higher discount rate for costs than for outcomes, and RussiaCitation44 required only costs to be discounted at a flat 5% rate. Five guidelines (12.5%)Citation12,Citation18,Citation22,Citation29,Citation52 did not further specify any rates for discounting. Most guidelines used a standard discount rate of 3–5% for both costs and outcomes. In addition, 30 guidelines (75%)Citation3,Citation7,Citation12,Citation14,Citation15,Citation20,Citation21,Citation24,Citation25,Citation27–31,Citation33–36,Citation38–40,Citation42,Citation43,Citation45–51 required inclusion of a sensitivity analysis (SA) of the discount rate. The ranges of the discount rate mostly ranged between 0–10%.

Uncertainty

All guidelines recommended the SA to handle uncertainty. For parameter uncertainty, six guidelines (15%)Citation28,Citation31,Citation36,Citation41,Citation50,Citation51 chose the probabilistic sensitivity analysis (PSA) as a preferred method. However, three guidelines (7.5%)Citation14,Citation40,Citation45 suggested the use of the deterministic sensitivity analysis (DSA) for routine examination, and the PSA should only be used additionally. Three guidelines (7.5%)Citation3,Citation15,Citation27 stated that parameter uncertainty should be handled by the DSA and PSA. Seven guidelines (17.5%)Citation3,Citation14,Citation15,Citation27,Citation36,Citation40,Citation41 required the model uncertainty to be handled by the DSA. The remaining guidelines did not distinguish between the types of uncertainty, implying that this can be handled by the SA, the PSA, or the DSA, while three of themCitation46,Citation47,Citation49 required 1-way SA as the basic method. The USCitation52 considered both types of SA. Several guidelines recommended results to be presented using confidence intervals (CIs) around the main variable, a cost effectiveness (CE) plane, or cost effectiveness acceptability curves (CEAC).

Presentation of results

The presentation of results has been detailed by 36 guidelines (90%)Citation3,Citation7,Citation12,Citation14,Citation15,Citation18,Citation19,Citation21–23,Citation25–28,Citation30–43,Citation45–52. Most of the guidelines recommend presenting the results in disaggregated and/or aggregated forms. Thirty-one guidelines (77.5%)Citation3,Citation7,Citation12,Citation14,Citation15,Citation19,Citation23,Citation25–28,Citation30–33,Citation35–40,Citation42,Citation43,Citation45–52 recommended using incremental cost effectiveness ratios (ICERs) for the presentation of results.

Efficacy vs effectiveness

Twenty-four guidelines (60%)Citation3,Citation7,Citation15,Citation19,Citation23–25,Citation27,Citation28,Citation30,Citation33,Citation35,Citation36,Citation38,Citation39,Citation42–44,Citation46,Citation47,Citation49,Citation52 explicitly emphasized differences between efficacy and effectiveness. Twenty-two guidelines (55%)Citation3,Citation7,Citation15,Citation19,Citation23,Citation25,Citation27,Citation28,Citation30,Citation33,Citation35,Citation36,Citation38,Citation39,Citation42–44,Citation46,Citation49–52 specifically preferred effectiveness. SpainCitation42 recommended that both efficacy and effectiveness should be used.

Equity

Equity mainly means that all patients should have a fair opportunity for participation and for obtaining the expected treatment outcomesCitation30. For example, to determine equity in PE, an additional QALY should receive the same weight, regardless of any other utilized characteristics (e.g. their socio-demographic characteristics, their age, or their level of health) of the people receiving the health benefitCitation31,Citation36. Sixteen guidelines (40%)Citation3,Citation14,Citation15,Citation19,Citation20,Citation22,Citation27,Citation28,Citation30,Citation31,Citation33,Citation35,Citation36,Citation45,Citation47,Citation49,Citation50 required that equity should be discussed.

Conflicts of interests

Thirteen guidelines (32.5%)Citation15,Citation19,Citation21,Citation29,Citation33,Citation35,Citation39,Citation42,Citation43,Citation47–50 indicated that any existing conflicts of interests should be discussed.

Discussion and conclusions

This paper presents a systematic review and comparison of PE guidelines. By analyzing similarities and differences among a total of 40 PE guidelines that were included in this study, we identified their individual characteristics to provide a way for researchers and experts in the PE to access the present methodological development of the PE. Methodological issues that substantially varied among the guidelines were mainly those that presented controversy or no consensus regarding an appropriate methodCitation53. The present study is intended to provide a preliminary understanding of these issues, and to enable PE specialists to focus more on them to contribute to the next phase of PE guideline development.

According to our study results, various countries all over the world produced numerous PE guidelines. Some of these were updated every few years, such as the guidelines of AustraliaCitation14, the NetherlandsCitation51, and the USCitation52; however, some guidelines have never been updated since they were produced, such as DetskyCitation19, Guyatt et al.Citation22, and the guideline of PortugalCitation43. However, it is important that PE guidelines develop rigorous, evidence-based, clinical, and policy-relevant processes. Therefore, PE guidelines should be reviewed periodically to ensure that they keep pace with methodological developments and changing environmentsCitation54. Many countries produced compulsory PE guidelines for use prior to pharmaceutical reimbursement submissions. However, some guidelines remain voluntary. The purpose of PE guidelines can be summarized as to provide standards for PE research, and to ultimately improve the quality of PE evaluations. Furthermore, many guidelines intend to provide assistance for policymakers in their decision-making. It is important for PE research to introduce good study questions and to provide a clear description of the studied medicines and technologies. We pre-conceived that these issues are so important that they would be mentioned in all guidelines. However, the results are unsatisfactory. Among the 40 guidelines, only 27 guidelines mentioned that study questions should be clearly stated, and 20 guidelines indicated that there should be detailed descriptions of studied interventions. We suggest that the reason might be that these issues are so important that guideline makers consider there is no need to specifically describe them. The choice of the perspective is vital, because it provides a basis for the measurement of costs and outcomes. Although the societal perspective was often recommended in the literature, since it is comparatively ideal in theory (because it takes all relevant consequences into account at a broader scope, thus providing insightful information for decision makers), only 14 guidelines recommended itCitation27,Citation50. The reason might be that this perspective is very difficult to implement. Consequently, most guidelines recommended the healthcare perspective. In addition, we found that the majority of guidelines recommending the societal perspective were informal, while most formalized guidelines recommended a healthcare perspective. Bracco and KrolCitation13 suggested that the reason might be that the target audience of formalized guidelines more likely cared about costs that they must bear than those falling outside their responsibility. Although recommended costs to be included depend on the choice of the perspective, some unjustifiable differences exist among the guidelinesCitation53. Several guidelines did not detail which costs should be included, instead briefly describing all relevant costs related to the perspective. We do not think that such vague descriptions are scientifically valid or help in improving the quality of PE research. Even in those guidelines that recommended the same perspective, the included costs differed. For example, from the societal perspective, some guidelines recommended direct medical and non-medical costs, as well as indirect costs, while others stated that direct and indirect costs should be included just in general terms. Moreover, some guidelines, such as the NetherlandsCitation32, recommended that costs for patients and families, costs within the healthcare system, and costs in other sectors should be included. Eight guidelines specifically mentioned intangible costs, and all guidelines, except for PortugalCitation43, suggested that intangible costs should be included in the outcome to avoid double-counting. Bracco and KrolCitation13 indicated that there is little agreement on methods for lost productivity evaluation. We agree with their conclusion, as our results showed that the measurement of lost productivity basically involves two methods: the HCM and friction cost methods. Most guidelines recommended the HCM. Some guidelines did not recommend friction cost methods because they considered them to be challenging for data collection, due to requiring far more empirical work before they can be appliedCitation33,Citation49. However, the guidelines supporting friction cost methods stated that the HCM rely on the assumption of full employment being unrealistic, leading to a possible over-estimation of lost productionCitation15,Citation33. Therefore, these guidelines suggest friction cost methods to be more realistic. In general, both methods have flaws, and can only be used to evaluate lost productivity of paid labor. For unpaid work or leisure time, limited guidelines existCitation13. From a healthcare perspective, some guidelines only included direct healthcare costs, while some mentioned additional costs. There are arguments about the inclusion of future costs. Most guidelines agreed with the inclusion of direct future costs. However, whether these indirect future costs should be included has not been decided or reached a general agreement. Only the NetherlandsCitation51 stated that all future costs should be included. Sanders et al.Citation55 agreed with the NetherlandsCitation51, and also recommended that all related and unrelated future costs should be included. Regarding the choice of the comparator, there are still some different ideas. However, most guidelines recommended that standard treatments, or treatments that would most likely be replaced by new technologies, should be chosen as a comparator. Only two guidelinesCitation27,Citation34 required that all therapeutic alternatives should be included. Drummond et al.Citation54 suggest all relevant technologies should be included in HTA, since potential inefficiencies exist in all forms of healthcare. Otherwise, decision-making concerning the use of resources may be distortedCitation54. Another issue that varied substantially among the guidelines was regarding the methods of SA. All guidelines recommended using SA to handle uncertainty. However, the majority of them only roughly outlined that uncertainty should be handled by SA or the DSA/PSA, but did not distinguish specific types of uncertainty to provide more targeted recommendations. In addition, there is no unanimous opinion on how to handle parameter uncertainty. However, most guidelines recommended the PSA over the DSA. Methods of SA were also based on different statistical assumptions, providing different implications for decisions. As a result, restrictions on certain types of SA can be less helpfulCitation53. Not surprisingly, almost all of the guidelines accepted the use of models, because these indeed form a centrally important sub-section of PE evaluations, and most of the guidelines (87.5%) specifically required that the details of models should be provided. However, only 15 guidelines required that the validity of the model should be provided, and only six of them briefly mentioned the validity rather than divided it into internal and external validity in more detail. Such rough descriptions may confuse researchers of PE evaluations. Equity and conflicts of interests had received far less attention than other issues. PE guidelines should focus more on these issues, as they might affect the allocation of resources. All such methodological issues that vary substantially among the guidelines are likely to impact the transferability of results in PE studies. The transferability of PE researches may be a problem, which should be brought to our attention. This may be particularly important in small jurisdictions and/or regarding decision problems that affect a small number of individuals. The methodological guidelines should recognize that limited resources are available for all activitiesCitation56. Therefore, if we knew that the values individuals place on health states are fairly similar across countries, it would be unnecessary for researchers or decision-makers in a given jurisdiction to specify that health state valuations must originate from the local population, thus avoiding unnecessary duplication of work and expenseCitation53,Citation56.

Issues with relatively unified recommendations were the preferred type of analysis, preferred sources for effectiveness, the use of the QALYs to measure outcomes, and the ICER to present results. The majority of the guidelines preferred the CUA, with outcomes expressed in terms of the QALYs, and an important reason was that the QALYs enable one to compare outcomes of different technologies across different activities in the healthcare sectorCitation36. However, there are some limitations of QALYs. A concern when using QALYs is that they do not discriminate between conditions with different severityCitation15. For instance, QALYs may not accurately reflect the burden of the diseases, which is short-lived, but typically intense. As a result, the QALYs gained because of an intervention that generates marginal gains for many people could be equal to the QALYs gained due to an intervention that generates substantial gains for a small number of peopleCitation55. These issues should be further explored. Most guidelines preferred meta-analysis, and required a systematic review of all evidence, because meta-analysis and systematic reviews were considered to be of the highest priorityCitation42. Moreover, although almost all guidelines agreed that costs and outcomes should be discounted, the discount rate varied substantially among guidelines.

In conclusion, this systematic review of PE guidelines identified areas of agreements and disagreements by comparing PE guidelines. Using this comparison, it can be summarized that methodological issues still remain and require further investigation.

Limitations

We have included a substantial number of PE/HE guidelines. It is, however, possible that we have missed some guidelines, e.g. guidelines published in non-English versions. We have also excluded guidelines that were not available at the time of our analysis. Furthermore, the accuracy of the data extracted from these guidelines depends on the interpretations of individual extractors. It may be the case that some extractors did not interpret the information consistently or accurately. However, we have reduced the bias, and improved the accuracy of our results. We believe our conclusions are still sufficiently accurate to provide a valuable reference.

Transparency

Declaration of funding

This manuscript has no sponsorship/funding.

Declaration of financial/other relationships

The authors declare that they have no conflicts of interests nor financial relationships that need to be disclosed. Peer reviewers on this manuscript have received an honorarium from JME for their review work, but have no other relevant financial relationships to disclose.

Supplemental material

Supplemental material

Download Zip (151.8 KB)

Acknowledgements

No assistance in the preparation of this article is to be declared.

References

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.