Addressing racial bias in pulse oximetry

Article Type
Changed
Fri, 08/05/2022 - 15:17

Pulse oximetry is a vital monitoring tool in the ICU and in pulmonary medicine. Regrettably, re-emerging data show that pulse oximeters do not accurately measure blood oxygen levels in Black patients, presumably due to their skin tone. Patients with darker skin are, therefore, more likely to experience occult hypoxemia (i.e., low arterial oxygen saturation despite a seemingly normal pulse oximetry reading). While inaccuracy of pulse oximeter measurements in patients with darker skin has been recognized for decades, recent studies have highlighted this as an ongoing problem with potentially severe consequences for Black patients and other patients of color.

One recent study found that Black patients had almost three times the likelihood of occult hypoxemia compared with White patients (Sjoding, MW, et al. N Engl J Med. 2020;383[25]:2477-8).

Subsequent studies have confirmed this to be a widespread problem across various clinical settings in hundreds of hospitals (Wong AI, et al. JAMA Netw Open. 2021;4[11]:e2131674; Valbuena VS, et al. Chest. 2022;161[4]:971-8). A recent retrospective cohort study of patients with COVID-19 found that occult hypoxemia in Black and Hispanic patients was associated with delayed eligibility for potentially lifesaving COVID-19 therapies (Fawzy AF, et al. JAMA Intern Med. 2022; published online May 31, 2022).

Dr. Emily A. Harlan, University of Michigan, Ann Arbor
Dr. Emily A. Harlan

Now that numerous studies have demonstrated the inaccuracy of pulse oximetry with the potential to cause harm to historically marginalized racial and ethnic groups, must we abandon the use of pulse oximetry? We would argue that pulse oximeters remain valuable tools, but for now, we must adapt our practice until better devices are widely adopted.

First, it is crucial that health professionals are aware that pulse oximeters may underestimate the true extent of hypoxemia for all patients, but particularly for patients with darker skin. Acknowledging this device flaw is essential to avoid harm to our patients.

Second, clinicians must have heightened skepticism for seemingly normal pulse oximetry values when caring for symptomatic patients at risk of occult hypoxemia.

Until better pulse oximeters are widely available, clinicians must consider workarounds aimed at ensuring timely identification of hypoxemia in Black patients and other patients of color.

These patients may need invasive monitoring of arterial oxygenation, including arterial blood gas checks or an arterial catheter. However, invasive monitoring comes at the cost of discomfort to patients and potential complications, such as vessel or nerve damage.

Dr. Daniel Colon Hidalgo, University of Colorado, Aurora
Dr. Daniel Colon Hidalgo

Invasive monitoring of patients at risk for occult hypoxemia is not an equitable or acceptable long-term solution for this problem. As advocates for patients, clinicians and professional organizations should lobby regulatory bodies to ensure pulse oximeters are accurate for all patients.

We must also call on government leaders to move this process forward. For example, in response to efforts by the United Kingdom’s Intensive Care Society, the Health Secretary of the UK, Sajid Javid, has called for a review of pulse oximeters as part of a larger review assessing structural issues in health care that lead to worse outcomes in racial and ethnic minorities (BBC News. https://www.bbc.com/news/uk-59363544. Published online Nov. 21, 2021).

Device companies are largely for-profit corporations with obligations to their shareholders. It seems that existing incentives are insufficient to motivate investment in less biased technology and real-world evaluations of their devices.

We previously called for buyers of pulse oximeters to change the incentives of device companies – that is, for “hospitals to commit to only purchasing pulse oximeters that have been shown to work equally well in patients of colour.” (Hidalgo DC, et al. Lancet Respir Med. 2021;9[4]:E37). And, indeed, we worry that hospitals are putting themselves at medicolegal risk by not raising their purchasing standards. Since it is now widely known that pulse oximeters are inaccurate in certain patients, could there be liability for hospitals that continue to use devices we know to be disproportionately inaccurate by race?

Dr. Thomas S. Valley


Device companies must commit to fixing racial bias in pulse oximeters. Change is feasible, and pulse oximeters can be redesigned to be accurate and reliable among all patients using existing technology that is decades-old.

In the 1960s and 1970s, Hewlett Packard worked with NASA to noninvasively measure oxygen saturation in astronauts (Moran-Thomas, M. Wired. Published online June 4, 2021. https://www.wired.com/story/pulse-oximeters-equity). The device was designed to work for all skin tones and could be calibrated based on an individual’s skin tone. However, Hewlett Packard moved away from medical devices in the 1980s, shelving their design while other companies took over the oximeter market.

Lastly, as new devices are designed, they must be proven to work for all patients. Testing should be conducted in real-world clinical settings using metrics aligned with clinical care, since we know testing in artificial environments may not generalize to critically ill patients. Testing standards historically used by the FDA, such as only requiring device testing in a small number of non-White individuals, may miss clinically relevant hypoxemia. Non-inferiority studies are particularly susceptible to poor design or under-powering, and rigorous standards are needed from unbiased sources.

While potential solutions are currently being evaluated, the fact remains that the inaccuracy of pulse oximeters has been known for decades without any meaningful action taken to correct the problem.

As Valeria Valbuena, author of a study demonstrating inaccuracy of pulse oximetry in patients about to undergo ECMO, points out, “Using White patients as the standard in biomedical design has led to both differential care and innovation inertia for optimizing the way devices and algorithms work for patients of racial and ethnic minoritized groups” (Valbuena VS. JAMA Intern Med. 2022. doi: 10.1001/jamainternmed.2022.1903).

We know that hypoxemia is dangerous for our patients and that this is only one example of the long-standing systemic racism leading to harm in historically marginalized racial and ethnic groups. It is unacceptable that the devices we rely on to care for our patients are disproportionately inaccurate in non-White patients.

We hope that with increased awareness of this problem, meaningful action will be taken by device companies to ensure pulse oximeters work accurately for all patients.




From the Division of Pulmonary and Critical Care, Department of Medicine and the Center for Bioethics and Social Sciences in Medicine, University of Michigan Medical School (Drs. Harlan and Valley), and the Institute for Healthcare Policy and Innovation (Dr. Valley), University of Michigan, Ann Arbor, MI; and the Division of Pulmonary Sciences and Critical Care Medicine, University of Colorado School of Medicine, Aurora, CO (Dr. Colon Hidalgo).

Publications
Topics
Sections

Pulse oximetry is a vital monitoring tool in the ICU and in pulmonary medicine. Regrettably, re-emerging data show that pulse oximeters do not accurately measure blood oxygen levels in Black patients, presumably due to their skin tone. Patients with darker skin are, therefore, more likely to experience occult hypoxemia (i.e., low arterial oxygen saturation despite a seemingly normal pulse oximetry reading). While inaccuracy of pulse oximeter measurements in patients with darker skin has been recognized for decades, recent studies have highlighted this as an ongoing problem with potentially severe consequences for Black patients and other patients of color.

One recent study found that Black patients had almost three times the likelihood of occult hypoxemia compared with White patients (Sjoding, MW, et al. N Engl J Med. 2020;383[25]:2477-8).

Subsequent studies have confirmed this to be a widespread problem across various clinical settings in hundreds of hospitals (Wong AI, et al. JAMA Netw Open. 2021;4[11]:e2131674; Valbuena VS, et al. Chest. 2022;161[4]:971-8). A recent retrospective cohort study of patients with COVID-19 found that occult hypoxemia in Black and Hispanic patients was associated with delayed eligibility for potentially lifesaving COVID-19 therapies (Fawzy AF, et al. JAMA Intern Med. 2022; published online May 31, 2022).

Dr. Emily A. Harlan, University of Michigan, Ann Arbor
Dr. Emily A. Harlan

Now that numerous studies have demonstrated the inaccuracy of pulse oximetry with the potential to cause harm to historically marginalized racial and ethnic groups, must we abandon the use of pulse oximetry? We would argue that pulse oximeters remain valuable tools, but for now, we must adapt our practice until better devices are widely adopted.

First, it is crucial that health professionals are aware that pulse oximeters may underestimate the true extent of hypoxemia for all patients, but particularly for patients with darker skin. Acknowledging this device flaw is essential to avoid harm to our patients.

Second, clinicians must have heightened skepticism for seemingly normal pulse oximetry values when caring for symptomatic patients at risk of occult hypoxemia.

Until better pulse oximeters are widely available, clinicians must consider workarounds aimed at ensuring timely identification of hypoxemia in Black patients and other patients of color.

These patients may need invasive monitoring of arterial oxygenation, including arterial blood gas checks or an arterial catheter. However, invasive monitoring comes at the cost of discomfort to patients and potential complications, such as vessel or nerve damage.

Dr. Daniel Colon Hidalgo, University of Colorado, Aurora
Dr. Daniel Colon Hidalgo

Invasive monitoring of patients at risk for occult hypoxemia is not an equitable or acceptable long-term solution for this problem. As advocates for patients, clinicians and professional organizations should lobby regulatory bodies to ensure pulse oximeters are accurate for all patients.

We must also call on government leaders to move this process forward. For example, in response to efforts by the United Kingdom’s Intensive Care Society, the Health Secretary of the UK, Sajid Javid, has called for a review of pulse oximeters as part of a larger review assessing structural issues in health care that lead to worse outcomes in racial and ethnic minorities (BBC News. https://www.bbc.com/news/uk-59363544. Published online Nov. 21, 2021).

Device companies are largely for-profit corporations with obligations to their shareholders. It seems that existing incentives are insufficient to motivate investment in less biased technology and real-world evaluations of their devices.

We previously called for buyers of pulse oximeters to change the incentives of device companies – that is, for “hospitals to commit to only purchasing pulse oximeters that have been shown to work equally well in patients of colour.” (Hidalgo DC, et al. Lancet Respir Med. 2021;9[4]:E37). And, indeed, we worry that hospitals are putting themselves at medicolegal risk by not raising their purchasing standards. Since it is now widely known that pulse oximeters are inaccurate in certain patients, could there be liability for hospitals that continue to use devices we know to be disproportionately inaccurate by race?

Dr. Thomas S. Valley


Device companies must commit to fixing racial bias in pulse oximeters. Change is feasible, and pulse oximeters can be redesigned to be accurate and reliable among all patients using existing technology that is decades-old.

In the 1960s and 1970s, Hewlett Packard worked with NASA to noninvasively measure oxygen saturation in astronauts (Moran-Thomas, M. Wired. Published online June 4, 2021. https://www.wired.com/story/pulse-oximeters-equity). The device was designed to work for all skin tones and could be calibrated based on an individual’s skin tone. However, Hewlett Packard moved away from medical devices in the 1980s, shelving their design while other companies took over the oximeter market.

Lastly, as new devices are designed, they must be proven to work for all patients. Testing should be conducted in real-world clinical settings using metrics aligned with clinical care, since we know testing in artificial environments may not generalize to critically ill patients. Testing standards historically used by the FDA, such as only requiring device testing in a small number of non-White individuals, may miss clinically relevant hypoxemia. Non-inferiority studies are particularly susceptible to poor design or under-powering, and rigorous standards are needed from unbiased sources.

While potential solutions are currently being evaluated, the fact remains that the inaccuracy of pulse oximeters has been known for decades without any meaningful action taken to correct the problem.

As Valeria Valbuena, author of a study demonstrating inaccuracy of pulse oximetry in patients about to undergo ECMO, points out, “Using White patients as the standard in biomedical design has led to both differential care and innovation inertia for optimizing the way devices and algorithms work for patients of racial and ethnic minoritized groups” (Valbuena VS. JAMA Intern Med. 2022. doi: 10.1001/jamainternmed.2022.1903).

We know that hypoxemia is dangerous for our patients and that this is only one example of the long-standing systemic racism leading to harm in historically marginalized racial and ethnic groups. It is unacceptable that the devices we rely on to care for our patients are disproportionately inaccurate in non-White patients.

We hope that with increased awareness of this problem, meaningful action will be taken by device companies to ensure pulse oximeters work accurately for all patients.




From the Division of Pulmonary and Critical Care, Department of Medicine and the Center for Bioethics and Social Sciences in Medicine, University of Michigan Medical School (Drs. Harlan and Valley), and the Institute for Healthcare Policy and Innovation (Dr. Valley), University of Michigan, Ann Arbor, MI; and the Division of Pulmonary Sciences and Critical Care Medicine, University of Colorado School of Medicine, Aurora, CO (Dr. Colon Hidalgo).

Pulse oximetry is a vital monitoring tool in the ICU and in pulmonary medicine. Regrettably, re-emerging data show that pulse oximeters do not accurately measure blood oxygen levels in Black patients, presumably due to their skin tone. Patients with darker skin are, therefore, more likely to experience occult hypoxemia (i.e., low arterial oxygen saturation despite a seemingly normal pulse oximetry reading). While inaccuracy of pulse oximeter measurements in patients with darker skin has been recognized for decades, recent studies have highlighted this as an ongoing problem with potentially severe consequences for Black patients and other patients of color.

One recent study found that Black patients had almost three times the likelihood of occult hypoxemia compared with White patients (Sjoding, MW, et al. N Engl J Med. 2020;383[25]:2477-8).

Subsequent studies have confirmed this to be a widespread problem across various clinical settings in hundreds of hospitals (Wong AI, et al. JAMA Netw Open. 2021;4[11]:e2131674; Valbuena VS, et al. Chest. 2022;161[4]:971-8). A recent retrospective cohort study of patients with COVID-19 found that occult hypoxemia in Black and Hispanic patients was associated with delayed eligibility for potentially lifesaving COVID-19 therapies (Fawzy AF, et al. JAMA Intern Med. 2022; published online May 31, 2022).

Dr. Emily A. Harlan, University of Michigan, Ann Arbor
Dr. Emily A. Harlan

Now that numerous studies have demonstrated the inaccuracy of pulse oximetry with the potential to cause harm to historically marginalized racial and ethnic groups, must we abandon the use of pulse oximetry? We would argue that pulse oximeters remain valuable tools, but for now, we must adapt our practice until better devices are widely adopted.

First, it is crucial that health professionals are aware that pulse oximeters may underestimate the true extent of hypoxemia for all patients, but particularly for patients with darker skin. Acknowledging this device flaw is essential to avoid harm to our patients.

Second, clinicians must have heightened skepticism for seemingly normal pulse oximetry values when caring for symptomatic patients at risk of occult hypoxemia.

Until better pulse oximeters are widely available, clinicians must consider workarounds aimed at ensuring timely identification of hypoxemia in Black patients and other patients of color.

These patients may need invasive monitoring of arterial oxygenation, including arterial blood gas checks or an arterial catheter. However, invasive monitoring comes at the cost of discomfort to patients and potential complications, such as vessel or nerve damage.

Dr. Daniel Colon Hidalgo, University of Colorado, Aurora
Dr. Daniel Colon Hidalgo

Invasive monitoring of patients at risk for occult hypoxemia is not an equitable or acceptable long-term solution for this problem. As advocates for patients, clinicians and professional organizations should lobby regulatory bodies to ensure pulse oximeters are accurate for all patients.

We must also call on government leaders to move this process forward. For example, in response to efforts by the United Kingdom’s Intensive Care Society, the Health Secretary of the UK, Sajid Javid, has called for a review of pulse oximeters as part of a larger review assessing structural issues in health care that lead to worse outcomes in racial and ethnic minorities (BBC News. https://www.bbc.com/news/uk-59363544. Published online Nov. 21, 2021).

Device companies are largely for-profit corporations with obligations to their shareholders. It seems that existing incentives are insufficient to motivate investment in less biased technology and real-world evaluations of their devices.

We previously called for buyers of pulse oximeters to change the incentives of device companies – that is, for “hospitals to commit to only purchasing pulse oximeters that have been shown to work equally well in patients of colour.” (Hidalgo DC, et al. Lancet Respir Med. 2021;9[4]:E37). And, indeed, we worry that hospitals are putting themselves at medicolegal risk by not raising their purchasing standards. Since it is now widely known that pulse oximeters are inaccurate in certain patients, could there be liability for hospitals that continue to use devices we know to be disproportionately inaccurate by race?

Dr. Thomas S. Valley


Device companies must commit to fixing racial bias in pulse oximeters. Change is feasible, and pulse oximeters can be redesigned to be accurate and reliable among all patients using existing technology that is decades-old.

In the 1960s and 1970s, Hewlett Packard worked with NASA to noninvasively measure oxygen saturation in astronauts (Moran-Thomas, M. Wired. Published online June 4, 2021. https://www.wired.com/story/pulse-oximeters-equity). The device was designed to work for all skin tones and could be calibrated based on an individual’s skin tone. However, Hewlett Packard moved away from medical devices in the 1980s, shelving their design while other companies took over the oximeter market.

Lastly, as new devices are designed, they must be proven to work for all patients. Testing should be conducted in real-world clinical settings using metrics aligned with clinical care, since we know testing in artificial environments may not generalize to critically ill patients. Testing standards historically used by the FDA, such as only requiring device testing in a small number of non-White individuals, may miss clinically relevant hypoxemia. Non-inferiority studies are particularly susceptible to poor design or under-powering, and rigorous standards are needed from unbiased sources.

While potential solutions are currently being evaluated, the fact remains that the inaccuracy of pulse oximeters has been known for decades without any meaningful action taken to correct the problem.

As Valeria Valbuena, author of a study demonstrating inaccuracy of pulse oximetry in patients about to undergo ECMO, points out, “Using White patients as the standard in biomedical design has led to both differential care and innovation inertia for optimizing the way devices and algorithms work for patients of racial and ethnic minoritized groups” (Valbuena VS. JAMA Intern Med. 2022. doi: 10.1001/jamainternmed.2022.1903).

We know that hypoxemia is dangerous for our patients and that this is only one example of the long-standing systemic racism leading to harm in historically marginalized racial and ethnic groups. It is unacceptable that the devices we rely on to care for our patients are disproportionately inaccurate in non-White patients.

We hope that with increased awareness of this problem, meaningful action will be taken by device companies to ensure pulse oximeters work accurately for all patients.




From the Division of Pulmonary and Critical Care, Department of Medicine and the Center for Bioethics and Social Sciences in Medicine, University of Michigan Medical School (Drs. Harlan and Valley), and the Institute for Healthcare Policy and Innovation (Dr. Valley), University of Michigan, Ann Arbor, MI; and the Division of Pulmonary Sciences and Critical Care Medicine, University of Colorado School of Medicine, Aurora, CO (Dr. Colon Hidalgo).

Publications
Publications
Topics
Article Type
Sections
Disallow All Ads
Content Gating
No Gating (article Unlocked/Free)
Alternative CME
Disqus Comments
Default
Use ProPublica
Hide sidebar & use full width
render the right sidebar.
Conference Recap Checkbox
Not Conference Recap
Clinical Edge
Display the Slideshow in this Article
Medscape Article
Display survey writer
Reuters content
Disable Inline Native ads
WebMD Article

Appraising the Evidence Supporting Choosing Wisely® Recommendations

Article Type
Changed
Fri, 10/04/2019 - 16:31

As healthcare costs rise, physicians and other stakeholders are now seeking innovative and effective ways to reduce the provision of low-value services.1,2 The Choosing Wisely® campaign aims to further this goal by promoting lists of specific procedures, tests, and treatments that providers should avoid in selected clinical settings.3 On February 21, 2013, the Society of Hospital Medicine (SHM) released 2 Choosing Wisely® lists consisting of adult and pediatric services that are seen as costly to consumers and to the healthcare system, but which are often nonbeneficial or even harmful.4,5 A total of 80 physician and nurse specialty societies have joined in submitting additional lists.

Despite the growing enthusiasm for this effort, questions remain regarding the Choosing Wisely® campaign’s ability to initiate the meaningful de-adoption of low-value services. Specifically, prior efforts to reduce the use of services deemed to be of questionable benefit have met several challenges.2,6 Early analyses of the Choosing Wisely® recommendations reveal similar roadblocks and variable uptakes of several recommendations.7-10 While the reasons for difficulties in achieving de-adoption are broad, one important factor in whether clinicians are willing to follow guideline recommendations from such initiatives as Choosing Wisely®is the extent to which they believe in the underlying evidence.11 The current work seeks to formally evaluate the evidence supporting the Choosing Wisely® recommendations, and to compare the quality of evidence supporting SHM lists to other published Choosing Wisely® lists.

METHODS

Data Sources

Using the online listing of published Choosing Wisely® recommendations, a dataset was generated incorporating all 320 recommendations comprising the 58 lists published through August, 2014; these include both the adult and pediatric hospital medicine lists released by the SHM.4,5,12 Although data collection ended at this point, this represents a majority of all 81 lists and 535 recommendations published through December, 2017. The reviewers (A.J.A., A.G., M.W., T.S.V., M.S., and C.R.C) extracted information about the references cited for each recommendation.

Data Analysis

The reviewers obtained each reference cited by a Choosing Wisely® recommendation and categorized it by evidence strength along the following hierarchy: clinical practice guideline (CPG), primary research, review article, expert opinion, book, or others/unknown. CPGs were used as the highest level of evidence based on standard expectations for methodological rigor.13 Primary research was further rated as follows: systematic reviews and meta-analyses, randomized controlled trials (RCTs), observational studies, and case series. Each recommendation was graded using only the strongest piece of evidence cited.

Guideline Appraisal

We further sought to evaluate the strength of referenced CPGs. To accomplish this, a 10% random sample of the Choosing Wisely® recommendations citing CPGs was selected, and the referenced CPGs were obtained. Separately, CPGs referenced by the SHM-published adult and pediatric lists were also obtained. For both groups, one CPG was randomly selected when a recommendation cited more than one CPG. These guidelines were assessed using the Appraisal of Guidelines for Research and Evaluation (AGREE) II instrument, a widely used instrument designed to assess CPG quality.14,15 AGREE II consists of 25 questions categorized into 6 domains: scope and purpose, stakeholder involvement, rigor of development, clarity of presentation, applicability, and editorial independence. Guidelines are also assigned an overall score. Two trained reviewers (A.J.A. and A.G.) assessed each of the sampled CPGs using a standardized form. Scores were then standardized using the method recommended by the instrument and reported as a percentage of available points. Although a standard interpretation of scores is not provided by the instrument, prior applications deemed scores below 50% as deficient16,17. When a recommendation item cited multiple CPGs, one was randomly selected. We also abstracted data on the year of publication, the evidence grade assigned to specific items recommended by Choosing Wisely®, and whether the CPG addressed the referring recommendation. All data management and analysis were conducted using Stata (V14.2, StataCorp, College Station, Texas).

 

 

RESULTS

A total of 320 recommendations were considered in our analysis, including 10 published across the 2 hospital medicine lists. When limited to the highest quality citation for each of the recommendations, 225 (70.3%) cited CPGs, whereas 71 (22.2%) cited primary research articles (Table 1). Specifically, 29 (9.1%) cited systematic reviews and meta-analyses, 28 (8.8%) cited observational studies, and 13 (4.1%) cited RCTs. One recommendation (0.3%) cited a case series as its highest level of evidence, 7 (2.2%) cited review articles, 7 (2.2%) cited editorials or opinion pieces, and 10 (3.1%) cited other types of documents, such as websites or books. Among hospital medicine recommendations, 9 (90%) referenced CPGs and 1 (10%) cited an observational study.

For the AGREE II assessment, we included 23 CPGs from the 225 referenced across all recommendations, after which we separately selected 6 CPGs from the hospital medicine recommendations. There was no overlap. Notably, 4 hospital medicine recommendations referenced a common CPG. Among the random sample of referenced CPGs, the median overall score obtained by using AGREE II was 54.2% (IQR 33.3%-70.8%, Table 2). This was similar to the median overall among hospital medicine guidelines (58.2%, IQR 50.0%-83.3%). Both hospital medicine and other sampled guidelines tended to score poorly in stakeholder involvement (48.6%, IQR 44.1%-61.1% and 47.2%, IQR 38.9%-61.1%, respectively). There were no significant differences between hospital medicine-referenced CPGs and the larger sample of CPGs in any AGREE II subdomains. The median age from the CPG publication to the list publication was 7 years (IQR 4–7) for hospital medicine recommendations and 3 years (IQR 2–6) for the nonhospital medicine recommendations. Substantial agreement was found between raters on the overall guideline assessment (ICC 0.80, 95% CI 0.58-0.91; Supplementary Table 1).



In terms of recommendation strengths and evidence grades, several recommendations were backed by Grades II–III (on a scale of I-III) evidence and level C (on a scale of A–C) recommendations in the reviewed CPG (Society of Maternal-Fetal Medicine, Recommendation 4, and Heart Rhythm Society, Recommendation 1). In one other case, the cited CPG did not directly address the Choosing Wisely® item (Society of Vascular Medicine, Recommendation 2).

DISCUSSION

Given the rising costs and the potential for iatrogenic harm, curbing ineffective practices has become an urgent concern. To achieve this, the Choosing Wisely® campaign has taken an important step by targeting certain low-value practices for de-adoption. However, the evidence supporting recommendations is variable. Specifically, 25 recommendations cited case series, review articles, or lower quality evidence as their highest level of support; moreover, among recommendations citing CPGs, quality, timeliness, and support for the recommendation item were variable. Although the hospital medicine lists tended to cite higher-quality evidence in the form of CPGs, these CPGs were often less recent than the guidelines referenced by other lists.

Our findings parallel those of other works that evaluate evidence among Choosing Wisely® recommendations and, more broadly, among CPGs.18–21 Lin and Yancey evaluated the quality of primary care-focused Choosing Wisely® recommendations using the Strength of Recommendation Taxonomy, a ranking system that evaluates evidence quality, consistency, and patient-centeredness.18 In their analysis, the authors found that many recommendations were based on lower quality evidence or relied on nonpatent-centered intermediate outcomes. Several groups, meanwhile, have evaluated the quality of evidence supporting CPG recommendations, finding them to be highly variable as well.19–21 These findings likely reflect inherent difficulties in the process, by which guideline development groups distill a broad evidence base into useful clinical recommendations, a reality that may have influenced the Choosing Wisely® list development groups seeking to make similar recommendations on low-value services.

These data should be taken in context due to several limitations. First, our sample of referenced CPGs includes only a small sample of all CPGs cited; thus, it may not be representative of all referenced guidelines. Second, the AGREE II assessment is inherently subjective, despite the availability of training materials. Third, data collection ended in April, 2014. Although this represents a majority of published lists to date, it is possible that more recent Choosing Wisely®lists include a stronger focus on evidence quality. Finally, references cited by Choosing Wisely®may not be representative of the entirety of the dataset that was considered when formulating the recommendations.

Despite these limitations, our findings suggest that Choosing Wisely®recommendations vary in terms of evidence strength. Although our results reveal that the majority of recommendations cite guidelines or high-quality original research, evidence gaps remain, with a small number citing low-quality evidence or low-quality CPGs as their highest form of support. Given the barriers to the successful de-implementation of low-value services, such campaigns as Choosing Wisely®face an uphill battle in their attempt to prompt behavior changes among providers and consumers.6-9 As a result, it is incumbent on funding agencies and medical journals to promote studies evaluating the harms and overall value of the care we deliver.

 

 

CONCLUSIONS

Although a majority of Choosing Wisely® recommendations cite high-quality evidence, some reference low-quality evidence or low-quality CPGs as their highest form of support. To overcome clinical inertia and other barriers to the successful de-implementation of low-value services, a clear rationale for the impetus to eradicate entrenched practices is critical.2,22 Choosing Wisely® has provided visionary leadership and a powerful platform to question low-value care. To expand the campaign’s efforts, the medical field must be able to generate the high-quality evidence necessary to support these efforts; further, list development groups must consider the availability of strong evidence when targeting services for de-implementation.

ACKNOWLEDGMENT

This work was supported, in part, by a grant from the Agency for Healthcare Research and Quality (No. K08HS020672, Dr. Cooke).

Disclosures

The authors have nothing to disclose.

Files
References

1. Institute of Medicine Roundtable on Evidence-Based Medicine. The Healthcare Imperative: Lowering Costs and Improving Outcomes: Workshop Series Summary. Yong P, Saudners R, Olsen L, editors. Washington, D.C.: National Academies Press; 2010. PubMed
2. Weinberger SE. Providing high-value, cost-conscious care: a critical seventh general competency for physicians. Ann Intern Med. 2011;155(6):386-388. PubMed
3. Cassel CK, Guest JA. Choosing wisely: Helping physicians and patients make smart decisions about their care. JAMA. 2012;307(17):1801-1802. PubMed
4. Bulger J, Nickel W, Messler J, Goldstein J, O’Callaghan J, Auron M, et al. Choosing wisely in adult hospital medicine: Five opportunities for improved healthcare value. J Hosp Med. 2013;8(9):486-492. PubMed
5. Quinonez RA, Garber MD, Schroeder AR, Alverson BK, Nickel W, Goldstein J, et al. Choosing wisely in pediatric hospital medicine: Five opportunities for improved healthcare value. J Hosp Med. 2013;8(9):479-485. PubMed
6. Prasad V, Ioannidis JP. Evidence-based de-implementation for contradicted, unproven, and aspiring healthcare practices. Implement Sci. 2014;9:1. PubMed
7. Rosenberg A, Agiro A, Gottlieb M, Barron J, Brady P, Liu Y, et al. Early trends among seven recommendations from the Choosing Wisely campaign. JAMA Intern Med. 2015;175(12):1913-1920. PubMed
8. Zikmund-Fisher BJ, Kullgren JT, Fagerlin A, Klamerus ML, Bernstein SJ, Kerr EA. Perceived barriers to implementing individual Choosing Wisely® recommendations in two national surveys of primary care providers. J Gen Intern Med. 2017;32(2):210-217. PubMed
9. Bishop TF, Cea M, Miranda Y, Kim R, Lash-Dardia M, Lee JI, et al. Academic physicians’ views on low-value services and the choosing wisely campaign: A qualitative study. Healthc (Amsterdam, Netherlands). 2017;5(1-2):17-22. PubMed
10. Prochaska MT, Hohmann SF, Modes M, Arora VM. Trends in Troponin-only testing for AMI in academic teaching hospitals and the impact of Choosing Wisely®. J Hosp Med. 2017;12(12):957-962. PubMed
11. Cabana MD, Rand CS, Powe NR, Wu AW, Wilson MH, Abboud PA, et al. Why don’t physicians follow clinical practice guidelines? A framework for improvement. JAMA. 1999;282(15):1458-1465. PubMed
12. ABIM Foundation. ChoosingWisely.org Search Recommendations. 2014. 
13. Institute of Medicine (US) Committee on Standards for Developing Trustworthy Clinical Practice Guidelines. Clinical Practice Guidelines We Can Trust. Graham R, Mancher M, Miller Wolman D, Greenfield S, Steinberg E, editors. Washington, D.C.: National Academies Press; 2011. PubMed
14. Brouwers MC, Kho ME, Browman GP, Burgers JS, Cluzeau F, Feder G, et al. AGREE II: Advancing guideline development, reporting, and evaluation in health care. Prev Med (Baltim). 2010;51(5):421-424. PubMed
15. Brouwers MC, Kho ME, Browman GP, Burgers JS, Cluzeau F, Feder G, et al. Development of the AGREE II, part 2: Assessment of validity of items and tools to support application. CMAJ. 2010;182(10):E472-E478. PubMed
16. He Z, Tian H, Song A, Jin L, Zhou X, Liu X, et al. Quality appraisal of clinical practice guidelines on pancreatic cancer. Medicine (Baltimore). 2015;94(12):e635. PubMed
17. Isaac A, Saginur M, Hartling L, Robinson JL. Quality of reporting and evidence in American Academy of Pediatrics guidelines. Pediatrics. 2013;131(4):732-738. PubMed
18. Lin KW, Yancey JR. Evaluating the Evidence for Choosing WiselyTM in Primary Care Using the Strength of Recommendation Taxonomy (SORT). J Am Board Fam Med. 2016;29(4):512-515. PubMed
19. McAlister FA, van Diepen S, Padwal RS, Johnson JA, Majumdar SR. How evidence-based are the recommendations in evidence-based guidelines? PLoS Med. 2007;4(8):e250. PubMed
20. Tricoci P, Allen JM, Kramer JM, Califf RM, Smith SC. Scientific evidence underlying the ACC/AHA clinical practice guidelines. JAMA. 2009;301(8):831-841. PubMed
21. Feuerstein JD, Gifford AE, Akbari M, Goldman J, Leffler DA, Sheth SG, et al. Systematic analysis underlying the quality of the scientific evidence and conflicts of interest in gastroenterology practice guidelines. Am J Gastroenterol. 2013;108(11):1686-1693. PubMed
22. Robert G, Harlock J, Williams I. Disentangling rhetoric and reality: an international Delphi study of factors and processes that facilitate the successful implementation of decisions to decommission healthcare services. Implement Sci. 2014;9:123. PubMed

Article PDF
Issue
Journal of Hospital Medicine 13(10)
Publications
Topics
Page Number
688-691. Published online first April 25, 2018
Sections
Files
Files
Article PDF
Article PDF
Related Articles

As healthcare costs rise, physicians and other stakeholders are now seeking innovative and effective ways to reduce the provision of low-value services.1,2 The Choosing Wisely® campaign aims to further this goal by promoting lists of specific procedures, tests, and treatments that providers should avoid in selected clinical settings.3 On February 21, 2013, the Society of Hospital Medicine (SHM) released 2 Choosing Wisely® lists consisting of adult and pediatric services that are seen as costly to consumers and to the healthcare system, but which are often nonbeneficial or even harmful.4,5 A total of 80 physician and nurse specialty societies have joined in submitting additional lists.

Despite the growing enthusiasm for this effort, questions remain regarding the Choosing Wisely® campaign’s ability to initiate the meaningful de-adoption of low-value services. Specifically, prior efforts to reduce the use of services deemed to be of questionable benefit have met several challenges.2,6 Early analyses of the Choosing Wisely® recommendations reveal similar roadblocks and variable uptakes of several recommendations.7-10 While the reasons for difficulties in achieving de-adoption are broad, one important factor in whether clinicians are willing to follow guideline recommendations from such initiatives as Choosing Wisely®is the extent to which they believe in the underlying evidence.11 The current work seeks to formally evaluate the evidence supporting the Choosing Wisely® recommendations, and to compare the quality of evidence supporting SHM lists to other published Choosing Wisely® lists.

METHODS

Data Sources

Using the online listing of published Choosing Wisely® recommendations, a dataset was generated incorporating all 320 recommendations comprising the 58 lists published through August, 2014; these include both the adult and pediatric hospital medicine lists released by the SHM.4,5,12 Although data collection ended at this point, this represents a majority of all 81 lists and 535 recommendations published through December, 2017. The reviewers (A.J.A., A.G., M.W., T.S.V., M.S., and C.R.C) extracted information about the references cited for each recommendation.

Data Analysis

The reviewers obtained each reference cited by a Choosing Wisely® recommendation and categorized it by evidence strength along the following hierarchy: clinical practice guideline (CPG), primary research, review article, expert opinion, book, or others/unknown. CPGs were used as the highest level of evidence based on standard expectations for methodological rigor.13 Primary research was further rated as follows: systematic reviews and meta-analyses, randomized controlled trials (RCTs), observational studies, and case series. Each recommendation was graded using only the strongest piece of evidence cited.

Guideline Appraisal

We further sought to evaluate the strength of referenced CPGs. To accomplish this, a 10% random sample of the Choosing Wisely® recommendations citing CPGs was selected, and the referenced CPGs were obtained. Separately, CPGs referenced by the SHM-published adult and pediatric lists were also obtained. For both groups, one CPG was randomly selected when a recommendation cited more than one CPG. These guidelines were assessed using the Appraisal of Guidelines for Research and Evaluation (AGREE) II instrument, a widely used instrument designed to assess CPG quality.14,15 AGREE II consists of 25 questions categorized into 6 domains: scope and purpose, stakeholder involvement, rigor of development, clarity of presentation, applicability, and editorial independence. Guidelines are also assigned an overall score. Two trained reviewers (A.J.A. and A.G.) assessed each of the sampled CPGs using a standardized form. Scores were then standardized using the method recommended by the instrument and reported as a percentage of available points. Although a standard interpretation of scores is not provided by the instrument, prior applications deemed scores below 50% as deficient16,17. When a recommendation item cited multiple CPGs, one was randomly selected. We also abstracted data on the year of publication, the evidence grade assigned to specific items recommended by Choosing Wisely®, and whether the CPG addressed the referring recommendation. All data management and analysis were conducted using Stata (V14.2, StataCorp, College Station, Texas).

 

 

RESULTS

A total of 320 recommendations were considered in our analysis, including 10 published across the 2 hospital medicine lists. When limited to the highest quality citation for each of the recommendations, 225 (70.3%) cited CPGs, whereas 71 (22.2%) cited primary research articles (Table 1). Specifically, 29 (9.1%) cited systematic reviews and meta-analyses, 28 (8.8%) cited observational studies, and 13 (4.1%) cited RCTs. One recommendation (0.3%) cited a case series as its highest level of evidence, 7 (2.2%) cited review articles, 7 (2.2%) cited editorials or opinion pieces, and 10 (3.1%) cited other types of documents, such as websites or books. Among hospital medicine recommendations, 9 (90%) referenced CPGs and 1 (10%) cited an observational study.

For the AGREE II assessment, we included 23 CPGs from the 225 referenced across all recommendations, after which we separately selected 6 CPGs from the hospital medicine recommendations. There was no overlap. Notably, 4 hospital medicine recommendations referenced a common CPG. Among the random sample of referenced CPGs, the median overall score obtained by using AGREE II was 54.2% (IQR 33.3%-70.8%, Table 2). This was similar to the median overall among hospital medicine guidelines (58.2%, IQR 50.0%-83.3%). Both hospital medicine and other sampled guidelines tended to score poorly in stakeholder involvement (48.6%, IQR 44.1%-61.1% and 47.2%, IQR 38.9%-61.1%, respectively). There were no significant differences between hospital medicine-referenced CPGs and the larger sample of CPGs in any AGREE II subdomains. The median age from the CPG publication to the list publication was 7 years (IQR 4–7) for hospital medicine recommendations and 3 years (IQR 2–6) for the nonhospital medicine recommendations. Substantial agreement was found between raters on the overall guideline assessment (ICC 0.80, 95% CI 0.58-0.91; Supplementary Table 1).



In terms of recommendation strengths and evidence grades, several recommendations were backed by Grades II–III (on a scale of I-III) evidence and level C (on a scale of A–C) recommendations in the reviewed CPG (Society of Maternal-Fetal Medicine, Recommendation 4, and Heart Rhythm Society, Recommendation 1). In one other case, the cited CPG did not directly address the Choosing Wisely® item (Society of Vascular Medicine, Recommendation 2).

DISCUSSION

Given the rising costs and the potential for iatrogenic harm, curbing ineffective practices has become an urgent concern. To achieve this, the Choosing Wisely® campaign has taken an important step by targeting certain low-value practices for de-adoption. However, the evidence supporting recommendations is variable. Specifically, 25 recommendations cited case series, review articles, or lower quality evidence as their highest level of support; moreover, among recommendations citing CPGs, quality, timeliness, and support for the recommendation item were variable. Although the hospital medicine lists tended to cite higher-quality evidence in the form of CPGs, these CPGs were often less recent than the guidelines referenced by other lists.

Our findings parallel those of other works that evaluate evidence among Choosing Wisely® recommendations and, more broadly, among CPGs.18–21 Lin and Yancey evaluated the quality of primary care-focused Choosing Wisely® recommendations using the Strength of Recommendation Taxonomy, a ranking system that evaluates evidence quality, consistency, and patient-centeredness.18 In their analysis, the authors found that many recommendations were based on lower quality evidence or relied on nonpatent-centered intermediate outcomes. Several groups, meanwhile, have evaluated the quality of evidence supporting CPG recommendations, finding them to be highly variable as well.19–21 These findings likely reflect inherent difficulties in the process, by which guideline development groups distill a broad evidence base into useful clinical recommendations, a reality that may have influenced the Choosing Wisely® list development groups seeking to make similar recommendations on low-value services.

These data should be taken in context due to several limitations. First, our sample of referenced CPGs includes only a small sample of all CPGs cited; thus, it may not be representative of all referenced guidelines. Second, the AGREE II assessment is inherently subjective, despite the availability of training materials. Third, data collection ended in April, 2014. Although this represents a majority of published lists to date, it is possible that more recent Choosing Wisely®lists include a stronger focus on evidence quality. Finally, references cited by Choosing Wisely®may not be representative of the entirety of the dataset that was considered when formulating the recommendations.

Despite these limitations, our findings suggest that Choosing Wisely®recommendations vary in terms of evidence strength. Although our results reveal that the majority of recommendations cite guidelines or high-quality original research, evidence gaps remain, with a small number citing low-quality evidence or low-quality CPGs as their highest form of support. Given the barriers to the successful de-implementation of low-value services, such campaigns as Choosing Wisely®face an uphill battle in their attempt to prompt behavior changes among providers and consumers.6-9 As a result, it is incumbent on funding agencies and medical journals to promote studies evaluating the harms and overall value of the care we deliver.

 

 

CONCLUSIONS

Although a majority of Choosing Wisely® recommendations cite high-quality evidence, some reference low-quality evidence or low-quality CPGs as their highest form of support. To overcome clinical inertia and other barriers to the successful de-implementation of low-value services, a clear rationale for the impetus to eradicate entrenched practices is critical.2,22 Choosing Wisely® has provided visionary leadership and a powerful platform to question low-value care. To expand the campaign’s efforts, the medical field must be able to generate the high-quality evidence necessary to support these efforts; further, list development groups must consider the availability of strong evidence when targeting services for de-implementation.

ACKNOWLEDGMENT

This work was supported, in part, by a grant from the Agency for Healthcare Research and Quality (No. K08HS020672, Dr. Cooke).

Disclosures

The authors have nothing to disclose.

As healthcare costs rise, physicians and other stakeholders are now seeking innovative and effective ways to reduce the provision of low-value services.1,2 The Choosing Wisely® campaign aims to further this goal by promoting lists of specific procedures, tests, and treatments that providers should avoid in selected clinical settings.3 On February 21, 2013, the Society of Hospital Medicine (SHM) released 2 Choosing Wisely® lists consisting of adult and pediatric services that are seen as costly to consumers and to the healthcare system, but which are often nonbeneficial or even harmful.4,5 A total of 80 physician and nurse specialty societies have joined in submitting additional lists.

Despite the growing enthusiasm for this effort, questions remain regarding the Choosing Wisely® campaign’s ability to initiate the meaningful de-adoption of low-value services. Specifically, prior efforts to reduce the use of services deemed to be of questionable benefit have met several challenges.2,6 Early analyses of the Choosing Wisely® recommendations reveal similar roadblocks and variable uptakes of several recommendations.7-10 While the reasons for difficulties in achieving de-adoption are broad, one important factor in whether clinicians are willing to follow guideline recommendations from such initiatives as Choosing Wisely®is the extent to which they believe in the underlying evidence.11 The current work seeks to formally evaluate the evidence supporting the Choosing Wisely® recommendations, and to compare the quality of evidence supporting SHM lists to other published Choosing Wisely® lists.

METHODS

Data Sources

Using the online listing of published Choosing Wisely® recommendations, a dataset was generated incorporating all 320 recommendations comprising the 58 lists published through August, 2014; these include both the adult and pediatric hospital medicine lists released by the SHM.4,5,12 Although data collection ended at this point, this represents a majority of all 81 lists and 535 recommendations published through December, 2017. The reviewers (A.J.A., A.G., M.W., T.S.V., M.S., and C.R.C) extracted information about the references cited for each recommendation.

Data Analysis

The reviewers obtained each reference cited by a Choosing Wisely® recommendation and categorized it by evidence strength along the following hierarchy: clinical practice guideline (CPG), primary research, review article, expert opinion, book, or others/unknown. CPGs were used as the highest level of evidence based on standard expectations for methodological rigor.13 Primary research was further rated as follows: systematic reviews and meta-analyses, randomized controlled trials (RCTs), observational studies, and case series. Each recommendation was graded using only the strongest piece of evidence cited.

Guideline Appraisal

We further sought to evaluate the strength of referenced CPGs. To accomplish this, a 10% random sample of the Choosing Wisely® recommendations citing CPGs was selected, and the referenced CPGs were obtained. Separately, CPGs referenced by the SHM-published adult and pediatric lists were also obtained. For both groups, one CPG was randomly selected when a recommendation cited more than one CPG. These guidelines were assessed using the Appraisal of Guidelines for Research and Evaluation (AGREE) II instrument, a widely used instrument designed to assess CPG quality.14,15 AGREE II consists of 25 questions categorized into 6 domains: scope and purpose, stakeholder involvement, rigor of development, clarity of presentation, applicability, and editorial independence. Guidelines are also assigned an overall score. Two trained reviewers (A.J.A. and A.G.) assessed each of the sampled CPGs using a standardized form. Scores were then standardized using the method recommended by the instrument and reported as a percentage of available points. Although a standard interpretation of scores is not provided by the instrument, prior applications deemed scores below 50% as deficient16,17. When a recommendation item cited multiple CPGs, one was randomly selected. We also abstracted data on the year of publication, the evidence grade assigned to specific items recommended by Choosing Wisely®, and whether the CPG addressed the referring recommendation. All data management and analysis were conducted using Stata (V14.2, StataCorp, College Station, Texas).

 

 

RESULTS

A total of 320 recommendations were considered in our analysis, including 10 published across the 2 hospital medicine lists. When limited to the highest quality citation for each of the recommendations, 225 (70.3%) cited CPGs, whereas 71 (22.2%) cited primary research articles (Table 1). Specifically, 29 (9.1%) cited systematic reviews and meta-analyses, 28 (8.8%) cited observational studies, and 13 (4.1%) cited RCTs. One recommendation (0.3%) cited a case series as its highest level of evidence, 7 (2.2%) cited review articles, 7 (2.2%) cited editorials or opinion pieces, and 10 (3.1%) cited other types of documents, such as websites or books. Among hospital medicine recommendations, 9 (90%) referenced CPGs and 1 (10%) cited an observational study.

For the AGREE II assessment, we included 23 CPGs from the 225 referenced across all recommendations, after which we separately selected 6 CPGs from the hospital medicine recommendations. There was no overlap. Notably, 4 hospital medicine recommendations referenced a common CPG. Among the random sample of referenced CPGs, the median overall score obtained by using AGREE II was 54.2% (IQR 33.3%-70.8%, Table 2). This was similar to the median overall among hospital medicine guidelines (58.2%, IQR 50.0%-83.3%). Both hospital medicine and other sampled guidelines tended to score poorly in stakeholder involvement (48.6%, IQR 44.1%-61.1% and 47.2%, IQR 38.9%-61.1%, respectively). There were no significant differences between hospital medicine-referenced CPGs and the larger sample of CPGs in any AGREE II subdomains. The median age from the CPG publication to the list publication was 7 years (IQR 4–7) for hospital medicine recommendations and 3 years (IQR 2–6) for the nonhospital medicine recommendations. Substantial agreement was found between raters on the overall guideline assessment (ICC 0.80, 95% CI 0.58-0.91; Supplementary Table 1).



In terms of recommendation strengths and evidence grades, several recommendations were backed by Grades II–III (on a scale of I-III) evidence and level C (on a scale of A–C) recommendations in the reviewed CPG (Society of Maternal-Fetal Medicine, Recommendation 4, and Heart Rhythm Society, Recommendation 1). In one other case, the cited CPG did not directly address the Choosing Wisely® item (Society of Vascular Medicine, Recommendation 2).

DISCUSSION

Given the rising costs and the potential for iatrogenic harm, curbing ineffective practices has become an urgent concern. To achieve this, the Choosing Wisely® campaign has taken an important step by targeting certain low-value practices for de-adoption. However, the evidence supporting recommendations is variable. Specifically, 25 recommendations cited case series, review articles, or lower quality evidence as their highest level of support; moreover, among recommendations citing CPGs, quality, timeliness, and support for the recommendation item were variable. Although the hospital medicine lists tended to cite higher-quality evidence in the form of CPGs, these CPGs were often less recent than the guidelines referenced by other lists.

Our findings parallel those of other works that evaluate evidence among Choosing Wisely® recommendations and, more broadly, among CPGs.18–21 Lin and Yancey evaluated the quality of primary care-focused Choosing Wisely® recommendations using the Strength of Recommendation Taxonomy, a ranking system that evaluates evidence quality, consistency, and patient-centeredness.18 In their analysis, the authors found that many recommendations were based on lower quality evidence or relied on nonpatent-centered intermediate outcomes. Several groups, meanwhile, have evaluated the quality of evidence supporting CPG recommendations, finding them to be highly variable as well.19–21 These findings likely reflect inherent difficulties in the process, by which guideline development groups distill a broad evidence base into useful clinical recommendations, a reality that may have influenced the Choosing Wisely® list development groups seeking to make similar recommendations on low-value services.

These data should be taken in context due to several limitations. First, our sample of referenced CPGs includes only a small sample of all CPGs cited; thus, it may not be representative of all referenced guidelines. Second, the AGREE II assessment is inherently subjective, despite the availability of training materials. Third, data collection ended in April, 2014. Although this represents a majority of published lists to date, it is possible that more recent Choosing Wisely®lists include a stronger focus on evidence quality. Finally, references cited by Choosing Wisely®may not be representative of the entirety of the dataset that was considered when formulating the recommendations.

Despite these limitations, our findings suggest that Choosing Wisely®recommendations vary in terms of evidence strength. Although our results reveal that the majority of recommendations cite guidelines or high-quality original research, evidence gaps remain, with a small number citing low-quality evidence or low-quality CPGs as their highest form of support. Given the barriers to the successful de-implementation of low-value services, such campaigns as Choosing Wisely®face an uphill battle in their attempt to prompt behavior changes among providers and consumers.6-9 As a result, it is incumbent on funding agencies and medical journals to promote studies evaluating the harms and overall value of the care we deliver.

 

 

CONCLUSIONS

Although a majority of Choosing Wisely® recommendations cite high-quality evidence, some reference low-quality evidence or low-quality CPGs as their highest form of support. To overcome clinical inertia and other barriers to the successful de-implementation of low-value services, a clear rationale for the impetus to eradicate entrenched practices is critical.2,22 Choosing Wisely® has provided visionary leadership and a powerful platform to question low-value care. To expand the campaign’s efforts, the medical field must be able to generate the high-quality evidence necessary to support these efforts; further, list development groups must consider the availability of strong evidence when targeting services for de-implementation.

ACKNOWLEDGMENT

This work was supported, in part, by a grant from the Agency for Healthcare Research and Quality (No. K08HS020672, Dr. Cooke).

Disclosures

The authors have nothing to disclose.

References

1. Institute of Medicine Roundtable on Evidence-Based Medicine. The Healthcare Imperative: Lowering Costs and Improving Outcomes: Workshop Series Summary. Yong P, Saudners R, Olsen L, editors. Washington, D.C.: National Academies Press; 2010. PubMed
2. Weinberger SE. Providing high-value, cost-conscious care: a critical seventh general competency for physicians. Ann Intern Med. 2011;155(6):386-388. PubMed
3. Cassel CK, Guest JA. Choosing wisely: Helping physicians and patients make smart decisions about their care. JAMA. 2012;307(17):1801-1802. PubMed
4. Bulger J, Nickel W, Messler J, Goldstein J, O’Callaghan J, Auron M, et al. Choosing wisely in adult hospital medicine: Five opportunities for improved healthcare value. J Hosp Med. 2013;8(9):486-492. PubMed
5. Quinonez RA, Garber MD, Schroeder AR, Alverson BK, Nickel W, Goldstein J, et al. Choosing wisely in pediatric hospital medicine: Five opportunities for improved healthcare value. J Hosp Med. 2013;8(9):479-485. PubMed
6. Prasad V, Ioannidis JP. Evidence-based de-implementation for contradicted, unproven, and aspiring healthcare practices. Implement Sci. 2014;9:1. PubMed
7. Rosenberg A, Agiro A, Gottlieb M, Barron J, Brady P, Liu Y, et al. Early trends among seven recommendations from the Choosing Wisely campaign. JAMA Intern Med. 2015;175(12):1913-1920. PubMed
8. Zikmund-Fisher BJ, Kullgren JT, Fagerlin A, Klamerus ML, Bernstein SJ, Kerr EA. Perceived barriers to implementing individual Choosing Wisely® recommendations in two national surveys of primary care providers. J Gen Intern Med. 2017;32(2):210-217. PubMed
9. Bishop TF, Cea M, Miranda Y, Kim R, Lash-Dardia M, Lee JI, et al. Academic physicians’ views on low-value services and the choosing wisely campaign: A qualitative study. Healthc (Amsterdam, Netherlands). 2017;5(1-2):17-22. PubMed
10. Prochaska MT, Hohmann SF, Modes M, Arora VM. Trends in Troponin-only testing for AMI in academic teaching hospitals and the impact of Choosing Wisely®. J Hosp Med. 2017;12(12):957-962. PubMed
11. Cabana MD, Rand CS, Powe NR, Wu AW, Wilson MH, Abboud PA, et al. Why don’t physicians follow clinical practice guidelines? A framework for improvement. JAMA. 1999;282(15):1458-1465. PubMed
12. ABIM Foundation. ChoosingWisely.org Search Recommendations. 2014. 
13. Institute of Medicine (US) Committee on Standards for Developing Trustworthy Clinical Practice Guidelines. Clinical Practice Guidelines We Can Trust. Graham R, Mancher M, Miller Wolman D, Greenfield S, Steinberg E, editors. Washington, D.C.: National Academies Press; 2011. PubMed
14. Brouwers MC, Kho ME, Browman GP, Burgers JS, Cluzeau F, Feder G, et al. AGREE II: Advancing guideline development, reporting, and evaluation in health care. Prev Med (Baltim). 2010;51(5):421-424. PubMed
15. Brouwers MC, Kho ME, Browman GP, Burgers JS, Cluzeau F, Feder G, et al. Development of the AGREE II, part 2: Assessment of validity of items and tools to support application. CMAJ. 2010;182(10):E472-E478. PubMed
16. He Z, Tian H, Song A, Jin L, Zhou X, Liu X, et al. Quality appraisal of clinical practice guidelines on pancreatic cancer. Medicine (Baltimore). 2015;94(12):e635. PubMed
17. Isaac A, Saginur M, Hartling L, Robinson JL. Quality of reporting and evidence in American Academy of Pediatrics guidelines. Pediatrics. 2013;131(4):732-738. PubMed
18. Lin KW, Yancey JR. Evaluating the Evidence for Choosing WiselyTM in Primary Care Using the Strength of Recommendation Taxonomy (SORT). J Am Board Fam Med. 2016;29(4):512-515. PubMed
19. McAlister FA, van Diepen S, Padwal RS, Johnson JA, Majumdar SR. How evidence-based are the recommendations in evidence-based guidelines? PLoS Med. 2007;4(8):e250. PubMed
20. Tricoci P, Allen JM, Kramer JM, Califf RM, Smith SC. Scientific evidence underlying the ACC/AHA clinical practice guidelines. JAMA. 2009;301(8):831-841. PubMed
21. Feuerstein JD, Gifford AE, Akbari M, Goldman J, Leffler DA, Sheth SG, et al. Systematic analysis underlying the quality of the scientific evidence and conflicts of interest in gastroenterology practice guidelines. Am J Gastroenterol. 2013;108(11):1686-1693. PubMed
22. Robert G, Harlock J, Williams I. Disentangling rhetoric and reality: an international Delphi study of factors and processes that facilitate the successful implementation of decisions to decommission healthcare services. Implement Sci. 2014;9:123. PubMed

References

1. Institute of Medicine Roundtable on Evidence-Based Medicine. The Healthcare Imperative: Lowering Costs and Improving Outcomes: Workshop Series Summary. Yong P, Saudners R, Olsen L, editors. Washington, D.C.: National Academies Press; 2010. PubMed
2. Weinberger SE. Providing high-value, cost-conscious care: a critical seventh general competency for physicians. Ann Intern Med. 2011;155(6):386-388. PubMed
3. Cassel CK, Guest JA. Choosing wisely: Helping physicians and patients make smart decisions about their care. JAMA. 2012;307(17):1801-1802. PubMed
4. Bulger J, Nickel W, Messler J, Goldstein J, O’Callaghan J, Auron M, et al. Choosing wisely in adult hospital medicine: Five opportunities for improved healthcare value. J Hosp Med. 2013;8(9):486-492. PubMed
5. Quinonez RA, Garber MD, Schroeder AR, Alverson BK, Nickel W, Goldstein J, et al. Choosing wisely in pediatric hospital medicine: Five opportunities for improved healthcare value. J Hosp Med. 2013;8(9):479-485. PubMed
6. Prasad V, Ioannidis JP. Evidence-based de-implementation for contradicted, unproven, and aspiring healthcare practices. Implement Sci. 2014;9:1. PubMed
7. Rosenberg A, Agiro A, Gottlieb M, Barron J, Brady P, Liu Y, et al. Early trends among seven recommendations from the Choosing Wisely campaign. JAMA Intern Med. 2015;175(12):1913-1920. PubMed
8. Zikmund-Fisher BJ, Kullgren JT, Fagerlin A, Klamerus ML, Bernstein SJ, Kerr EA. Perceived barriers to implementing individual Choosing Wisely® recommendations in two national surveys of primary care providers. J Gen Intern Med. 2017;32(2):210-217. PubMed
9. Bishop TF, Cea M, Miranda Y, Kim R, Lash-Dardia M, Lee JI, et al. Academic physicians’ views on low-value services and the choosing wisely campaign: A qualitative study. Healthc (Amsterdam, Netherlands). 2017;5(1-2):17-22. PubMed
10. Prochaska MT, Hohmann SF, Modes M, Arora VM. Trends in Troponin-only testing for AMI in academic teaching hospitals and the impact of Choosing Wisely®. J Hosp Med. 2017;12(12):957-962. PubMed
11. Cabana MD, Rand CS, Powe NR, Wu AW, Wilson MH, Abboud PA, et al. Why don’t physicians follow clinical practice guidelines? A framework for improvement. JAMA. 1999;282(15):1458-1465. PubMed
12. ABIM Foundation. ChoosingWisely.org Search Recommendations. 2014. 
13. Institute of Medicine (US) Committee on Standards for Developing Trustworthy Clinical Practice Guidelines. Clinical Practice Guidelines We Can Trust. Graham R, Mancher M, Miller Wolman D, Greenfield S, Steinberg E, editors. Washington, D.C.: National Academies Press; 2011. PubMed
14. Brouwers MC, Kho ME, Browman GP, Burgers JS, Cluzeau F, Feder G, et al. AGREE II: Advancing guideline development, reporting, and evaluation in health care. Prev Med (Baltim). 2010;51(5):421-424. PubMed
15. Brouwers MC, Kho ME, Browman GP, Burgers JS, Cluzeau F, Feder G, et al. Development of the AGREE II, part 2: Assessment of validity of items and tools to support application. CMAJ. 2010;182(10):E472-E478. PubMed
16. He Z, Tian H, Song A, Jin L, Zhou X, Liu X, et al. Quality appraisal of clinical practice guidelines on pancreatic cancer. Medicine (Baltimore). 2015;94(12):e635. PubMed
17. Isaac A, Saginur M, Hartling L, Robinson JL. Quality of reporting and evidence in American Academy of Pediatrics guidelines. Pediatrics. 2013;131(4):732-738. PubMed
18. Lin KW, Yancey JR. Evaluating the Evidence for Choosing WiselyTM in Primary Care Using the Strength of Recommendation Taxonomy (SORT). J Am Board Fam Med. 2016;29(4):512-515. PubMed
19. McAlister FA, van Diepen S, Padwal RS, Johnson JA, Majumdar SR. How evidence-based are the recommendations in evidence-based guidelines? PLoS Med. 2007;4(8):e250. PubMed
20. Tricoci P, Allen JM, Kramer JM, Califf RM, Smith SC. Scientific evidence underlying the ACC/AHA clinical practice guidelines. JAMA. 2009;301(8):831-841. PubMed
21. Feuerstein JD, Gifford AE, Akbari M, Goldman J, Leffler DA, Sheth SG, et al. Systematic analysis underlying the quality of the scientific evidence and conflicts of interest in gastroenterology practice guidelines. Am J Gastroenterol. 2013;108(11):1686-1693. PubMed
22. Robert G, Harlock J, Williams I. Disentangling rhetoric and reality: an international Delphi study of factors and processes that facilitate the successful implementation of decisions to decommission healthcare services. Implement Sci. 2014;9:123. PubMed

Issue
Journal of Hospital Medicine 13(10)
Issue
Journal of Hospital Medicine 13(10)
Page Number
688-691. Published online first April 25, 2018
Page Number
688-691. Published online first April 25, 2018
Publications
Publications
Topics
Article Type
Sections
Article Source

© 2018 Society of Hospital Medicine

Citation Override
Published online first April 25, 2018
Disallow All Ads
Correspondence Location
"Andrew J. Admon, MD, MPH", North Campus Research Complex, Buillding 16 127W, 2800 Plymouth Road, Ann Arbor, MI, 48109-2800; E-mail: ajadmon@umich.edu
Content Gating
Gated (full article locked unless allowed per User)
Alternative CME
Disqus Comments
Default
Gate On Date
Wed, 05/23/2018 - 05:00
Use ProPublica
Hide sidebar & use full width
render the right sidebar.
Gating Strategy
First Peek Free
Article PDF Media
Media Files