Comparison of Methods to Define High Use of Inpatient Services Using Population-Based Data

Article Type
Changed
Display Headline
Comparison of Methods to Define High Use of Inpatient Services Using Population-Based Data

As healthcare system use and costs continue to rise, increased importance has been placed on identifying the small subgroup of patients that drive this trend.1 It is estimated that 5% of healthcare users account for over 60% of healthcare spending.2-6 Furthermore, care for these “high users” is expensive due to an over-reliance on inpatient services. Approximately 40% of all health spending is for inpatient care, the largest single category of health spending, which is similarly skewed toward high users.1,3,5 Improving our understanding of this population may provide an opportunity to direct improvement efforts to a select group of patients with a potentially high benefit, as well as move care away from the costly inpatient setting.

 

However, the development of effective interventions to improve patient experience and outcomes while decreasing costs (referred to as the “Triple Aim” by the Institute for Health Improvement) for high users of inpatient services hinges on the methodology used to identify this high-risk population.7 There is substantial variability in definitions of high users; the most common definitions are based on the number of hospital encounters, days spent in the hospital, and hospital costs.8-15 Definitions have intrinsic differences in their implications around appropriateness, efficiency, and financial sustainability of inpatient resource use. Though the constructs underlying these definitions are highly variable, direct comparisons of differences in patient capture are limited.

A recent study from a single US center explored the clinical characteristics of hospital patients based on definitions of use vs cost and observed important differences in patients’ profiles and outcomes.12 While this suggests that the choice of definition may have major implications for whom to target (and the efficacy of any proposed interventions), this concept has not been explored at the population level. Therefore, we used population-based administrative data from a single-payer healthcare system to compare 3 common definitions of high inpatient service use and their influence on patient capture, health outcomes, and inpatient system burden.

METHODS

Data Sources and Study Population

We conducted a retrospective population-based study using administrative and clinical data for the province of Alberta, including the discharge abstracts database, physician claims, ambulatory care records, population health registry file, and aggregated data from the Canadian census.16 We identified all adults who had 1 or more hospitalizations with a discharge date between April 1, 2012, and March 31, 2013, though the admission date could be prior to April 1, 2012.

Definition of High-Inpatient Use

High-inpatient use was defined using 3 metrics: number of inpatient episodes, length of stay, and cost. As in prior studies, for each definition, individuals in the upper5th percentile of the relevant distribution were designated “high users,”2,15 while patients in the lower 95th percentile were considered “nonhigh users.” Patients could be defined as a high user in more than 1 definition.

Patients with 3 or more hospital episodes were defined as high users for the “number of inpatient episodes” definition. A hospital episode of care was defined as an event that resulted in discharge (or death) from an inpatient facility. If an individual was admitted to a hospital and transferred to another facility within 1 day of discharge, the hospitalizations were considered part of the same episode of care.

The “length of stay” definition refers to the cumulative number of days spent in an inpatient facility for all eligible episodes of care. Patients with 56 or more days in hospital during the study period were considered high users. Day of admission and discharge were considered full inpatient days, regardless of the time of admission and discharge.

The “cost” definition considered the cumulative estimated cost of every eligible episode of care. We estimated costs for each hospitalization using resource intensity weights (RIW). This is a relative weighted value for the average inpatient case after taking factors such as age, comorbidity, and procedures into account. The RIW for each episode was multiplied by the national average inpatient cost.17 Based on this definition, patients with a cumulative hospital cost of ≥ $63,597 were deemed high users. All costs were calculated in Canadian Dollars (CAD, $) and adjusted to 2013 dollars based on Statistics Canada’s Consumer Price Index.18

 

 

Demographic, Clinical, and Encounter Characteristics

Individual characteristics were measured using a combination of provincial administrative data sources. All measures were recorded as of the admission date of the first eligible hospitalization. Demographic characteristics included age, sex, First Nations status, urban/rural status (based on the individual’s residential postal code), and median neighborhood income quintile. Clinical characteristics included 28 comorbid conditions defined based on separate validated International Statistical Classification of Disease and Health Related Problems, Tenth Revision, Canada (ICD-10-CA) coding algorithms reported individually and cumulatively (categorized as 0, 1, 2–3, and 4+).19 Primary care attachment was defined as the percentage of all outpatient primary care visits made to a single practitioner in the 2-year period prior to their first hospitalization (among those with ≥3 visits). Attachment was categorized as 75%-100% (good attachment), 50%-74% (moderate attachment), or <50% (low attachment).20,21

We also identified hospital encounter-level characteristics. These included the most responsible diagnosis, admission category (elective or urgent/emergent), and discharge disposition for each hospital episode. Reported health outcomes included the proportion of patients with in-hospital mortality and those with at least one 30-day, all-cause readmission to hospital.

Analysis

Patient characteristics were described using proportions and means (standard deviation) as appropriate for high users and nonhigh users within and across each definition. Encounter characteristics were also described and stratified by age category (18-64 or 65+ years). Comparison of patient capture was then analyzed among patients who were high use by at least 1 definition. The overlap and agreement of the 3 definitions were compared using a Venn diagram and kappa statistic. The 10 most responsible diagnoses (based on frequency) were also compared across definitions and stratified by age.

Finally, the percentage of system burden accounted for by each measure was calculated as the amount used by high users divided by the total amount used by the entire study population (x 100). To assess the potential modifying effect of age, results were stratified by age category for each definition.

All analyses were conducted using Stata 11.2 (StataCorp LP, College Station, TX).22 The Conjoint Health Research Ethics Board of the University of Calgary approved this study and granted waiver of patient consent. This manuscript is written in accordance with reporting guidelines for studies conducted using observational routinely collected health data (RECORD statement).23

RESULTS

Comparison of Patient and Encounter-level Characterist ics

A total of 219,106 adults had 283,204 inpatient episodes of care within the study timeframe. There were 12,707 (5.8%), 11,095 (5.1%), and 10,956 (5.0%) patients defined as high users based on number of inpatient episodes, length of stay, and cost, respectively (supplementary Figure 1). Regardless of definition, when compared to their non–high use counterparts, patients classified as high use were more likely to be male, older, in a lower median neighborhood income quintile, and have a higher level of comorbidity. Comparing across definitions of high use, those defined by number of inpatient episodes were more likely to be younger, live in rural areas, have better primary care attachment, and have fewer comorbidities, compared to the other definitions. High users by length of stay were more likely to be older and had a higher proportion of mental health–related comorbidities, including dementia and depression, as compared with the other definitions. Results were largely similar for those defined by cost (Table 1).

Encounter-level analyses

showed that high users were more likely to die within hospital (range 3.6%-9.3%) or be discharged to a long-term care setting (range 4.2%-15.2%) ,compared with nonhigh users. High users were also more likely to be readmitted within 30 days during the study period. Comparing across definitions, those defined by number of inpatient episodes were more often discharged home. High users defined by length of stay were more likely to have been discharged to a long-term care facility, while those defined by cost were more likely to have died in hospital (Table 2). Similar trends were observed across definitions when stratified by age with proportions increasing with advancing age (supplementary Table 1).

Comparison of Patient Capture and Inpatient Burden

Of the 22,691 individuals who were defined as high use by at least 1 definition, 2,331 (10.3%) were consistently high use across all 3 definitions (kappa = 0.38; Figure 1). Of the 13,682 individuals classified as high use by at least 1 of length of stay or cost, 8369 (61.2%) were defined as high use by both definitions (kappa = 0.75). However, of the 12,707 defined as high use by the number of inpatient episodes, only 3698 (29.1%) were also defined as high use by another definition. Exploration of the most responsible diagnoses across definitions showed that congestive heart failure (2.8%-3.5%), chronic obstructive pulmonary disease (1.6%-3.2%), and dementia (0.6%-2.2%) were the most frequent. Acute medical conditions (eg, pneumonia [1.8%] or gastroenteritis [0.7%]) that may result in multiple shorter hospitalizations were observed at higher frequencies among high users defined by inpatient episodes, while conditions commonly requiring rehabilitation (eg, fracture [1.8%] and stroke [1.7%]) were more common among high users defined by length of stay and cost (supplementary Table 2). Stratification by age showed marked differences in the diagnoses across high-use definitions. Among hi

gh users defined by inpatient episodes, patients aged 18-64 years had a wide range of medical diagnoses, including several for complications of childbirth. Major diagnoses among high users by length of stay aged 18-64 years were dominated by mental health–related conditions. Diagnoses among older adults (65+) were often related to degenerative neurological conditions (dementia and Alzheimer’s disease). Diagnoses among high users by cost showed similar trends to length of stay (supplementary Table 3).

 

 

When assessing inpatient system burden, high users by number of inpatient episodes accounted for 47,044 (16.6%) of the 283,204 episodes. High users defined by length of stay accounted for 1,286,539 (46.4%) days of 2,773,561 total days, while high users defined by cost accumulated $1.4 billion (38.9%) of the estimated $3.7 billion in inpatient expenditures. High users defined by cost and length of stay each accounted for comparatively few episode

s (8.5% and 8.2%, respectively), while high-cost individuals accounted for 42.8% of length of stay, and high length of stay individuals accounted for 35.8% of cost. High users by number of inpatient episodes accounted for a lower burden of the other definitions (Figure 2). High-user system burden was higher among elderly patients (65+) for all definitions.

DISCUSSION

Using a large population-based cohort of all adults with at least 1 hospitalization in the province of Alberta, Canada, within a 12-month period, we compared 3 commonly used definitions of high inpatient use. The choice of definition had a substantial influence on the types of patients categorized as high use, as well as the proportion of total inpatient utilization that was associated with high users. The definition based on number of inpatient episodes captured a distinct population of high users, while the populations identified using cumulative length of stay or cost were similar.

Differences within and between definitions were especially apparent in age-stratified analyses: Greater length of stay or higher cost among patients aged 18-64 years identifies a large proportion of psychological conditions, while a greater number of inpatient episodes identifies acute conditions and childbirth or labor-related complications. Conversely, definitions based on length of stay and cost in the elderly (65+) identified groups with chronic conditions that result in progressive functional decline (often requiring increasing supportive services upon discharge) or conditions that require significant rehabilitation prior to discharge. Regarding inpatient system burden, high users defined by number of inpatient episodes accounted for a small proportion of total inpatient episodes, while high users defined by length of stay and cost accounted for nearly half of the accumulated hospital days and cost for each. These findings highlight the need for careful consideration of how high use is defined when studying high-user populations and implications for targeting subpopulations for intervention.

Our results add to those from previous studies. A US-based, single-center study of 2566 individuals compared definitions of high inpatient use based on cost and frequency of admission and found that patients defined by cost were predominantly hospitalized for surgical conditions, while those fulfilling the episode-based definition were often hospitalized for medical conditions.12 The most responsible diagnoses for patient hospitalizations in our study reflect this. We extended this comparison to consider the impact of age on outcomes and inpatient system burden and found that older age was also linked to poorer outcomes and increased burden. We also considered a third definition (cumulative length of stay), which provided another opportunity for comparison. The presence of chronic conditions requiring rehabilitation and possible alternate level of care days within our cohort highlights the utility of this length of stay-based approach when considering definitions. Although there were similarities between patients defined by length of stay and cost, partly due to cost being largely a function of length of stay, there were also important differences in their patient profiles. Those defined by cost tended to have conditions requiring surgical procedures not requiring extended in-hospital rehabilitation. Furthermore, the higher proportion of in-hospital mortality among those defined by cost may also reflect the fact that patients tend to accrue the majority of their healthcare expenditures during the final 120 days of life.24

Each definition of high use identified complex patients; however, the differences between the various types of high users identified by these definitions suggest that they are not interchangeable. Arguably, selection of the most appropriate definition should depend on the objective of measuring high users, particularly if an intervention is planned. Interventions for high users are complex, requiring both medical and nonmedical components. The current literature in this area has often focused on case management programs, collaboration with community-based social support programs, and improving coordination and transitions of care.25-27 While many of these approaches require considerable involvement outside of the inpatient setting, these interventions can be informed by defining who high users of inpatient services are. Our findings show several possible subgroups of high users, which could be targeted for intervention. For example, an inpatient episode-based definition, which identifies patients with frequent encounters for acute conditions (eg, pneumonia and urinary tract infections), would be informative if an intervention targeted reductions in inpatient use and readmission rates. Alternatively, an intervention designed to improve community-based mental health programs would best be informed by a definition based on length of stay in which high users with underlying mental health conditions were prevalent. Such interventions are rarely mutually exclusive and require multiple perspectives to inform their objectives. A well-designed intervention will not only address the medical characteristics of high users but also the social determinants of health that place patients at risk of high inpatient use.

Our study should be interpreted in light of its limitations. First, measures of disease severity were not available to further characterize similarities and differences across high-use groups. Furthermore, we were unable to account for other social determinants of health that may be relevant to inpatient system usage. Second, direct cost of hospitalizations was estimated based on RIW and is thus reflective of expected rather than actual costs. However, this will have minimal impact on capture, as patients defined by this metric require substantial costs to be included in the top fifth percentile, and thus deviations in individual hospitalization costs will have minimal influence on the cumulative cost. Finally, while inpatient spending makes up a large proportion of healthcare spending, there is likely a number of different high-use profiles found outside of the acute care setting. Despite these limitations, our study includes several key strengths. The use of population-level data allows for analysis that is robust and more generalizable than studies from single centers. Additionally, the comparison of 3 independent definitions allows for a greater comparison of the nuances of each definition. Our study also considers the important impact of age as an effect modifier of inpatient use in the general population and identifies distinct patient profiles that exist across each definition.

 

 

CONCLUSIONS

Definitions of high use of inpatient services based on number of inpatient episodes, days spent in hospital, and total hospital costs identify patient populations with different characteristics and differ substantially in their impact on health outcomes and inpatient burden. These results highlight the need for careful consideration of the context of the study or intervention and the implications of selecting a specific definition of high inpatient use at study conception. Ultimately, the performance of an intervention in high-use populations is likely to be conditional on the fit of the patient population generated by the chosen definition of high inpatient use to the objectives of a study.

Acknowledgments

This study is based in part on data provided by Alberta Health and Alberta Health Services. The interpretation and conclusions are those of the researchers and do not represent the views of the Government of Alberta. Neither the Government of Alberta nor Alberta Health express any opinion in relation to this study.

Disclosure

Dr. Hemmelgarn is supported by the Roy and Vi Baay Chair in Kidney Research. Dr. Manns is supported by the Svare Professorship in Health Economics and by a Health Scholar Award by Alberta Innovates Health Solutions (AIHS). Dr. Tonelli is supported by the David Freeze chair in Health Services Research. The Interdisciplinary Chronic Disease Collaboration is funded by AIHS—Collaborative Research and Innovation Opportunity (CRIO) Team Grants Program.

 

Files
References

1. National Health Expenditure Trends, 1975 to 2015. Canadian Institute for Health Information. 2015. https://secure.cihi.ca/free_products/nhex_trends_narrative_report_2015_en.pdf. Accessed on June 23, 2016.
2. Berk ML, Monheit AC. The concentration of health care expenditures, revisited. Health Aff (Millwood). 2001;20:9-18. PubMed
3. Wodchis WP, Austin PC, Henry DA. A 3-year study of high-cost users of health care. CMAJ. 2016;188(3):182-188. PubMed
4. Forget EL, Roos LL, Deber RB, Wald R. Variations in Lifetime Healthcare Costs across a Population. Healthc Policy. 2008;4:e148-e167. PubMed
5. Joynt KE, Gawande AA, Orav EJ, Jha AK. Contribution of preventable acute care spending to total spending for high-cost Medicare patients. JAMA. 2013;309:2572-2578. PubMed
6. Riley GF. Long-term trends in the concentration of Medicare spending. Health Aff (Millwood). 2007;26:808-816. PubMed
7. IHI Triple Aim Initiative. Institute for Healthcare Improvement. 2015. http://www.ihi.org/engage/initiatives/TripleAim/Pages/default.aspx. Accessed on June 17, 2016.
8. Johansen H, Nair C, Bond J. Who goes to the hospital? An investigation of high users of hospital days. Health Reports. 1994;6(2):253-277. PubMed
9. Conwell LJ, Cohen JW. Characteristics of persons with high medical expenditures in the US civilian noninstitutionalized population. MEPS Statistical Brief# 73. 2002. 
10. Lemstra M, Mackenbach J, Neudorf C, Nannapaneni U. High health care utilization and costs associated with lower socio-economic status: Results from a linked dataset. CJPH. 2009;100(3):180-183. PubMed
11. Macnee CL, McCabe S, Clarke PN, Fiske M, Campbell S. Typology of high users of health services among a rural medicaid population. Pub Health Nurs. 2009;26(5):396-404. PubMed
12. Nguyen OK, Tang N, Hillman JM, Gonzales R. What’s cost got to do with it? Association between hospital costs and frequency of admissions among “high users” of hospital care. J. Hosp Med. 2013;8(12):665-671. PubMed
13. Rosella LC, Fitzpatrick T, Wodchis WP, Calzavara A, Manson H, Goel V. High-cost health care users in Ontario, Canada: Demographic, socio-economic, and health status characteristics. BMC Health Serv Res. 2014;14(1):532. PubMed
14. Cohen SB. The Concentration of Health Care Expenditures and Related Expenses for Costly Medical Conditions, 2009. Agency for Healthcare Research and Quality Statistical Brief #359; 2012. 

15. Ronksley PE, McKay JA, Kobewka DM, Mulpuru S, Forster AJ. Patterns of health care use in a high-cost inpatient population in Ottawa, Ontario: A retrospective observational study. CMAJ Open. 2015; 3:E111-E118. PubMed
16. Hemmelgarn BR, Clement F, Manns BJ, et al. Overview of the Alberta Kidney Disease Network. BMC Nephrol. 2009;10:30. PubMed
17. DAD Resource Intensity Weights and Expected Length of Stay. Canadian Institute for Health Information. 2016. https://www.cihi.ca/en/data-and-standards/standards/case-mix/resource-indicators-dad-resource-intensity-weights-and. Accessed on June 24, 2016.
18. Statistics Canada. The Canadian Consumer Price Index Reference Paper, Statistics Canada Catalogue no. 62-553-X.
19. Tonelli M, Wiebe N, Fortin M, et al. Methods for identifying 30 chronic conditions: Application to administrative data. BMC Med Inform Decis Mak. 2015;17:15(1):1. PubMed
20. Jaakkimainen RL, Klein-Geltink J, Guttmann A, Barnsley J, Jagorski B, Kopp A. Indicators of primary care based on administrative data. In Primary Care in Ontario: ICES Atlas. Toronto, Ontario: Institute for Clinical Evaluative Sciences; 2006. 
21. Jee SH, Cabana MD. Indices for continuity of care: A systematic review of the literature. Med Care Res Rev. 2006;63:158-188. PubMed
22. Stata Statistical Software: Release 11. College Station, TX: StataCorp LP. 2009. 
23. Benchimol EI, Smeeth L, Guttmann A, et al. The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) statement. PLoS Med. 2015;12(10):e1001885. PubMed
24. Tanuseputro P, Wodchis WP, Fowler R, et al. The health care cost of dying: A population-based retrospective cohort study of the last year of life in ontario, canada. PLoS One. 2015;10(3):e0121759. PubMed
25. Hong CS, Siegel AL, Ferris TG. Caring for high-need, high-cost patients: What makes for a successful care management program? Issue Brief (Commonw Fund). 2014;19:1-19. PubMed
26. Birnbaum M, Halper DE. Rethinking service delivery for high-cost Medicaid patients. Medicaid Institute. 2009. http://shnny.org/research/rethinking-service-delivery-for-high-cost-medicaid-patients/. Accessed on Jan 11, 2017.
27. Pan-Canadian forum on high users of health care. Canadian Institute for Health Information. 2014. https://secure.cihi.ca/free_products/highusers_summary_report_revised_EN_web.pdf. Accessed on Jan 11, 2017.

Article PDF
Issue
Journal of Hospital Medicine 12 (8)
Topics
Page Number
596-602
Sections
Files
Files
Article PDF
Article PDF

As healthcare system use and costs continue to rise, increased importance has been placed on identifying the small subgroup of patients that drive this trend.1 It is estimated that 5% of healthcare users account for over 60% of healthcare spending.2-6 Furthermore, care for these “high users” is expensive due to an over-reliance on inpatient services. Approximately 40% of all health spending is for inpatient care, the largest single category of health spending, which is similarly skewed toward high users.1,3,5 Improving our understanding of this population may provide an opportunity to direct improvement efforts to a select group of patients with a potentially high benefit, as well as move care away from the costly inpatient setting.

 

However, the development of effective interventions to improve patient experience and outcomes while decreasing costs (referred to as the “Triple Aim” by the Institute for Health Improvement) for high users of inpatient services hinges on the methodology used to identify this high-risk population.7 There is substantial variability in definitions of high users; the most common definitions are based on the number of hospital encounters, days spent in the hospital, and hospital costs.8-15 Definitions have intrinsic differences in their implications around appropriateness, efficiency, and financial sustainability of inpatient resource use. Though the constructs underlying these definitions are highly variable, direct comparisons of differences in patient capture are limited.

A recent study from a single US center explored the clinical characteristics of hospital patients based on definitions of use vs cost and observed important differences in patients’ profiles and outcomes.12 While this suggests that the choice of definition may have major implications for whom to target (and the efficacy of any proposed interventions), this concept has not been explored at the population level. Therefore, we used population-based administrative data from a single-payer healthcare system to compare 3 common definitions of high inpatient service use and their influence on patient capture, health outcomes, and inpatient system burden.

METHODS

Data Sources and Study Population

We conducted a retrospective population-based study using administrative and clinical data for the province of Alberta, including the discharge abstracts database, physician claims, ambulatory care records, population health registry file, and aggregated data from the Canadian census.16 We identified all adults who had 1 or more hospitalizations with a discharge date between April 1, 2012, and March 31, 2013, though the admission date could be prior to April 1, 2012.

Definition of High-Inpatient Use

High-inpatient use was defined using 3 metrics: number of inpatient episodes, length of stay, and cost. As in prior studies, for each definition, individuals in the upper5th percentile of the relevant distribution were designated “high users,”2,15 while patients in the lower 95th percentile were considered “nonhigh users.” Patients could be defined as a high user in more than 1 definition.

Patients with 3 or more hospital episodes were defined as high users for the “number of inpatient episodes” definition. A hospital episode of care was defined as an event that resulted in discharge (or death) from an inpatient facility. If an individual was admitted to a hospital and transferred to another facility within 1 day of discharge, the hospitalizations were considered part of the same episode of care.

The “length of stay” definition refers to the cumulative number of days spent in an inpatient facility for all eligible episodes of care. Patients with 56 or more days in hospital during the study period were considered high users. Day of admission and discharge were considered full inpatient days, regardless of the time of admission and discharge.

The “cost” definition considered the cumulative estimated cost of every eligible episode of care. We estimated costs for each hospitalization using resource intensity weights (RIW). This is a relative weighted value for the average inpatient case after taking factors such as age, comorbidity, and procedures into account. The RIW for each episode was multiplied by the national average inpatient cost.17 Based on this definition, patients with a cumulative hospital cost of ≥ $63,597 were deemed high users. All costs were calculated in Canadian Dollars (CAD, $) and adjusted to 2013 dollars based on Statistics Canada’s Consumer Price Index.18

 

 

Demographic, Clinical, and Encounter Characteristics

Individual characteristics were measured using a combination of provincial administrative data sources. All measures were recorded as of the admission date of the first eligible hospitalization. Demographic characteristics included age, sex, First Nations status, urban/rural status (based on the individual’s residential postal code), and median neighborhood income quintile. Clinical characteristics included 28 comorbid conditions defined based on separate validated International Statistical Classification of Disease and Health Related Problems, Tenth Revision, Canada (ICD-10-CA) coding algorithms reported individually and cumulatively (categorized as 0, 1, 2–3, and 4+).19 Primary care attachment was defined as the percentage of all outpatient primary care visits made to a single practitioner in the 2-year period prior to their first hospitalization (among those with ≥3 visits). Attachment was categorized as 75%-100% (good attachment), 50%-74% (moderate attachment), or <50% (low attachment).20,21

We also identified hospital encounter-level characteristics. These included the most responsible diagnosis, admission category (elective or urgent/emergent), and discharge disposition for each hospital episode. Reported health outcomes included the proportion of patients with in-hospital mortality and those with at least one 30-day, all-cause readmission to hospital.

Analysis

Patient characteristics were described using proportions and means (standard deviation) as appropriate for high users and nonhigh users within and across each definition. Encounter characteristics were also described and stratified by age category (18-64 or 65+ years). Comparison of patient capture was then analyzed among patients who were high use by at least 1 definition. The overlap and agreement of the 3 definitions were compared using a Venn diagram and kappa statistic. The 10 most responsible diagnoses (based on frequency) were also compared across definitions and stratified by age.

Finally, the percentage of system burden accounted for by each measure was calculated as the amount used by high users divided by the total amount used by the entire study population (x 100). To assess the potential modifying effect of age, results were stratified by age category for each definition.

All analyses were conducted using Stata 11.2 (StataCorp LP, College Station, TX).22 The Conjoint Health Research Ethics Board of the University of Calgary approved this study and granted waiver of patient consent. This manuscript is written in accordance with reporting guidelines for studies conducted using observational routinely collected health data (RECORD statement).23

RESULTS

Comparison of Patient and Encounter-level Characterist ics

A total of 219,106 adults had 283,204 inpatient episodes of care within the study timeframe. There were 12,707 (5.8%), 11,095 (5.1%), and 10,956 (5.0%) patients defined as high users based on number of inpatient episodes, length of stay, and cost, respectively (supplementary Figure 1). Regardless of definition, when compared to their non–high use counterparts, patients classified as high use were more likely to be male, older, in a lower median neighborhood income quintile, and have a higher level of comorbidity. Comparing across definitions of high use, those defined by number of inpatient episodes were more likely to be younger, live in rural areas, have better primary care attachment, and have fewer comorbidities, compared to the other definitions. High users by length of stay were more likely to be older and had a higher proportion of mental health–related comorbidities, including dementia and depression, as compared with the other definitions. Results were largely similar for those defined by cost (Table 1).

Encounter-level analyses

showed that high users were more likely to die within hospital (range 3.6%-9.3%) or be discharged to a long-term care setting (range 4.2%-15.2%) ,compared with nonhigh users. High users were also more likely to be readmitted within 30 days during the study period. Comparing across definitions, those defined by number of inpatient episodes were more often discharged home. High users defined by length of stay were more likely to have been discharged to a long-term care facility, while those defined by cost were more likely to have died in hospital (Table 2). Similar trends were observed across definitions when stratified by age with proportions increasing with advancing age (supplementary Table 1).

Comparison of Patient Capture and Inpatient Burden

Of the 22,691 individuals who were defined as high use by at least 1 definition, 2,331 (10.3%) were consistently high use across all 3 definitions (kappa = 0.38; Figure 1). Of the 13,682 individuals classified as high use by at least 1 of length of stay or cost, 8369 (61.2%) were defined as high use by both definitions (kappa = 0.75). However, of the 12,707 defined as high use by the number of inpatient episodes, only 3698 (29.1%) were also defined as high use by another definition. Exploration of the most responsible diagnoses across definitions showed that congestive heart failure (2.8%-3.5%), chronic obstructive pulmonary disease (1.6%-3.2%), and dementia (0.6%-2.2%) were the most frequent. Acute medical conditions (eg, pneumonia [1.8%] or gastroenteritis [0.7%]) that may result in multiple shorter hospitalizations were observed at higher frequencies among high users defined by inpatient episodes, while conditions commonly requiring rehabilitation (eg, fracture [1.8%] and stroke [1.7%]) were more common among high users defined by length of stay and cost (supplementary Table 2). Stratification by age showed marked differences in the diagnoses across high-use definitions. Among hi

gh users defined by inpatient episodes, patients aged 18-64 years had a wide range of medical diagnoses, including several for complications of childbirth. Major diagnoses among high users by length of stay aged 18-64 years were dominated by mental health–related conditions. Diagnoses among older adults (65+) were often related to degenerative neurological conditions (dementia and Alzheimer’s disease). Diagnoses among high users by cost showed similar trends to length of stay (supplementary Table 3).

 

 

When assessing inpatient system burden, high users by number of inpatient episodes accounted for 47,044 (16.6%) of the 283,204 episodes. High users defined by length of stay accounted for 1,286,539 (46.4%) days of 2,773,561 total days, while high users defined by cost accumulated $1.4 billion (38.9%) of the estimated $3.7 billion in inpatient expenditures. High users defined by cost and length of stay each accounted for comparatively few episode

s (8.5% and 8.2%, respectively), while high-cost individuals accounted for 42.8% of length of stay, and high length of stay individuals accounted for 35.8% of cost. High users by number of inpatient episodes accounted for a lower burden of the other definitions (Figure 2). High-user system burden was higher among elderly patients (65+) for all definitions.

DISCUSSION

Using a large population-based cohort of all adults with at least 1 hospitalization in the province of Alberta, Canada, within a 12-month period, we compared 3 commonly used definitions of high inpatient use. The choice of definition had a substantial influence on the types of patients categorized as high use, as well as the proportion of total inpatient utilization that was associated with high users. The definition based on number of inpatient episodes captured a distinct population of high users, while the populations identified using cumulative length of stay or cost were similar.

Differences within and between definitions were especially apparent in age-stratified analyses: Greater length of stay or higher cost among patients aged 18-64 years identifies a large proportion of psychological conditions, while a greater number of inpatient episodes identifies acute conditions and childbirth or labor-related complications. Conversely, definitions based on length of stay and cost in the elderly (65+) identified groups with chronic conditions that result in progressive functional decline (often requiring increasing supportive services upon discharge) or conditions that require significant rehabilitation prior to discharge. Regarding inpatient system burden, high users defined by number of inpatient episodes accounted for a small proportion of total inpatient episodes, while high users defined by length of stay and cost accounted for nearly half of the accumulated hospital days and cost for each. These findings highlight the need for careful consideration of how high use is defined when studying high-user populations and implications for targeting subpopulations for intervention.

Our results add to those from previous studies. A US-based, single-center study of 2566 individuals compared definitions of high inpatient use based on cost and frequency of admission and found that patients defined by cost were predominantly hospitalized for surgical conditions, while those fulfilling the episode-based definition were often hospitalized for medical conditions.12 The most responsible diagnoses for patient hospitalizations in our study reflect this. We extended this comparison to consider the impact of age on outcomes and inpatient system burden and found that older age was also linked to poorer outcomes and increased burden. We also considered a third definition (cumulative length of stay), which provided another opportunity for comparison. The presence of chronic conditions requiring rehabilitation and possible alternate level of care days within our cohort highlights the utility of this length of stay-based approach when considering definitions. Although there were similarities between patients defined by length of stay and cost, partly due to cost being largely a function of length of stay, there were also important differences in their patient profiles. Those defined by cost tended to have conditions requiring surgical procedures not requiring extended in-hospital rehabilitation. Furthermore, the higher proportion of in-hospital mortality among those defined by cost may also reflect the fact that patients tend to accrue the majority of their healthcare expenditures during the final 120 days of life.24

Each definition of high use identified complex patients; however, the differences between the various types of high users identified by these definitions suggest that they are not interchangeable. Arguably, selection of the most appropriate definition should depend on the objective of measuring high users, particularly if an intervention is planned. Interventions for high users are complex, requiring both medical and nonmedical components. The current literature in this area has often focused on case management programs, collaboration with community-based social support programs, and improving coordination and transitions of care.25-27 While many of these approaches require considerable involvement outside of the inpatient setting, these interventions can be informed by defining who high users of inpatient services are. Our findings show several possible subgroups of high users, which could be targeted for intervention. For example, an inpatient episode-based definition, which identifies patients with frequent encounters for acute conditions (eg, pneumonia and urinary tract infections), would be informative if an intervention targeted reductions in inpatient use and readmission rates. Alternatively, an intervention designed to improve community-based mental health programs would best be informed by a definition based on length of stay in which high users with underlying mental health conditions were prevalent. Such interventions are rarely mutually exclusive and require multiple perspectives to inform their objectives. A well-designed intervention will not only address the medical characteristics of high users but also the social determinants of health that place patients at risk of high inpatient use.

Our study should be interpreted in light of its limitations. First, measures of disease severity were not available to further characterize similarities and differences across high-use groups. Furthermore, we were unable to account for other social determinants of health that may be relevant to inpatient system usage. Second, direct cost of hospitalizations was estimated based on RIW and is thus reflective of expected rather than actual costs. However, this will have minimal impact on capture, as patients defined by this metric require substantial costs to be included in the top fifth percentile, and thus deviations in individual hospitalization costs will have minimal influence on the cumulative cost. Finally, while inpatient spending makes up a large proportion of healthcare spending, there is likely a number of different high-use profiles found outside of the acute care setting. Despite these limitations, our study includes several key strengths. The use of population-level data allows for analysis that is robust and more generalizable than studies from single centers. Additionally, the comparison of 3 independent definitions allows for a greater comparison of the nuances of each definition. Our study also considers the important impact of age as an effect modifier of inpatient use in the general population and identifies distinct patient profiles that exist across each definition.

 

 

CONCLUSIONS

Definitions of high use of inpatient services based on number of inpatient episodes, days spent in hospital, and total hospital costs identify patient populations with different characteristics and differ substantially in their impact on health outcomes and inpatient burden. These results highlight the need for careful consideration of the context of the study or intervention and the implications of selecting a specific definition of high inpatient use at study conception. Ultimately, the performance of an intervention in high-use populations is likely to be conditional on the fit of the patient population generated by the chosen definition of high inpatient use to the objectives of a study.

Acknowledgments

This study is based in part on data provided by Alberta Health and Alberta Health Services. The interpretation and conclusions are those of the researchers and do not represent the views of the Government of Alberta. Neither the Government of Alberta nor Alberta Health express any opinion in relation to this study.

Disclosure

Dr. Hemmelgarn is supported by the Roy and Vi Baay Chair in Kidney Research. Dr. Manns is supported by the Svare Professorship in Health Economics and by a Health Scholar Award by Alberta Innovates Health Solutions (AIHS). Dr. Tonelli is supported by the David Freeze chair in Health Services Research. The Interdisciplinary Chronic Disease Collaboration is funded by AIHS—Collaborative Research and Innovation Opportunity (CRIO) Team Grants Program.

 

As healthcare system use and costs continue to rise, increased importance has been placed on identifying the small subgroup of patients that drive this trend.1 It is estimated that 5% of healthcare users account for over 60% of healthcare spending.2-6 Furthermore, care for these “high users” is expensive due to an over-reliance on inpatient services. Approximately 40% of all health spending is for inpatient care, the largest single category of health spending, which is similarly skewed toward high users.1,3,5 Improving our understanding of this population may provide an opportunity to direct improvement efforts to a select group of patients with a potentially high benefit, as well as move care away from the costly inpatient setting.

 

However, the development of effective interventions to improve patient experience and outcomes while decreasing costs (referred to as the “Triple Aim” by the Institute for Health Improvement) for high users of inpatient services hinges on the methodology used to identify this high-risk population.7 There is substantial variability in definitions of high users; the most common definitions are based on the number of hospital encounters, days spent in the hospital, and hospital costs.8-15 Definitions have intrinsic differences in their implications around appropriateness, efficiency, and financial sustainability of inpatient resource use. Though the constructs underlying these definitions are highly variable, direct comparisons of differences in patient capture are limited.

A recent study from a single US center explored the clinical characteristics of hospital patients based on definitions of use vs cost and observed important differences in patients’ profiles and outcomes.12 While this suggests that the choice of definition may have major implications for whom to target (and the efficacy of any proposed interventions), this concept has not been explored at the population level. Therefore, we used population-based administrative data from a single-payer healthcare system to compare 3 common definitions of high inpatient service use and their influence on patient capture, health outcomes, and inpatient system burden.

METHODS

Data Sources and Study Population

We conducted a retrospective population-based study using administrative and clinical data for the province of Alberta, including the discharge abstracts database, physician claims, ambulatory care records, population health registry file, and aggregated data from the Canadian census.16 We identified all adults who had 1 or more hospitalizations with a discharge date between April 1, 2012, and March 31, 2013, though the admission date could be prior to April 1, 2012.

Definition of High-Inpatient Use

High-inpatient use was defined using 3 metrics: number of inpatient episodes, length of stay, and cost. As in prior studies, for each definition, individuals in the upper5th percentile of the relevant distribution were designated “high users,”2,15 while patients in the lower 95th percentile were considered “nonhigh users.” Patients could be defined as a high user in more than 1 definition.

Patients with 3 or more hospital episodes were defined as high users for the “number of inpatient episodes” definition. A hospital episode of care was defined as an event that resulted in discharge (or death) from an inpatient facility. If an individual was admitted to a hospital and transferred to another facility within 1 day of discharge, the hospitalizations were considered part of the same episode of care.

The “length of stay” definition refers to the cumulative number of days spent in an inpatient facility for all eligible episodes of care. Patients with 56 or more days in hospital during the study period were considered high users. Day of admission and discharge were considered full inpatient days, regardless of the time of admission and discharge.

The “cost” definition considered the cumulative estimated cost of every eligible episode of care. We estimated costs for each hospitalization using resource intensity weights (RIW). This is a relative weighted value for the average inpatient case after taking factors such as age, comorbidity, and procedures into account. The RIW for each episode was multiplied by the national average inpatient cost.17 Based on this definition, patients with a cumulative hospital cost of ≥ $63,597 were deemed high users. All costs were calculated in Canadian Dollars (CAD, $) and adjusted to 2013 dollars based on Statistics Canada’s Consumer Price Index.18

 

 

Demographic, Clinical, and Encounter Characteristics

Individual characteristics were measured using a combination of provincial administrative data sources. All measures were recorded as of the admission date of the first eligible hospitalization. Demographic characteristics included age, sex, First Nations status, urban/rural status (based on the individual’s residential postal code), and median neighborhood income quintile. Clinical characteristics included 28 comorbid conditions defined based on separate validated International Statistical Classification of Disease and Health Related Problems, Tenth Revision, Canada (ICD-10-CA) coding algorithms reported individually and cumulatively (categorized as 0, 1, 2–3, and 4+).19 Primary care attachment was defined as the percentage of all outpatient primary care visits made to a single practitioner in the 2-year period prior to their first hospitalization (among those with ≥3 visits). Attachment was categorized as 75%-100% (good attachment), 50%-74% (moderate attachment), or <50% (low attachment).20,21

We also identified hospital encounter-level characteristics. These included the most responsible diagnosis, admission category (elective or urgent/emergent), and discharge disposition for each hospital episode. Reported health outcomes included the proportion of patients with in-hospital mortality and those with at least one 30-day, all-cause readmission to hospital.

Analysis

Patient characteristics were described using proportions and means (standard deviation) as appropriate for high users and nonhigh users within and across each definition. Encounter characteristics were also described and stratified by age category (18-64 or 65+ years). Comparison of patient capture was then analyzed among patients who were high use by at least 1 definition. The overlap and agreement of the 3 definitions were compared using a Venn diagram and kappa statistic. The 10 most responsible diagnoses (based on frequency) were also compared across definitions and stratified by age.

Finally, the percentage of system burden accounted for by each measure was calculated as the amount used by high users divided by the total amount used by the entire study population (x 100). To assess the potential modifying effect of age, results were stratified by age category for each definition.

All analyses were conducted using Stata 11.2 (StataCorp LP, College Station, TX).22 The Conjoint Health Research Ethics Board of the University of Calgary approved this study and granted waiver of patient consent. This manuscript is written in accordance with reporting guidelines for studies conducted using observational routinely collected health data (RECORD statement).23

RESULTS

Comparison of Patient and Encounter-level Characterist ics

A total of 219,106 adults had 283,204 inpatient episodes of care within the study timeframe. There were 12,707 (5.8%), 11,095 (5.1%), and 10,956 (5.0%) patients defined as high users based on number of inpatient episodes, length of stay, and cost, respectively (supplementary Figure 1). Regardless of definition, when compared to their non–high use counterparts, patients classified as high use were more likely to be male, older, in a lower median neighborhood income quintile, and have a higher level of comorbidity. Comparing across definitions of high use, those defined by number of inpatient episodes were more likely to be younger, live in rural areas, have better primary care attachment, and have fewer comorbidities, compared to the other definitions. High users by length of stay were more likely to be older and had a higher proportion of mental health–related comorbidities, including dementia and depression, as compared with the other definitions. Results were largely similar for those defined by cost (Table 1).

Encounter-level analyses

showed that high users were more likely to die within hospital (range 3.6%-9.3%) or be discharged to a long-term care setting (range 4.2%-15.2%) ,compared with nonhigh users. High users were also more likely to be readmitted within 30 days during the study period. Comparing across definitions, those defined by number of inpatient episodes were more often discharged home. High users defined by length of stay were more likely to have been discharged to a long-term care facility, while those defined by cost were more likely to have died in hospital (Table 2). Similar trends were observed across definitions when stratified by age with proportions increasing with advancing age (supplementary Table 1).

Comparison of Patient Capture and Inpatient Burden

Of the 22,691 individuals who were defined as high use by at least 1 definition, 2,331 (10.3%) were consistently high use across all 3 definitions (kappa = 0.38; Figure 1). Of the 13,682 individuals classified as high use by at least 1 of length of stay or cost, 8369 (61.2%) were defined as high use by both definitions (kappa = 0.75). However, of the 12,707 defined as high use by the number of inpatient episodes, only 3698 (29.1%) were also defined as high use by another definition. Exploration of the most responsible diagnoses across definitions showed that congestive heart failure (2.8%-3.5%), chronic obstructive pulmonary disease (1.6%-3.2%), and dementia (0.6%-2.2%) were the most frequent. Acute medical conditions (eg, pneumonia [1.8%] or gastroenteritis [0.7%]) that may result in multiple shorter hospitalizations were observed at higher frequencies among high users defined by inpatient episodes, while conditions commonly requiring rehabilitation (eg, fracture [1.8%] and stroke [1.7%]) were more common among high users defined by length of stay and cost (supplementary Table 2). Stratification by age showed marked differences in the diagnoses across high-use definitions. Among hi

gh users defined by inpatient episodes, patients aged 18-64 years had a wide range of medical diagnoses, including several for complications of childbirth. Major diagnoses among high users by length of stay aged 18-64 years were dominated by mental health–related conditions. Diagnoses among older adults (65+) were often related to degenerative neurological conditions (dementia and Alzheimer’s disease). Diagnoses among high users by cost showed similar trends to length of stay (supplementary Table 3).

 

 

When assessing inpatient system burden, high users by number of inpatient episodes accounted for 47,044 (16.6%) of the 283,204 episodes. High users defined by length of stay accounted for 1,286,539 (46.4%) days of 2,773,561 total days, while high users defined by cost accumulated $1.4 billion (38.9%) of the estimated $3.7 billion in inpatient expenditures. High users defined by cost and length of stay each accounted for comparatively few episode

s (8.5% and 8.2%, respectively), while high-cost individuals accounted for 42.8% of length of stay, and high length of stay individuals accounted for 35.8% of cost. High users by number of inpatient episodes accounted for a lower burden of the other definitions (Figure 2). High-user system burden was higher among elderly patients (65+) for all definitions.

DISCUSSION

Using a large population-based cohort of all adults with at least 1 hospitalization in the province of Alberta, Canada, within a 12-month period, we compared 3 commonly used definitions of high inpatient use. The choice of definition had a substantial influence on the types of patients categorized as high use, as well as the proportion of total inpatient utilization that was associated with high users. The definition based on number of inpatient episodes captured a distinct population of high users, while the populations identified using cumulative length of stay or cost were similar.

Differences within and between definitions were especially apparent in age-stratified analyses: Greater length of stay or higher cost among patients aged 18-64 years identifies a large proportion of psychological conditions, while a greater number of inpatient episodes identifies acute conditions and childbirth or labor-related complications. Conversely, definitions based on length of stay and cost in the elderly (65+) identified groups with chronic conditions that result in progressive functional decline (often requiring increasing supportive services upon discharge) or conditions that require significant rehabilitation prior to discharge. Regarding inpatient system burden, high users defined by number of inpatient episodes accounted for a small proportion of total inpatient episodes, while high users defined by length of stay and cost accounted for nearly half of the accumulated hospital days and cost for each. These findings highlight the need for careful consideration of how high use is defined when studying high-user populations and implications for targeting subpopulations for intervention.

Our results add to those from previous studies. A US-based, single-center study of 2566 individuals compared definitions of high inpatient use based on cost and frequency of admission and found that patients defined by cost were predominantly hospitalized for surgical conditions, while those fulfilling the episode-based definition were often hospitalized for medical conditions.12 The most responsible diagnoses for patient hospitalizations in our study reflect this. We extended this comparison to consider the impact of age on outcomes and inpatient system burden and found that older age was also linked to poorer outcomes and increased burden. We also considered a third definition (cumulative length of stay), which provided another opportunity for comparison. The presence of chronic conditions requiring rehabilitation and possible alternate level of care days within our cohort highlights the utility of this length of stay-based approach when considering definitions. Although there were similarities between patients defined by length of stay and cost, partly due to cost being largely a function of length of stay, there were also important differences in their patient profiles. Those defined by cost tended to have conditions requiring surgical procedures not requiring extended in-hospital rehabilitation. Furthermore, the higher proportion of in-hospital mortality among those defined by cost may also reflect the fact that patients tend to accrue the majority of their healthcare expenditures during the final 120 days of life.24

Each definition of high use identified complex patients; however, the differences between the various types of high users identified by these definitions suggest that they are not interchangeable. Arguably, selection of the most appropriate definition should depend on the objective of measuring high users, particularly if an intervention is planned. Interventions for high users are complex, requiring both medical and nonmedical components. The current literature in this area has often focused on case management programs, collaboration with community-based social support programs, and improving coordination and transitions of care.25-27 While many of these approaches require considerable involvement outside of the inpatient setting, these interventions can be informed by defining who high users of inpatient services are. Our findings show several possible subgroups of high users, which could be targeted for intervention. For example, an inpatient episode-based definition, which identifies patients with frequent encounters for acute conditions (eg, pneumonia and urinary tract infections), would be informative if an intervention targeted reductions in inpatient use and readmission rates. Alternatively, an intervention designed to improve community-based mental health programs would best be informed by a definition based on length of stay in which high users with underlying mental health conditions were prevalent. Such interventions are rarely mutually exclusive and require multiple perspectives to inform their objectives. A well-designed intervention will not only address the medical characteristics of high users but also the social determinants of health that place patients at risk of high inpatient use.

Our study should be interpreted in light of its limitations. First, measures of disease severity were not available to further characterize similarities and differences across high-use groups. Furthermore, we were unable to account for other social determinants of health that may be relevant to inpatient system usage. Second, direct cost of hospitalizations was estimated based on RIW and is thus reflective of expected rather than actual costs. However, this will have minimal impact on capture, as patients defined by this metric require substantial costs to be included in the top fifth percentile, and thus deviations in individual hospitalization costs will have minimal influence on the cumulative cost. Finally, while inpatient spending makes up a large proportion of healthcare spending, there is likely a number of different high-use profiles found outside of the acute care setting. Despite these limitations, our study includes several key strengths. The use of population-level data allows for analysis that is robust and more generalizable than studies from single centers. Additionally, the comparison of 3 independent definitions allows for a greater comparison of the nuances of each definition. Our study also considers the important impact of age as an effect modifier of inpatient use in the general population and identifies distinct patient profiles that exist across each definition.

 

 

CONCLUSIONS

Definitions of high use of inpatient services based on number of inpatient episodes, days spent in hospital, and total hospital costs identify patient populations with different characteristics and differ substantially in their impact on health outcomes and inpatient burden. These results highlight the need for careful consideration of the context of the study or intervention and the implications of selecting a specific definition of high inpatient use at study conception. Ultimately, the performance of an intervention in high-use populations is likely to be conditional on the fit of the patient population generated by the chosen definition of high inpatient use to the objectives of a study.

Acknowledgments

This study is based in part on data provided by Alberta Health and Alberta Health Services. The interpretation and conclusions are those of the researchers and do not represent the views of the Government of Alberta. Neither the Government of Alberta nor Alberta Health express any opinion in relation to this study.

Disclosure

Dr. Hemmelgarn is supported by the Roy and Vi Baay Chair in Kidney Research. Dr. Manns is supported by the Svare Professorship in Health Economics and by a Health Scholar Award by Alberta Innovates Health Solutions (AIHS). Dr. Tonelli is supported by the David Freeze chair in Health Services Research. The Interdisciplinary Chronic Disease Collaboration is funded by AIHS—Collaborative Research and Innovation Opportunity (CRIO) Team Grants Program.

 

References

1. National Health Expenditure Trends, 1975 to 2015. Canadian Institute for Health Information. 2015. https://secure.cihi.ca/free_products/nhex_trends_narrative_report_2015_en.pdf. Accessed on June 23, 2016.
2. Berk ML, Monheit AC. The concentration of health care expenditures, revisited. Health Aff (Millwood). 2001;20:9-18. PubMed
3. Wodchis WP, Austin PC, Henry DA. A 3-year study of high-cost users of health care. CMAJ. 2016;188(3):182-188. PubMed
4. Forget EL, Roos LL, Deber RB, Wald R. Variations in Lifetime Healthcare Costs across a Population. Healthc Policy. 2008;4:e148-e167. PubMed
5. Joynt KE, Gawande AA, Orav EJ, Jha AK. Contribution of preventable acute care spending to total spending for high-cost Medicare patients. JAMA. 2013;309:2572-2578. PubMed
6. Riley GF. Long-term trends in the concentration of Medicare spending. Health Aff (Millwood). 2007;26:808-816. PubMed
7. IHI Triple Aim Initiative. Institute for Healthcare Improvement. 2015. http://www.ihi.org/engage/initiatives/TripleAim/Pages/default.aspx. Accessed on June 17, 2016.
8. Johansen H, Nair C, Bond J. Who goes to the hospital? An investigation of high users of hospital days. Health Reports. 1994;6(2):253-277. PubMed
9. Conwell LJ, Cohen JW. Characteristics of persons with high medical expenditures in the US civilian noninstitutionalized population. MEPS Statistical Brief# 73. 2002. 
10. Lemstra M, Mackenbach J, Neudorf C, Nannapaneni U. High health care utilization and costs associated with lower socio-economic status: Results from a linked dataset. CJPH. 2009;100(3):180-183. PubMed
11. Macnee CL, McCabe S, Clarke PN, Fiske M, Campbell S. Typology of high users of health services among a rural medicaid population. Pub Health Nurs. 2009;26(5):396-404. PubMed
12. Nguyen OK, Tang N, Hillman JM, Gonzales R. What’s cost got to do with it? Association between hospital costs and frequency of admissions among “high users” of hospital care. J. Hosp Med. 2013;8(12):665-671. PubMed
13. Rosella LC, Fitzpatrick T, Wodchis WP, Calzavara A, Manson H, Goel V. High-cost health care users in Ontario, Canada: Demographic, socio-economic, and health status characteristics. BMC Health Serv Res. 2014;14(1):532. PubMed
14. Cohen SB. The Concentration of Health Care Expenditures and Related Expenses for Costly Medical Conditions, 2009. Agency for Healthcare Research and Quality Statistical Brief #359; 2012. 

15. Ronksley PE, McKay JA, Kobewka DM, Mulpuru S, Forster AJ. Patterns of health care use in a high-cost inpatient population in Ottawa, Ontario: A retrospective observational study. CMAJ Open. 2015; 3:E111-E118. PubMed
16. Hemmelgarn BR, Clement F, Manns BJ, et al. Overview of the Alberta Kidney Disease Network. BMC Nephrol. 2009;10:30. PubMed
17. DAD Resource Intensity Weights and Expected Length of Stay. Canadian Institute for Health Information. 2016. https://www.cihi.ca/en/data-and-standards/standards/case-mix/resource-indicators-dad-resource-intensity-weights-and. Accessed on June 24, 2016.
18. Statistics Canada. The Canadian Consumer Price Index Reference Paper, Statistics Canada Catalogue no. 62-553-X.
19. Tonelli M, Wiebe N, Fortin M, et al. Methods for identifying 30 chronic conditions: Application to administrative data. BMC Med Inform Decis Mak. 2015;17:15(1):1. PubMed
20. Jaakkimainen RL, Klein-Geltink J, Guttmann A, Barnsley J, Jagorski B, Kopp A. Indicators of primary care based on administrative data. In Primary Care in Ontario: ICES Atlas. Toronto, Ontario: Institute for Clinical Evaluative Sciences; 2006. 
21. Jee SH, Cabana MD. Indices for continuity of care: A systematic review of the literature. Med Care Res Rev. 2006;63:158-188. PubMed
22. Stata Statistical Software: Release 11. College Station, TX: StataCorp LP. 2009. 
23. Benchimol EI, Smeeth L, Guttmann A, et al. The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) statement. PLoS Med. 2015;12(10):e1001885. PubMed
24. Tanuseputro P, Wodchis WP, Fowler R, et al. The health care cost of dying: A population-based retrospective cohort study of the last year of life in ontario, canada. PLoS One. 2015;10(3):e0121759. PubMed
25. Hong CS, Siegel AL, Ferris TG. Caring for high-need, high-cost patients: What makes for a successful care management program? Issue Brief (Commonw Fund). 2014;19:1-19. PubMed
26. Birnbaum M, Halper DE. Rethinking service delivery for high-cost Medicaid patients. Medicaid Institute. 2009. http://shnny.org/research/rethinking-service-delivery-for-high-cost-medicaid-patients/. Accessed on Jan 11, 2017.
27. Pan-Canadian forum on high users of health care. Canadian Institute for Health Information. 2014. https://secure.cihi.ca/free_products/highusers_summary_report_revised_EN_web.pdf. Accessed on Jan 11, 2017.

References

1. National Health Expenditure Trends, 1975 to 2015. Canadian Institute for Health Information. 2015. https://secure.cihi.ca/free_products/nhex_trends_narrative_report_2015_en.pdf. Accessed on June 23, 2016.
2. Berk ML, Monheit AC. The concentration of health care expenditures, revisited. Health Aff (Millwood). 2001;20:9-18. PubMed
3. Wodchis WP, Austin PC, Henry DA. A 3-year study of high-cost users of health care. CMAJ. 2016;188(3):182-188. PubMed
4. Forget EL, Roos LL, Deber RB, Wald R. Variations in Lifetime Healthcare Costs across a Population. Healthc Policy. 2008;4:e148-e167. PubMed
5. Joynt KE, Gawande AA, Orav EJ, Jha AK. Contribution of preventable acute care spending to total spending for high-cost Medicare patients. JAMA. 2013;309:2572-2578. PubMed
6. Riley GF. Long-term trends in the concentration of Medicare spending. Health Aff (Millwood). 2007;26:808-816. PubMed
7. IHI Triple Aim Initiative. Institute for Healthcare Improvement. 2015. http://www.ihi.org/engage/initiatives/TripleAim/Pages/default.aspx. Accessed on June 17, 2016.
8. Johansen H, Nair C, Bond J. Who goes to the hospital? An investigation of high users of hospital days. Health Reports. 1994;6(2):253-277. PubMed
9. Conwell LJ, Cohen JW. Characteristics of persons with high medical expenditures in the US civilian noninstitutionalized population. MEPS Statistical Brief# 73. 2002. 
10. Lemstra M, Mackenbach J, Neudorf C, Nannapaneni U. High health care utilization and costs associated with lower socio-economic status: Results from a linked dataset. CJPH. 2009;100(3):180-183. PubMed
11. Macnee CL, McCabe S, Clarke PN, Fiske M, Campbell S. Typology of high users of health services among a rural medicaid population. Pub Health Nurs. 2009;26(5):396-404. PubMed
12. Nguyen OK, Tang N, Hillman JM, Gonzales R. What’s cost got to do with it? Association between hospital costs and frequency of admissions among “high users” of hospital care. J. Hosp Med. 2013;8(12):665-671. PubMed
13. Rosella LC, Fitzpatrick T, Wodchis WP, Calzavara A, Manson H, Goel V. High-cost health care users in Ontario, Canada: Demographic, socio-economic, and health status characteristics. BMC Health Serv Res. 2014;14(1):532. PubMed
14. Cohen SB. The Concentration of Health Care Expenditures and Related Expenses for Costly Medical Conditions, 2009. Agency for Healthcare Research and Quality Statistical Brief #359; 2012. 

15. Ronksley PE, McKay JA, Kobewka DM, Mulpuru S, Forster AJ. Patterns of health care use in a high-cost inpatient population in Ottawa, Ontario: A retrospective observational study. CMAJ Open. 2015; 3:E111-E118. PubMed
16. Hemmelgarn BR, Clement F, Manns BJ, et al. Overview of the Alberta Kidney Disease Network. BMC Nephrol. 2009;10:30. PubMed
17. DAD Resource Intensity Weights and Expected Length of Stay. Canadian Institute for Health Information. 2016. https://www.cihi.ca/en/data-and-standards/standards/case-mix/resource-indicators-dad-resource-intensity-weights-and. Accessed on June 24, 2016.
18. Statistics Canada. The Canadian Consumer Price Index Reference Paper, Statistics Canada Catalogue no. 62-553-X.
19. Tonelli M, Wiebe N, Fortin M, et al. Methods for identifying 30 chronic conditions: Application to administrative data. BMC Med Inform Decis Mak. 2015;17:15(1):1. PubMed
20. Jaakkimainen RL, Klein-Geltink J, Guttmann A, Barnsley J, Jagorski B, Kopp A. Indicators of primary care based on administrative data. In Primary Care in Ontario: ICES Atlas. Toronto, Ontario: Institute for Clinical Evaluative Sciences; 2006. 
21. Jee SH, Cabana MD. Indices for continuity of care: A systematic review of the literature. Med Care Res Rev. 2006;63:158-188. PubMed
22. Stata Statistical Software: Release 11. College Station, TX: StataCorp LP. 2009. 
23. Benchimol EI, Smeeth L, Guttmann A, et al. The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) statement. PLoS Med. 2015;12(10):e1001885. PubMed
24. Tanuseputro P, Wodchis WP, Fowler R, et al. The health care cost of dying: A population-based retrospective cohort study of the last year of life in ontario, canada. PLoS One. 2015;10(3):e0121759. PubMed
25. Hong CS, Siegel AL, Ferris TG. Caring for high-need, high-cost patients: What makes for a successful care management program? Issue Brief (Commonw Fund). 2014;19:1-19. PubMed
26. Birnbaum M, Halper DE. Rethinking service delivery for high-cost Medicaid patients. Medicaid Institute. 2009. http://shnny.org/research/rethinking-service-delivery-for-high-cost-medicaid-patients/. Accessed on Jan 11, 2017.
27. Pan-Canadian forum on high users of health care. Canadian Institute for Health Information. 2014. https://secure.cihi.ca/free_products/highusers_summary_report_revised_EN_web.pdf. Accessed on Jan 11, 2017.

Issue
Journal of Hospital Medicine 12 (8)
Issue
Journal of Hospital Medicine 12 (8)
Page Number
596-602
Page Number
596-602
Topics
Article Type
Display Headline
Comparison of Methods to Define High Use of Inpatient Services Using Population-Based Data
Display Headline
Comparison of Methods to Define High Use of Inpatient Services Using Population-Based Data
Sections
Article Source

© 2017 Society of Hospital Medicine

Disallow All Ads
Correspondence Location
*Address for correspondence and reprint requests: Dr. Paul E. Ronksley, Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, HSC G239, 3330 Hospital Drive NW, Calgary, Alberta, Canada T2N 4N1; Telephone: 403-220-8820; Fax: 403-210-9165; E-mail: [email protected]

Content Gating
No Gating (article Unlocked/Free)
Alternative CME
Disqus Comments
Default
Article PDF Media
Media Files

Is Simultaneous Bilateral Total Knee Arthroplasty (BTKA) as Safe as Staged BTKA?

Article Type
Changed
Display Headline
Is Simultaneous Bilateral Total Knee Arthroplasty (BTKA) as Safe as Staged BTKA?

Take-Home Points

  • Complication rates did not statistically significantly differ between simultaneous and staged TKA.
  • Length of stay of 2 TKA admissions was greater than 1 BTKA admission.
  • Transfusion requirements were greater in BTKA.
  • Avoid bilateral procedures in ASA 3 patients.
  • Develop institutional protocols for BTKA with multidisciplinary input.

In the United States, osteoarthritis is the most common cause of knee pain and one of the leading causes of disability.1 Total knee arthroplasty (TKA) is an effective treatment for end-stage osteoarthritis of the knee.2 Whether patients with severe, debilitating bilateral disease should undergo simultaneous bilateral TKA (BTKA) or staged BTKA (2 separate procedures during separate hospital admissions) continues to be debated. The relative risks and benefits of simultaneous BTKA relative to staged BTKA or unilateral TKA are controversial.3-6 Proponents of simultaneous BTKA have argued that this surgery results in shorter hospital length of stay (LOS) and higher patient satisfaction without increased risk of perioperative complications,7-9 and opponents have argued that it leads to increased perioperative mortality and complications and should not be performed routinely.10,11

The safety of simultaneous BTKA cannot necessarily be extrapolated from data on unilateral TKA. Authors have argued that the complication rate for simultaneous BTKA is not comparable to the rate for unilateral TKA but instead is double the rate.12 Although a doubled rate may more closely approximate the true risk of simultaneous BTKA, it still does not account for the increased surgical impact of 2 procedures (vs 1 procedure) on a patient. In this regard, comparing simultaneous and staged BTKA provides a more accurate assessment of risk, as long as the interval between surgeries is not excessive. The major stress experienced during TKA affects the cardiovascular, pulmonary, and musculoskeletal systems, and full recovery may take up to 6 months.13-15 Outcome studies have found significant improvement in validated measures of function and pain up to but not past 6 months.13,15 Furthermore, a large study comparing American Society of Anesthesiologists (ASA) scores with morbidity and mortality rates recorded in the New Zealand Total Joint Database established 6 months as a best approximation of postoperative mortality and morbidity risk.14 Given these data, we propose that the most accurate analysis of postoperative morbidity and mortality would be a comparison of simultaneous BTKA with BTKA staged <6 months apart. The staged procedures fall within the crucial postoperative period when increased morbidity and mortality would more likely be present. A between-surgeries interval >6 months would effectively separate the 2 procedures, rendering their risks not truly representative.

We retrospectively analyzed all simultaneous BTKA and staged BTKA (<6 months apart) surgeries performed at our orthopedic specialty hospital between 2005 and 2009. We hypothesized there would be no significant difference in perioperative morbidity or mortality between the groups.

Methods and Materials

Our institution’s Institutional Review Board approved this study. All patients who underwent either simultaneous BTKA or staged BTKA (<6 months apart) at a single orthopedic specialty hospital between 2005 and 2009 were retrospectively identified. Twenty-five surgeons performed the procedures. Which procedure to perform (simultaneous or staged) was decided by the attending surgeon in consultation with an anesthesiologist. Preoperative medical diagnostic testing was determined by the internist, who provided medical clearance, and was subject to review by the anesthesiologist. A patient was excluded from simultaneous BTKA only if the medical or anesthesiology consultant deemed the patient too high risk for bilateral procedures. Revision TKAs were excluded from the study.

Implant, approach, tourniquet use, and TKA technique were selected by the individual surgeons. Strategies for the simultaneous procedures were (1) single surgeon, single team, sequential, start second knee after closure of first, and (2) single surgeon, single team, sequential, start second knee after implantation of first but before closure. The decision to proceed with the second knee was confirmed in consultation with the anesthesiologist after implantation and deflation of the tourniquet on the first knee.

Individual electronic patient charts were reviewed for information on demographics, comorbidities, anesthesia type, antibiotics, and postoperative venous thromboembolism prophylaxis. Demographic variables included age, sex, height, weight, and body mass index (BMI). Comorbidities recorded were diabetes mellitus, coronary artery disease, prior myocardial infarction, stroke, and endocrinopathies. In addition, available ASA scores were recorded. The primary outcome was perioperative complications, defined as any complications that occurred within 6 months after surgery. These included death, pulmonary embolism (PE), and deep surgical-site infections (SSIs). Secondary outcome measures were LOS, discharge location (rehabilitation or home), and blood transfusion requirements.

The 2 groups (simultaneous BTKA, staged BTKA) were compared using Student t test for continuous variables and χ2 test for categorical variables. Subgroup analysis was performed to compare healthier patients (ASA score 1 or 2) with patients who had more severe comorbidities (ASA score 3). Statistical significance was set at P < .05.

Results

Between 2005 and 2009, 371 patients had simultaneous BTKA, and 67 had staged BTKA (134 procedures) <6 months apart (Table 1).

Mean recovery interval between staged procedures was 4.3 months (range, 2-6 months). Mean age was 63.9 years (range, 44-88 years) for the simultaneous BTKA patients and 63.1 years (range, 35-81 years) for the staged BTKA patients (P = .105). Both groups had proportionately more female patients (69.8% in the simultaneous BTKA group, 64.2% in the staged BTKA group), but there was no sex difference between the groups (P = .359). There were 71 (19.1%) morbidly obese patients (body mass index [BMI], ≥40 kg/m2) in the simultaneous group and 14 (20.9%) in the staged group (P = .739). The groups had statistically similar proportions of diabetes mellitus and coronary artery disease (P = .283).

Most surgeries (84.4% simultaneous, 90.3% staged) were performed with the patient under spinal anesthesia, and there was a trend (P = .167) toward more frequent use of general anesthesia in the simultaneous group relative to the staged group (Table 2).

Intraoperative antibiotics were given in all cases, and there were no significant differences in antibiotic type between the groups. Postoperative chemical venous thromboembolism prophylaxis was administered to all patients, depending on surgeon preference, and there were no significant differences between the groups.

The 2 cohorts’ perioperative complication rates were not statistically significantly different (P = .97) (Table 3). The simultaneous BTKA group had 13 complications: 7 PEs (1.9%), 5 deep SSIs (1.08%), and 1 respiratory arrest (0.27%). The staged BTKA group had only 1 complication, a deep SSI (0.75%). There were no significant differences in rates of individual complications (deep vein thrombosis, PE, SSI; P = .697) or intensive care unit admission (P = .312). Mean number of transfusion units was 1.39 for simultaneous BTKA and 0.66 for both staged TKAs combined (P = .042). Mean aggregated LOS for both procedures in the staged BTKA was 8.93 days per patient, and mean LOS for simultaneous BTKA was 4.94 days per patient, significantly shorter (P = .0001). The percentage of postoperative discharges from hospital to an inpatient acute rehabilitation center was significantly higher (P = .0001) in the simultaneous BTKA group (92.7%) than in the staged BTKA group (50.7%).

There was no statistically significant difference (P = .398) in occurrence of postoperative complications between the 2 cohorts compared on ASA scores, and the difference between patients with ASA score 1 or 2 and those with ASA score 3 was not statistically significant (P = .200) (Table 4). There was a trend (P = .161) toward more complications in 85 patients with BMI of ≥40 kg/m2 (morbidly obese), of whom 5 (5.9%) had a complication, than in 9 patients (2.6%) with BMI of <40 kg/m2, but the difference was not statistically significant because of the sample size.

Discussion

Although there was no significant difference in postoperative complication rates within 6 months after surgery between the simultaneous and staged BTKA groups, the incidence of complications in the simultaneous group was notable. The disproportionate size of the 2 comparison groups limited the power of our study to analyze individual perioperative complications. This study may be underpowered to detect differences in complications occurring relatively infrequently, which may explain why the difference in number of complications (13 in simultaneous group, 1 in staged group) did not achieve statistical significance (β = 0.89). Post hoc power analysis showed 956 patients would be needed in each group to adequately power for such small complication rates. However, our results are consistent with those of other studies.13-15 The 1.9% PE rate in our simultaneous BTKA group does not vary from the average PE rate for TKA in the literature and is actually lower than the PE rate in a previous study at our institution.16 Fat embolism traditionally is considered more of a concern in bilateral cases than in unilateral cases. Although fat embolism surely is inherent to the physiologic alterations caused by TKA, we did not find clinically significant fat embolism in either cohort.

Similarly, the 1.08% rate of deep SSIs is within the range for postoperative TKA infections at our institution and others.17 Our staged BTKA group’s complication rate, 0.75% (1 SSI), was slightly lower than expected. However, 0.75% is in keeping with institutional norms (typical rate, ~1%). We would have expected a nonzero rate for venous thromboembolism, and perhaps such a rate would have come with an inclusion period longer than 6 months. Last, the death in the simultaneous BTKA group was not an outlier, given the published rate of mortality after elective total joint surgery.18The characteristics of our simultaneous and staged BTKA groups were very similar (Table 1), though the larger number of staged-group patients with diabetes mellitus and coronary artery disease may represent selection bias. Nevertheless, the proportions of patients with each of 3 ASA scores were similar. It is also important to note that, in this context, a high percentage of patients in each group (33.6% simultaneous, 37.5% staged) received ASA score 3 from the anesthesiologist (P > .05). This may be an important factor in explaining the larger though not statistically significant number of complications in the simultaneous group (13) relative to the staged group (1).

We therefore consider ASA score 3 to be a contraindication to a bilateral procedure, and for simultaneous BTKA we have developed a set of exclusion criteria that include ASA score 3 or 4 (Table 5). These criteria reflect input from our surgeons, anesthesiologist, and medical specialists, as well as the data presented here.

Other authors have studied the safety of simultaneous vs staged BTKA and drawn conflicting conclusions.11,19-21 Walmsley and colleagues21 found no differences in 90-day mortality between 3 groups: patients with simultaneous BTKA, patients with BTKA staged within 5 years, and patients with unilateral TKA. Stefánsdóttir and colleagues11 found that, compared with simultaneous BTKA, BTKA staged within 1 year had a lower 30-day mortality rate. Meehan and colleagues20 compared simultaneous BTKA with BTKA staged within 1 year and found a lower risk of infection and device malfunction and a higher risk of adverse cardiovascular outcomes in the simultaneous group. A recent meta-analysis found that, compared with staged BTKA, simultaneous BTKA had a higher risk of perioperative complications.19 A systematic review of retrospective studies found simultaneous BTKA had higher rates of mortality, PE, and transfusion and lower rates of deep SSI and revision.22 A survey of Medicare data found higher 90-day mortality and myocardial infarction rates for simultaneous BTKA but no difference in infection and revision rates.23 Clearly, there is no consensus as to whether simultaneous BTKA carries higher risks relative to staged BTKA.

The amount of blood transfused in our simultaneous BTKA group was more than double that in the 2 staged TKAs combined. It is intuitive that the blood loss in 2 concurrent TKAs is always more than in 1 TKA, but the clinical relevance of this fact is unknown. Transfusions have potential complications, and this risk needs to be addressed in the preoperative discussion.

LOS for simultaneous BTKA was on average 4 days shorter than the combined LOS (2 hospitalizations) for staged BTKA. This shorter LOS has been shown to provide the healthcare system with a cost savings.8 However, not considered in the equation is the difference in cost of rehabilitations, 2 vs 1. In the present study, 92.7% of simultaneous BTKA patients and only 50.7% of staged BTKA patients were discharged to an inpatient acute rehabilitation unit. Interestingly, the majority of the staged patients who went to inpatient rehabilitation did so after the second surgery. At our institution at the time of this study, simultaneous BTKA patients, and staged BTKA patients with the second surgery completed, were more likely than unilateral TKA patients to qualify for inpatient acute rehabilitation. Staged BTKA patients’ higher cost for 2 rehabilitations, rather than 1, adds to the cost savings realized with simultaneous BTKA. In the context of an episode-based payment system, the cost of posthospital rehabilitation enters the overall cost equation and may lead to an increase in the number of simultaneous BTKAs being performed.

Conclusion

In this study, the incidence of postoperative complications was higher for simultaneous BTKA than for staged BTKA performed <6 months apart, but the difference was not significantly different. There were significant differences in LOS and blood transfusion rates between the groups, as expected. At present, only patients with ASA score 1 or 2 are considered for simultaneous BTKA at our institution. Patients with ASA score 3 or higher are not eligible.

Am J Orthop. 2017;46(4):E224-E229. Copyright Frontline Medical Communications Inc. 2017. All rights reserved.

References

1. Hootman JM, Helmick CG. Projections of US prevalence of arthritis and associated activity limitations. Arthritis Rheum. 2006;54(1):226-229.

2. Kolettis GT, Wixson RL, Peruzzi WT, Blake MJ, Wardell S, Stulberg SD. Safety of 1-stage bilateral total knee arthroplasty. Clin Orthop Relat Res. 1994;(309):102-109.

3. Kim YH, Choi YW, Kim JS. Simultaneous bilateral sequential total knee replacement is as safe as unilateral total knee replacement. J Bone Joint Surg Br. 2009;91(1):64-68.

4. Luscombe JC, Theivendran K, Abudu A, Carter SR. The relative safety of one-stage bilateral total knee arthroplasty. Int Orthop. 2009;33(1):101-104.

5. Memtsoudis SG, Ma Y, González Della Valle A, et al. Perioperative outcomes after unilateral and bilateral total knee arthroplasty. Anesthesiology. 2009;111(6):1206-1216.

6. Zeni JA Jr, Snyder-Mackler L. Clinical outcomes after simultaneous bilateral total knee arthroplasty: comparison to unilateral total knee arthroplasty and healthy controls. J Arthroplasty. 2010;25(4):541-546.

7. March LM, Cross M, Tribe KL, et al; Arthritis C.O.S.T. Study Project Group. Two knees or not two knees? Patient costs and outcomes following bilateral and unilateral total knee joint replacement surgery for OA. Osteoarthritis Cartilage. 2004;12(5):400-408.

8. Reuben JD, Meyers SJ, Cox DD, Elliott M, Watson M, Shim SD. Cost comparison between bilateral simultaneous, staged, and unilateral total joint arthroplasty. J Arthroplasty. 1998;13(2):172-179.

9. Ritter MA, Harty LD. Debate: simultaneous bilateral knee replacements: the outcomes justify its use. Clin Orthop Relat Res. 2004;(428):84-86.

10. Restrepo C, Parvizi J, Dietrich T, Einhorn TA. Safety of simultaneous bilateral total knee arthroplasty. A meta-analysis. J Bone Joint Surg Am. 2007;89(6):1220-1226.

11. Stefánsdóttir A, Lidgren L, Robertsson O. Higher early mortality with simultaneous rather than staged bilateral TKAs: results from the Swedish Knee Arthroplasty Register. Clin Orthop Relat Res. 2008;466(12):3066-3070.

12. Noble J, Goodall J, Noble D. Simultaneous bilateral total knee replacement: a persistent controversy. Knee. 2009;16(6):420-426.

13. Fortin PR, Penrod JR, Clarke AE, et al. Timing of total joint replacement affects clinical outcomes among patients with osteoarthritis of the hip or knee. Arthritis Rheum. 2002;46(12):3327-3330.

14. Hooper GJ, Rothwell AG, Hooper NM, Frampton C. The relationship between the American Society of Anesthesiologists physical rating and outcome following total hip and knee arthroplasty: an analysis of the New Zealand Joint Registry. J Bone Joint Surg Am. 2012;94(12):1065-1070.

15. MacWilliam CH, Yood MU, Verner JJ, McCarthy BD, Ward RE. Patient-related risk factors that predict poor outcome after total hip replacement. Health Serv Res. 1996;31(5):623-638.

16. Hadley SR, Lee M, Reid M, Dweck E, Steiger D. Predictors of pulmonary embolism in orthopaedic patient population. Abstract presented at: 43rd Annual Meeting of the Eastern Orthopaedic Association; June 20-23, 2012; Bolton Landing, NY.

17. Hadley S, Immerman I, Hutzler L, Slover J, Bosco J. Staphylococcus aureus decolonization protocol decreases surgical site infections for total joint replacement. Arthritis. 2010;2010:924518.

18. Singh JA, Lewallen DG. Ninety-day mortality in patients undergoing elective total hip or total knee arthroplasty. J Arthroplasty. 2012;27(8):1417-1422.e1.

19. Hu J, Liu Y, Lv Z, Li X, Qin X, Fan W. Mortality and morbidity associated with simultaneous bilateral or staged bilateral total knee arthroplasty: a meta-analysis. Arch Orthop Trauma Surg. 2011;131(9):1291-1298.

20. Meehan JP, Danielsen B, Tancredi DJ, Kim S, Jamali AA, White RH. A population-based comparison of the incidence of adverse outcomes after simultaneous-bilateral and staged-bilateral total knee arthroplasty. J Bone Joint Surg Am. 2011;93(23):2203-2213.

21. Walmsley P, Murray A, Brenkel IJ. The practice of bilateral, simultaneous total knee replacement in Scotland over the last decade. Data from the Scottish Arthroplasty Project. Knee. 2006;13(2):102-105.

22. Fu D, Li G, Chen K, Zeng H, Zhang X, Cai Z. Comparison of clinical outcome between simultaneous-bilateral and staged-bilateral total knee arthroplasty: a systematic review of retrospective studies. J Arthroplasty. 2013;28(7):1141-1147.

23. Bolognesi MP, Watters TS, Attarian DE, Wellman SS, Setoguchi S. Simultaneous vs staged bilateral total knee arthroplasty among Medicare beneficiaries, 2000–2009. J Arthroplasty. 2013;28(8 suppl):87-91.

Article PDF
Author and Disclosure Information

Authors’ Disclosure Statement: The authors report no actual or potential conflict of interest in relation to this article.

Acknowledgment: The authors thank Emmanuel Koli, BS, for his help with data collection.

Issue
The American Journal of Orthopedics - 46(4)
Publications
Topics
Page Number
E224-E229
Sections
Author and Disclosure Information

Authors’ Disclosure Statement: The authors report no actual or potential conflict of interest in relation to this article.

Acknowledgment: The authors thank Emmanuel Koli, BS, for his help with data collection.

Author and Disclosure Information

Authors’ Disclosure Statement: The authors report no actual or potential conflict of interest in relation to this article.

Acknowledgment: The authors thank Emmanuel Koli, BS, for his help with data collection.

Article PDF
Article PDF

Take-Home Points

  • Complication rates did not statistically significantly differ between simultaneous and staged TKA.
  • Length of stay of 2 TKA admissions was greater than 1 BTKA admission.
  • Transfusion requirements were greater in BTKA.
  • Avoid bilateral procedures in ASA 3 patients.
  • Develop institutional protocols for BTKA with multidisciplinary input.

In the United States, osteoarthritis is the most common cause of knee pain and one of the leading causes of disability.1 Total knee arthroplasty (TKA) is an effective treatment for end-stage osteoarthritis of the knee.2 Whether patients with severe, debilitating bilateral disease should undergo simultaneous bilateral TKA (BTKA) or staged BTKA (2 separate procedures during separate hospital admissions) continues to be debated. The relative risks and benefits of simultaneous BTKA relative to staged BTKA or unilateral TKA are controversial.3-6 Proponents of simultaneous BTKA have argued that this surgery results in shorter hospital length of stay (LOS) and higher patient satisfaction without increased risk of perioperative complications,7-9 and opponents have argued that it leads to increased perioperative mortality and complications and should not be performed routinely.10,11

The safety of simultaneous BTKA cannot necessarily be extrapolated from data on unilateral TKA. Authors have argued that the complication rate for simultaneous BTKA is not comparable to the rate for unilateral TKA but instead is double the rate.12 Although a doubled rate may more closely approximate the true risk of simultaneous BTKA, it still does not account for the increased surgical impact of 2 procedures (vs 1 procedure) on a patient. In this regard, comparing simultaneous and staged BTKA provides a more accurate assessment of risk, as long as the interval between surgeries is not excessive. The major stress experienced during TKA affects the cardiovascular, pulmonary, and musculoskeletal systems, and full recovery may take up to 6 months.13-15 Outcome studies have found significant improvement in validated measures of function and pain up to but not past 6 months.13,15 Furthermore, a large study comparing American Society of Anesthesiologists (ASA) scores with morbidity and mortality rates recorded in the New Zealand Total Joint Database established 6 months as a best approximation of postoperative mortality and morbidity risk.14 Given these data, we propose that the most accurate analysis of postoperative morbidity and mortality would be a comparison of simultaneous BTKA with BTKA staged <6 months apart. The staged procedures fall within the crucial postoperative period when increased morbidity and mortality would more likely be present. A between-surgeries interval >6 months would effectively separate the 2 procedures, rendering their risks not truly representative.

We retrospectively analyzed all simultaneous BTKA and staged BTKA (<6 months apart) surgeries performed at our orthopedic specialty hospital between 2005 and 2009. We hypothesized there would be no significant difference in perioperative morbidity or mortality between the groups.

Methods and Materials

Our institution’s Institutional Review Board approved this study. All patients who underwent either simultaneous BTKA or staged BTKA (<6 months apart) at a single orthopedic specialty hospital between 2005 and 2009 were retrospectively identified. Twenty-five surgeons performed the procedures. Which procedure to perform (simultaneous or staged) was decided by the attending surgeon in consultation with an anesthesiologist. Preoperative medical diagnostic testing was determined by the internist, who provided medical clearance, and was subject to review by the anesthesiologist. A patient was excluded from simultaneous BTKA only if the medical or anesthesiology consultant deemed the patient too high risk for bilateral procedures. Revision TKAs were excluded from the study.

Implant, approach, tourniquet use, and TKA technique were selected by the individual surgeons. Strategies for the simultaneous procedures were (1) single surgeon, single team, sequential, start second knee after closure of first, and (2) single surgeon, single team, sequential, start second knee after implantation of first but before closure. The decision to proceed with the second knee was confirmed in consultation with the anesthesiologist after implantation and deflation of the tourniquet on the first knee.

Individual electronic patient charts were reviewed for information on demographics, comorbidities, anesthesia type, antibiotics, and postoperative venous thromboembolism prophylaxis. Demographic variables included age, sex, height, weight, and body mass index (BMI). Comorbidities recorded were diabetes mellitus, coronary artery disease, prior myocardial infarction, stroke, and endocrinopathies. In addition, available ASA scores were recorded. The primary outcome was perioperative complications, defined as any complications that occurred within 6 months after surgery. These included death, pulmonary embolism (PE), and deep surgical-site infections (SSIs). Secondary outcome measures were LOS, discharge location (rehabilitation or home), and blood transfusion requirements.

The 2 groups (simultaneous BTKA, staged BTKA) were compared using Student t test for continuous variables and χ2 test for categorical variables. Subgroup analysis was performed to compare healthier patients (ASA score 1 or 2) with patients who had more severe comorbidities (ASA score 3). Statistical significance was set at P < .05.

Results

Between 2005 and 2009, 371 patients had simultaneous BTKA, and 67 had staged BTKA (134 procedures) <6 months apart (Table 1).

Mean recovery interval between staged procedures was 4.3 months (range, 2-6 months). Mean age was 63.9 years (range, 44-88 years) for the simultaneous BTKA patients and 63.1 years (range, 35-81 years) for the staged BTKA patients (P = .105). Both groups had proportionately more female patients (69.8% in the simultaneous BTKA group, 64.2% in the staged BTKA group), but there was no sex difference between the groups (P = .359). There were 71 (19.1%) morbidly obese patients (body mass index [BMI], ≥40 kg/m2) in the simultaneous group and 14 (20.9%) in the staged group (P = .739). The groups had statistically similar proportions of diabetes mellitus and coronary artery disease (P = .283).

Most surgeries (84.4% simultaneous, 90.3% staged) were performed with the patient under spinal anesthesia, and there was a trend (P = .167) toward more frequent use of general anesthesia in the simultaneous group relative to the staged group (Table 2).

Intraoperative antibiotics were given in all cases, and there were no significant differences in antibiotic type between the groups. Postoperative chemical venous thromboembolism prophylaxis was administered to all patients, depending on surgeon preference, and there were no significant differences between the groups.

The 2 cohorts’ perioperative complication rates were not statistically significantly different (P = .97) (Table 3). The simultaneous BTKA group had 13 complications: 7 PEs (1.9%), 5 deep SSIs (1.08%), and 1 respiratory arrest (0.27%). The staged BTKA group had only 1 complication, a deep SSI (0.75%). There were no significant differences in rates of individual complications (deep vein thrombosis, PE, SSI; P = .697) or intensive care unit admission (P = .312). Mean number of transfusion units was 1.39 for simultaneous BTKA and 0.66 for both staged TKAs combined (P = .042). Mean aggregated LOS for both procedures in the staged BTKA was 8.93 days per patient, and mean LOS for simultaneous BTKA was 4.94 days per patient, significantly shorter (P = .0001). The percentage of postoperative discharges from hospital to an inpatient acute rehabilitation center was significantly higher (P = .0001) in the simultaneous BTKA group (92.7%) than in the staged BTKA group (50.7%).

There was no statistically significant difference (P = .398) in occurrence of postoperative complications between the 2 cohorts compared on ASA scores, and the difference between patients with ASA score 1 or 2 and those with ASA score 3 was not statistically significant (P = .200) (Table 4). There was a trend (P = .161) toward more complications in 85 patients with BMI of ≥40 kg/m2 (morbidly obese), of whom 5 (5.9%) had a complication, than in 9 patients (2.6%) with BMI of <40 kg/m2, but the difference was not statistically significant because of the sample size.

Discussion

Although there was no significant difference in postoperative complication rates within 6 months after surgery between the simultaneous and staged BTKA groups, the incidence of complications in the simultaneous group was notable. The disproportionate size of the 2 comparison groups limited the power of our study to analyze individual perioperative complications. This study may be underpowered to detect differences in complications occurring relatively infrequently, which may explain why the difference in number of complications (13 in simultaneous group, 1 in staged group) did not achieve statistical significance (β = 0.89). Post hoc power analysis showed 956 patients would be needed in each group to adequately power for such small complication rates. However, our results are consistent with those of other studies.13-15 The 1.9% PE rate in our simultaneous BTKA group does not vary from the average PE rate for TKA in the literature and is actually lower than the PE rate in a previous study at our institution.16 Fat embolism traditionally is considered more of a concern in bilateral cases than in unilateral cases. Although fat embolism surely is inherent to the physiologic alterations caused by TKA, we did not find clinically significant fat embolism in either cohort.

Similarly, the 1.08% rate of deep SSIs is within the range for postoperative TKA infections at our institution and others.17 Our staged BTKA group’s complication rate, 0.75% (1 SSI), was slightly lower than expected. However, 0.75% is in keeping with institutional norms (typical rate, ~1%). We would have expected a nonzero rate for venous thromboembolism, and perhaps such a rate would have come with an inclusion period longer than 6 months. Last, the death in the simultaneous BTKA group was not an outlier, given the published rate of mortality after elective total joint surgery.18The characteristics of our simultaneous and staged BTKA groups were very similar (Table 1), though the larger number of staged-group patients with diabetes mellitus and coronary artery disease may represent selection bias. Nevertheless, the proportions of patients with each of 3 ASA scores were similar. It is also important to note that, in this context, a high percentage of patients in each group (33.6% simultaneous, 37.5% staged) received ASA score 3 from the anesthesiologist (P > .05). This may be an important factor in explaining the larger though not statistically significant number of complications in the simultaneous group (13) relative to the staged group (1).

We therefore consider ASA score 3 to be a contraindication to a bilateral procedure, and for simultaneous BTKA we have developed a set of exclusion criteria that include ASA score 3 or 4 (Table 5). These criteria reflect input from our surgeons, anesthesiologist, and medical specialists, as well as the data presented here.

Other authors have studied the safety of simultaneous vs staged BTKA and drawn conflicting conclusions.11,19-21 Walmsley and colleagues21 found no differences in 90-day mortality between 3 groups: patients with simultaneous BTKA, patients with BTKA staged within 5 years, and patients with unilateral TKA. Stefánsdóttir and colleagues11 found that, compared with simultaneous BTKA, BTKA staged within 1 year had a lower 30-day mortality rate. Meehan and colleagues20 compared simultaneous BTKA with BTKA staged within 1 year and found a lower risk of infection and device malfunction and a higher risk of adverse cardiovascular outcomes in the simultaneous group. A recent meta-analysis found that, compared with staged BTKA, simultaneous BTKA had a higher risk of perioperative complications.19 A systematic review of retrospective studies found simultaneous BTKA had higher rates of mortality, PE, and transfusion and lower rates of deep SSI and revision.22 A survey of Medicare data found higher 90-day mortality and myocardial infarction rates for simultaneous BTKA but no difference in infection and revision rates.23 Clearly, there is no consensus as to whether simultaneous BTKA carries higher risks relative to staged BTKA.

The amount of blood transfused in our simultaneous BTKA group was more than double that in the 2 staged TKAs combined. It is intuitive that the blood loss in 2 concurrent TKAs is always more than in 1 TKA, but the clinical relevance of this fact is unknown. Transfusions have potential complications, and this risk needs to be addressed in the preoperative discussion.

LOS for simultaneous BTKA was on average 4 days shorter than the combined LOS (2 hospitalizations) for staged BTKA. This shorter LOS has been shown to provide the healthcare system with a cost savings.8 However, not considered in the equation is the difference in cost of rehabilitations, 2 vs 1. In the present study, 92.7% of simultaneous BTKA patients and only 50.7% of staged BTKA patients were discharged to an inpatient acute rehabilitation unit. Interestingly, the majority of the staged patients who went to inpatient rehabilitation did so after the second surgery. At our institution at the time of this study, simultaneous BTKA patients, and staged BTKA patients with the second surgery completed, were more likely than unilateral TKA patients to qualify for inpatient acute rehabilitation. Staged BTKA patients’ higher cost for 2 rehabilitations, rather than 1, adds to the cost savings realized with simultaneous BTKA. In the context of an episode-based payment system, the cost of posthospital rehabilitation enters the overall cost equation and may lead to an increase in the number of simultaneous BTKAs being performed.

Conclusion

In this study, the incidence of postoperative complications was higher for simultaneous BTKA than for staged BTKA performed <6 months apart, but the difference was not significantly different. There were significant differences in LOS and blood transfusion rates between the groups, as expected. At present, only patients with ASA score 1 or 2 are considered for simultaneous BTKA at our institution. Patients with ASA score 3 or higher are not eligible.

Am J Orthop. 2017;46(4):E224-E229. Copyright Frontline Medical Communications Inc. 2017. All rights reserved.

Take-Home Points

  • Complication rates did not statistically significantly differ between simultaneous and staged TKA.
  • Length of stay of 2 TKA admissions was greater than 1 BTKA admission.
  • Transfusion requirements were greater in BTKA.
  • Avoid bilateral procedures in ASA 3 patients.
  • Develop institutional protocols for BTKA with multidisciplinary input.

In the United States, osteoarthritis is the most common cause of knee pain and one of the leading causes of disability.1 Total knee arthroplasty (TKA) is an effective treatment for end-stage osteoarthritis of the knee.2 Whether patients with severe, debilitating bilateral disease should undergo simultaneous bilateral TKA (BTKA) or staged BTKA (2 separate procedures during separate hospital admissions) continues to be debated. The relative risks and benefits of simultaneous BTKA relative to staged BTKA or unilateral TKA are controversial.3-6 Proponents of simultaneous BTKA have argued that this surgery results in shorter hospital length of stay (LOS) and higher patient satisfaction without increased risk of perioperative complications,7-9 and opponents have argued that it leads to increased perioperative mortality and complications and should not be performed routinely.10,11

The safety of simultaneous BTKA cannot necessarily be extrapolated from data on unilateral TKA. Authors have argued that the complication rate for simultaneous BTKA is not comparable to the rate for unilateral TKA but instead is double the rate.12 Although a doubled rate may more closely approximate the true risk of simultaneous BTKA, it still does not account for the increased surgical impact of 2 procedures (vs 1 procedure) on a patient. In this regard, comparing simultaneous and staged BTKA provides a more accurate assessment of risk, as long as the interval between surgeries is not excessive. The major stress experienced during TKA affects the cardiovascular, pulmonary, and musculoskeletal systems, and full recovery may take up to 6 months.13-15 Outcome studies have found significant improvement in validated measures of function and pain up to but not past 6 months.13,15 Furthermore, a large study comparing American Society of Anesthesiologists (ASA) scores with morbidity and mortality rates recorded in the New Zealand Total Joint Database established 6 months as a best approximation of postoperative mortality and morbidity risk.14 Given these data, we propose that the most accurate analysis of postoperative morbidity and mortality would be a comparison of simultaneous BTKA with BTKA staged <6 months apart. The staged procedures fall within the crucial postoperative period when increased morbidity and mortality would more likely be present. A between-surgeries interval >6 months would effectively separate the 2 procedures, rendering their risks not truly representative.

We retrospectively analyzed all simultaneous BTKA and staged BTKA (<6 months apart) surgeries performed at our orthopedic specialty hospital between 2005 and 2009. We hypothesized there would be no significant difference in perioperative morbidity or mortality between the groups.

Methods and Materials

Our institution’s Institutional Review Board approved this study. All patients who underwent either simultaneous BTKA or staged BTKA (<6 months apart) at a single orthopedic specialty hospital between 2005 and 2009 were retrospectively identified. Twenty-five surgeons performed the procedures. Which procedure to perform (simultaneous or staged) was decided by the attending surgeon in consultation with an anesthesiologist. Preoperative medical diagnostic testing was determined by the internist, who provided medical clearance, and was subject to review by the anesthesiologist. A patient was excluded from simultaneous BTKA only if the medical or anesthesiology consultant deemed the patient too high risk for bilateral procedures. Revision TKAs were excluded from the study.

Implant, approach, tourniquet use, and TKA technique were selected by the individual surgeons. Strategies for the simultaneous procedures were (1) single surgeon, single team, sequential, start second knee after closure of first, and (2) single surgeon, single team, sequential, start second knee after implantation of first but before closure. The decision to proceed with the second knee was confirmed in consultation with the anesthesiologist after implantation and deflation of the tourniquet on the first knee.

Individual electronic patient charts were reviewed for information on demographics, comorbidities, anesthesia type, antibiotics, and postoperative venous thromboembolism prophylaxis. Demographic variables included age, sex, height, weight, and body mass index (BMI). Comorbidities recorded were diabetes mellitus, coronary artery disease, prior myocardial infarction, stroke, and endocrinopathies. In addition, available ASA scores were recorded. The primary outcome was perioperative complications, defined as any complications that occurred within 6 months after surgery. These included death, pulmonary embolism (PE), and deep surgical-site infections (SSIs). Secondary outcome measures were LOS, discharge location (rehabilitation or home), and blood transfusion requirements.

The 2 groups (simultaneous BTKA, staged BTKA) were compared using Student t test for continuous variables and χ2 test for categorical variables. Subgroup analysis was performed to compare healthier patients (ASA score 1 or 2) with patients who had more severe comorbidities (ASA score 3). Statistical significance was set at P < .05.

Results

Between 2005 and 2009, 371 patients had simultaneous BTKA, and 67 had staged BTKA (134 procedures) <6 months apart (Table 1).

Mean recovery interval between staged procedures was 4.3 months (range, 2-6 months). Mean age was 63.9 years (range, 44-88 years) for the simultaneous BTKA patients and 63.1 years (range, 35-81 years) for the staged BTKA patients (P = .105). Both groups had proportionately more female patients (69.8% in the simultaneous BTKA group, 64.2% in the staged BTKA group), but there was no sex difference between the groups (P = .359). There were 71 (19.1%) morbidly obese patients (body mass index [BMI], ≥40 kg/m2) in the simultaneous group and 14 (20.9%) in the staged group (P = .739). The groups had statistically similar proportions of diabetes mellitus and coronary artery disease (P = .283).

Most surgeries (84.4% simultaneous, 90.3% staged) were performed with the patient under spinal anesthesia, and there was a trend (P = .167) toward more frequent use of general anesthesia in the simultaneous group relative to the staged group (Table 2).

Intraoperative antibiotics were given in all cases, and there were no significant differences in antibiotic type between the groups. Postoperative chemical venous thromboembolism prophylaxis was administered to all patients, depending on surgeon preference, and there were no significant differences between the groups.

The 2 cohorts’ perioperative complication rates were not statistically significantly different (P = .97) (Table 3). The simultaneous BTKA group had 13 complications: 7 PEs (1.9%), 5 deep SSIs (1.08%), and 1 respiratory arrest (0.27%). The staged BTKA group had only 1 complication, a deep SSI (0.75%). There were no significant differences in rates of individual complications (deep vein thrombosis, PE, SSI; P = .697) or intensive care unit admission (P = .312). Mean number of transfusion units was 1.39 for simultaneous BTKA and 0.66 for both staged TKAs combined (P = .042). Mean aggregated LOS for both procedures in the staged BTKA was 8.93 days per patient, and mean LOS for simultaneous BTKA was 4.94 days per patient, significantly shorter (P = .0001). The percentage of postoperative discharges from hospital to an inpatient acute rehabilitation center was significantly higher (P = .0001) in the simultaneous BTKA group (92.7%) than in the staged BTKA group (50.7%).

There was no statistically significant difference (P = .398) in occurrence of postoperative complications between the 2 cohorts compared on ASA scores, and the difference between patients with ASA score 1 or 2 and those with ASA score 3 was not statistically significant (P = .200) (Table 4). There was a trend (P = .161) toward more complications in 85 patients with BMI of ≥40 kg/m2 (morbidly obese), of whom 5 (5.9%) had a complication, than in 9 patients (2.6%) with BMI of <40 kg/m2, but the difference was not statistically significant because of the sample size.

Discussion

Although there was no significant difference in postoperative complication rates within 6 months after surgery between the simultaneous and staged BTKA groups, the incidence of complications in the simultaneous group was notable. The disproportionate size of the 2 comparison groups limited the power of our study to analyze individual perioperative complications. This study may be underpowered to detect differences in complications occurring relatively infrequently, which may explain why the difference in number of complications (13 in simultaneous group, 1 in staged group) did not achieve statistical significance (β = 0.89). Post hoc power analysis showed 956 patients would be needed in each group to adequately power for such small complication rates. However, our results are consistent with those of other studies.13-15 The 1.9% PE rate in our simultaneous BTKA group does not vary from the average PE rate for TKA in the literature and is actually lower than the PE rate in a previous study at our institution.16 Fat embolism traditionally is considered more of a concern in bilateral cases than in unilateral cases. Although fat embolism surely is inherent to the physiologic alterations caused by TKA, we did not find clinically significant fat embolism in either cohort.

Similarly, the 1.08% rate of deep SSIs is within the range for postoperative TKA infections at our institution and others.17 Our staged BTKA group’s complication rate, 0.75% (1 SSI), was slightly lower than expected. However, 0.75% is in keeping with institutional norms (typical rate, ~1%). We would have expected a nonzero rate for venous thromboembolism, and perhaps such a rate would have come with an inclusion period longer than 6 months. Last, the death in the simultaneous BTKA group was not an outlier, given the published rate of mortality after elective total joint surgery.18The characteristics of our simultaneous and staged BTKA groups were very similar (Table 1), though the larger number of staged-group patients with diabetes mellitus and coronary artery disease may represent selection bias. Nevertheless, the proportions of patients with each of 3 ASA scores were similar. It is also important to note that, in this context, a high percentage of patients in each group (33.6% simultaneous, 37.5% staged) received ASA score 3 from the anesthesiologist (P > .05). This may be an important factor in explaining the larger though not statistically significant number of complications in the simultaneous group (13) relative to the staged group (1).

We therefore consider ASA score 3 to be a contraindication to a bilateral procedure, and for simultaneous BTKA we have developed a set of exclusion criteria that include ASA score 3 or 4 (Table 5). These criteria reflect input from our surgeons, anesthesiologist, and medical specialists, as well as the data presented here.

Other authors have studied the safety of simultaneous vs staged BTKA and drawn conflicting conclusions.11,19-21 Walmsley and colleagues21 found no differences in 90-day mortality between 3 groups: patients with simultaneous BTKA, patients with BTKA staged within 5 years, and patients with unilateral TKA. Stefánsdóttir and colleagues11 found that, compared with simultaneous BTKA, BTKA staged within 1 year had a lower 30-day mortality rate. Meehan and colleagues20 compared simultaneous BTKA with BTKA staged within 1 year and found a lower risk of infection and device malfunction and a higher risk of adverse cardiovascular outcomes in the simultaneous group. A recent meta-analysis found that, compared with staged BTKA, simultaneous BTKA had a higher risk of perioperative complications.19 A systematic review of retrospective studies found simultaneous BTKA had higher rates of mortality, PE, and transfusion and lower rates of deep SSI and revision.22 A survey of Medicare data found higher 90-day mortality and myocardial infarction rates for simultaneous BTKA but no difference in infection and revision rates.23 Clearly, there is no consensus as to whether simultaneous BTKA carries higher risks relative to staged BTKA.

The amount of blood transfused in our simultaneous BTKA group was more than double that in the 2 staged TKAs combined. It is intuitive that the blood loss in 2 concurrent TKAs is always more than in 1 TKA, but the clinical relevance of this fact is unknown. Transfusions have potential complications, and this risk needs to be addressed in the preoperative discussion.

LOS for simultaneous BTKA was on average 4 days shorter than the combined LOS (2 hospitalizations) for staged BTKA. This shorter LOS has been shown to provide the healthcare system with a cost savings.8 However, not considered in the equation is the difference in cost of rehabilitations, 2 vs 1. In the present study, 92.7% of simultaneous BTKA patients and only 50.7% of staged BTKA patients were discharged to an inpatient acute rehabilitation unit. Interestingly, the majority of the staged patients who went to inpatient rehabilitation did so after the second surgery. At our institution at the time of this study, simultaneous BTKA patients, and staged BTKA patients with the second surgery completed, were more likely than unilateral TKA patients to qualify for inpatient acute rehabilitation. Staged BTKA patients’ higher cost for 2 rehabilitations, rather than 1, adds to the cost savings realized with simultaneous BTKA. In the context of an episode-based payment system, the cost of posthospital rehabilitation enters the overall cost equation and may lead to an increase in the number of simultaneous BTKAs being performed.

Conclusion

In this study, the incidence of postoperative complications was higher for simultaneous BTKA than for staged BTKA performed <6 months apart, but the difference was not significantly different. There were significant differences in LOS and blood transfusion rates between the groups, as expected. At present, only patients with ASA score 1 or 2 are considered for simultaneous BTKA at our institution. Patients with ASA score 3 or higher are not eligible.

Am J Orthop. 2017;46(4):E224-E229. Copyright Frontline Medical Communications Inc. 2017. All rights reserved.

References

1. Hootman JM, Helmick CG. Projections of US prevalence of arthritis and associated activity limitations. Arthritis Rheum. 2006;54(1):226-229.

2. Kolettis GT, Wixson RL, Peruzzi WT, Blake MJ, Wardell S, Stulberg SD. Safety of 1-stage bilateral total knee arthroplasty. Clin Orthop Relat Res. 1994;(309):102-109.

3. Kim YH, Choi YW, Kim JS. Simultaneous bilateral sequential total knee replacement is as safe as unilateral total knee replacement. J Bone Joint Surg Br. 2009;91(1):64-68.

4. Luscombe JC, Theivendran K, Abudu A, Carter SR. The relative safety of one-stage bilateral total knee arthroplasty. Int Orthop. 2009;33(1):101-104.

5. Memtsoudis SG, Ma Y, González Della Valle A, et al. Perioperative outcomes after unilateral and bilateral total knee arthroplasty. Anesthesiology. 2009;111(6):1206-1216.

6. Zeni JA Jr, Snyder-Mackler L. Clinical outcomes after simultaneous bilateral total knee arthroplasty: comparison to unilateral total knee arthroplasty and healthy controls. J Arthroplasty. 2010;25(4):541-546.

7. March LM, Cross M, Tribe KL, et al; Arthritis C.O.S.T. Study Project Group. Two knees or not two knees? Patient costs and outcomes following bilateral and unilateral total knee joint replacement surgery for OA. Osteoarthritis Cartilage. 2004;12(5):400-408.

8. Reuben JD, Meyers SJ, Cox DD, Elliott M, Watson M, Shim SD. Cost comparison between bilateral simultaneous, staged, and unilateral total joint arthroplasty. J Arthroplasty. 1998;13(2):172-179.

9. Ritter MA, Harty LD. Debate: simultaneous bilateral knee replacements: the outcomes justify its use. Clin Orthop Relat Res. 2004;(428):84-86.

10. Restrepo C, Parvizi J, Dietrich T, Einhorn TA. Safety of simultaneous bilateral total knee arthroplasty. A meta-analysis. J Bone Joint Surg Am. 2007;89(6):1220-1226.

11. Stefánsdóttir A, Lidgren L, Robertsson O. Higher early mortality with simultaneous rather than staged bilateral TKAs: results from the Swedish Knee Arthroplasty Register. Clin Orthop Relat Res. 2008;466(12):3066-3070.

12. Noble J, Goodall J, Noble D. Simultaneous bilateral total knee replacement: a persistent controversy. Knee. 2009;16(6):420-426.

13. Fortin PR, Penrod JR, Clarke AE, et al. Timing of total joint replacement affects clinical outcomes among patients with osteoarthritis of the hip or knee. Arthritis Rheum. 2002;46(12):3327-3330.

14. Hooper GJ, Rothwell AG, Hooper NM, Frampton C. The relationship between the American Society of Anesthesiologists physical rating and outcome following total hip and knee arthroplasty: an analysis of the New Zealand Joint Registry. J Bone Joint Surg Am. 2012;94(12):1065-1070.

15. MacWilliam CH, Yood MU, Verner JJ, McCarthy BD, Ward RE. Patient-related risk factors that predict poor outcome after total hip replacement. Health Serv Res. 1996;31(5):623-638.

16. Hadley SR, Lee M, Reid M, Dweck E, Steiger D. Predictors of pulmonary embolism in orthopaedic patient population. Abstract presented at: 43rd Annual Meeting of the Eastern Orthopaedic Association; June 20-23, 2012; Bolton Landing, NY.

17. Hadley S, Immerman I, Hutzler L, Slover J, Bosco J. Staphylococcus aureus decolonization protocol decreases surgical site infections for total joint replacement. Arthritis. 2010;2010:924518.

18. Singh JA, Lewallen DG. Ninety-day mortality in patients undergoing elective total hip or total knee arthroplasty. J Arthroplasty. 2012;27(8):1417-1422.e1.

19. Hu J, Liu Y, Lv Z, Li X, Qin X, Fan W. Mortality and morbidity associated with simultaneous bilateral or staged bilateral total knee arthroplasty: a meta-analysis. Arch Orthop Trauma Surg. 2011;131(9):1291-1298.

20. Meehan JP, Danielsen B, Tancredi DJ, Kim S, Jamali AA, White RH. A population-based comparison of the incidence of adverse outcomes after simultaneous-bilateral and staged-bilateral total knee arthroplasty. J Bone Joint Surg Am. 2011;93(23):2203-2213.

21. Walmsley P, Murray A, Brenkel IJ. The practice of bilateral, simultaneous total knee replacement in Scotland over the last decade. Data from the Scottish Arthroplasty Project. Knee. 2006;13(2):102-105.

22. Fu D, Li G, Chen K, Zeng H, Zhang X, Cai Z. Comparison of clinical outcome between simultaneous-bilateral and staged-bilateral total knee arthroplasty: a systematic review of retrospective studies. J Arthroplasty. 2013;28(7):1141-1147.

23. Bolognesi MP, Watters TS, Attarian DE, Wellman SS, Setoguchi S. Simultaneous vs staged bilateral total knee arthroplasty among Medicare beneficiaries, 2000–2009. J Arthroplasty. 2013;28(8 suppl):87-91.

References

1. Hootman JM, Helmick CG. Projections of US prevalence of arthritis and associated activity limitations. Arthritis Rheum. 2006;54(1):226-229.

2. Kolettis GT, Wixson RL, Peruzzi WT, Blake MJ, Wardell S, Stulberg SD. Safety of 1-stage bilateral total knee arthroplasty. Clin Orthop Relat Res. 1994;(309):102-109.

3. Kim YH, Choi YW, Kim JS. Simultaneous bilateral sequential total knee replacement is as safe as unilateral total knee replacement. J Bone Joint Surg Br. 2009;91(1):64-68.

4. Luscombe JC, Theivendran K, Abudu A, Carter SR. The relative safety of one-stage bilateral total knee arthroplasty. Int Orthop. 2009;33(1):101-104.

5. Memtsoudis SG, Ma Y, González Della Valle A, et al. Perioperative outcomes after unilateral and bilateral total knee arthroplasty. Anesthesiology. 2009;111(6):1206-1216.

6. Zeni JA Jr, Snyder-Mackler L. Clinical outcomes after simultaneous bilateral total knee arthroplasty: comparison to unilateral total knee arthroplasty and healthy controls. J Arthroplasty. 2010;25(4):541-546.

7. March LM, Cross M, Tribe KL, et al; Arthritis C.O.S.T. Study Project Group. Two knees or not two knees? Patient costs and outcomes following bilateral and unilateral total knee joint replacement surgery for OA. Osteoarthritis Cartilage. 2004;12(5):400-408.

8. Reuben JD, Meyers SJ, Cox DD, Elliott M, Watson M, Shim SD. Cost comparison between bilateral simultaneous, staged, and unilateral total joint arthroplasty. J Arthroplasty. 1998;13(2):172-179.

9. Ritter MA, Harty LD. Debate: simultaneous bilateral knee replacements: the outcomes justify its use. Clin Orthop Relat Res. 2004;(428):84-86.

10. Restrepo C, Parvizi J, Dietrich T, Einhorn TA. Safety of simultaneous bilateral total knee arthroplasty. A meta-analysis. J Bone Joint Surg Am. 2007;89(6):1220-1226.

11. Stefánsdóttir A, Lidgren L, Robertsson O. Higher early mortality with simultaneous rather than staged bilateral TKAs: results from the Swedish Knee Arthroplasty Register. Clin Orthop Relat Res. 2008;466(12):3066-3070.

12. Noble J, Goodall J, Noble D. Simultaneous bilateral total knee replacement: a persistent controversy. Knee. 2009;16(6):420-426.

13. Fortin PR, Penrod JR, Clarke AE, et al. Timing of total joint replacement affects clinical outcomes among patients with osteoarthritis of the hip or knee. Arthritis Rheum. 2002;46(12):3327-3330.

14. Hooper GJ, Rothwell AG, Hooper NM, Frampton C. The relationship between the American Society of Anesthesiologists physical rating and outcome following total hip and knee arthroplasty: an analysis of the New Zealand Joint Registry. J Bone Joint Surg Am. 2012;94(12):1065-1070.

15. MacWilliam CH, Yood MU, Verner JJ, McCarthy BD, Ward RE. Patient-related risk factors that predict poor outcome after total hip replacement. Health Serv Res. 1996;31(5):623-638.

16. Hadley SR, Lee M, Reid M, Dweck E, Steiger D. Predictors of pulmonary embolism in orthopaedic patient population. Abstract presented at: 43rd Annual Meeting of the Eastern Orthopaedic Association; June 20-23, 2012; Bolton Landing, NY.

17. Hadley S, Immerman I, Hutzler L, Slover J, Bosco J. Staphylococcus aureus decolonization protocol decreases surgical site infections for total joint replacement. Arthritis. 2010;2010:924518.

18. Singh JA, Lewallen DG. Ninety-day mortality in patients undergoing elective total hip or total knee arthroplasty. J Arthroplasty. 2012;27(8):1417-1422.e1.

19. Hu J, Liu Y, Lv Z, Li X, Qin X, Fan W. Mortality and morbidity associated with simultaneous bilateral or staged bilateral total knee arthroplasty: a meta-analysis. Arch Orthop Trauma Surg. 2011;131(9):1291-1298.

20. Meehan JP, Danielsen B, Tancredi DJ, Kim S, Jamali AA, White RH. A population-based comparison of the incidence of adverse outcomes after simultaneous-bilateral and staged-bilateral total knee arthroplasty. J Bone Joint Surg Am. 2011;93(23):2203-2213.

21. Walmsley P, Murray A, Brenkel IJ. The practice of bilateral, simultaneous total knee replacement in Scotland over the last decade. Data from the Scottish Arthroplasty Project. Knee. 2006;13(2):102-105.

22. Fu D, Li G, Chen K, Zeng H, Zhang X, Cai Z. Comparison of clinical outcome between simultaneous-bilateral and staged-bilateral total knee arthroplasty: a systematic review of retrospective studies. J Arthroplasty. 2013;28(7):1141-1147.

23. Bolognesi MP, Watters TS, Attarian DE, Wellman SS, Setoguchi S. Simultaneous vs staged bilateral total knee arthroplasty among Medicare beneficiaries, 2000–2009. J Arthroplasty. 2013;28(8 suppl):87-91.

Issue
The American Journal of Orthopedics - 46(4)
Issue
The American Journal of Orthopedics - 46(4)
Page Number
E224-E229
Page Number
E224-E229
Publications
Publications
Topics
Article Type
Display Headline
Is Simultaneous Bilateral Total Knee Arthroplasty (BTKA) as Safe as Staged BTKA?
Display Headline
Is Simultaneous Bilateral Total Knee Arthroplasty (BTKA) as Safe as Staged BTKA?
Sections
Disallow All Ads
Content Gating
No Gating (article Unlocked/Free)
Alternative CME
Disqus Comments
Default
Article PDF Media

Biceps Tenodesis: An Evolution of Treatment

Article Type
Changed
Display Headline
Biceps Tenodesis: An Evolution of Treatment

Take-Home Points

  • The LHB tendon has been shown to be a significant pain generator in the shoulder.
  • At our institution, the number of LHB tenodeses significantly increased from 2004 to 2014.
  • The age of patients who underwent a LHB tenodesis did not change significantly over the study period.
  • Furthermore, the percentage of shoulder procedures that involved a LHB tenodesis significantly increased over the study period.
  • Biceps tenodesis has become a more common procedure to treat shoulder pathology.

Although the exact function of the long head of the biceps (LHB) tendon is not completely understood, it is accepted that the LHB tendon can be a significant source of pain within the shoulder.1-4 Patients with symptoms related to biceps pathology often present with anterior shoulder pain that worsens with flexion and supination of the affected elbow and wrist.5 Although the sensitivity and specificity of physical examination maneuvers have been called into question, special tests have been developed to aid in the diagnosis of tendonitis of the LHB. These tests include the Speed, Yergason, bear hug, and uppercut tests as well as the O’Brien test (cross-body adduction).6,7 Recent studies have found LHB pathology in 45% of patients who undergo rotator cuff repair and in 63% of patients with a subscapularis tear.8,9

Pathology of the LHB tendon, including superior labrum anterior to posterior (SLAP) tears, can be treated in many ways.5,10,11 Options include SLAP repair, biceps tenodesis, débridement, and biceps tenotomy.11,12 Results of SLAP repairs have been less than optimal, but biceps tenodesis has been effective, and avoids the issue of cramping as can be seen with biceps tenotomy and débridement.10,12,13 Surgical methods for biceps tenodesis include open subpectoral and all-arthroscopic.11,12 Both methods have had good, reliable outcomes, but the all-arthroscopic technique is relatively new.11,12,14We conducted a study to determine LHB tenodesis trends, including patient age at time of surgery. We used surgical data from fellowship-trained sports or shoulder/elbow orthopedic surgeons at a busy subspecialty-based shoulder orthopedic practice. We hypothesized that the rate of LHB tenodesis would increase significantly over time and that there would be no significant change in the age of patients who underwent LHB tenodesis.

Methods

Our Institutional Review Board exempted this study. To determine the number of LHB tenodesis procedures performed at our institution, overall and in comparison with other common arthroscopic shoulder procedures, we queried the surgical database of 4 fellowship-trained orthopedic surgeons (shoulder/elbow, Drs. Nicholson and Cole; sports, Drs. Romeo and Verma) for the period January 1, 2004 to December 31, 2014. We used Current Procedural Terminology (CPT) code 23430 to determine the number of LHB tenodesis cases, as the surgeons primarily perform an open subpectoral biceps tenodesis. Patient age at time of surgery and the date of surgery were recorded. All patients who underwent LHB tenodesis between January 1, 2004 and December 31, 2014 were included. Number of procedures performed each year by each surgeon was recorded, as were concomitant procedures performed at the same time as the LHB tenodesis. To get the denominator (and reference point) for the number of arthroscopic shoulder surgeries performed by these 4 surgeons during the study period, and thereby determine the rate of LHB tenodesis, we selected the most common shoulder arthroscopy CPT codes used in our practice: 23430, 29806, 29807, 29822, 29823, 29825, 29826, and 29827. For a patient who underwent multiple procedures on the same day (multiple CPT codes entered on the same day), only one code was counted for that day. If 23430 was among the codes, it was included, and the case was placed in the numerator; if 23430 was not among the codes, the case was placed in the denominator.

The Arthroscopy Association of North America provides descriptions for the CPT codes: 23430 (tenodesis of long tendon of biceps), 29806 (arthroscopy, shoulder, surgical; capsulorrhaphy), 29807 (arthroscopy, shoulder, surgical; repair of SLAP lesion), 29822 (arthroscopy, shoulder, surgical; débridement, limited), 29823 (arthroscopy, shoulder, surgical; débridement, extensive), 29825 (arthroscopy, shoulder, surgical; with lysis and resection of adhesions, with or without manipulation), 29826 (arthroscopy, shoulder, surgical; decompression of subacromial space with partial acromioplasty, with or without coracoacromial release), and 29827 (arthroscopy, shoulder, surgical; with rotator cuff repair).

For analysis, we divided the data into total number of arthroscopic shoulder procedures performed by each surgeon each year and number of LHB tenodesis procedures performed by each surgeon each year. Total number of patients who had an arthroscopic procedure was used to create a denominator, and number of LHB tenodesis procedures showed the percentage of arthroscopic shoulder surgery patients who underwent LHB tenodesis. (All patients who undergo biceps tenodesis also have, at the least, diagnostic shoulder arthroscopy with or without tenotomy; if the tendon is ruptured, tenotomy is unnecessary.)

Descriptive statistics were calculated as means (SDs) for continuous variables and as frequencies with percentages for categorical variables. Linear regression analysis was used to determine whether the number of LHB tenodesis procedures changed during the study period and whether patient age changed over time. Significance was set at P < .05.

 

Results

Of the 7640 patients who underwent arthroscopic shoulder procedures between 2004 and 2014, 2125 had LHB tenodesis (CPT code 23430).

Mean (SD) age of the subgroup was 49.33 (13.2) years, and mean (SD) number of LHB tenodesis cases per year was 193.2 (130.5). Over time, mean age of patients who had these procedures did not change significantly (P = .934) (Figure 1), mean number of LHB tenodesis cases increased significantly (P = .0024) (Figure 2A), and percentage of LHB tenodesis cases increased significantly relative to percentage of all arthroscopic shoulder procedures (P = .0099) (Figure 2B). The concomitant procedures performed with LHB tenodesis during the study period are listed in the Table.

Discussion

Tenodesis has become a common treatment option for several pathologic shoulder conditions involving the LHB tendon.5 We set out to determine trends in LHB tenodesis at a subspecialty-focused shoulder orthopedic practice and hypothesized that the rate of LHB tenodesis would increase significantly over time and that there would be no significant change in the age of patients who underwent LHB tenodesis. Our hypotheses were confirmed: The number of LHB tenodesis cases increased significantly without a significant change in patient age.

Treatment options for LHB pathology and SLAP tears include simple tenotomy, débridement, open biceps tenodesis, and arthroscopic tenodesis.11,12,15

Several fixation options have been used in open subpectoral biceps tenodesis. In this technique, which was used by all the surgeons in this study, the biceps tendon is fixed such that the musculotendinous junction of the biceps rests at the inferior border of the pectoralis major in the bicipital groove.16-19 Studies have found good, reliable outcomes with both the open and the arthroscopic surgical techniques.12,18 Comparing the LHB tenodesis trends in the present study with the SLAP repair trends we found at our institution in a previous study,20 we discovered that overall number of LHB tenodesis cases and percentage of LHB tenodesis cases relative to percentage of all arthroscopic shoulder procedures increased significantly more than for SLAP repairs.

Recent evidence has called into question the results of SLAP repairs and suggested biceps tenodesis may be a better treatment option for SLAP tears.10,13,21 Studies have found excellent outcomes with open subpectoral biceps tenodesis in the treatment of SLAP tears, and others have found better restoration of pitchers’ thoracic rotation with open subpectoral biceps tenodesis than with SLAP repair.13,14 Similarly, comparison studies have largely favored biceps tenodesis over SLAP repair, particularly in patients older than 35 years to 40 years.22 Given these results, it is not surprising that, querying the American Board of Orthopaedic Surgeons (ABOS) part II database for isolated SLAP lesions treated between 2002 and 2011, Patterson and colleagues23 found the percentage of SLAP repairs decreased from 69.3% to 44.8% (P < .0001), whereas the percentage of biceps tenodesis procedures increased from 1.9% to 18.8% (P < .0001), indicating the realization of improved outcomes with LHB tenodesis in the treatment of SLAP tears. On the other hand, in the ABOS part II database for the period 2003 to 2008, Weber and colleagues24 found that, despite a decrease in the percentage of SLAP repairs, total number of SLAP repairs increased from 9.4% to 10.1% (P = .0163). According to our study results, the number of SLAP repairs is decreasing over time, whereas the number of LHB tenodesis procedures is continuing to rise. The practice patterns seen in our study correlate with those in previous studies of the treatment of SLAP tears: good results in tenodesis groups and poor results in SLAP repair groups.10,13Werner and colleagues25 recently used the large PearlDiver database, which includes information from both private payers and Medicare, to determine overall LHB tenodesis trends in the United States for the period 2008 to 2011. Over those years, the incidence of LHB tenodesis increased 1.7-fold, and the rate of arthroscopic LHB tenodesis increased significantly more than the rate of open LHB tenodesis. These results are similar to ours in that the number of LHB tenodesis cases increased significantly over time. However, as the overwhelming majority of patients in our practice undergo open biceps tenodesis, the faster rate of growth in the arthroscopic cohort relative to the open cohort cannot be assessed. Additional randomized studies comparing biceps tenodesis, both open and arthroscopic, with SLAP repair are needed to properly determine the superiority of LHB tenodesis over SLAP repair.

One strength of this database study was the number of patients: more than 7000, 2125 of whom underwent biceps tenodesis performed by 1 of 4 fellowship-trained orthopedic surgeons. There were several study limitations. First, because the original diagnoses were not recorded, it was unclear exactly which pathologies were treated with tenodesis, limiting our ability to make recommendations regarding treatment trends for specific pathologies. Similarly, we did not assess outcome variables, which would have allowed us to draw conclusions about the effectiveness of the biceps tenodesis procedures. Furthermore, some procedures may have been coded incorrectly, and therefore some patients may have been erroneously included or excluded. In addition, using data from only one institution may have introduced bias into our conclusions, though the results are consistent with national trends. Finally, there was some variability among the 4 surgeons in the number of LHB tenodesis procedures performed, and this variability may have confounded results, though these surgeons treat biceps pathology in similar ways.

Am J Orthop. 2017;46(4):E219-E223. Copyright Frontline Medical Communications Inc. 2017. All rights reserved.

References

1. Denard PJ, Dai X, Hanypsiak BT, Burkhart SS. Anatomy of the biceps tendon: implications for restoring physiological length–tension relation during biceps tenodesis with interference screw fixation. Arthroscopy. 2012;28(10):1352-1358.

2. Ejnisman B, Monteiro GC, Andreoli CV, de Castro Pochini A. Disorder of the long head of the biceps tendon. Br J Sports Med. 2010;44(5):347-354.

3. Mellano CR, Shin JJ, Yanke AB, Verma NN. Disorders of the long head of the biceps tendon. Instr Course Lect. 2015;64:567-576.

4. Szabo I, Boileau P, Walch G. The proximal biceps as a pain generator and results of tenotomy. Sports Med Arthrosc Rev. 2008;16(3):180-186.

5. Harwin SF, Birns ME, Mbabuike JJ, Porter DA, Galano GJ. Arthroscopic tenodesis of the long head of the biceps. Orthopedics. 2014;37(11):743-747.

6. Holtby R, Razmjou H. Accuracy of the Speed’s and Yergason’s tests in detecting biceps pathology and SLAP lesions: comparison with arthroscopic findings. Arthroscopy. 2004;20(3):231-236.

7. Ben Kibler W, Sciascia AD, Hester P, Dome D, Jacobs C. Clinical utility of traditional and new tests in the diagnosis of biceps tendon injuries and superior labrum anterior and posterior lesions in the shoulder. Am J Sports Med. 2009;37(9):1840-1847.

8. Lafosse L, Reiland Y, Baier GP, Toussaint B, Jost B. Anterior and posterior instability of the long head of the biceps tendon in rotator cuff tears: a new classification based on arthroscopic observations. Arthroscopy. 2007;23(1):73-80.

9. Adams CR, Schoolfield JD, Burkhart SS. The results of arthroscopic subscapularis tendon repairs. Arthroscopy. 2008;24(12):1381-1389.

10. Provencher MT, McCormick F, Dewing C, McIntire S, Solomon D. A prospective analysis of 179 type 2 superior labrum anterior and posterior repairs: outcomes and factors associated with success and failure. Am J Sports Med. 2013;41(4):880-886.

11. Gombera MM, Kahlenberg CA, Nair R, Saltzman MD, Terry MA. All-arthroscopic suprapectoral versus open subpectoral tenodesis of the long head of the biceps brachii. Am J Sports Med. 2015;43(5):1077-1083.

12. Delle Rose G, Borroni M, Silvestro A, et al. The long head of biceps as a source of pain in active population: tenotomy or tenodesis? A comparison of 2 case series with isolated lesions. Musculoskelet Surg. 2012;96(suppl 1):S47-S52.

13. Chalmers PN, Trombley R, Cip J, et al. Postoperative restoration of upper extremity motion and neuromuscular control during the overhand pitch: evaluation of tenodesis and repair for superior labral anterior-posterior tears. Am J Sports Med. 2014;42(12):2825-2836.

14. Gupta AK, Chalmers PN, Klosterman EL, et al. Subpectoral biceps tenodesis for bicipital tendonitis with SLAP tear. Orthopedics. 2015;38(1):e48-e53.

15. Ge H, Zhang Q, Sun Y, Li J, Sun L, Cheng B. Tenotomy or tenodesis for the long head of biceps lesions in shoulders: a systematic review and meta-analysis. PLoS One. 2015;10(3):e0121286.

16. Kaback LA, Gowda AL, Paller D, Green A, Blaine T. Long head biceps tenodesis with a knotless cinch suture anchor: a biomechanical analysis. Arthroscopy. 2015;31(5):831-835.

17. Kany J, Guinand R, Amaravathi RS, Alassaf I. The keyhole technique for arthroscopic tenodesis of the long head of the biceps tendon. In vivo prospective study with a radio-opaque marker. Orthop Traumatol Surg Res. 2015;101(1):31-34.

18. Mazzocca AD, Cote MP, Arciero CL, Romeo AA, Arciero RA. Clinical outcomes after subpectoral biceps tenodesis with an interference screw. Am J Sports Med. 2008;36(10):1922-1929.

19. Provencher MT, LeClere LE, Romeo AA. Subpectoral biceps tenodesis. Sports Med Arthrosc Rev. 2008;16(3):170-176.

20. Erickson BJ, Jain A, Abrams GD, et al. SLAP lesions: trends in treatment. Arthroscopy. 2016;32(6):976-981.

21. Erickson J, Lavery K, Monica J, Gatt C, Dhawan A. Surgical treatment of symptomatic superior labrum anterior-posterior tears in patients older than 40 years: a systematic review. Am J Sports Med. 2015;43(5):1274-1282.

22. Denard PJ, Ladermann A, Parsley BK, Burkhart SS. Arthroscopic biceps tenodesis compared with repair of isolated type II SLAP lesions in patients older than 35 years. Orthopedics. 2014;37(3):e292-e297.

23. Patterson BM, Creighton RA, Spang JT, Roberson JR, Kamath GV. Surgical trends in the treatment of superior labrum anterior and posterior lesions of the shoulder: analysis of data from the American Board of Orthopaedic Surgery certification examination database. Am J Sports Med. 2014;42(8):1904-1910.

24. Weber SC, Martin DF, Seiler JG 3rd, Harrast JJ. Superior labrum anterior and posterior lesions of the shoulder: incidence rates, complications, and outcomes as reported by American Board of Orthopedic Surgery. Part II candidates. Am J Sports Med. 2012;40(7):1538-1543.

25. Werner BC, Brockmeier SF, Gwathmey FW. Trends in long head biceps tenodesis. Am J Sports Med. 2015;43(3):570-578.

Article PDF
Author and Disclosure Information

Authors’ Disclosure Statement: The authors report no actual or potential conflict of interest in relation to this article.

Issue
The American Journal of Orthopedics - 46(4)
Publications
Topics
Page Number
E219-E223
Sections
Author and Disclosure Information

Authors’ Disclosure Statement: The authors report no actual or potential conflict of interest in relation to this article.

Author and Disclosure Information

Authors’ Disclosure Statement: The authors report no actual or potential conflict of interest in relation to this article.

Article PDF
Article PDF

Take-Home Points

  • The LHB tendon has been shown to be a significant pain generator in the shoulder.
  • At our institution, the number of LHB tenodeses significantly increased from 2004 to 2014.
  • The age of patients who underwent a LHB tenodesis did not change significantly over the study period.
  • Furthermore, the percentage of shoulder procedures that involved a LHB tenodesis significantly increased over the study period.
  • Biceps tenodesis has become a more common procedure to treat shoulder pathology.

Although the exact function of the long head of the biceps (LHB) tendon is not completely understood, it is accepted that the LHB tendon can be a significant source of pain within the shoulder.1-4 Patients with symptoms related to biceps pathology often present with anterior shoulder pain that worsens with flexion and supination of the affected elbow and wrist.5 Although the sensitivity and specificity of physical examination maneuvers have been called into question, special tests have been developed to aid in the diagnosis of tendonitis of the LHB. These tests include the Speed, Yergason, bear hug, and uppercut tests as well as the O’Brien test (cross-body adduction).6,7 Recent studies have found LHB pathology in 45% of patients who undergo rotator cuff repair and in 63% of patients with a subscapularis tear.8,9

Pathology of the LHB tendon, including superior labrum anterior to posterior (SLAP) tears, can be treated in many ways.5,10,11 Options include SLAP repair, biceps tenodesis, débridement, and biceps tenotomy.11,12 Results of SLAP repairs have been less than optimal, but biceps tenodesis has been effective, and avoids the issue of cramping as can be seen with biceps tenotomy and débridement.10,12,13 Surgical methods for biceps tenodesis include open subpectoral and all-arthroscopic.11,12 Both methods have had good, reliable outcomes, but the all-arthroscopic technique is relatively new.11,12,14We conducted a study to determine LHB tenodesis trends, including patient age at time of surgery. We used surgical data from fellowship-trained sports or shoulder/elbow orthopedic surgeons at a busy subspecialty-based shoulder orthopedic practice. We hypothesized that the rate of LHB tenodesis would increase significantly over time and that there would be no significant change in the age of patients who underwent LHB tenodesis.

Methods

Our Institutional Review Board exempted this study. To determine the number of LHB tenodesis procedures performed at our institution, overall and in comparison with other common arthroscopic shoulder procedures, we queried the surgical database of 4 fellowship-trained orthopedic surgeons (shoulder/elbow, Drs. Nicholson and Cole; sports, Drs. Romeo and Verma) for the period January 1, 2004 to December 31, 2014. We used Current Procedural Terminology (CPT) code 23430 to determine the number of LHB tenodesis cases, as the surgeons primarily perform an open subpectoral biceps tenodesis. Patient age at time of surgery and the date of surgery were recorded. All patients who underwent LHB tenodesis between January 1, 2004 and December 31, 2014 were included. Number of procedures performed each year by each surgeon was recorded, as were concomitant procedures performed at the same time as the LHB tenodesis. To get the denominator (and reference point) for the number of arthroscopic shoulder surgeries performed by these 4 surgeons during the study period, and thereby determine the rate of LHB tenodesis, we selected the most common shoulder arthroscopy CPT codes used in our practice: 23430, 29806, 29807, 29822, 29823, 29825, 29826, and 29827. For a patient who underwent multiple procedures on the same day (multiple CPT codes entered on the same day), only one code was counted for that day. If 23430 was among the codes, it was included, and the case was placed in the numerator; if 23430 was not among the codes, the case was placed in the denominator.

The Arthroscopy Association of North America provides descriptions for the CPT codes: 23430 (tenodesis of long tendon of biceps), 29806 (arthroscopy, shoulder, surgical; capsulorrhaphy), 29807 (arthroscopy, shoulder, surgical; repair of SLAP lesion), 29822 (arthroscopy, shoulder, surgical; débridement, limited), 29823 (arthroscopy, shoulder, surgical; débridement, extensive), 29825 (arthroscopy, shoulder, surgical; with lysis and resection of adhesions, with or without manipulation), 29826 (arthroscopy, shoulder, surgical; decompression of subacromial space with partial acromioplasty, with or without coracoacromial release), and 29827 (arthroscopy, shoulder, surgical; with rotator cuff repair).

For analysis, we divided the data into total number of arthroscopic shoulder procedures performed by each surgeon each year and number of LHB tenodesis procedures performed by each surgeon each year. Total number of patients who had an arthroscopic procedure was used to create a denominator, and number of LHB tenodesis procedures showed the percentage of arthroscopic shoulder surgery patients who underwent LHB tenodesis. (All patients who undergo biceps tenodesis also have, at the least, diagnostic shoulder arthroscopy with or without tenotomy; if the tendon is ruptured, tenotomy is unnecessary.)

Descriptive statistics were calculated as means (SDs) for continuous variables and as frequencies with percentages for categorical variables. Linear regression analysis was used to determine whether the number of LHB tenodesis procedures changed during the study period and whether patient age changed over time. Significance was set at P < .05.

 

Results

Of the 7640 patients who underwent arthroscopic shoulder procedures between 2004 and 2014, 2125 had LHB tenodesis (CPT code 23430).

Mean (SD) age of the subgroup was 49.33 (13.2) years, and mean (SD) number of LHB tenodesis cases per year was 193.2 (130.5). Over time, mean age of patients who had these procedures did not change significantly (P = .934) (Figure 1), mean number of LHB tenodesis cases increased significantly (P = .0024) (Figure 2A), and percentage of LHB tenodesis cases increased significantly relative to percentage of all arthroscopic shoulder procedures (P = .0099) (Figure 2B). The concomitant procedures performed with LHB tenodesis during the study period are listed in the Table.

Discussion

Tenodesis has become a common treatment option for several pathologic shoulder conditions involving the LHB tendon.5 We set out to determine trends in LHB tenodesis at a subspecialty-focused shoulder orthopedic practice and hypothesized that the rate of LHB tenodesis would increase significantly over time and that there would be no significant change in the age of patients who underwent LHB tenodesis. Our hypotheses were confirmed: The number of LHB tenodesis cases increased significantly without a significant change in patient age.

Treatment options for LHB pathology and SLAP tears include simple tenotomy, débridement, open biceps tenodesis, and arthroscopic tenodesis.11,12,15

Several fixation options have been used in open subpectoral biceps tenodesis. In this technique, which was used by all the surgeons in this study, the biceps tendon is fixed such that the musculotendinous junction of the biceps rests at the inferior border of the pectoralis major in the bicipital groove.16-19 Studies have found good, reliable outcomes with both the open and the arthroscopic surgical techniques.12,18 Comparing the LHB tenodesis trends in the present study with the SLAP repair trends we found at our institution in a previous study,20 we discovered that overall number of LHB tenodesis cases and percentage of LHB tenodesis cases relative to percentage of all arthroscopic shoulder procedures increased significantly more than for SLAP repairs.

Recent evidence has called into question the results of SLAP repairs and suggested biceps tenodesis may be a better treatment option for SLAP tears.10,13,21 Studies have found excellent outcomes with open subpectoral biceps tenodesis in the treatment of SLAP tears, and others have found better restoration of pitchers’ thoracic rotation with open subpectoral biceps tenodesis than with SLAP repair.13,14 Similarly, comparison studies have largely favored biceps tenodesis over SLAP repair, particularly in patients older than 35 years to 40 years.22 Given these results, it is not surprising that, querying the American Board of Orthopaedic Surgeons (ABOS) part II database for isolated SLAP lesions treated between 2002 and 2011, Patterson and colleagues23 found the percentage of SLAP repairs decreased from 69.3% to 44.8% (P < .0001), whereas the percentage of biceps tenodesis procedures increased from 1.9% to 18.8% (P < .0001), indicating the realization of improved outcomes with LHB tenodesis in the treatment of SLAP tears. On the other hand, in the ABOS part II database for the period 2003 to 2008, Weber and colleagues24 found that, despite a decrease in the percentage of SLAP repairs, total number of SLAP repairs increased from 9.4% to 10.1% (P = .0163). According to our study results, the number of SLAP repairs is decreasing over time, whereas the number of LHB tenodesis procedures is continuing to rise. The practice patterns seen in our study correlate with those in previous studies of the treatment of SLAP tears: good results in tenodesis groups and poor results in SLAP repair groups.10,13Werner and colleagues25 recently used the large PearlDiver database, which includes information from both private payers and Medicare, to determine overall LHB tenodesis trends in the United States for the period 2008 to 2011. Over those years, the incidence of LHB tenodesis increased 1.7-fold, and the rate of arthroscopic LHB tenodesis increased significantly more than the rate of open LHB tenodesis. These results are similar to ours in that the number of LHB tenodesis cases increased significantly over time. However, as the overwhelming majority of patients in our practice undergo open biceps tenodesis, the faster rate of growth in the arthroscopic cohort relative to the open cohort cannot be assessed. Additional randomized studies comparing biceps tenodesis, both open and arthroscopic, with SLAP repair are needed to properly determine the superiority of LHB tenodesis over SLAP repair.

One strength of this database study was the number of patients: more than 7000, 2125 of whom underwent biceps tenodesis performed by 1 of 4 fellowship-trained orthopedic surgeons. There were several study limitations. First, because the original diagnoses were not recorded, it was unclear exactly which pathologies were treated with tenodesis, limiting our ability to make recommendations regarding treatment trends for specific pathologies. Similarly, we did not assess outcome variables, which would have allowed us to draw conclusions about the effectiveness of the biceps tenodesis procedures. Furthermore, some procedures may have been coded incorrectly, and therefore some patients may have been erroneously included or excluded. In addition, using data from only one institution may have introduced bias into our conclusions, though the results are consistent with national trends. Finally, there was some variability among the 4 surgeons in the number of LHB tenodesis procedures performed, and this variability may have confounded results, though these surgeons treat biceps pathology in similar ways.

Am J Orthop. 2017;46(4):E219-E223. Copyright Frontline Medical Communications Inc. 2017. All rights reserved.

Take-Home Points

  • The LHB tendon has been shown to be a significant pain generator in the shoulder.
  • At our institution, the number of LHB tenodeses significantly increased from 2004 to 2014.
  • The age of patients who underwent a LHB tenodesis did not change significantly over the study period.
  • Furthermore, the percentage of shoulder procedures that involved a LHB tenodesis significantly increased over the study period.
  • Biceps tenodesis has become a more common procedure to treat shoulder pathology.

Although the exact function of the long head of the biceps (LHB) tendon is not completely understood, it is accepted that the LHB tendon can be a significant source of pain within the shoulder.1-4 Patients with symptoms related to biceps pathology often present with anterior shoulder pain that worsens with flexion and supination of the affected elbow and wrist.5 Although the sensitivity and specificity of physical examination maneuvers have been called into question, special tests have been developed to aid in the diagnosis of tendonitis of the LHB. These tests include the Speed, Yergason, bear hug, and uppercut tests as well as the O’Brien test (cross-body adduction).6,7 Recent studies have found LHB pathology in 45% of patients who undergo rotator cuff repair and in 63% of patients with a subscapularis tear.8,9

Pathology of the LHB tendon, including superior labrum anterior to posterior (SLAP) tears, can be treated in many ways.5,10,11 Options include SLAP repair, biceps tenodesis, débridement, and biceps tenotomy.11,12 Results of SLAP repairs have been less than optimal, but biceps tenodesis has been effective, and avoids the issue of cramping as can be seen with biceps tenotomy and débridement.10,12,13 Surgical methods for biceps tenodesis include open subpectoral and all-arthroscopic.11,12 Both methods have had good, reliable outcomes, but the all-arthroscopic technique is relatively new.11,12,14We conducted a study to determine LHB tenodesis trends, including patient age at time of surgery. We used surgical data from fellowship-trained sports or shoulder/elbow orthopedic surgeons at a busy subspecialty-based shoulder orthopedic practice. We hypothesized that the rate of LHB tenodesis would increase significantly over time and that there would be no significant change in the age of patients who underwent LHB tenodesis.

Methods

Our Institutional Review Board exempted this study. To determine the number of LHB tenodesis procedures performed at our institution, overall and in comparison with other common arthroscopic shoulder procedures, we queried the surgical database of 4 fellowship-trained orthopedic surgeons (shoulder/elbow, Drs. Nicholson and Cole; sports, Drs. Romeo and Verma) for the period January 1, 2004 to December 31, 2014. We used Current Procedural Terminology (CPT) code 23430 to determine the number of LHB tenodesis cases, as the surgeons primarily perform an open subpectoral biceps tenodesis. Patient age at time of surgery and the date of surgery were recorded. All patients who underwent LHB tenodesis between January 1, 2004 and December 31, 2014 were included. Number of procedures performed each year by each surgeon was recorded, as were concomitant procedures performed at the same time as the LHB tenodesis. To get the denominator (and reference point) for the number of arthroscopic shoulder surgeries performed by these 4 surgeons during the study period, and thereby determine the rate of LHB tenodesis, we selected the most common shoulder arthroscopy CPT codes used in our practice: 23430, 29806, 29807, 29822, 29823, 29825, 29826, and 29827. For a patient who underwent multiple procedures on the same day (multiple CPT codes entered on the same day), only one code was counted for that day. If 23430 was among the codes, it was included, and the case was placed in the numerator; if 23430 was not among the codes, the case was placed in the denominator.

The Arthroscopy Association of North America provides descriptions for the CPT codes: 23430 (tenodesis of long tendon of biceps), 29806 (arthroscopy, shoulder, surgical; capsulorrhaphy), 29807 (arthroscopy, shoulder, surgical; repair of SLAP lesion), 29822 (arthroscopy, shoulder, surgical; débridement, limited), 29823 (arthroscopy, shoulder, surgical; débridement, extensive), 29825 (arthroscopy, shoulder, surgical; with lysis and resection of adhesions, with or without manipulation), 29826 (arthroscopy, shoulder, surgical; decompression of subacromial space with partial acromioplasty, with or without coracoacromial release), and 29827 (arthroscopy, shoulder, surgical; with rotator cuff repair).

For analysis, we divided the data into total number of arthroscopic shoulder procedures performed by each surgeon each year and number of LHB tenodesis procedures performed by each surgeon each year. Total number of patients who had an arthroscopic procedure was used to create a denominator, and number of LHB tenodesis procedures showed the percentage of arthroscopic shoulder surgery patients who underwent LHB tenodesis. (All patients who undergo biceps tenodesis also have, at the least, diagnostic shoulder arthroscopy with or without tenotomy; if the tendon is ruptured, tenotomy is unnecessary.)

Descriptive statistics were calculated as means (SDs) for continuous variables and as frequencies with percentages for categorical variables. Linear regression analysis was used to determine whether the number of LHB tenodesis procedures changed during the study period and whether patient age changed over time. Significance was set at P < .05.

 

Results

Of the 7640 patients who underwent arthroscopic shoulder procedures between 2004 and 2014, 2125 had LHB tenodesis (CPT code 23430).

Mean (SD) age of the subgroup was 49.33 (13.2) years, and mean (SD) number of LHB tenodesis cases per year was 193.2 (130.5). Over time, mean age of patients who had these procedures did not change significantly (P = .934) (Figure 1), mean number of LHB tenodesis cases increased significantly (P = .0024) (Figure 2A), and percentage of LHB tenodesis cases increased significantly relative to percentage of all arthroscopic shoulder procedures (P = .0099) (Figure 2B). The concomitant procedures performed with LHB tenodesis during the study period are listed in the Table.

Discussion

Tenodesis has become a common treatment option for several pathologic shoulder conditions involving the LHB tendon.5 We set out to determine trends in LHB tenodesis at a subspecialty-focused shoulder orthopedic practice and hypothesized that the rate of LHB tenodesis would increase significantly over time and that there would be no significant change in the age of patients who underwent LHB tenodesis. Our hypotheses were confirmed: The number of LHB tenodesis cases increased significantly without a significant change in patient age.

Treatment options for LHB pathology and SLAP tears include simple tenotomy, débridement, open biceps tenodesis, and arthroscopic tenodesis.11,12,15

Several fixation options have been used in open subpectoral biceps tenodesis. In this technique, which was used by all the surgeons in this study, the biceps tendon is fixed such that the musculotendinous junction of the biceps rests at the inferior border of the pectoralis major in the bicipital groove.16-19 Studies have found good, reliable outcomes with both the open and the arthroscopic surgical techniques.12,18 Comparing the LHB tenodesis trends in the present study with the SLAP repair trends we found at our institution in a previous study,20 we discovered that overall number of LHB tenodesis cases and percentage of LHB tenodesis cases relative to percentage of all arthroscopic shoulder procedures increased significantly more than for SLAP repairs.

Recent evidence has called into question the results of SLAP repairs and suggested biceps tenodesis may be a better treatment option for SLAP tears.10,13,21 Studies have found excellent outcomes with open subpectoral biceps tenodesis in the treatment of SLAP tears, and others have found better restoration of pitchers’ thoracic rotation with open subpectoral biceps tenodesis than with SLAP repair.13,14 Similarly, comparison studies have largely favored biceps tenodesis over SLAP repair, particularly in patients older than 35 years to 40 years.22 Given these results, it is not surprising that, querying the American Board of Orthopaedic Surgeons (ABOS) part II database for isolated SLAP lesions treated between 2002 and 2011, Patterson and colleagues23 found the percentage of SLAP repairs decreased from 69.3% to 44.8% (P < .0001), whereas the percentage of biceps tenodesis procedures increased from 1.9% to 18.8% (P < .0001), indicating the realization of improved outcomes with LHB tenodesis in the treatment of SLAP tears. On the other hand, in the ABOS part II database for the period 2003 to 2008, Weber and colleagues24 found that, despite a decrease in the percentage of SLAP repairs, total number of SLAP repairs increased from 9.4% to 10.1% (P = .0163). According to our study results, the number of SLAP repairs is decreasing over time, whereas the number of LHB tenodesis procedures is continuing to rise. The practice patterns seen in our study correlate with those in previous studies of the treatment of SLAP tears: good results in tenodesis groups and poor results in SLAP repair groups.10,13Werner and colleagues25 recently used the large PearlDiver database, which includes information from both private payers and Medicare, to determine overall LHB tenodesis trends in the United States for the period 2008 to 2011. Over those years, the incidence of LHB tenodesis increased 1.7-fold, and the rate of arthroscopic LHB tenodesis increased significantly more than the rate of open LHB tenodesis. These results are similar to ours in that the number of LHB tenodesis cases increased significantly over time. However, as the overwhelming majority of patients in our practice undergo open biceps tenodesis, the faster rate of growth in the arthroscopic cohort relative to the open cohort cannot be assessed. Additional randomized studies comparing biceps tenodesis, both open and arthroscopic, with SLAP repair are needed to properly determine the superiority of LHB tenodesis over SLAP repair.

One strength of this database study was the number of patients: more than 7000, 2125 of whom underwent biceps tenodesis performed by 1 of 4 fellowship-trained orthopedic surgeons. There were several study limitations. First, because the original diagnoses were not recorded, it was unclear exactly which pathologies were treated with tenodesis, limiting our ability to make recommendations regarding treatment trends for specific pathologies. Similarly, we did not assess outcome variables, which would have allowed us to draw conclusions about the effectiveness of the biceps tenodesis procedures. Furthermore, some procedures may have been coded incorrectly, and therefore some patients may have been erroneously included or excluded. In addition, using data from only one institution may have introduced bias into our conclusions, though the results are consistent with national trends. Finally, there was some variability among the 4 surgeons in the number of LHB tenodesis procedures performed, and this variability may have confounded results, though these surgeons treat biceps pathology in similar ways.

Am J Orthop. 2017;46(4):E219-E223. Copyright Frontline Medical Communications Inc. 2017. All rights reserved.

References

1. Denard PJ, Dai X, Hanypsiak BT, Burkhart SS. Anatomy of the biceps tendon: implications for restoring physiological length–tension relation during biceps tenodesis with interference screw fixation. Arthroscopy. 2012;28(10):1352-1358.

2. Ejnisman B, Monteiro GC, Andreoli CV, de Castro Pochini A. Disorder of the long head of the biceps tendon. Br J Sports Med. 2010;44(5):347-354.

3. Mellano CR, Shin JJ, Yanke AB, Verma NN. Disorders of the long head of the biceps tendon. Instr Course Lect. 2015;64:567-576.

4. Szabo I, Boileau P, Walch G. The proximal biceps as a pain generator and results of tenotomy. Sports Med Arthrosc Rev. 2008;16(3):180-186.

5. Harwin SF, Birns ME, Mbabuike JJ, Porter DA, Galano GJ. Arthroscopic tenodesis of the long head of the biceps. Orthopedics. 2014;37(11):743-747.

6. Holtby R, Razmjou H. Accuracy of the Speed’s and Yergason’s tests in detecting biceps pathology and SLAP lesions: comparison with arthroscopic findings. Arthroscopy. 2004;20(3):231-236.

7. Ben Kibler W, Sciascia AD, Hester P, Dome D, Jacobs C. Clinical utility of traditional and new tests in the diagnosis of biceps tendon injuries and superior labrum anterior and posterior lesions in the shoulder. Am J Sports Med. 2009;37(9):1840-1847.

8. Lafosse L, Reiland Y, Baier GP, Toussaint B, Jost B. Anterior and posterior instability of the long head of the biceps tendon in rotator cuff tears: a new classification based on arthroscopic observations. Arthroscopy. 2007;23(1):73-80.

9. Adams CR, Schoolfield JD, Burkhart SS. The results of arthroscopic subscapularis tendon repairs. Arthroscopy. 2008;24(12):1381-1389.

10. Provencher MT, McCormick F, Dewing C, McIntire S, Solomon D. A prospective analysis of 179 type 2 superior labrum anterior and posterior repairs: outcomes and factors associated with success and failure. Am J Sports Med. 2013;41(4):880-886.

11. Gombera MM, Kahlenberg CA, Nair R, Saltzman MD, Terry MA. All-arthroscopic suprapectoral versus open subpectoral tenodesis of the long head of the biceps brachii. Am J Sports Med. 2015;43(5):1077-1083.

12. Delle Rose G, Borroni M, Silvestro A, et al. The long head of biceps as a source of pain in active population: tenotomy or tenodesis? A comparison of 2 case series with isolated lesions. Musculoskelet Surg. 2012;96(suppl 1):S47-S52.

13. Chalmers PN, Trombley R, Cip J, et al. Postoperative restoration of upper extremity motion and neuromuscular control during the overhand pitch: evaluation of tenodesis and repair for superior labral anterior-posterior tears. Am J Sports Med. 2014;42(12):2825-2836.

14. Gupta AK, Chalmers PN, Klosterman EL, et al. Subpectoral biceps tenodesis for bicipital tendonitis with SLAP tear. Orthopedics. 2015;38(1):e48-e53.

15. Ge H, Zhang Q, Sun Y, Li J, Sun L, Cheng B. Tenotomy or tenodesis for the long head of biceps lesions in shoulders: a systematic review and meta-analysis. PLoS One. 2015;10(3):e0121286.

16. Kaback LA, Gowda AL, Paller D, Green A, Blaine T. Long head biceps tenodesis with a knotless cinch suture anchor: a biomechanical analysis. Arthroscopy. 2015;31(5):831-835.

17. Kany J, Guinand R, Amaravathi RS, Alassaf I. The keyhole technique for arthroscopic tenodesis of the long head of the biceps tendon. In vivo prospective study with a radio-opaque marker. Orthop Traumatol Surg Res. 2015;101(1):31-34.

18. Mazzocca AD, Cote MP, Arciero CL, Romeo AA, Arciero RA. Clinical outcomes after subpectoral biceps tenodesis with an interference screw. Am J Sports Med. 2008;36(10):1922-1929.

19. Provencher MT, LeClere LE, Romeo AA. Subpectoral biceps tenodesis. Sports Med Arthrosc Rev. 2008;16(3):170-176.

20. Erickson BJ, Jain A, Abrams GD, et al. SLAP lesions: trends in treatment. Arthroscopy. 2016;32(6):976-981.

21. Erickson J, Lavery K, Monica J, Gatt C, Dhawan A. Surgical treatment of symptomatic superior labrum anterior-posterior tears in patients older than 40 years: a systematic review. Am J Sports Med. 2015;43(5):1274-1282.

22. Denard PJ, Ladermann A, Parsley BK, Burkhart SS. Arthroscopic biceps tenodesis compared with repair of isolated type II SLAP lesions in patients older than 35 years. Orthopedics. 2014;37(3):e292-e297.

23. Patterson BM, Creighton RA, Spang JT, Roberson JR, Kamath GV. Surgical trends in the treatment of superior labrum anterior and posterior lesions of the shoulder: analysis of data from the American Board of Orthopaedic Surgery certification examination database. Am J Sports Med. 2014;42(8):1904-1910.

24. Weber SC, Martin DF, Seiler JG 3rd, Harrast JJ. Superior labrum anterior and posterior lesions of the shoulder: incidence rates, complications, and outcomes as reported by American Board of Orthopedic Surgery. Part II candidates. Am J Sports Med. 2012;40(7):1538-1543.

25. Werner BC, Brockmeier SF, Gwathmey FW. Trends in long head biceps tenodesis. Am J Sports Med. 2015;43(3):570-578.

References

1. Denard PJ, Dai X, Hanypsiak BT, Burkhart SS. Anatomy of the biceps tendon: implications for restoring physiological length–tension relation during biceps tenodesis with interference screw fixation. Arthroscopy. 2012;28(10):1352-1358.

2. Ejnisman B, Monteiro GC, Andreoli CV, de Castro Pochini A. Disorder of the long head of the biceps tendon. Br J Sports Med. 2010;44(5):347-354.

3. Mellano CR, Shin JJ, Yanke AB, Verma NN. Disorders of the long head of the biceps tendon. Instr Course Lect. 2015;64:567-576.

4. Szabo I, Boileau P, Walch G. The proximal biceps as a pain generator and results of tenotomy. Sports Med Arthrosc Rev. 2008;16(3):180-186.

5. Harwin SF, Birns ME, Mbabuike JJ, Porter DA, Galano GJ. Arthroscopic tenodesis of the long head of the biceps. Orthopedics. 2014;37(11):743-747.

6. Holtby R, Razmjou H. Accuracy of the Speed’s and Yergason’s tests in detecting biceps pathology and SLAP lesions: comparison with arthroscopic findings. Arthroscopy. 2004;20(3):231-236.

7. Ben Kibler W, Sciascia AD, Hester P, Dome D, Jacobs C. Clinical utility of traditional and new tests in the diagnosis of biceps tendon injuries and superior labrum anterior and posterior lesions in the shoulder. Am J Sports Med. 2009;37(9):1840-1847.

8. Lafosse L, Reiland Y, Baier GP, Toussaint B, Jost B. Anterior and posterior instability of the long head of the biceps tendon in rotator cuff tears: a new classification based on arthroscopic observations. Arthroscopy. 2007;23(1):73-80.

9. Adams CR, Schoolfield JD, Burkhart SS. The results of arthroscopic subscapularis tendon repairs. Arthroscopy. 2008;24(12):1381-1389.

10. Provencher MT, McCormick F, Dewing C, McIntire S, Solomon D. A prospective analysis of 179 type 2 superior labrum anterior and posterior repairs: outcomes and factors associated with success and failure. Am J Sports Med. 2013;41(4):880-886.

11. Gombera MM, Kahlenberg CA, Nair R, Saltzman MD, Terry MA. All-arthroscopic suprapectoral versus open subpectoral tenodesis of the long head of the biceps brachii. Am J Sports Med. 2015;43(5):1077-1083.

12. Delle Rose G, Borroni M, Silvestro A, et al. The long head of biceps as a source of pain in active population: tenotomy or tenodesis? A comparison of 2 case series with isolated lesions. Musculoskelet Surg. 2012;96(suppl 1):S47-S52.

13. Chalmers PN, Trombley R, Cip J, et al. Postoperative restoration of upper extremity motion and neuromuscular control during the overhand pitch: evaluation of tenodesis and repair for superior labral anterior-posterior tears. Am J Sports Med. 2014;42(12):2825-2836.

14. Gupta AK, Chalmers PN, Klosterman EL, et al. Subpectoral biceps tenodesis for bicipital tendonitis with SLAP tear. Orthopedics. 2015;38(1):e48-e53.

15. Ge H, Zhang Q, Sun Y, Li J, Sun L, Cheng B. Tenotomy or tenodesis for the long head of biceps lesions in shoulders: a systematic review and meta-analysis. PLoS One. 2015;10(3):e0121286.

16. Kaback LA, Gowda AL, Paller D, Green A, Blaine T. Long head biceps tenodesis with a knotless cinch suture anchor: a biomechanical analysis. Arthroscopy. 2015;31(5):831-835.

17. Kany J, Guinand R, Amaravathi RS, Alassaf I. The keyhole technique for arthroscopic tenodesis of the long head of the biceps tendon. In vivo prospective study with a radio-opaque marker. Orthop Traumatol Surg Res. 2015;101(1):31-34.

18. Mazzocca AD, Cote MP, Arciero CL, Romeo AA, Arciero RA. Clinical outcomes after subpectoral biceps tenodesis with an interference screw. Am J Sports Med. 2008;36(10):1922-1929.

19. Provencher MT, LeClere LE, Romeo AA. Subpectoral biceps tenodesis. Sports Med Arthrosc Rev. 2008;16(3):170-176.

20. Erickson BJ, Jain A, Abrams GD, et al. SLAP lesions: trends in treatment. Arthroscopy. 2016;32(6):976-981.

21. Erickson J, Lavery K, Monica J, Gatt C, Dhawan A. Surgical treatment of symptomatic superior labrum anterior-posterior tears in patients older than 40 years: a systematic review. Am J Sports Med. 2015;43(5):1274-1282.

22. Denard PJ, Ladermann A, Parsley BK, Burkhart SS. Arthroscopic biceps tenodesis compared with repair of isolated type II SLAP lesions in patients older than 35 years. Orthopedics. 2014;37(3):e292-e297.

23. Patterson BM, Creighton RA, Spang JT, Roberson JR, Kamath GV. Surgical trends in the treatment of superior labrum anterior and posterior lesions of the shoulder: analysis of data from the American Board of Orthopaedic Surgery certification examination database. Am J Sports Med. 2014;42(8):1904-1910.

24. Weber SC, Martin DF, Seiler JG 3rd, Harrast JJ. Superior labrum anterior and posterior lesions of the shoulder: incidence rates, complications, and outcomes as reported by American Board of Orthopedic Surgery. Part II candidates. Am J Sports Med. 2012;40(7):1538-1543.

25. Werner BC, Brockmeier SF, Gwathmey FW. Trends in long head biceps tenodesis. Am J Sports Med. 2015;43(3):570-578.

Issue
The American Journal of Orthopedics - 46(4)
Issue
The American Journal of Orthopedics - 46(4)
Page Number
E219-E223
Page Number
E219-E223
Publications
Publications
Topics
Article Type
Display Headline
Biceps Tenodesis: An Evolution of Treatment
Display Headline
Biceps Tenodesis: An Evolution of Treatment
Sections
Disallow All Ads
Content Gating
No Gating (article Unlocked/Free)
Alternative CME
Disqus Comments
Default
Article PDF Media

Sarcopenia and the New ICD-10-CM Code: Screening, Staging, and Diagnosis Considerations

Article Type
Changed
The CDC recently recognized sarcopenia as a reportable medical condition necessitating better screening and diagnosis of this geriatric syndrome.

Sarcopenia is an age-related loss of skeletal muscle that may result in diminished muscle strength and functional performance. The prevalence of sarcopenia varies based on the cohort and the assessment criteria. According to the Health Aging and Body Composition (ABC) study, the prevalence of sarcopenia in community-dwelling older adults is about 14% to 18%, whereas the estimate may exceed 30% for those in longterm care.1,2 This geriatric syndrome may disproportionately affect veterans given that they are older than the civilian population and may have disabling comorbid conditions associated with military service.3

Recently, there has been a call to action to systematically address sarcopenia by interdisciplinary organizations such as the European Society for Clinical and Economic Aspects of Osteoporosis and Osteoarthritis (ESCEO) and the International Working Group on Sarcopenia (IWGS).4,5 This call to action is due to the association of sarcopenia with increased health care costs, higher disability incidence, and elevated risk of mortality.6,7 The consequences of sarcopenia may include serious complications, such as hip fracture or a loss of functional independence.8,9 The CDC now recognizes sarcopenia as an independently reportable medical condition. Consequently, physicians, nurse practitioners (NPs), and other associated health professionals within the VA will need to better understand clinically viable and valid methods to screen and diagnose this geriatric syndrome.

The purpose of this paper is to inform practitioners how sarcopenia screening is aided by the new ICD-10-CM code and briefly review recent VA initiatives for proactive care. Additional objectives include identifying common methods used to assess sarcopenia and providing general recommendations to the VHA National Center for Health Promotion and Disease Prevention (NCP) concerning the management of sarcopenia.

 

Addressing Sarcopenia

While the age-related decline in muscle size and performance has long been recognized by geriatricians, sustained advocacy by several organizations was required to realize the formal recognition of sarcopenia. Aging in Motion (AIM), a coalition of organizations focused on advancing research and treatment for conditions associated with age-related muscle dysfunction, sought the formal recognition of sarcopenia. The CDC established the ICD-10-CM code for sarcopenia in October of 2016, which allowed the syndrome to be designated as a primary or secondary condition.10

The ubiquitous nature of agerelated changes in muscle and the mandate to engage in proactive care by all levels of VA leadership led to the focus on addressing sarcopenia. The recognition of sarcopenia by the CDC comes at an opportune time given recent VA efforts to transform itself from a facilitator mainly of care delivery to an active partner in fostering the health and well-being of veterans. Initiatives that are emblematic of this attempt to shift the organizational culture across the VHA include establishing the VA Center for Innovation (VACI) and issuing guidance documents, such as the Blueprint for Excellence, which was introduced in 2014 by then VA Secretary Robert McDonald.11,12 Many of the following Blueprint themes and strategies potentially impact sarcopenia screening and treatment within the VA:

  • Delivering high-quality, veteran-centered care: A major Blueprint theme is attaining the “Triple Aims” of a health care system by promoting better health among veterans, improving the provision of care, and lowering costs through operational efficiency. The management of sarcopenia has clear clinical value given the association of age-related muscle loss with fall risk and decreased mobility.13 Financial value also may be associated with the effort to decrease disability related to sarcopenia and the use of a team approach featuring associated health professionals to help screen for this geriatric syndrome.14,15 (Strategy 2)
  • Leveraging health care informatics to optimize individual and population health outcomes: The inclusion of the most basic muscle performance and functional status measures in the electronic medical record (EMR), such as grip strength and gait speed, would help to identify the risk factors and determinants of sarcopenia among the veteran population. (Strategy 3)
  • Advancing personalized, proactive health care that fosters a high level of health and well-being: The long-term promotion of musculoskeletal health and optimal management of sarcopenia cannot be sustained through episodic medical interactions. Instead, a contemporary approach to health services marked by the continuous promotion of health education, physical activity and exercise, and proper nutrition has demonstrated value in the management of chronic conditions.16,17 (Strategy 6)

The new sarcopenia ICD-10-CM code along with elements of the VHA Blueprint can serve to support the systematic assessment and management of veterans with age-related muscle dysfunction. Nevertheless, renewed calls for health promotion and screening programs are often counterbalanced by the need for cost containment and the cautionary tales concerning the potential harms or errors associated with some forms of medical screening. The American Board of Internal Medicine Foundation has spearheaded the Choosing Wisely campaign to raise awareness about excessive medical testing. However, the Institute of Medicine has linked the provision of quality health care to a diagnostic process that is both timely and accurate.18 Careful consideration of these health care challenges may help guide practitioners within the VA concerning the screening and diagnosis of sarcopenia.

Sarcopenia Assessment

Sarcopenia can have several underlying causes in some individuals and result in varied patterns of clinical presentation and differing degrees of severity. The European Working Group on Sarcopenia in Older People first met in 2009 and used a consensus-based decision-making process to determine operational definitions for sarcopenia and create a staging algorithm for the syndrome.19 This consensus group developed a conceptual staging model with 3 categories: presarcopenia, sarcopenia, and severe sarcopenia (Table 1). The impetus for sarcopenia staging was the emerging research findings suggesting that lean body mass (LBM) alone did not provide a high degree of clinical value in outpatient settings due to the nonlinear relationship between LBM and muscle function in older adults.20,21 Using the consensus model approach, an individual is classified as sarcopenic on presenting with both low LBM and low muscle function.

Screening: A Place to Start

Findings from the Health ABC Study suggested that older adults who maintained high levels of LBM were less likely to become sarcopenic. Whereas, older adults in the cohort with low levels of LBM tended to remain in a sarcopenic state.6 Consequently, the early detection of sarcopenia may have important health promotion implications for older adults. Sarcopenia is a syndrome with a continuum of clinical features; it is not a disease with a clear or singular etiololgy. Therefore, the result of the screening examination should identify those who would most benefit from a formal diagnostic assessment.

One approach to screening for sarcopenia involves the use of questionnaires, such as the SARC-F (sluggishness, assistance in walking, rise from a chair, climb stairs, falls), which is a brief 5-item questionnaire with Likert scoring for patient responses.13 In a cohort of National Health and Nutrition Examination Survey (NHANES) participants, SARC-F scores ≥ 4 were associated with slower gait speed, lower strength, and an increased likelihood of hospitalization within a year of the test response.22 However, rather than stratify patients by risk, the SARC-F exhibits a high degree of test specificity regarding the major consensus-based sarcopenia classification criteria (specificity = 94.2% to 99.1%; sensitivity = 3.8% to 9.9%).13 Given the known limitations of screening tools with low sensitivity, organizations such as the ESCEO have recommended supplementing the SARC-F questionnaire with other forms of assessment.4 Supplements to the screening examination may range from the use of “red flag” questions concerning changes in nutritional status, body weight, and physical activity, to conducting standard gait speed and grip-strength testing.4,19,23

Performance-based testing, including habitual gait speed and grip-strength dynamometry, also may be used in both the screening and classification of sarcopenia.2 Although walking speed below 1.0 m/s has been used by the IWGS as a criterion to prompt further assessment, many people within the VA health care system may have gait abnormalities independent of LBM status, and others may be nonambulatory.24,25 As a result, grip-strength testing should be considered as a supplementary or alternate screening assessment tool.26,27

Hand-grip dynamometry is often used diagnostically given its previous test validation, low expense, and ease of use.23 Moreover, recent evidence suggests that muscle strength surpasses gait speed as a means of identifying people with sarcopenia. Grip strength is associated with all-cause mortality, even when adjusting for age, sex, and body size,28 while slow gait speed (< .82 m/s) has a reported sensitivity of 63% and specificity of 70% for mortality in population-based studies involving older adults.29

 

 

Gait speed (in those who are ambulatory) and grip-strength values could be entered into the EMR evaluation note by the primary care provider (PCP). Elements of the VA EMR, such as the ability to review the diagnosis of sarcopenia on the Problem List or the nominal enhancement of providing LBM estimates within the Cumulative Vitals and Measurements Report would support the management of sarcopenia. See Table 2 for cutoff values for frequently used sarcopenia screening and staging tests.

The pitfalls of excessive or inappropriate screening are well documented. The efforts to screen for prostate cancer have highlighted instances when inappropriate followup tests and treatment fail to alter mortality rates and ultimately yield more harm than good.30 However, there are several points of departure concerning the screening for sarcopenia vs screening for prostate cancer. The screening assessments for sarcopenia are low-cost procedures that are associated with a low patient burden. These procedures may include questionnaires, functional testing, or the assessment of muscle performance. Additionally, there is a low propensity for adverse effects stemming from treatment due to disease misclassification given the common nonpharmacologic approaches used to manage sarcopenia.31 Nonetheless, the best screening examination—even one that has low patient burden and cost—may prove to be a poor use of medical resources if the process is not linked to a viable intervention.

Screening people aged ≥ 65 years may strike a balance between controlling health care expenditures and identifying people with the initial signs of sarcopenia early enough to begin monitoring key outcomes and providing a formalized exercise prescription. Presuming an annual age-related decline in LBM of 1.5%, and considering the standard error measurement of the most frequently used methods of strength and LBM assessment, recurrent screening could occur every 2 years.21,32

Earlier screening may be considered for patient populations with a higher pretest probability. These groups include those with conditions associated with accelerated muscle loss, such as chronic kidney disease, peripheral artery disease, and diabetes mellitus (DM).32 Although accelerated muscle loss characterized by an inflammatory motif (eg, cancer-related cachexia) may share some features of the sarcopenia screening and assessment approach, important differences exist regarding the etiology, medical evaluation, and ICD-10-CM code designation.

Staging and Classification

Staging criteria are generally used to denote the severity of a given disease or syndrome, whereas classification criteria are used to define homogenous patient groups based on specific pathologic or clinical features of a disorder. Although classification schemes may incorporate an element of severity, they are primarily used to characterize fairly distinct phenotypic forms of disease or specific clinical presentation patterns associated with a well-defined syndrome. Although not universally adopted, the European consensus group sarcopenia staging criteria are increasingly used to provide a staging algorithm presumably driven by the severity of the condition.19

The assessment of functional performance for use in sarcopenia staging often involves measuring habitual gait speed or completing the Short Physical Performance Battery (SPPB).23 The SPPB involves a variety of performance-based activities for balance, gait, strength, and endurance. This test has predictive validity for the onset of disability and adverse health events, and it has been extensively used in research and clinical settings.33 Additional tests used to characterize function during the staging or diagnostic process include the timed get up and go test (TGUG) and the timed sit to stand test.34,35 The TGUG provides an estimate of dynamic balance, and the sit to stand test has been used as very basic proxy measure of muscular power.36 The sit to stand test and habitual gait speed are items included in the SPPB.33

Accepted methods to obtain the traditional index measure of sarcopenia—based on estimates of LBM—include bioimpedance analysis (BIA) and dual X-ray absorptiometry (DXA). The BIA uses the electrical impedance of body tissues and its 2 components, resistance and reactance, to derive its body composition estimates.37 Segmental BIA allows for isolated measurements of the limbs, which may be calibrated to DXA appendicular lean body mass (ALM) or magnetic resonance imaging-based estimates of LBM. This instrument is relatively safe for use, inexpensive for medical facilities, and useful for longitudinal studies, but it can be confounded by issues, such as varying levels of hydration, which may affect measurement validity in some instances.

Despite the precision of DXA for estimating densities for whole body composition analysis, the equipment is not very portable and involves low levels of radiation exposure, which limits its utility in some clinical settings. While each body composition assessment method has its advantages and disadvantages, DXA is regarded as an acceptable form of measurement for hospital settings, and BIA is frequently used in outpatient clinics and community settings. Other methods used to estimate LBM with greater accuracy, such as peripheral quantitative computed tomography, doubly labeled water, and whole body gamma ray counting, are not viable for clinical use. Other accessible methods such as anthropometric measures and skinfold measures have not been embraced by sarcopenia classification consensus groups.23,37

Alternative methods of estimating LBM, such as diagnostic ultrasound and multifrequency electrical impedance myography, are featured outcomes in ongoing clinical trials that involve veteran participants. These modalities may soon provide a clinically viable approach to assessing muscle quality via estimates of muscle tissue composition.37,38 Similar to the management of other geriatric syndromes, interprofessional collaboration provides an optimal approach to the assessment of sarcopenia. Physicians and other health care providers may draw on the standardized assessment of strength and function (via the SPPB and hand-grip dynamometry) by physical therapists (PTs), questionnaires administered by nursing staff (the SARC-F), or body composition estimates from other health professionals (ranging from BIA to DXA) to aid the diagnostic process and facilitate appropriate case management (Table 2).

Competing staging and classification definitions have been cited as a primary factor behind the CDC’s delayed recognition of the sarcopenia diagnosis, which in turn posed a barrier to formal clinical recognition by geriatricians.24 However, this reaction to the evolving sarcopenia staging criteria also may reveal the larger misapplication of the staging process to the diagnostic process. The application of classification and staging criteria results in a homogenous group of patients, whereas the application of diagnostic criteria results in a heterogeneous group of patients to account for variations in clinical presentation associated with a given disorder. Classification criteria may be equivalent to objective measures that are used in the diagnostic process when a given disease is characterized by a well-established biomarker.39

However, this is not the case for most geriatric syndromes and other disorders marked by varied clinical presentation patterns. On considering the commonly used sarcopenia staging criteria of LBM ≤ 8.50 kg/m2 or grip strength < 30 kg in men and LBM ≤ 5.75 kg/m2 or grip strength < 20 kg in women, it is easy to understand that such general cutoff values are far from diagnostic.40,41 Moreover, stringent cutoff values associated with classification and staging may not adequately capture those with an atypical presentation of the syndrome (eg, someone who exhibits age-related muscle weakness but has retained adequate LBM). Such criteria often prove to have high specificity and low sensitivity, which may yield a false negative rate that is appropriate for clinical research eligibility and group assignment but inadequate for clinical care.

Screening, staging, and classification criteria with high specificity may indeed be desirable for confirmatory imaging tests associated with radiation exposure concerns or for managing risk in experimental clinical trials involving pharmacologic treatment. For example, a SARC-F score ≥ 4 may prompt the formal assessment of LBM via a DXA examination.4 In contrast, those with a SARC-F score ≤ 3 with low gait speed or grip strength may benefit from consultation regarding regular physical activity and nutrition recommendations. Given the challenges of establishing sarcopenia classification criteria that perform consistently across populations and geographic regions, classification and staging criteria may be best viewed as clinical reasoning tools that supplement, but not supplant, the diagnostic process.7,42

Diagnosis

Geriatric syndromes do not lend themselves to a simple diagnostic process. Syndromes such as frailty and sarcopenia are multifactorial and lack a single distinguishing clinical feature or biomarker. The oft-cited refrain that sarcopenia is an underdiagnosed condition is partially explained by the recent ICD-10-CM code and varied classification and diagnostic criteria.5 This circumstance highlights the need to distinctly contrast the diagnostic process with the screening and staging classifications.

The diagnostic process involves the interpretation of the patient history, signs, and symptoms within the context of individual factors, local or regional disease prevalence, and the results of the best available and most appropriate laboratory tests. After all, a patient that presents with low LBM and a gradual loss of strength without a precipitating event would necessitate further workup to rule out many clinical possibilities under the aegis of a differential diagnosis. Clinical features, such as the magnitude of weakness and pattern of strength loss or muscle atrophy along with the determination of neurologic or autoimmune involvement, are among the key elements of the differential examination for a case involving the observation of frank muscle weakness. Older adults with low muscle strength may have additional risk factors for sarcopenia such as obesity, pain, poor nutrition, previous bone fracture, and a sedentary lifestyle. However, disease etiology with lower probabilities, such as myogenic or neurogenic conditions associated with advancing age, also may be under consideration during the clinical assessment.6

In many instances, the cutoff scores associated with the sarcopenia staging criteria may help to guide the diagnostic process and aid clinical decision making. Since individuals with a positive screening result based on the SARC-F questionnaire (score ≥ 4) have a high likelihood of meeting the staging criteria for severe sarcopenia, a PCP may opt to obtain a confirmatory estimate of LBM both to support the clinical assessment and to monitor change over the course of rehabilitation. Whereas people who present with a decline in strength (ie, grip strength < 30 kg for a male) without an observable loss of function or a positive SARC-F score may benefit from consultation from the physician, NP, or rehabilitation health professional regarding modifiable risk factors associated with sarcopenia.

Incorporating less frequently used sarcopenia classification schemes such as identifying those with sarcopenic obesity or secondary sarcopenia due to mitigating factors such as chronic kidney disease or DM (Table 3) may engender a more comprehensive approach to intervention that targets the primary disease while also addressing important secondary sequelae. Nevertheless, staging or classification criteria cannot be deemed equivalent to diagnostic criteria for sarcopenia due to the challenges posed by syndromes that have a heterogeneous clinical presentation.

The refinement of the staging and classification criteria along with the advances in imaging technology and mechanistic research are not unique to sarcopenia. Practitioners involved in the care of people with rheumatologic conditions or osteoporosis also have contended with continued refinements to their classification criteria and approach to risk stratification.39,43,44 Primary care providers will now have the option to use a new ICD-10-CM code (M62.84) for sarcopenia, which will allow them to properly document the clinical distinctions between people with impaired strength or function largely due to age-related muscle changes and those who have impaired muscle function due to cachexia, inflammatory myopathies, or forms of neuromuscular disease.

 

 

The ability to identify and document this geriatric syndrome in veterans will help to better define the scope of the problem within the VA health care system. The median age of veterans is 62 years compared with 43 years for nonveterans.3 Consequently, there may be value in the adoption of a formal approach to screening and diagnosis for sarcopenia among veterans who receive their primary care from VA facilities.7 Indeed, the exchange between the patient and the health professional regarding the screening and diagnostic process will provide valuable opportunities to promote exercise interventions before patients incur significant impairments.

One of the biggest threats burdening global health is noncommunicable diseases, and many chronic conditions, such as sarcopenia, can be prevented and managed with appropriate levels of physical activity.17 Increased physician involvement may prove to be critical given the identification of physical inactivity as a top 5 risk factor for general morbidity and mortality by World Health Organization and consensus group recommendations calling for physicians to serve a more prominent role in the provision of exercise and physical activity recommendations.16,17

This developing health care role should include NPs, PTs, physician assistants, and other associated health professionals. It also should include collaborative efforts between physicians and rehabilitation practitioners concerning provision of the formal exercise prescriptionprescription and monitoring of patient outcomes.

Individuals with severe forms of sarcopenia rarely improve without intervention.6 Although no pharmacologic treatment exists to specifically address sarcopenia, strengthening exercise has been shown to be an effective mode of prevention and conservative management.8 Progressive resistance exercise cannot abate the expected age-related changes in skeletal muscle, but it can significantly reverse the loss of LBM and strength in untrained older adults and slow the age-related decline in muscle performance in older adult athletes and trained individuals.45

Local senior centers and community organizations may prove to be valuable resources concerning group exercise options, and they provide the added benefit of social engagement and peer group accountability. Federal resources include the Go4Life exercise guide and online videos provided by the National Institute on Aging and the MOVE! Weight Management and Health Program provided at select VA community-based outpatient clinics. Ultimately, collaborative efforts with exercise specialists may serve to reduce the PCP burden during the provision of health services, minimize diagnostic errors associated with sarcopenia assessment and help to connect patients to valuable health promotion resources.17,18

Conclusion

While practitioners should remain keenly aware of the pernicious effects of overdiagnosis, sarcopenia has long existed as a known, but undiagnosed, condition. Of course, geriatricians have traditionally managed poor muscle performance and mobility limitations by addressing treatable symptoms and providing referrals to physical medicine specialists when warranted. Nevertheless, the advent of ICD-10-CM code M62.84 provides the VA with an opportunity to take a leading role in systematically addressing this geriatric syndrome within an aging veteran population.

The following items should be considered by NCP for the development of guidelines and recommendations concerning sarcopenia screening:

  1. Consider screening veterans aged > 65 years for sarcopenia every 2 years. Those with mitigating systemic conditions (eg, chronic kidney disease, DM, or malnutrition) or significant mobility limitations may be screened at any age.
  2. Sarcopenia screening procedures should include at a minimum the SARC-F questionnaire and gait speed (when appropriate). Including gait speed or grip strength testing in the screening exam is recommended given the low sensitivity of the SARC-F questionnaire.
  3. Veterans with positive SARC-F results (≥ 4) merit a physical therapy referral. In addition, these veterans should obtain confirmatory standardized assessments for LBM and functional status.
  4. Veterans at risk for sarcopenia based on patient age, medical history, and the physical examination (eg, obesity, sedentary lifestyle, a previous fracture, self-reported physical decline), but with negative SARC-F results should receive a formal exercise prescription from their PCP. Baseline assessment measures may be used for comparison with serial measures obtained during subsequent screening visits to support long-term case management.
  5. Interprofessional collaboration involving geriatricians, PTs, nurses, radiologists, and other health care professionals should be involved in the screening, diagnosis, and case management of veterans with sarcopenia.
  6. The VA EMR should be systematically documented with sarcopenia assessment data obtained from the gait speed tests, SARCF, SPPB, grip strength tests, and LBM estimates to better characterize this condition within the veteran population.

Any expansion in the provision of health care comes with anticipated benefits and potential costs. Broad guidance from NCP may encourage veterans to pursue selected screening tests, promote the appropriate use of preventative services, and facilitate timely treatment when needed.31 Clinicians who are informed about the screening, staging, classification, and diagnostic process for sarcopenia may partner with patients to make reasoned decisions about how to best manage this syndrome within the VA medical center environment.

References

1. Newman AB, Kupelian V, Visser M, et al; Health ABC Study Investigators. Sarcopenia: alternative definitions and associations with lower extremity function. J Am Geriatr Soc. 2003;51(11):1602-1609.

2. Cruz-Jentoft AJ, Landi F, Schneider SM, et al. Prevalence of and interventions for sarcopenia in ageing adults: a systematic review. Report of the International Sarcopenia Initiative (EWGSOP and IWGS). Age Ageing. 2014;43(6):748-759.

3. U.S. Department of Veterans Affairs, National Center for Veterans Analysis and Statistics. Profile of veterans: 2009. Data from the American Community Survey. http://www.va.gov/vetdata/docs/SpecialReports/Profile_of_Veterans_2009_FINAL.pdf. Published January 2011. Accessed May 18, 2017.

4. Beaudart C, McCloskey E, Bruyère O, et al. Sarcopenia in daily practice: assessment and management. BMC Geriatr. 2016;16(1):170.

5. Fielding RA, Vellas B, Evans WJ, et al. Sarcopenia: an undiagnosed condition in older adults. Current consensus definition: prevalence, etiology, and consequences. International working group on sarcopenia. J Am Med Dir Assoc. 2011;12(4):249-256.

6. Murphy RA, Ip EH, Zhang Q, et al; Health, Aging, and Body Composition Study. Transition to sarcopenia and determinants of transitions in older adults: a population-based study. J Gerontol A Biol Sci Med Sci. 2014;69(6):751-758.

7. Harris-Love MO, Adams B, Hernandez HJ, DiPietro L, Blackman MR. Disparities in the consequences of sarcopenia: implications for African American veterans. Front Physiol. 2014;5:250.

8. Morley JE. Sarcopenia in the elderly. Fam Pract. 2012;29(suppl 1):i44-i48.

9. Fragala MS, Dam TT, Barber V, et al. Strength and function response to clinical interventions of older women categorized by weakness and low lean mass using classifications from the Foundation for the National Institute of Health sarcopenia project. J Gerontol A Biol Sci Med Sci. 2015;70(2):202-209.

10. Aging in Motion. AIM coalition announces establishment of ICD-10-CM Code for Sarcopenia
by the Centers for Disease Control and Prevention [press release]. http://aginginmotion.org/news/2388-2/. Published April 28, 2016. Accessed June 7, 2017.

11. U.S. Department of Veterans Affairs, Veterans Health Administration. Blueprint for excellence. https://www.va.gov/HEALTH/docs/VHA _Blueprint_for_Excellence.pdf. Published September 21, 2014. Accessed June 7, 2017.

12. U.S. Department of Veterans Affairs. VA Center of Innovation 2010–2012 stakeholder report. https://www.innovation.va.gov/docs/VACI_2010-2012_Stakeholder_Report.pdf. Published 2012. Accessed June 14, 2017.

13. Woo J, Leung J, Morley JE. Validating the SARCF: a suitable community screening tool for sarcopenia? J Am Med Dir Assoc. 2014;15(9):630-634.

14. Sousa AS, Guerra RS, Fonseca I, Pichel F, Ferreira S, Amaral TF. Financial impact of sarcopenia on hospitalization costs. Eur J Clin Nutr. 2016;70(9):1046-1051.

15. Janssen I, Shepard DS, Katzmarzyk PT, Roubenoff R. The healthcare costs of sarcopenia in the United States. J Am Geriatr Soc. 2004;52(1):80-85.

16. Ekelund U, Steene-Johannessen J, Brown WJ, et al; Lancet Physical Activity Series 2 Executive Committe; Lancet Sedentary Behaviour Working Group. Does physical activity attenuate, or even eliminate, the detrimental association of sitting time with mortality? A harmonised meta-analysis of data from more than 1 million men and women. Lancet. 2016;388(10051):1302-1310.

17. Thornton JS, Frémont P, Khan K, et al. Physical activity prescription: a critical opportunity to address a modifiable risk factor for the prevention and management of chronic disease: a position statement by the Canadian Academy of Sport and Exercise Medicine. Clin J Sport Med.
2016;26(4):259-265.

18. The National Academies of Sciences, Engineering, and Medicine; Committee on Diagnostic Error in Health Care, Board on Health Care Services; Institute of Medicine. Improving Diagnosis in Health Care. Washington, DC: National Academies Press;2015.

19. Cruz-Jentoft AJ, Baeyens JP, Bauer JM, et al; European Working Group on Sarcopenia in Older People. Sarcopenia: European consensus on definition and diagnosis: report of the European Working Group on Sarcopenia in Older People. Age Ageing. 2010;39(4):412-423.

20. Ferrucci L, Guralnik JM, Buchner D, et al. Departures from linearity in the relationship between measures of muscular strength and physical performance of the lower extremities: the Women’s Health and Aging Study. J Gerontol A Biol Sci Med Sci. 1997;52(5):M275-M285.

21. Goodpaster BH, Park SW, Harris TB, et al. The loss of skeletal muscle strength, mass, and quality in older adults: the Health, Aging and Body Composition Study. J Gerontol A Biol Sci Med Sci. 2006;61(10):1059-1064.

22. Malmstrom TK, Miller DK, Simonsick EM, Ferrucci L, Morley JE. SARC-F: a symptom score to predict persons with sarcopenia at risk for poor functional outcomes. J Cachexia Sarcopenia Muscle. 2016;7(1):28-36.

23. Cooper C, Fielding R, Visser M, et al. Tools in the assessment of sarcopenia. Calcif Tissue Int. 2013;93(3):201-210.

24. Lee WJ, Liu LK, Peng LN, Lin MH, Chen LK; ILAS Research Group. Comparisons of sarcopenia defined by IWGS and EWGSOP criteria among older people: results from the I-Lan longitudinal aging study. J Am Med Dir Assoc. 2013;14(7):528.e1-e7.

25. Cesari M, Kritchevsky SB, Penninx BW, et al. Prognostic value of usual gait speed in well-functioning  older people—results from the Health, Aging and Body Composition Study. J Am Geriatr Soc. 2005;53(10):1675-1680.

26. Rossi AP, Fantin F, Micciolo R, et al. Identifying sarcopenia in acute care setting patients. J Am Med Dir Assoc. 2014;15(4):303.e7-e12.

27. Sánchez-Rodríguez D, Marco E, Miralles R, et al. Does gait speed contribute to sarcopenia casefinding in a postacute rehabilitation setting? Arch Gerontol Geriatr. 2015;61(2):176-181.

28. Strand BH, Cooper R, Bergland A, et al. The association of grip strength from midlife onwards with all-cause and cause-specific mortality over 17 years of follow-up in the Tromsø Study. J Epidemiol Community Health. 2016;70:1214-1221.

29. Stanaway FF, Gnjidic D, Blyth FM, et al. How fast does the Grim Reaper walk? Receiver operating characteristics curve analysis in healthy men aged 70 and over. BMJ. 2011;343:d7679.

30. Reiter RE. Risk stratification of prostate cancer 2016. Scand J Clin Lab Invest Suppl.  2016;245:S54-S59.

31. U.S. Department of Veterans Affairs, National Center for Health Promotion and Disease Prevention. Get recommended screening tests and immunizations. https://www.prevention.va.gov/Healthy_Living/Get_Recommended_Screening_Tests_and_Immunizations.asp. Updated September 9, 2016. Accessed June 7, 2017.

32. Buford TW, Anton SD, Judge AR, et al. Models of accelerated sarcopenia: critical pieces for solving the puzzle of age-related muscle atrophy. Ageing Res Rev. 2010;9(4):369-383.

33. Guralnik JM, Simonsick EM, Ferrucci L, et al. A short physical performance battery assessing lower extremity function: association with self-reported disability and prediction of mortality and nursing home admission. J Gerontol. 1994;49(2):M85-M94.

34. Daubney ME, Culham EG. Lower-extremity muscle force and balance performance in adults aged 65 years and older. Phys Ther. 1999;79(12):1177-1185.

35. Bohannon RW. Reference values for the fiverepetition sit-to-stand test: a descriptive metaanalysis of data from elders. Percept Mot Skills. 2006;103(1):215-222.

36. Correa-de-Araujo R, Harris-Love MO, Miljkovic I, Fragala MS, Anthony BW, Manini TM. The need for standardized assessment of muscle quality in skeletal muscle function deficit and other agingrelated muscle dysfunctions: a symposium report. Front Physiol. 2017;8:87.

37. Heymsfield SB, Gonzalez MC, Lu J, Jia G, Zheng J. Skeletal muscle mass and quality: evolution of modern measurement concepts in the context of sarcopenia. Proc Nutr Soc. 2015;74(4):355-366.

38. Harris-Love MO, Monfaredi R, Ismail C, Blackman MR, Cleary K. Quantitative ultrasound: measurement considerations for the assessment of muscular dystrophy and sarcopenia. Front Aging Neurosci. 2014;6:172.

39. Fries JF, Hochberg MC, Medsger TA Jr, Hunder GG, Bombardier C. Criteria for rheumatic disease. Different types and different functions. The American College of Rheumatology Diagnostic and Therapeutic Criteria Committee. Arthritis Rheum. 1994;37(4):454-462.

40. Janssen I, Baumgartner RN, Ross R, Rosenberg IH, Roubenoff R. Skeletal muscle cutpoints associated with elevated physical disability risk in older men and women. Am J Epidemiol.
2004;159(4):413-421.

41. Ismail C, Zabal J, Hernandez HJ, et al. Diagnostic ultrasound estimates of muscle mass and muscle quality discriminate between women with and without sarcopenia. Front Physiol. 2015;6:302.

42. Chen LK, Liu LK, Woo J, et al. Sarcopenia in Asia: consensus report of the Asian Working Group for Sarcopenia. J Am Med Dir Assoc. 2014;15(2):95-101.

43. Aggarwal R, Ringold S, Khanna D, et al. Distinctions between diagnostic and classification  criteria? Arthritis Care Res (Hoboken). 2015;67(7):891-897.

44. Licata A. Bone density vs bone quality: what’s a clinician to do? Cleve Clin J Med. 2009;76(6):331-336.

45. Pollock ML, Mengelkoch LJ, Graves JE, et al. Twenty-year follow-up of aerobic power and body composition of older track athletes. J Appl Physiol. 1997;82(5):1508-1516.

Article PDF
Author and Disclosure Information

Ms. Falcon is an exercise science intern in the Physical Medicine & Rehabilitation Service, and Dr. Harris-Love is the associate director of the Human Performance Research Unit in the Clinical Research Center, both at the Washington DC VAMC. Dr. Harris-Love also is an associate clinical professor at the George Washington University Milken Institute School of Public Health, and Ms. Falcon is a program coordinator with the Children’s National Health System, both in Washington DC.

Author disclosures
The authors report no actual or potential conflicts of interest with regard to this article.

Disclaimer
The opinions expressed herein are those of the authors and do not necessarily reflect those of
Federal Practitioner, Frontline Medical Communications Inc., the U.S. Government, or any of its agencies. This article may discuss unlabeled or investigational use of certain drugs. Please review the complete prescribing information for specific drugs or drug combinations—including indications, contraindications, warnings, and adverse effects—before administering pharmacologic therapy to patients.

Issue
Federal Practitioner - 34(7)
Publications
Topics
Page Number
24-32
Sections
Author and Disclosure Information

Ms. Falcon is an exercise science intern in the Physical Medicine & Rehabilitation Service, and Dr. Harris-Love is the associate director of the Human Performance Research Unit in the Clinical Research Center, both at the Washington DC VAMC. Dr. Harris-Love also is an associate clinical professor at the George Washington University Milken Institute School of Public Health, and Ms. Falcon is a program coordinator with the Children’s National Health System, both in Washington DC.

Author disclosures
The authors report no actual or potential conflicts of interest with regard to this article.

Disclaimer
The opinions expressed herein are those of the authors and do not necessarily reflect those of
Federal Practitioner, Frontline Medical Communications Inc., the U.S. Government, or any of its agencies. This article may discuss unlabeled or investigational use of certain drugs. Please review the complete prescribing information for specific drugs or drug combinations—including indications, contraindications, warnings, and adverse effects—before administering pharmacologic therapy to patients.

Author and Disclosure Information

Ms. Falcon is an exercise science intern in the Physical Medicine & Rehabilitation Service, and Dr. Harris-Love is the associate director of the Human Performance Research Unit in the Clinical Research Center, both at the Washington DC VAMC. Dr. Harris-Love also is an associate clinical professor at the George Washington University Milken Institute School of Public Health, and Ms. Falcon is a program coordinator with the Children’s National Health System, both in Washington DC.

Author disclosures
The authors report no actual or potential conflicts of interest with regard to this article.

Disclaimer
The opinions expressed herein are those of the authors and do not necessarily reflect those of
Federal Practitioner, Frontline Medical Communications Inc., the U.S. Government, or any of its agencies. This article may discuss unlabeled or investigational use of certain drugs. Please review the complete prescribing information for specific drugs or drug combinations—including indications, contraindications, warnings, and adverse effects—before administering pharmacologic therapy to patients.

Article PDF
Article PDF
Related Articles
The CDC recently recognized sarcopenia as a reportable medical condition necessitating better screening and diagnosis of this geriatric syndrome.
The CDC recently recognized sarcopenia as a reportable medical condition necessitating better screening and diagnosis of this geriatric syndrome.

Sarcopenia is an age-related loss of skeletal muscle that may result in diminished muscle strength and functional performance. The prevalence of sarcopenia varies based on the cohort and the assessment criteria. According to the Health Aging and Body Composition (ABC) study, the prevalence of sarcopenia in community-dwelling older adults is about 14% to 18%, whereas the estimate may exceed 30% for those in longterm care.1,2 This geriatric syndrome may disproportionately affect veterans given that they are older than the civilian population and may have disabling comorbid conditions associated with military service.3

Recently, there has been a call to action to systematically address sarcopenia by interdisciplinary organizations such as the European Society for Clinical and Economic Aspects of Osteoporosis and Osteoarthritis (ESCEO) and the International Working Group on Sarcopenia (IWGS).4,5 This call to action is due to the association of sarcopenia with increased health care costs, higher disability incidence, and elevated risk of mortality.6,7 The consequences of sarcopenia may include serious complications, such as hip fracture or a loss of functional independence.8,9 The CDC now recognizes sarcopenia as an independently reportable medical condition. Consequently, physicians, nurse practitioners (NPs), and other associated health professionals within the VA will need to better understand clinically viable and valid methods to screen and diagnose this geriatric syndrome.

The purpose of this paper is to inform practitioners how sarcopenia screening is aided by the new ICD-10-CM code and briefly review recent VA initiatives for proactive care. Additional objectives include identifying common methods used to assess sarcopenia and providing general recommendations to the VHA National Center for Health Promotion and Disease Prevention (NCP) concerning the management of sarcopenia.

 

Addressing Sarcopenia

While the age-related decline in muscle size and performance has long been recognized by geriatricians, sustained advocacy by several organizations was required to realize the formal recognition of sarcopenia. Aging in Motion (AIM), a coalition of organizations focused on advancing research and treatment for conditions associated with age-related muscle dysfunction, sought the formal recognition of sarcopenia. The CDC established the ICD-10-CM code for sarcopenia in October of 2016, which allowed the syndrome to be designated as a primary or secondary condition.10

The ubiquitous nature of agerelated changes in muscle and the mandate to engage in proactive care by all levels of VA leadership led to the focus on addressing sarcopenia. The recognition of sarcopenia by the CDC comes at an opportune time given recent VA efforts to transform itself from a facilitator mainly of care delivery to an active partner in fostering the health and well-being of veterans. Initiatives that are emblematic of this attempt to shift the organizational culture across the VHA include establishing the VA Center for Innovation (VACI) and issuing guidance documents, such as the Blueprint for Excellence, which was introduced in 2014 by then VA Secretary Robert McDonald.11,12 Many of the following Blueprint themes and strategies potentially impact sarcopenia screening and treatment within the VA:

  • Delivering high-quality, veteran-centered care: A major Blueprint theme is attaining the “Triple Aims” of a health care system by promoting better health among veterans, improving the provision of care, and lowering costs through operational efficiency. The management of sarcopenia has clear clinical value given the association of age-related muscle loss with fall risk and decreased mobility.13 Financial value also may be associated with the effort to decrease disability related to sarcopenia and the use of a team approach featuring associated health professionals to help screen for this geriatric syndrome.14,15 (Strategy 2)
  • Leveraging health care informatics to optimize individual and population health outcomes: The inclusion of the most basic muscle performance and functional status measures in the electronic medical record (EMR), such as grip strength and gait speed, would help to identify the risk factors and determinants of sarcopenia among the veteran population. (Strategy 3)
  • Advancing personalized, proactive health care that fosters a high level of health and well-being: The long-term promotion of musculoskeletal health and optimal management of sarcopenia cannot be sustained through episodic medical interactions. Instead, a contemporary approach to health services marked by the continuous promotion of health education, physical activity and exercise, and proper nutrition has demonstrated value in the management of chronic conditions.16,17 (Strategy 6)

The new sarcopenia ICD-10-CM code along with elements of the VHA Blueprint can serve to support the systematic assessment and management of veterans with age-related muscle dysfunction. Nevertheless, renewed calls for health promotion and screening programs are often counterbalanced by the need for cost containment and the cautionary tales concerning the potential harms or errors associated with some forms of medical screening. The American Board of Internal Medicine Foundation has spearheaded the Choosing Wisely campaign to raise awareness about excessive medical testing. However, the Institute of Medicine has linked the provision of quality health care to a diagnostic process that is both timely and accurate.18 Careful consideration of these health care challenges may help guide practitioners within the VA concerning the screening and diagnosis of sarcopenia.

Sarcopenia Assessment

Sarcopenia can have several underlying causes in some individuals and result in varied patterns of clinical presentation and differing degrees of severity. The European Working Group on Sarcopenia in Older People first met in 2009 and used a consensus-based decision-making process to determine operational definitions for sarcopenia and create a staging algorithm for the syndrome.19 This consensus group developed a conceptual staging model with 3 categories: presarcopenia, sarcopenia, and severe sarcopenia (Table 1). The impetus for sarcopenia staging was the emerging research findings suggesting that lean body mass (LBM) alone did not provide a high degree of clinical value in outpatient settings due to the nonlinear relationship between LBM and muscle function in older adults.20,21 Using the consensus model approach, an individual is classified as sarcopenic on presenting with both low LBM and low muscle function.

Screening: A Place to Start

Findings from the Health ABC Study suggested that older adults who maintained high levels of LBM were less likely to become sarcopenic. Whereas, older adults in the cohort with low levels of LBM tended to remain in a sarcopenic state.6 Consequently, the early detection of sarcopenia may have important health promotion implications for older adults. Sarcopenia is a syndrome with a continuum of clinical features; it is not a disease with a clear or singular etiololgy. Therefore, the result of the screening examination should identify those who would most benefit from a formal diagnostic assessment.

One approach to screening for sarcopenia involves the use of questionnaires, such as the SARC-F (sluggishness, assistance in walking, rise from a chair, climb stairs, falls), which is a brief 5-item questionnaire with Likert scoring for patient responses.13 In a cohort of National Health and Nutrition Examination Survey (NHANES) participants, SARC-F scores ≥ 4 were associated with slower gait speed, lower strength, and an increased likelihood of hospitalization within a year of the test response.22 However, rather than stratify patients by risk, the SARC-F exhibits a high degree of test specificity regarding the major consensus-based sarcopenia classification criteria (specificity = 94.2% to 99.1%; sensitivity = 3.8% to 9.9%).13 Given the known limitations of screening tools with low sensitivity, organizations such as the ESCEO have recommended supplementing the SARC-F questionnaire with other forms of assessment.4 Supplements to the screening examination may range from the use of “red flag” questions concerning changes in nutritional status, body weight, and physical activity, to conducting standard gait speed and grip-strength testing.4,19,23

Performance-based testing, including habitual gait speed and grip-strength dynamometry, also may be used in both the screening and classification of sarcopenia.2 Although walking speed below 1.0 m/s has been used by the IWGS as a criterion to prompt further assessment, many people within the VA health care system may have gait abnormalities independent of LBM status, and others may be nonambulatory.24,25 As a result, grip-strength testing should be considered as a supplementary or alternate screening assessment tool.26,27

Hand-grip dynamometry is often used diagnostically given its previous test validation, low expense, and ease of use.23 Moreover, recent evidence suggests that muscle strength surpasses gait speed as a means of identifying people with sarcopenia. Grip strength is associated with all-cause mortality, even when adjusting for age, sex, and body size,28 while slow gait speed (< .82 m/s) has a reported sensitivity of 63% and specificity of 70% for mortality in population-based studies involving older adults.29

 

 

Gait speed (in those who are ambulatory) and grip-strength values could be entered into the EMR evaluation note by the primary care provider (PCP). Elements of the VA EMR, such as the ability to review the diagnosis of sarcopenia on the Problem List or the nominal enhancement of providing LBM estimates within the Cumulative Vitals and Measurements Report would support the management of sarcopenia. See Table 2 for cutoff values for frequently used sarcopenia screening and staging tests.

The pitfalls of excessive or inappropriate screening are well documented. The efforts to screen for prostate cancer have highlighted instances when inappropriate followup tests and treatment fail to alter mortality rates and ultimately yield more harm than good.30 However, there are several points of departure concerning the screening for sarcopenia vs screening for prostate cancer. The screening assessments for sarcopenia are low-cost procedures that are associated with a low patient burden. These procedures may include questionnaires, functional testing, or the assessment of muscle performance. Additionally, there is a low propensity for adverse effects stemming from treatment due to disease misclassification given the common nonpharmacologic approaches used to manage sarcopenia.31 Nonetheless, the best screening examination—even one that has low patient burden and cost—may prove to be a poor use of medical resources if the process is not linked to a viable intervention.

Screening people aged ≥ 65 years may strike a balance between controlling health care expenditures and identifying people with the initial signs of sarcopenia early enough to begin monitoring key outcomes and providing a formalized exercise prescription. Presuming an annual age-related decline in LBM of 1.5%, and considering the standard error measurement of the most frequently used methods of strength and LBM assessment, recurrent screening could occur every 2 years.21,32

Earlier screening may be considered for patient populations with a higher pretest probability. These groups include those with conditions associated with accelerated muscle loss, such as chronic kidney disease, peripheral artery disease, and diabetes mellitus (DM).32 Although accelerated muscle loss characterized by an inflammatory motif (eg, cancer-related cachexia) may share some features of the sarcopenia screening and assessment approach, important differences exist regarding the etiology, medical evaluation, and ICD-10-CM code designation.

Staging and Classification

Staging criteria are generally used to denote the severity of a given disease or syndrome, whereas classification criteria are used to define homogenous patient groups based on specific pathologic or clinical features of a disorder. Although classification schemes may incorporate an element of severity, they are primarily used to characterize fairly distinct phenotypic forms of disease or specific clinical presentation patterns associated with a well-defined syndrome. Although not universally adopted, the European consensus group sarcopenia staging criteria are increasingly used to provide a staging algorithm presumably driven by the severity of the condition.19

The assessment of functional performance for use in sarcopenia staging often involves measuring habitual gait speed or completing the Short Physical Performance Battery (SPPB).23 The SPPB involves a variety of performance-based activities for balance, gait, strength, and endurance. This test has predictive validity for the onset of disability and adverse health events, and it has been extensively used in research and clinical settings.33 Additional tests used to characterize function during the staging or diagnostic process include the timed get up and go test (TGUG) and the timed sit to stand test.34,35 The TGUG provides an estimate of dynamic balance, and the sit to stand test has been used as very basic proxy measure of muscular power.36 The sit to stand test and habitual gait speed are items included in the SPPB.33

Accepted methods to obtain the traditional index measure of sarcopenia—based on estimates of LBM—include bioimpedance analysis (BIA) and dual X-ray absorptiometry (DXA). The BIA uses the electrical impedance of body tissues and its 2 components, resistance and reactance, to derive its body composition estimates.37 Segmental BIA allows for isolated measurements of the limbs, which may be calibrated to DXA appendicular lean body mass (ALM) or magnetic resonance imaging-based estimates of LBM. This instrument is relatively safe for use, inexpensive for medical facilities, and useful for longitudinal studies, but it can be confounded by issues, such as varying levels of hydration, which may affect measurement validity in some instances.

Despite the precision of DXA for estimating densities for whole body composition analysis, the equipment is not very portable and involves low levels of radiation exposure, which limits its utility in some clinical settings. While each body composition assessment method has its advantages and disadvantages, DXA is regarded as an acceptable form of measurement for hospital settings, and BIA is frequently used in outpatient clinics and community settings. Other methods used to estimate LBM with greater accuracy, such as peripheral quantitative computed tomography, doubly labeled water, and whole body gamma ray counting, are not viable for clinical use. Other accessible methods such as anthropometric measures and skinfold measures have not been embraced by sarcopenia classification consensus groups.23,37

Alternative methods of estimating LBM, such as diagnostic ultrasound and multifrequency electrical impedance myography, are featured outcomes in ongoing clinical trials that involve veteran participants. These modalities may soon provide a clinically viable approach to assessing muscle quality via estimates of muscle tissue composition.37,38 Similar to the management of other geriatric syndromes, interprofessional collaboration provides an optimal approach to the assessment of sarcopenia. Physicians and other health care providers may draw on the standardized assessment of strength and function (via the SPPB and hand-grip dynamometry) by physical therapists (PTs), questionnaires administered by nursing staff (the SARC-F), or body composition estimates from other health professionals (ranging from BIA to DXA) to aid the diagnostic process and facilitate appropriate case management (Table 2).

Competing staging and classification definitions have been cited as a primary factor behind the CDC’s delayed recognition of the sarcopenia diagnosis, which in turn posed a barrier to formal clinical recognition by geriatricians.24 However, this reaction to the evolving sarcopenia staging criteria also may reveal the larger misapplication of the staging process to the diagnostic process. The application of classification and staging criteria results in a homogenous group of patients, whereas the application of diagnostic criteria results in a heterogeneous group of patients to account for variations in clinical presentation associated with a given disorder. Classification criteria may be equivalent to objective measures that are used in the diagnostic process when a given disease is characterized by a well-established biomarker.39

However, this is not the case for most geriatric syndromes and other disorders marked by varied clinical presentation patterns. On considering the commonly used sarcopenia staging criteria of LBM ≤ 8.50 kg/m2 or grip strength < 30 kg in men and LBM ≤ 5.75 kg/m2 or grip strength < 20 kg in women, it is easy to understand that such general cutoff values are far from diagnostic.40,41 Moreover, stringent cutoff values associated with classification and staging may not adequately capture those with an atypical presentation of the syndrome (eg, someone who exhibits age-related muscle weakness but has retained adequate LBM). Such criteria often prove to have high specificity and low sensitivity, which may yield a false negative rate that is appropriate for clinical research eligibility and group assignment but inadequate for clinical care.

Screening, staging, and classification criteria with high specificity may indeed be desirable for confirmatory imaging tests associated with radiation exposure concerns or for managing risk in experimental clinical trials involving pharmacologic treatment. For example, a SARC-F score ≥ 4 may prompt the formal assessment of LBM via a DXA examination.4 In contrast, those with a SARC-F score ≤ 3 with low gait speed or grip strength may benefit from consultation regarding regular physical activity and nutrition recommendations. Given the challenges of establishing sarcopenia classification criteria that perform consistently across populations and geographic regions, classification and staging criteria may be best viewed as clinical reasoning tools that supplement, but not supplant, the diagnostic process.7,42

Diagnosis

Geriatric syndromes do not lend themselves to a simple diagnostic process. Syndromes such as frailty and sarcopenia are multifactorial and lack a single distinguishing clinical feature or biomarker. The oft-cited refrain that sarcopenia is an underdiagnosed condition is partially explained by the recent ICD-10-CM code and varied classification and diagnostic criteria.5 This circumstance highlights the need to distinctly contrast the diagnostic process with the screening and staging classifications.

The diagnostic process involves the interpretation of the patient history, signs, and symptoms within the context of individual factors, local or regional disease prevalence, and the results of the best available and most appropriate laboratory tests. After all, a patient that presents with low LBM and a gradual loss of strength without a precipitating event would necessitate further workup to rule out many clinical possibilities under the aegis of a differential diagnosis. Clinical features, such as the magnitude of weakness and pattern of strength loss or muscle atrophy along with the determination of neurologic or autoimmune involvement, are among the key elements of the differential examination for a case involving the observation of frank muscle weakness. Older adults with low muscle strength may have additional risk factors for sarcopenia such as obesity, pain, poor nutrition, previous bone fracture, and a sedentary lifestyle. However, disease etiology with lower probabilities, such as myogenic or neurogenic conditions associated with advancing age, also may be under consideration during the clinical assessment.6

In many instances, the cutoff scores associated with the sarcopenia staging criteria may help to guide the diagnostic process and aid clinical decision making. Since individuals with a positive screening result based on the SARC-F questionnaire (score ≥ 4) have a high likelihood of meeting the staging criteria for severe sarcopenia, a PCP may opt to obtain a confirmatory estimate of LBM both to support the clinical assessment and to monitor change over the course of rehabilitation. Whereas people who present with a decline in strength (ie, grip strength < 30 kg for a male) without an observable loss of function or a positive SARC-F score may benefit from consultation from the physician, NP, or rehabilitation health professional regarding modifiable risk factors associated with sarcopenia.

Incorporating less frequently used sarcopenia classification schemes such as identifying those with sarcopenic obesity or secondary sarcopenia due to mitigating factors such as chronic kidney disease or DM (Table 3) may engender a more comprehensive approach to intervention that targets the primary disease while also addressing important secondary sequelae. Nevertheless, staging or classification criteria cannot be deemed equivalent to diagnostic criteria for sarcopenia due to the challenges posed by syndromes that have a heterogeneous clinical presentation.

The refinement of the staging and classification criteria along with the advances in imaging technology and mechanistic research are not unique to sarcopenia. Practitioners involved in the care of people with rheumatologic conditions or osteoporosis also have contended with continued refinements to their classification criteria and approach to risk stratification.39,43,44 Primary care providers will now have the option to use a new ICD-10-CM code (M62.84) for sarcopenia, which will allow them to properly document the clinical distinctions between people with impaired strength or function largely due to age-related muscle changes and those who have impaired muscle function due to cachexia, inflammatory myopathies, or forms of neuromuscular disease.

 

 

The ability to identify and document this geriatric syndrome in veterans will help to better define the scope of the problem within the VA health care system. The median age of veterans is 62 years compared with 43 years for nonveterans.3 Consequently, there may be value in the adoption of a formal approach to screening and diagnosis for sarcopenia among veterans who receive their primary care from VA facilities.7 Indeed, the exchange between the patient and the health professional regarding the screening and diagnostic process will provide valuable opportunities to promote exercise interventions before patients incur significant impairments.

One of the biggest threats burdening global health is noncommunicable diseases, and many chronic conditions, such as sarcopenia, can be prevented and managed with appropriate levels of physical activity.17 Increased physician involvement may prove to be critical given the identification of physical inactivity as a top 5 risk factor for general morbidity and mortality by World Health Organization and consensus group recommendations calling for physicians to serve a more prominent role in the provision of exercise and physical activity recommendations.16,17

This developing health care role should include NPs, PTs, physician assistants, and other associated health professionals. It also should include collaborative efforts between physicians and rehabilitation practitioners concerning provision of the formal exercise prescriptionprescription and monitoring of patient outcomes.

Individuals with severe forms of sarcopenia rarely improve without intervention.6 Although no pharmacologic treatment exists to specifically address sarcopenia, strengthening exercise has been shown to be an effective mode of prevention and conservative management.8 Progressive resistance exercise cannot abate the expected age-related changes in skeletal muscle, but it can significantly reverse the loss of LBM and strength in untrained older adults and slow the age-related decline in muscle performance in older adult athletes and trained individuals.45

Local senior centers and community organizations may prove to be valuable resources concerning group exercise options, and they provide the added benefit of social engagement and peer group accountability. Federal resources include the Go4Life exercise guide and online videos provided by the National Institute on Aging and the MOVE! Weight Management and Health Program provided at select VA community-based outpatient clinics. Ultimately, collaborative efforts with exercise specialists may serve to reduce the PCP burden during the provision of health services, minimize diagnostic errors associated with sarcopenia assessment and help to connect patients to valuable health promotion resources.17,18

Conclusion

While practitioners should remain keenly aware of the pernicious effects of overdiagnosis, sarcopenia has long existed as a known, but undiagnosed, condition. Of course, geriatricians have traditionally managed poor muscle performance and mobility limitations by addressing treatable symptoms and providing referrals to physical medicine specialists when warranted. Nevertheless, the advent of ICD-10-CM code M62.84 provides the VA with an opportunity to take a leading role in systematically addressing this geriatric syndrome within an aging veteran population.

The following items should be considered by NCP for the development of guidelines and recommendations concerning sarcopenia screening:

  1. Consider screening veterans aged > 65 years for sarcopenia every 2 years. Those with mitigating systemic conditions (eg, chronic kidney disease, DM, or malnutrition) or significant mobility limitations may be screened at any age.
  2. Sarcopenia screening procedures should include at a minimum the SARC-F questionnaire and gait speed (when appropriate). Including gait speed or grip strength testing in the screening exam is recommended given the low sensitivity of the SARC-F questionnaire.
  3. Veterans with positive SARC-F results (≥ 4) merit a physical therapy referral. In addition, these veterans should obtain confirmatory standardized assessments for LBM and functional status.
  4. Veterans at risk for sarcopenia based on patient age, medical history, and the physical examination (eg, obesity, sedentary lifestyle, a previous fracture, self-reported physical decline), but with negative SARC-F results should receive a formal exercise prescription from their PCP. Baseline assessment measures may be used for comparison with serial measures obtained during subsequent screening visits to support long-term case management.
  5. Interprofessional collaboration involving geriatricians, PTs, nurses, radiologists, and other health care professionals should be involved in the screening, diagnosis, and case management of veterans with sarcopenia.
  6. The VA EMR should be systematically documented with sarcopenia assessment data obtained from the gait speed tests, SARCF, SPPB, grip strength tests, and LBM estimates to better characterize this condition within the veteran population.

Any expansion in the provision of health care comes with anticipated benefits and potential costs. Broad guidance from NCP may encourage veterans to pursue selected screening tests, promote the appropriate use of preventative services, and facilitate timely treatment when needed.31 Clinicians who are informed about the screening, staging, classification, and diagnostic process for sarcopenia may partner with patients to make reasoned decisions about how to best manage this syndrome within the VA medical center environment.

Sarcopenia is an age-related loss of skeletal muscle that may result in diminished muscle strength and functional performance. The prevalence of sarcopenia varies based on the cohort and the assessment criteria. According to the Health Aging and Body Composition (ABC) study, the prevalence of sarcopenia in community-dwelling older adults is about 14% to 18%, whereas the estimate may exceed 30% for those in longterm care.1,2 This geriatric syndrome may disproportionately affect veterans given that they are older than the civilian population and may have disabling comorbid conditions associated with military service.3

Recently, there has been a call to action to systematically address sarcopenia by interdisciplinary organizations such as the European Society for Clinical and Economic Aspects of Osteoporosis and Osteoarthritis (ESCEO) and the International Working Group on Sarcopenia (IWGS).4,5 This call to action is due to the association of sarcopenia with increased health care costs, higher disability incidence, and elevated risk of mortality.6,7 The consequences of sarcopenia may include serious complications, such as hip fracture or a loss of functional independence.8,9 The CDC now recognizes sarcopenia as an independently reportable medical condition. Consequently, physicians, nurse practitioners (NPs), and other associated health professionals within the VA will need to better understand clinically viable and valid methods to screen and diagnose this geriatric syndrome.

The purpose of this paper is to inform practitioners how sarcopenia screening is aided by the new ICD-10-CM code and briefly review recent VA initiatives for proactive care. Additional objectives include identifying common methods used to assess sarcopenia and providing general recommendations to the VHA National Center for Health Promotion and Disease Prevention (NCP) concerning the management of sarcopenia.

 

Addressing Sarcopenia

While the age-related decline in muscle size and performance has long been recognized by geriatricians, sustained advocacy by several organizations was required to realize the formal recognition of sarcopenia. Aging in Motion (AIM), a coalition of organizations focused on advancing research and treatment for conditions associated with age-related muscle dysfunction, sought the formal recognition of sarcopenia. The CDC established the ICD-10-CM code for sarcopenia in October of 2016, which allowed the syndrome to be designated as a primary or secondary condition.10

The ubiquitous nature of agerelated changes in muscle and the mandate to engage in proactive care by all levels of VA leadership led to the focus on addressing sarcopenia. The recognition of sarcopenia by the CDC comes at an opportune time given recent VA efforts to transform itself from a facilitator mainly of care delivery to an active partner in fostering the health and well-being of veterans. Initiatives that are emblematic of this attempt to shift the organizational culture across the VHA include establishing the VA Center for Innovation (VACI) and issuing guidance documents, such as the Blueprint for Excellence, which was introduced in 2014 by then VA Secretary Robert McDonald.11,12 Many of the following Blueprint themes and strategies potentially impact sarcopenia screening and treatment within the VA:

  • Delivering high-quality, veteran-centered care: A major Blueprint theme is attaining the “Triple Aims” of a health care system by promoting better health among veterans, improving the provision of care, and lowering costs through operational efficiency. The management of sarcopenia has clear clinical value given the association of age-related muscle loss with fall risk and decreased mobility.13 Financial value also may be associated with the effort to decrease disability related to sarcopenia and the use of a team approach featuring associated health professionals to help screen for this geriatric syndrome.14,15 (Strategy 2)
  • Leveraging health care informatics to optimize individual and population health outcomes: The inclusion of the most basic muscle performance and functional status measures in the electronic medical record (EMR), such as grip strength and gait speed, would help to identify the risk factors and determinants of sarcopenia among the veteran population. (Strategy 3)
  • Advancing personalized, proactive health care that fosters a high level of health and well-being: The long-term promotion of musculoskeletal health and optimal management of sarcopenia cannot be sustained through episodic medical interactions. Instead, a contemporary approach to health services marked by the continuous promotion of health education, physical activity and exercise, and proper nutrition has demonstrated value in the management of chronic conditions.16,17 (Strategy 6)

The new sarcopenia ICD-10-CM code along with elements of the VHA Blueprint can serve to support the systematic assessment and management of veterans with age-related muscle dysfunction. Nevertheless, renewed calls for health promotion and screening programs are often counterbalanced by the need for cost containment and the cautionary tales concerning the potential harms or errors associated with some forms of medical screening. The American Board of Internal Medicine Foundation has spearheaded the Choosing Wisely campaign to raise awareness about excessive medical testing. However, the Institute of Medicine has linked the provision of quality health care to a diagnostic process that is both timely and accurate.18 Careful consideration of these health care challenges may help guide practitioners within the VA concerning the screening and diagnosis of sarcopenia.

Sarcopenia Assessment

Sarcopenia can have several underlying causes in some individuals and result in varied patterns of clinical presentation and differing degrees of severity. The European Working Group on Sarcopenia in Older People first met in 2009 and used a consensus-based decision-making process to determine operational definitions for sarcopenia and create a staging algorithm for the syndrome.19 This consensus group developed a conceptual staging model with 3 categories: presarcopenia, sarcopenia, and severe sarcopenia (Table 1). The impetus for sarcopenia staging was the emerging research findings suggesting that lean body mass (LBM) alone did not provide a high degree of clinical value in outpatient settings due to the nonlinear relationship between LBM and muscle function in older adults.20,21 Using the consensus model approach, an individual is classified as sarcopenic on presenting with both low LBM and low muscle function.

Screening: A Place to Start

Findings from the Health ABC Study suggested that older adults who maintained high levels of LBM were less likely to become sarcopenic. Whereas, older adults in the cohort with low levels of LBM tended to remain in a sarcopenic state.6 Consequently, the early detection of sarcopenia may have important health promotion implications for older adults. Sarcopenia is a syndrome with a continuum of clinical features; it is not a disease with a clear or singular etiololgy. Therefore, the result of the screening examination should identify those who would most benefit from a formal diagnostic assessment.

One approach to screening for sarcopenia involves the use of questionnaires, such as the SARC-F (sluggishness, assistance in walking, rise from a chair, climb stairs, falls), which is a brief 5-item questionnaire with Likert scoring for patient responses.13 In a cohort of National Health and Nutrition Examination Survey (NHANES) participants, SARC-F scores ≥ 4 were associated with slower gait speed, lower strength, and an increased likelihood of hospitalization within a year of the test response.22 However, rather than stratify patients by risk, the SARC-F exhibits a high degree of test specificity regarding the major consensus-based sarcopenia classification criteria (specificity = 94.2% to 99.1%; sensitivity = 3.8% to 9.9%).13 Given the known limitations of screening tools with low sensitivity, organizations such as the ESCEO have recommended supplementing the SARC-F questionnaire with other forms of assessment.4 Supplements to the screening examination may range from the use of “red flag” questions concerning changes in nutritional status, body weight, and physical activity, to conducting standard gait speed and grip-strength testing.4,19,23

Performance-based testing, including habitual gait speed and grip-strength dynamometry, also may be used in both the screening and classification of sarcopenia.2 Although walking speed below 1.0 m/s has been used by the IWGS as a criterion to prompt further assessment, many people within the VA health care system may have gait abnormalities independent of LBM status, and others may be nonambulatory.24,25 As a result, grip-strength testing should be considered as a supplementary or alternate screening assessment tool.26,27

Hand-grip dynamometry is often used diagnostically given its previous test validation, low expense, and ease of use.23 Moreover, recent evidence suggests that muscle strength surpasses gait speed as a means of identifying people with sarcopenia. Grip strength is associated with all-cause mortality, even when adjusting for age, sex, and body size,28 while slow gait speed (< .82 m/s) has a reported sensitivity of 63% and specificity of 70% for mortality in population-based studies involving older adults.29

 

 

Gait speed (in those who are ambulatory) and grip-strength values could be entered into the EMR evaluation note by the primary care provider (PCP). Elements of the VA EMR, such as the ability to review the diagnosis of sarcopenia on the Problem List or the nominal enhancement of providing LBM estimates within the Cumulative Vitals and Measurements Report would support the management of sarcopenia. See Table 2 for cutoff values for frequently used sarcopenia screening and staging tests.

The pitfalls of excessive or inappropriate screening are well documented. The efforts to screen for prostate cancer have highlighted instances when inappropriate followup tests and treatment fail to alter mortality rates and ultimately yield more harm than good.30 However, there are several points of departure concerning the screening for sarcopenia vs screening for prostate cancer. The screening assessments for sarcopenia are low-cost procedures that are associated with a low patient burden. These procedures may include questionnaires, functional testing, or the assessment of muscle performance. Additionally, there is a low propensity for adverse effects stemming from treatment due to disease misclassification given the common nonpharmacologic approaches used to manage sarcopenia.31 Nonetheless, the best screening examination—even one that has low patient burden and cost—may prove to be a poor use of medical resources if the process is not linked to a viable intervention.

Screening people aged ≥ 65 years may strike a balance between controlling health care expenditures and identifying people with the initial signs of sarcopenia early enough to begin monitoring key outcomes and providing a formalized exercise prescription. Presuming an annual age-related decline in LBM of 1.5%, and considering the standard error measurement of the most frequently used methods of strength and LBM assessment, recurrent screening could occur every 2 years.21,32

Earlier screening may be considered for patient populations with a higher pretest probability. These groups include those with conditions associated with accelerated muscle loss, such as chronic kidney disease, peripheral artery disease, and diabetes mellitus (DM).32 Although accelerated muscle loss characterized by an inflammatory motif (eg, cancer-related cachexia) may share some features of the sarcopenia screening and assessment approach, important differences exist regarding the etiology, medical evaluation, and ICD-10-CM code designation.

Staging and Classification

Staging criteria are generally used to denote the severity of a given disease or syndrome, whereas classification criteria are used to define homogenous patient groups based on specific pathologic or clinical features of a disorder. Although classification schemes may incorporate an element of severity, they are primarily used to characterize fairly distinct phenotypic forms of disease or specific clinical presentation patterns associated with a well-defined syndrome. Although not universally adopted, the European consensus group sarcopenia staging criteria are increasingly used to provide a staging algorithm presumably driven by the severity of the condition.19

The assessment of functional performance for use in sarcopenia staging often involves measuring habitual gait speed or completing the Short Physical Performance Battery (SPPB).23 The SPPB involves a variety of performance-based activities for balance, gait, strength, and endurance. This test has predictive validity for the onset of disability and adverse health events, and it has been extensively used in research and clinical settings.33 Additional tests used to characterize function during the staging or diagnostic process include the timed get up and go test (TGUG) and the timed sit to stand test.34,35 The TGUG provides an estimate of dynamic balance, and the sit to stand test has been used as very basic proxy measure of muscular power.36 The sit to stand test and habitual gait speed are items included in the SPPB.33

Accepted methods to obtain the traditional index measure of sarcopenia—based on estimates of LBM—include bioimpedance analysis (BIA) and dual X-ray absorptiometry (DXA). The BIA uses the electrical impedance of body tissues and its 2 components, resistance and reactance, to derive its body composition estimates.37 Segmental BIA allows for isolated measurements of the limbs, which may be calibrated to DXA appendicular lean body mass (ALM) or magnetic resonance imaging-based estimates of LBM. This instrument is relatively safe for use, inexpensive for medical facilities, and useful for longitudinal studies, but it can be confounded by issues, such as varying levels of hydration, which may affect measurement validity in some instances.

Despite the precision of DXA for estimating densities for whole body composition analysis, the equipment is not very portable and involves low levels of radiation exposure, which limits its utility in some clinical settings. While each body composition assessment method has its advantages and disadvantages, DXA is regarded as an acceptable form of measurement for hospital settings, and BIA is frequently used in outpatient clinics and community settings. Other methods used to estimate LBM with greater accuracy, such as peripheral quantitative computed tomography, doubly labeled water, and whole body gamma ray counting, are not viable for clinical use. Other accessible methods such as anthropometric measures and skinfold measures have not been embraced by sarcopenia classification consensus groups.23,37

Alternative methods of estimating LBM, such as diagnostic ultrasound and multifrequency electrical impedance myography, are featured outcomes in ongoing clinical trials that involve veteran participants. These modalities may soon provide a clinically viable approach to assessing muscle quality via estimates of muscle tissue composition.37,38 Similar to the management of other geriatric syndromes, interprofessional collaboration provides an optimal approach to the assessment of sarcopenia. Physicians and other health care providers may draw on the standardized assessment of strength and function (via the SPPB and hand-grip dynamometry) by physical therapists (PTs), questionnaires administered by nursing staff (the SARC-F), or body composition estimates from other health professionals (ranging from BIA to DXA) to aid the diagnostic process and facilitate appropriate case management (Table 2).

Competing staging and classification definitions have been cited as a primary factor behind the CDC’s delayed recognition of the sarcopenia diagnosis, which in turn posed a barrier to formal clinical recognition by geriatricians.24 However, this reaction to the evolving sarcopenia staging criteria also may reveal the larger misapplication of the staging process to the diagnostic process. The application of classification and staging criteria results in a homogenous group of patients, whereas the application of diagnostic criteria results in a heterogeneous group of patients to account for variations in clinical presentation associated with a given disorder. Classification criteria may be equivalent to objective measures that are used in the diagnostic process when a given disease is characterized by a well-established biomarker.39

However, this is not the case for most geriatric syndromes and other disorders marked by varied clinical presentation patterns. On considering the commonly used sarcopenia staging criteria of LBM ≤ 8.50 kg/m2 or grip strength < 30 kg in men and LBM ≤ 5.75 kg/m2 or grip strength < 20 kg in women, it is easy to understand that such general cutoff values are far from diagnostic.40,41 Moreover, stringent cutoff values associated with classification and staging may not adequately capture those with an atypical presentation of the syndrome (eg, someone who exhibits age-related muscle weakness but has retained adequate LBM). Such criteria often prove to have high specificity and low sensitivity, which may yield a false negative rate that is appropriate for clinical research eligibility and group assignment but inadequate for clinical care.

Screening, staging, and classification criteria with high specificity may indeed be desirable for confirmatory imaging tests associated with radiation exposure concerns or for managing risk in experimental clinical trials involving pharmacologic treatment. For example, a SARC-F score ≥ 4 may prompt the formal assessment of LBM via a DXA examination.4 In contrast, those with a SARC-F score ≤ 3 with low gait speed or grip strength may benefit from consultation regarding regular physical activity and nutrition recommendations. Given the challenges of establishing sarcopenia classification criteria that perform consistently across populations and geographic regions, classification and staging criteria may be best viewed as clinical reasoning tools that supplement, but not supplant, the diagnostic process.7,42

Diagnosis

Geriatric syndromes do not lend themselves to a simple diagnostic process. Syndromes such as frailty and sarcopenia are multifactorial and lack a single distinguishing clinical feature or biomarker. The oft-cited refrain that sarcopenia is an underdiagnosed condition is partially explained by the recent ICD-10-CM code and varied classification and diagnostic criteria.5 This circumstance highlights the need to distinctly contrast the diagnostic process with the screening and staging classifications.

The diagnostic process involves the interpretation of the patient history, signs, and symptoms within the context of individual factors, local or regional disease prevalence, and the results of the best available and most appropriate laboratory tests. After all, a patient that presents with low LBM and a gradual loss of strength without a precipitating event would necessitate further workup to rule out many clinical possibilities under the aegis of a differential diagnosis. Clinical features, such as the magnitude of weakness and pattern of strength loss or muscle atrophy along with the determination of neurologic or autoimmune involvement, are among the key elements of the differential examination for a case involving the observation of frank muscle weakness. Older adults with low muscle strength may have additional risk factors for sarcopenia such as obesity, pain, poor nutrition, previous bone fracture, and a sedentary lifestyle. However, disease etiology with lower probabilities, such as myogenic or neurogenic conditions associated with advancing age, also may be under consideration during the clinical assessment.6

In many instances, the cutoff scores associated with the sarcopenia staging criteria may help to guide the diagnostic process and aid clinical decision making. Since individuals with a positive screening result based on the SARC-F questionnaire (score ≥ 4) have a high likelihood of meeting the staging criteria for severe sarcopenia, a PCP may opt to obtain a confirmatory estimate of LBM both to support the clinical assessment and to monitor change over the course of rehabilitation. Whereas people who present with a decline in strength (ie, grip strength < 30 kg for a male) without an observable loss of function or a positive SARC-F score may benefit from consultation from the physician, NP, or rehabilitation health professional regarding modifiable risk factors associated with sarcopenia.

Incorporating less frequently used sarcopenia classification schemes such as identifying those with sarcopenic obesity or secondary sarcopenia due to mitigating factors such as chronic kidney disease or DM (Table 3) may engender a more comprehensive approach to intervention that targets the primary disease while also addressing important secondary sequelae. Nevertheless, staging or classification criteria cannot be deemed equivalent to diagnostic criteria for sarcopenia due to the challenges posed by syndromes that have a heterogeneous clinical presentation.

The refinement of the staging and classification criteria along with the advances in imaging technology and mechanistic research are not unique to sarcopenia. Practitioners involved in the care of people with rheumatologic conditions or osteoporosis also have contended with continued refinements to their classification criteria and approach to risk stratification.39,43,44 Primary care providers will now have the option to use a new ICD-10-CM code (M62.84) for sarcopenia, which will allow them to properly document the clinical distinctions between people with impaired strength or function largely due to age-related muscle changes and those who have impaired muscle function due to cachexia, inflammatory myopathies, or forms of neuromuscular disease.

 

 

The ability to identify and document this geriatric syndrome in veterans will help to better define the scope of the problem within the VA health care system. The median age of veterans is 62 years compared with 43 years for nonveterans.3 Consequently, there may be value in the adoption of a formal approach to screening and diagnosis for sarcopenia among veterans who receive their primary care from VA facilities.7 Indeed, the exchange between the patient and the health professional regarding the screening and diagnostic process will provide valuable opportunities to promote exercise interventions before patients incur significant impairments.

One of the biggest threats burdening global health is noncommunicable diseases, and many chronic conditions, such as sarcopenia, can be prevented and managed with appropriate levels of physical activity.17 Increased physician involvement may prove to be critical given the identification of physical inactivity as a top 5 risk factor for general morbidity and mortality by World Health Organization and consensus group recommendations calling for physicians to serve a more prominent role in the provision of exercise and physical activity recommendations.16,17

This developing health care role should include NPs, PTs, physician assistants, and other associated health professionals. It also should include collaborative efforts between physicians and rehabilitation practitioners concerning provision of the formal exercise prescriptionprescription and monitoring of patient outcomes.

Individuals with severe forms of sarcopenia rarely improve without intervention.6 Although no pharmacologic treatment exists to specifically address sarcopenia, strengthening exercise has been shown to be an effective mode of prevention and conservative management.8 Progressive resistance exercise cannot abate the expected age-related changes in skeletal muscle, but it can significantly reverse the loss of LBM and strength in untrained older adults and slow the age-related decline in muscle performance in older adult athletes and trained individuals.45

Local senior centers and community organizations may prove to be valuable resources concerning group exercise options, and they provide the added benefit of social engagement and peer group accountability. Federal resources include the Go4Life exercise guide and online videos provided by the National Institute on Aging and the MOVE! Weight Management and Health Program provided at select VA community-based outpatient clinics. Ultimately, collaborative efforts with exercise specialists may serve to reduce the PCP burden during the provision of health services, minimize diagnostic errors associated with sarcopenia assessment and help to connect patients to valuable health promotion resources.17,18

Conclusion

While practitioners should remain keenly aware of the pernicious effects of overdiagnosis, sarcopenia has long existed as a known, but undiagnosed, condition. Of course, geriatricians have traditionally managed poor muscle performance and mobility limitations by addressing treatable symptoms and providing referrals to physical medicine specialists when warranted. Nevertheless, the advent of ICD-10-CM code M62.84 provides the VA with an opportunity to take a leading role in systematically addressing this geriatric syndrome within an aging veteran population.

The following items should be considered by NCP for the development of guidelines and recommendations concerning sarcopenia screening:

  1. Consider screening veterans aged > 65 years for sarcopenia every 2 years. Those with mitigating systemic conditions (eg, chronic kidney disease, DM, or malnutrition) or significant mobility limitations may be screened at any age.
  2. Sarcopenia screening procedures should include at a minimum the SARC-F questionnaire and gait speed (when appropriate). Including gait speed or grip strength testing in the screening exam is recommended given the low sensitivity of the SARC-F questionnaire.
  3. Veterans with positive SARC-F results (≥ 4) merit a physical therapy referral. In addition, these veterans should obtain confirmatory standardized assessments for LBM and functional status.
  4. Veterans at risk for sarcopenia based on patient age, medical history, and the physical examination (eg, obesity, sedentary lifestyle, a previous fracture, self-reported physical decline), but with negative SARC-F results should receive a formal exercise prescription from their PCP. Baseline assessment measures may be used for comparison with serial measures obtained during subsequent screening visits to support long-term case management.
  5. Interprofessional collaboration involving geriatricians, PTs, nurses, radiologists, and other health care professionals should be involved in the screening, diagnosis, and case management of veterans with sarcopenia.
  6. The VA EMR should be systematically documented with sarcopenia assessment data obtained from the gait speed tests, SARCF, SPPB, grip strength tests, and LBM estimates to better characterize this condition within the veteran population.

Any expansion in the provision of health care comes with anticipated benefits and potential costs. Broad guidance from NCP may encourage veterans to pursue selected screening tests, promote the appropriate use of preventative services, and facilitate timely treatment when needed.31 Clinicians who are informed about the screening, staging, classification, and diagnostic process for sarcopenia may partner with patients to make reasoned decisions about how to best manage this syndrome within the VA medical center environment.

References

1. Newman AB, Kupelian V, Visser M, et al; Health ABC Study Investigators. Sarcopenia: alternative definitions and associations with lower extremity function. J Am Geriatr Soc. 2003;51(11):1602-1609.

2. Cruz-Jentoft AJ, Landi F, Schneider SM, et al. Prevalence of and interventions for sarcopenia in ageing adults: a systematic review. Report of the International Sarcopenia Initiative (EWGSOP and IWGS). Age Ageing. 2014;43(6):748-759.

3. U.S. Department of Veterans Affairs, National Center for Veterans Analysis and Statistics. Profile of veterans: 2009. Data from the American Community Survey. http://www.va.gov/vetdata/docs/SpecialReports/Profile_of_Veterans_2009_FINAL.pdf. Published January 2011. Accessed May 18, 2017.

4. Beaudart C, McCloskey E, Bruyère O, et al. Sarcopenia in daily practice: assessment and management. BMC Geriatr. 2016;16(1):170.

5. Fielding RA, Vellas B, Evans WJ, et al. Sarcopenia: an undiagnosed condition in older adults. Current consensus definition: prevalence, etiology, and consequences. International working group on sarcopenia. J Am Med Dir Assoc. 2011;12(4):249-256.

6. Murphy RA, Ip EH, Zhang Q, et al; Health, Aging, and Body Composition Study. Transition to sarcopenia and determinants of transitions in older adults: a population-based study. J Gerontol A Biol Sci Med Sci. 2014;69(6):751-758.

7. Harris-Love MO, Adams B, Hernandez HJ, DiPietro L, Blackman MR. Disparities in the consequences of sarcopenia: implications for African American veterans. Front Physiol. 2014;5:250.

8. Morley JE. Sarcopenia in the elderly. Fam Pract. 2012;29(suppl 1):i44-i48.

9. Fragala MS, Dam TT, Barber V, et al. Strength and function response to clinical interventions of older women categorized by weakness and low lean mass using classifications from the Foundation for the National Institute of Health sarcopenia project. J Gerontol A Biol Sci Med Sci. 2015;70(2):202-209.

10. Aging in Motion. AIM coalition announces establishment of ICD-10-CM Code for Sarcopenia
by the Centers for Disease Control and Prevention [press release]. http://aginginmotion.org/news/2388-2/. Published April 28, 2016. Accessed June 7, 2017.

11. U.S. Department of Veterans Affairs, Veterans Health Administration. Blueprint for excellence. https://www.va.gov/HEALTH/docs/VHA _Blueprint_for_Excellence.pdf. Published September 21, 2014. Accessed June 7, 2017.

12. U.S. Department of Veterans Affairs. VA Center of Innovation 2010–2012 stakeholder report. https://www.innovation.va.gov/docs/VACI_2010-2012_Stakeholder_Report.pdf. Published 2012. Accessed June 14, 2017.

13. Woo J, Leung J, Morley JE. Validating the SARCF: a suitable community screening tool for sarcopenia? J Am Med Dir Assoc. 2014;15(9):630-634.

14. Sousa AS, Guerra RS, Fonseca I, Pichel F, Ferreira S, Amaral TF. Financial impact of sarcopenia on hospitalization costs. Eur J Clin Nutr. 2016;70(9):1046-1051.

15. Janssen I, Shepard DS, Katzmarzyk PT, Roubenoff R. The healthcare costs of sarcopenia in the United States. J Am Geriatr Soc. 2004;52(1):80-85.

16. Ekelund U, Steene-Johannessen J, Brown WJ, et al; Lancet Physical Activity Series 2 Executive Committe; Lancet Sedentary Behaviour Working Group. Does physical activity attenuate, or even eliminate, the detrimental association of sitting time with mortality? A harmonised meta-analysis of data from more than 1 million men and women. Lancet. 2016;388(10051):1302-1310.

17. Thornton JS, Frémont P, Khan K, et al. Physical activity prescription: a critical opportunity to address a modifiable risk factor for the prevention and management of chronic disease: a position statement by the Canadian Academy of Sport and Exercise Medicine. Clin J Sport Med.
2016;26(4):259-265.

18. The National Academies of Sciences, Engineering, and Medicine; Committee on Diagnostic Error in Health Care, Board on Health Care Services; Institute of Medicine. Improving Diagnosis in Health Care. Washington, DC: National Academies Press;2015.

19. Cruz-Jentoft AJ, Baeyens JP, Bauer JM, et al; European Working Group on Sarcopenia in Older People. Sarcopenia: European consensus on definition and diagnosis: report of the European Working Group on Sarcopenia in Older People. Age Ageing. 2010;39(4):412-423.

20. Ferrucci L, Guralnik JM, Buchner D, et al. Departures from linearity in the relationship between measures of muscular strength and physical performance of the lower extremities: the Women’s Health and Aging Study. J Gerontol A Biol Sci Med Sci. 1997;52(5):M275-M285.

21. Goodpaster BH, Park SW, Harris TB, et al. The loss of skeletal muscle strength, mass, and quality in older adults: the Health, Aging and Body Composition Study. J Gerontol A Biol Sci Med Sci. 2006;61(10):1059-1064.

22. Malmstrom TK, Miller DK, Simonsick EM, Ferrucci L, Morley JE. SARC-F: a symptom score to predict persons with sarcopenia at risk for poor functional outcomes. J Cachexia Sarcopenia Muscle. 2016;7(1):28-36.

23. Cooper C, Fielding R, Visser M, et al. Tools in the assessment of sarcopenia. Calcif Tissue Int. 2013;93(3):201-210.

24. Lee WJ, Liu LK, Peng LN, Lin MH, Chen LK; ILAS Research Group. Comparisons of sarcopenia defined by IWGS and EWGSOP criteria among older people: results from the I-Lan longitudinal aging study. J Am Med Dir Assoc. 2013;14(7):528.e1-e7.

25. Cesari M, Kritchevsky SB, Penninx BW, et al. Prognostic value of usual gait speed in well-functioning  older people—results from the Health, Aging and Body Composition Study. J Am Geriatr Soc. 2005;53(10):1675-1680.

26. Rossi AP, Fantin F, Micciolo R, et al. Identifying sarcopenia in acute care setting patients. J Am Med Dir Assoc. 2014;15(4):303.e7-e12.

27. Sánchez-Rodríguez D, Marco E, Miralles R, et al. Does gait speed contribute to sarcopenia casefinding in a postacute rehabilitation setting? Arch Gerontol Geriatr. 2015;61(2):176-181.

28. Strand BH, Cooper R, Bergland A, et al. The association of grip strength from midlife onwards with all-cause and cause-specific mortality over 17 years of follow-up in the Tromsø Study. J Epidemiol Community Health. 2016;70:1214-1221.

29. Stanaway FF, Gnjidic D, Blyth FM, et al. How fast does the Grim Reaper walk? Receiver operating characteristics curve analysis in healthy men aged 70 and over. BMJ. 2011;343:d7679.

30. Reiter RE. Risk stratification of prostate cancer 2016. Scand J Clin Lab Invest Suppl.  2016;245:S54-S59.

31. U.S. Department of Veterans Affairs, National Center for Health Promotion and Disease Prevention. Get recommended screening tests and immunizations. https://www.prevention.va.gov/Healthy_Living/Get_Recommended_Screening_Tests_and_Immunizations.asp. Updated September 9, 2016. Accessed June 7, 2017.

32. Buford TW, Anton SD, Judge AR, et al. Models of accelerated sarcopenia: critical pieces for solving the puzzle of age-related muscle atrophy. Ageing Res Rev. 2010;9(4):369-383.

33. Guralnik JM, Simonsick EM, Ferrucci L, et al. A short physical performance battery assessing lower extremity function: association with self-reported disability and prediction of mortality and nursing home admission. J Gerontol. 1994;49(2):M85-M94.

34. Daubney ME, Culham EG. Lower-extremity muscle force and balance performance in adults aged 65 years and older. Phys Ther. 1999;79(12):1177-1185.

35. Bohannon RW. Reference values for the fiverepetition sit-to-stand test: a descriptive metaanalysis of data from elders. Percept Mot Skills. 2006;103(1):215-222.

36. Correa-de-Araujo R, Harris-Love MO, Miljkovic I, Fragala MS, Anthony BW, Manini TM. The need for standardized assessment of muscle quality in skeletal muscle function deficit and other agingrelated muscle dysfunctions: a symposium report. Front Physiol. 2017;8:87.

37. Heymsfield SB, Gonzalez MC, Lu J, Jia G, Zheng J. Skeletal muscle mass and quality: evolution of modern measurement concepts in the context of sarcopenia. Proc Nutr Soc. 2015;74(4):355-366.

38. Harris-Love MO, Monfaredi R, Ismail C, Blackman MR, Cleary K. Quantitative ultrasound: measurement considerations for the assessment of muscular dystrophy and sarcopenia. Front Aging Neurosci. 2014;6:172.

39. Fries JF, Hochberg MC, Medsger TA Jr, Hunder GG, Bombardier C. Criteria for rheumatic disease. Different types and different functions. The American College of Rheumatology Diagnostic and Therapeutic Criteria Committee. Arthritis Rheum. 1994;37(4):454-462.

40. Janssen I, Baumgartner RN, Ross R, Rosenberg IH, Roubenoff R. Skeletal muscle cutpoints associated with elevated physical disability risk in older men and women. Am J Epidemiol.
2004;159(4):413-421.

41. Ismail C, Zabal J, Hernandez HJ, et al. Diagnostic ultrasound estimates of muscle mass and muscle quality discriminate between women with and without sarcopenia. Front Physiol. 2015;6:302.

42. Chen LK, Liu LK, Woo J, et al. Sarcopenia in Asia: consensus report of the Asian Working Group for Sarcopenia. J Am Med Dir Assoc. 2014;15(2):95-101.

43. Aggarwal R, Ringold S, Khanna D, et al. Distinctions between diagnostic and classification  criteria? Arthritis Care Res (Hoboken). 2015;67(7):891-897.

44. Licata A. Bone density vs bone quality: what’s a clinician to do? Cleve Clin J Med. 2009;76(6):331-336.

45. Pollock ML, Mengelkoch LJ, Graves JE, et al. Twenty-year follow-up of aerobic power and body composition of older track athletes. J Appl Physiol. 1997;82(5):1508-1516.

References

1. Newman AB, Kupelian V, Visser M, et al; Health ABC Study Investigators. Sarcopenia: alternative definitions and associations with lower extremity function. J Am Geriatr Soc. 2003;51(11):1602-1609.

2. Cruz-Jentoft AJ, Landi F, Schneider SM, et al. Prevalence of and interventions for sarcopenia in ageing adults: a systematic review. Report of the International Sarcopenia Initiative (EWGSOP and IWGS). Age Ageing. 2014;43(6):748-759.

3. U.S. Department of Veterans Affairs, National Center for Veterans Analysis and Statistics. Profile of veterans: 2009. Data from the American Community Survey. http://www.va.gov/vetdata/docs/SpecialReports/Profile_of_Veterans_2009_FINAL.pdf. Published January 2011. Accessed May 18, 2017.

4. Beaudart C, McCloskey E, Bruyère O, et al. Sarcopenia in daily practice: assessment and management. BMC Geriatr. 2016;16(1):170.

5. Fielding RA, Vellas B, Evans WJ, et al. Sarcopenia: an undiagnosed condition in older adults. Current consensus definition: prevalence, etiology, and consequences. International working group on sarcopenia. J Am Med Dir Assoc. 2011;12(4):249-256.

6. Murphy RA, Ip EH, Zhang Q, et al; Health, Aging, and Body Composition Study. Transition to sarcopenia and determinants of transitions in older adults: a population-based study. J Gerontol A Biol Sci Med Sci. 2014;69(6):751-758.

7. Harris-Love MO, Adams B, Hernandez HJ, DiPietro L, Blackman MR. Disparities in the consequences of sarcopenia: implications for African American veterans. Front Physiol. 2014;5:250.

8. Morley JE. Sarcopenia in the elderly. Fam Pract. 2012;29(suppl 1):i44-i48.

9. Fragala MS, Dam TT, Barber V, et al. Strength and function response to clinical interventions of older women categorized by weakness and low lean mass using classifications from the Foundation for the National Institute of Health sarcopenia project. J Gerontol A Biol Sci Med Sci. 2015;70(2):202-209.

10. Aging in Motion. AIM coalition announces establishment of ICD-10-CM Code for Sarcopenia
by the Centers for Disease Control and Prevention [press release]. http://aginginmotion.org/news/2388-2/. Published April 28, 2016. Accessed June 7, 2017.

11. U.S. Department of Veterans Affairs, Veterans Health Administration. Blueprint for excellence. https://www.va.gov/HEALTH/docs/VHA _Blueprint_for_Excellence.pdf. Published September 21, 2014. Accessed June 7, 2017.

12. U.S. Department of Veterans Affairs. VA Center of Innovation 2010–2012 stakeholder report. https://www.innovation.va.gov/docs/VACI_2010-2012_Stakeholder_Report.pdf. Published 2012. Accessed June 14, 2017.

13. Woo J, Leung J, Morley JE. Validating the SARCF: a suitable community screening tool for sarcopenia? J Am Med Dir Assoc. 2014;15(9):630-634.

14. Sousa AS, Guerra RS, Fonseca I, Pichel F, Ferreira S, Amaral TF. Financial impact of sarcopenia on hospitalization costs. Eur J Clin Nutr. 2016;70(9):1046-1051.

15. Janssen I, Shepard DS, Katzmarzyk PT, Roubenoff R. The healthcare costs of sarcopenia in the United States. J Am Geriatr Soc. 2004;52(1):80-85.

16. Ekelund U, Steene-Johannessen J, Brown WJ, et al; Lancet Physical Activity Series 2 Executive Committe; Lancet Sedentary Behaviour Working Group. Does physical activity attenuate, or even eliminate, the detrimental association of sitting time with mortality? A harmonised meta-analysis of data from more than 1 million men and women. Lancet. 2016;388(10051):1302-1310.

17. Thornton JS, Frémont P, Khan K, et al. Physical activity prescription: a critical opportunity to address a modifiable risk factor for the prevention and management of chronic disease: a position statement by the Canadian Academy of Sport and Exercise Medicine. Clin J Sport Med.
2016;26(4):259-265.

18. The National Academies of Sciences, Engineering, and Medicine; Committee on Diagnostic Error in Health Care, Board on Health Care Services; Institute of Medicine. Improving Diagnosis in Health Care. Washington, DC: National Academies Press;2015.

19. Cruz-Jentoft AJ, Baeyens JP, Bauer JM, et al; European Working Group on Sarcopenia in Older People. Sarcopenia: European consensus on definition and diagnosis: report of the European Working Group on Sarcopenia in Older People. Age Ageing. 2010;39(4):412-423.

20. Ferrucci L, Guralnik JM, Buchner D, et al. Departures from linearity in the relationship between measures of muscular strength and physical performance of the lower extremities: the Women’s Health and Aging Study. J Gerontol A Biol Sci Med Sci. 1997;52(5):M275-M285.

21. Goodpaster BH, Park SW, Harris TB, et al. The loss of skeletal muscle strength, mass, and quality in older adults: the Health, Aging and Body Composition Study. J Gerontol A Biol Sci Med Sci. 2006;61(10):1059-1064.

22. Malmstrom TK, Miller DK, Simonsick EM, Ferrucci L, Morley JE. SARC-F: a symptom score to predict persons with sarcopenia at risk for poor functional outcomes. J Cachexia Sarcopenia Muscle. 2016;7(1):28-36.

23. Cooper C, Fielding R, Visser M, et al. Tools in the assessment of sarcopenia. Calcif Tissue Int. 2013;93(3):201-210.

24. Lee WJ, Liu LK, Peng LN, Lin MH, Chen LK; ILAS Research Group. Comparisons of sarcopenia defined by IWGS and EWGSOP criteria among older people: results from the I-Lan longitudinal aging study. J Am Med Dir Assoc. 2013;14(7):528.e1-e7.

25. Cesari M, Kritchevsky SB, Penninx BW, et al. Prognostic value of usual gait speed in well-functioning  older people—results from the Health, Aging and Body Composition Study. J Am Geriatr Soc. 2005;53(10):1675-1680.

26. Rossi AP, Fantin F, Micciolo R, et al. Identifying sarcopenia in acute care setting patients. J Am Med Dir Assoc. 2014;15(4):303.e7-e12.

27. Sánchez-Rodríguez D, Marco E, Miralles R, et al. Does gait speed contribute to sarcopenia casefinding in a postacute rehabilitation setting? Arch Gerontol Geriatr. 2015;61(2):176-181.

28. Strand BH, Cooper R, Bergland A, et al. The association of grip strength from midlife onwards with all-cause and cause-specific mortality over 17 years of follow-up in the Tromsø Study. J Epidemiol Community Health. 2016;70:1214-1221.

29. Stanaway FF, Gnjidic D, Blyth FM, et al. How fast does the Grim Reaper walk? Receiver operating characteristics curve analysis in healthy men aged 70 and over. BMJ. 2011;343:d7679.

30. Reiter RE. Risk stratification of prostate cancer 2016. Scand J Clin Lab Invest Suppl.  2016;245:S54-S59.

31. U.S. Department of Veterans Affairs, National Center for Health Promotion and Disease Prevention. Get recommended screening tests and immunizations. https://www.prevention.va.gov/Healthy_Living/Get_Recommended_Screening_Tests_and_Immunizations.asp. Updated September 9, 2016. Accessed June 7, 2017.

32. Buford TW, Anton SD, Judge AR, et al. Models of accelerated sarcopenia: critical pieces for solving the puzzle of age-related muscle atrophy. Ageing Res Rev. 2010;9(4):369-383.

33. Guralnik JM, Simonsick EM, Ferrucci L, et al. A short physical performance battery assessing lower extremity function: association with self-reported disability and prediction of mortality and nursing home admission. J Gerontol. 1994;49(2):M85-M94.

34. Daubney ME, Culham EG. Lower-extremity muscle force and balance performance in adults aged 65 years and older. Phys Ther. 1999;79(12):1177-1185.

35. Bohannon RW. Reference values for the fiverepetition sit-to-stand test: a descriptive metaanalysis of data from elders. Percept Mot Skills. 2006;103(1):215-222.

36. Correa-de-Araujo R, Harris-Love MO, Miljkovic I, Fragala MS, Anthony BW, Manini TM. The need for standardized assessment of muscle quality in skeletal muscle function deficit and other agingrelated muscle dysfunctions: a symposium report. Front Physiol. 2017;8:87.

37. Heymsfield SB, Gonzalez MC, Lu J, Jia G, Zheng J. Skeletal muscle mass and quality: evolution of modern measurement concepts in the context of sarcopenia. Proc Nutr Soc. 2015;74(4):355-366.

38. Harris-Love MO, Monfaredi R, Ismail C, Blackman MR, Cleary K. Quantitative ultrasound: measurement considerations for the assessment of muscular dystrophy and sarcopenia. Front Aging Neurosci. 2014;6:172.

39. Fries JF, Hochberg MC, Medsger TA Jr, Hunder GG, Bombardier C. Criteria for rheumatic disease. Different types and different functions. The American College of Rheumatology Diagnostic and Therapeutic Criteria Committee. Arthritis Rheum. 1994;37(4):454-462.

40. Janssen I, Baumgartner RN, Ross R, Rosenberg IH, Roubenoff R. Skeletal muscle cutpoints associated with elevated physical disability risk in older men and women. Am J Epidemiol.
2004;159(4):413-421.

41. Ismail C, Zabal J, Hernandez HJ, et al. Diagnostic ultrasound estimates of muscle mass and muscle quality discriminate between women with and without sarcopenia. Front Physiol. 2015;6:302.

42. Chen LK, Liu LK, Woo J, et al. Sarcopenia in Asia: consensus report of the Asian Working Group for Sarcopenia. J Am Med Dir Assoc. 2014;15(2):95-101.

43. Aggarwal R, Ringold S, Khanna D, et al. Distinctions between diagnostic and classification  criteria? Arthritis Care Res (Hoboken). 2015;67(7):891-897.

44. Licata A. Bone density vs bone quality: what’s a clinician to do? Cleve Clin J Med. 2009;76(6):331-336.

45. Pollock ML, Mengelkoch LJ, Graves JE, et al. Twenty-year follow-up of aerobic power and body composition of older track athletes. J Appl Physiol. 1997;82(5):1508-1516.

Issue
Federal Practitioner - 34(7)
Issue
Federal Practitioner - 34(7)
Page Number
24-32
Page Number
24-32
Publications
Publications
Topics
Article Type
Sections
Disallow All Ads
Content Gating
No Gating (article Unlocked/Free)
Alternative CME
Disqus Comments
Default
Use ProPublica
Article PDF Media

Lack of Significant Anti-inflammatory Activity With Clindamycin in the Treatment of Rosacea: Results of 2 Randomized, Vehicle-Controlled Trials

Article Type
Changed
Display Headline
Lack of Significant Anti-inflammatory Activity With Clindamycin in the Treatment of Rosacea: Results of 2 Randomized, Vehicle-Controlled Trials

Rosacea is a chronic inflammatory skin disease characterized by central facial erythema with or without intermittent papules and pustules (described as the inflammatory lesions of rosacea). Although twice-daily clindamycin 1% solution or gel has been used in the treatment of acne, few studies have investigated the use of clindamycin in rosacea.1,2 In one study comparing twice-daily clindamycin lotion 1% with oral tetracycline in 43 rosacea patients, clindamycin was found to be superior in the eradication of pustules.3 A combination therapy of clindamycin 1% and benzoyl peroxide 5% was found to be more effective than the vehicle in inflammatory lesions and erythema of rosacea in a 12-week randomized controlled trial; however, a definitive advantage over US Food and Drug Administration-approved topical agents used to treat papulopustular rosacea was not established.4,5 Two further studies evaluated clindamycin phosphate 1.2%-tretinoin 0.025% combination gel in the treatment of rosacea, but only 1 showed any effect on papulopustular lesions.6-8 The objective of the studies reported here was to evaluate the efficacy and safety of clindamycin in the treatment of patients with moderate to severe rosacea.

Methods

Study Design

Two multicenter (study A, 20 centers; study B, 10 centers), randomized, investigator-blinded, vehicle-controlled studies were conducted in the United States between 1999 and 2002 in accordance with the Declaration of Helsinki, International Conference on Harmonisation Good Clinical Practice guidelines, and local regulatory requirements. The studies were reviewed and approved by the respective institutional review boards, and all participants provided written informed consent.

In study A, moderate to severe rosacea patients with erythema, telangiectasia, and at least 8 inflammatory lesions were randomized to receive clindamycin cream 1% or vehicle cream once (in the evening) or twice daily (in the morning and evening) or clindamycin cream 0.3% once daily (in the evening) for 12 weeks (1:1:1:1:1 ratio). All study treatments were supplied in identical tubes with blinded labels.

In study B, patients with moderate to severe rosacea and at least 8 inflammatory lesions were randomized in a 1:1 ratio with instructions to apply clindamycin gel 1% or vehicle gel to the affected areas twice daily (morning and evening) for 12 weeks.

Efficacy Evaluation

Evaluations were performed at baseline and weeks 2, 4, 8, and 12 on the intention-to-treat population with the last observation carried forward.

Efficacy assessments in both studies included inflammatory lesion counts (papules and pustules) of 5 facial regions--forehead, chin, nose, right cheek, left cheek--counted separately and then combined to give the total inflammatory lesion count (both studies), as well as improvement in the investigator global rosacea severity score (0=none/clear; 1=mild, detectable erythema with ≤7 papules/pustules; 2=moderate, prominent erythema with ≥8 papules/pustules; 3=severe, intense erythema with ≥10 to <50 papules/pustules; 3.5 [study A] or 4 [study B]=very severe, intense erythema with >50 papules/pustules). In study B, the proportion of participants dichotomized to success (a score of 0 [none/clear] or 1 [mild/almost clear]) or failure (a score of ≥2) on the 5-point investigator global rosacea severity scale at week 12 was evaluated. In study A, investigator global improvement assessment at week 12, based on photographs taken at baseline, was graded on a 7-point scale (from -1 [worse], 0 [no change], and 1 [minimal improvement] to 5 [clear]). In both studies, erythema severity was graded on a 7-point scale in increments of 0.5 (from 0=no erythema to 3.5=very severe redness, very intense redness). Skin irritation also was graded as none, mild, moderate, or severe. 

Safety Evaluation

Safety was assessed by the incidence of adverse events (AEs).

Statistical Analysis

Studies were powered assuming 60% reduction in inflammatory lesion counts with active and 40% with vehicle, based on historical data from a prior study with metronidazole cream 0.75% versus vehicle; 64 participants were required in each treatment group to detect this effect using a 2-sided t test (&#945;=.017). Pairwise comparisons (clindamycin vs respective vehicle) were performed using the Cochran-Mantel-Haenszel test for combined lesion count percentage change.

 

 

Results

Participant Disposition and Baseline Characteristics

Overall, a total of 629 participants were randomized across both studies. In study A, a total of 416 participants were randomized into 5 treatment arms, with 369 participants (88.7%) completing the study; 47 (11.3%) participants discontinued study A, mainly due to participant request (19/47 [40.4%]) or lost to follow-up (11/47 [23.4%]). In study B, a total of 213 participants were randomized to receive either clindamycin gel 1% (n=109 [51.2%]) twice daily or vehicle gel (n=104 [48.8%]) twice daily, with 193 participants (90.6%) completing the study; 20 (9.4%) participants discontinued study B, mainly due to participant request (6/20 [30%]) or lost to follow-up (4/20 [20%]). Participants in studies A and B were similar in demographics and baseline disease characteristics (Table). The majority of participants were white females. 

Efficacy

No statistically significant difference was observed in all pairwise comparisons (clindamycin cream twice daily vs vehicle twice daily, clindamycin cream once daily vs vehicle once daily, clindamycin gel vs vehicle gel) for the primary end point of mean percentage change from baseline in inflammatory lesion counts at week 12 (Figure 1; P>.5 for all pairwise comparisons). 

Figure 1. Mean percentage decrease from baseline in total inflammatory lesion count for clindamycin cream 1% twice daily (n=81) versus vehicle cream twice daily (n=81)(A), clindamycin cream 1% once daily (n=87) and clindamycin cream 0.3% once daily (n=85) versus vehicle cream once daily (n=82)(B), and clindamycin gel 1% twice daily (n=109) versus vehicle gel twice daily (n=104)(C). All P values were not significant.

At week 12, the proportion of participants in study B deemed as a success (none/clear or mild/almost clear [investigator global rosacea severity score of 0 or 1]) in the clindamycin gel 1% and vehicle gel groups were 45% versus 38%, respectively (P=.347) (Figure 2). 

Figure 2. Study B success rate (score of 0 [none/clear] or 1 [mild/almost clear]) of participants after 12 weeks of treatment with clindamycin gel 1% twice daily or vehicle gel twice daily based on the 5-point investigator global rosacea severity score (P=.347).

For the secondary end point of mean investigator global rosacea severity assessment at week 12 (study A), there were no significant differences between the active and vehicle control groups (P>.5 for all pairwise comparisons)(Figure 3). Also, the proportion of participants with at least a moderate investigator global improvement assessment from baseline to week 12 ranged from 45% for clindamycin cream 1% twice daily to 56% for clindamycin cream 0.3% cream once daily and from 45% for vehicle cream once daily to 51% for vehicle cream twice daily (P>.5 for all pairwise comparisons).

Figure 3. Study A mean investigator global rosacea severity score at baseline and week 12 for clindamycin cream 1% twice daily versus vehicle cream twice daily (A) and clindamycin cream 1% once daily and clindamycin cream 0.3% once daily versus vehicle cream once daily (B). All P values were not significant.

There were no significant differences in the mean total erythema severity scores at week 12 for clindamycin cream 1% twice daily versus vehicle cream twice daily (6.3 vs 6.0; P>.5), clindamycin cream 1% once daily versus vehicle cream once daily (6.2 vs 6.0; P>.5), clindamycin cream 0.3% once daily versus vehicle cream once daily (5.9 vs 6.0; P>.5), and clindamycin gel 1% twice daily versus vehicle gel twice daily (6.7 vs 6.2; P>.5). 

There were no relevant differences between any of the clindamycin cream groups and their respective vehicle group at week 12 for skin irritation, including desquamation, edema, dryness, pruritus, and stinging/burning.

Safety

In study A, the majority of AEs in all 5 treatment arms were nondermatologic, mild in intensity, and not considered to be related to the study treatment by the investigator. Overall, 12 participants had AEs considered by the investigator as possibly or probably related to the study treatment: 4.9% in the clindamycin cream 1% twice daily group, 4.6% in the clindamycin cream 1% once daily group, 3.7% in the vehicle cream twice daily group, 1.2% in the clindamycin cream 0.3% once daily group, and 0% in the vehicle cream once daily group. Two treatment-related AEs led to treatment discontinuation, including dermatitis in 1 participant from the clindamycin cream 1% once daily group and contact dermatitis in 1 participant from the clindamycin cream 1% twice daily group.

Comment

No evidence of increased efficacy over the respective vehicles was observed with clindamycin cream or gel, whatever the regimen, in the treatment of rosacea patients in either of these well-designed and well-powered, blinded studies. Slight improvements in the various efficacy criteria were observed, even in the vehicle groups, highlighting the importance of using a good basic skin care regimen in the management of rosacea.9 In contrast to our observations of lack of efficacy in the treatment of rosacea, clinical efficacy of clindamycin has been demonstrated in acne,10-12 albeit with low efficacy for clindamycin monotherapy.13 It is noteworthy that oral or topical antibiotics are no longer recommended as monotherapy for acne to prevent and minimize antibiotic resistance and to preserve the therapeutic value of antibiotics.14

Acne and rosacea are both chronic inflammatory disorders of the skin associated with papules and pustules, and they share some common inflammatory patterns.15-19 Furthermore, the intrinsic anti-inflammatory activity of clindamycin in addition to its antibiotic effects has been suggested by some authors as the main reason for treating acne with clindamycin.20 However, the relative contributions of antibacterial and/or anti-inflammatory properties remain to be fully elucidated, and evidence for direct anti-inflammatory effects of clindamycin remains heterogeneous.21,22 Several pathophysiological factors have been implicated in acne, including hormonal effects, abnormal keratinocyte function, increased sebum production, and microbial components (eg, hypercolonization of the skin follicles by Propionibacterium acnes).23,24 The antibiotic activity of clindamycin against P acnes may be the key factor responsible for the clinical effects in acne.25,26 Although clindamycin may have anti-inflammatory effects in acne via a different inflammatory pathway not shared by rosacea, a purely antibiotic mechanism of action of clindamycin also could explain why we observed no evidence of efficacy in the treatment of rosacea, as no causative bacterial component has been clearly demonstrated in rosacea.27

Conclusion

In these studies, clindamycin cream 0.3% once daily, clindamycin cream 1% once or twice daily, and clindamycin gel 1% twice daily were all well tolerated; however, they were no more effective than the vehicles in the treatment of moderate to severe rosacea.  

Acknowledgment

The authors would like to thank Helen Simpson, PhD, of Galderma R&D (Sophia Antipolis, France), for editorial and medical writing assistance.

References
  1. Whitney KM, Ditre CM. Anti-inflammatory properties of clindamycin: a review of its use in the treatment of acne vulgaris. Clinical Medicine Insights: Dermatology. 2011;4:27-41.  
  2. Mays RM, Gordon RA, Wilson JM, et al. New antibiotic therapies for acne and rosacea. Dermatol Ther. 2012;25:23-37.
  3. Wilkin JK, DeWitt S. Treatment of rosacea: topical clindamycin versus oral tetracycline. Int J Dermatol. 1993;32:65-67.
  4. Breneman D, Savin R, VandePol C, et al. Double-blind, randomized, vehicle-controlled clinical trial of once-daily benzoyl peroxide/clindamycin topical gel in the treatment of patients with moderate to severe rosacea. Int J Dermatol. 2004;43:381-387.
  5. Leyden JJ, Thiboutot D, Shalita A. Photographic review of results from a clinical study comparing benzoyl peroxide 5%/clindamycin 1% topical gel with vehicle in the treatment of rosacea. Cutis. 2004;73(6 suppl):11-17.
  6. Chang AL, Alora-Palli M, Lima XT, et al. A randomized, double-blind, placebo-controlled, pilot study to assess the efficacy and safety of clindamycin 1.2% and tretinoin 0.025% combination gel for the treatment of acne rosacea over 12 weeks. J Drugs Dermatol. 2012;11:333-339.
  7. Freeman SA, Moon SD, Spencer JM. Clindamycin phosphate 1.2% and tretinoin 0.025% gel for rosacea: summary of a placebo-controlled, double-blind trial. J Drugs Dermatol. 2012;11:1410-1414.
  8. van Zuuren EJ, Fedorowicz Z, Carter B, et al. Interventions for rosacea. Cochrane Database Syst Rev. 2015;4:CD003262.
  9. Laquieze S, Czernielewski J, Baltas E. Beneficial use of Cetaphil moisturizing cream as part of a daily skin care regimen for individuals with rosacea. J Dermatolog Treat. 2007;18:158-162.
  10. Lookingbill DP, Chalker DK, Lindholm JS, et al. Treatment of acne with a combination clindamycin/benzoyl peroxide gel compared with clindamycin gel, benzoyl peroxide gel and vehicle gel: combined results of two double-blind investigations. J Am Acad Dermatol. 1997;37:590-595.
  11. Alirezaï M, Gerlach B, Horvath A, et al. Results of a randomised, multicentre study comparing a new water-based gel of clindamycin 1% versus clindamycin 1% topical solution in the treatment of acne vulgaris. Eur J Dermatol. 2005;15:274-278.
  12. Jarratt MT, Brundage T. Efficacy and safety of clindamycin-tretinoin gel versus clindamycin or tretinoin alone in acne vulgaris: a randomized, double-blind, vehicle-controlled study. J Drugs Dermatol. 2012;11:318-326.
  13. Benzaclin. Med Library website. http://medlibrary.org/lib/rx/meds/benzaclin-3. Updated May 8, 2013. Accessed January 24, 2017.
  14. Walsh TR, Efthimiou J, Dréno B. Systematic review of antibiotic resistance in acne: an increasing topical and oral threat. Lancet Infect Dis. 2016;16:E23-E33.
  15. Jeremy AH, Holland DB, Roberts SG, et al. Inflammatory events are involved in acne lesion initiation. J Invest Dermatol. 2003;121:20-27.  
  16. Kircik LH. Re-evaluating treatment targets in acne vulgaris: adapting to a new understanding of pathophysiology. J Drugs Dermatol. 2014;13:S57-S60.  
  17. Salzer S, Kresse S, Hirai Y, et al. Cathelicidin peptide LL-37 increases UVB-triggered inflammasome activation: possible implications for rosacea. J Dermatol Sci. 2014;76:173-179.
  18. Buhl T, Sulk M, Nowak P, et al. Molecular and morphological characterization of inflammatory infiltrate in rosacea reveals activation of Th1/Th17 pathways. J Invest Dermatol. 2015;135:2198-2208.
  19. Kistowska M, Meier B, Proust T, et al. Propionibacterium acnes promotes Th17 and Th17/Th1 responses in acne patients. J Invest Dermatol. 2015;135:110-118.  
  20. Zeichner JA. Inflammatory acne treatment: review of current and new topical therapeutic options. J Drugs Dermatol. 2016;15(1 suppl 1):S11-S16.  
  21. Nakano T, Hiramatsu K, Kishi K, et al. Clindamycin modulates inflammatory-cytokine induction in lipopolysaccharide-stimulated mouse peritoneal macrophages. Antimicrob Agents Chemother. 2003;47:363-367.
  22. Orman KL, English BK. Effects of antibiotic class on the macrophage inflammatory response to Streptococcus pneumoniae. J Infect Dis. 2000;182:1561-1565.
  23. Taylor M, Gonzalez M, Porter R. Pathways to inflammation: acne pathophysiology. Eur J Dermatol. 2011;21:323-333.
  24. Del Rosso JQ, Kircik LH. The sequence of inflammation, relevant biomarkers, and the pathogenesis of acne vulgaris: what does recent research show and what does it mean to the clinician? J Drugs Dermatol. 2013;12(8 suppl):S109-S115.  
  25. Leyden J, Kaidbey K, Levy SF. The combination formulation of clindamycin 1% plus benzoyl peroxide 5% versus 3 different formulations of topical clindamycin alone in the reduction of Propionibacterium acnes. an in vivo comparative study. Am J Clin Dermatol. 2001;2:263-266.
  26. Wang WL, Everett ED, Johnson M, et al. Susceptibility of Propionibacterium acnes to seventeen antibiotics. Antimicrob Agents Chemother. 1977;11:171-173.
  27. Steinhoff M, Schauber J, Leyden JJ. New insights into rosacea pathophysiology: a review of recent findings. J Am Acad Dermatol. 2013;69(6 suppl 1):S15-S26.  
Article PDF
Author and Disclosure Information

Dr. Martel and Ms. Carlavan are from Galderma R&D, Sophia Antipolis, France. Dr. Jarratt is from DermResearch Inc, Austin, Texas. Dr. Weiss is from Gwinnett Dermatology, PC, and Gwinnett Clinical Research Center, Inc, Snellville, Georgia.

The studies were sponsored by Galderma R&D. Dr. Martel and Ms. Carlavan are employees of Galderma R&D. Dr. Jarratt has been a consultant, investigator, and received honoraria from Allergan; Galderma R&D; and Valeant Pharmaceuticals International, Inc. He also is a consultant for Athenex. Dr. Weiss has been an advisory board member and researcher for Foamix; Galderma R&D; and Valeant Pharmaceuticals International, Inc. He also has been a researcher for Allergan, Inc. 

Correspondence: Philippe Martel, MD, Galderma R&D, 2400 Route des Colles, F-06410 Biot, France ([email protected]).

Issue
Cutis - 100(1)
Publications
Topics
Page Number
53-58
Sections
Author and Disclosure Information

Dr. Martel and Ms. Carlavan are from Galderma R&D, Sophia Antipolis, France. Dr. Jarratt is from DermResearch Inc, Austin, Texas. Dr. Weiss is from Gwinnett Dermatology, PC, and Gwinnett Clinical Research Center, Inc, Snellville, Georgia.

The studies were sponsored by Galderma R&D. Dr. Martel and Ms. Carlavan are employees of Galderma R&D. Dr. Jarratt has been a consultant, investigator, and received honoraria from Allergan; Galderma R&D; and Valeant Pharmaceuticals International, Inc. He also is a consultant for Athenex. Dr. Weiss has been an advisory board member and researcher for Foamix; Galderma R&D; and Valeant Pharmaceuticals International, Inc. He also has been a researcher for Allergan, Inc. 

Correspondence: Philippe Martel, MD, Galderma R&D, 2400 Route des Colles, F-06410 Biot, France ([email protected]).

Author and Disclosure Information

Dr. Martel and Ms. Carlavan are from Galderma R&D, Sophia Antipolis, France. Dr. Jarratt is from DermResearch Inc, Austin, Texas. Dr. Weiss is from Gwinnett Dermatology, PC, and Gwinnett Clinical Research Center, Inc, Snellville, Georgia.

The studies were sponsored by Galderma R&D. Dr. Martel and Ms. Carlavan are employees of Galderma R&D. Dr. Jarratt has been a consultant, investigator, and received honoraria from Allergan; Galderma R&D; and Valeant Pharmaceuticals International, Inc. He also is a consultant for Athenex. Dr. Weiss has been an advisory board member and researcher for Foamix; Galderma R&D; and Valeant Pharmaceuticals International, Inc. He also has been a researcher for Allergan, Inc. 

Correspondence: Philippe Martel, MD, Galderma R&D, 2400 Route des Colles, F-06410 Biot, France ([email protected]).

Article PDF
Article PDF
Related Articles

Rosacea is a chronic inflammatory skin disease characterized by central facial erythema with or without intermittent papules and pustules (described as the inflammatory lesions of rosacea). Although twice-daily clindamycin 1% solution or gel has been used in the treatment of acne, few studies have investigated the use of clindamycin in rosacea.1,2 In one study comparing twice-daily clindamycin lotion 1% with oral tetracycline in 43 rosacea patients, clindamycin was found to be superior in the eradication of pustules.3 A combination therapy of clindamycin 1% and benzoyl peroxide 5% was found to be more effective than the vehicle in inflammatory lesions and erythema of rosacea in a 12-week randomized controlled trial; however, a definitive advantage over US Food and Drug Administration-approved topical agents used to treat papulopustular rosacea was not established.4,5 Two further studies evaluated clindamycin phosphate 1.2%-tretinoin 0.025% combination gel in the treatment of rosacea, but only 1 showed any effect on papulopustular lesions.6-8 The objective of the studies reported here was to evaluate the efficacy and safety of clindamycin in the treatment of patients with moderate to severe rosacea.

Methods

Study Design

Two multicenter (study A, 20 centers; study B, 10 centers), randomized, investigator-blinded, vehicle-controlled studies were conducted in the United States between 1999 and 2002 in accordance with the Declaration of Helsinki, International Conference on Harmonisation Good Clinical Practice guidelines, and local regulatory requirements. The studies were reviewed and approved by the respective institutional review boards, and all participants provided written informed consent.

In study A, moderate to severe rosacea patients with erythema, telangiectasia, and at least 8 inflammatory lesions were randomized to receive clindamycin cream 1% or vehicle cream once (in the evening) or twice daily (in the morning and evening) or clindamycin cream 0.3% once daily (in the evening) for 12 weeks (1:1:1:1:1 ratio). All study treatments were supplied in identical tubes with blinded labels.

In study B, patients with moderate to severe rosacea and at least 8 inflammatory lesions were randomized in a 1:1 ratio with instructions to apply clindamycin gel 1% or vehicle gel to the affected areas twice daily (morning and evening) for 12 weeks.

Efficacy Evaluation

Evaluations were performed at baseline and weeks 2, 4, 8, and 12 on the intention-to-treat population with the last observation carried forward.

Efficacy assessments in both studies included inflammatory lesion counts (papules and pustules) of 5 facial regions--forehead, chin, nose, right cheek, left cheek--counted separately and then combined to give the total inflammatory lesion count (both studies), as well as improvement in the investigator global rosacea severity score (0=none/clear; 1=mild, detectable erythema with ≤7 papules/pustules; 2=moderate, prominent erythema with ≥8 papules/pustules; 3=severe, intense erythema with ≥10 to <50 papules/pustules; 3.5 [study A] or 4 [study B]=very severe, intense erythema with >50 papules/pustules). In study B, the proportion of participants dichotomized to success (a score of 0 [none/clear] or 1 [mild/almost clear]) or failure (a score of ≥2) on the 5-point investigator global rosacea severity scale at week 12 was evaluated. In study A, investigator global improvement assessment at week 12, based on photographs taken at baseline, was graded on a 7-point scale (from -1 [worse], 0 [no change], and 1 [minimal improvement] to 5 [clear]). In both studies, erythema severity was graded on a 7-point scale in increments of 0.5 (from 0=no erythema to 3.5=very severe redness, very intense redness). Skin irritation also was graded as none, mild, moderate, or severe. 

Safety Evaluation

Safety was assessed by the incidence of adverse events (AEs).

Statistical Analysis

Studies were powered assuming 60% reduction in inflammatory lesion counts with active and 40% with vehicle, based on historical data from a prior study with metronidazole cream 0.75% versus vehicle; 64 participants were required in each treatment group to detect this effect using a 2-sided t test (&#945;=.017). Pairwise comparisons (clindamycin vs respective vehicle) were performed using the Cochran-Mantel-Haenszel test for combined lesion count percentage change.

 

 

Results

Participant Disposition and Baseline Characteristics

Overall, a total of 629 participants were randomized across both studies. In study A, a total of 416 participants were randomized into 5 treatment arms, with 369 participants (88.7%) completing the study; 47 (11.3%) participants discontinued study A, mainly due to participant request (19/47 [40.4%]) or lost to follow-up (11/47 [23.4%]). In study B, a total of 213 participants were randomized to receive either clindamycin gel 1% (n=109 [51.2%]) twice daily or vehicle gel (n=104 [48.8%]) twice daily, with 193 participants (90.6%) completing the study; 20 (9.4%) participants discontinued study B, mainly due to participant request (6/20 [30%]) or lost to follow-up (4/20 [20%]). Participants in studies A and B were similar in demographics and baseline disease characteristics (Table). The majority of participants were white females. 

Efficacy

No statistically significant difference was observed in all pairwise comparisons (clindamycin cream twice daily vs vehicle twice daily, clindamycin cream once daily vs vehicle once daily, clindamycin gel vs vehicle gel) for the primary end point of mean percentage change from baseline in inflammatory lesion counts at week 12 (Figure 1; P>.5 for all pairwise comparisons). 

Figure 1. Mean percentage decrease from baseline in total inflammatory lesion count for clindamycin cream 1% twice daily (n=81) versus vehicle cream twice daily (n=81)(A), clindamycin cream 1% once daily (n=87) and clindamycin cream 0.3% once daily (n=85) versus vehicle cream once daily (n=82)(B), and clindamycin gel 1% twice daily (n=109) versus vehicle gel twice daily (n=104)(C). All P values were not significant.

At week 12, the proportion of participants in study B deemed as a success (none/clear or mild/almost clear [investigator global rosacea severity score of 0 or 1]) in the clindamycin gel 1% and vehicle gel groups were 45% versus 38%, respectively (P=.347) (Figure 2). 

Figure 2. Study B success rate (score of 0 [none/clear] or 1 [mild/almost clear]) of participants after 12 weeks of treatment with clindamycin gel 1% twice daily or vehicle gel twice daily based on the 5-point investigator global rosacea severity score (P=.347).

For the secondary end point of mean investigator global rosacea severity assessment at week 12 (study A), there were no significant differences between the active and vehicle control groups (P>.5 for all pairwise comparisons)(Figure 3). Also, the proportion of participants with at least a moderate investigator global improvement assessment from baseline to week 12 ranged from 45% for clindamycin cream 1% twice daily to 56% for clindamycin cream 0.3% cream once daily and from 45% for vehicle cream once daily to 51% for vehicle cream twice daily (P>.5 for all pairwise comparisons).

Figure 3. Study A mean investigator global rosacea severity score at baseline and week 12 for clindamycin cream 1% twice daily versus vehicle cream twice daily (A) and clindamycin cream 1% once daily and clindamycin cream 0.3% once daily versus vehicle cream once daily (B). All P values were not significant.

There were no significant differences in the mean total erythema severity scores at week 12 for clindamycin cream 1% twice daily versus vehicle cream twice daily (6.3 vs 6.0; P>.5), clindamycin cream 1% once daily versus vehicle cream once daily (6.2 vs 6.0; P>.5), clindamycin cream 0.3% once daily versus vehicle cream once daily (5.9 vs 6.0; P>.5), and clindamycin gel 1% twice daily versus vehicle gel twice daily (6.7 vs 6.2; P>.5). 

There were no relevant differences between any of the clindamycin cream groups and their respective vehicle group at week 12 for skin irritation, including desquamation, edema, dryness, pruritus, and stinging/burning.

Safety

In study A, the majority of AEs in all 5 treatment arms were nondermatologic, mild in intensity, and not considered to be related to the study treatment by the investigator. Overall, 12 participants had AEs considered by the investigator as possibly or probably related to the study treatment: 4.9% in the clindamycin cream 1% twice daily group, 4.6% in the clindamycin cream 1% once daily group, 3.7% in the vehicle cream twice daily group, 1.2% in the clindamycin cream 0.3% once daily group, and 0% in the vehicle cream once daily group. Two treatment-related AEs led to treatment discontinuation, including dermatitis in 1 participant from the clindamycin cream 1% once daily group and contact dermatitis in 1 participant from the clindamycin cream 1% twice daily group.

Comment

No evidence of increased efficacy over the respective vehicles was observed with clindamycin cream or gel, whatever the regimen, in the treatment of rosacea patients in either of these well-designed and well-powered, blinded studies. Slight improvements in the various efficacy criteria were observed, even in the vehicle groups, highlighting the importance of using a good basic skin care regimen in the management of rosacea.9 In contrast to our observations of lack of efficacy in the treatment of rosacea, clinical efficacy of clindamycin has been demonstrated in acne,10-12 albeit with low efficacy for clindamycin monotherapy.13 It is noteworthy that oral or topical antibiotics are no longer recommended as monotherapy for acne to prevent and minimize antibiotic resistance and to preserve the therapeutic value of antibiotics.14

Acne and rosacea are both chronic inflammatory disorders of the skin associated with papules and pustules, and they share some common inflammatory patterns.15-19 Furthermore, the intrinsic anti-inflammatory activity of clindamycin in addition to its antibiotic effects has been suggested by some authors as the main reason for treating acne with clindamycin.20 However, the relative contributions of antibacterial and/or anti-inflammatory properties remain to be fully elucidated, and evidence for direct anti-inflammatory effects of clindamycin remains heterogeneous.21,22 Several pathophysiological factors have been implicated in acne, including hormonal effects, abnormal keratinocyte function, increased sebum production, and microbial components (eg, hypercolonization of the skin follicles by Propionibacterium acnes).23,24 The antibiotic activity of clindamycin against P acnes may be the key factor responsible for the clinical effects in acne.25,26 Although clindamycin may have anti-inflammatory effects in acne via a different inflammatory pathway not shared by rosacea, a purely antibiotic mechanism of action of clindamycin also could explain why we observed no evidence of efficacy in the treatment of rosacea, as no causative bacterial component has been clearly demonstrated in rosacea.27

Conclusion

In these studies, clindamycin cream 0.3% once daily, clindamycin cream 1% once or twice daily, and clindamycin gel 1% twice daily were all well tolerated; however, they were no more effective than the vehicles in the treatment of moderate to severe rosacea.  

Acknowledgment

The authors would like to thank Helen Simpson, PhD, of Galderma R&D (Sophia Antipolis, France), for editorial and medical writing assistance.

Rosacea is a chronic inflammatory skin disease characterized by central facial erythema with or without intermittent papules and pustules (described as the inflammatory lesions of rosacea). Although twice-daily clindamycin 1% solution or gel has been used in the treatment of acne, few studies have investigated the use of clindamycin in rosacea.1,2 In one study comparing twice-daily clindamycin lotion 1% with oral tetracycline in 43 rosacea patients, clindamycin was found to be superior in the eradication of pustules.3 A combination therapy of clindamycin 1% and benzoyl peroxide 5% was found to be more effective than the vehicle in inflammatory lesions and erythema of rosacea in a 12-week randomized controlled trial; however, a definitive advantage over US Food and Drug Administration-approved topical agents used to treat papulopustular rosacea was not established.4,5 Two further studies evaluated clindamycin phosphate 1.2%-tretinoin 0.025% combination gel in the treatment of rosacea, but only 1 showed any effect on papulopustular lesions.6-8 The objective of the studies reported here was to evaluate the efficacy and safety of clindamycin in the treatment of patients with moderate to severe rosacea.

Methods

Study Design

Two multicenter (study A, 20 centers; study B, 10 centers), randomized, investigator-blinded, vehicle-controlled studies were conducted in the United States between 1999 and 2002 in accordance with the Declaration of Helsinki, International Conference on Harmonisation Good Clinical Practice guidelines, and local regulatory requirements. The studies were reviewed and approved by the respective institutional review boards, and all participants provided written informed consent.

In study A, moderate to severe rosacea patients with erythema, telangiectasia, and at least 8 inflammatory lesions were randomized to receive clindamycin cream 1% or vehicle cream once (in the evening) or twice daily (in the morning and evening) or clindamycin cream 0.3% once daily (in the evening) for 12 weeks (1:1:1:1:1 ratio). All study treatments were supplied in identical tubes with blinded labels.

In study B, patients with moderate to severe rosacea and at least 8 inflammatory lesions were randomized in a 1:1 ratio with instructions to apply clindamycin gel 1% or vehicle gel to the affected areas twice daily (morning and evening) for 12 weeks.

Efficacy Evaluation

Evaluations were performed at baseline and weeks 2, 4, 8, and 12 on the intention-to-treat population with the last observation carried forward.

Efficacy assessments in both studies included inflammatory lesion counts (papules and pustules) of 5 facial regions--forehead, chin, nose, right cheek, left cheek--counted separately and then combined to give the total inflammatory lesion count (both studies), as well as improvement in the investigator global rosacea severity score (0=none/clear; 1=mild, detectable erythema with ≤7 papules/pustules; 2=moderate, prominent erythema with ≥8 papules/pustules; 3=severe, intense erythema with ≥10 to <50 papules/pustules; 3.5 [study A] or 4 [study B]=very severe, intense erythema with >50 papules/pustules). In study B, the proportion of participants dichotomized to success (a score of 0 [none/clear] or 1 [mild/almost clear]) or failure (a score of ≥2) on the 5-point investigator global rosacea severity scale at week 12 was evaluated. In study A, investigator global improvement assessment at week 12, based on photographs taken at baseline, was graded on a 7-point scale (from -1 [worse], 0 [no change], and 1 [minimal improvement] to 5 [clear]). In both studies, erythema severity was graded on a 7-point scale in increments of 0.5 (from 0=no erythema to 3.5=very severe redness, very intense redness). Skin irritation also was graded as none, mild, moderate, or severe. 

Safety Evaluation

Safety was assessed by the incidence of adverse events (AEs).

Statistical Analysis

Studies were powered assuming 60% reduction in inflammatory lesion counts with active and 40% with vehicle, based on historical data from a prior study with metronidazole cream 0.75% versus vehicle; 64 participants were required in each treatment group to detect this effect using a 2-sided t test (&#945;=.017). Pairwise comparisons (clindamycin vs respective vehicle) were performed using the Cochran-Mantel-Haenszel test for combined lesion count percentage change.

 

 

Results

Participant Disposition and Baseline Characteristics

Overall, a total of 629 participants were randomized across both studies. In study A, a total of 416 participants were randomized into 5 treatment arms, with 369 participants (88.7%) completing the study; 47 (11.3%) participants discontinued study A, mainly due to participant request (19/47 [40.4%]) or lost to follow-up (11/47 [23.4%]). In study B, a total of 213 participants were randomized to receive either clindamycin gel 1% (n=109 [51.2%]) twice daily or vehicle gel (n=104 [48.8%]) twice daily, with 193 participants (90.6%) completing the study; 20 (9.4%) participants discontinued study B, mainly due to participant request (6/20 [30%]) or lost to follow-up (4/20 [20%]). Participants in studies A and B were similar in demographics and baseline disease characteristics (Table). The majority of participants were white females. 

Efficacy

No statistically significant difference was observed in all pairwise comparisons (clindamycin cream twice daily vs vehicle twice daily, clindamycin cream once daily vs vehicle once daily, clindamycin gel vs vehicle gel) for the primary end point of mean percentage change from baseline in inflammatory lesion counts at week 12 (Figure 1; P>.5 for all pairwise comparisons). 

Figure 1. Mean percentage decrease from baseline in total inflammatory lesion count for clindamycin cream 1% twice daily (n=81) versus vehicle cream twice daily (n=81)(A), clindamycin cream 1% once daily (n=87) and clindamycin cream 0.3% once daily (n=85) versus vehicle cream once daily (n=82)(B), and clindamycin gel 1% twice daily (n=109) versus vehicle gel twice daily (n=104)(C). All P values were not significant.

At week 12, the proportion of participants in study B deemed as a success (none/clear or mild/almost clear [investigator global rosacea severity score of 0 or 1]) in the clindamycin gel 1% and vehicle gel groups were 45% versus 38%, respectively (P=.347) (Figure 2). 

Figure 2. Study B success rate (score of 0 [none/clear] or 1 [mild/almost clear]) of participants after 12 weeks of treatment with clindamycin gel 1% twice daily or vehicle gel twice daily based on the 5-point investigator global rosacea severity score (P=.347).

For the secondary end point of mean investigator global rosacea severity assessment at week 12 (study A), there were no significant differences between the active and vehicle control groups (P>.5 for all pairwise comparisons)(Figure 3). Also, the proportion of participants with at least a moderate investigator global improvement assessment from baseline to week 12 ranged from 45% for clindamycin cream 1% twice daily to 56% for clindamycin cream 0.3% cream once daily and from 45% for vehicle cream once daily to 51% for vehicle cream twice daily (P>.5 for all pairwise comparisons).

Figure 3. Study A mean investigator global rosacea severity score at baseline and week 12 for clindamycin cream 1% twice daily versus vehicle cream twice daily (A) and clindamycin cream 1% once daily and clindamycin cream 0.3% once daily versus vehicle cream once daily (B). All P values were not significant.

There were no significant differences in the mean total erythema severity scores at week 12 for clindamycin cream 1% twice daily versus vehicle cream twice daily (6.3 vs 6.0; P>.5), clindamycin cream 1% once daily versus vehicle cream once daily (6.2 vs 6.0; P>.5), clindamycin cream 0.3% once daily versus vehicle cream once daily (5.9 vs 6.0; P>.5), and clindamycin gel 1% twice daily versus vehicle gel twice daily (6.7 vs 6.2; P>.5). 

There were no relevant differences between any of the clindamycin cream groups and their respective vehicle group at week 12 for skin irritation, including desquamation, edema, dryness, pruritus, and stinging/burning.

Safety

In study A, the majority of AEs in all 5 treatment arms were nondermatologic, mild in intensity, and not considered to be related to the study treatment by the investigator. Overall, 12 participants had AEs considered by the investigator as possibly or probably related to the study treatment: 4.9% in the clindamycin cream 1% twice daily group, 4.6% in the clindamycin cream 1% once daily group, 3.7% in the vehicle cream twice daily group, 1.2% in the clindamycin cream 0.3% once daily group, and 0% in the vehicle cream once daily group. Two treatment-related AEs led to treatment discontinuation, including dermatitis in 1 participant from the clindamycin cream 1% once daily group and contact dermatitis in 1 participant from the clindamycin cream 1% twice daily group.

Comment

No evidence of increased efficacy over the respective vehicles was observed with clindamycin cream or gel, whatever the regimen, in the treatment of rosacea patients in either of these well-designed and well-powered, blinded studies. Slight improvements in the various efficacy criteria were observed, even in the vehicle groups, highlighting the importance of using a good basic skin care regimen in the management of rosacea.9 In contrast to our observations of lack of efficacy in the treatment of rosacea, clinical efficacy of clindamycin has been demonstrated in acne,10-12 albeit with low efficacy for clindamycin monotherapy.13 It is noteworthy that oral or topical antibiotics are no longer recommended as monotherapy for acne to prevent and minimize antibiotic resistance and to preserve the therapeutic value of antibiotics.14

Acne and rosacea are both chronic inflammatory disorders of the skin associated with papules and pustules, and they share some common inflammatory patterns.15-19 Furthermore, the intrinsic anti-inflammatory activity of clindamycin in addition to its antibiotic effects has been suggested by some authors as the main reason for treating acne with clindamycin.20 However, the relative contributions of antibacterial and/or anti-inflammatory properties remain to be fully elucidated, and evidence for direct anti-inflammatory effects of clindamycin remains heterogeneous.21,22 Several pathophysiological factors have been implicated in acne, including hormonal effects, abnormal keratinocyte function, increased sebum production, and microbial components (eg, hypercolonization of the skin follicles by Propionibacterium acnes).23,24 The antibiotic activity of clindamycin against P acnes may be the key factor responsible for the clinical effects in acne.25,26 Although clindamycin may have anti-inflammatory effects in acne via a different inflammatory pathway not shared by rosacea, a purely antibiotic mechanism of action of clindamycin also could explain why we observed no evidence of efficacy in the treatment of rosacea, as no causative bacterial component has been clearly demonstrated in rosacea.27

Conclusion

In these studies, clindamycin cream 0.3% once daily, clindamycin cream 1% once or twice daily, and clindamycin gel 1% twice daily were all well tolerated; however, they were no more effective than the vehicles in the treatment of moderate to severe rosacea.  

Acknowledgment

The authors would like to thank Helen Simpson, PhD, of Galderma R&D (Sophia Antipolis, France), for editorial and medical writing assistance.

References
  1. Whitney KM, Ditre CM. Anti-inflammatory properties of clindamycin: a review of its use in the treatment of acne vulgaris. Clinical Medicine Insights: Dermatology. 2011;4:27-41.  
  2. Mays RM, Gordon RA, Wilson JM, et al. New antibiotic therapies for acne and rosacea. Dermatol Ther. 2012;25:23-37.
  3. Wilkin JK, DeWitt S. Treatment of rosacea: topical clindamycin versus oral tetracycline. Int J Dermatol. 1993;32:65-67.
  4. Breneman D, Savin R, VandePol C, et al. Double-blind, randomized, vehicle-controlled clinical trial of once-daily benzoyl peroxide/clindamycin topical gel in the treatment of patients with moderate to severe rosacea. Int J Dermatol. 2004;43:381-387.
  5. Leyden JJ, Thiboutot D, Shalita A. Photographic review of results from a clinical study comparing benzoyl peroxide 5%/clindamycin 1% topical gel with vehicle in the treatment of rosacea. Cutis. 2004;73(6 suppl):11-17.
  6. Chang AL, Alora-Palli M, Lima XT, et al. A randomized, double-blind, placebo-controlled, pilot study to assess the efficacy and safety of clindamycin 1.2% and tretinoin 0.025% combination gel for the treatment of acne rosacea over 12 weeks. J Drugs Dermatol. 2012;11:333-339.
  7. Freeman SA, Moon SD, Spencer JM. Clindamycin phosphate 1.2% and tretinoin 0.025% gel for rosacea: summary of a placebo-controlled, double-blind trial. J Drugs Dermatol. 2012;11:1410-1414.
  8. van Zuuren EJ, Fedorowicz Z, Carter B, et al. Interventions for rosacea. Cochrane Database Syst Rev. 2015;4:CD003262.
  9. Laquieze S, Czernielewski J, Baltas E. Beneficial use of Cetaphil moisturizing cream as part of a daily skin care regimen for individuals with rosacea. J Dermatolog Treat. 2007;18:158-162.
  10. Lookingbill DP, Chalker DK, Lindholm JS, et al. Treatment of acne with a combination clindamycin/benzoyl peroxide gel compared with clindamycin gel, benzoyl peroxide gel and vehicle gel: combined results of two double-blind investigations. J Am Acad Dermatol. 1997;37:590-595.
  11. Alirezaï M, Gerlach B, Horvath A, et al. Results of a randomised, multicentre study comparing a new water-based gel of clindamycin 1% versus clindamycin 1% topical solution in the treatment of acne vulgaris. Eur J Dermatol. 2005;15:274-278.
  12. Jarratt MT, Brundage T. Efficacy and safety of clindamycin-tretinoin gel versus clindamycin or tretinoin alone in acne vulgaris: a randomized, double-blind, vehicle-controlled study. J Drugs Dermatol. 2012;11:318-326.
  13. Benzaclin. Med Library website. http://medlibrary.org/lib/rx/meds/benzaclin-3. Updated May 8, 2013. Accessed January 24, 2017.
  14. Walsh TR, Efthimiou J, Dréno B. Systematic review of antibiotic resistance in acne: an increasing topical and oral threat. Lancet Infect Dis. 2016;16:E23-E33.
  15. Jeremy AH, Holland DB, Roberts SG, et al. Inflammatory events are involved in acne lesion initiation. J Invest Dermatol. 2003;121:20-27.  
  16. Kircik LH. Re-evaluating treatment targets in acne vulgaris: adapting to a new understanding of pathophysiology. J Drugs Dermatol. 2014;13:S57-S60.  
  17. Salzer S, Kresse S, Hirai Y, et al. Cathelicidin peptide LL-37 increases UVB-triggered inflammasome activation: possible implications for rosacea. J Dermatol Sci. 2014;76:173-179.
  18. Buhl T, Sulk M, Nowak P, et al. Molecular and morphological characterization of inflammatory infiltrate in rosacea reveals activation of Th1/Th17 pathways. J Invest Dermatol. 2015;135:2198-2208.
  19. Kistowska M, Meier B, Proust T, et al. Propionibacterium acnes promotes Th17 and Th17/Th1 responses in acne patients. J Invest Dermatol. 2015;135:110-118.  
  20. Zeichner JA. Inflammatory acne treatment: review of current and new topical therapeutic options. J Drugs Dermatol. 2016;15(1 suppl 1):S11-S16.  
  21. Nakano T, Hiramatsu K, Kishi K, et al. Clindamycin modulates inflammatory-cytokine induction in lipopolysaccharide-stimulated mouse peritoneal macrophages. Antimicrob Agents Chemother. 2003;47:363-367.
  22. Orman KL, English BK. Effects of antibiotic class on the macrophage inflammatory response to Streptococcus pneumoniae. J Infect Dis. 2000;182:1561-1565.
  23. Taylor M, Gonzalez M, Porter R. Pathways to inflammation: acne pathophysiology. Eur J Dermatol. 2011;21:323-333.
  24. Del Rosso JQ, Kircik LH. The sequence of inflammation, relevant biomarkers, and the pathogenesis of acne vulgaris: what does recent research show and what does it mean to the clinician? J Drugs Dermatol. 2013;12(8 suppl):S109-S115.  
  25. Leyden J, Kaidbey K, Levy SF. The combination formulation of clindamycin 1% plus benzoyl peroxide 5% versus 3 different formulations of topical clindamycin alone in the reduction of Propionibacterium acnes. an in vivo comparative study. Am J Clin Dermatol. 2001;2:263-266.
  26. Wang WL, Everett ED, Johnson M, et al. Susceptibility of Propionibacterium acnes to seventeen antibiotics. Antimicrob Agents Chemother. 1977;11:171-173.
  27. Steinhoff M, Schauber J, Leyden JJ. New insights into rosacea pathophysiology: a review of recent findings. J Am Acad Dermatol. 2013;69(6 suppl 1):S15-S26.  
References
  1. Whitney KM, Ditre CM. Anti-inflammatory properties of clindamycin: a review of its use in the treatment of acne vulgaris. Clinical Medicine Insights: Dermatology. 2011;4:27-41.  
  2. Mays RM, Gordon RA, Wilson JM, et al. New antibiotic therapies for acne and rosacea. Dermatol Ther. 2012;25:23-37.
  3. Wilkin JK, DeWitt S. Treatment of rosacea: topical clindamycin versus oral tetracycline. Int J Dermatol. 1993;32:65-67.
  4. Breneman D, Savin R, VandePol C, et al. Double-blind, randomized, vehicle-controlled clinical trial of once-daily benzoyl peroxide/clindamycin topical gel in the treatment of patients with moderate to severe rosacea. Int J Dermatol. 2004;43:381-387.
  5. Leyden JJ, Thiboutot D, Shalita A. Photographic review of results from a clinical study comparing benzoyl peroxide 5%/clindamycin 1% topical gel with vehicle in the treatment of rosacea. Cutis. 2004;73(6 suppl):11-17.
  6. Chang AL, Alora-Palli M, Lima XT, et al. A randomized, double-blind, placebo-controlled, pilot study to assess the efficacy and safety of clindamycin 1.2% and tretinoin 0.025% combination gel for the treatment of acne rosacea over 12 weeks. J Drugs Dermatol. 2012;11:333-339.
  7. Freeman SA, Moon SD, Spencer JM. Clindamycin phosphate 1.2% and tretinoin 0.025% gel for rosacea: summary of a placebo-controlled, double-blind trial. J Drugs Dermatol. 2012;11:1410-1414.
  8. van Zuuren EJ, Fedorowicz Z, Carter B, et al. Interventions for rosacea. Cochrane Database Syst Rev. 2015;4:CD003262.
  9. Laquieze S, Czernielewski J, Baltas E. Beneficial use of Cetaphil moisturizing cream as part of a daily skin care regimen for individuals with rosacea. J Dermatolog Treat. 2007;18:158-162.
  10. Lookingbill DP, Chalker DK, Lindholm JS, et al. Treatment of acne with a combination clindamycin/benzoyl peroxide gel compared with clindamycin gel, benzoyl peroxide gel and vehicle gel: combined results of two double-blind investigations. J Am Acad Dermatol. 1997;37:590-595.
  11. Alirezaï M, Gerlach B, Horvath A, et al. Results of a randomised, multicentre study comparing a new water-based gel of clindamycin 1% versus clindamycin 1% topical solution in the treatment of acne vulgaris. Eur J Dermatol. 2005;15:274-278.
  12. Jarratt MT, Brundage T. Efficacy and safety of clindamycin-tretinoin gel versus clindamycin or tretinoin alone in acne vulgaris: a randomized, double-blind, vehicle-controlled study. J Drugs Dermatol. 2012;11:318-326.
  13. Benzaclin. Med Library website. http://medlibrary.org/lib/rx/meds/benzaclin-3. Updated May 8, 2013. Accessed January 24, 2017.
  14. Walsh TR, Efthimiou J, Dréno B. Systematic review of antibiotic resistance in acne: an increasing topical and oral threat. Lancet Infect Dis. 2016;16:E23-E33.
  15. Jeremy AH, Holland DB, Roberts SG, et al. Inflammatory events are involved in acne lesion initiation. J Invest Dermatol. 2003;121:20-27.  
  16. Kircik LH. Re-evaluating treatment targets in acne vulgaris: adapting to a new understanding of pathophysiology. J Drugs Dermatol. 2014;13:S57-S60.  
  17. Salzer S, Kresse S, Hirai Y, et al. Cathelicidin peptide LL-37 increases UVB-triggered inflammasome activation: possible implications for rosacea. J Dermatol Sci. 2014;76:173-179.
  18. Buhl T, Sulk M, Nowak P, et al. Molecular and morphological characterization of inflammatory infiltrate in rosacea reveals activation of Th1/Th17 pathways. J Invest Dermatol. 2015;135:2198-2208.
  19. Kistowska M, Meier B, Proust T, et al. Propionibacterium acnes promotes Th17 and Th17/Th1 responses in acne patients. J Invest Dermatol. 2015;135:110-118.  
  20. Zeichner JA. Inflammatory acne treatment: review of current and new topical therapeutic options. J Drugs Dermatol. 2016;15(1 suppl 1):S11-S16.  
  21. Nakano T, Hiramatsu K, Kishi K, et al. Clindamycin modulates inflammatory-cytokine induction in lipopolysaccharide-stimulated mouse peritoneal macrophages. Antimicrob Agents Chemother. 2003;47:363-367.
  22. Orman KL, English BK. Effects of antibiotic class on the macrophage inflammatory response to Streptococcus pneumoniae. J Infect Dis. 2000;182:1561-1565.
  23. Taylor M, Gonzalez M, Porter R. Pathways to inflammation: acne pathophysiology. Eur J Dermatol. 2011;21:323-333.
  24. Del Rosso JQ, Kircik LH. The sequence of inflammation, relevant biomarkers, and the pathogenesis of acne vulgaris: what does recent research show and what does it mean to the clinician? J Drugs Dermatol. 2013;12(8 suppl):S109-S115.  
  25. Leyden J, Kaidbey K, Levy SF. The combination formulation of clindamycin 1% plus benzoyl peroxide 5% versus 3 different formulations of topical clindamycin alone in the reduction of Propionibacterium acnes. an in vivo comparative study. Am J Clin Dermatol. 2001;2:263-266.
  26. Wang WL, Everett ED, Johnson M, et al. Susceptibility of Propionibacterium acnes to seventeen antibiotics. Antimicrob Agents Chemother. 1977;11:171-173.
  27. Steinhoff M, Schauber J, Leyden JJ. New insights into rosacea pathophysiology: a review of recent findings. J Am Acad Dermatol. 2013;69(6 suppl 1):S15-S26.  
Issue
Cutis - 100(1)
Issue
Cutis - 100(1)
Page Number
53-58
Page Number
53-58
Publications
Publications
Topics
Article Type
Display Headline
Lack of Significant Anti-inflammatory Activity With Clindamycin in the Treatment of Rosacea: Results of 2 Randomized, Vehicle-Controlled Trials
Display Headline
Lack of Significant Anti-inflammatory Activity With Clindamycin in the Treatment of Rosacea: Results of 2 Randomized, Vehicle-Controlled Trials
Sections
Inside the Article

Practice Points

  • Clindamycin cream 0.3% and 1% and clindamycin gel 1% were no more effective than their respective vehicles in the treatment of moderate to severe rosacea.
  • Clindamycin may have no intrinsic anti-inflammatory activity in rosacea.
Disallow All Ads
Content Gating
No Gating (article Unlocked/Free)
Alternative CME
PubMed ID
28873109
Disqus Comments
Default
Article PDF Media

Comparison of Salicylic Acid 30% Peel and Pneumatic Broadband Light in the Treatment of Mild to Moderately Severe Facial Acne Vulgaris

Article Type
Changed
Display Headline
Comparison of Salicylic Acid 30% Peel and Pneumatic Broadband Light in the Treatment of Mild to Moderately Severe Facial Acne Vulgaris

Facial acne vulgaris is a common skin disease among teenagers and adolescents that may negatively affect self-esteem, perceived facial attractiveness, and social participation.1 Treatments for acne often are multimodal and require the utmost adherence. For these reasons, acne treatments have been challenging to clinicians and patients alike, as patient compliance in maintaining the use of prescribed topical and oral medications remains essential to attain improvement in quality of life (QOL).

Salicylic acid is a popular medicament for acne treatment that frequently is used as monotherapy or as an adjuvant for other acne treatments, especially in patients with oily skin.2 Salicylic acid has a keratolytic effect, causing corneocyte discohesion in clogged pores or congested follicles,2 and it is effective in treating both inflammatory and noninflammatory acne.3,4

Light therapy, particularly with visible light, has been demonstrated to improve acne outcomes.5 Pneumatic broadband light (PBBL) is a therapeutic light treatment in the broadband range (400–1200 nm) that is combined with vacuum suction, which creates a mechanical lysis of thin-walled pustules and dislodges pore impaction. Additionally, the blue light portion of the PBBL spectrum targets endogenous porphyrins in Propionibacterium acnes, resulting in bacterial destruction.6-8

The purpose of this study was to compare the efficacy, tolerability, and safety of salicylic acid 30% peel versus PBBL in the treatment of mild to moderately severe facial acne vulgaris.

METHODS

Study Design

This single-blind, randomized, split-face pilot study was approved by the institutional review board of the University of Pennsylvania (Philadelphia, Pennsylvania). All patients provided informed consent before entering the study. The single-blind evaluation was performed by one dermatologist (C.T.) who examined the participants on every visit prior to PBBL treatment.

Before the study started, participants were randomized for which side of the face was to be treated with PBBL using a number assigned to each participant. Participants received both treatments—salicylic acid 30% peel on one side of the face and PBBL treatment on the other side of the face—once weekly for a total of 6 treatments. They were then asked to return for 2 follow-up evaluations at weeks 3 and 6 following the last treatment session and were instructed not to use any topical or oral acne medications during these follow-up periods.

Inclusion and Exclusion Criteria

Patients aged 18 years and older of any race and sex with noninflammatory papules, some inflammatory papules, and no more than 1 nodule (considered as mild to moderately severe facial acne) were included in the study. Participants had not been on any topical acne medications for at least 1 month and/or oral retinoids for at least 1 year prior to the study period. All women completed urine pregnancy tests prior to the study and were advised to utilize birth control during the study period.

Study Treatments

Salicylic Acid 30% Peel

The participant’s face was cleansed thoroughly before application of salicylic acid 30% (1.5 g/2.5 mL) to half of the face and left on for 5 minutes before being carefully rinsed off by spraying with spring water. Prior to initiating PBBL therapy, the peeled side of the participant’s face was covered with a towel.

Pneumatic Broadband Light

On the other side of the face, PBBL was performed to deliver broadband light within the spectrum range of 400 to 1200 nm at a setting approximately equivalent to a fluence of 4 to 6 J/cm2 and a vacuum setting approximately equivalent to a negative pressure of 3 lb/in2. The power setting was increased on each subsequent visit depending on each participant’s tolerability.

Participants were required to apply a moisturizer and sunscreen to the face and avoid excessive sun exposure between study visits.

Efficacy Evaluation

A comparison of the efficacy of the treatments was determined by clinical evaluation and examining the results of the outcome measurements with the modified Global Acne Grading Score (mGAGS) and Acne QOL Scale during each treatment visit. Facial photographs were taken at each visit.

Modified Global Acne Grading Score

The mGAGS is a modification of the Global Acne Grading Scale (GAGS) that has been used to evaluate acne severity in many studies.9-11 The GAGS considers 6 locations on the face with a grading factor for each location. The local score is obtained by multiplying the factor rated by location with the factor of clinical assessment: local score = factor rated by location × factor rated by clinical assessment. The total score is the sum of the individual local scores (Table 1).

Although the original GAGS incorporated the type and location of the lesions in its calculation, we felt that the number of lesions also was important to add to our grading score. Therefore, we modified the GAGS by adding a factor rated by the number of lesions to improve the accuracy of the test. Accordingly, the local mGAGS scores were calculated by multiplying the location factor by the lesion type and number of lesions factors: local score = location factor × lesion type factor × number of lesions factor.

Acne QOL Questionnaire

Acne QOL was assessed during each visit to demonstrate if the treatment results affected participants’ socialization due to appearance.12 Participants were asked to complete the questionnaire, which consisted of 9 questions with 4 rating answers (0=not affected; 1=mildly affected; 2=moderately affected; 3=markedly affected). A total score of 9 or higher (high score) indicated that acne had a substantial negative impact on the participant, while a total score below 9 (low score) meant acne scarcely impacted social aspects and daily activities of the patient.

Safety Evaluation

The safety of the treatments was evaluated by clinical inspection and by comparing the results of the Wong-Baker FACES Pain Rating Scale (WBPRS)13 after treatment. The WBPRS is used worldwide among researchers to assess pain, particularly in children.14,15 It is composed of 6 faces expressing pain with word descriptions with a corresponding number range reflecting pain severity from 0 to 5 (0=no hurt; 1=hurts little bit; 2=hurts little more; 3=hurts even more; 4=hurts whole lot; 5=hurts worst).13

Statistical Analysis

All variables were presented as the median (range). A Wilcoxon signed rank test was used to compare clinical responses between the salicylic acid 30% peel and PBBL therapies. SPSS software version 12.0 was used for all statistical analysis. A 2-tailed P value of ≤.05 was considered statistically significant.

 

 

RESULTS

Study Population

Twelve participants (2 males, 10 females) aged 17 to 36 years (median age, 22 years; mean age [SD], 23.33 [1.65] years) with both comedonal and inflammatory acne were enrolled into this study for 6 split-face treatments of salicylic acid 30% peel and PBBL at 1-week intervals for 6 weeks, with 2 subsequent follow-up sessions at weeks 3 and 6 posttreatment. Of the 12 participants, 11 were white and 1 was Asian American, with Fitzpatrick skin types II to IV. Nine participants (75%) completed the study. One participant dropped out of the study after the fourth treatment due to a scheduling conflict, and the other 2 participants did not return for follow-up. No participants withdrew from the study because of adverse therapeutic events.

Efficacy Evaluation

Comparisons between the salicylic acid 30% peel and PBBL procedures for mGAGS at each visit are shown in Table 2. There was no significant difference in treatment efficacy between the salicylic acid 30% peel and PBBL therapies during the study’s treatment and follow-up events; however, both procedures contributed to a major improvement in acne symptoms by the third treatment session and through to the last follow-up session (P≤.05). Clinical photographs at baseline, at last treatment visit (week 6), and at last follow-up (week 12) are shown in Figures 1 and 2.

Figure 1. A 19-year-old woman with mild acne who was treated with salicylic acid 30% peel on the right side of the face at baseline (A), week 6 (B), and week 12 (C).

Figure 2. A 19-year-old woman with mild acne who was treated with pneumatic broadband light on the left side of the face at baseline (A), week 6 (B), and week 12 (C).

The results of the acne QOL questionnaire are shown in Table 2. Lower scores reflect a higher QOL. Median QOL scores at each visit ranged from 0.5 to 4.5. There was no significant difference found between the peel agent or PBBL based on the baseline QOL and subsequent visit assessments; however, the differences between the 2 treatments were significant at weeks 3 (P=.05) and 5 (P=.03) of treatment as well as at the last follow-up visit (P=.05).

According to the QOL scores, by the third treatment session participants were more satisfied with their improved acne condition from the PBBL procedure than the salicylic acid 30% peel as demonstrated by a positive range of the QOL assessments between PBBL and salicylic acid 30% peel (as shown in the difference in QOL in Table 2: week 3, 0–6; week 4, 0–3; week 5, 0–7). On the other hand, participants saw more improvement from the salicylic acid 30% peel than from PBBL by the last follow-up evaluation, as the differences in QOL scores between the 2 treatments resulted in a negative range (5–0).

Safety

Pain assessment by the WBPRS at every visit showed a low pain rating associated with both salicylic acid 30% peel (range, 0–0.5) and PBBL (range, 1.0–1.5) treatments. The median pain score of the salicylic acid 30% peel appeared higher compared to the PBBL treatment, yet a significant difference between both treatments was seen only at weeks 1, 3, and 6 of treatment (P≤.05).

There were no unexpected therapeutic reactions reported in our study, and no participants withdrew from the study due to adverse events. Most participants experienced only mild adverse reactions, including redness, stinging, and a burning sensation on the salicylic acid 30% peel side, which were transient and disappeared in minutes; only redness occurred on the PBBL-treated side.

Comment

Facial acne treatment is challenging, as prolonged and/or severe acne contributes to scarring, declining self-confidence, and undesirable financial consequences. Even though salicylic acid peel is a commonly used acne treatment choice, the PBBL methodology was approved by the US Food and Drug Administration6 and has become an alternative procedure for acne treatment.

The pharmacological effects of salicylic acid are related to its corneocyte desquamation and exfoliative actions, thereby reducing corneocyte cohesion and unclogging follicular pores.16 Salicylic acid has been demonstrated to ameliorate inflammatory acne by its effects on the arachidonic acid cascade.2,4,17 In our study, salicylic acid 30% peel met participants’ satisfaction in acne improvement similar to a study showing a 50% improvement in acne scores after just 2 treatments.18 Our data support and corroborate that salicylic acid 30% peel renders an improvement in acne sequelae reported in several other studies.2,17,18

Pneumatic broadband light has been known to treat acne by the mechanism of pneumatic suction combined with photodynamic therapy using broadband-pulsed light (400–1200 nm).6-8 By applying the pneumatic device, a vacuum is created on the skin to remove sebum contents from follicles, whereas broadband light is emitted simultaneously to destroy bacteria and decrease the inflammatory process.7 During the vacuum process, the skin is stretched to reduce pain and avoid competitive chromophores (eg, hemoglobin), while the broadband light is administered.7 Broadband light encompasses 2 main light spectrums: blue light (415 nm) activates coproporphyrin III, which induces reactive free radicals and singlet oxygen species and has been reported to be the cause of bacterial cell death,19 and red light (633 nm), which renders an increase of fibroblast growth factors to work against the inflammatory processes.20 There are numerous studies showing a reduction of acne lesions after photopneumatic therapy with minimal side effects.6-8

In our study, we compared the efficacy of salicylic acid 30% peel with PBBL in the treatment of acne. Both treatments showed significant reduction of mGAGS compared to baseline starting from week 3 and lasting until week 12. Remarkably, although there were some participants who reported acne recurrence after completing all treatments at week 6, which could have happened when the treatments were ended, the final acne score at week 12 was still significantly lower than baseline. It is clear that the participants continued their acne improvement up to the 6-week follow-up period without any topical or oral medication. We do not propose that either salicylic acid peel or PBBL treatment is a solitary option but speculate that the combination of both treatments may initiate a faster resolution in the disappearance of acne.

Although there was no statistically significant difference in efficacy between salicylic acid 30% peel and PBBL procedures at each visit, QOL assessments related to treatment satisfaction did yield significant differences between baseline and the end of treatment. We noticed that participants had more positive attitudes toward the PBBL side at week 3 and week 5 but only mild satisfaction at week 4, as the differences in QOL scores between both treatments showed positive ranging values. This finding is most likely related to the immediate reduction of acne pustules by the PBBL vacuum lysis of these lesions. The differences in the QOL scores between both treatments at week 12 (the last follow-up evaluation) provided opposite findings, which meant patients had nearly even improvement in both PBBL method and salicylic acid 30% peel. Therefore, according to QOL data, acne disappeared quickly with the application of PBBL therapy but reappeared on the PBBL-treated side by the follow-up evaluations, though the acne score between both sides showed no statistically significant difference.

We reason that the PBBL therapy works better than salicylic acid 30% peel because the pneumatic system may help to unclog the pores through mechanical debridement via suctioning versus desquamation from salicylic acid 30% peel. Nonetheless, salicylic acid 30% peel sustained improvement when compared to PBBL through the follow-up periods. Both salicylic acid 30% peel and PBBL treatments are well tolerated and may initiate a faster resolution in the improvement of acne when incorporated with a medical program.

Because of the recurrence of acne after treatments were stopped, additional medical therapies are advised to be used along with this study’s clinical treatments to help mitigate the acne symptoms. These treatments should be considered in patients concerned about antibiotic resistance or those who cannot take oral antibiotics or retinoids. Salicylic acid peel is more accessible and affordable than PBBL, whereas PBBL is slightly more tolerable and less irritating than salicylic acid peel. Nevertheless, the cost of investment in PBBL is quite high—as much as $70,000—and does not include disposable, single-use tips, which cost $30 each. The machine is easy to set up, weighs about 40 lb, and requires little space to store. The average cost per visit of PBBL treatment in office is $150.00 and $75.00 for salicylic acid peel (unpublished data, Hospital of the University of Pennsylvania, 2010). Most patients may select salicylic acid peel over PBBL due to the cost and convenience of the treatment. Neither procedure should be considered as a solitary treatment option but rather as adjunctive procedures combined with oral and/or topical acne medications. After this study’s treatments were stopped and without other medications to maintain treatment effectiveness, the lesions reappeared, trending back toward baseline.

 

 

Conclusion

Both salicylic acid 30% peel and PBBL procedures are effective, safe, and well tolerated in treating acne. Although there was no significant difference in the efficacy between both treatments in this study, the small sample size and short follow-up intervals warrant further studies to support the observed outstanding outcomes and should be considered in combination with other medical treatment options. These procedures may be beneficial in holding the patient compliant until their medical therapies have an opportunity to work.

Acknowledgment

The authors would like to thank Joyce Okawa, RN (Philadelphia, Pennsylvania), for her assistance in the submission to the institutional review board of the University of Pennsylvania.

References
  1. Rapp DA, Brenes GA, Feldman SR, et al. Anger and acne: implications for quality of life, patient satisfaction and clinical care. Br J Dermatol. 2004;151:183-189.
  2. Zakopoulou N, Kontochristopoulos G. Superficial chemical peels. J Cosmet Dermatol. 2006;5:246-253.
  3. Berson DS, Cohen JL, Rendon MI, et al. Clinical role and application of superficial chemical peels in today’s practice. J Drugs Dermatol. 2009;8:803-811.
  4. Shalita AR. Treatment of mild and moderate acne vulgaris with salicylic acid in an alcohol-detergent vehicle. Cutis. 1981;28:556-558, 561.
  5. Sakamoto FH, Lopes JD, Anderson RR. Photodynamic therapy for acne vulgaris: a critical review from basics to clinical practice: part I. acne vulgaris: when and why consider photodynamic therapy? J Am Acad Dermatol. 2010;63:183-193; quiz 93-94.
  6. Gold MH, Biron J. Efficacy of a novel combination of pneumatic energy and broadband light for the treatment of acne. J Drugs Dermatol. 2008;7:639-642.
  7. Shamban AT, Enokibori M, Narurkar V, et al. Photopneumatic technology for the treatment of acne vulgaris. J Drugs Dermatol. 2008;7:139-145.
  8. Wanitphakdeedecha R, Tanzi EL, Alster TS. Photopneumatic therapy for the treatment of acne. J Drugs Dermatol. 2009;8:239-241.
  9. Doshi A, Zaheer A, Stiller MJ. A comparison of current acne grading systems and proposal of a novel system. Int J Dermatol. 1997;36:416-418.
  10. Weiss JW, Shavin J, Davis M. Preliminary results of a nonrandomized, multicenter, open-label study of patient satisfaction after treatment with combination benzoyl peroxide/clindamycin topical gel for mild to moderate acne. Clin Ther. 2002;24:1706-1717.
  11. Demircay Z, Kus S, Sur H. Predictive factors for acne flare during isotretinoin treatment. Eur J Dermatol. 2008;18:452-456.
  12. Gupta MA, Johnson AM, Gupta AK. The development of an Acne Quality of Life scale: reliability, validity, and relation to subjective acne severity in mild to moderate acne vulgaris. Acta Derm Venereol. 1998;78:451-456.
  13. Wong DL, Baker CM. Pain in children: comparison of assessment scales. Pediatr Nurs. 1988;14:9-17.
  14. Wong DL, Hockenberry-Eaton M, Wilson D, et al. Wong’s Essentials of Pediatric Nursing. 6th ed. St. Louis, MO: Mosby; 2001:1301.
  15. Zempsky WT, Robbins B, McKay K. Reduction of topical anesthetic onset time using ultrasound: a randomized controlled trial prior to venipuncture in young children. Pain Med. 2008;9:795-802.
  16. Imayama S, Ueda S, Isoda M. Histologic changes in the skin of hairless mice following peeling with salicylic acid. Arch Dermatol. 2000;136:1390-1395.
  17. Lee H, Kim I. Salicylic acid peels for the treatment of acne vulgaris in Asian patients. Dermatol Surg. 2003;29:1196-1199.
  18. Kessler E, Flanagan K, Chia C, et al. Comparison of alpha- and beta-hydroxy acid chemical peels in the treatment of mild to moderately severe facial acne vulgaris. Dermatol Surg. 2008;34:45-50.
  19. Omi T, Munavalli GS, Kawana S, et al. Ultrastructural evidencefor thermal injury to pilosebaceous units during the treatment of acne using photopneumatic (PPX) therapy. J Cosmet Laser Ther. 2008;10:7-11.
  20. Papageorgiou P, Katsambas A, Chu A. Phototherapy with blue (415 nm) and red (660 nm) light in the treatment of acne vulgaris. Br J Dermatol. 2000;142:973-978.
Article PDF
Author and Disclosure Information

Drs. Thuangtong and Rattanaumpawan are from the Faculty of Medicine, Siriraj Hospital, Mahidol University, Bangkok, Thailand. Dr. Thuangtong is from the Department of Dermatology, and Dr. Rattanaumpawan is from the Department of Medicine. Dr. Tangjaturonrusamee is from the Institute of Dermatology, Department of Medical Services, Ministry of Public Health, Bangkok. Dr. Ditre is from the Department of Dermatology, Perelman School of Medicine at University of Pennsylvania, Philadelphia, and Penn Medicine Radnor, Pennsylvania.

The authors report no conflict of interest.

Correspondence: Chérie M. Ditre, MD, Department of Dermatology, Perelman School of Medicine, University of Pennsylvania, 250 King of Prussia Rd, Radnor, PA 19087 ([email protected]).

Issue
Cutis - 100(1)
Publications
Topics
Page Number
43-48
Sections
Author and Disclosure Information

Drs. Thuangtong and Rattanaumpawan are from the Faculty of Medicine, Siriraj Hospital, Mahidol University, Bangkok, Thailand. Dr. Thuangtong is from the Department of Dermatology, and Dr. Rattanaumpawan is from the Department of Medicine. Dr. Tangjaturonrusamee is from the Institute of Dermatology, Department of Medical Services, Ministry of Public Health, Bangkok. Dr. Ditre is from the Department of Dermatology, Perelman School of Medicine at University of Pennsylvania, Philadelphia, and Penn Medicine Radnor, Pennsylvania.

The authors report no conflict of interest.

Correspondence: Chérie M. Ditre, MD, Department of Dermatology, Perelman School of Medicine, University of Pennsylvania, 250 King of Prussia Rd, Radnor, PA 19087 ([email protected]).

Author and Disclosure Information

Drs. Thuangtong and Rattanaumpawan are from the Faculty of Medicine, Siriraj Hospital, Mahidol University, Bangkok, Thailand. Dr. Thuangtong is from the Department of Dermatology, and Dr. Rattanaumpawan is from the Department of Medicine. Dr. Tangjaturonrusamee is from the Institute of Dermatology, Department of Medical Services, Ministry of Public Health, Bangkok. Dr. Ditre is from the Department of Dermatology, Perelman School of Medicine at University of Pennsylvania, Philadelphia, and Penn Medicine Radnor, Pennsylvania.

The authors report no conflict of interest.

Correspondence: Chérie M. Ditre, MD, Department of Dermatology, Perelman School of Medicine, University of Pennsylvania, 250 King of Prussia Rd, Radnor, PA 19087 ([email protected]).

Article PDF
Article PDF
Related Articles

Facial acne vulgaris is a common skin disease among teenagers and adolescents that may negatively affect self-esteem, perceived facial attractiveness, and social participation.1 Treatments for acne often are multimodal and require the utmost adherence. For these reasons, acne treatments have been challenging to clinicians and patients alike, as patient compliance in maintaining the use of prescribed topical and oral medications remains essential to attain improvement in quality of life (QOL).

Salicylic acid is a popular medicament for acne treatment that frequently is used as monotherapy or as an adjuvant for other acne treatments, especially in patients with oily skin.2 Salicylic acid has a keratolytic effect, causing corneocyte discohesion in clogged pores or congested follicles,2 and it is effective in treating both inflammatory and noninflammatory acne.3,4

Light therapy, particularly with visible light, has been demonstrated to improve acne outcomes.5 Pneumatic broadband light (PBBL) is a therapeutic light treatment in the broadband range (400–1200 nm) that is combined with vacuum suction, which creates a mechanical lysis of thin-walled pustules and dislodges pore impaction. Additionally, the blue light portion of the PBBL spectrum targets endogenous porphyrins in Propionibacterium acnes, resulting in bacterial destruction.6-8

The purpose of this study was to compare the efficacy, tolerability, and safety of salicylic acid 30% peel versus PBBL in the treatment of mild to moderately severe facial acne vulgaris.

METHODS

Study Design

This single-blind, randomized, split-face pilot study was approved by the institutional review board of the University of Pennsylvania (Philadelphia, Pennsylvania). All patients provided informed consent before entering the study. The single-blind evaluation was performed by one dermatologist (C.T.) who examined the participants on every visit prior to PBBL treatment.

Before the study started, participants were randomized for which side of the face was to be treated with PBBL using a number assigned to each participant. Participants received both treatments—salicylic acid 30% peel on one side of the face and PBBL treatment on the other side of the face—once weekly for a total of 6 treatments. They were then asked to return for 2 follow-up evaluations at weeks 3 and 6 following the last treatment session and were instructed not to use any topical or oral acne medications during these follow-up periods.

Inclusion and Exclusion Criteria

Patients aged 18 years and older of any race and sex with noninflammatory papules, some inflammatory papules, and no more than 1 nodule (considered as mild to moderately severe facial acne) were included in the study. Participants had not been on any topical acne medications for at least 1 month and/or oral retinoids for at least 1 year prior to the study period. All women completed urine pregnancy tests prior to the study and were advised to utilize birth control during the study period.

Study Treatments

Salicylic Acid 30% Peel

The participant’s face was cleansed thoroughly before application of salicylic acid 30% (1.5 g/2.5 mL) to half of the face and left on for 5 minutes before being carefully rinsed off by spraying with spring water. Prior to initiating PBBL therapy, the peeled side of the participant’s face was covered with a towel.

Pneumatic Broadband Light

On the other side of the face, PBBL was performed to deliver broadband light within the spectrum range of 400 to 1200 nm at a setting approximately equivalent to a fluence of 4 to 6 J/cm2 and a vacuum setting approximately equivalent to a negative pressure of 3 lb/in2. The power setting was increased on each subsequent visit depending on each participant’s tolerability.

Participants were required to apply a moisturizer and sunscreen to the face and avoid excessive sun exposure between study visits.

Efficacy Evaluation

A comparison of the efficacy of the treatments was determined by clinical evaluation and examining the results of the outcome measurements with the modified Global Acne Grading Score (mGAGS) and Acne QOL Scale during each treatment visit. Facial photographs were taken at each visit.

Modified Global Acne Grading Score

The mGAGS is a modification of the Global Acne Grading Scale (GAGS) that has been used to evaluate acne severity in many studies.9-11 The GAGS considers 6 locations on the face with a grading factor for each location. The local score is obtained by multiplying the factor rated by location with the factor of clinical assessment: local score = factor rated by location × factor rated by clinical assessment. The total score is the sum of the individual local scores (Table 1).

Although the original GAGS incorporated the type and location of the lesions in its calculation, we felt that the number of lesions also was important to add to our grading score. Therefore, we modified the GAGS by adding a factor rated by the number of lesions to improve the accuracy of the test. Accordingly, the local mGAGS scores were calculated by multiplying the location factor by the lesion type and number of lesions factors: local score = location factor × lesion type factor × number of lesions factor.

Acne QOL Questionnaire

Acne QOL was assessed during each visit to demonstrate if the treatment results affected participants’ socialization due to appearance.12 Participants were asked to complete the questionnaire, which consisted of 9 questions with 4 rating answers (0=not affected; 1=mildly affected; 2=moderately affected; 3=markedly affected). A total score of 9 or higher (high score) indicated that acne had a substantial negative impact on the participant, while a total score below 9 (low score) meant acne scarcely impacted social aspects and daily activities of the patient.

Safety Evaluation

The safety of the treatments was evaluated by clinical inspection and by comparing the results of the Wong-Baker FACES Pain Rating Scale (WBPRS)13 after treatment. The WBPRS is used worldwide among researchers to assess pain, particularly in children.14,15 It is composed of 6 faces expressing pain with word descriptions with a corresponding number range reflecting pain severity from 0 to 5 (0=no hurt; 1=hurts little bit; 2=hurts little more; 3=hurts even more; 4=hurts whole lot; 5=hurts worst).13

Statistical Analysis

All variables were presented as the median (range). A Wilcoxon signed rank test was used to compare clinical responses between the salicylic acid 30% peel and PBBL therapies. SPSS software version 12.0 was used for all statistical analysis. A 2-tailed P value of ≤.05 was considered statistically significant.

 

 

RESULTS

Study Population

Twelve participants (2 males, 10 females) aged 17 to 36 years (median age, 22 years; mean age [SD], 23.33 [1.65] years) with both comedonal and inflammatory acne were enrolled into this study for 6 split-face treatments of salicylic acid 30% peel and PBBL at 1-week intervals for 6 weeks, with 2 subsequent follow-up sessions at weeks 3 and 6 posttreatment. Of the 12 participants, 11 were white and 1 was Asian American, with Fitzpatrick skin types II to IV. Nine participants (75%) completed the study. One participant dropped out of the study after the fourth treatment due to a scheduling conflict, and the other 2 participants did not return for follow-up. No participants withdrew from the study because of adverse therapeutic events.

Efficacy Evaluation

Comparisons between the salicylic acid 30% peel and PBBL procedures for mGAGS at each visit are shown in Table 2. There was no significant difference in treatment efficacy between the salicylic acid 30% peel and PBBL therapies during the study’s treatment and follow-up events; however, both procedures contributed to a major improvement in acne symptoms by the third treatment session and through to the last follow-up session (P≤.05). Clinical photographs at baseline, at last treatment visit (week 6), and at last follow-up (week 12) are shown in Figures 1 and 2.

Figure 1. A 19-year-old woman with mild acne who was treated with salicylic acid 30% peel on the right side of the face at baseline (A), week 6 (B), and week 12 (C).

Figure 2. A 19-year-old woman with mild acne who was treated with pneumatic broadband light on the left side of the face at baseline (A), week 6 (B), and week 12 (C).

The results of the acne QOL questionnaire are shown in Table 2. Lower scores reflect a higher QOL. Median QOL scores at each visit ranged from 0.5 to 4.5. There was no significant difference found between the peel agent or PBBL based on the baseline QOL and subsequent visit assessments; however, the differences between the 2 treatments were significant at weeks 3 (P=.05) and 5 (P=.03) of treatment as well as at the last follow-up visit (P=.05).

According to the QOL scores, by the third treatment session participants were more satisfied with their improved acne condition from the PBBL procedure than the salicylic acid 30% peel as demonstrated by a positive range of the QOL assessments between PBBL and salicylic acid 30% peel (as shown in the difference in QOL in Table 2: week 3, 0–6; week 4, 0–3; week 5, 0–7). On the other hand, participants saw more improvement from the salicylic acid 30% peel than from PBBL by the last follow-up evaluation, as the differences in QOL scores between the 2 treatments resulted in a negative range (5–0).

Safety

Pain assessment by the WBPRS at every visit showed a low pain rating associated with both salicylic acid 30% peel (range, 0–0.5) and PBBL (range, 1.0–1.5) treatments. The median pain score of the salicylic acid 30% peel appeared higher compared to the PBBL treatment, yet a significant difference between both treatments was seen only at weeks 1, 3, and 6 of treatment (P≤.05).

There were no unexpected therapeutic reactions reported in our study, and no participants withdrew from the study due to adverse events. Most participants experienced only mild adverse reactions, including redness, stinging, and a burning sensation on the salicylic acid 30% peel side, which were transient and disappeared in minutes; only redness occurred on the PBBL-treated side.

Comment

Facial acne treatment is challenging, as prolonged and/or severe acne contributes to scarring, declining self-confidence, and undesirable financial consequences. Even though salicylic acid peel is a commonly used acne treatment choice, the PBBL methodology was approved by the US Food and Drug Administration6 and has become an alternative procedure for acne treatment.

The pharmacological effects of salicylic acid are related to its corneocyte desquamation and exfoliative actions, thereby reducing corneocyte cohesion and unclogging follicular pores.16 Salicylic acid has been demonstrated to ameliorate inflammatory acne by its effects on the arachidonic acid cascade.2,4,17 In our study, salicylic acid 30% peel met participants’ satisfaction in acne improvement similar to a study showing a 50% improvement in acne scores after just 2 treatments.18 Our data support and corroborate that salicylic acid 30% peel renders an improvement in acne sequelae reported in several other studies.2,17,18

Pneumatic broadband light has been known to treat acne by the mechanism of pneumatic suction combined with photodynamic therapy using broadband-pulsed light (400–1200 nm).6-8 By applying the pneumatic device, a vacuum is created on the skin to remove sebum contents from follicles, whereas broadband light is emitted simultaneously to destroy bacteria and decrease the inflammatory process.7 During the vacuum process, the skin is stretched to reduce pain and avoid competitive chromophores (eg, hemoglobin), while the broadband light is administered.7 Broadband light encompasses 2 main light spectrums: blue light (415 nm) activates coproporphyrin III, which induces reactive free radicals and singlet oxygen species and has been reported to be the cause of bacterial cell death,19 and red light (633 nm), which renders an increase of fibroblast growth factors to work against the inflammatory processes.20 There are numerous studies showing a reduction of acne lesions after photopneumatic therapy with minimal side effects.6-8

In our study, we compared the efficacy of salicylic acid 30% peel with PBBL in the treatment of acne. Both treatments showed significant reduction of mGAGS compared to baseline starting from week 3 and lasting until week 12. Remarkably, although there were some participants who reported acne recurrence after completing all treatments at week 6, which could have happened when the treatments were ended, the final acne score at week 12 was still significantly lower than baseline. It is clear that the participants continued their acne improvement up to the 6-week follow-up period without any topical or oral medication. We do not propose that either salicylic acid peel or PBBL treatment is a solitary option but speculate that the combination of both treatments may initiate a faster resolution in the disappearance of acne.

Although there was no statistically significant difference in efficacy between salicylic acid 30% peel and PBBL procedures at each visit, QOL assessments related to treatment satisfaction did yield significant differences between baseline and the end of treatment. We noticed that participants had more positive attitudes toward the PBBL side at week 3 and week 5 but only mild satisfaction at week 4, as the differences in QOL scores between both treatments showed positive ranging values. This finding is most likely related to the immediate reduction of acne pustules by the PBBL vacuum lysis of these lesions. The differences in the QOL scores between both treatments at week 12 (the last follow-up evaluation) provided opposite findings, which meant patients had nearly even improvement in both PBBL method and salicylic acid 30% peel. Therefore, according to QOL data, acne disappeared quickly with the application of PBBL therapy but reappeared on the PBBL-treated side by the follow-up evaluations, though the acne score between both sides showed no statistically significant difference.

We reason that the PBBL therapy works better than salicylic acid 30% peel because the pneumatic system may help to unclog the pores through mechanical debridement via suctioning versus desquamation from salicylic acid 30% peel. Nonetheless, salicylic acid 30% peel sustained improvement when compared to PBBL through the follow-up periods. Both salicylic acid 30% peel and PBBL treatments are well tolerated and may initiate a faster resolution in the improvement of acne when incorporated with a medical program.

Because of the recurrence of acne after treatments were stopped, additional medical therapies are advised to be used along with this study’s clinical treatments to help mitigate the acne symptoms. These treatments should be considered in patients concerned about antibiotic resistance or those who cannot take oral antibiotics or retinoids. Salicylic acid peel is more accessible and affordable than PBBL, whereas PBBL is slightly more tolerable and less irritating than salicylic acid peel. Nevertheless, the cost of investment in PBBL is quite high—as much as $70,000—and does not include disposable, single-use tips, which cost $30 each. The machine is easy to set up, weighs about 40 lb, and requires little space to store. The average cost per visit of PBBL treatment in office is $150.00 and $75.00 for salicylic acid peel (unpublished data, Hospital of the University of Pennsylvania, 2010). Most patients may select salicylic acid peel over PBBL due to the cost and convenience of the treatment. Neither procedure should be considered as a solitary treatment option but rather as adjunctive procedures combined with oral and/or topical acne medications. After this study’s treatments were stopped and without other medications to maintain treatment effectiveness, the lesions reappeared, trending back toward baseline.

 

 

Conclusion

Both salicylic acid 30% peel and PBBL procedures are effective, safe, and well tolerated in treating acne. Although there was no significant difference in the efficacy between both treatments in this study, the small sample size and short follow-up intervals warrant further studies to support the observed outstanding outcomes and should be considered in combination with other medical treatment options. These procedures may be beneficial in holding the patient compliant until their medical therapies have an opportunity to work.

Acknowledgment

The authors would like to thank Joyce Okawa, RN (Philadelphia, Pennsylvania), for her assistance in the submission to the institutional review board of the University of Pennsylvania.

Facial acne vulgaris is a common skin disease among teenagers and adolescents that may negatively affect self-esteem, perceived facial attractiveness, and social participation.1 Treatments for acne often are multimodal and require the utmost adherence. For these reasons, acne treatments have been challenging to clinicians and patients alike, as patient compliance in maintaining the use of prescribed topical and oral medications remains essential to attain improvement in quality of life (QOL).

Salicylic acid is a popular medicament for acne treatment that frequently is used as monotherapy or as an adjuvant for other acne treatments, especially in patients with oily skin.2 Salicylic acid has a keratolytic effect, causing corneocyte discohesion in clogged pores or congested follicles,2 and it is effective in treating both inflammatory and noninflammatory acne.3,4

Light therapy, particularly with visible light, has been demonstrated to improve acne outcomes.5 Pneumatic broadband light (PBBL) is a therapeutic light treatment in the broadband range (400–1200 nm) that is combined with vacuum suction, which creates a mechanical lysis of thin-walled pustules and dislodges pore impaction. Additionally, the blue light portion of the PBBL spectrum targets endogenous porphyrins in Propionibacterium acnes, resulting in bacterial destruction.6-8

The purpose of this study was to compare the efficacy, tolerability, and safety of salicylic acid 30% peel versus PBBL in the treatment of mild to moderately severe facial acne vulgaris.

METHODS

Study Design

This single-blind, randomized, split-face pilot study was approved by the institutional review board of the University of Pennsylvania (Philadelphia, Pennsylvania). All patients provided informed consent before entering the study. The single-blind evaluation was performed by one dermatologist (C.T.) who examined the participants on every visit prior to PBBL treatment.

Before the study started, participants were randomized for which side of the face was to be treated with PBBL using a number assigned to each participant. Participants received both treatments—salicylic acid 30% peel on one side of the face and PBBL treatment on the other side of the face—once weekly for a total of 6 treatments. They were then asked to return for 2 follow-up evaluations at weeks 3 and 6 following the last treatment session and were instructed not to use any topical or oral acne medications during these follow-up periods.

Inclusion and Exclusion Criteria

Patients aged 18 years and older of any race and sex with noninflammatory papules, some inflammatory papules, and no more than 1 nodule (considered as mild to moderately severe facial acne) were included in the study. Participants had not been on any topical acne medications for at least 1 month and/or oral retinoids for at least 1 year prior to the study period. All women completed urine pregnancy tests prior to the study and were advised to utilize birth control during the study period.

Study Treatments

Salicylic Acid 30% Peel

The participant’s face was cleansed thoroughly before application of salicylic acid 30% (1.5 g/2.5 mL) to half of the face and left on for 5 minutes before being carefully rinsed off by spraying with spring water. Prior to initiating PBBL therapy, the peeled side of the participant’s face was covered with a towel.

Pneumatic Broadband Light

On the other side of the face, PBBL was performed to deliver broadband light within the spectrum range of 400 to 1200 nm at a setting approximately equivalent to a fluence of 4 to 6 J/cm2 and a vacuum setting approximately equivalent to a negative pressure of 3 lb/in2. The power setting was increased on each subsequent visit depending on each participant’s tolerability.

Participants were required to apply a moisturizer and sunscreen to the face and avoid excessive sun exposure between study visits.

Efficacy Evaluation

A comparison of the efficacy of the treatments was determined by clinical evaluation and examining the results of the outcome measurements with the modified Global Acne Grading Score (mGAGS) and Acne QOL Scale during each treatment visit. Facial photographs were taken at each visit.

Modified Global Acne Grading Score

The mGAGS is a modification of the Global Acne Grading Scale (GAGS) that has been used to evaluate acne severity in many studies.9-11 The GAGS considers 6 locations on the face with a grading factor for each location. The local score is obtained by multiplying the factor rated by location with the factor of clinical assessment: local score = factor rated by location × factor rated by clinical assessment. The total score is the sum of the individual local scores (Table 1).

Although the original GAGS incorporated the type and location of the lesions in its calculation, we felt that the number of lesions also was important to add to our grading score. Therefore, we modified the GAGS by adding a factor rated by the number of lesions to improve the accuracy of the test. Accordingly, the local mGAGS scores were calculated by multiplying the location factor by the lesion type and number of lesions factors: local score = location factor × lesion type factor × number of lesions factor.

Acne QOL Questionnaire

Acne QOL was assessed during each visit to demonstrate if the treatment results affected participants’ socialization due to appearance.12 Participants were asked to complete the questionnaire, which consisted of 9 questions with 4 rating answers (0=not affected; 1=mildly affected; 2=moderately affected; 3=markedly affected). A total score of 9 or higher (high score) indicated that acne had a substantial negative impact on the participant, while a total score below 9 (low score) meant acne scarcely impacted social aspects and daily activities of the patient.

Safety Evaluation

The safety of the treatments was evaluated by clinical inspection and by comparing the results of the Wong-Baker FACES Pain Rating Scale (WBPRS)13 after treatment. The WBPRS is used worldwide among researchers to assess pain, particularly in children.14,15 It is composed of 6 faces expressing pain with word descriptions with a corresponding number range reflecting pain severity from 0 to 5 (0=no hurt; 1=hurts little bit; 2=hurts little more; 3=hurts even more; 4=hurts whole lot; 5=hurts worst).13

Statistical Analysis

All variables were presented as the median (range). A Wilcoxon signed rank test was used to compare clinical responses between the salicylic acid 30% peel and PBBL therapies. SPSS software version 12.0 was used for all statistical analysis. A 2-tailed P value of ≤.05 was considered statistically significant.

 

 

RESULTS

Study Population

Twelve participants (2 males, 10 females) aged 17 to 36 years (median age, 22 years; mean age [SD], 23.33 [1.65] years) with both comedonal and inflammatory acne were enrolled into this study for 6 split-face treatments of salicylic acid 30% peel and PBBL at 1-week intervals for 6 weeks, with 2 subsequent follow-up sessions at weeks 3 and 6 posttreatment. Of the 12 participants, 11 were white and 1 was Asian American, with Fitzpatrick skin types II to IV. Nine participants (75%) completed the study. One participant dropped out of the study after the fourth treatment due to a scheduling conflict, and the other 2 participants did not return for follow-up. No participants withdrew from the study because of adverse therapeutic events.

Efficacy Evaluation

Comparisons between the salicylic acid 30% peel and PBBL procedures for mGAGS at each visit are shown in Table 2. There was no significant difference in treatment efficacy between the salicylic acid 30% peel and PBBL therapies during the study’s treatment and follow-up events; however, both procedures contributed to a major improvement in acne symptoms by the third treatment session and through to the last follow-up session (P≤.05). Clinical photographs at baseline, at last treatment visit (week 6), and at last follow-up (week 12) are shown in Figures 1 and 2.

Figure 1. A 19-year-old woman with mild acne who was treated with salicylic acid 30% peel on the right side of the face at baseline (A), week 6 (B), and week 12 (C).

Figure 2. A 19-year-old woman with mild acne who was treated with pneumatic broadband light on the left side of the face at baseline (A), week 6 (B), and week 12 (C).

The results of the acne QOL questionnaire are shown in Table 2. Lower scores reflect a higher QOL. Median QOL scores at each visit ranged from 0.5 to 4.5. There was no significant difference found between the peel agent or PBBL based on the baseline QOL and subsequent visit assessments; however, the differences between the 2 treatments were significant at weeks 3 (P=.05) and 5 (P=.03) of treatment as well as at the last follow-up visit (P=.05).

According to the QOL scores, by the third treatment session participants were more satisfied with their improved acne condition from the PBBL procedure than the salicylic acid 30% peel as demonstrated by a positive range of the QOL assessments between PBBL and salicylic acid 30% peel (as shown in the difference in QOL in Table 2: week 3, 0–6; week 4, 0–3; week 5, 0–7). On the other hand, participants saw more improvement from the salicylic acid 30% peel than from PBBL by the last follow-up evaluation, as the differences in QOL scores between the 2 treatments resulted in a negative range (5–0).

Safety

Pain assessment by the WBPRS at every visit showed a low pain rating associated with both salicylic acid 30% peel (range, 0–0.5) and PBBL (range, 1.0–1.5) treatments. The median pain score of the salicylic acid 30% peel appeared higher compared to the PBBL treatment, yet a significant difference between both treatments was seen only at weeks 1, 3, and 6 of treatment (P≤.05).

There were no unexpected therapeutic reactions reported in our study, and no participants withdrew from the study due to adverse events. Most participants experienced only mild adverse reactions, including redness, stinging, and a burning sensation on the salicylic acid 30% peel side, which were transient and disappeared in minutes; only redness occurred on the PBBL-treated side.

Comment

Facial acne treatment is challenging, as prolonged and/or severe acne contributes to scarring, declining self-confidence, and undesirable financial consequences. Even though salicylic acid peel is a commonly used acne treatment choice, the PBBL methodology was approved by the US Food and Drug Administration6 and has become an alternative procedure for acne treatment.

The pharmacological effects of salicylic acid are related to its corneocyte desquamation and exfoliative actions, thereby reducing corneocyte cohesion and unclogging follicular pores.16 Salicylic acid has been demonstrated to ameliorate inflammatory acne by its effects on the arachidonic acid cascade.2,4,17 In our study, salicylic acid 30% peel met participants’ satisfaction in acne improvement similar to a study showing a 50% improvement in acne scores after just 2 treatments.18 Our data support and corroborate that salicylic acid 30% peel renders an improvement in acne sequelae reported in several other studies.2,17,18

Pneumatic broadband light has been known to treat acne by the mechanism of pneumatic suction combined with photodynamic therapy using broadband-pulsed light (400–1200 nm).6-8 By applying the pneumatic device, a vacuum is created on the skin to remove sebum contents from follicles, whereas broadband light is emitted simultaneously to destroy bacteria and decrease the inflammatory process.7 During the vacuum process, the skin is stretched to reduce pain and avoid competitive chromophores (eg, hemoglobin), while the broadband light is administered.7 Broadband light encompasses 2 main light spectrums: blue light (415 nm) activates coproporphyrin III, which induces reactive free radicals and singlet oxygen species and has been reported to be the cause of bacterial cell death,19 and red light (633 nm), which renders an increase of fibroblast growth factors to work against the inflammatory processes.20 There are numerous studies showing a reduction of acne lesions after photopneumatic therapy with minimal side effects.6-8

In our study, we compared the efficacy of salicylic acid 30% peel with PBBL in the treatment of acne. Both treatments showed significant reduction of mGAGS compared to baseline starting from week 3 and lasting until week 12. Remarkably, although there were some participants who reported acne recurrence after completing all treatments at week 6, which could have happened when the treatments were ended, the final acne score at week 12 was still significantly lower than baseline. It is clear that the participants continued their acne improvement up to the 6-week follow-up period without any topical or oral medication. We do not propose that either salicylic acid peel or PBBL treatment is a solitary option but speculate that the combination of both treatments may initiate a faster resolution in the disappearance of acne.

Although there was no statistically significant difference in efficacy between salicylic acid 30% peel and PBBL procedures at each visit, QOL assessments related to treatment satisfaction did yield significant differences between baseline and the end of treatment. We noticed that participants had more positive attitudes toward the PBBL side at week 3 and week 5 but only mild satisfaction at week 4, as the differences in QOL scores between both treatments showed positive ranging values. This finding is most likely related to the immediate reduction of acne pustules by the PBBL vacuum lysis of these lesions. The differences in the QOL scores between both treatments at week 12 (the last follow-up evaluation) provided opposite findings, which meant patients had nearly even improvement in both PBBL method and salicylic acid 30% peel. Therefore, according to QOL data, acne disappeared quickly with the application of PBBL therapy but reappeared on the PBBL-treated side by the follow-up evaluations, though the acne score between both sides showed no statistically significant difference.

We reason that the PBBL therapy works better than salicylic acid 30% peel because the pneumatic system may help to unclog the pores through mechanical debridement via suctioning versus desquamation from salicylic acid 30% peel. Nonetheless, salicylic acid 30% peel sustained improvement when compared to PBBL through the follow-up periods. Both salicylic acid 30% peel and PBBL treatments are well tolerated and may initiate a faster resolution in the improvement of acne when incorporated with a medical program.

Because of the recurrence of acne after treatments were stopped, additional medical therapies are advised to be used along with this study’s clinical treatments to help mitigate the acne symptoms. These treatments should be considered in patients concerned about antibiotic resistance or those who cannot take oral antibiotics or retinoids. Salicylic acid peel is more accessible and affordable than PBBL, whereas PBBL is slightly more tolerable and less irritating than salicylic acid peel. Nevertheless, the cost of investment in PBBL is quite high—as much as $70,000—and does not include disposable, single-use tips, which cost $30 each. The machine is easy to set up, weighs about 40 lb, and requires little space to store. The average cost per visit of PBBL treatment in office is $150.00 and $75.00 for salicylic acid peel (unpublished data, Hospital of the University of Pennsylvania, 2010). Most patients may select salicylic acid peel over PBBL due to the cost and convenience of the treatment. Neither procedure should be considered as a solitary treatment option but rather as adjunctive procedures combined with oral and/or topical acne medications. After this study’s treatments were stopped and without other medications to maintain treatment effectiveness, the lesions reappeared, trending back toward baseline.

 

 

Conclusion

Both salicylic acid 30% peel and PBBL procedures are effective, safe, and well tolerated in treating acne. Although there was no significant difference in the efficacy between both treatments in this study, the small sample size and short follow-up intervals warrant further studies to support the observed outstanding outcomes and should be considered in combination with other medical treatment options. These procedures may be beneficial in holding the patient compliant until their medical therapies have an opportunity to work.

Acknowledgment

The authors would like to thank Joyce Okawa, RN (Philadelphia, Pennsylvania), for her assistance in the submission to the institutional review board of the University of Pennsylvania.

References
  1. Rapp DA, Brenes GA, Feldman SR, et al. Anger and acne: implications for quality of life, patient satisfaction and clinical care. Br J Dermatol. 2004;151:183-189.
  2. Zakopoulou N, Kontochristopoulos G. Superficial chemical peels. J Cosmet Dermatol. 2006;5:246-253.
  3. Berson DS, Cohen JL, Rendon MI, et al. Clinical role and application of superficial chemical peels in today’s practice. J Drugs Dermatol. 2009;8:803-811.
  4. Shalita AR. Treatment of mild and moderate acne vulgaris with salicylic acid in an alcohol-detergent vehicle. Cutis. 1981;28:556-558, 561.
  5. Sakamoto FH, Lopes JD, Anderson RR. Photodynamic therapy for acne vulgaris: a critical review from basics to clinical practice: part I. acne vulgaris: when and why consider photodynamic therapy? J Am Acad Dermatol. 2010;63:183-193; quiz 93-94.
  6. Gold MH, Biron J. Efficacy of a novel combination of pneumatic energy and broadband light for the treatment of acne. J Drugs Dermatol. 2008;7:639-642.
  7. Shamban AT, Enokibori M, Narurkar V, et al. Photopneumatic technology for the treatment of acne vulgaris. J Drugs Dermatol. 2008;7:139-145.
  8. Wanitphakdeedecha R, Tanzi EL, Alster TS. Photopneumatic therapy for the treatment of acne. J Drugs Dermatol. 2009;8:239-241.
  9. Doshi A, Zaheer A, Stiller MJ. A comparison of current acne grading systems and proposal of a novel system. Int J Dermatol. 1997;36:416-418.
  10. Weiss JW, Shavin J, Davis M. Preliminary results of a nonrandomized, multicenter, open-label study of patient satisfaction after treatment with combination benzoyl peroxide/clindamycin topical gel for mild to moderate acne. Clin Ther. 2002;24:1706-1717.
  11. Demircay Z, Kus S, Sur H. Predictive factors for acne flare during isotretinoin treatment. Eur J Dermatol. 2008;18:452-456.
  12. Gupta MA, Johnson AM, Gupta AK. The development of an Acne Quality of Life scale: reliability, validity, and relation to subjective acne severity in mild to moderate acne vulgaris. Acta Derm Venereol. 1998;78:451-456.
  13. Wong DL, Baker CM. Pain in children: comparison of assessment scales. Pediatr Nurs. 1988;14:9-17.
  14. Wong DL, Hockenberry-Eaton M, Wilson D, et al. Wong’s Essentials of Pediatric Nursing. 6th ed. St. Louis, MO: Mosby; 2001:1301.
  15. Zempsky WT, Robbins B, McKay K. Reduction of topical anesthetic onset time using ultrasound: a randomized controlled trial prior to venipuncture in young children. Pain Med. 2008;9:795-802.
  16. Imayama S, Ueda S, Isoda M. Histologic changes in the skin of hairless mice following peeling with salicylic acid. Arch Dermatol. 2000;136:1390-1395.
  17. Lee H, Kim I. Salicylic acid peels for the treatment of acne vulgaris in Asian patients. Dermatol Surg. 2003;29:1196-1199.
  18. Kessler E, Flanagan K, Chia C, et al. Comparison of alpha- and beta-hydroxy acid chemical peels in the treatment of mild to moderately severe facial acne vulgaris. Dermatol Surg. 2008;34:45-50.
  19. Omi T, Munavalli GS, Kawana S, et al. Ultrastructural evidencefor thermal injury to pilosebaceous units during the treatment of acne using photopneumatic (PPX) therapy. J Cosmet Laser Ther. 2008;10:7-11.
  20. Papageorgiou P, Katsambas A, Chu A. Phototherapy with blue (415 nm) and red (660 nm) light in the treatment of acne vulgaris. Br J Dermatol. 2000;142:973-978.
References
  1. Rapp DA, Brenes GA, Feldman SR, et al. Anger and acne: implications for quality of life, patient satisfaction and clinical care. Br J Dermatol. 2004;151:183-189.
  2. Zakopoulou N, Kontochristopoulos G. Superficial chemical peels. J Cosmet Dermatol. 2006;5:246-253.
  3. Berson DS, Cohen JL, Rendon MI, et al. Clinical role and application of superficial chemical peels in today’s practice. J Drugs Dermatol. 2009;8:803-811.
  4. Shalita AR. Treatment of mild and moderate acne vulgaris with salicylic acid in an alcohol-detergent vehicle. Cutis. 1981;28:556-558, 561.
  5. Sakamoto FH, Lopes JD, Anderson RR. Photodynamic therapy for acne vulgaris: a critical review from basics to clinical practice: part I. acne vulgaris: when and why consider photodynamic therapy? J Am Acad Dermatol. 2010;63:183-193; quiz 93-94.
  6. Gold MH, Biron J. Efficacy of a novel combination of pneumatic energy and broadband light for the treatment of acne. J Drugs Dermatol. 2008;7:639-642.
  7. Shamban AT, Enokibori M, Narurkar V, et al. Photopneumatic technology for the treatment of acne vulgaris. J Drugs Dermatol. 2008;7:139-145.
  8. Wanitphakdeedecha R, Tanzi EL, Alster TS. Photopneumatic therapy for the treatment of acne. J Drugs Dermatol. 2009;8:239-241.
  9. Doshi A, Zaheer A, Stiller MJ. A comparison of current acne grading systems and proposal of a novel system. Int J Dermatol. 1997;36:416-418.
  10. Weiss JW, Shavin J, Davis M. Preliminary results of a nonrandomized, multicenter, open-label study of patient satisfaction after treatment with combination benzoyl peroxide/clindamycin topical gel for mild to moderate acne. Clin Ther. 2002;24:1706-1717.
  11. Demircay Z, Kus S, Sur H. Predictive factors for acne flare during isotretinoin treatment. Eur J Dermatol. 2008;18:452-456.
  12. Gupta MA, Johnson AM, Gupta AK. The development of an Acne Quality of Life scale: reliability, validity, and relation to subjective acne severity in mild to moderate acne vulgaris. Acta Derm Venereol. 1998;78:451-456.
  13. Wong DL, Baker CM. Pain in children: comparison of assessment scales. Pediatr Nurs. 1988;14:9-17.
  14. Wong DL, Hockenberry-Eaton M, Wilson D, et al. Wong’s Essentials of Pediatric Nursing. 6th ed. St. Louis, MO: Mosby; 2001:1301.
  15. Zempsky WT, Robbins B, McKay K. Reduction of topical anesthetic onset time using ultrasound: a randomized controlled trial prior to venipuncture in young children. Pain Med. 2008;9:795-802.
  16. Imayama S, Ueda S, Isoda M. Histologic changes in the skin of hairless mice following peeling with salicylic acid. Arch Dermatol. 2000;136:1390-1395.
  17. Lee H, Kim I. Salicylic acid peels for the treatment of acne vulgaris in Asian patients. Dermatol Surg. 2003;29:1196-1199.
  18. Kessler E, Flanagan K, Chia C, et al. Comparison of alpha- and beta-hydroxy acid chemical peels in the treatment of mild to moderately severe facial acne vulgaris. Dermatol Surg. 2008;34:45-50.
  19. Omi T, Munavalli GS, Kawana S, et al. Ultrastructural evidencefor thermal injury to pilosebaceous units during the treatment of acne using photopneumatic (PPX) therapy. J Cosmet Laser Ther. 2008;10:7-11.
  20. Papageorgiou P, Katsambas A, Chu A. Phototherapy with blue (415 nm) and red (660 nm) light in the treatment of acne vulgaris. Br J Dermatol. 2000;142:973-978.
Issue
Cutis - 100(1)
Issue
Cutis - 100(1)
Page Number
43-48
Page Number
43-48
Publications
Publications
Topics
Article Type
Display Headline
Comparison of Salicylic Acid 30% Peel and Pneumatic Broadband Light in the Treatment of Mild to Moderately Severe Facial Acne Vulgaris
Display Headline
Comparison of Salicylic Acid 30% Peel and Pneumatic Broadband Light in the Treatment of Mild to Moderately Severe Facial Acne Vulgaris
Sections
Inside the Article

Practice Points

  • Salicylic acid peel and pneumatic broadband light (PBBL) are good alternative options in treating acne in addition to regular oral and topical treatments.
  • Both salicylic acid peel and PBBL are effective, safe, and tolerable.
Disallow All Ads
Content Gating
No Gating (article Unlocked/Free)
Alternative CME
PubMed ID
28873106
Disqus Comments
Default
Article PDF Media

Techniques and behaviors associated with exemplary inpatient general medicine teaching: an exploratory qualitative study

Article Type
Changed
Display Headline
Techniques and behaviors associated with exemplary inpatient general medicine teaching: an exploratory qualitative study

Clinician educators face numerous obstacles to their joint mission of facilitating learning while also ensuring high-quality and patient-centered care. Time constraints, including the institution of house officer duty hour limitations,1 shorter lengths of stay for hospitalized patients,2 and competing career responsibilities, combine to create a dynamic learning environment. Additionally, clinician educators must balance the autonomy of their learners with the safety of their patients. They must teach to multiple learning levels and work collaboratively with multiple disciplines to foster an effective team-based approach to patient care. Yet, many clinician educators have no formal training in pedagogical methods.3 Such challenges necessitate increased attention to the work of excellent clinician educators and their respective teaching approaches.

Many studies of clinical teaching rely primarily on survey data of attributes of good clinical teachers.3-7 While some studies have incorporated direct observations of teaching8,9 or interviews with clinician educators or learners,10,11 few have incorporated multiple perspectives from the current team and from former learners in order to provide a comprehensive picture of team-based learning.12

The goal of this study was to gain a thorough understanding, through multiple perspectives, of the techniques and behaviors used by exemplary educators within actual clinical environments. We studied attitudes, behaviors, and approaches of 12 such inpatient clinician educators.

METHODS

Study Design and Sampling

This was a multisite study using an exploratory qualitative approach to inquiry. This approach was used to study the techniques and behaviors of excellent attendings during inpatient general medicine rounds. A modified snowball sampling approach13 was used, meaning individuals known to one member of the research team (SS) were initially contacted and asked to identify clinician educators (also referred to as attendings) for potential inclusion in the study. In an effort to identify attendings from a broad range of medical schools, the “2015 U.S. News and World Report Top Medical Schools: Research” rankings14 were also reviewed, with priority given to the top 25, as these are widely used to represent the best US hospitals. In an attempt to invite attendings from diverse institutions, additional medical schools not in the top 25 as well as historically black medical schools were also included. Division chiefs and chairs of internal medicine and/or directors of internal medicine residency programs at these schools were contacted and asked for recommendations of attendings, both within and outside their institutions, who they considered to be great inpatient teachers. In addition, key experts who have won teaching awards or were known to be specialists in the field of medical education were asked to nominate one or two other outstanding attendings.

Table 1
 

 

By using this sampling method, 59 potential participants were identified. An internet search was conducted to obtain information about the potential participants and their institutions. Organizational characteristics such as geographic location, hospital size and affiliation, and patient population, as well as individual characteristics such as gender, medical education and training, and educational awards received were considered so that a diversity of organizations and backgrounds was represented. The list was narrowed down to 16 attendings who were contacted via e-mail and asked to participate. Interested participants were asked for a list of their current team members and 6 to 10 former learners to contact for interviews and focus groups. Former learners were included in an effort to better understand lasting effects on learners from their exemplary teaching attendings. A total of 12 attending physicians agreed to participate (Table 1). Literature on field methods has shown that 12 interviews are found to be adequate in accomplishing data saturation.15 Although 2 attendings were located at the same institution, we decided to include them given that both are recognized as master clinician educators and were each recommended by several individuals from various institutions. Hospitals were located throughout the US and included both university-affiliated hospitals and Veterans Affairs medical centers. Despite efforts to include physicians from historically black colleges and universities, only one attending was identified, and they declined the request to participate.

Data Collection

Observations. The one-day site visits were mainly conducted by two research team members, a physician (SS) and a medical anthropologist (MH), both of whom have extensive experience in qualitative methods. Teams were not uniform but were generally comprised of 1 attending, 1 senior medical resident, 1 to 2 interns, and approximately 2 medical students. Occasionally, a pharmacist, clinical assistant, or other health professional accompanied the team on rounds. Not infrequently, the bedside nurse would explicitly be included in the discussion regarding his or her specific patient. Each site visit began with observing attendings (N = 12) and current learners (N = 57) during rounds. Each research team member recorded their own observations via handwritten field notes, paying particular attention to group interactions, teaching approach, conversations occurring within and peripheral to the team, patient-team interactions, and the physical environment. By standing outside of the medical team circle and remaining silent during rounds, research team members remained unobtrusive to the discussion and process of rounds. Materials the attendings used during their teaching rounds were also documented and collected. Rounds generally lasted 2 to 3 hours. After each site visit, the research team met to compare and combine field notes.

Interviews and Focus Groups. The research team then conducted individual, semi-structured interviews with the attendings, focus groups with their current team (N = 46), and interviews or focus groups with their former learners (N = 26; Supplement 1). Eleven of the current team members observed during rounds were unable to participate in the focus groups due to clinical duties. Because the current learners who participated in the focus groups were also observed during rounds, the research team was able to ask them open-ended questions regarding teaching rounds and their roles as learners within this environment. Former learners who were still at the hospital participated in separate focus groups or interviews. Former learners who were no longer present at the hospital were contacted by telephone and individually interviewed by one research team member (MH). All interviews and focus groups were audio-recorded and transcribed.

This study was determined to be exempt by the University of Michigan Institutional Review Board. All participants were informed that their participation was completely voluntary and that they could terminate their involvement at any time.

Data Analysis

Data were analyzed using a thematic analysis approach.16 Thematic analysis entails reading through the data to identify patterns (and create codes) that relate to behaviors, experiences, meanings, and activities. Once patterns have been identified, they are grouped according to similarity into themes, which help to further explain the findings.17

After the first site visit was completed, the research team members that participated (SS and MH) met to develop initial ideas about meanings and possible patterns. All transcripts were read by one team member (MH) and, based on review of the data, codes were developed, defined, and documented in a codebook. This process was repeated after every site visit using the codebook to expand or combine codes and refine definitions as necessary. If a new code was added, the previously coded data were reviewed to apply the new code. NVivo® 10 software (QSR International; Melbourne, Australia) was used to manage the data.

Once all field notes and transcripts were coded (MH), the code reports, which list all data described within a specific code, were run to ensure consistency and identify relationships between codes. Once coding was verified, codes were grouped based on similarities and relationships into salient themes by 3 members of the research team (NH, MH, and SM). Themes, along with their supporting codes, were then further defined to understand how these attendings worked to facilitate excellent teaching in clinical settings.

Table 2
 

 

RESULTS

The coded interview data and field notes were categorized into broad, overlapping themes. Three of these major themes include (1) fostering positive relationships, (2) patient-centered teaching, and (3) collaboration and coaching. Table 2 lists each theme, salient behaviors, examples, and selected quotes that further elucidate its meaning.

Fostering Positive Relationships

Attending physicians took observable steps to develop positive relationships with their team members, which in turn created a safe learning environment. For instance, attendings used learners’ first names, demonstrated interest in their well-being, deployed humor, and generally displayed informal actions—uncrossed arms, “fist bump” when recognizing learners’ success, standing outside the circle of team members and leaning in to listen—during learner interactions. Attendings also made it a priority to get to know individuals on a personal level. As one current learner put it, “He asks about where we are from. He will try to find some kind of connection that he can establish with not only each of the team members but also with each of the patients.”

Additionally, attendings built positive relationships with their learners by responding thoughtfully to their input, even when learners’ evaluations of patients required modification. In turn, learners reported feeling safe to ask questions, admit uncertainty, and respectfully disagree with their attendings. As one attending reflected, “If I can get them into a place where they feel like the learning environment is someplace where they can make a mistake and know that that mistake does not necessarily mean that it’s going to cost them in their evaluation part, then I feel like that’s why it’s important.”

To build rapport and create a safe learning environment, attendings used a number of strategies to position themselves as learners alongside their team members. For instance, attendings indicated that they wanted their ideas questioned because they saw it as an opportunity to learn. Moreover, in conversations with learners, attendings demonstrated humility, admitting when they did not know something. One former learner noted, “There have been times when he has asked [a] question…nobody knows and then he admits that he doesn’t know either. So everybody goes and looks it up…The whole thing turns out to be a fun learning experience.”

Attendings demonstrated respect for their team members’ time by reading about patients before rounds, identifying learning opportunities during rounds, and integrating teaching points into the daily work of patient care. Teaching was not relegated exclusively to the conference room or confined to the traditional “chalk talk” before or after rounds but rather was assimilated into daily workflow. They appeared to be responsive to the needs of individual patients and the team, which allowed attendings to both directly oversee their patients’ care and overcome the challenges of multiple competing demands for time. The importance of this approach was made clear by one current learner who stated “…she does prepare before, especially you know on call days, she does prepare for the new patients before coming in to staff, which is really appreciated… it saves a lot of time on rounds.”

Attendings also included other health professionals in team discussions. Attendings used many of the same relationship-building techniques with these professionals as they did with learners and patients. They consistently asked these professionals to provide insight and direction in patients’ plans of care. A former learner commented, “He always asks the [nurse] what is her impression of the patient...he truly values the [nurse’s] opinion of the patient.” One attending reiterated this approach, stating “I don’t want them to think that anything I have to say is more valuable than our pharmacist or the [nurse].”

Patient-Centered Teaching

Attending physicians modeled numerous teaching techniques that focused learning around the patient. Attendings knew their patients well through review of the medical records, discussion with the patient, and personal examination. This preparation allowed attendings to focus on key teaching points in the context of the patient. One former learner noted, “He tended to bring up a variety of things that really fit well into the clinical scenario. So whether that is talking about what is the differential for a new symptom that just came up for this patient or kind of here is a new paper talking about this condition or maybe some other pearl of physical exam for a patient that has a certain physical condition.”

Attendings served as effective role models by being directly involved in examining and talking with patients as well as demonstrating excellent physical examination and communication techniques. One current learner articulated the importance of learning these skills by observing them done well: “I think he teaches by example and by doing, again, those little things: being attentive to the patients and being very careful during exams…I think those are things that you teach people by doing them, not by saying you need to do this better during the patient encounter.”

 

 

Collaboration and Coaching

Attending physicians used varied collaboration and coaching techniques to facilitate learning across the entire care team. During rounds, attendings utilized visual aids to reinforce key concepts and simplify complex topics. They also collaborated by using discussion rather than lecture to engage with team members. For instance, attendings used Socratic questioning, asking questions that lead learners through critical thinking and allow them to solve problems themselves, to guide learners’ decision-making. One former learner reported, “He never gives you the answer, and he always asks your opinion; ‘So what are your thoughts on this?’”

Coaching for success, rather than directing the various team members, was emphasized. Attendings did not wish to be seen as the “leaders” of the team. During rounds, one attending was noted to explain his role in ensuring that the team was building connections with others: “When we have a bad outcome, if it feels like your soul has been ripped out, then you’ve done something right. You’ve made that connection with the patient. My job, as your coach, was to build communication between all of us so we feel vested in each other and our patients.”

Attendings also fostered clinical reasoning skills in their learners by encouraging them to verbalize their thought processes aloud in order to clarify and check for understanding. Attendings also placed emphasis not simply on memorizing content but rather prioritization of the patient’s problems and thinking step by step through individual medical problems. One current learner applauded an attending who could “come up with schematics of how to approach problems rather than feeding us factual information of this paper or this trial.”

Additionally, attendings facilitated learning across the entire care team by differentiating their teaching to meet the needs of multiple learning levels. While the entire team was explicitly included in the learning process, attendings encouraged learners to play various roles, execute tasks, and answer questions depending on their educational level. Attendings positioned learners as leaders of the team by allowing them to talk without interruption and by encouraging them to take ownership of their patients’ care. One former learner stated, “She set expectations…we would be the ones who would be running the team, that you know it would very much be our team and that she is there to advise us and provide supervision but also safety for the patients as well.”

Table 3

CONCLUSION

This study reveals the complex ways effective attendings build rapport, create a safe learning environment, utilize patient-centered teaching strategies, and engage in collaboration and coaching with all members of the team. These findings provide a framework of shared themes and their salient behaviors that may influence the success of inpatient general medicine clinician educators (Table 3).

There is a broad and voluminous literature on the subject of outstanding clinical teaching characteristics, much of which has shaped various faculty development curricula for decades. This study sought not to identify novel approaches of inpatient teaching necessarily but rather to closely examine the techniques and behaviors of clinician educators identified as exemplary. The findings affirm and reinforce the numerous, well-documented lists of personal attributes, techniques, and behaviors that resonate with learners, including creating a positive environment, demonstrating enthusiasm and interest in the learner, reading facial expressions, being student-centered, maintaining a high level of clinical knowledge, and utilizing effective communication skills.18-24 The strengths of this study lie within the nuanced and rich observations and discussions that move beyond learners’ Likert scale evaluations and responses.3-7,12 Input was sought from multiple perspectives on the care team, which provided detail from key stakeholders. Out of these comprehensive data arose several conclusions that extend the research literature on medical education.

In their seminal review, Sutkin et al.18 demonstrate that two thirds of characteristics of outstanding clinical teachers are “noncognitive” and that, “Perhaps what makes a clinical educator truly great depends less on the acquisition of cognitive skills such as medical knowledge and formulating learning objectives, and more on inherent, relationship-based, noncognitive attributes. Whereas cognitive abilities generally involve skills that may be taught and learned, albeit with difficulty, noncognitive abilities represent personal attributes, such as relationship skills, personality types, and emotional states, which are more difficult to develop and teach.”18 Our study, thus, adds to the literature by (1) highlighting examples of techniques and behaviors that encompass the crucial “noncognitive” arena and (2) informing best practices in teaching clinical medicine, especially those that resonate with learners, for future faculty development.

The findings highlight the role that relationships play in the teaching and learning of team-based medicine. Building rapport and sustaining successful relationships are cornerstones of effective teaching.18 For the attendings in this study, this manifested in observable, tangible behaviors such as greeting others by name, joking, using physical touch, and actively involving all team members, regardless of role or level of education. Previous literature has highlighted the importance of showing interest in learners.7,19,25-27 This study provides multiple and varied examples of ways in which interest might be displayed.

For patients, the critical role of relationships was evidenced through rapport building and attention to patients as people outside their acute hospitalization. For instance, attendings regularly put patients’ medical issues into context and anticipated future outpatient challenges. To the authors’ knowledge, previous scholarship has not significantly emphasized this form of contextualized medicine, which involves the mindful consideration of the ongoing needs patients may experience upon transitions of care.

Several participants highlighted humility as an important characteristic of effective clinician educators. Attendings recognized that the field produces more new knowledge than can possibly be assimilated and that uncertainty is a mainstay of modern medical care. Attendings frequently utilized self-deprecation to acknowledge doubt, a technique that created a collaborative environment in which learners also felt safe to ask questions. These findings support the viewpoints by Reilly and Beckman that humility and an appreciation for questions and push-back from learners encourage lifelong learning through role modeling.19,23 In responding to the interviewer’s question “And what happens when [the attending] is wrong?” one learner simply stated, “He makes fun of himself.”

This study has several limitations. First, it was conducted in a limited number of US based healthcare systems. The majority of institutions represented were larger, research intensive hospitals. While these hospitals were purposefully selected to provide a range in geography, size, type, and access to resources, the findings may differ in other settings. Second, it was conducted with a limited number of attendings and learners, which may limit the study’s generalizability. However, enough interviews were conducted to reach data saturation.15 Because evidence for a causal relationship between quality teaching and student and patient outcomes is lacking,18 we must rely on imperfect proxies for teaching excellence, including awards and recognition. This study attempted to identify exemplary educators through various means, but it is recognized that bias is likely. Third, because attendings provided lists of former learners, selection and recall biases may have been introduced, as attendings may have more readily identified former learners with whom they formed strong relationships. Fourth, focus was placed exclusively on teaching and learning within general medicine rounds. This was because there would be ample opportunity for teaching on this service, the structure of the teams and the types of patients would be comparable across sites, and the principal investigator was also a general medicine attending and would have a frame of reference for these types of rounds. Due to this narrow focus, the findings may not be generalizable to other subspecialties. Fifth, attendings were selected through a nonexhaustive method. However, the multisite design, the modified snowball sampling, and the inclusion of several types of institutions in the final participant pool introduced diversity to the final list. Finally, although we cannot discount the potential role of a Hawthorne effect on our data collection, the research team did attempt to mitigate this by standing apart from the care teams and remaining unobtrusive during observations.

Using a combination of interviews, focus group discussions, and direct observation, we identified consistent techniques and behaviors of excellent teaching attendings during inpatient general medicine rounds. We hope that all levels of clinician educators may use them to elevate their own teaching.

 

 

Disclosure

Dr. Saint is on a medical advisory board of Doximity, a new social networking site for physicians, and receives an honorarium. He is also on the scientific advisory board of Jvion, a healthcare technology company. Drs. Houchens, Harrod, Moody, and Ms. Fowler have no conflicts of interest.

Files
References

1. Accreditation Council for Graduate Medical Education. Common program requirements. 2011. http://www.acgme.org/Portals/0/PDFs/Common_Program_Requirements_07012011[2].pdf. Accessed September 16, 2016.
2. Healthcare Cost and Utilization Project. Overview statistics for inpatient hospital stays. HCUP Facts and Figures: Statistics on Hospital-Based Care in the United States, 2009. Rockville, MD: Agency for Healthcare Research and Quality; 2011.
3. Busari JO, W eggelaar NM, Knottnerus AC, Greidanus PM, Scherpbier AJ. How medical residents perceive the quality of supervision provided by attending doctors in the clinical setting. Med Educ. 2005;39(7):696-703. PubMed
4. Smith CA, Varkey AB, Evans AT, Reilly BM. Evaluating the performance of inpatient attending physicians: a new instrument for today’s teaching hospitals. J Gen Intern Med. 2004;19(7):766-771. PubMed
5. Elnicki DM, Cooper A. Medical students’ perceptions of the elements of effective inpatient teaching by attending physicians and housestaff. J Gen Intern Med. 2005;20(7):635-639. PubMed
6. Buchel TL, Edwards FD. Characteristics of effective clinical teachers. Fam Med. 2005;37(1):30-35. PubMed
7. Guarino CM, Ko CY, Baker LC, Klein DJ, Quiter ES, Escarce JJ. Impact of instructional practices on student satisfaction with attendings’ teaching in the inpatient component of internal medicine clerkships. J Gen Intern Med. 2006;21(1):7-12. PubMed
8. Irby DM. How attending physicians make instructional decisions when conducting teaching rounds. Acad Med. 1992;67(10):630-638. PubMed
9. Beckman TJ. Lessons learned from a peer review of bedside teaching. Acad Med. 2004;79(4):343-346. PubMed
10. Wright SM, Carrese JA. Excellence in role modelling: insight and perspectives from the pros. CMAJ. 2002;167(6):638-643. PubMed
11. Castiglioni A, Shewchuk RM, Willett LL, Heudebert GR, Centor RM. A pilot study using nominal group technique to assess residents’ perceptions of successful attending rounds. J Gen Intern Med. 2008;23(7):1060-1065. PubMed
12. Bergman K, Gaitskill T. Faculty and student perceptions of effective clinical teachers: an extension study. J Prof Nurs. 1990;6(1):33-44. PubMed
13. Richards L, Morse J. README FIRST for a User’s Guide to Qualitative Methods. 3rd ed. Los Angeles, CA: SAGE Publications, Inc.; 2013. 
14. U.S. News and World Report. Best Medical Schools: Research. 2014. http://grad-schools.usnews.rankingsandreviews.com/best-graduate-schools/top-medical-schools/research-rankings. Accessed September 16, 2016.
15. Guest G, Bunce A, Johnson L. How many interviews are enough? An experiment with data saturation and variability. Field Methods. 2006;18(1):59-82. 
16. Braun V, Clarke V. Using thematic analysis in psychology. Qual Res Psychol. 2006;3(2):77-101. 
17. Aronson J. A pragmatic view of thematic analysis. Qual Rep. 1995;2(1):1-3. 
18. Sutkin G, Wagner E, Harris I, Schiffer R. What makes a good clinical teacher in medicine? A review of the literature. Acad Med. 2008;83(5):452-466. PubMed
19. Beckman TJ, Lee MC. Proposal for a collaborative approach to clinical teaching. Mayo Clin Proc. 2009;84(4):339-344. PubMed
20. Ramani S. Twelve tips to improve bedside teaching. Med Teach. 2003;25(2):112-115. PubMed
21. Irby DM. What clinical teachers in medicine need to know. Acad Med. 1994;69(5):333-342. PubMed
22. Wiese J, ed. Teaching in the Hospital. Philadelphia, PA: American College of Physicians; 2010. 
23. Reilly BM. Inconvenient truths about effective clinical teaching. Lancet. 2007;370(9588):705-711. PubMed
24. Branch WT Jr, Kern D, Haidet P, et al. The patient-physician relationship. Teaching the human dimensions of care in clinical settings. JAMA. 2001;286(9):1067-1074. PubMed
25. McLeod PJ, Harden RM. Clinical teaching strategies for physicians. Med Teach. 1985;7(2):173-189. PubMed
26. Pinsky LE, Monson D, Irby DM. How excellent teachers are made: reflecting on success to improve teaching. Adv Health Sci Educ Theory Pract. 1998;3(3):207-215. PubMed
27. Ullian JA, Bland CJ, Simpson DE. An alternative approach to defining the role of the clinical teacher. Acad Med. 1994;69(10):832-838. PubMed

Article PDF
Issue
Journal of Hospital Medicine 12(7)
Topics
Page Number
503-509
Sections
Files
Files
Article PDF
Article PDF

Clinician educators face numerous obstacles to their joint mission of facilitating learning while also ensuring high-quality and patient-centered care. Time constraints, including the institution of house officer duty hour limitations,1 shorter lengths of stay for hospitalized patients,2 and competing career responsibilities, combine to create a dynamic learning environment. Additionally, clinician educators must balance the autonomy of their learners with the safety of their patients. They must teach to multiple learning levels and work collaboratively with multiple disciplines to foster an effective team-based approach to patient care. Yet, many clinician educators have no formal training in pedagogical methods.3 Such challenges necessitate increased attention to the work of excellent clinician educators and their respective teaching approaches.

Many studies of clinical teaching rely primarily on survey data of attributes of good clinical teachers.3-7 While some studies have incorporated direct observations of teaching8,9 or interviews with clinician educators or learners,10,11 few have incorporated multiple perspectives from the current team and from former learners in order to provide a comprehensive picture of team-based learning.12

The goal of this study was to gain a thorough understanding, through multiple perspectives, of the techniques and behaviors used by exemplary educators within actual clinical environments. We studied attitudes, behaviors, and approaches of 12 such inpatient clinician educators.

METHODS

Study Design and Sampling

This was a multisite study using an exploratory qualitative approach to inquiry. This approach was used to study the techniques and behaviors of excellent attendings during inpatient general medicine rounds. A modified snowball sampling approach13 was used, meaning individuals known to one member of the research team (SS) were initially contacted and asked to identify clinician educators (also referred to as attendings) for potential inclusion in the study. In an effort to identify attendings from a broad range of medical schools, the “2015 U.S. News and World Report Top Medical Schools: Research” rankings14 were also reviewed, with priority given to the top 25, as these are widely used to represent the best US hospitals. In an attempt to invite attendings from diverse institutions, additional medical schools not in the top 25 as well as historically black medical schools were also included. Division chiefs and chairs of internal medicine and/or directors of internal medicine residency programs at these schools were contacted and asked for recommendations of attendings, both within and outside their institutions, who they considered to be great inpatient teachers. In addition, key experts who have won teaching awards or were known to be specialists in the field of medical education were asked to nominate one or two other outstanding attendings.

Table 1
 

 

By using this sampling method, 59 potential participants were identified. An internet search was conducted to obtain information about the potential participants and their institutions. Organizational characteristics such as geographic location, hospital size and affiliation, and patient population, as well as individual characteristics such as gender, medical education and training, and educational awards received were considered so that a diversity of organizations and backgrounds was represented. The list was narrowed down to 16 attendings who were contacted via e-mail and asked to participate. Interested participants were asked for a list of their current team members and 6 to 10 former learners to contact for interviews and focus groups. Former learners were included in an effort to better understand lasting effects on learners from their exemplary teaching attendings. A total of 12 attending physicians agreed to participate (Table 1). Literature on field methods has shown that 12 interviews are found to be adequate in accomplishing data saturation.15 Although 2 attendings were located at the same institution, we decided to include them given that both are recognized as master clinician educators and were each recommended by several individuals from various institutions. Hospitals were located throughout the US and included both university-affiliated hospitals and Veterans Affairs medical centers. Despite efforts to include physicians from historically black colleges and universities, only one attending was identified, and they declined the request to participate.

Data Collection

Observations. The one-day site visits were mainly conducted by two research team members, a physician (SS) and a medical anthropologist (MH), both of whom have extensive experience in qualitative methods. Teams were not uniform but were generally comprised of 1 attending, 1 senior medical resident, 1 to 2 interns, and approximately 2 medical students. Occasionally, a pharmacist, clinical assistant, or other health professional accompanied the team on rounds. Not infrequently, the bedside nurse would explicitly be included in the discussion regarding his or her specific patient. Each site visit began with observing attendings (N = 12) and current learners (N = 57) during rounds. Each research team member recorded their own observations via handwritten field notes, paying particular attention to group interactions, teaching approach, conversations occurring within and peripheral to the team, patient-team interactions, and the physical environment. By standing outside of the medical team circle and remaining silent during rounds, research team members remained unobtrusive to the discussion and process of rounds. Materials the attendings used during their teaching rounds were also documented and collected. Rounds generally lasted 2 to 3 hours. After each site visit, the research team met to compare and combine field notes.

Interviews and Focus Groups. The research team then conducted individual, semi-structured interviews with the attendings, focus groups with their current team (N = 46), and interviews or focus groups with their former learners (N = 26; Supplement 1). Eleven of the current team members observed during rounds were unable to participate in the focus groups due to clinical duties. Because the current learners who participated in the focus groups were also observed during rounds, the research team was able to ask them open-ended questions regarding teaching rounds and their roles as learners within this environment. Former learners who were still at the hospital participated in separate focus groups or interviews. Former learners who were no longer present at the hospital were contacted by telephone and individually interviewed by one research team member (MH). All interviews and focus groups were audio-recorded and transcribed.

This study was determined to be exempt by the University of Michigan Institutional Review Board. All participants were informed that their participation was completely voluntary and that they could terminate their involvement at any time.

Data Analysis

Data were analyzed using a thematic analysis approach.16 Thematic analysis entails reading through the data to identify patterns (and create codes) that relate to behaviors, experiences, meanings, and activities. Once patterns have been identified, they are grouped according to similarity into themes, which help to further explain the findings.17

After the first site visit was completed, the research team members that participated (SS and MH) met to develop initial ideas about meanings and possible patterns. All transcripts were read by one team member (MH) and, based on review of the data, codes were developed, defined, and documented in a codebook. This process was repeated after every site visit using the codebook to expand or combine codes and refine definitions as necessary. If a new code was added, the previously coded data were reviewed to apply the new code. NVivo® 10 software (QSR International; Melbourne, Australia) was used to manage the data.

Once all field notes and transcripts were coded (MH), the code reports, which list all data described within a specific code, were run to ensure consistency and identify relationships between codes. Once coding was verified, codes were grouped based on similarities and relationships into salient themes by 3 members of the research team (NH, MH, and SM). Themes, along with their supporting codes, were then further defined to understand how these attendings worked to facilitate excellent teaching in clinical settings.

Table 2
 

 

RESULTS

The coded interview data and field notes were categorized into broad, overlapping themes. Three of these major themes include (1) fostering positive relationships, (2) patient-centered teaching, and (3) collaboration and coaching. Table 2 lists each theme, salient behaviors, examples, and selected quotes that further elucidate its meaning.

Fostering Positive Relationships

Attending physicians took observable steps to develop positive relationships with their team members, which in turn created a safe learning environment. For instance, attendings used learners’ first names, demonstrated interest in their well-being, deployed humor, and generally displayed informal actions—uncrossed arms, “fist bump” when recognizing learners’ success, standing outside the circle of team members and leaning in to listen—during learner interactions. Attendings also made it a priority to get to know individuals on a personal level. As one current learner put it, “He asks about where we are from. He will try to find some kind of connection that he can establish with not only each of the team members but also with each of the patients.”

Additionally, attendings built positive relationships with their learners by responding thoughtfully to their input, even when learners’ evaluations of patients required modification. In turn, learners reported feeling safe to ask questions, admit uncertainty, and respectfully disagree with their attendings. As one attending reflected, “If I can get them into a place where they feel like the learning environment is someplace where they can make a mistake and know that that mistake does not necessarily mean that it’s going to cost them in their evaluation part, then I feel like that’s why it’s important.”

To build rapport and create a safe learning environment, attendings used a number of strategies to position themselves as learners alongside their team members. For instance, attendings indicated that they wanted their ideas questioned because they saw it as an opportunity to learn. Moreover, in conversations with learners, attendings demonstrated humility, admitting when they did not know something. One former learner noted, “There have been times when he has asked [a] question…nobody knows and then he admits that he doesn’t know either. So everybody goes and looks it up…The whole thing turns out to be a fun learning experience.”

Attendings demonstrated respect for their team members’ time by reading about patients before rounds, identifying learning opportunities during rounds, and integrating teaching points into the daily work of patient care. Teaching was not relegated exclusively to the conference room or confined to the traditional “chalk talk” before or after rounds but rather was assimilated into daily workflow. They appeared to be responsive to the needs of individual patients and the team, which allowed attendings to both directly oversee their patients’ care and overcome the challenges of multiple competing demands for time. The importance of this approach was made clear by one current learner who stated “…she does prepare before, especially you know on call days, she does prepare for the new patients before coming in to staff, which is really appreciated… it saves a lot of time on rounds.”

Attendings also included other health professionals in team discussions. Attendings used many of the same relationship-building techniques with these professionals as they did with learners and patients. They consistently asked these professionals to provide insight and direction in patients’ plans of care. A former learner commented, “He always asks the [nurse] what is her impression of the patient...he truly values the [nurse’s] opinion of the patient.” One attending reiterated this approach, stating “I don’t want them to think that anything I have to say is more valuable than our pharmacist or the [nurse].”

Patient-Centered Teaching

Attending physicians modeled numerous teaching techniques that focused learning around the patient. Attendings knew their patients well through review of the medical records, discussion with the patient, and personal examination. This preparation allowed attendings to focus on key teaching points in the context of the patient. One former learner noted, “He tended to bring up a variety of things that really fit well into the clinical scenario. So whether that is talking about what is the differential for a new symptom that just came up for this patient or kind of here is a new paper talking about this condition or maybe some other pearl of physical exam for a patient that has a certain physical condition.”

Attendings served as effective role models by being directly involved in examining and talking with patients as well as demonstrating excellent physical examination and communication techniques. One current learner articulated the importance of learning these skills by observing them done well: “I think he teaches by example and by doing, again, those little things: being attentive to the patients and being very careful during exams…I think those are things that you teach people by doing them, not by saying you need to do this better during the patient encounter.”

 

 

Collaboration and Coaching

Attending physicians used varied collaboration and coaching techniques to facilitate learning across the entire care team. During rounds, attendings utilized visual aids to reinforce key concepts and simplify complex topics. They also collaborated by using discussion rather than lecture to engage with team members. For instance, attendings used Socratic questioning, asking questions that lead learners through critical thinking and allow them to solve problems themselves, to guide learners’ decision-making. One former learner reported, “He never gives you the answer, and he always asks your opinion; ‘So what are your thoughts on this?’”

Coaching for success, rather than directing the various team members, was emphasized. Attendings did not wish to be seen as the “leaders” of the team. During rounds, one attending was noted to explain his role in ensuring that the team was building connections with others: “When we have a bad outcome, if it feels like your soul has been ripped out, then you’ve done something right. You’ve made that connection with the patient. My job, as your coach, was to build communication between all of us so we feel vested in each other and our patients.”

Attendings also fostered clinical reasoning skills in their learners by encouraging them to verbalize their thought processes aloud in order to clarify and check for understanding. Attendings also placed emphasis not simply on memorizing content but rather prioritization of the patient’s problems and thinking step by step through individual medical problems. One current learner applauded an attending who could “come up with schematics of how to approach problems rather than feeding us factual information of this paper or this trial.”

Additionally, attendings facilitated learning across the entire care team by differentiating their teaching to meet the needs of multiple learning levels. While the entire team was explicitly included in the learning process, attendings encouraged learners to play various roles, execute tasks, and answer questions depending on their educational level. Attendings positioned learners as leaders of the team by allowing them to talk without interruption and by encouraging them to take ownership of their patients’ care. One former learner stated, “She set expectations…we would be the ones who would be running the team, that you know it would very much be our team and that she is there to advise us and provide supervision but also safety for the patients as well.”

Table 3

CONCLUSION

This study reveals the complex ways effective attendings build rapport, create a safe learning environment, utilize patient-centered teaching strategies, and engage in collaboration and coaching with all members of the team. These findings provide a framework of shared themes and their salient behaviors that may influence the success of inpatient general medicine clinician educators (Table 3).

There is a broad and voluminous literature on the subject of outstanding clinical teaching characteristics, much of which has shaped various faculty development curricula for decades. This study sought not to identify novel approaches of inpatient teaching necessarily but rather to closely examine the techniques and behaviors of clinician educators identified as exemplary. The findings affirm and reinforce the numerous, well-documented lists of personal attributes, techniques, and behaviors that resonate with learners, including creating a positive environment, demonstrating enthusiasm and interest in the learner, reading facial expressions, being student-centered, maintaining a high level of clinical knowledge, and utilizing effective communication skills.18-24 The strengths of this study lie within the nuanced and rich observations and discussions that move beyond learners’ Likert scale evaluations and responses.3-7,12 Input was sought from multiple perspectives on the care team, which provided detail from key stakeholders. Out of these comprehensive data arose several conclusions that extend the research literature on medical education.

In their seminal review, Sutkin et al.18 demonstrate that two thirds of characteristics of outstanding clinical teachers are “noncognitive” and that, “Perhaps what makes a clinical educator truly great depends less on the acquisition of cognitive skills such as medical knowledge and formulating learning objectives, and more on inherent, relationship-based, noncognitive attributes. Whereas cognitive abilities generally involve skills that may be taught and learned, albeit with difficulty, noncognitive abilities represent personal attributes, such as relationship skills, personality types, and emotional states, which are more difficult to develop and teach.”18 Our study, thus, adds to the literature by (1) highlighting examples of techniques and behaviors that encompass the crucial “noncognitive” arena and (2) informing best practices in teaching clinical medicine, especially those that resonate with learners, for future faculty development.

The findings highlight the role that relationships play in the teaching and learning of team-based medicine. Building rapport and sustaining successful relationships are cornerstones of effective teaching.18 For the attendings in this study, this manifested in observable, tangible behaviors such as greeting others by name, joking, using physical touch, and actively involving all team members, regardless of role or level of education. Previous literature has highlighted the importance of showing interest in learners.7,19,25-27 This study provides multiple and varied examples of ways in which interest might be displayed.

For patients, the critical role of relationships was evidenced through rapport building and attention to patients as people outside their acute hospitalization. For instance, attendings regularly put patients’ medical issues into context and anticipated future outpatient challenges. To the authors’ knowledge, previous scholarship has not significantly emphasized this form of contextualized medicine, which involves the mindful consideration of the ongoing needs patients may experience upon transitions of care.

Several participants highlighted humility as an important characteristic of effective clinician educators. Attendings recognized that the field produces more new knowledge than can possibly be assimilated and that uncertainty is a mainstay of modern medical care. Attendings frequently utilized self-deprecation to acknowledge doubt, a technique that created a collaborative environment in which learners also felt safe to ask questions. These findings support the viewpoints by Reilly and Beckman that humility and an appreciation for questions and push-back from learners encourage lifelong learning through role modeling.19,23 In responding to the interviewer’s question “And what happens when [the attending] is wrong?” one learner simply stated, “He makes fun of himself.”

This study has several limitations. First, it was conducted in a limited number of US based healthcare systems. The majority of institutions represented were larger, research intensive hospitals. While these hospitals were purposefully selected to provide a range in geography, size, type, and access to resources, the findings may differ in other settings. Second, it was conducted with a limited number of attendings and learners, which may limit the study’s generalizability. However, enough interviews were conducted to reach data saturation.15 Because evidence for a causal relationship between quality teaching and student and patient outcomes is lacking,18 we must rely on imperfect proxies for teaching excellence, including awards and recognition. This study attempted to identify exemplary educators through various means, but it is recognized that bias is likely. Third, because attendings provided lists of former learners, selection and recall biases may have been introduced, as attendings may have more readily identified former learners with whom they formed strong relationships. Fourth, focus was placed exclusively on teaching and learning within general medicine rounds. This was because there would be ample opportunity for teaching on this service, the structure of the teams and the types of patients would be comparable across sites, and the principal investigator was also a general medicine attending and would have a frame of reference for these types of rounds. Due to this narrow focus, the findings may not be generalizable to other subspecialties. Fifth, attendings were selected through a nonexhaustive method. However, the multisite design, the modified snowball sampling, and the inclusion of several types of institutions in the final participant pool introduced diversity to the final list. Finally, although we cannot discount the potential role of a Hawthorne effect on our data collection, the research team did attempt to mitigate this by standing apart from the care teams and remaining unobtrusive during observations.

Using a combination of interviews, focus group discussions, and direct observation, we identified consistent techniques and behaviors of excellent teaching attendings during inpatient general medicine rounds. We hope that all levels of clinician educators may use them to elevate their own teaching.

 

 

Disclosure

Dr. Saint is on a medical advisory board of Doximity, a new social networking site for physicians, and receives an honorarium. He is also on the scientific advisory board of Jvion, a healthcare technology company. Drs. Houchens, Harrod, Moody, and Ms. Fowler have no conflicts of interest.

Clinician educators face numerous obstacles to their joint mission of facilitating learning while also ensuring high-quality and patient-centered care. Time constraints, including the institution of house officer duty hour limitations,1 shorter lengths of stay for hospitalized patients,2 and competing career responsibilities, combine to create a dynamic learning environment. Additionally, clinician educators must balance the autonomy of their learners with the safety of their patients. They must teach to multiple learning levels and work collaboratively with multiple disciplines to foster an effective team-based approach to patient care. Yet, many clinician educators have no formal training in pedagogical methods.3 Such challenges necessitate increased attention to the work of excellent clinician educators and their respective teaching approaches.

Many studies of clinical teaching rely primarily on survey data of attributes of good clinical teachers.3-7 While some studies have incorporated direct observations of teaching8,9 or interviews with clinician educators or learners,10,11 few have incorporated multiple perspectives from the current team and from former learners in order to provide a comprehensive picture of team-based learning.12

The goal of this study was to gain a thorough understanding, through multiple perspectives, of the techniques and behaviors used by exemplary educators within actual clinical environments. We studied attitudes, behaviors, and approaches of 12 such inpatient clinician educators.

METHODS

Study Design and Sampling

This was a multisite study using an exploratory qualitative approach to inquiry. This approach was used to study the techniques and behaviors of excellent attendings during inpatient general medicine rounds. A modified snowball sampling approach13 was used, meaning individuals known to one member of the research team (SS) were initially contacted and asked to identify clinician educators (also referred to as attendings) for potential inclusion in the study. In an effort to identify attendings from a broad range of medical schools, the “2015 U.S. News and World Report Top Medical Schools: Research” rankings14 were also reviewed, with priority given to the top 25, as these are widely used to represent the best US hospitals. In an attempt to invite attendings from diverse institutions, additional medical schools not in the top 25 as well as historically black medical schools were also included. Division chiefs and chairs of internal medicine and/or directors of internal medicine residency programs at these schools were contacted and asked for recommendations of attendings, both within and outside their institutions, who they considered to be great inpatient teachers. In addition, key experts who have won teaching awards or were known to be specialists in the field of medical education were asked to nominate one or two other outstanding attendings.

Table 1
 

 

By using this sampling method, 59 potential participants were identified. An internet search was conducted to obtain information about the potential participants and their institutions. Organizational characteristics such as geographic location, hospital size and affiliation, and patient population, as well as individual characteristics such as gender, medical education and training, and educational awards received were considered so that a diversity of organizations and backgrounds was represented. The list was narrowed down to 16 attendings who were contacted via e-mail and asked to participate. Interested participants were asked for a list of their current team members and 6 to 10 former learners to contact for interviews and focus groups. Former learners were included in an effort to better understand lasting effects on learners from their exemplary teaching attendings. A total of 12 attending physicians agreed to participate (Table 1). Literature on field methods has shown that 12 interviews are found to be adequate in accomplishing data saturation.15 Although 2 attendings were located at the same institution, we decided to include them given that both are recognized as master clinician educators and were each recommended by several individuals from various institutions. Hospitals were located throughout the US and included both university-affiliated hospitals and Veterans Affairs medical centers. Despite efforts to include physicians from historically black colleges and universities, only one attending was identified, and they declined the request to participate.

Data Collection

Observations. The one-day site visits were mainly conducted by two research team members, a physician (SS) and a medical anthropologist (MH), both of whom have extensive experience in qualitative methods. Teams were not uniform but were generally comprised of 1 attending, 1 senior medical resident, 1 to 2 interns, and approximately 2 medical students. Occasionally, a pharmacist, clinical assistant, or other health professional accompanied the team on rounds. Not infrequently, the bedside nurse would explicitly be included in the discussion regarding his or her specific patient. Each site visit began with observing attendings (N = 12) and current learners (N = 57) during rounds. Each research team member recorded their own observations via handwritten field notes, paying particular attention to group interactions, teaching approach, conversations occurring within and peripheral to the team, patient-team interactions, and the physical environment. By standing outside of the medical team circle and remaining silent during rounds, research team members remained unobtrusive to the discussion and process of rounds. Materials the attendings used during their teaching rounds were also documented and collected. Rounds generally lasted 2 to 3 hours. After each site visit, the research team met to compare and combine field notes.

Interviews and Focus Groups. The research team then conducted individual, semi-structured interviews with the attendings, focus groups with their current team (N = 46), and interviews or focus groups with their former learners (N = 26; Supplement 1). Eleven of the current team members observed during rounds were unable to participate in the focus groups due to clinical duties. Because the current learners who participated in the focus groups were also observed during rounds, the research team was able to ask them open-ended questions regarding teaching rounds and their roles as learners within this environment. Former learners who were still at the hospital participated in separate focus groups or interviews. Former learners who were no longer present at the hospital were contacted by telephone and individually interviewed by one research team member (MH). All interviews and focus groups were audio-recorded and transcribed.

This study was determined to be exempt by the University of Michigan Institutional Review Board. All participants were informed that their participation was completely voluntary and that they could terminate their involvement at any time.

Data Analysis

Data were analyzed using a thematic analysis approach.16 Thematic analysis entails reading through the data to identify patterns (and create codes) that relate to behaviors, experiences, meanings, and activities. Once patterns have been identified, they are grouped according to similarity into themes, which help to further explain the findings.17

After the first site visit was completed, the research team members that participated (SS and MH) met to develop initial ideas about meanings and possible patterns. All transcripts were read by one team member (MH) and, based on review of the data, codes were developed, defined, and documented in a codebook. This process was repeated after every site visit using the codebook to expand or combine codes and refine definitions as necessary. If a new code was added, the previously coded data were reviewed to apply the new code. NVivo® 10 software (QSR International; Melbourne, Australia) was used to manage the data.

Once all field notes and transcripts were coded (MH), the code reports, which list all data described within a specific code, were run to ensure consistency and identify relationships between codes. Once coding was verified, codes were grouped based on similarities and relationships into salient themes by 3 members of the research team (NH, MH, and SM). Themes, along with their supporting codes, were then further defined to understand how these attendings worked to facilitate excellent teaching in clinical settings.

Table 2
 

 

RESULTS

The coded interview data and field notes were categorized into broad, overlapping themes. Three of these major themes include (1) fostering positive relationships, (2) patient-centered teaching, and (3) collaboration and coaching. Table 2 lists each theme, salient behaviors, examples, and selected quotes that further elucidate its meaning.

Fostering Positive Relationships

Attending physicians took observable steps to develop positive relationships with their team members, which in turn created a safe learning environment. For instance, attendings used learners’ first names, demonstrated interest in their well-being, deployed humor, and generally displayed informal actions—uncrossed arms, “fist bump” when recognizing learners’ success, standing outside the circle of team members and leaning in to listen—during learner interactions. Attendings also made it a priority to get to know individuals on a personal level. As one current learner put it, “He asks about where we are from. He will try to find some kind of connection that he can establish with not only each of the team members but also with each of the patients.”

Additionally, attendings built positive relationships with their learners by responding thoughtfully to their input, even when learners’ evaluations of patients required modification. In turn, learners reported feeling safe to ask questions, admit uncertainty, and respectfully disagree with their attendings. As one attending reflected, “If I can get them into a place where they feel like the learning environment is someplace where they can make a mistake and know that that mistake does not necessarily mean that it’s going to cost them in their evaluation part, then I feel like that’s why it’s important.”

To build rapport and create a safe learning environment, attendings used a number of strategies to position themselves as learners alongside their team members. For instance, attendings indicated that they wanted their ideas questioned because they saw it as an opportunity to learn. Moreover, in conversations with learners, attendings demonstrated humility, admitting when they did not know something. One former learner noted, “There have been times when he has asked [a] question…nobody knows and then he admits that he doesn’t know either. So everybody goes and looks it up…The whole thing turns out to be a fun learning experience.”

Attendings demonstrated respect for their team members’ time by reading about patients before rounds, identifying learning opportunities during rounds, and integrating teaching points into the daily work of patient care. Teaching was not relegated exclusively to the conference room or confined to the traditional “chalk talk” before or after rounds but rather was assimilated into daily workflow. They appeared to be responsive to the needs of individual patients and the team, which allowed attendings to both directly oversee their patients’ care and overcome the challenges of multiple competing demands for time. The importance of this approach was made clear by one current learner who stated “…she does prepare before, especially you know on call days, she does prepare for the new patients before coming in to staff, which is really appreciated… it saves a lot of time on rounds.”

Attendings also included other health professionals in team discussions. Attendings used many of the same relationship-building techniques with these professionals as they did with learners and patients. They consistently asked these professionals to provide insight and direction in patients’ plans of care. A former learner commented, “He always asks the [nurse] what is her impression of the patient...he truly values the [nurse’s] opinion of the patient.” One attending reiterated this approach, stating “I don’t want them to think that anything I have to say is more valuable than our pharmacist or the [nurse].”

Patient-Centered Teaching

Attending physicians modeled numerous teaching techniques that focused learning around the patient. Attendings knew their patients well through review of the medical records, discussion with the patient, and personal examination. This preparation allowed attendings to focus on key teaching points in the context of the patient. One former learner noted, “He tended to bring up a variety of things that really fit well into the clinical scenario. So whether that is talking about what is the differential for a new symptom that just came up for this patient or kind of here is a new paper talking about this condition or maybe some other pearl of physical exam for a patient that has a certain physical condition.”

Attendings served as effective role models by being directly involved in examining and talking with patients as well as demonstrating excellent physical examination and communication techniques. One current learner articulated the importance of learning these skills by observing them done well: “I think he teaches by example and by doing, again, those little things: being attentive to the patients and being very careful during exams…I think those are things that you teach people by doing them, not by saying you need to do this better during the patient encounter.”

 

 

Collaboration and Coaching

Attending physicians used varied collaboration and coaching techniques to facilitate learning across the entire care team. During rounds, attendings utilized visual aids to reinforce key concepts and simplify complex topics. They also collaborated by using discussion rather than lecture to engage with team members. For instance, attendings used Socratic questioning, asking questions that lead learners through critical thinking and allow them to solve problems themselves, to guide learners’ decision-making. One former learner reported, “He never gives you the answer, and he always asks your opinion; ‘So what are your thoughts on this?’”

Coaching for success, rather than directing the various team members, was emphasized. Attendings did not wish to be seen as the “leaders” of the team. During rounds, one attending was noted to explain his role in ensuring that the team was building connections with others: “When we have a bad outcome, if it feels like your soul has been ripped out, then you’ve done something right. You’ve made that connection with the patient. My job, as your coach, was to build communication between all of us so we feel vested in each other and our patients.”

Attendings also fostered clinical reasoning skills in their learners by encouraging them to verbalize their thought processes aloud in order to clarify and check for understanding. Attendings also placed emphasis not simply on memorizing content but rather prioritization of the patient’s problems and thinking step by step through individual medical problems. One current learner applauded an attending who could “come up with schematics of how to approach problems rather than feeding us factual information of this paper or this trial.”

Additionally, attendings facilitated learning across the entire care team by differentiating their teaching to meet the needs of multiple learning levels. While the entire team was explicitly included in the learning process, attendings encouraged learners to play various roles, execute tasks, and answer questions depending on their educational level. Attendings positioned learners as leaders of the team by allowing them to talk without interruption and by encouraging them to take ownership of their patients’ care. One former learner stated, “She set expectations…we would be the ones who would be running the team, that you know it would very much be our team and that she is there to advise us and provide supervision but also safety for the patients as well.”

Table 3

CONCLUSION

This study reveals the complex ways effective attendings build rapport, create a safe learning environment, utilize patient-centered teaching strategies, and engage in collaboration and coaching with all members of the team. These findings provide a framework of shared themes and their salient behaviors that may influence the success of inpatient general medicine clinician educators (Table 3).

There is a broad and voluminous literature on the subject of outstanding clinical teaching characteristics, much of which has shaped various faculty development curricula for decades. This study sought not to identify novel approaches of inpatient teaching necessarily but rather to closely examine the techniques and behaviors of clinician educators identified as exemplary. The findings affirm and reinforce the numerous, well-documented lists of personal attributes, techniques, and behaviors that resonate with learners, including creating a positive environment, demonstrating enthusiasm and interest in the learner, reading facial expressions, being student-centered, maintaining a high level of clinical knowledge, and utilizing effective communication skills.18-24 The strengths of this study lie within the nuanced and rich observations and discussions that move beyond learners’ Likert scale evaluations and responses.3-7,12 Input was sought from multiple perspectives on the care team, which provided detail from key stakeholders. Out of these comprehensive data arose several conclusions that extend the research literature on medical education.

In their seminal review, Sutkin et al.18 demonstrate that two thirds of characteristics of outstanding clinical teachers are “noncognitive” and that, “Perhaps what makes a clinical educator truly great depends less on the acquisition of cognitive skills such as medical knowledge and formulating learning objectives, and more on inherent, relationship-based, noncognitive attributes. Whereas cognitive abilities generally involve skills that may be taught and learned, albeit with difficulty, noncognitive abilities represent personal attributes, such as relationship skills, personality types, and emotional states, which are more difficult to develop and teach.”18 Our study, thus, adds to the literature by (1) highlighting examples of techniques and behaviors that encompass the crucial “noncognitive” arena and (2) informing best practices in teaching clinical medicine, especially those that resonate with learners, for future faculty development.

The findings highlight the role that relationships play in the teaching and learning of team-based medicine. Building rapport and sustaining successful relationships are cornerstones of effective teaching.18 For the attendings in this study, this manifested in observable, tangible behaviors such as greeting others by name, joking, using physical touch, and actively involving all team members, regardless of role or level of education. Previous literature has highlighted the importance of showing interest in learners.7,19,25-27 This study provides multiple and varied examples of ways in which interest might be displayed.

For patients, the critical role of relationships was evidenced through rapport building and attention to patients as people outside their acute hospitalization. For instance, attendings regularly put patients’ medical issues into context and anticipated future outpatient challenges. To the authors’ knowledge, previous scholarship has not significantly emphasized this form of contextualized medicine, which involves the mindful consideration of the ongoing needs patients may experience upon transitions of care.

Several participants highlighted humility as an important characteristic of effective clinician educators. Attendings recognized that the field produces more new knowledge than can possibly be assimilated and that uncertainty is a mainstay of modern medical care. Attendings frequently utilized self-deprecation to acknowledge doubt, a technique that created a collaborative environment in which learners also felt safe to ask questions. These findings support the viewpoints by Reilly and Beckman that humility and an appreciation for questions and push-back from learners encourage lifelong learning through role modeling.19,23 In responding to the interviewer’s question “And what happens when [the attending] is wrong?” one learner simply stated, “He makes fun of himself.”

This study has several limitations. First, it was conducted in a limited number of US based healthcare systems. The majority of institutions represented were larger, research intensive hospitals. While these hospitals were purposefully selected to provide a range in geography, size, type, and access to resources, the findings may differ in other settings. Second, it was conducted with a limited number of attendings and learners, which may limit the study’s generalizability. However, enough interviews were conducted to reach data saturation.15 Because evidence for a causal relationship between quality teaching and student and patient outcomes is lacking,18 we must rely on imperfect proxies for teaching excellence, including awards and recognition. This study attempted to identify exemplary educators through various means, but it is recognized that bias is likely. Third, because attendings provided lists of former learners, selection and recall biases may have been introduced, as attendings may have more readily identified former learners with whom they formed strong relationships. Fourth, focus was placed exclusively on teaching and learning within general medicine rounds. This was because there would be ample opportunity for teaching on this service, the structure of the teams and the types of patients would be comparable across sites, and the principal investigator was also a general medicine attending and would have a frame of reference for these types of rounds. Due to this narrow focus, the findings may not be generalizable to other subspecialties. Fifth, attendings were selected through a nonexhaustive method. However, the multisite design, the modified snowball sampling, and the inclusion of several types of institutions in the final participant pool introduced diversity to the final list. Finally, although we cannot discount the potential role of a Hawthorne effect on our data collection, the research team did attempt to mitigate this by standing apart from the care teams and remaining unobtrusive during observations.

Using a combination of interviews, focus group discussions, and direct observation, we identified consistent techniques and behaviors of excellent teaching attendings during inpatient general medicine rounds. We hope that all levels of clinician educators may use them to elevate their own teaching.

 

 

Disclosure

Dr. Saint is on a medical advisory board of Doximity, a new social networking site for physicians, and receives an honorarium. He is also on the scientific advisory board of Jvion, a healthcare technology company. Drs. Houchens, Harrod, Moody, and Ms. Fowler have no conflicts of interest.

References

1. Accreditation Council for Graduate Medical Education. Common program requirements. 2011. http://www.acgme.org/Portals/0/PDFs/Common_Program_Requirements_07012011[2].pdf. Accessed September 16, 2016.
2. Healthcare Cost and Utilization Project. Overview statistics for inpatient hospital stays. HCUP Facts and Figures: Statistics on Hospital-Based Care in the United States, 2009. Rockville, MD: Agency for Healthcare Research and Quality; 2011.
3. Busari JO, W eggelaar NM, Knottnerus AC, Greidanus PM, Scherpbier AJ. How medical residents perceive the quality of supervision provided by attending doctors in the clinical setting. Med Educ. 2005;39(7):696-703. PubMed
4. Smith CA, Varkey AB, Evans AT, Reilly BM. Evaluating the performance of inpatient attending physicians: a new instrument for today’s teaching hospitals. J Gen Intern Med. 2004;19(7):766-771. PubMed
5. Elnicki DM, Cooper A. Medical students’ perceptions of the elements of effective inpatient teaching by attending physicians and housestaff. J Gen Intern Med. 2005;20(7):635-639. PubMed
6. Buchel TL, Edwards FD. Characteristics of effective clinical teachers. Fam Med. 2005;37(1):30-35. PubMed
7. Guarino CM, Ko CY, Baker LC, Klein DJ, Quiter ES, Escarce JJ. Impact of instructional practices on student satisfaction with attendings’ teaching in the inpatient component of internal medicine clerkships. J Gen Intern Med. 2006;21(1):7-12. PubMed
8. Irby DM. How attending physicians make instructional decisions when conducting teaching rounds. Acad Med. 1992;67(10):630-638. PubMed
9. Beckman TJ. Lessons learned from a peer review of bedside teaching. Acad Med. 2004;79(4):343-346. PubMed
10. Wright SM, Carrese JA. Excellence in role modelling: insight and perspectives from the pros. CMAJ. 2002;167(6):638-643. PubMed
11. Castiglioni A, Shewchuk RM, Willett LL, Heudebert GR, Centor RM. A pilot study using nominal group technique to assess residents’ perceptions of successful attending rounds. J Gen Intern Med. 2008;23(7):1060-1065. PubMed
12. Bergman K, Gaitskill T. Faculty and student perceptions of effective clinical teachers: an extension study. J Prof Nurs. 1990;6(1):33-44. PubMed
13. Richards L, Morse J. README FIRST for a User’s Guide to Qualitative Methods. 3rd ed. Los Angeles, CA: SAGE Publications, Inc.; 2013. 
14. U.S. News and World Report. Best Medical Schools: Research. 2014. http://grad-schools.usnews.rankingsandreviews.com/best-graduate-schools/top-medical-schools/research-rankings. Accessed September 16, 2016.
15. Guest G, Bunce A, Johnson L. How many interviews are enough? An experiment with data saturation and variability. Field Methods. 2006;18(1):59-82. 
16. Braun V, Clarke V. Using thematic analysis in psychology. Qual Res Psychol. 2006;3(2):77-101. 
17. Aronson J. A pragmatic view of thematic analysis. Qual Rep. 1995;2(1):1-3. 
18. Sutkin G, Wagner E, Harris I, Schiffer R. What makes a good clinical teacher in medicine? A review of the literature. Acad Med. 2008;83(5):452-466. PubMed
19. Beckman TJ, Lee MC. Proposal for a collaborative approach to clinical teaching. Mayo Clin Proc. 2009;84(4):339-344. PubMed
20. Ramani S. Twelve tips to improve bedside teaching. Med Teach. 2003;25(2):112-115. PubMed
21. Irby DM. What clinical teachers in medicine need to know. Acad Med. 1994;69(5):333-342. PubMed
22. Wiese J, ed. Teaching in the Hospital. Philadelphia, PA: American College of Physicians; 2010. 
23. Reilly BM. Inconvenient truths about effective clinical teaching. Lancet. 2007;370(9588):705-711. PubMed
24. Branch WT Jr, Kern D, Haidet P, et al. The patient-physician relationship. Teaching the human dimensions of care in clinical settings. JAMA. 2001;286(9):1067-1074. PubMed
25. McLeod PJ, Harden RM. Clinical teaching strategies for physicians. Med Teach. 1985;7(2):173-189. PubMed
26. Pinsky LE, Monson D, Irby DM. How excellent teachers are made: reflecting on success to improve teaching. Adv Health Sci Educ Theory Pract. 1998;3(3):207-215. PubMed
27. Ullian JA, Bland CJ, Simpson DE. An alternative approach to defining the role of the clinical teacher. Acad Med. 1994;69(10):832-838. PubMed

References

1. Accreditation Council for Graduate Medical Education. Common program requirements. 2011. http://www.acgme.org/Portals/0/PDFs/Common_Program_Requirements_07012011[2].pdf. Accessed September 16, 2016.
2. Healthcare Cost and Utilization Project. Overview statistics for inpatient hospital stays. HCUP Facts and Figures: Statistics on Hospital-Based Care in the United States, 2009. Rockville, MD: Agency for Healthcare Research and Quality; 2011.
3. Busari JO, W eggelaar NM, Knottnerus AC, Greidanus PM, Scherpbier AJ. How medical residents perceive the quality of supervision provided by attending doctors in the clinical setting. Med Educ. 2005;39(7):696-703. PubMed
4. Smith CA, Varkey AB, Evans AT, Reilly BM. Evaluating the performance of inpatient attending physicians: a new instrument for today’s teaching hospitals. J Gen Intern Med. 2004;19(7):766-771. PubMed
5. Elnicki DM, Cooper A. Medical students’ perceptions of the elements of effective inpatient teaching by attending physicians and housestaff. J Gen Intern Med. 2005;20(7):635-639. PubMed
6. Buchel TL, Edwards FD. Characteristics of effective clinical teachers. Fam Med. 2005;37(1):30-35. PubMed
7. Guarino CM, Ko CY, Baker LC, Klein DJ, Quiter ES, Escarce JJ. Impact of instructional practices on student satisfaction with attendings’ teaching in the inpatient component of internal medicine clerkships. J Gen Intern Med. 2006;21(1):7-12. PubMed
8. Irby DM. How attending physicians make instructional decisions when conducting teaching rounds. Acad Med. 1992;67(10):630-638. PubMed
9. Beckman TJ. Lessons learned from a peer review of bedside teaching. Acad Med. 2004;79(4):343-346. PubMed
10. Wright SM, Carrese JA. Excellence in role modelling: insight and perspectives from the pros. CMAJ. 2002;167(6):638-643. PubMed
11. Castiglioni A, Shewchuk RM, Willett LL, Heudebert GR, Centor RM. A pilot study using nominal group technique to assess residents’ perceptions of successful attending rounds. J Gen Intern Med. 2008;23(7):1060-1065. PubMed
12. Bergman K, Gaitskill T. Faculty and student perceptions of effective clinical teachers: an extension study. J Prof Nurs. 1990;6(1):33-44. PubMed
13. Richards L, Morse J. README FIRST for a User’s Guide to Qualitative Methods. 3rd ed. Los Angeles, CA: SAGE Publications, Inc.; 2013. 
14. U.S. News and World Report. Best Medical Schools: Research. 2014. http://grad-schools.usnews.rankingsandreviews.com/best-graduate-schools/top-medical-schools/research-rankings. Accessed September 16, 2016.
15. Guest G, Bunce A, Johnson L. How many interviews are enough? An experiment with data saturation and variability. Field Methods. 2006;18(1):59-82. 
16. Braun V, Clarke V. Using thematic analysis in psychology. Qual Res Psychol. 2006;3(2):77-101. 
17. Aronson J. A pragmatic view of thematic analysis. Qual Rep. 1995;2(1):1-3. 
18. Sutkin G, Wagner E, Harris I, Schiffer R. What makes a good clinical teacher in medicine? A review of the literature. Acad Med. 2008;83(5):452-466. PubMed
19. Beckman TJ, Lee MC. Proposal for a collaborative approach to clinical teaching. Mayo Clin Proc. 2009;84(4):339-344. PubMed
20. Ramani S. Twelve tips to improve bedside teaching. Med Teach. 2003;25(2):112-115. PubMed
21. Irby DM. What clinical teachers in medicine need to know. Acad Med. 1994;69(5):333-342. PubMed
22. Wiese J, ed. Teaching in the Hospital. Philadelphia, PA: American College of Physicians; 2010. 
23. Reilly BM. Inconvenient truths about effective clinical teaching. Lancet. 2007;370(9588):705-711. PubMed
24. Branch WT Jr, Kern D, Haidet P, et al. The patient-physician relationship. Teaching the human dimensions of care in clinical settings. JAMA. 2001;286(9):1067-1074. PubMed
25. McLeod PJ, Harden RM. Clinical teaching strategies for physicians. Med Teach. 1985;7(2):173-189. PubMed
26. Pinsky LE, Monson D, Irby DM. How excellent teachers are made: reflecting on success to improve teaching. Adv Health Sci Educ Theory Pract. 1998;3(3):207-215. PubMed
27. Ullian JA, Bland CJ, Simpson DE. An alternative approach to defining the role of the clinical teacher. Acad Med. 1994;69(10):832-838. PubMed

Issue
Journal of Hospital Medicine 12(7)
Issue
Journal of Hospital Medicine 12(7)
Page Number
503-509
Page Number
503-509
Topics
Article Type
Display Headline
Techniques and behaviors associated with exemplary inpatient general medicine teaching: an exploratory qualitative study
Display Headline
Techniques and behaviors associated with exemplary inpatient general medicine teaching: an exploratory qualitative study
Sections
Article Source

© 2017 Society of Hospital Medicine

Disallow All Ads
Correspondence Location
*Address for correspondence and reprint requests: Nathan Houchens, MD, University of Michigan and Veterans Affairs Ann Arbor Healthcare System, 2215 Fuller Road, Mail Code 111, Ann Arbor, MI 48105; Telephone: 734-845-5922; Fax: 734-913-0883; E-mail: [email protected]
Content Gating
No Gating (article Unlocked/Free)
Alternative CME
Disqus Comments
Default
Article PDF Media
Media Files

Improving the readability of pediatric hospital medicine discharge instructions

Article Type
Changed
Display Headline
Improving the readability of pediatric hospital medicine discharge instructions

The transition from hospital to home can be overwhelming for caregivers.1 Stress of hospitalization coupled with the expectation of families to execute postdischarge care plans make understandable discharge communication critical. Communication failures, inadequate education, absence of caregiver confidence, and lack of clarity regarding care plans may prohibit smooth transitions and lead to adverse postdischarge outcomes.2-4

Health literacy plays a pivotal role in caregivers’ capacity to navigate the healthcare system, comprehend, and execute care plans. An estimated 90 million Americans have limited health literacy that may negatively impact the provision of safe and quality care5,6 and be a risk factor for poor outcomes, including increased emergency department (ED) utilization and readmission rates.7-9 Readability strongly influences the effectiveness of written materials.10 However, written medical information for patients and families are frequently between the 10th and 12th grade reading levels; more than 75% of all pediatric health information is written at or above 10th grade reading level.11 Government agencies recommend between a 6th and 8th grade reading level, for written material;5,12,13 written discharge instructions have been identified as an important quality metric for hospital-to-home transitions.14-16

At our center, we found that discharge instructions were commonly written at high reading levels and often incomplete.17 Poor discharge instructions may contribute to increased readmission rates and unnecessary ED visits.9,18 Our global aim targeted improved health-literate written information, including understandability and completeness.

Our specific aim was to increase the percentage of discharge instructions written at or below the 7th grade level for hospital medicine (HM) patients on a community hospital pediatric unit from 13% to 80% in 6 months.

METHODS

Context

The improvement work took place at a 42-bed inpatient pediatric unit at a community satellite of our large, urban, academic hospital. The unit is staffed by medical providers including attendings, fellows, nurse practitioners (NPs), and senior pediatric residents, and had more than 1000 HM discharges in fiscal year 2016. Children with common general pediatric diagnoses are admitted to this service; postsurgical patients are not admitted primarily to the HM service. In Cincinnati, the neighborhood-level high school drop-out rates are as high as 64%.19 Discharge instructions are written by medical providers in the electronic health record (EHR). A printed copy is given to families and verbally reviewed by a bedside nurse prior to discharge. Quality improvement (QI) efforts focused on discharge instructions were ignited by a prior review of 200 discharge instructions that showed they were difficult to read (median reading level of 10th grade), poorly understandable (36% of instructions met the threshold of understandability as measured by the Patient Education Materials Assessment Tool20) and were missing key elements of information.17

 

 

Improvement Team

The improvement team consisted of 4 pediatric hospitalists, 2 NPs, 1 nurse educator with health literacy expertise, 1 pediatric resident, 1 fourth-year medical student, 1 QI consultant, and 2 parents who had first-hand experience on the HM service. The improvement team observed the discharge process, including roles of the provider, nurse and family, outlined a process map, and created a modified failure mode and effect analysis.21 Prior to our work, discharge instructions written by providers often occurred as a last step, and the content was created as free text or from nonstandardized templates. Key drivers that informed interventions were determined and revised over time (Figure 1). The study was reviewed by our institutional review board and deemed not human subjects research.

Figure 1
Improvement Activities

Key drivers were identified, and interventions were executed using Plan-Do Study-Act cycles.22 The key drivers thought to be critical for the success of the QI efforts were family engagement; standardization of discharge instructions; medical staff engagement; and audit and feedback of data. The corresponding interventions were as follows:

Family Engagement

Understanding the discharge information families desired. Prior to testing, 10 families admitted to the HM service were asked about the discharge experience. We asked families about information they wanted in written discharge instructions: 1) reasons to call your primary doctor or return to the hospital; 2) when to see your primary doctor for a follow-up visit; 3) the phone number to reach your child’s doctor; 4) more information about why your child was admitted; 5) information about new medications; and 6) what to do to help your child continue to recover at home.

Development of templates. We engaged families throughout the process of creating general and disease-specific discharge templates. After a specific template was created and reviewed by the parents on our team, it was sent to members of the institutional Patient Education Committee, which includes parents and local health literacy experts, to review and critique. Feedback from the reviewers was incorporated into the templates prior to use in the EHR.

Postdischarge phone calls.A convenience sample of families discharged from the satellite campus was called 24 to 48 hours after discharge over a 2-week period in January, 2016. A member of our improvement team solicited feedback from families about the quality of the discharge instructions. Families were asked if discharge instructions were reviewed with them prior to going home, if they were given a copy of the instructions, how they would rate the ability to read and use the information, and if there were additional pieces of information that would have improved the instructions.

Standardization of Instructions

Education. A presentation was created and shared with medical providers; it was re-disseminated monthly to new residents rotating onto the service and to the attendings, fellows, and NPs scheduled for shifts during the month. This education continued for the duration of the study. The presentation included the definition of health literacy, scope of the problem, examples of poorly written discharge instructions, and tips on how to write readable and understandable instructions. Laminated cards that included tips on how to write instructions were also placed on work stations.

Figure 2
Creation of discharge instruction templates in the EHR.A general discharge instruction template that was initially created and tested in the EHR (Figure 2) included text written below the 7th grade and employed 14 point font, bolded words for emphasis, and lists with bullet points. Asterisks were used to indicate where providers needed to include patient-specific information. The sections included in the general template were informed by feedback from providers and parents prior to testing, parents on the improvement team, and parents of patients admitted to our satellite campus. The sections reflect components critical to successful postdischarge care: discharge diagnosis and its brief description, postdischarge care information, new medications, signs and symptoms that would warrant escalation of care to the patient’s primary care provider or the ED, and follow-up instructions and contact information for the patent’s primary care doctor.

While the general template was an important first step, the content relied heavily on free text by providers, which could still lead to instructions written at a high reading level. Thus, disease-specific discharge instruction templates were created with prepopulated information that was written at a reading level at or below 7th grade level (Figure 2). The diseases were prioritized based on the most common diagnoses on our HM service. Each template included information under each of the subheadings noted in the general template. Twelve disease-specific templates were tested and ultimately embedded in the EHR; the general template remained for use when the discharge diagnosis was not covered by a disease-specific template.

 

 

Medical Staff Engagement

Previously described tests of change also aimed to enhance staff engagement. These included frequent e-mails, discussion of the QI efforts at specific team meetings, and the creation of visual cues posted at computer work stations, which prompted staff to begin to work on discharge instructions soon after admission.

Audit and Feedback of Data

Weekly phone calls. One team updated clinicians through a regularly scheduled bi-weekly phone conference. The phone conference was established prior to our work and was designed to relay pertinent information to attendings and NPs who work at the satellite hospital. During the phone conferences, clinicians were notified of current performance on discharge instruction readability and specific tests of change for the week. Additionally, providers gave feedback about the improvement efforts. These updates continued for the first 6 months of the project until sustained improvements were observed.

E-mails. Weekly e-mails were sent to all providers scheduled for clinical time at the satellite campus. The e-mail contained information on current tests of change, a list of discharge instruction templates that were available in the EHR, and the annotated run chart illustrating readability levels over time.

Additionally, individual e-mails were sent to each provider after review of the written discharge instructions for the week. Providers were given information on the number of discharge instructions they personally composed, the percentage of those instructions that were written at or below 7th grade level, and specific feedback on how their written instructions could be improved. We also encouraged feedback from each provider to better identify barriers to achieving our goal.

Study of the Interventions

Baseline data included a review of all instructions for patients discharged from the satellite campus from the end of April 2015 through mid-September 2015. The time period for testing of interventions during the fall and winter months allowed for rapid cycle learning due to higher patient census and predictability of admissions for specific diagnosis (ie, asthma and bronchiolitis). An automated report was generated from the EHR weekly with specific demographics and identifiers for patient discharged over the past 7 days, including patient age, gender, length of stay, discharge diagnosis, and insurance classification. Data was collected during the intervention period via structured review of the discharge instructions in the EHR by the principal investigator or a trained research coordinator. Discharge instructions for medically cleared mental health patients admitted to hospital medicine while awaiting psychiatric bed availability and patients and parents who were non-English speaking were excluded from review. All other instructions for patients discharged from the HM service at our Liberty Campus were included for review.

Measures

Readability, our primary measure of interest, was calculated using the mean score from the following formulas: Flesch Kincaid Grade Level,23 Simple Measure of Gobbledygook Index,24 Coleman-Liau Index,25 Gunning-Fog Index,26 and Automated Readability Index27 by means of an online platform (https://readability-score.com).28 This platform was chosen because it incorporated a variety of formulas, was user-friendly, and required minimal data cleaning. Each of the readability formulas have been used to assesses readability of health information given to patients and families.29,30 The threshold of 7th grade is in alignment with our institutional policy for educational materials and with recommendations from several government agencies.5,12

Analysis

A statistical process control p-chart was used to analyze our primary measure of readability, dichotomized as percent discharge instructions written at or below 7th grade level. Run charts were used to follow mean reading level of discharge instructions and our process measure of percent of discharge instruction written with a general or disease-specific standardized template. Run chart and control chart rules for identifying special cause were used for midline shifts.31

Table

RESULTS

The Table includes the demographic and clinical information of patients included in our analyses. Through sequential interventions, the percentage of discharge instructions written at or below 7th grade readability level increased from a mean of 13% to more than 80% in 3 months (Figure 3). Furthermore, the mean was sustained above 90% for 10 months and at 98% for the last 4 months. The use of 1 of the 13 EHR templates increased from 0% to 96% and was associated with the largest impact on the overall improvements (Supplemental Figure 1). Additionally, the average reading level of the discharge instructions decreased from 10th grade to 6th grade level (Supplemental Figure 2).

Figure 3

Qualitative comments from providers about the discharge instructions included:

“Are these [discharge instructions] available at base??  Great resource for interns.”
“These [discharge] instructions make the [discharge] process so easy!!! Love these...”
“Also feel like they have helped my discharge teaching in the room!”

Qualitative comments from families postdischarge included:
“I thought the instructions were very clear and easy to read. I especially thought that highlighting the important areas really helped.”
“I think this form looks great, and I really like the idea of having your child’s name on it.”

 

 

DISCUSSION

Through sequential Plan-Do Study-Act cycles, we increased the percentage of discharge instructions written at or below 7th grade reading level from 13% to 98%. Our most impactful intervention was the creation and dissemination of standardized disease-specific discharge instruction templates. Our findings complement evidence in the adult and pediatric literature that the use of standardized, disease-specific discharge instruction templates may improve readability of instructions.32,33 And, while quality improvement efforts have been employed to improve the discharge process for patients,34-36 this is the first study in the inpatient setting that, to our knowledge, specifically addresses discharge instructions using quality improvement methods.

Our work targeted the critical intersection between individual health literacy, an individual’s capacity to acquire, interpret, and use health information, and the necessary changes needed within our healthcare system to ensure that appropriately written instructions are given to patients and families.17,37 Our efforts focused on improving discharge instructions answer the call to consider health literacy a modifiable clinical risk factor.37 Furthermore, we address the 6 aims for quality healthcare delivery: 1) safe, timely, efficient and equitable delivery of care through the creation and dissemination of standardized instructions that are written at the appropriate reading level for families to ease hospital-to-home transitions and streamline the workflow of medical providers; 2) effective education of medical providers on health literacy concepts; and 3) family-centeredness through the involvement of families in our QI efforts. While previous QI efforts to improve hospital-to-home transitions have focused on medication reconciliation, communication with primary care physicians, follow-up appointments, and timely discharges of patients, none have specifically focused on the quality of discharge instructions.34-36

Most physicians do not receive education about how to write information that is readable and understandable; more than half of providers desired more education in this area.38 Furthermore, pediatric providers may overestimate parental health literacy levels,39 which may contribute to variability in the readability of written health materials. While education alone can contribute to a provider’s ability to create readable instructions, we note the improvement after the introduction of disease templates to demonstrate the importance of workflow-integrated higher reliability interventions to sustain improvements.

Our baseline poor readability rates were due to limited knowledge by frontline providers composing the instructions and a system in which an important element for successful hospital-to-home transitions was not tackled until patients were ready for discharge. Streamlining of the discharge process, including the creation of discharge instructions, may lead to improved efficiency, fewer discrepancies, more effective communication, and an enhanced family experience. Moreover, the success of our improvement work was due to key stakeholders, including parents, being a part of the team and the notable buy-in from providers.

Our work was not without limitations. We excluded non-English speaking families from the study. We were unable to measure reading level of our population directly and instead based our goals on national estimates. Our primary measure was readability, which is only 1 piece contributing to quality discharge instructions. Understandability and actionability are also important considerations; 17,20,29,40 however, improvements in these areas were limited by our design options within the EHR. Our efforts focused on children with common general pediatric diagnoses, and it is unclear how our interventions would generalize to medically complex patients with more volume of information to communicate at discharge and with uncommon diagnoses that are less readily incorporated into standardized templates. Relatedly, our work occurred at the satellite campus of our tertiary care center and may not represent generalizable material or methods to implement templates at our main campus location or at other hospitals. To begin to better understand this, we have spread to HM patients at our main campus, including medically complex patients with technology dependence and/or neurological impairments. Standardized, disease-specific templates most relevant to this population as well as several patient specific templates, for those with frequent readmissions due to medical complexity, have been created and are actively being tested.

CONCLUSION

In conclusion, in using interventions targeted at standardization of discharge instructions and timely feedback to staff, we saw rapid, dramatic, and sustained improvement in the readability of discharge instructions. Next steps include adaptation and spread to other patient populations and care teams, collaborations with other centers, and assessing the impact of effectively written discharge instructions on patient outcomes, such as adverse drug events, readmission rates, and family experience.

Disclosure

No external funding was secured for this study. Dr. Brady is supported by a Patient-Centered Outcomes Research Mentored Clinical Investigator Award from the Agency for Healthcare Research and Quality, Award Number K08HS023827. The content is solely the responsibility of the authors and does not necessarily represent the official views of the funding organizations. The funding organization had no role in the design, preparation, review, or approval of this paper; nor the decision to submit the manuscript for publication. The authors have no financial relationships relevant to this article to disclose.

Files
References

1. Solan LG, Beck AF, Brunswick SA, et al. The family perspective on hospital to
home transitions: a qualitative study. Pediatrics. 2015;136:e1539-e1549. PubMed
2. Engel KG, Buckley BA, Forth VE, et al. Patient understanding of emergency
department discharge instructions: where are knowledge deficits greatest? Acad
Emerg Med. 2012;19:E1035-E1044. PubMed
3. Ashbrook L, Mourad M, Sehgal N. Communicating discharge instructions to patients:
a survey of nurse, intern, and hospitalist practices. J Hosp Med. 2013;8:
36-41. PubMed
4. Kripalani S, Jacobson TA, Mugalla IC, Cawthon CR, Niesner KJ, Vaccarino V.
Health literacy and the quality of physician-patient communication during hospitalization.
J Hosp Med. 2010;5:269-275. PubMed
5. Institute of Medicine Committee on Health Literacy. Kindig D, Alfonso D, Chudler
E, et al, eds. Health Literacy: A Prescription to End Confusion. Washington,
DC: National Academies Press; 2004. 
6. Yin HS, Johnson M, Mendelsohn AL, Abrams MA, Sanders LM, Dreyer BP. The
health literacy of parents in the United States: a nationally representative study.
Pediatrics. 2009;124(suppl 3):S289-S298. PubMed
7. Rak EC, Hooper SR, Belsante MJ, et al. Caregiver word reading literacy and
health outcomes among children treated in a pediatric nephrology practice. Clin
Kid J. 2016;9:510-515. PubMed
8. Morrison AK, Schapira MM, Gorelick MH, Hoffmann RG, Brousseau DC. Low
caregiver health literacy is associated with higher pediatric emergency department
use and nonurgent visits. Acad Pediatr. 2014;14:309-314. PubMed
9. Howard-Anderson J, Busuttil A, Lonowski S, Vangala S, Afsar-Manesh N. From
discharge to readmission: Understanding the process from the patient perspective.
J Hosp Med. 2016;11:407-412. PubMed
10. Doak CC, Doak LG, Root JH. Teaching Patients with Low Literacy Skills. 2nd ed.
Philadelphia PA: J.B. Lippincott; 1996. PubMed
11. Berkman ND, Sheridan SL, Donahue KE, et al. Health literacy interventions and
outcomes: an updated systematic review. Evid Rep/Technol Assess. 2011;199:1-941. PubMed
12. Prevention CfDCa. Health Literacy for Public Health Professionals. In: Prevention
CfDCa, ed. Atlanta, GA2009. 
13. “What Did the Doctor Say?” Improving Health Literacy to Protect Patient Safety.
Oakbrook Terrace, IL: The Joint Commission, 2007. 
14. Desai AD, Burkhart Q, Parast L, et al. Development and pilot testing of caregiver-
reported pediatric quality measures for transitions between sites of care. Acad
Pediatr. 2016;16:760-769. PubMed
15. Leyenaar JK, Desai AD, Burkhart Q, et al. Quality measures to assess care transitions
for hospitalized children. Pediatrics. 2016;138(2). PubMed
16. Akinsola B, Cheng J, Zmitrovich A, Khan N, Jain S. Improving discharge instructions
in a pediatric emergency department: impact of a quality initiative. Pediatr
Emerg Care. 2017;33:10-13. PubMed
17. Unaka NI, Statile AM, Haney J, Beck AF, Brady PW, Jerardi K. Assessment of
the readability, understandability and completeness of pediatric hospital medicine
discharge instructions J Hosp Med. In press. PubMed
18. Stella SA, Allyn R, Keniston A, et al. Postdischarge problems identified by telephone
calls to an advice line. J Hosp Med. 2014;9:695-699. PubMed
19. Maloney M, Auffrey C. The social areas of Cincinnati.
20. The Patient Education Materials Assessment Tool (PEMAT) and User’s Guide:
An Instrument To Assess the Understandability and Actionability of Print and
Audiovisual Patient Education Materials. Available at: http://www.ahrq.gov/
professionals/prevention-chronic-care/improve/self-mgmt/pemat/index.html. Accessed
November 27, 2013.
21. Cohen MR, Senders J, Davis NM. Failure mode and effects analysis: a novel
approach to avoiding dangerous medication errors and accidents. Hosp Pharm.
1994;29:319-30. PubMed
22. Langley GJ, Moen R, Nolan KM, Nolan TW, Norman CL, Provost LP. The Improvement
Guide: A Practical Approach to Enhancing Organizational Performance.
San Franciso, CA: John Wiley & Sons; 2009. 
23. Flesch R. A new readability yardstick. J Appl Psychol. 1948;32:221-233. PubMed
24. McLaughlin GH. SMOG grading-a new readability formula. J Reading.
1969;12:639-646.
25. Coleman M, Liau TL. A computer readability formula designed for machine scoring.
J Appl Psych. 1975;60:283. 
26. Gunning R. {The Technique of Clear Writing}. 1952.
27. Smith EA, Senter R. Automated readability index. AMRL-TR Aerospace Medical
Research Laboratories (6570th) 1967:1. PubMed
28. How readable is your writing. 2011. https://readability-score.com. Accessed September
23, 2016.
An Official Publication of the Society of Hospital Medicine Journal of Hospital Medicine Vol 12 | No 7 | July 2017 557
Improving Readability of Discharge Instructions | Unaka et al
29. Yin HS, Gupta RS, Tomopoulos S, et al. Readability, suitability, and characteristics
of asthma action plans: examination of factors that may impair understanding.
Pediatrics. 2013;131:e116-E126. PubMed
30. Brigo F, Otte WM, Igwe SC, Tezzon F, Nardone R. Clearly written, easily comprehended?
The readability of websites providing information on epilepsy. Epilepsy
Behav. 2015;44:35-39. PubMed
31. Benneyan JC. Use and interpretation of statistical quality control charts. Int J
Qual Health Care. 1998;10:69-73. PubMed
32. Mueller SK, Giannelli K, Boxer R, Schnipper JL. Readability of patient discharge
instructions with and without the use of electronically available disease-specific
templates. J Am Med Inform Assoc. 2015;22:857-863. PubMed
33. Lauster CD, Gibson JM, DiNella JV, DiNardo M, Korytkowski MT, Donihi AC.
Implementation of standardized instructions for insulin at hospital discharge.
J Hosp Med. 2009;4:E41-E42. PubMed
34. Tuso P, Huynh DN, Garofalo L, et al. The readmission reduction program of
Kaiser Permanente Southern California-knowledge transfer and performance improvement.
Perm J. 2013;17:58-63. PubMed
35. White CM, Statile AM, White DL, et al. Using quality improvement to optimise
paediatric discharge efficiency. BMJ Qual Saf. 2014;23:428-436. PubMed
36. Mussman GM, Vossmeyer MT, Brady PW, Warrick DM, Simmons JM, White CM.
Improving the reliability of verbal communication between primary care physicians
and pediatric hospitalists at hospital discharge. J Hosp Med. 2015;10:574-
580. PubMed
37. Rothman RL, Yin HS, Mulvaney S, Co JP, Homer C, Lannon C. Health literacy
and quality: focus on chronic illness care and patient safety. Pediatrics
2009;124(suppl 3):S315-S326. PubMed
38. Turner T, Cull WL, Bayldon B, et al. Pediatricians and health literacy: descriptive
results from a national survey. Pediatrics. 2009;124(suppl 3):S299-S305. PubMed
39. Harrington KF, Haven KM, Bailey WC, Gerald LB. Provider perceptions of parent
health literacy and effect on asthma treatment: recommendations and instructions.
Pediatr Allergy immunol Pulmonol. 2013;26:69-75. PubMed
40. Yin HS, Parker RM, Wolf MS, et al. Health literacy assessment of labeling of
pediatric nonprescription medications: examination of characteristics that may
impair parent understanding. Acad Pediatr. 2012;12:288-296. PubMed

Article PDF
Issue
Journal of Hospital Medicine 12(7)
Topics
Page Number
551-557
Sections
Files
Files
Article PDF
Article PDF

The transition from hospital to home can be overwhelming for caregivers.1 Stress of hospitalization coupled with the expectation of families to execute postdischarge care plans make understandable discharge communication critical. Communication failures, inadequate education, absence of caregiver confidence, and lack of clarity regarding care plans may prohibit smooth transitions and lead to adverse postdischarge outcomes.2-4

Health literacy plays a pivotal role in caregivers’ capacity to navigate the healthcare system, comprehend, and execute care plans. An estimated 90 million Americans have limited health literacy that may negatively impact the provision of safe and quality care5,6 and be a risk factor for poor outcomes, including increased emergency department (ED) utilization and readmission rates.7-9 Readability strongly influences the effectiveness of written materials.10 However, written medical information for patients and families are frequently between the 10th and 12th grade reading levels; more than 75% of all pediatric health information is written at or above 10th grade reading level.11 Government agencies recommend between a 6th and 8th grade reading level, for written material;5,12,13 written discharge instructions have been identified as an important quality metric for hospital-to-home transitions.14-16

At our center, we found that discharge instructions were commonly written at high reading levels and often incomplete.17 Poor discharge instructions may contribute to increased readmission rates and unnecessary ED visits.9,18 Our global aim targeted improved health-literate written information, including understandability and completeness.

Our specific aim was to increase the percentage of discharge instructions written at or below the 7th grade level for hospital medicine (HM) patients on a community hospital pediatric unit from 13% to 80% in 6 months.

METHODS

Context

The improvement work took place at a 42-bed inpatient pediatric unit at a community satellite of our large, urban, academic hospital. The unit is staffed by medical providers including attendings, fellows, nurse practitioners (NPs), and senior pediatric residents, and had more than 1000 HM discharges in fiscal year 2016. Children with common general pediatric diagnoses are admitted to this service; postsurgical patients are not admitted primarily to the HM service. In Cincinnati, the neighborhood-level high school drop-out rates are as high as 64%.19 Discharge instructions are written by medical providers in the electronic health record (EHR). A printed copy is given to families and verbally reviewed by a bedside nurse prior to discharge. Quality improvement (QI) efforts focused on discharge instructions were ignited by a prior review of 200 discharge instructions that showed they were difficult to read (median reading level of 10th grade), poorly understandable (36% of instructions met the threshold of understandability as measured by the Patient Education Materials Assessment Tool20) and were missing key elements of information.17

 

 

Improvement Team

The improvement team consisted of 4 pediatric hospitalists, 2 NPs, 1 nurse educator with health literacy expertise, 1 pediatric resident, 1 fourth-year medical student, 1 QI consultant, and 2 parents who had first-hand experience on the HM service. The improvement team observed the discharge process, including roles of the provider, nurse and family, outlined a process map, and created a modified failure mode and effect analysis.21 Prior to our work, discharge instructions written by providers often occurred as a last step, and the content was created as free text or from nonstandardized templates. Key drivers that informed interventions were determined and revised over time (Figure 1). The study was reviewed by our institutional review board and deemed not human subjects research.

Figure 1
Improvement Activities

Key drivers were identified, and interventions were executed using Plan-Do Study-Act cycles.22 The key drivers thought to be critical for the success of the QI efforts were family engagement; standardization of discharge instructions; medical staff engagement; and audit and feedback of data. The corresponding interventions were as follows:

Family Engagement

Understanding the discharge information families desired. Prior to testing, 10 families admitted to the HM service were asked about the discharge experience. We asked families about information they wanted in written discharge instructions: 1) reasons to call your primary doctor or return to the hospital; 2) when to see your primary doctor for a follow-up visit; 3) the phone number to reach your child’s doctor; 4) more information about why your child was admitted; 5) information about new medications; and 6) what to do to help your child continue to recover at home.

Development of templates. We engaged families throughout the process of creating general and disease-specific discharge templates. After a specific template was created and reviewed by the parents on our team, it was sent to members of the institutional Patient Education Committee, which includes parents and local health literacy experts, to review and critique. Feedback from the reviewers was incorporated into the templates prior to use in the EHR.

Postdischarge phone calls.A convenience sample of families discharged from the satellite campus was called 24 to 48 hours after discharge over a 2-week period in January, 2016. A member of our improvement team solicited feedback from families about the quality of the discharge instructions. Families were asked if discharge instructions were reviewed with them prior to going home, if they were given a copy of the instructions, how they would rate the ability to read and use the information, and if there were additional pieces of information that would have improved the instructions.

Standardization of Instructions

Education. A presentation was created and shared with medical providers; it was re-disseminated monthly to new residents rotating onto the service and to the attendings, fellows, and NPs scheduled for shifts during the month. This education continued for the duration of the study. The presentation included the definition of health literacy, scope of the problem, examples of poorly written discharge instructions, and tips on how to write readable and understandable instructions. Laminated cards that included tips on how to write instructions were also placed on work stations.

Figure 2
Creation of discharge instruction templates in the EHR.A general discharge instruction template that was initially created and tested in the EHR (Figure 2) included text written below the 7th grade and employed 14 point font, bolded words for emphasis, and lists with bullet points. Asterisks were used to indicate where providers needed to include patient-specific information. The sections included in the general template were informed by feedback from providers and parents prior to testing, parents on the improvement team, and parents of patients admitted to our satellite campus. The sections reflect components critical to successful postdischarge care: discharge diagnosis and its brief description, postdischarge care information, new medications, signs and symptoms that would warrant escalation of care to the patient’s primary care provider or the ED, and follow-up instructions and contact information for the patent’s primary care doctor.

While the general template was an important first step, the content relied heavily on free text by providers, which could still lead to instructions written at a high reading level. Thus, disease-specific discharge instruction templates were created with prepopulated information that was written at a reading level at or below 7th grade level (Figure 2). The diseases were prioritized based on the most common diagnoses on our HM service. Each template included information under each of the subheadings noted in the general template. Twelve disease-specific templates were tested and ultimately embedded in the EHR; the general template remained for use when the discharge diagnosis was not covered by a disease-specific template.

 

 

Medical Staff Engagement

Previously described tests of change also aimed to enhance staff engagement. These included frequent e-mails, discussion of the QI efforts at specific team meetings, and the creation of visual cues posted at computer work stations, which prompted staff to begin to work on discharge instructions soon after admission.

Audit and Feedback of Data

Weekly phone calls. One team updated clinicians through a regularly scheduled bi-weekly phone conference. The phone conference was established prior to our work and was designed to relay pertinent information to attendings and NPs who work at the satellite hospital. During the phone conferences, clinicians were notified of current performance on discharge instruction readability and specific tests of change for the week. Additionally, providers gave feedback about the improvement efforts. These updates continued for the first 6 months of the project until sustained improvements were observed.

E-mails. Weekly e-mails were sent to all providers scheduled for clinical time at the satellite campus. The e-mail contained information on current tests of change, a list of discharge instruction templates that were available in the EHR, and the annotated run chart illustrating readability levels over time.

Additionally, individual e-mails were sent to each provider after review of the written discharge instructions for the week. Providers were given information on the number of discharge instructions they personally composed, the percentage of those instructions that were written at or below 7th grade level, and specific feedback on how their written instructions could be improved. We also encouraged feedback from each provider to better identify barriers to achieving our goal.

Study of the Interventions

Baseline data included a review of all instructions for patients discharged from the satellite campus from the end of April 2015 through mid-September 2015. The time period for testing of interventions during the fall and winter months allowed for rapid cycle learning due to higher patient census and predictability of admissions for specific diagnosis (ie, asthma and bronchiolitis). An automated report was generated from the EHR weekly with specific demographics and identifiers for patient discharged over the past 7 days, including patient age, gender, length of stay, discharge diagnosis, and insurance classification. Data was collected during the intervention period via structured review of the discharge instructions in the EHR by the principal investigator or a trained research coordinator. Discharge instructions for medically cleared mental health patients admitted to hospital medicine while awaiting psychiatric bed availability and patients and parents who were non-English speaking were excluded from review. All other instructions for patients discharged from the HM service at our Liberty Campus were included for review.

Measures

Readability, our primary measure of interest, was calculated using the mean score from the following formulas: Flesch Kincaid Grade Level,23 Simple Measure of Gobbledygook Index,24 Coleman-Liau Index,25 Gunning-Fog Index,26 and Automated Readability Index27 by means of an online platform (https://readability-score.com).28 This platform was chosen because it incorporated a variety of formulas, was user-friendly, and required minimal data cleaning. Each of the readability formulas have been used to assesses readability of health information given to patients and families.29,30 The threshold of 7th grade is in alignment with our institutional policy for educational materials and with recommendations from several government agencies.5,12

Analysis

A statistical process control p-chart was used to analyze our primary measure of readability, dichotomized as percent discharge instructions written at or below 7th grade level. Run charts were used to follow mean reading level of discharge instructions and our process measure of percent of discharge instruction written with a general or disease-specific standardized template. Run chart and control chart rules for identifying special cause were used for midline shifts.31

Table

RESULTS

The Table includes the demographic and clinical information of patients included in our analyses. Through sequential interventions, the percentage of discharge instructions written at or below 7th grade readability level increased from a mean of 13% to more than 80% in 3 months (Figure 3). Furthermore, the mean was sustained above 90% for 10 months and at 98% for the last 4 months. The use of 1 of the 13 EHR templates increased from 0% to 96% and was associated with the largest impact on the overall improvements (Supplemental Figure 1). Additionally, the average reading level of the discharge instructions decreased from 10th grade to 6th grade level (Supplemental Figure 2).

Figure 3

Qualitative comments from providers about the discharge instructions included:

“Are these [discharge instructions] available at base??  Great resource for interns.”
“These [discharge] instructions make the [discharge] process so easy!!! Love these...”
“Also feel like they have helped my discharge teaching in the room!”

Qualitative comments from families postdischarge included:
“I thought the instructions were very clear and easy to read. I especially thought that highlighting the important areas really helped.”
“I think this form looks great, and I really like the idea of having your child’s name on it.”

 

 

DISCUSSION

Through sequential Plan-Do Study-Act cycles, we increased the percentage of discharge instructions written at or below 7th grade reading level from 13% to 98%. Our most impactful intervention was the creation and dissemination of standardized disease-specific discharge instruction templates. Our findings complement evidence in the adult and pediatric literature that the use of standardized, disease-specific discharge instruction templates may improve readability of instructions.32,33 And, while quality improvement efforts have been employed to improve the discharge process for patients,34-36 this is the first study in the inpatient setting that, to our knowledge, specifically addresses discharge instructions using quality improvement methods.

Our work targeted the critical intersection between individual health literacy, an individual’s capacity to acquire, interpret, and use health information, and the necessary changes needed within our healthcare system to ensure that appropriately written instructions are given to patients and families.17,37 Our efforts focused on improving discharge instructions answer the call to consider health literacy a modifiable clinical risk factor.37 Furthermore, we address the 6 aims for quality healthcare delivery: 1) safe, timely, efficient and equitable delivery of care through the creation and dissemination of standardized instructions that are written at the appropriate reading level for families to ease hospital-to-home transitions and streamline the workflow of medical providers; 2) effective education of medical providers on health literacy concepts; and 3) family-centeredness through the involvement of families in our QI efforts. While previous QI efforts to improve hospital-to-home transitions have focused on medication reconciliation, communication with primary care physicians, follow-up appointments, and timely discharges of patients, none have specifically focused on the quality of discharge instructions.34-36

Most physicians do not receive education about how to write information that is readable and understandable; more than half of providers desired more education in this area.38 Furthermore, pediatric providers may overestimate parental health literacy levels,39 which may contribute to variability in the readability of written health materials. While education alone can contribute to a provider’s ability to create readable instructions, we note the improvement after the introduction of disease templates to demonstrate the importance of workflow-integrated higher reliability interventions to sustain improvements.

Our baseline poor readability rates were due to limited knowledge by frontline providers composing the instructions and a system in which an important element for successful hospital-to-home transitions was not tackled until patients were ready for discharge. Streamlining of the discharge process, including the creation of discharge instructions, may lead to improved efficiency, fewer discrepancies, more effective communication, and an enhanced family experience. Moreover, the success of our improvement work was due to key stakeholders, including parents, being a part of the team and the notable buy-in from providers.

Our work was not without limitations. We excluded non-English speaking families from the study. We were unable to measure reading level of our population directly and instead based our goals on national estimates. Our primary measure was readability, which is only 1 piece contributing to quality discharge instructions. Understandability and actionability are also important considerations; 17,20,29,40 however, improvements in these areas were limited by our design options within the EHR. Our efforts focused on children with common general pediatric diagnoses, and it is unclear how our interventions would generalize to medically complex patients with more volume of information to communicate at discharge and with uncommon diagnoses that are less readily incorporated into standardized templates. Relatedly, our work occurred at the satellite campus of our tertiary care center and may not represent generalizable material or methods to implement templates at our main campus location or at other hospitals. To begin to better understand this, we have spread to HM patients at our main campus, including medically complex patients with technology dependence and/or neurological impairments. Standardized, disease-specific templates most relevant to this population as well as several patient specific templates, for those with frequent readmissions due to medical complexity, have been created and are actively being tested.

CONCLUSION

In conclusion, in using interventions targeted at standardization of discharge instructions and timely feedback to staff, we saw rapid, dramatic, and sustained improvement in the readability of discharge instructions. Next steps include adaptation and spread to other patient populations and care teams, collaborations with other centers, and assessing the impact of effectively written discharge instructions on patient outcomes, such as adverse drug events, readmission rates, and family experience.

Disclosure

No external funding was secured for this study. Dr. Brady is supported by a Patient-Centered Outcomes Research Mentored Clinical Investigator Award from the Agency for Healthcare Research and Quality, Award Number K08HS023827. The content is solely the responsibility of the authors and does not necessarily represent the official views of the funding organizations. The funding organization had no role in the design, preparation, review, or approval of this paper; nor the decision to submit the manuscript for publication. The authors have no financial relationships relevant to this article to disclose.

The transition from hospital to home can be overwhelming for caregivers.1 Stress of hospitalization coupled with the expectation of families to execute postdischarge care plans make understandable discharge communication critical. Communication failures, inadequate education, absence of caregiver confidence, and lack of clarity regarding care plans may prohibit smooth transitions and lead to adverse postdischarge outcomes.2-4

Health literacy plays a pivotal role in caregivers’ capacity to navigate the healthcare system, comprehend, and execute care plans. An estimated 90 million Americans have limited health literacy that may negatively impact the provision of safe and quality care5,6 and be a risk factor for poor outcomes, including increased emergency department (ED) utilization and readmission rates.7-9 Readability strongly influences the effectiveness of written materials.10 However, written medical information for patients and families are frequently between the 10th and 12th grade reading levels; more than 75% of all pediatric health information is written at or above 10th grade reading level.11 Government agencies recommend between a 6th and 8th grade reading level, for written material;5,12,13 written discharge instructions have been identified as an important quality metric for hospital-to-home transitions.14-16

At our center, we found that discharge instructions were commonly written at high reading levels and often incomplete.17 Poor discharge instructions may contribute to increased readmission rates and unnecessary ED visits.9,18 Our global aim targeted improved health-literate written information, including understandability and completeness.

Our specific aim was to increase the percentage of discharge instructions written at or below the 7th grade level for hospital medicine (HM) patients on a community hospital pediatric unit from 13% to 80% in 6 months.

METHODS

Context

The improvement work took place at a 42-bed inpatient pediatric unit at a community satellite of our large, urban, academic hospital. The unit is staffed by medical providers including attendings, fellows, nurse practitioners (NPs), and senior pediatric residents, and had more than 1000 HM discharges in fiscal year 2016. Children with common general pediatric diagnoses are admitted to this service; postsurgical patients are not admitted primarily to the HM service. In Cincinnati, the neighborhood-level high school drop-out rates are as high as 64%.19 Discharge instructions are written by medical providers in the electronic health record (EHR). A printed copy is given to families and verbally reviewed by a bedside nurse prior to discharge. Quality improvement (QI) efforts focused on discharge instructions were ignited by a prior review of 200 discharge instructions that showed they were difficult to read (median reading level of 10th grade), poorly understandable (36% of instructions met the threshold of understandability as measured by the Patient Education Materials Assessment Tool20) and were missing key elements of information.17

 

 

Improvement Team

The improvement team consisted of 4 pediatric hospitalists, 2 NPs, 1 nurse educator with health literacy expertise, 1 pediatric resident, 1 fourth-year medical student, 1 QI consultant, and 2 parents who had first-hand experience on the HM service. The improvement team observed the discharge process, including roles of the provider, nurse and family, outlined a process map, and created a modified failure mode and effect analysis.21 Prior to our work, discharge instructions written by providers often occurred as a last step, and the content was created as free text or from nonstandardized templates. Key drivers that informed interventions were determined and revised over time (Figure 1). The study was reviewed by our institutional review board and deemed not human subjects research.

Figure 1
Improvement Activities

Key drivers were identified, and interventions were executed using Plan-Do Study-Act cycles.22 The key drivers thought to be critical for the success of the QI efforts were family engagement; standardization of discharge instructions; medical staff engagement; and audit and feedback of data. The corresponding interventions were as follows:

Family Engagement

Understanding the discharge information families desired. Prior to testing, 10 families admitted to the HM service were asked about the discharge experience. We asked families about information they wanted in written discharge instructions: 1) reasons to call your primary doctor or return to the hospital; 2) when to see your primary doctor for a follow-up visit; 3) the phone number to reach your child’s doctor; 4) more information about why your child was admitted; 5) information about new medications; and 6) what to do to help your child continue to recover at home.

Development of templates. We engaged families throughout the process of creating general and disease-specific discharge templates. After a specific template was created and reviewed by the parents on our team, it was sent to members of the institutional Patient Education Committee, which includes parents and local health literacy experts, to review and critique. Feedback from the reviewers was incorporated into the templates prior to use in the EHR.

Postdischarge phone calls.A convenience sample of families discharged from the satellite campus was called 24 to 48 hours after discharge over a 2-week period in January, 2016. A member of our improvement team solicited feedback from families about the quality of the discharge instructions. Families were asked if discharge instructions were reviewed with them prior to going home, if they were given a copy of the instructions, how they would rate the ability to read and use the information, and if there were additional pieces of information that would have improved the instructions.

Standardization of Instructions

Education. A presentation was created and shared with medical providers; it was re-disseminated monthly to new residents rotating onto the service and to the attendings, fellows, and NPs scheduled for shifts during the month. This education continued for the duration of the study. The presentation included the definition of health literacy, scope of the problem, examples of poorly written discharge instructions, and tips on how to write readable and understandable instructions. Laminated cards that included tips on how to write instructions were also placed on work stations.

Figure 2
Creation of discharge instruction templates in the EHR.A general discharge instruction template that was initially created and tested in the EHR (Figure 2) included text written below the 7th grade and employed 14 point font, bolded words for emphasis, and lists with bullet points. Asterisks were used to indicate where providers needed to include patient-specific information. The sections included in the general template were informed by feedback from providers and parents prior to testing, parents on the improvement team, and parents of patients admitted to our satellite campus. The sections reflect components critical to successful postdischarge care: discharge diagnosis and its brief description, postdischarge care information, new medications, signs and symptoms that would warrant escalation of care to the patient’s primary care provider or the ED, and follow-up instructions and contact information for the patent’s primary care doctor.

While the general template was an important first step, the content relied heavily on free text by providers, which could still lead to instructions written at a high reading level. Thus, disease-specific discharge instruction templates were created with prepopulated information that was written at a reading level at or below 7th grade level (Figure 2). The diseases were prioritized based on the most common diagnoses on our HM service. Each template included information under each of the subheadings noted in the general template. Twelve disease-specific templates were tested and ultimately embedded in the EHR; the general template remained for use when the discharge diagnosis was not covered by a disease-specific template.

 

 

Medical Staff Engagement

Previously described tests of change also aimed to enhance staff engagement. These included frequent e-mails, discussion of the QI efforts at specific team meetings, and the creation of visual cues posted at computer work stations, which prompted staff to begin to work on discharge instructions soon after admission.

Audit and Feedback of Data

Weekly phone calls. One team updated clinicians through a regularly scheduled bi-weekly phone conference. The phone conference was established prior to our work and was designed to relay pertinent information to attendings and NPs who work at the satellite hospital. During the phone conferences, clinicians were notified of current performance on discharge instruction readability and specific tests of change for the week. Additionally, providers gave feedback about the improvement efforts. These updates continued for the first 6 months of the project until sustained improvements were observed.

E-mails. Weekly e-mails were sent to all providers scheduled for clinical time at the satellite campus. The e-mail contained information on current tests of change, a list of discharge instruction templates that were available in the EHR, and the annotated run chart illustrating readability levels over time.

Additionally, individual e-mails were sent to each provider after review of the written discharge instructions for the week. Providers were given information on the number of discharge instructions they personally composed, the percentage of those instructions that were written at or below 7th grade level, and specific feedback on how their written instructions could be improved. We also encouraged feedback from each provider to better identify barriers to achieving our goal.

Study of the Interventions

Baseline data included a review of all instructions for patients discharged from the satellite campus from the end of April 2015 through mid-September 2015. The time period for testing of interventions during the fall and winter months allowed for rapid cycle learning due to higher patient census and predictability of admissions for specific diagnosis (ie, asthma and bronchiolitis). An automated report was generated from the EHR weekly with specific demographics and identifiers for patient discharged over the past 7 days, including patient age, gender, length of stay, discharge diagnosis, and insurance classification. Data was collected during the intervention period via structured review of the discharge instructions in the EHR by the principal investigator or a trained research coordinator. Discharge instructions for medically cleared mental health patients admitted to hospital medicine while awaiting psychiatric bed availability and patients and parents who were non-English speaking were excluded from review. All other instructions for patients discharged from the HM service at our Liberty Campus were included for review.

Measures

Readability, our primary measure of interest, was calculated using the mean score from the following formulas: Flesch Kincaid Grade Level,23 Simple Measure of Gobbledygook Index,24 Coleman-Liau Index,25 Gunning-Fog Index,26 and Automated Readability Index27 by means of an online platform (https://readability-score.com).28 This platform was chosen because it incorporated a variety of formulas, was user-friendly, and required minimal data cleaning. Each of the readability formulas have been used to assesses readability of health information given to patients and families.29,30 The threshold of 7th grade is in alignment with our institutional policy for educational materials and with recommendations from several government agencies.5,12

Analysis

A statistical process control p-chart was used to analyze our primary measure of readability, dichotomized as percent discharge instructions written at or below 7th grade level. Run charts were used to follow mean reading level of discharge instructions and our process measure of percent of discharge instruction written with a general or disease-specific standardized template. Run chart and control chart rules for identifying special cause were used for midline shifts.31

Table

RESULTS

The Table includes the demographic and clinical information of patients included in our analyses. Through sequential interventions, the percentage of discharge instructions written at or below 7th grade readability level increased from a mean of 13% to more than 80% in 3 months (Figure 3). Furthermore, the mean was sustained above 90% for 10 months and at 98% for the last 4 months. The use of 1 of the 13 EHR templates increased from 0% to 96% and was associated with the largest impact on the overall improvements (Supplemental Figure 1). Additionally, the average reading level of the discharge instructions decreased from 10th grade to 6th grade level (Supplemental Figure 2).

Figure 3

Qualitative comments from providers about the discharge instructions included:

“Are these [discharge instructions] available at base??  Great resource for interns.”
“These [discharge] instructions make the [discharge] process so easy!!! Love these...”
“Also feel like they have helped my discharge teaching in the room!”

Qualitative comments from families postdischarge included:
“I thought the instructions were very clear and easy to read. I especially thought that highlighting the important areas really helped.”
“I think this form looks great, and I really like the idea of having your child’s name on it.”

 

 

DISCUSSION

Through sequential Plan-Do Study-Act cycles, we increased the percentage of discharge instructions written at or below 7th grade reading level from 13% to 98%. Our most impactful intervention was the creation and dissemination of standardized disease-specific discharge instruction templates. Our findings complement evidence in the adult and pediatric literature that the use of standardized, disease-specific discharge instruction templates may improve readability of instructions.32,33 And, while quality improvement efforts have been employed to improve the discharge process for patients,34-36 this is the first study in the inpatient setting that, to our knowledge, specifically addresses discharge instructions using quality improvement methods.

Our work targeted the critical intersection between individual health literacy, an individual’s capacity to acquire, interpret, and use health information, and the necessary changes needed within our healthcare system to ensure that appropriately written instructions are given to patients and families.17,37 Our efforts focused on improving discharge instructions answer the call to consider health literacy a modifiable clinical risk factor.37 Furthermore, we address the 6 aims for quality healthcare delivery: 1) safe, timely, efficient and equitable delivery of care through the creation and dissemination of standardized instructions that are written at the appropriate reading level for families to ease hospital-to-home transitions and streamline the workflow of medical providers; 2) effective education of medical providers on health literacy concepts; and 3) family-centeredness through the involvement of families in our QI efforts. While previous QI efforts to improve hospital-to-home transitions have focused on medication reconciliation, communication with primary care physicians, follow-up appointments, and timely discharges of patients, none have specifically focused on the quality of discharge instructions.34-36

Most physicians do not receive education about how to write information that is readable and understandable; more than half of providers desired more education in this area.38 Furthermore, pediatric providers may overestimate parental health literacy levels,39 which may contribute to variability in the readability of written health materials. While education alone can contribute to a provider’s ability to create readable instructions, we note the improvement after the introduction of disease templates to demonstrate the importance of workflow-integrated higher reliability interventions to sustain improvements.

Our baseline poor readability rates were due to limited knowledge by frontline providers composing the instructions and a system in which an important element for successful hospital-to-home transitions was not tackled until patients were ready for discharge. Streamlining of the discharge process, including the creation of discharge instructions, may lead to improved efficiency, fewer discrepancies, more effective communication, and an enhanced family experience. Moreover, the success of our improvement work was due to key stakeholders, including parents, being a part of the team and the notable buy-in from providers.

Our work was not without limitations. We excluded non-English speaking families from the study. We were unable to measure reading level of our population directly and instead based our goals on national estimates. Our primary measure was readability, which is only 1 piece contributing to quality discharge instructions. Understandability and actionability are also important considerations; 17,20,29,40 however, improvements in these areas were limited by our design options within the EHR. Our efforts focused on children with common general pediatric diagnoses, and it is unclear how our interventions would generalize to medically complex patients with more volume of information to communicate at discharge and with uncommon diagnoses that are less readily incorporated into standardized templates. Relatedly, our work occurred at the satellite campus of our tertiary care center and may not represent generalizable material or methods to implement templates at our main campus location or at other hospitals. To begin to better understand this, we have spread to HM patients at our main campus, including medically complex patients with technology dependence and/or neurological impairments. Standardized, disease-specific templates most relevant to this population as well as several patient specific templates, for those with frequent readmissions due to medical complexity, have been created and are actively being tested.

CONCLUSION

In conclusion, in using interventions targeted at standardization of discharge instructions and timely feedback to staff, we saw rapid, dramatic, and sustained improvement in the readability of discharge instructions. Next steps include adaptation and spread to other patient populations and care teams, collaborations with other centers, and assessing the impact of effectively written discharge instructions on patient outcomes, such as adverse drug events, readmission rates, and family experience.

Disclosure

No external funding was secured for this study. Dr. Brady is supported by a Patient-Centered Outcomes Research Mentored Clinical Investigator Award from the Agency for Healthcare Research and Quality, Award Number K08HS023827. The content is solely the responsibility of the authors and does not necessarily represent the official views of the funding organizations. The funding organization had no role in the design, preparation, review, or approval of this paper; nor the decision to submit the manuscript for publication. The authors have no financial relationships relevant to this article to disclose.

References

1. Solan LG, Beck AF, Brunswick SA, et al. The family perspective on hospital to
home transitions: a qualitative study. Pediatrics. 2015;136:e1539-e1549. PubMed
2. Engel KG, Buckley BA, Forth VE, et al. Patient understanding of emergency
department discharge instructions: where are knowledge deficits greatest? Acad
Emerg Med. 2012;19:E1035-E1044. PubMed
3. Ashbrook L, Mourad M, Sehgal N. Communicating discharge instructions to patients:
a survey of nurse, intern, and hospitalist practices. J Hosp Med. 2013;8:
36-41. PubMed
4. Kripalani S, Jacobson TA, Mugalla IC, Cawthon CR, Niesner KJ, Vaccarino V.
Health literacy and the quality of physician-patient communication during hospitalization.
J Hosp Med. 2010;5:269-275. PubMed
5. Institute of Medicine Committee on Health Literacy. Kindig D, Alfonso D, Chudler
E, et al, eds. Health Literacy: A Prescription to End Confusion. Washington,
DC: National Academies Press; 2004. 
6. Yin HS, Johnson M, Mendelsohn AL, Abrams MA, Sanders LM, Dreyer BP. The
health literacy of parents in the United States: a nationally representative study.
Pediatrics. 2009;124(suppl 3):S289-S298. PubMed
7. Rak EC, Hooper SR, Belsante MJ, et al. Caregiver word reading literacy and
health outcomes among children treated in a pediatric nephrology practice. Clin
Kid J. 2016;9:510-515. PubMed
8. Morrison AK, Schapira MM, Gorelick MH, Hoffmann RG, Brousseau DC. Low
caregiver health literacy is associated with higher pediatric emergency department
use and nonurgent visits. Acad Pediatr. 2014;14:309-314. PubMed
9. Howard-Anderson J, Busuttil A, Lonowski S, Vangala S, Afsar-Manesh N. From
discharge to readmission: Understanding the process from the patient perspective.
J Hosp Med. 2016;11:407-412. PubMed
10. Doak CC, Doak LG, Root JH. Teaching Patients with Low Literacy Skills. 2nd ed.
Philadelphia PA: J.B. Lippincott; 1996. PubMed
11. Berkman ND, Sheridan SL, Donahue KE, et al. Health literacy interventions and
outcomes: an updated systematic review. Evid Rep/Technol Assess. 2011;199:1-941. PubMed
12. Prevention CfDCa. Health Literacy for Public Health Professionals. In: Prevention
CfDCa, ed. Atlanta, GA2009. 
13. “What Did the Doctor Say?” Improving Health Literacy to Protect Patient Safety.
Oakbrook Terrace, IL: The Joint Commission, 2007. 
14. Desai AD, Burkhart Q, Parast L, et al. Development and pilot testing of caregiver-
reported pediatric quality measures for transitions between sites of care. Acad
Pediatr. 2016;16:760-769. PubMed
15. Leyenaar JK, Desai AD, Burkhart Q, et al. Quality measures to assess care transitions
for hospitalized children. Pediatrics. 2016;138(2). PubMed
16. Akinsola B, Cheng J, Zmitrovich A, Khan N, Jain S. Improving discharge instructions
in a pediatric emergency department: impact of a quality initiative. Pediatr
Emerg Care. 2017;33:10-13. PubMed
17. Unaka NI, Statile AM, Haney J, Beck AF, Brady PW, Jerardi K. Assessment of
the readability, understandability and completeness of pediatric hospital medicine
discharge instructions J Hosp Med. In press. PubMed
18. Stella SA, Allyn R, Keniston A, et al. Postdischarge problems identified by telephone
calls to an advice line. J Hosp Med. 2014;9:695-699. PubMed
19. Maloney M, Auffrey C. The social areas of Cincinnati.
20. The Patient Education Materials Assessment Tool (PEMAT) and User’s Guide:
An Instrument To Assess the Understandability and Actionability of Print and
Audiovisual Patient Education Materials. Available at: http://www.ahrq.gov/
professionals/prevention-chronic-care/improve/self-mgmt/pemat/index.html. Accessed
November 27, 2013.
21. Cohen MR, Senders J, Davis NM. Failure mode and effects analysis: a novel
approach to avoiding dangerous medication errors and accidents. Hosp Pharm.
1994;29:319-30. PubMed
22. Langley GJ, Moen R, Nolan KM, Nolan TW, Norman CL, Provost LP. The Improvement
Guide: A Practical Approach to Enhancing Organizational Performance.
San Franciso, CA: John Wiley & Sons; 2009. 
23. Flesch R. A new readability yardstick. J Appl Psychol. 1948;32:221-233. PubMed
24. McLaughlin GH. SMOG grading-a new readability formula. J Reading.
1969;12:639-646.
25. Coleman M, Liau TL. A computer readability formula designed for machine scoring.
J Appl Psych. 1975;60:283. 
26. Gunning R. {The Technique of Clear Writing}. 1952.
27. Smith EA, Senter R. Automated readability index. AMRL-TR Aerospace Medical
Research Laboratories (6570th) 1967:1. PubMed
28. How readable is your writing. 2011. https://readability-score.com. Accessed September
23, 2016.
An Official Publication of the Society of Hospital Medicine Journal of Hospital Medicine Vol 12 | No 7 | July 2017 557
Improving Readability of Discharge Instructions | Unaka et al
29. Yin HS, Gupta RS, Tomopoulos S, et al. Readability, suitability, and characteristics
of asthma action plans: examination of factors that may impair understanding.
Pediatrics. 2013;131:e116-E126. PubMed
30. Brigo F, Otte WM, Igwe SC, Tezzon F, Nardone R. Clearly written, easily comprehended?
The readability of websites providing information on epilepsy. Epilepsy
Behav. 2015;44:35-39. PubMed
31. Benneyan JC. Use and interpretation of statistical quality control charts. Int J
Qual Health Care. 1998;10:69-73. PubMed
32. Mueller SK, Giannelli K, Boxer R, Schnipper JL. Readability of patient discharge
instructions with and without the use of electronically available disease-specific
templates. J Am Med Inform Assoc. 2015;22:857-863. PubMed
33. Lauster CD, Gibson JM, DiNella JV, DiNardo M, Korytkowski MT, Donihi AC.
Implementation of standardized instructions for insulin at hospital discharge.
J Hosp Med. 2009;4:E41-E42. PubMed
34. Tuso P, Huynh DN, Garofalo L, et al. The readmission reduction program of
Kaiser Permanente Southern California-knowledge transfer and performance improvement.
Perm J. 2013;17:58-63. PubMed
35. White CM, Statile AM, White DL, et al. Using quality improvement to optimise
paediatric discharge efficiency. BMJ Qual Saf. 2014;23:428-436. PubMed
36. Mussman GM, Vossmeyer MT, Brady PW, Warrick DM, Simmons JM, White CM.
Improving the reliability of verbal communication between primary care physicians
and pediatric hospitalists at hospital discharge. J Hosp Med. 2015;10:574-
580. PubMed
37. Rothman RL, Yin HS, Mulvaney S, Co JP, Homer C, Lannon C. Health literacy
and quality: focus on chronic illness care and patient safety. Pediatrics
2009;124(suppl 3):S315-S326. PubMed
38. Turner T, Cull WL, Bayldon B, et al. Pediatricians and health literacy: descriptive
results from a national survey. Pediatrics. 2009;124(suppl 3):S299-S305. PubMed
39. Harrington KF, Haven KM, Bailey WC, Gerald LB. Provider perceptions of parent
health literacy and effect on asthma treatment: recommendations and instructions.
Pediatr Allergy immunol Pulmonol. 2013;26:69-75. PubMed
40. Yin HS, Parker RM, Wolf MS, et al. Health literacy assessment of labeling of
pediatric nonprescription medications: examination of characteristics that may
impair parent understanding. Acad Pediatr. 2012;12:288-296. PubMed

References

1. Solan LG, Beck AF, Brunswick SA, et al. The family perspective on hospital to
home transitions: a qualitative study. Pediatrics. 2015;136:e1539-e1549. PubMed
2. Engel KG, Buckley BA, Forth VE, et al. Patient understanding of emergency
department discharge instructions: where are knowledge deficits greatest? Acad
Emerg Med. 2012;19:E1035-E1044. PubMed
3. Ashbrook L, Mourad M, Sehgal N. Communicating discharge instructions to patients:
a survey of nurse, intern, and hospitalist practices. J Hosp Med. 2013;8:
36-41. PubMed
4. Kripalani S, Jacobson TA, Mugalla IC, Cawthon CR, Niesner KJ, Vaccarino V.
Health literacy and the quality of physician-patient communication during hospitalization.
J Hosp Med. 2010;5:269-275. PubMed
5. Institute of Medicine Committee on Health Literacy. Kindig D, Alfonso D, Chudler
E, et al, eds. Health Literacy: A Prescription to End Confusion. Washington,
DC: National Academies Press; 2004. 
6. Yin HS, Johnson M, Mendelsohn AL, Abrams MA, Sanders LM, Dreyer BP. The
health literacy of parents in the United States: a nationally representative study.
Pediatrics. 2009;124(suppl 3):S289-S298. PubMed
7. Rak EC, Hooper SR, Belsante MJ, et al. Caregiver word reading literacy and
health outcomes among children treated in a pediatric nephrology practice. Clin
Kid J. 2016;9:510-515. PubMed
8. Morrison AK, Schapira MM, Gorelick MH, Hoffmann RG, Brousseau DC. Low
caregiver health literacy is associated with higher pediatric emergency department
use and nonurgent visits. Acad Pediatr. 2014;14:309-314. PubMed
9. Howard-Anderson J, Busuttil A, Lonowski S, Vangala S, Afsar-Manesh N. From
discharge to readmission: Understanding the process from the patient perspective.
J Hosp Med. 2016;11:407-412. PubMed
10. Doak CC, Doak LG, Root JH. Teaching Patients with Low Literacy Skills. 2nd ed.
Philadelphia PA: J.B. Lippincott; 1996. PubMed
11. Berkman ND, Sheridan SL, Donahue KE, et al. Health literacy interventions and
outcomes: an updated systematic review. Evid Rep/Technol Assess. 2011;199:1-941. PubMed
12. Prevention CfDCa. Health Literacy for Public Health Professionals. In: Prevention
CfDCa, ed. Atlanta, GA2009. 
13. “What Did the Doctor Say?” Improving Health Literacy to Protect Patient Safety.
Oakbrook Terrace, IL: The Joint Commission, 2007. 
14. Desai AD, Burkhart Q, Parast L, et al. Development and pilot testing of caregiver-
reported pediatric quality measures for transitions between sites of care. Acad
Pediatr. 2016;16:760-769. PubMed
15. Leyenaar JK, Desai AD, Burkhart Q, et al. Quality measures to assess care transitions
for hospitalized children. Pediatrics. 2016;138(2). PubMed
16. Akinsola B, Cheng J, Zmitrovich A, Khan N, Jain S. Improving discharge instructions
in a pediatric emergency department: impact of a quality initiative. Pediatr
Emerg Care. 2017;33:10-13. PubMed
17. Unaka NI, Statile AM, Haney J, Beck AF, Brady PW, Jerardi K. Assessment of
the readability, understandability and completeness of pediatric hospital medicine
discharge instructions J Hosp Med. In press. PubMed
18. Stella SA, Allyn R, Keniston A, et al. Postdischarge problems identified by telephone
calls to an advice line. J Hosp Med. 2014;9:695-699. PubMed
19. Maloney M, Auffrey C. The social areas of Cincinnati.
20. The Patient Education Materials Assessment Tool (PEMAT) and User’s Guide:
An Instrument To Assess the Understandability and Actionability of Print and
Audiovisual Patient Education Materials. Available at: http://www.ahrq.gov/
professionals/prevention-chronic-care/improve/self-mgmt/pemat/index.html. Accessed
November 27, 2013.
21. Cohen MR, Senders J, Davis NM. Failure mode and effects analysis: a novel
approach to avoiding dangerous medication errors and accidents. Hosp Pharm.
1994;29:319-30. PubMed
22. Langley GJ, Moen R, Nolan KM, Nolan TW, Norman CL, Provost LP. The Improvement
Guide: A Practical Approach to Enhancing Organizational Performance.
San Franciso, CA: John Wiley & Sons; 2009. 
23. Flesch R. A new readability yardstick. J Appl Psychol. 1948;32:221-233. PubMed
24. McLaughlin GH. SMOG grading-a new readability formula. J Reading.
1969;12:639-646.
25. Coleman M, Liau TL. A computer readability formula designed for machine scoring.
J Appl Psych. 1975;60:283. 
26. Gunning R. {The Technique of Clear Writing}. 1952.
27. Smith EA, Senter R. Automated readability index. AMRL-TR Aerospace Medical
Research Laboratories (6570th) 1967:1. PubMed
28. How readable is your writing. 2011. https://readability-score.com. Accessed September
23, 2016.
An Official Publication of the Society of Hospital Medicine Journal of Hospital Medicine Vol 12 | No 7 | July 2017 557
Improving Readability of Discharge Instructions | Unaka et al
29. Yin HS, Gupta RS, Tomopoulos S, et al. Readability, suitability, and characteristics
of asthma action plans: examination of factors that may impair understanding.
Pediatrics. 2013;131:e116-E126. PubMed
30. Brigo F, Otte WM, Igwe SC, Tezzon F, Nardone R. Clearly written, easily comprehended?
The readability of websites providing information on epilepsy. Epilepsy
Behav. 2015;44:35-39. PubMed
31. Benneyan JC. Use and interpretation of statistical quality control charts. Int J
Qual Health Care. 1998;10:69-73. PubMed
32. Mueller SK, Giannelli K, Boxer R, Schnipper JL. Readability of patient discharge
instructions with and without the use of electronically available disease-specific
templates. J Am Med Inform Assoc. 2015;22:857-863. PubMed
33. Lauster CD, Gibson JM, DiNella JV, DiNardo M, Korytkowski MT, Donihi AC.
Implementation of standardized instructions for insulin at hospital discharge.
J Hosp Med. 2009;4:E41-E42. PubMed
34. Tuso P, Huynh DN, Garofalo L, et al. The readmission reduction program of
Kaiser Permanente Southern California-knowledge transfer and performance improvement.
Perm J. 2013;17:58-63. PubMed
35. White CM, Statile AM, White DL, et al. Using quality improvement to optimise
paediatric discharge efficiency. BMJ Qual Saf. 2014;23:428-436. PubMed
36. Mussman GM, Vossmeyer MT, Brady PW, Warrick DM, Simmons JM, White CM.
Improving the reliability of verbal communication between primary care physicians
and pediatric hospitalists at hospital discharge. J Hosp Med. 2015;10:574-
580. PubMed
37. Rothman RL, Yin HS, Mulvaney S, Co JP, Homer C, Lannon C. Health literacy
and quality: focus on chronic illness care and patient safety. Pediatrics
2009;124(suppl 3):S315-S326. PubMed
38. Turner T, Cull WL, Bayldon B, et al. Pediatricians and health literacy: descriptive
results from a national survey. Pediatrics. 2009;124(suppl 3):S299-S305. PubMed
39. Harrington KF, Haven KM, Bailey WC, Gerald LB. Provider perceptions of parent
health literacy and effect on asthma treatment: recommendations and instructions.
Pediatr Allergy immunol Pulmonol. 2013;26:69-75. PubMed
40. Yin HS, Parker RM, Wolf MS, et al. Health literacy assessment of labeling of
pediatric nonprescription medications: examination of characteristics that may
impair parent understanding. Acad Pediatr. 2012;12:288-296. PubMed

Issue
Journal of Hospital Medicine 12(7)
Issue
Journal of Hospital Medicine 12(7)
Page Number
551-557
Page Number
551-557
Topics
Article Type
Display Headline
Improving the readability of pediatric hospital medicine discharge instructions
Display Headline
Improving the readability of pediatric hospital medicine discharge instructions
Sections
Article Source

© 2017 Society of Hospital Medicine

Disallow All Ads
Correspondence Location
*Address for correspondence and reprint requests: Ndidi I. Unaka, Division of Hospital Medicine, Cincinnati Children’s Hospital Medical Center, 3333 Burnet Ave., ML 5018, Cincinnati, OH 45229; Telephone: 513-636-8354; Fax: 513-636-7905; E-mail: [email protected]
Content Gating
Open Access (article Unlocked/Open Access)
Alternative CME
Disqus Comments
Default
Use ProPublica
Article PDF Media
Media Files

Dosing accuracy of direct oral anticoagulants in an academic medical center

Article Type
Changed
Display Headline
Dosing accuracy of direct oral anticoagulants in an academic medical center

Direct-acting oral anticoagulants (DOACs) have been introduced into clinical use for stroke prevention in patients with nonvalvular atrial fibrillation (NVAF), prevention of venous thrombosis after hip or knee surgery, and treatment of deep vein thrombosis (DVT) and pulmonary embolism (PE).1-7 Advantages of DOACs over warfarin are often stated as fixed dosing, minor drug and food interactions, wider therapeutic index, and no need for laboratory test monitoring.1,8 Yet, recommended DOAC dosages vary by renal function and therapeutic indications. Dosing recommendations for prevention of stroke in patients with NVAF are based on estimated creatinine clearance (dabigatran, rivaroxaban, edoxaban), age (apixaban), weight (apixaban, edoxaban), serum creatinine level (apixaban, edoxaban), and presence of cirrhosis by Child-Pugh class9,10 (apixaban, edoxaban).4-6,11,12 Dosing recommendations based on coadministration of strong CYP34A and P-glycoprotein inhibitors or inducers vary by DOAC. In addition, dabigatran cannot be crushed and must be stored in its original packaging, and rivaroxaban should be taken with food when the dose is over 10 mg.

We studied DOAC prescribing in adults admitted to a large academic medical center by comparing initial prescribed dosing with FDA-approved prescribing information. We hypothesized that the complexity of DOAC dosing may not be recognized by prescribers.

METHODS

Our study protocol was approved by the Committee on Human Research (Institutional Review Board) of the University of California San Francisco.

Data Collection

We used electronic medical records (EMRs) to identify adult inpatients who were prescribed a DOAC (apixaban, dabigatran, edoxaban, or rivaroxaban) at the University of California San Francisco Medical Center, a large academic hospital, between July 1, 2014 and June 30, 2015. Demographic and medical information related to therapeutic indications, contraindications, and indications for dose adjustments were collected and included diagnoses classified by International Classification of Diseases, Ninth Revision (ICD-9) and Tenth Revision (ICD-10) for venous thromboses; phlebitis or thrombophlebitis; PE or venous embolism; atrial arrhythmias; surgical procedures; cirrhosis and/or ascites or liver disease; coagulopathies; artificial heart valves or implanted devices; prior use of medications including parenteral anticoagulants; and laboratory data obtained before the first DOAC order (serum creatinine level, estimated glomerular filtration rate [eGFR] determined by Chronic Kidney Disease Epidemiology Collaboration,13 international normalized ratio, or, if available, activated partial thromboplastin time and bilirubin level). Creatinine clearance was calculated with the Cockcroft-Gault method14 using total body weight, per drug label recommendation. Child-Pugh class was calculated if cirrhosis was diagnosed.10 DOAC dose, frequency, dosing directions, and prescriber medical specialty were determined.

 

 

Accuracy of search results was confirmed by review of the first 200 patients’ records. Records were manually reviewed for encounters lacking ICD-9/10 codes and approved DOAC indications (30%) and encounters having multiple coded diagnostic indications (to identify the indication). ICD-9 codes for venous thrombosis were reviewed to differentiate acute from chronic events.

Data Analysis

The main outcome was concordance or discordance between the first DOAC prescribing order and the FDA-approved prescribing information at the time. Initial classification, performed by 2 independent reviewers (a pharmacist and a physician, or 2 pharmacists), was followed by adjudication and individual record review (by 2 independent reviewers) of all initial prescribing orders classified as discordant. A third reviewer adjudicated any disagreement. Records and notes were reviewed to identify stated or potential reasons for dosing variation and pre-admission prescriptions. Data are presented as means and standard deviations (SDs) and as raw numbers and percentages. Differences in patient characteristics by DOAC or therapeutic indication were determined by analysis of variance (ANOVA) with Bonferroni correction for post hoc comparisons. Dosing information was categorized as the same as recommended, lower than recommended, higher than recommended, or avoid drug use (drug–drug or drug–disease interaction), per FDA-approved prescribing information, and χ2 tests were used to determine whether variation in dosing occurred by individual DOAC, therapeutic indication, or prescriber specialty. Relationships between dosing variation and age or renal function were tested by ANOVA with Bonferroni correction for post hoc comparisons.

RESULTS

Table 1
There were 635 admissions with apixaban, dabigatran, or rivaroxaban prescribed for 508 patients (Table 1). Edoxaban was not on the formulary and not prescribed during the period studied. The therapeutic indication was prevention of embolic stroke in patients with atrial fibrillation/flutter or AF (465 admissions, or 73%, with valvular disease and/or tissue valve in 35), chronic DVT (67 admissions, or 11%, with active malignancy in 14), acute DVT (32 admissions, with malignancy in 2), chronic PE (23 admissions, with malignancy in 3), acute PE (19 admissions, with malignancy in 4), and DVT prevention after hip or knee surgery (19 admissions). DOACs were prescribed for unapproved indications in 10 admissions, and these were excluded from further analysis (mural thrombus in 3 admissions, low ejection fraction in 2, bedrest immobilization in 2, aortic aneurysm in 1, thrombocytosis in 1, and extensive superficial venous thrombosis in 1) (Table 2).

Table 2

Patients with AF were older with lower creatinine clearance compared to patients with other diagnoses. Mean (SD) patient age was 72.1 (12.7) years for AF, 53.1 (10.9) years for chronic PE, 55.5 (14) years for acute PE, 56.4 (15.9) years for chronic DVT, 57.9 (18.4) years for acute DVT, and 61.4 (11.6) years for DVT prevention after hip or knee surgery (P < 0.0001 for all comparisons). Mean (SD) estimated creatinine clearance was 76.8 (43.5) mL/min for AF, 92.4 (44.4) mL/min for DVT prevention after hip or knee surgery, 111 (53) mL/min for chronic DVT, 118 (55) mL/min for acute DVT, 126 (60) mL/min for chronic PE, and 127 (54) mL/min for acute PE (P < 0.0001 for all comparisons). Differences between patient groups by therapeutic indication were not detected for weight, body mass index, or serum creatinine level.

The most frequent deviation from prescribing recommendations was omission of directions to administer rivaroxaban with food—93% (248/268) of orders—but not for DVT prevention after hip or knee surgery, for which the 10-mg dose is appropriately administered without food. Doses were the same as recommended for 82% of apixaban orders, 84% of rivaroxaban orders, and 93% of initial dabigatran orders (P < 0.05 for differences; Table 3). Dosages not concordant with FDA recommendations were prescribed in 44 (18.1%) of 243 apixaban orders, 41 (14.3%) of 286 rivaroxaban orders, and 7 (7.2%) of 89 initial dabigatran orders. Lower than recommended doses were more common than higher than recommended doses (Table 3, Figure 1): 15.2% versus 2.1% of apixaban orders, 9.4% versus 3.5% of rivaroxaban orders, and 4.2% versus 1.0% of initial dabigatran orders (P < 0.05). Failure to avoid drug use (for potential drug–drug or drug–disease interactions) was uncommon (1%-2%). There were more deviations from recommended doses for patients with AF or DVT prevention after hip or knee surgery than for patients with acute or chronic PE or acute DVT (Table 3). No significant differences were detected between prescribed and recommended doses by prescriber specialty.

Table 3
In most cases, a reason for deviating from FDA dosing recommendations was not stated in the EMR. The exception was fluctuating renal function, which was cited in 8 cases.

Figure

For apixaban, patients who were prescribed lower than recommended doses were older than those prescribed recommended doses: mean (SD), 78.1 (12.2) years versus 71 (13.6) years (P = 0.003). Seventy-six percent of those prescribed lower than recommended doses were older than 75. Prescriptions for apixaban at lower than recommended doses were continuations of prior outpatient prescriptions in 20 of 37 cases (almost half), and in 12 cases (one-fourth) antiplatelet drugs were coprescribed (aspirin in 10 cases, clopidogrel in 1, prasugrel in 1). For rivaroxaban, older age was associated with both lower than recommended dosing (P = 0.003) and higher than recommended dosing (P < 0.001). Variations from prescribing recommendations were continuations of outpatient rivaroxaban doses in about two-thirds (26 of 41; 63.4 %) with 13 receiving antiplatelet drugs. For dabigatran, 6 of 7 orders not in agreement with recommendations were continuations of outpatient dosing.

The specific equation used to estimate renal function also had the potential to lead to dosing errors. Among the 41 rivaroxaban patients categorized as receiving doses discordant with recommendations, 8 would have had an inappropriate DOAC dose if eGFR were used instead of eCrCL as recommended. No relationships were detected for other patient variables/measures and dosing deviations from recommendations.

 

 

DISCUSSION

We examined initial hospital orders for DOACs in adults admitted to a single academic medical center during 2014-2015. Dabigatran, apixaban and rivaroxaban were prescribed for prevention of stroke in patients with atrial fibrillation/flutter (AF) in three quarters of the encounters similar to national patterns. (15) Prescribing departures from FDA-approved recommendations ranged from failure to prescribe rivaroxaban with food to failure to recognize drug-drug interactions in 1% to 2%. Unexpectedly, lower than recommended dosing was more common than higher than recommended dosing of the three DOACs.

Rivaroxaban bioavailability is dose dependent with the presence of food required to enhance absorption for doses over 10 mg that are used for prevention of stroke in patients with non-valvular AF or treatment of DVT or PE.5,16 Peak rivaroxaban concentrations are 75% higher and the total area under the concentration vs. time curve after dosing is 40% higher when rivaroxaban is administered with high fat high calorie meals compared to the fasting state.16 If rivaroxaban is not administered with food, drug concentrations and pharmacologic effects may be less than in clinical trials that specified co-administration with food.17-19 A small survey of outpatients receiving rivaroxaban found that 23% reported taking it without food.20 With electronic pharmacy systems in almost all hospitals and electronic prescriber order entry in most, automated addition of directions for rivaroxaban administration with food for doses over 10 mg to labels or dispensing instructions could easily correct this deviation from recommended practice.

Lower than recommended doses were prescribed in 9.4% of orders for rivaroxaban and 15.2% of orders for apixaban, with dose-deviations often appearing to be a continuation of outpatient doses. Patients 75 years or older were more likely to receive lower than recommended dosing of apixaban. Reductions in apixaban doses from 5 mg twice daily to 2.5 mg twice daily are recommended in patients with non-valvular AF with two of the following criteria: age ≥80 y, weight ≤60 kg, serum creatinine ≥1.5 mg/dL or co-administration of a strong PgP inhibitor to a patient without 2 of the 3 dose reduction criteria. Our study was not designed to determine reasons for under-dosing, but we speculate that clinicians may have considered patients aged 75-79 years to be similar to those 80 years of age or older, or, older and not as healthy as those enrolled in randomized trials.21-25 The median age of our patients with AF receiving apixaban was 75y (interquartile range of 16) vs 70y ( interquartile range 63-76) in the pivotal trial comparing warfarin to apixaban.21 Renal function was also lower with 37% having eCrCL below 50 mL/min compared to 17% in ARISTOTLE. (21). Twenty-six percent of our apixaban-treated AF patients qualified for the lower 2.5 mg twice daily compared to only 5% of ARISTOTLE participants,21 further suggesting differences between patients in our sample compared to randomized trial participants.

Concerns regarding bleeding or falls in older patients, may also have contributed to lower than recommended doses. Recent analyses of patients at risk for falls confirmed that increased risk of falling was associated with more bone fractures, bleeding and all-cause death but not stroke or systemic emboli, and with less severe bleeding with the DOAC edoxaban compared to warfarin.26 While a rationale for personalized or lower than recommended dosing of apixaban may exist in very old patients and those at risk of falls and bleeding, more data are needed to determine outcomes of lower than recommended doses of DOACs before such an approach can be endorsed. Monitoring of anticoagulant effect in patients who receive doses lower than those investigated in clinical trials could provide important information. The assays that measure DOAC effects are likely to be more available because of the use of reversal agents in the setting of bleeding with DOACs.27

We had anticipated higher than recommended dosing for rivaroxaban as recommendations are based on creatinine clearance while laboratories routinely report estimated glomerular filtration rate (eGFR) that can provide higher estimates of renal clearance and estimated DOAC doses in older and smaller individuals.28 Higher than recommended dosing was found in only 3.5% of our sample. In half, eGFR estimates were higher than creatinine clearance estimates. An international postmarketing registry of rivaroxaban use for the prevention of stroke in patients with NVAF, which included outpatients, found that 36% of those with creatinine clearances below 50 mL/min received a dose higher than recommended, and 15% received a dose lower than expected.29 A more recent outpatient registry report on patients with NVAF, in which apixaban, dabigatran, or rivaroxaban was administered, found that overall 9.4% received a dose lower than recommended, and 3.4% were overdosed, with a similar percentage (34%) of rivaroxaban patients with creatinine clearance of 15 to 50 mL/min receiving higher than recommended dosing.30 The lower rate of higher-than-recommended doses that we observed may have been related to the routine measurement of serum creatinine and attention to dosing adjustments for renal function in the inpatient setting compared to the outpatient setting. In addition, renal function data may not be available to outpatient pharmacies, limiting potential input on dosing recommendations. At least one cardiac society recommends monitoring of renal function in patients treated with DOACs, annually in patients with normal estimated creatinine clearance and more frequently (at intervals in months equal to the creatinine clearance divided by 10) in patients with abnormal creatinine clearance.11 A hospital encounter provides an opportunity to assess or reassess renal status to optimize DOAC dosing.

Dabigatran was the first DOAC introduced into use in the United States with the same dose recommended for prevention of stroke in patients with AF or venous thromboembolic disease with reductions for creatinine clearance below 30 mL/min or creatinine clearance between 30 and 50 mL/min and concomitant use of the potent P-glycoprotein inhibitor dronedarone or systemic ketoconazole. The relative simplicity of dosing may have been responsible for the lowest rate of prescribing outside of recommendations observed in this study, but the low dabigatran use limits analyses of contributing factors.

Failure to avoid drug use in combination with use of strong P-glycoprotein inducers or inhibitors was infrequent but should be preventable. Current prescribing recommendations refer to “strong” P-glycoprotein inhibitors and list different specific agents that interact with each DOAC without a standardized definition or classification. Standardized classifications or reference sources would be helpful.

Our primary goal in this study was to compare initial prescribed dosing of DOACs with FDA-approved prescribing directions. However, therapeutic indication data warrant discussion. In our sample, 7.5% of patients with AF had bioprosthetic valves or recent mitral valve repair or replacement. Using the NVAF definition found in the 2014 AHA/ACC/HRS (American Heart Association, American College of Cardiology, Heart Rhythm Society) AF guidelines1—“absence of rheumatic mitral valve disease, a prosthetic heart valve, or mitral valve repair”—these patients would not appear to be candidates for DOACs. However, arguments have been made that a bioprosthetic heart valve or native valve after valve repair does not have a risk profile for thromboembolism that differs from other forms of NVAF and would be equally responsive to DOAC therapy.31 Data are sparse, but retrospective subanalyses of limited numbers of patients with valvular disease (including bioprosthesis and mitral repair patients but excluding mechanical valve patients) enrolled in the pivotal DOAC studies support this conclusion.32 For the first months after biological valve replacement (including catheter-based valve replacement), recent European guidelines recommend vitamin K antagonists but also state, “NOACs probably deliver the same protection.”8 DOACs were also used for management of venous thromboembolic disease (both acute and chronic) in patients with active cancer. Our data predate the most recent American College of Chest Physician guidelines on treatment of venous thromboembolism in patients with cancer, which provide grade 2B recommendations for use of low-molecular-weight heparin (LMWH) over vitamin K antagonists and grade 2C recommendations for use of LMWH over dabigatran, rivaroxaban, apixaban, or edoxaban.33

Our study had several limitations. First, data were from a single US academic medical center, though similar rates of prescribing deviation from recommendations have been reported for rivaroxaban and dabigatran in NVAF patients in other countries.29,34 Second, therapeutic indications may have been misclassified because of errors, incomplete EMR data, or multiple indications. Third, we analyzed the first DOAC order and not dispensing information or subsequent corrections. Therefore, deviations from recommendations should not be interpreted as errors that reached patients. We evaluated dosing based on the measures used at the time of hospital admission, noting that, in a significant fraction of deviations from recommended doses, they represented continuations of outpatient doses when renal function or weight may have differed, and it is unknown whether patients were counseled to take rivaroxaban with food in the outpatient setting. Fourth, the number of patients with acute DVT was small, so firm conclusions cannot be drawn for this specific population. Fifth, our estimates of off-label dosing may have been underestimates, as data on cancer and cancer activity or cardiac valvular disease may not have been complete.

 

 

CONCLUSION

Healthcare professionals are prescribing DOACs in ways that differ from recommendations. These differences may reflect the older ages and reduced renal function of clinical populations relative to randomized clinical trial groups, but they could also potentially alter clinical efficacy. Our findings support the need to evaluate the appropriateness and dosing of DOACs at each encounter and to determine the outcomes of patients treated with lower than recommended doses of DOACs and the outcomes of DOAC-treated patients with bioprostheses or active malignancies.

Acknowledgment

The authors thank Tobias Schmelzinger for electronic data extraction and compilation and University of California San Francisco students Eduardo De La Torre Cruz (School of Pharmacy) and Carlos Mikell (School of Medicine) for assistance with data review.

Disclosure

Dr. Schwartz reports receiving personal fees from Bristol-Myers Squibb and Amgen and grants from Bristol-Myers Squibb and Pfizer, outside the submitted work. The other authors have nothing to report.

 

References

1. January CT, Wann LS, Alpert JS, et al. 2014 AHA/ACC/HRS guideline for the management of patients with atrial fibrillation: executive summary. A report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines and the Heart Rhythm Society. J Am Coll Cardiol. 2014;64(21):2246-2280. PubMed
2. Saraf K, Morris PD, Garg P, Sheridan P, Storey R. Non–vitamin K antagonist oral anticoagulants (NOACs): clinical evidence and therapeutic considerations. Postgrad Med J. 2014;90(1067):520-528. PubMed
3. Yeh CH, Gross PL, Weitz JI. Evolving use of new oral anticoagulants for treatment of venous thromboembolism. Blood. 2014;124(7):1020-1028. PubMed
4. Pradaxa website. https://www.pradaxa.com. Accessed June 1, 2017.
5. Xarelto website. https://www.xarelto-us.com. Accessed June 1, 2017.
6. Eliquis website. http://www.eliquis.com. Accessed June 1, 2017.
7. Savaysa [prescribing information]. Tokyo, Japan: Daiichi Sankyo; 2015.
8. Kirchhof P, Benussi S, Kotecha D, et al. 2016 ESC guidelines for the management of atrial fibrillation developed in collaboration with EACTS. Eur Heart J. 2016;37(38):2893-2962. PubMed
9. Child C, Turcotte J. Surgery and portal hypertension. In: Child CG, ed. The Liver and Portal Hypertension. Philadelphia, PA: Saunders; 1964:50-64. PubMed
10. Pugh RN, Murray-Lyon IM, Dawson JL, Pietroni MC, Williams R. Transection of the oesophagus for bleeding oesophageal varices. Br J Surg. 1973;60(8):646-649. PubMed
11. Heidbuchel H, Verhamme P, Alings M, et al. Updated European Heart Rhythm Association practical guide on the use of non–vitamin K antagonist anticoagulants in patients with non-valvular atrial fibrillation. Europace. 2015;17(10):1467-1507. PubMed
12. Savaysa website. https://savaysahcp.com. Accessed June 1, 2017.
13. Levey AS, Stevens LA, Schmid CH, et al; CKD-EPI (Chronic Kidney Disease Epidemiology Collaboration). A new equation to estimate glomerular filtration rate. Ann Intern Med. 2009;150(9):604-612. PubMed
14. Cockcroft DW, Gault MH. Prediction of creatinine clearance from serum creatinine. Nephron. 1976;16(1):31-41. PubMed
15. Rose AJ, Reisman JI, Allen AL, Miller DR. Potentially inappropriate prescribing of direct-acting oral anticoagulants in the Veterans Health Administration. Am J Pharm Benefits. 2016;4(4):e75-e80.
16. Stampfuss J, Kubitza D, Becka M, Mueck W. The effect of food on the absorption and pharmacokinetics of rivaroxaban. Int J Clin Pharmacol Ther. 2013;51(7):549-561. PubMed
17. Patel MR, Mahaffey KW, Garg J, et al; ROCKET AF Investigators. Rivaroxaban versus warfarin in nonvalvular atrial fibrillation. N Engl J Med. 2011;365(10):883-891. PubMed
18. EINSTEIN Investigators, Bauersachs R, Berkowitz SD, et al. Oral rivaroxaban for symptomatic venous thromboembolism. N Engl J Med. 2010;363(26):2499-2510. PubMed
19. EINSTEIN-PE Investigators, Büller HR, Prins MH, et al. Oral rivaroxaban for the treatment of symptomatic pulmonary embolism. N Engl J Med. 2012;366(14):1287-1297. PubMed
20. Simon J, Hawes E, Deyo Z, Bryant-Shilliday B. Evaluation of prescribing and patient use of target-specific oral anticoagulants in the outpatient setting. J Clin Pharm Ther. 2015;40(5):525-530. PubMed
21. Granger CB, Alexander JH, McMurray JJ, et al; ARISTOTLE Committees and Investigators. Apixaban versus warfarin in patients with atrial fibrillation. N Engl J Med. 2011;365(11):981-992. PubMed
22. Ruff CT, Giugliano RP, Braunwald E, et al. Comparison of the efficacy and safety of new oral anticoagulants with warfarin in patients with atrial fibrillation: a meta-analysis of randomised trials. Lancet. 2014;383(9921):955-962. PubMed
23. van der Hulle T, Kooiman J, den Exter PL, Dekkers OM, Klok FA, Huisman MV. Effectiveness and safety of novel oral anticoagulants as compared with vitamin K antagonists in the treatment of acute symptomatic venous thromboembolism: a systematic review and meta-analysis. J Thromb Haemost. 2014;12(3):320-328. PubMed
24. Schuh T, Reichardt B, Finsterer J, Stöllberger C. Age-dependency of prescribing patterns of oral anticoagulant drugs in Austria during 2011–2014. J Thromb Thrombolysis. 2016;42(3):447-451. PubMed
25. Stöllberger C, Brooks R, Finsterer J, Pachofszky T. Use of direct-acting oral anticoagulants in nonagenarians: a call for more data. Drugs Aging. 2016;33(5):315-320. PubMed
26. Steffel J, Giugliano RP, Braunwald E, et al. Edoxaban versus warfarin in atrial fibrillation patients at risk of falling: ENGAGE AF-TIMI 48 analysis. J Am Coll Cardiol. 2016;68(11):1169-1178. PubMed
27. Ruff CT, Giugliano RP, Antman EM. Management of bleeding with non–vitamin K antagonist oral anticoagulants in the era of specific reversal agents. Circulation. 2016;134(3):248-261. PubMed
28. Schwartz JB. Potential impact of substituting estimated glomerular filtration rate for estimated creatinine clearance for dosing of direct oral anticoagulants. J Am Geriatr Soc. 2016;64(10):1996-2002. PubMed
29. Camm AJ, Amarenco P, Haas S, et al; XANTUS Investigators. XANTUS: a real-world, prospective, observational study of patients treated with rivaroxaban for stroke prevention in atrial fibrillation. Eur Heart J. 2016;37(14):1145-1153. PubMed
30. Steinberg BA, Shrader P, Thomas L, et al; ORBIT-AF Investigators and Patients. Off-label dosing of non–vitamin K antagonist oral anticoagulants and adverse outcomes: the ORBIT-AF II Registry. J Am Coll Cardiol. 2016;68(24):2597-2604. PubMed
31. Fauchier L, Philippart R, Clementy N, et al. How to define valvular atrial fibrillation? Arch Cardiovasc Dis. 2015;108(10):530-539. PubMed
32. Di Biase L. Use of direct oral anticoagulants in patients with atrial fibrillation and valvular heart lesions. J Am Heart Assoc. 2016;5(2). PubMed

33. Kearon C, Akl EA, Ornelas J, et al. Antithrombotic therapy for VTE disease:
CHEST guideline and expert panel report. Chest. 2016;149(2):315-352. PubMed
34. Larock AS, Mullier F, Sennesael AL, et al. Appropriateness of prescribing dabigatran
etexilate and rivaroxaban in patients with nonvalvular atrial fibrillation: a prospective
study. Ann Pharmacother. 2014;48(10):1258-1268. PubMed

Article PDF
Issue
Journal of Hospital Medicine 12(7)
Topics
Page Number
544-550
Sections
Article PDF
Article PDF

Direct-acting oral anticoagulants (DOACs) have been introduced into clinical use for stroke prevention in patients with nonvalvular atrial fibrillation (NVAF), prevention of venous thrombosis after hip or knee surgery, and treatment of deep vein thrombosis (DVT) and pulmonary embolism (PE).1-7 Advantages of DOACs over warfarin are often stated as fixed dosing, minor drug and food interactions, wider therapeutic index, and no need for laboratory test monitoring.1,8 Yet, recommended DOAC dosages vary by renal function and therapeutic indications. Dosing recommendations for prevention of stroke in patients with NVAF are based on estimated creatinine clearance (dabigatran, rivaroxaban, edoxaban), age (apixaban), weight (apixaban, edoxaban), serum creatinine level (apixaban, edoxaban), and presence of cirrhosis by Child-Pugh class9,10 (apixaban, edoxaban).4-6,11,12 Dosing recommendations based on coadministration of strong CYP34A and P-glycoprotein inhibitors or inducers vary by DOAC. In addition, dabigatran cannot be crushed and must be stored in its original packaging, and rivaroxaban should be taken with food when the dose is over 10 mg.

We studied DOAC prescribing in adults admitted to a large academic medical center by comparing initial prescribed dosing with FDA-approved prescribing information. We hypothesized that the complexity of DOAC dosing may not be recognized by prescribers.

METHODS

Our study protocol was approved by the Committee on Human Research (Institutional Review Board) of the University of California San Francisco.

Data Collection

We used electronic medical records (EMRs) to identify adult inpatients who were prescribed a DOAC (apixaban, dabigatran, edoxaban, or rivaroxaban) at the University of California San Francisco Medical Center, a large academic hospital, between July 1, 2014 and June 30, 2015. Demographic and medical information related to therapeutic indications, contraindications, and indications for dose adjustments were collected and included diagnoses classified by International Classification of Diseases, Ninth Revision (ICD-9) and Tenth Revision (ICD-10) for venous thromboses; phlebitis or thrombophlebitis; PE or venous embolism; atrial arrhythmias; surgical procedures; cirrhosis and/or ascites or liver disease; coagulopathies; artificial heart valves or implanted devices; prior use of medications including parenteral anticoagulants; and laboratory data obtained before the first DOAC order (serum creatinine level, estimated glomerular filtration rate [eGFR] determined by Chronic Kidney Disease Epidemiology Collaboration,13 international normalized ratio, or, if available, activated partial thromboplastin time and bilirubin level). Creatinine clearance was calculated with the Cockcroft-Gault method14 using total body weight, per drug label recommendation. Child-Pugh class was calculated if cirrhosis was diagnosed.10 DOAC dose, frequency, dosing directions, and prescriber medical specialty were determined.

 

 

Accuracy of search results was confirmed by review of the first 200 patients’ records. Records were manually reviewed for encounters lacking ICD-9/10 codes and approved DOAC indications (30%) and encounters having multiple coded diagnostic indications (to identify the indication). ICD-9 codes for venous thrombosis were reviewed to differentiate acute from chronic events.

Data Analysis

The main outcome was concordance or discordance between the first DOAC prescribing order and the FDA-approved prescribing information at the time. Initial classification, performed by 2 independent reviewers (a pharmacist and a physician, or 2 pharmacists), was followed by adjudication and individual record review (by 2 independent reviewers) of all initial prescribing orders classified as discordant. A third reviewer adjudicated any disagreement. Records and notes were reviewed to identify stated or potential reasons for dosing variation and pre-admission prescriptions. Data are presented as means and standard deviations (SDs) and as raw numbers and percentages. Differences in patient characteristics by DOAC or therapeutic indication were determined by analysis of variance (ANOVA) with Bonferroni correction for post hoc comparisons. Dosing information was categorized as the same as recommended, lower than recommended, higher than recommended, or avoid drug use (drug–drug or drug–disease interaction), per FDA-approved prescribing information, and χ2 tests were used to determine whether variation in dosing occurred by individual DOAC, therapeutic indication, or prescriber specialty. Relationships between dosing variation and age or renal function were tested by ANOVA with Bonferroni correction for post hoc comparisons.

RESULTS

Table 1
There were 635 admissions with apixaban, dabigatran, or rivaroxaban prescribed for 508 patients (Table 1). Edoxaban was not on the formulary and not prescribed during the period studied. The therapeutic indication was prevention of embolic stroke in patients with atrial fibrillation/flutter or AF (465 admissions, or 73%, with valvular disease and/or tissue valve in 35), chronic DVT (67 admissions, or 11%, with active malignancy in 14), acute DVT (32 admissions, with malignancy in 2), chronic PE (23 admissions, with malignancy in 3), acute PE (19 admissions, with malignancy in 4), and DVT prevention after hip or knee surgery (19 admissions). DOACs were prescribed for unapproved indications in 10 admissions, and these were excluded from further analysis (mural thrombus in 3 admissions, low ejection fraction in 2, bedrest immobilization in 2, aortic aneurysm in 1, thrombocytosis in 1, and extensive superficial venous thrombosis in 1) (Table 2).

Table 2

Patients with AF were older with lower creatinine clearance compared to patients with other diagnoses. Mean (SD) patient age was 72.1 (12.7) years for AF, 53.1 (10.9) years for chronic PE, 55.5 (14) years for acute PE, 56.4 (15.9) years for chronic DVT, 57.9 (18.4) years for acute DVT, and 61.4 (11.6) years for DVT prevention after hip or knee surgery (P < 0.0001 for all comparisons). Mean (SD) estimated creatinine clearance was 76.8 (43.5) mL/min for AF, 92.4 (44.4) mL/min for DVT prevention after hip or knee surgery, 111 (53) mL/min for chronic DVT, 118 (55) mL/min for acute DVT, 126 (60) mL/min for chronic PE, and 127 (54) mL/min for acute PE (P < 0.0001 for all comparisons). Differences between patient groups by therapeutic indication were not detected for weight, body mass index, or serum creatinine level.

The most frequent deviation from prescribing recommendations was omission of directions to administer rivaroxaban with food—93% (248/268) of orders—but not for DVT prevention after hip or knee surgery, for which the 10-mg dose is appropriately administered without food. Doses were the same as recommended for 82% of apixaban orders, 84% of rivaroxaban orders, and 93% of initial dabigatran orders (P < 0.05 for differences; Table 3). Dosages not concordant with FDA recommendations were prescribed in 44 (18.1%) of 243 apixaban orders, 41 (14.3%) of 286 rivaroxaban orders, and 7 (7.2%) of 89 initial dabigatran orders. Lower than recommended doses were more common than higher than recommended doses (Table 3, Figure 1): 15.2% versus 2.1% of apixaban orders, 9.4% versus 3.5% of rivaroxaban orders, and 4.2% versus 1.0% of initial dabigatran orders (P < 0.05). Failure to avoid drug use (for potential drug–drug or drug–disease interactions) was uncommon (1%-2%). There were more deviations from recommended doses for patients with AF or DVT prevention after hip or knee surgery than for patients with acute or chronic PE or acute DVT (Table 3). No significant differences were detected between prescribed and recommended doses by prescriber specialty.

Table 3
In most cases, a reason for deviating from FDA dosing recommendations was not stated in the EMR. The exception was fluctuating renal function, which was cited in 8 cases.

Figure

For apixaban, patients who were prescribed lower than recommended doses were older than those prescribed recommended doses: mean (SD), 78.1 (12.2) years versus 71 (13.6) years (P = 0.003). Seventy-six percent of those prescribed lower than recommended doses were older than 75. Prescriptions for apixaban at lower than recommended doses were continuations of prior outpatient prescriptions in 20 of 37 cases (almost half), and in 12 cases (one-fourth) antiplatelet drugs were coprescribed (aspirin in 10 cases, clopidogrel in 1, prasugrel in 1). For rivaroxaban, older age was associated with both lower than recommended dosing (P = 0.003) and higher than recommended dosing (P < 0.001). Variations from prescribing recommendations were continuations of outpatient rivaroxaban doses in about two-thirds (26 of 41; 63.4 %) with 13 receiving antiplatelet drugs. For dabigatran, 6 of 7 orders not in agreement with recommendations were continuations of outpatient dosing.

The specific equation used to estimate renal function also had the potential to lead to dosing errors. Among the 41 rivaroxaban patients categorized as receiving doses discordant with recommendations, 8 would have had an inappropriate DOAC dose if eGFR were used instead of eCrCL as recommended. No relationships were detected for other patient variables/measures and dosing deviations from recommendations.

 

 

DISCUSSION

We examined initial hospital orders for DOACs in adults admitted to a single academic medical center during 2014-2015. Dabigatran, apixaban and rivaroxaban were prescribed for prevention of stroke in patients with atrial fibrillation/flutter (AF) in three quarters of the encounters similar to national patterns. (15) Prescribing departures from FDA-approved recommendations ranged from failure to prescribe rivaroxaban with food to failure to recognize drug-drug interactions in 1% to 2%. Unexpectedly, lower than recommended dosing was more common than higher than recommended dosing of the three DOACs.

Rivaroxaban bioavailability is dose dependent with the presence of food required to enhance absorption for doses over 10 mg that are used for prevention of stroke in patients with non-valvular AF or treatment of DVT or PE.5,16 Peak rivaroxaban concentrations are 75% higher and the total area under the concentration vs. time curve after dosing is 40% higher when rivaroxaban is administered with high fat high calorie meals compared to the fasting state.16 If rivaroxaban is not administered with food, drug concentrations and pharmacologic effects may be less than in clinical trials that specified co-administration with food.17-19 A small survey of outpatients receiving rivaroxaban found that 23% reported taking it without food.20 With electronic pharmacy systems in almost all hospitals and electronic prescriber order entry in most, automated addition of directions for rivaroxaban administration with food for doses over 10 mg to labels or dispensing instructions could easily correct this deviation from recommended practice.

Lower than recommended doses were prescribed in 9.4% of orders for rivaroxaban and 15.2% of orders for apixaban, with dose-deviations often appearing to be a continuation of outpatient doses. Patients 75 years or older were more likely to receive lower than recommended dosing of apixaban. Reductions in apixaban doses from 5 mg twice daily to 2.5 mg twice daily are recommended in patients with non-valvular AF with two of the following criteria: age ≥80 y, weight ≤60 kg, serum creatinine ≥1.5 mg/dL or co-administration of a strong PgP inhibitor to a patient without 2 of the 3 dose reduction criteria. Our study was not designed to determine reasons for under-dosing, but we speculate that clinicians may have considered patients aged 75-79 years to be similar to those 80 years of age or older, or, older and not as healthy as those enrolled in randomized trials.21-25 The median age of our patients with AF receiving apixaban was 75y (interquartile range of 16) vs 70y ( interquartile range 63-76) in the pivotal trial comparing warfarin to apixaban.21 Renal function was also lower with 37% having eCrCL below 50 mL/min compared to 17% in ARISTOTLE. (21). Twenty-six percent of our apixaban-treated AF patients qualified for the lower 2.5 mg twice daily compared to only 5% of ARISTOTLE participants,21 further suggesting differences between patients in our sample compared to randomized trial participants.

Concerns regarding bleeding or falls in older patients, may also have contributed to lower than recommended doses. Recent analyses of patients at risk for falls confirmed that increased risk of falling was associated with more bone fractures, bleeding and all-cause death but not stroke or systemic emboli, and with less severe bleeding with the DOAC edoxaban compared to warfarin.26 While a rationale for personalized or lower than recommended dosing of apixaban may exist in very old patients and those at risk of falls and bleeding, more data are needed to determine outcomes of lower than recommended doses of DOACs before such an approach can be endorsed. Monitoring of anticoagulant effect in patients who receive doses lower than those investigated in clinical trials could provide important information. The assays that measure DOAC effects are likely to be more available because of the use of reversal agents in the setting of bleeding with DOACs.27

We had anticipated higher than recommended dosing for rivaroxaban as recommendations are based on creatinine clearance while laboratories routinely report estimated glomerular filtration rate (eGFR) that can provide higher estimates of renal clearance and estimated DOAC doses in older and smaller individuals.28 Higher than recommended dosing was found in only 3.5% of our sample. In half, eGFR estimates were higher than creatinine clearance estimates. An international postmarketing registry of rivaroxaban use for the prevention of stroke in patients with NVAF, which included outpatients, found that 36% of those with creatinine clearances below 50 mL/min received a dose higher than recommended, and 15% received a dose lower than expected.29 A more recent outpatient registry report on patients with NVAF, in which apixaban, dabigatran, or rivaroxaban was administered, found that overall 9.4% received a dose lower than recommended, and 3.4% were overdosed, with a similar percentage (34%) of rivaroxaban patients with creatinine clearance of 15 to 50 mL/min receiving higher than recommended dosing.30 The lower rate of higher-than-recommended doses that we observed may have been related to the routine measurement of serum creatinine and attention to dosing adjustments for renal function in the inpatient setting compared to the outpatient setting. In addition, renal function data may not be available to outpatient pharmacies, limiting potential input on dosing recommendations. At least one cardiac society recommends monitoring of renal function in patients treated with DOACs, annually in patients with normal estimated creatinine clearance and more frequently (at intervals in months equal to the creatinine clearance divided by 10) in patients with abnormal creatinine clearance.11 A hospital encounter provides an opportunity to assess or reassess renal status to optimize DOAC dosing.

Dabigatran was the first DOAC introduced into use in the United States with the same dose recommended for prevention of stroke in patients with AF or venous thromboembolic disease with reductions for creatinine clearance below 30 mL/min or creatinine clearance between 30 and 50 mL/min and concomitant use of the potent P-glycoprotein inhibitor dronedarone or systemic ketoconazole. The relative simplicity of dosing may have been responsible for the lowest rate of prescribing outside of recommendations observed in this study, but the low dabigatran use limits analyses of contributing factors.

Failure to avoid drug use in combination with use of strong P-glycoprotein inducers or inhibitors was infrequent but should be preventable. Current prescribing recommendations refer to “strong” P-glycoprotein inhibitors and list different specific agents that interact with each DOAC without a standardized definition or classification. Standardized classifications or reference sources would be helpful.

Our primary goal in this study was to compare initial prescribed dosing of DOACs with FDA-approved prescribing directions. However, therapeutic indication data warrant discussion. In our sample, 7.5% of patients with AF had bioprosthetic valves or recent mitral valve repair or replacement. Using the NVAF definition found in the 2014 AHA/ACC/HRS (American Heart Association, American College of Cardiology, Heart Rhythm Society) AF guidelines1—“absence of rheumatic mitral valve disease, a prosthetic heart valve, or mitral valve repair”—these patients would not appear to be candidates for DOACs. However, arguments have been made that a bioprosthetic heart valve or native valve after valve repair does not have a risk profile for thromboembolism that differs from other forms of NVAF and would be equally responsive to DOAC therapy.31 Data are sparse, but retrospective subanalyses of limited numbers of patients with valvular disease (including bioprosthesis and mitral repair patients but excluding mechanical valve patients) enrolled in the pivotal DOAC studies support this conclusion.32 For the first months after biological valve replacement (including catheter-based valve replacement), recent European guidelines recommend vitamin K antagonists but also state, “NOACs probably deliver the same protection.”8 DOACs were also used for management of venous thromboembolic disease (both acute and chronic) in patients with active cancer. Our data predate the most recent American College of Chest Physician guidelines on treatment of venous thromboembolism in patients with cancer, which provide grade 2B recommendations for use of low-molecular-weight heparin (LMWH) over vitamin K antagonists and grade 2C recommendations for use of LMWH over dabigatran, rivaroxaban, apixaban, or edoxaban.33

Our study had several limitations. First, data were from a single US academic medical center, though similar rates of prescribing deviation from recommendations have been reported for rivaroxaban and dabigatran in NVAF patients in other countries.29,34 Second, therapeutic indications may have been misclassified because of errors, incomplete EMR data, or multiple indications. Third, we analyzed the first DOAC order and not dispensing information or subsequent corrections. Therefore, deviations from recommendations should not be interpreted as errors that reached patients. We evaluated dosing based on the measures used at the time of hospital admission, noting that, in a significant fraction of deviations from recommended doses, they represented continuations of outpatient doses when renal function or weight may have differed, and it is unknown whether patients were counseled to take rivaroxaban with food in the outpatient setting. Fourth, the number of patients with acute DVT was small, so firm conclusions cannot be drawn for this specific population. Fifth, our estimates of off-label dosing may have been underestimates, as data on cancer and cancer activity or cardiac valvular disease may not have been complete.

 

 

CONCLUSION

Healthcare professionals are prescribing DOACs in ways that differ from recommendations. These differences may reflect the older ages and reduced renal function of clinical populations relative to randomized clinical trial groups, but they could also potentially alter clinical efficacy. Our findings support the need to evaluate the appropriateness and dosing of DOACs at each encounter and to determine the outcomes of patients treated with lower than recommended doses of DOACs and the outcomes of DOAC-treated patients with bioprostheses or active malignancies.

Acknowledgment

The authors thank Tobias Schmelzinger for electronic data extraction and compilation and University of California San Francisco students Eduardo De La Torre Cruz (School of Pharmacy) and Carlos Mikell (School of Medicine) for assistance with data review.

Disclosure

Dr. Schwartz reports receiving personal fees from Bristol-Myers Squibb and Amgen and grants from Bristol-Myers Squibb and Pfizer, outside the submitted work. The other authors have nothing to report.

 

Direct-acting oral anticoagulants (DOACs) have been introduced into clinical use for stroke prevention in patients with nonvalvular atrial fibrillation (NVAF), prevention of venous thrombosis after hip or knee surgery, and treatment of deep vein thrombosis (DVT) and pulmonary embolism (PE).1-7 Advantages of DOACs over warfarin are often stated as fixed dosing, minor drug and food interactions, wider therapeutic index, and no need for laboratory test monitoring.1,8 Yet, recommended DOAC dosages vary by renal function and therapeutic indications. Dosing recommendations for prevention of stroke in patients with NVAF are based on estimated creatinine clearance (dabigatran, rivaroxaban, edoxaban), age (apixaban), weight (apixaban, edoxaban), serum creatinine level (apixaban, edoxaban), and presence of cirrhosis by Child-Pugh class9,10 (apixaban, edoxaban).4-6,11,12 Dosing recommendations based on coadministration of strong CYP34A and P-glycoprotein inhibitors or inducers vary by DOAC. In addition, dabigatran cannot be crushed and must be stored in its original packaging, and rivaroxaban should be taken with food when the dose is over 10 mg.

We studied DOAC prescribing in adults admitted to a large academic medical center by comparing initial prescribed dosing with FDA-approved prescribing information. We hypothesized that the complexity of DOAC dosing may not be recognized by prescribers.

METHODS

Our study protocol was approved by the Committee on Human Research (Institutional Review Board) of the University of California San Francisco.

Data Collection

We used electronic medical records (EMRs) to identify adult inpatients who were prescribed a DOAC (apixaban, dabigatran, edoxaban, or rivaroxaban) at the University of California San Francisco Medical Center, a large academic hospital, between July 1, 2014 and June 30, 2015. Demographic and medical information related to therapeutic indications, contraindications, and indications for dose adjustments were collected and included diagnoses classified by International Classification of Diseases, Ninth Revision (ICD-9) and Tenth Revision (ICD-10) for venous thromboses; phlebitis or thrombophlebitis; PE or venous embolism; atrial arrhythmias; surgical procedures; cirrhosis and/or ascites or liver disease; coagulopathies; artificial heart valves or implanted devices; prior use of medications including parenteral anticoagulants; and laboratory data obtained before the first DOAC order (serum creatinine level, estimated glomerular filtration rate [eGFR] determined by Chronic Kidney Disease Epidemiology Collaboration,13 international normalized ratio, or, if available, activated partial thromboplastin time and bilirubin level). Creatinine clearance was calculated with the Cockcroft-Gault method14 using total body weight, per drug label recommendation. Child-Pugh class was calculated if cirrhosis was diagnosed.10 DOAC dose, frequency, dosing directions, and prescriber medical specialty were determined.

 

 

Accuracy of search results was confirmed by review of the first 200 patients’ records. Records were manually reviewed for encounters lacking ICD-9/10 codes and approved DOAC indications (30%) and encounters having multiple coded diagnostic indications (to identify the indication). ICD-9 codes for venous thrombosis were reviewed to differentiate acute from chronic events.

Data Analysis

The main outcome was concordance or discordance between the first DOAC prescribing order and the FDA-approved prescribing information at the time. Initial classification, performed by 2 independent reviewers (a pharmacist and a physician, or 2 pharmacists), was followed by adjudication and individual record review (by 2 independent reviewers) of all initial prescribing orders classified as discordant. A third reviewer adjudicated any disagreement. Records and notes were reviewed to identify stated or potential reasons for dosing variation and pre-admission prescriptions. Data are presented as means and standard deviations (SDs) and as raw numbers and percentages. Differences in patient characteristics by DOAC or therapeutic indication were determined by analysis of variance (ANOVA) with Bonferroni correction for post hoc comparisons. Dosing information was categorized as the same as recommended, lower than recommended, higher than recommended, or avoid drug use (drug–drug or drug–disease interaction), per FDA-approved prescribing information, and χ2 tests were used to determine whether variation in dosing occurred by individual DOAC, therapeutic indication, or prescriber specialty. Relationships between dosing variation and age or renal function were tested by ANOVA with Bonferroni correction for post hoc comparisons.

RESULTS

Table 1
There were 635 admissions with apixaban, dabigatran, or rivaroxaban prescribed for 508 patients (Table 1). Edoxaban was not on the formulary and not prescribed during the period studied. The therapeutic indication was prevention of embolic stroke in patients with atrial fibrillation/flutter or AF (465 admissions, or 73%, with valvular disease and/or tissue valve in 35), chronic DVT (67 admissions, or 11%, with active malignancy in 14), acute DVT (32 admissions, with malignancy in 2), chronic PE (23 admissions, with malignancy in 3), acute PE (19 admissions, with malignancy in 4), and DVT prevention after hip or knee surgery (19 admissions). DOACs were prescribed for unapproved indications in 10 admissions, and these were excluded from further analysis (mural thrombus in 3 admissions, low ejection fraction in 2, bedrest immobilization in 2, aortic aneurysm in 1, thrombocytosis in 1, and extensive superficial venous thrombosis in 1) (Table 2).

Table 2

Patients with AF were older with lower creatinine clearance compared to patients with other diagnoses. Mean (SD) patient age was 72.1 (12.7) years for AF, 53.1 (10.9) years for chronic PE, 55.5 (14) years for acute PE, 56.4 (15.9) years for chronic DVT, 57.9 (18.4) years for acute DVT, and 61.4 (11.6) years for DVT prevention after hip or knee surgery (P < 0.0001 for all comparisons). Mean (SD) estimated creatinine clearance was 76.8 (43.5) mL/min for AF, 92.4 (44.4) mL/min for DVT prevention after hip or knee surgery, 111 (53) mL/min for chronic DVT, 118 (55) mL/min for acute DVT, 126 (60) mL/min for chronic PE, and 127 (54) mL/min for acute PE (P < 0.0001 for all comparisons). Differences between patient groups by therapeutic indication were not detected for weight, body mass index, or serum creatinine level.

The most frequent deviation from prescribing recommendations was omission of directions to administer rivaroxaban with food—93% (248/268) of orders—but not for DVT prevention after hip or knee surgery, for which the 10-mg dose is appropriately administered without food. Doses were the same as recommended for 82% of apixaban orders, 84% of rivaroxaban orders, and 93% of initial dabigatran orders (P < 0.05 for differences; Table 3). Dosages not concordant with FDA recommendations were prescribed in 44 (18.1%) of 243 apixaban orders, 41 (14.3%) of 286 rivaroxaban orders, and 7 (7.2%) of 89 initial dabigatran orders. Lower than recommended doses were more common than higher than recommended doses (Table 3, Figure 1): 15.2% versus 2.1% of apixaban orders, 9.4% versus 3.5% of rivaroxaban orders, and 4.2% versus 1.0% of initial dabigatran orders (P < 0.05). Failure to avoid drug use (for potential drug–drug or drug–disease interactions) was uncommon (1%-2%). There were more deviations from recommended doses for patients with AF or DVT prevention after hip or knee surgery than for patients with acute or chronic PE or acute DVT (Table 3). No significant differences were detected between prescribed and recommended doses by prescriber specialty.

Table 3
In most cases, a reason for deviating from FDA dosing recommendations was not stated in the EMR. The exception was fluctuating renal function, which was cited in 8 cases.

Figure

For apixaban, patients who were prescribed lower than recommended doses were older than those prescribed recommended doses: mean (SD), 78.1 (12.2) years versus 71 (13.6) years (P = 0.003). Seventy-six percent of those prescribed lower than recommended doses were older than 75. Prescriptions for apixaban at lower than recommended doses were continuations of prior outpatient prescriptions in 20 of 37 cases (almost half), and in 12 cases (one-fourth) antiplatelet drugs were coprescribed (aspirin in 10 cases, clopidogrel in 1, prasugrel in 1). For rivaroxaban, older age was associated with both lower than recommended dosing (P = 0.003) and higher than recommended dosing (P < 0.001). Variations from prescribing recommendations were continuations of outpatient rivaroxaban doses in about two-thirds (26 of 41; 63.4 %) with 13 receiving antiplatelet drugs. For dabigatran, 6 of 7 orders not in agreement with recommendations were continuations of outpatient dosing.

The specific equation used to estimate renal function also had the potential to lead to dosing errors. Among the 41 rivaroxaban patients categorized as receiving doses discordant with recommendations, 8 would have had an inappropriate DOAC dose if eGFR were used instead of eCrCL as recommended. No relationships were detected for other patient variables/measures and dosing deviations from recommendations.

 

 

DISCUSSION

We examined initial hospital orders for DOACs in adults admitted to a single academic medical center during 2014-2015. Dabigatran, apixaban and rivaroxaban were prescribed for prevention of stroke in patients with atrial fibrillation/flutter (AF) in three quarters of the encounters similar to national patterns. (15) Prescribing departures from FDA-approved recommendations ranged from failure to prescribe rivaroxaban with food to failure to recognize drug-drug interactions in 1% to 2%. Unexpectedly, lower than recommended dosing was more common than higher than recommended dosing of the three DOACs.

Rivaroxaban bioavailability is dose dependent with the presence of food required to enhance absorption for doses over 10 mg that are used for prevention of stroke in patients with non-valvular AF or treatment of DVT or PE.5,16 Peak rivaroxaban concentrations are 75% higher and the total area under the concentration vs. time curve after dosing is 40% higher when rivaroxaban is administered with high fat high calorie meals compared to the fasting state.16 If rivaroxaban is not administered with food, drug concentrations and pharmacologic effects may be less than in clinical trials that specified co-administration with food.17-19 A small survey of outpatients receiving rivaroxaban found that 23% reported taking it without food.20 With electronic pharmacy systems in almost all hospitals and electronic prescriber order entry in most, automated addition of directions for rivaroxaban administration with food for doses over 10 mg to labels or dispensing instructions could easily correct this deviation from recommended practice.

Lower than recommended doses were prescribed in 9.4% of orders for rivaroxaban and 15.2% of orders for apixaban, with dose-deviations often appearing to be a continuation of outpatient doses. Patients 75 years or older were more likely to receive lower than recommended dosing of apixaban. Reductions in apixaban doses from 5 mg twice daily to 2.5 mg twice daily are recommended in patients with non-valvular AF with two of the following criteria: age ≥80 y, weight ≤60 kg, serum creatinine ≥1.5 mg/dL or co-administration of a strong PgP inhibitor to a patient without 2 of the 3 dose reduction criteria. Our study was not designed to determine reasons for under-dosing, but we speculate that clinicians may have considered patients aged 75-79 years to be similar to those 80 years of age or older, or, older and not as healthy as those enrolled in randomized trials.21-25 The median age of our patients with AF receiving apixaban was 75y (interquartile range of 16) vs 70y ( interquartile range 63-76) in the pivotal trial comparing warfarin to apixaban.21 Renal function was also lower with 37% having eCrCL below 50 mL/min compared to 17% in ARISTOTLE. (21). Twenty-six percent of our apixaban-treated AF patients qualified for the lower 2.5 mg twice daily compared to only 5% of ARISTOTLE participants,21 further suggesting differences between patients in our sample compared to randomized trial participants.

Concerns regarding bleeding or falls in older patients, may also have contributed to lower than recommended doses. Recent analyses of patients at risk for falls confirmed that increased risk of falling was associated with more bone fractures, bleeding and all-cause death but not stroke or systemic emboli, and with less severe bleeding with the DOAC edoxaban compared to warfarin.26 While a rationale for personalized or lower than recommended dosing of apixaban may exist in very old patients and those at risk of falls and bleeding, more data are needed to determine outcomes of lower than recommended doses of DOACs before such an approach can be endorsed. Monitoring of anticoagulant effect in patients who receive doses lower than those investigated in clinical trials could provide important information. The assays that measure DOAC effects are likely to be more available because of the use of reversal agents in the setting of bleeding with DOACs.27

We had anticipated higher than recommended dosing for rivaroxaban as recommendations are based on creatinine clearance while laboratories routinely report estimated glomerular filtration rate (eGFR) that can provide higher estimates of renal clearance and estimated DOAC doses in older and smaller individuals.28 Higher than recommended dosing was found in only 3.5% of our sample. In half, eGFR estimates were higher than creatinine clearance estimates. An international postmarketing registry of rivaroxaban use for the prevention of stroke in patients with NVAF, which included outpatients, found that 36% of those with creatinine clearances below 50 mL/min received a dose higher than recommended, and 15% received a dose lower than expected.29 A more recent outpatient registry report on patients with NVAF, in which apixaban, dabigatran, or rivaroxaban was administered, found that overall 9.4% received a dose lower than recommended, and 3.4% were overdosed, with a similar percentage (34%) of rivaroxaban patients with creatinine clearance of 15 to 50 mL/min receiving higher than recommended dosing.30 The lower rate of higher-than-recommended doses that we observed may have been related to the routine measurement of serum creatinine and attention to dosing adjustments for renal function in the inpatient setting compared to the outpatient setting. In addition, renal function data may not be available to outpatient pharmacies, limiting potential input on dosing recommendations. At least one cardiac society recommends monitoring of renal function in patients treated with DOACs, annually in patients with normal estimated creatinine clearance and more frequently (at intervals in months equal to the creatinine clearance divided by 10) in patients with abnormal creatinine clearance.11 A hospital encounter provides an opportunity to assess or reassess renal status to optimize DOAC dosing.

Dabigatran was the first DOAC introduced into use in the United States with the same dose recommended for prevention of stroke in patients with AF or venous thromboembolic disease with reductions for creatinine clearance below 30 mL/min or creatinine clearance between 30 and 50 mL/min and concomitant use of the potent P-glycoprotein inhibitor dronedarone or systemic ketoconazole. The relative simplicity of dosing may have been responsible for the lowest rate of prescribing outside of recommendations observed in this study, but the low dabigatran use limits analyses of contributing factors.

Failure to avoid drug use in combination with use of strong P-glycoprotein inducers or inhibitors was infrequent but should be preventable. Current prescribing recommendations refer to “strong” P-glycoprotein inhibitors and list different specific agents that interact with each DOAC without a standardized definition or classification. Standardized classifications or reference sources would be helpful.

Our primary goal in this study was to compare initial prescribed dosing of DOACs with FDA-approved prescribing directions. However, therapeutic indication data warrant discussion. In our sample, 7.5% of patients with AF had bioprosthetic valves or recent mitral valve repair or replacement. Using the NVAF definition found in the 2014 AHA/ACC/HRS (American Heart Association, American College of Cardiology, Heart Rhythm Society) AF guidelines1—“absence of rheumatic mitral valve disease, a prosthetic heart valve, or mitral valve repair”—these patients would not appear to be candidates for DOACs. However, arguments have been made that a bioprosthetic heart valve or native valve after valve repair does not have a risk profile for thromboembolism that differs from other forms of NVAF and would be equally responsive to DOAC therapy.31 Data are sparse, but retrospective subanalyses of limited numbers of patients with valvular disease (including bioprosthesis and mitral repair patients but excluding mechanical valve patients) enrolled in the pivotal DOAC studies support this conclusion.32 For the first months after biological valve replacement (including catheter-based valve replacement), recent European guidelines recommend vitamin K antagonists but also state, “NOACs probably deliver the same protection.”8 DOACs were also used for management of venous thromboembolic disease (both acute and chronic) in patients with active cancer. Our data predate the most recent American College of Chest Physician guidelines on treatment of venous thromboembolism in patients with cancer, which provide grade 2B recommendations for use of low-molecular-weight heparin (LMWH) over vitamin K antagonists and grade 2C recommendations for use of LMWH over dabigatran, rivaroxaban, apixaban, or edoxaban.33

Our study had several limitations. First, data were from a single US academic medical center, though similar rates of prescribing deviation from recommendations have been reported for rivaroxaban and dabigatran in NVAF patients in other countries.29,34 Second, therapeutic indications may have been misclassified because of errors, incomplete EMR data, or multiple indications. Third, we analyzed the first DOAC order and not dispensing information or subsequent corrections. Therefore, deviations from recommendations should not be interpreted as errors that reached patients. We evaluated dosing based on the measures used at the time of hospital admission, noting that, in a significant fraction of deviations from recommended doses, they represented continuations of outpatient doses when renal function or weight may have differed, and it is unknown whether patients were counseled to take rivaroxaban with food in the outpatient setting. Fourth, the number of patients with acute DVT was small, so firm conclusions cannot be drawn for this specific population. Fifth, our estimates of off-label dosing may have been underestimates, as data on cancer and cancer activity or cardiac valvular disease may not have been complete.

 

 

CONCLUSION

Healthcare professionals are prescribing DOACs in ways that differ from recommendations. These differences may reflect the older ages and reduced renal function of clinical populations relative to randomized clinical trial groups, but they could also potentially alter clinical efficacy. Our findings support the need to evaluate the appropriateness and dosing of DOACs at each encounter and to determine the outcomes of patients treated with lower than recommended doses of DOACs and the outcomes of DOAC-treated patients with bioprostheses or active malignancies.

Acknowledgment

The authors thank Tobias Schmelzinger for electronic data extraction and compilation and University of California San Francisco students Eduardo De La Torre Cruz (School of Pharmacy) and Carlos Mikell (School of Medicine) for assistance with data review.

Disclosure

Dr. Schwartz reports receiving personal fees from Bristol-Myers Squibb and Amgen and grants from Bristol-Myers Squibb and Pfizer, outside the submitted work. The other authors have nothing to report.

 

References

1. January CT, Wann LS, Alpert JS, et al. 2014 AHA/ACC/HRS guideline for the management of patients with atrial fibrillation: executive summary. A report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines and the Heart Rhythm Society. J Am Coll Cardiol. 2014;64(21):2246-2280. PubMed
2. Saraf K, Morris PD, Garg P, Sheridan P, Storey R. Non–vitamin K antagonist oral anticoagulants (NOACs): clinical evidence and therapeutic considerations. Postgrad Med J. 2014;90(1067):520-528. PubMed
3. Yeh CH, Gross PL, Weitz JI. Evolving use of new oral anticoagulants for treatment of venous thromboembolism. Blood. 2014;124(7):1020-1028. PubMed
4. Pradaxa website. https://www.pradaxa.com. Accessed June 1, 2017.
5. Xarelto website. https://www.xarelto-us.com. Accessed June 1, 2017.
6. Eliquis website. http://www.eliquis.com. Accessed June 1, 2017.
7. Savaysa [prescribing information]. Tokyo, Japan: Daiichi Sankyo; 2015.
8. Kirchhof P, Benussi S, Kotecha D, et al. 2016 ESC guidelines for the management of atrial fibrillation developed in collaboration with EACTS. Eur Heart J. 2016;37(38):2893-2962. PubMed
9. Child C, Turcotte J. Surgery and portal hypertension. In: Child CG, ed. The Liver and Portal Hypertension. Philadelphia, PA: Saunders; 1964:50-64. PubMed
10. Pugh RN, Murray-Lyon IM, Dawson JL, Pietroni MC, Williams R. Transection of the oesophagus for bleeding oesophageal varices. Br J Surg. 1973;60(8):646-649. PubMed
11. Heidbuchel H, Verhamme P, Alings M, et al. Updated European Heart Rhythm Association practical guide on the use of non–vitamin K antagonist anticoagulants in patients with non-valvular atrial fibrillation. Europace. 2015;17(10):1467-1507. PubMed
12. Savaysa website. https://savaysahcp.com. Accessed June 1, 2017.
13. Levey AS, Stevens LA, Schmid CH, et al; CKD-EPI (Chronic Kidney Disease Epidemiology Collaboration). A new equation to estimate glomerular filtration rate. Ann Intern Med. 2009;150(9):604-612. PubMed
14. Cockcroft DW, Gault MH. Prediction of creatinine clearance from serum creatinine. Nephron. 1976;16(1):31-41. PubMed
15. Rose AJ, Reisman JI, Allen AL, Miller DR. Potentially inappropriate prescribing of direct-acting oral anticoagulants in the Veterans Health Administration. Am J Pharm Benefits. 2016;4(4):e75-e80.
16. Stampfuss J, Kubitza D, Becka M, Mueck W. The effect of food on the absorption and pharmacokinetics of rivaroxaban. Int J Clin Pharmacol Ther. 2013;51(7):549-561. PubMed
17. Patel MR, Mahaffey KW, Garg J, et al; ROCKET AF Investigators. Rivaroxaban versus warfarin in nonvalvular atrial fibrillation. N Engl J Med. 2011;365(10):883-891. PubMed
18. EINSTEIN Investigators, Bauersachs R, Berkowitz SD, et al. Oral rivaroxaban for symptomatic venous thromboembolism. N Engl J Med. 2010;363(26):2499-2510. PubMed
19. EINSTEIN-PE Investigators, Büller HR, Prins MH, et al. Oral rivaroxaban for the treatment of symptomatic pulmonary embolism. N Engl J Med. 2012;366(14):1287-1297. PubMed
20. Simon J, Hawes E, Deyo Z, Bryant-Shilliday B. Evaluation of prescribing and patient use of target-specific oral anticoagulants in the outpatient setting. J Clin Pharm Ther. 2015;40(5):525-530. PubMed
21. Granger CB, Alexander JH, McMurray JJ, et al; ARISTOTLE Committees and Investigators. Apixaban versus warfarin in patients with atrial fibrillation. N Engl J Med. 2011;365(11):981-992. PubMed
22. Ruff CT, Giugliano RP, Braunwald E, et al. Comparison of the efficacy and safety of new oral anticoagulants with warfarin in patients with atrial fibrillation: a meta-analysis of randomised trials. Lancet. 2014;383(9921):955-962. PubMed
23. van der Hulle T, Kooiman J, den Exter PL, Dekkers OM, Klok FA, Huisman MV. Effectiveness and safety of novel oral anticoagulants as compared with vitamin K antagonists in the treatment of acute symptomatic venous thromboembolism: a systematic review and meta-analysis. J Thromb Haemost. 2014;12(3):320-328. PubMed
24. Schuh T, Reichardt B, Finsterer J, Stöllberger C. Age-dependency of prescribing patterns of oral anticoagulant drugs in Austria during 2011–2014. J Thromb Thrombolysis. 2016;42(3):447-451. PubMed
25. Stöllberger C, Brooks R, Finsterer J, Pachofszky T. Use of direct-acting oral anticoagulants in nonagenarians: a call for more data. Drugs Aging. 2016;33(5):315-320. PubMed
26. Steffel J, Giugliano RP, Braunwald E, et al. Edoxaban versus warfarin in atrial fibrillation patients at risk of falling: ENGAGE AF-TIMI 48 analysis. J Am Coll Cardiol. 2016;68(11):1169-1178. PubMed
27. Ruff CT, Giugliano RP, Antman EM. Management of bleeding with non–vitamin K antagonist oral anticoagulants in the era of specific reversal agents. Circulation. 2016;134(3):248-261. PubMed
28. Schwartz JB. Potential impact of substituting estimated glomerular filtration rate for estimated creatinine clearance for dosing of direct oral anticoagulants. J Am Geriatr Soc. 2016;64(10):1996-2002. PubMed
29. Camm AJ, Amarenco P, Haas S, et al; XANTUS Investigators. XANTUS: a real-world, prospective, observational study of patients treated with rivaroxaban for stroke prevention in atrial fibrillation. Eur Heart J. 2016;37(14):1145-1153. PubMed
30. Steinberg BA, Shrader P, Thomas L, et al; ORBIT-AF Investigators and Patients. Off-label dosing of non–vitamin K antagonist oral anticoagulants and adverse outcomes: the ORBIT-AF II Registry. J Am Coll Cardiol. 2016;68(24):2597-2604. PubMed
31. Fauchier L, Philippart R, Clementy N, et al. How to define valvular atrial fibrillation? Arch Cardiovasc Dis. 2015;108(10):530-539. PubMed
32. Di Biase L. Use of direct oral anticoagulants in patients with atrial fibrillation and valvular heart lesions. J Am Heart Assoc. 2016;5(2). PubMed

33. Kearon C, Akl EA, Ornelas J, et al. Antithrombotic therapy for VTE disease:
CHEST guideline and expert panel report. Chest. 2016;149(2):315-352. PubMed
34. Larock AS, Mullier F, Sennesael AL, et al. Appropriateness of prescribing dabigatran
etexilate and rivaroxaban in patients with nonvalvular atrial fibrillation: a prospective
study. Ann Pharmacother. 2014;48(10):1258-1268. PubMed

References

1. January CT, Wann LS, Alpert JS, et al. 2014 AHA/ACC/HRS guideline for the management of patients with atrial fibrillation: executive summary. A report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines and the Heart Rhythm Society. J Am Coll Cardiol. 2014;64(21):2246-2280. PubMed
2. Saraf K, Morris PD, Garg P, Sheridan P, Storey R. Non–vitamin K antagonist oral anticoagulants (NOACs): clinical evidence and therapeutic considerations. Postgrad Med J. 2014;90(1067):520-528. PubMed
3. Yeh CH, Gross PL, Weitz JI. Evolving use of new oral anticoagulants for treatment of venous thromboembolism. Blood. 2014;124(7):1020-1028. PubMed
4. Pradaxa website. https://www.pradaxa.com. Accessed June 1, 2017.
5. Xarelto website. https://www.xarelto-us.com. Accessed June 1, 2017.
6. Eliquis website. http://www.eliquis.com. Accessed June 1, 2017.
7. Savaysa [prescribing information]. Tokyo, Japan: Daiichi Sankyo; 2015.
8. Kirchhof P, Benussi S, Kotecha D, et al. 2016 ESC guidelines for the management of atrial fibrillation developed in collaboration with EACTS. Eur Heart J. 2016;37(38):2893-2962. PubMed
9. Child C, Turcotte J. Surgery and portal hypertension. In: Child CG, ed. The Liver and Portal Hypertension. Philadelphia, PA: Saunders; 1964:50-64. PubMed
10. Pugh RN, Murray-Lyon IM, Dawson JL, Pietroni MC, Williams R. Transection of the oesophagus for bleeding oesophageal varices. Br J Surg. 1973;60(8):646-649. PubMed
11. Heidbuchel H, Verhamme P, Alings M, et al. Updated European Heart Rhythm Association practical guide on the use of non–vitamin K antagonist anticoagulants in patients with non-valvular atrial fibrillation. Europace. 2015;17(10):1467-1507. PubMed
12. Savaysa website. https://savaysahcp.com. Accessed June 1, 2017.
13. Levey AS, Stevens LA, Schmid CH, et al; CKD-EPI (Chronic Kidney Disease Epidemiology Collaboration). A new equation to estimate glomerular filtration rate. Ann Intern Med. 2009;150(9):604-612. PubMed
14. Cockcroft DW, Gault MH. Prediction of creatinine clearance from serum creatinine. Nephron. 1976;16(1):31-41. PubMed
15. Rose AJ, Reisman JI, Allen AL, Miller DR. Potentially inappropriate prescribing of direct-acting oral anticoagulants in the Veterans Health Administration. Am J Pharm Benefits. 2016;4(4):e75-e80.
16. Stampfuss J, Kubitza D, Becka M, Mueck W. The effect of food on the absorption and pharmacokinetics of rivaroxaban. Int J Clin Pharmacol Ther. 2013;51(7):549-561. PubMed
17. Patel MR, Mahaffey KW, Garg J, et al; ROCKET AF Investigators. Rivaroxaban versus warfarin in nonvalvular atrial fibrillation. N Engl J Med. 2011;365(10):883-891. PubMed
18. EINSTEIN Investigators, Bauersachs R, Berkowitz SD, et al. Oral rivaroxaban for symptomatic venous thromboembolism. N Engl J Med. 2010;363(26):2499-2510. PubMed
19. EINSTEIN-PE Investigators, Büller HR, Prins MH, et al. Oral rivaroxaban for the treatment of symptomatic pulmonary embolism. N Engl J Med. 2012;366(14):1287-1297. PubMed
20. Simon J, Hawes E, Deyo Z, Bryant-Shilliday B. Evaluation of prescribing and patient use of target-specific oral anticoagulants in the outpatient setting. J Clin Pharm Ther. 2015;40(5):525-530. PubMed
21. Granger CB, Alexander JH, McMurray JJ, et al; ARISTOTLE Committees and Investigators. Apixaban versus warfarin in patients with atrial fibrillation. N Engl J Med. 2011;365(11):981-992. PubMed
22. Ruff CT, Giugliano RP, Braunwald E, et al. Comparison of the efficacy and safety of new oral anticoagulants with warfarin in patients with atrial fibrillation: a meta-analysis of randomised trials. Lancet. 2014;383(9921):955-962. PubMed
23. van der Hulle T, Kooiman J, den Exter PL, Dekkers OM, Klok FA, Huisman MV. Effectiveness and safety of novel oral anticoagulants as compared with vitamin K antagonists in the treatment of acute symptomatic venous thromboembolism: a systematic review and meta-analysis. J Thromb Haemost. 2014;12(3):320-328. PubMed
24. Schuh T, Reichardt B, Finsterer J, Stöllberger C. Age-dependency of prescribing patterns of oral anticoagulant drugs in Austria during 2011–2014. J Thromb Thrombolysis. 2016;42(3):447-451. PubMed
25. Stöllberger C, Brooks R, Finsterer J, Pachofszky T. Use of direct-acting oral anticoagulants in nonagenarians: a call for more data. Drugs Aging. 2016;33(5):315-320. PubMed
26. Steffel J, Giugliano RP, Braunwald E, et al. Edoxaban versus warfarin in atrial fibrillation patients at risk of falling: ENGAGE AF-TIMI 48 analysis. J Am Coll Cardiol. 2016;68(11):1169-1178. PubMed
27. Ruff CT, Giugliano RP, Antman EM. Management of bleeding with non–vitamin K antagonist oral anticoagulants in the era of specific reversal agents. Circulation. 2016;134(3):248-261. PubMed
28. Schwartz JB. Potential impact of substituting estimated glomerular filtration rate for estimated creatinine clearance for dosing of direct oral anticoagulants. J Am Geriatr Soc. 2016;64(10):1996-2002. PubMed
29. Camm AJ, Amarenco P, Haas S, et al; XANTUS Investigators. XANTUS: a real-world, prospective, observational study of patients treated with rivaroxaban for stroke prevention in atrial fibrillation. Eur Heart J. 2016;37(14):1145-1153. PubMed
30. Steinberg BA, Shrader P, Thomas L, et al; ORBIT-AF Investigators and Patients. Off-label dosing of non–vitamin K antagonist oral anticoagulants and adverse outcomes: the ORBIT-AF II Registry. J Am Coll Cardiol. 2016;68(24):2597-2604. PubMed
31. Fauchier L, Philippart R, Clementy N, et al. How to define valvular atrial fibrillation? Arch Cardiovasc Dis. 2015;108(10):530-539. PubMed
32. Di Biase L. Use of direct oral anticoagulants in patients with atrial fibrillation and valvular heart lesions. J Am Heart Assoc. 2016;5(2). PubMed

33. Kearon C, Akl EA, Ornelas J, et al. Antithrombotic therapy for VTE disease:
CHEST guideline and expert panel report. Chest. 2016;149(2):315-352. PubMed
34. Larock AS, Mullier F, Sennesael AL, et al. Appropriateness of prescribing dabigatran
etexilate and rivaroxaban in patients with nonvalvular atrial fibrillation: a prospective
study. Ann Pharmacother. 2014;48(10):1258-1268. PubMed

Issue
Journal of Hospital Medicine 12(7)
Issue
Journal of Hospital Medicine 12(7)
Page Number
544-550
Page Number
544-550
Topics
Article Type
Display Headline
Dosing accuracy of direct oral anticoagulants in an academic medical center
Display Headline
Dosing accuracy of direct oral anticoagulants in an academic medical center
Sections
Article Source

© 2017 Society of Hospital Medicine

Disallow All Ads
Correspondence Location
Janice B. Schwartz, MD, 302 Silver Ave, San Francisco, CA 94112; Telephone: 415-406-1573; Fax: 415-406-1577; E-mail: [email protected]
Content Gating
Gated (full article locked unless allowed per User)
Alternative CME
Disqus Comments
Default
Gating Strategy
First Peek Free
Article PDF Media

Hospital-level factors associated with pediatric emergency department return visits

Article Type
Changed
Display Headline
Hospital-level factors associated with pediatric emergency department return visits

Return visit (RV) rate is a quality measure commonly used in the emergency department (ED) setting. This metric may represent suboptimal care at the index ED visit.1-5 Although patient- and visit-level factors affecting ED RVs have been evaluated,1,3,4,6-9 hospital-level factors and factors of a hospital’s patient population that may play roles in ED RV rates have not been examined. Identifying the factors associated with increased RVs may allow resources to be designated to areas that improve emergent care for children.10

Hospital readmission rates are a closely followed quality measure and are linked to reimbursement by the federal government, but a recent study found the influence a hospital can have on this marker may be mitigated by the impact of the social determinates of health (SDHs) of the hospital’s patient population.11 That study and others have prompted an ongoing debate about adjusting quality measures for SDHs.12,13 A clearer understanding of these interactions may permit us to focus on factors that can truly lead to improvement in care instead of penalizing practitioners or hospitals that provide care to those most in need.

Prior work has identified several SDHs associated with higher ED RV rates in patient- or visit-level analyses.3,11,14 We conducted a study of hospital-level characteristics and characteristics of a hospital’s patient population to identify potentially mutable factors associated with increased ED RV rates that, once recognized, may allow for improvement in this quality measure.

PATIENTS AND METHODS

This study was not considered human subjects research in accordance with Common Rule 45 CFR§46.104(f) and was evaluated by the Ann and Robert H. Lurie Children’s Hospital and Northwestern University Feinberg School of Medicine Institutional Review Boards and deemed exempt from review.

Study Population and Protocol

Our study had 2 data sources (to be described in detail): the Pediatric Health Information System (PHIS) and a survey of ED medical directors of the hospitals represented within PHIS. Hospitals were eligible for inclusion in the study if their data (1) met PHIS quality control standards for ED patient visits as determined by internal data assurance processes incorporated in PHIS,3,14,15 (2) included data only from an identifiable single main ED, and (3) completed the ED medical director’s survey.

 

 

PHIS Database

PHIS, an administrative database managed by Truven Health Analytics, includes data from ED, ambulatory surgery, observation, and inpatient encounters across Children’s Hospital Association member children’s hospitals in North America. Data are subjected to validity checks before being included in the database.16 PHIS assigns unique patient identifiers to track individual patient visits within participating institutions over time.

Hospitals were described by percentages of ED patients in several groups: age (<1, 1-4, 5-9, 10-14, and 15-18 years)17; sex; race/ethnicity; insurance type (commercial, government, other); ED International Classification of Diseases, Ninth Edition (ICD-9) diagnosis code–based severity classification system score (1-2, low severity; 3-5, high severity)18; complex chronic condition presence at ED visits in prior year14,19-21; home postal (Zip) code median household income from 2010 US Census data compared with Federal Poverty Level (<1.5, 1.5-2, 2-3, and >3 × FPL)17; and primary care physician (PCP) density in Federal Health Service Area of patient’s home address as reported by Dartmouth Atlas of Health Care modeled by quartiles.22 Density of PCPs—general pediatricians, family practitioners, general practitioners, and general internists—is calculated as number of PCPs per 100,000 residents. We used PCP density to account for potential care provided by any of the PCPs mentioned. We also assessed, at hospital level, index visit arrival time (8:01 am to 4:00 pm; 4:01 pm to 12:00 am; 12:01 am to 8:00 am) and index visit season.23

ED Medical Director Survey

A web-based survey was constructed in an iterative process based on literature review and expert opinion to assess hospital-level factors that may impact ED RV rates.3,7,24-26 The survey was piloted at 3 institutions to refine its structure and content.

The survey included 15 close-ended or multiple-choice questions on ED environment and operations and 2 open-ended questions, “What is the largest barrier to reducing the number of return visits within 72 hours of discharge from a previous ED visit?” and “In your opinion, what is the best way of reducing the number of the return visits within 72 hours of previous ED visit ?” (questionnaire in Supplemental material). Hospital characteristics from the survey included total clinical time allotment, or full-time equivalent (FTE), among all physicians, pediatric emergency medicine (PEM) fellowship-trained physicians, and all other (non-PEM) physicians. The data were standardized across sites by calculating FTE-per-10,000-visits values for each hospital; median duration of ED visit for admitted and discharged patients; median time from arrival to ED physician evaluation; rate of leaving without being seen; discharge educational material authorship and age specificity; follow-up visit scheduling procedure; and percentage of ED patients for whom English was a second language.

Responses to the 2 open-ended questions were independently categorized by Drs. Pittsenbarger and Alpern. Responses could be placed in more than 1 category if multiple answers to the question were included in the response. Categorizations were compared for consistency, and any inconsistencies were resolved by the consensus of the study investigators.

Outcome Measures From PHIS Database

All ED visits within a 12-month period (July 1, 2013–June 30, 2014) by patients younger than 18 years at time of index ED visit were eligible for inclusion in the study. An index visit was defined as any ED visit without another ED visit within the preceding 72 hours. The 72-hour time frame was used because it is the most widely studied time frame for ED RVs.5 Index ED visits that led to admission, observation status, death, or transfer were excluded.

The 2 primary outcomes of interest were (1) RVs within 72 hours of index ED visit discharge and (2) RVs within 72 hours that resulted in hospital admission or observation status at the next ED visit (RVA).7,9,27-30 For patients with multiple ED revisits within 72 hours, only the first was assessed. There was a 72-hour minimum between index visits for the same patient.

Statistical Analyses

To determine hospital groups based on RV and RVA rates, we adjusted RV and RVA rates using generalized linear mixed-effects models, controlling for clustering and allowing for correlated data (within hospitals), nonconstant variability (across hospitals), and non-normally distributed data, as we did in a study of patient-level factors associated with ED RV and RVA.3 For each calculated rate (RV, RVA), the hospitals were then classified into 3 groups based on whether the hospital’s adjusted RV and RVA rates were outside 2 SDs from the mean, below the 5th or above the 95th percentile, or within that range. These groups were labeled lowest outliers, highest outliers, and average-performing hospitals.

After the groups of hospitals were determined, we returned to using unadjusted data to statistically analyze them. We summarized continuous variables using minimum and maximum values, medians, and interquartile ranges (IQRs). We present categorical variables using counts and percentages. To identify hospital characteristics with the most potential to gain from improvement, we also analyzed associations using 2 collapsed groups: hospitals with RV (or RVA) rates included in the average-performing and lowest outlier groups and hospitals within the highest outlier group. Hospital characteristics and hospital’s patient population characteristics from the surveys are summarized based on RV and RVA rate groups. Differences in distributions among continuous variables were assessed by Kruskal-Wallis 1-way analysis of variance. Chi-square tests were used to evaluate differences in proportions among categorical variables. All statistical analyses were performed with SAS Version 9.4 (SAS Institute); 2-sided P < 0.05 was considered statistically significant.

 

 

RESULTS

Return Visit Rates and Hospital ED Site Population Characteristics

Twenty-four of 35 (68%) eligible hospitals that met PHIS quality control standards for ED patient visits responded to the ED medical director survey. The included hospitals that both met quality control standards and completed the survey had a total of 1,456,377 patient visits during the study period. Individual sites had annual volumes ranging from 26,627 to 96,637 ED encounters. The mean RV rate across the institutions was 3.7% (range, 3.0%-4.8%), and the mean RVA rate across the hospitals was 0.7% (range, 0.5%-1.1%) (Figure).

Figure

There were 5 hospitals with RV rates less than 2 SDs of the mean rate, placing them in the lowest outlier group for RV; 13 hospitals with RV rates within 2 SDs of the mean RV rate, placing them in the average-performing group; and 6 hospitals with RV rates above 2 SDs of the mean, placing them in the highest outlier group. Table 1 lists the hospital ED site population characteristics among the 3 RV rate groups. Hospitals in the highest outlier group served populations with higher proportions of patients with insurance from a government payer, lower proportions of patients covered by a commercial insurance plan, and higher proportion of patients with lower median household incomes.

Table 1

In the RVA analysis, there were 6 hospitals with RVA rates less than 2 SDs of the mean RVA rate (lowest outliers); 14 hospitals with RVA rates within 2 SDs of the mean RVA rate (average performers); and 4 hospitals with RVA rates above 2 SDs of the mean RVA rate (highest outliers). When using these groups based on RVA rate, there were no statistically significant differences in hospital ED site population characteristics (Supplemental Table 1).

RV Rates and Hospital-Level Factors Survey Characteristics

Table 2 lists the ED medical director survey hospital-level data among the 3 RV rate groups. There were fewer FTEs by PEM fellowship-trained physicians per 10,000 patient visits at sites with higher RV rates (Table 2). Hospital-level characteristics assessed by the survey were not associated with RVA rates (Supplemental Table 2).

Table 2

Evaluating characteristics of hospitals with the most potential to gain from improvement, hospitals with the highest RV rates (highest outlier group), compared with hospitals in the lowest outlier and average-performing groups collapsed together, persisted in having fewer PEM fellowship-trained physician FTEs per patient visit (Table 3). A similar collapsed analysis of RVA rates demonstrated that hospitals in the highest outlier group had longer-wait-to-physician time (81 minutes; IQR, 51-105 minutes) compared with hospitals in the other 2 groups (30 minutes; IQR, 19-42.5 minutes) (Table 3).

Table 3

In response to the first qualitative question on the ED medial director survey, “In your opinion, what is the largest barrier to reducing the number of return visits within 72 hours of discharge from a previous ED visit?”, 15 directors (62.5%) reported limited access to primary care, 4 (16.6%) reported inadequate discharge instructions and/or education provided, and 3 (12.5%) reported lack of access to specialist care. To the second question, “In your opinion, what is the best way of reducing the number of the return visits within 72 hours of previous ED visit for the same condition?”, they responded that RVs could be reduced by innovations in scheduling primary care or specialty follow-up visits (19, 79%), improving discharge education and instructions (6, 25%), and identifying more case management or care coordination (4, 16.6%).

DISCUSSION

Other studies have identified patient- and visit-level characteristics associated with higher ED RV and RVA rates.3,8,9,31 However, as our goal was to identify possible modifiable institutional features, our study examined factors at hospital and population-served levels (instead of patient or visit level) that may impact ED RV and RVA rates. Interestingly, our sample of tertiary-care pediatric center EDs provided evidence of variability in RV and RVA rates. We identified factors associated with RV rates related to the SDHs of the populations served by the ED, which suggests these factors are not modifiable at an institution level. In addition, we found that the increased availability of PEM providers per patient visit correlated with fewer ED RVs.

Hospitals serving ED populations with more government-insured and fewer commercially insured patients had higher rates of return to the ED. Similarly, hospitals with larger proportions of patients from areas with lower median household incomes had higher RV rates. These factors may indicate that patients with limited resources may have more frequent ED RVs,3,6,32,33 and hospitals that serve them have higher ED RV rates. Our findings complement those of a recent study by Sills et al.,11 who evaluated hospital readmissions and proposed risk adjustment for performance reimbursement. This study found that hospital population-level race, ethnicity, insurance status, and household income were predictors of hospital readmission after discharge.

Of note, our data did not identify similar site-level attributes related to the population served that correlated with RVA rates. We postulate that the need for admission on RV may indicate an inherent clinical urgency or medical need associated with the return to the ED, whereas RV without admission may be related more to patient- or population-level sociodemographic factors than to quality of care and clinical course, which influence ED utilization.1,3,30 EDs treating higher proportions of patients of minority race or ethnicity, those with fewer financial resources, and those in more need of government health insurance are at higher risk for ED revisits.

We observed that increased PEM fellowship-trained physician staffing was associated with decreased RV rates. The availability of specialty-trained physicians in PEM may allow a larger proportion of patients treated by physicians with honed clinical skills for the patient population. Data from a single pediatric center showed PEM fellowship-trained physicians had admission rates lower than those of their counterparts without subspecialty fellowship training.34 The lower RV rate for this group in our study is especially interesting in light of previously reported lower admission rates at index visit in PEM trained physicians. With lower index admission rates, it may have been assumed that visits associated with PEM trained physician care would have an increased (rather than decreased) chance of RV. In addition, we noted the increased RVA rates were associated with longer waits to see a physician. These measures may indicate the effect of institutional access to robust resources (the ability to hire and support more specialty-trained physicians). These novel findings warrant further evaluation, particularly as our sample included only pediatric centers.

Our survey data demonstrated the impact that access to care has on ED RV rates. The ED medical directors indicated that limited access to outpatient appointments with PCPs and specialists was an important factor increasing ED RVs and a potential avenue for interventions. As the 2 open-ended questions addressed barriers and potential solutions, it is interesting that the respondents cited access to care and discharge instructions as the largest barriers and identified innovations in access to care and discharge education as important potential remedies.

This study demonstrated that, at the hospital level, ED RV quality measures are influenced by complex and varied SDHs that primarily reflect the characteristics of the patient populations served. Prior work has similarly highlighted the importance of gaining a rigorous understanding of other quality measures before widespread use, reporting, and dissemination of results.11,35-38 With this in mind, as quality measures are developed and implemented, care should be taken to ensure they accurately and appropriately reflect the quality of care provided to the patient and are not more representative of other factors not directly within institutional control. These findings call into question the usefulness of ED RVs as a quality measure for comparing institutions.

 

 

Study Limitations

This study had several limitations. The PHIS dataset tracks only patients within each institution and does not include RVs to other EDs, which may account for a proportion of RVs.39 Our survey response rate was 68% among medical directors, excluding 11 hospitals from analysis, which decreased the study’s power to detect differences that may be present between groups. In addition, the generalizability of our findings may be limited to tertiary-care children’s hospitals, as the PHIS dataset included only these types of healthcare facilities. We also included data only from the sites’ main EDs, and therefore cannot know if our results are applicable to satellite EDs. ED staffing of PEM physicians was analyzed using FTEs. However, number of clinical hours in 1 FTE may vary among sites, leading to imprecision in this hospital characteristic.

CONCLUSION

Hospitals with the highest RV rates served populations with a larger proportion of patients with government insurance and lower household income, and these hospitals had fewer PEM trained physicians. Variation in RV rates among hospitals may be indicative of the SDHs of their unique patient populations. ED revisit rates should be used cautiously in determining the quality of care of hospitals providing care to differing populations.

Disclosure

Nothing to report.

 

Files
References

1. Goldman RD, Kapoor A, Mehta S. Children admitted to the hospital after returning to the emergency department within 72 hours. Pediatr Emerg Care. 2011;27(9):808-811. PubMed
2. Cho CS, Shapiro DJ, Cabana MD, Maselli JH, Hersh AL. A national depiction of children with return visits to the emergency department within 72 hours, 2001–2007. Pediatr Emerg Care. 2012;28(7):606-610. PubMed
3. Akenroye AT, Thurm CW, Neuman MI, et al. Prevalence and predictors of return visits to pediatric emergency departments. J Hosp Med. 2014;9(12):779-787. PubMed
4. Gallagher RA, Porter S, Monuteaux MC, Stack AM. Unscheduled return visits to the emergency department: the impact of language. Pediatr Emerg Care. 2013;29(5):579-583. PubMed
5. Sørup CM, Jacobsen P, Forberg JL. Evaluation of emergency department performance—a systematic review on recommended performance and quality-in-care measures. Scand J Trauma Resusc Emerg Med. 2013;21:62. PubMed
6. Gabayan GZ, Asch SM, Hsia RY, et al. Factors associated with short-term bounce-back admissions after emergency department discharge. Ann Emerg Med. 2013;62(2):136-144.e1. PubMed
7. Ali AB, Place R, Howell J, Malubay SM. Early pediatric emergency department return visits: a prospective patient-centric assessment. Clin Pediatr (Phila). 2012;51(7):651-658. PubMed
8. Alessandrini EA, Lavelle JM, Grenfell SM, Jacobstein CR, Shaw KN. Return visits to a pediatric emergency department. Pediatr Emerg Care. 2004;20(3):166-171. PubMed
9. Goldman RD, Ong M, Macpherson A. Unscheduled return visits to the pediatric emergency department—one-year experience. Pediatr Emerg Care. 2006;22(8):545-549. PubMed
10. Berry JG, Toomey SL, Zaslavsky AM, et al. Pediatric readmission prevalence and variability across hospitals. JAMA. 2013;309(4):372-380. PubMed
11. Sills MR, Hall M, Colvin JD, et al. Association of social determinants with children’s hospitals’ preventable readmissions performance. JAMA Pediatr. 2016;170(4):350-358. PubMed
12. Fiscella K, Burstin HR, Nerenz DR. Quality measures and sociodemographic risk factors: to adjust or not to adjust. JAMA. 2014;312(24):2615-2616. PubMed
13. Lipstein SH, Dunagan WC. The risks of not adjusting performance measures for sociodemographic factors. Ann Intern Med. 2014;161(8):594-596. PubMed
14. Berry JG, Hall DE, Kuo DZ, et al. Hospital utilization and characteristics of patients experiencing recurrent readmissions within children’s hospitals. JAMA. 2011;305(7):682-690. PubMed
15. Bourgeois FT, Monuteaux MC, Stack AM, Neuman MI. Variation in emergency department admission rates in US children’s hospitals. Pediatrics. 2014;134(3):539-545. PubMed
16. Fletcher DM. Achieving data quality. How data from a pediatric health information system earns the trust of its users. J AHIMA. 2004;75(10):22-26. PubMed
17. US Census Bureau. US Census current estimates data. 2014. https://www.census.gov/programs-surveys/popest/data/data-sets.2014.html. Accessed June 2015.
18. Alessandrini EA, Alpern ER, Chamberlain JM, Shea JA, Gorelick MH. A new diagnosis grouping system for child emergency department visits. Acad Emerg Med. 2010;17(2):204-213. PubMed
19. Feudtner C, Levin JE, Srivastava R, et al. How well can hospital readmission be predicted in a cohort of hospitalized children? A retrospective, multicenter study. Pediatrics. 2009;123(1):286-293. PubMed
20. Feinstein JA, Feudtner C, Kempe A. Adverse drug event–related emergency department visits associated with complex chronic conditions. Pediatrics. 2014;133(6):e1575-e1585. PubMed
21. Simon TD, Berry J, Feudtner C, et al. Children with complex chronic conditions in inpatient hospital settings in the United States. Pediatrics. 2010;126(4):647-655. PubMed
22. Dartmouth Medical School, Center for Evaluative Clinical Sciences. The Dartmouth Atlas of Health Care. Chicago, IL: American Hospital Publishing; 2015. 
23. Kansagara D, Englander H, Salanitro A, et al. Risk prediction models for hospital readmission: a systematic review. JAMA. 2011;306(15):1688-1698. PubMed
24. Lawrence LM, Jenkins CA, Zhou C, Givens TG. The effect of diagnosis-specific computerized discharge instructions on 72-hour return visits to the pediatric emergency department. Pediatr Emerg Care. 2009;25(11):733-738. PubMed
25. National Quality Forum. National Quality Forum issue brief: strengthening pediatric quality measurement and reporting. J Healthc Qual. 2008;30(3):51-55. PubMed
26. Rising KL, Victor TW, Hollander JE, Carr BG. Patient returns to the emergency department: the time-to-return curve. Acad Emerg Med. 2014;21(8):864-871. PubMed
27. Cho CS, Shapiro DJ, Cabana MD, Maselli JH, Hersh AL. A national depiction of children with return visits to the emergency department within 72 hours, 2001–2007. Pediatr Emerg Care. 2012;28(7):606-610. PubMed
28. Adekoya N. Patients seen in emergency departments who had a prior visit within the previous 72 h—National Hospital Ambulatory Medical Care Survey, 2002. Public Health. 2005;119(10):914-918. PubMed
29. Mittal MK, Zorc JJ, Garcia-Espana JF, Shaw KN. An assessment of clinical performance measures for pediatric emergency physicians. Am J Med Qual. 2013;28(1):33-39. PubMed
30. Depiero AD, Ochsenschlager DW, Chamberlain JM. Analysis of pediatric hospitalizations after emergency department release as a quality improvement tool. Ann Emerg Med. 2002;39(2):159-163. PubMed
31. Sung SF, Liu KE, Chen SC, Lo CL, Lin KC, Hu YH. Predicting factors and risk stratification for return visits to the emergency department within 72 hours in pediatric patients. Pediatr Emerg Care. 2015;31(12):819-824. PubMed
32. Jacobstein CR, Alessandrini EA, Lavelle JM, Shaw KN. Unscheduled revisits to a pediatric emergency department: risk factors for children with fever or infection-related complaints. Pediatr Emerg Care. 2005;21(12):816-821. PubMed
33. Barnett ML, Hsu J, McWilliams J. Patient characteristics and differences in hospital readmission rates. JAMA Intern Med. 2015;175(11):1803-1812. PubMed
34. Gaucher N, Bailey B, Gravel J. Impact of physicians’ characteristics on the admission risk among children visiting a pediatric emergency department. Pediatr Emerg Care. 2012;28(2):120-124. PubMed
35. McHugh M, Neimeyer J, Powell E, Khare RK, Adams JG. An early look at performance on the emergency care measures included in Medicare’s hospital inpatient Value-Based Purchasing Program. Ann Emerg Med. 2013;61(6):616-623.e2. PubMed
36. Axon RN, Williams MV. Hospital readmission as an accountability measure. JAMA. 2011;305(5):504-505. PubMed
37. Adams JG. Ensuring the quality of quality metrics for emergency care. JAMA. 2016;315(7):659-660. PubMed
38. Payne NR, Flood A. Preventing pediatric readmissions: which ones and how? J Pediatr. 2015;166(3):519-520. PubMed
39. Khan A, Nakamura MM, Zaslavsky AM, et al. Same-hospital readmission rates as a measure of pediatric quality of care. JAMA Pediatr. 2015;169(10):905-912. PubMed

Article PDF
Issue
Journal of Hospital Medicine 12(7)
Topics
Page Number
536-543
Sections
Files
Files
Article PDF
Article PDF

Return visit (RV) rate is a quality measure commonly used in the emergency department (ED) setting. This metric may represent suboptimal care at the index ED visit.1-5 Although patient- and visit-level factors affecting ED RVs have been evaluated,1,3,4,6-9 hospital-level factors and factors of a hospital’s patient population that may play roles in ED RV rates have not been examined. Identifying the factors associated with increased RVs may allow resources to be designated to areas that improve emergent care for children.10

Hospital readmission rates are a closely followed quality measure and are linked to reimbursement by the federal government, but a recent study found the influence a hospital can have on this marker may be mitigated by the impact of the social determinates of health (SDHs) of the hospital’s patient population.11 That study and others have prompted an ongoing debate about adjusting quality measures for SDHs.12,13 A clearer understanding of these interactions may permit us to focus on factors that can truly lead to improvement in care instead of penalizing practitioners or hospitals that provide care to those most in need.

Prior work has identified several SDHs associated with higher ED RV rates in patient- or visit-level analyses.3,11,14 We conducted a study of hospital-level characteristics and characteristics of a hospital’s patient population to identify potentially mutable factors associated with increased ED RV rates that, once recognized, may allow for improvement in this quality measure.

PATIENTS AND METHODS

This study was not considered human subjects research in accordance with Common Rule 45 CFR§46.104(f) and was evaluated by the Ann and Robert H. Lurie Children’s Hospital and Northwestern University Feinberg School of Medicine Institutional Review Boards and deemed exempt from review.

Study Population and Protocol

Our study had 2 data sources (to be described in detail): the Pediatric Health Information System (PHIS) and a survey of ED medical directors of the hospitals represented within PHIS. Hospitals were eligible for inclusion in the study if their data (1) met PHIS quality control standards for ED patient visits as determined by internal data assurance processes incorporated in PHIS,3,14,15 (2) included data only from an identifiable single main ED, and (3) completed the ED medical director’s survey.

 

 

PHIS Database

PHIS, an administrative database managed by Truven Health Analytics, includes data from ED, ambulatory surgery, observation, and inpatient encounters across Children’s Hospital Association member children’s hospitals in North America. Data are subjected to validity checks before being included in the database.16 PHIS assigns unique patient identifiers to track individual patient visits within participating institutions over time.

Hospitals were described by percentages of ED patients in several groups: age (<1, 1-4, 5-9, 10-14, and 15-18 years)17; sex; race/ethnicity; insurance type (commercial, government, other); ED International Classification of Diseases, Ninth Edition (ICD-9) diagnosis code–based severity classification system score (1-2, low severity; 3-5, high severity)18; complex chronic condition presence at ED visits in prior year14,19-21; home postal (Zip) code median household income from 2010 US Census data compared with Federal Poverty Level (<1.5, 1.5-2, 2-3, and >3 × FPL)17; and primary care physician (PCP) density in Federal Health Service Area of patient’s home address as reported by Dartmouth Atlas of Health Care modeled by quartiles.22 Density of PCPs—general pediatricians, family practitioners, general practitioners, and general internists—is calculated as number of PCPs per 100,000 residents. We used PCP density to account for potential care provided by any of the PCPs mentioned. We also assessed, at hospital level, index visit arrival time (8:01 am to 4:00 pm; 4:01 pm to 12:00 am; 12:01 am to 8:00 am) and index visit season.23

ED Medical Director Survey

A web-based survey was constructed in an iterative process based on literature review and expert opinion to assess hospital-level factors that may impact ED RV rates.3,7,24-26 The survey was piloted at 3 institutions to refine its structure and content.

The survey included 15 close-ended or multiple-choice questions on ED environment and operations and 2 open-ended questions, “What is the largest barrier to reducing the number of return visits within 72 hours of discharge from a previous ED visit?” and “In your opinion, what is the best way of reducing the number of the return visits within 72 hours of previous ED visit ?” (questionnaire in Supplemental material). Hospital characteristics from the survey included total clinical time allotment, or full-time equivalent (FTE), among all physicians, pediatric emergency medicine (PEM) fellowship-trained physicians, and all other (non-PEM) physicians. The data were standardized across sites by calculating FTE-per-10,000-visits values for each hospital; median duration of ED visit for admitted and discharged patients; median time from arrival to ED physician evaluation; rate of leaving without being seen; discharge educational material authorship and age specificity; follow-up visit scheduling procedure; and percentage of ED patients for whom English was a second language.

Responses to the 2 open-ended questions were independently categorized by Drs. Pittsenbarger and Alpern. Responses could be placed in more than 1 category if multiple answers to the question were included in the response. Categorizations were compared for consistency, and any inconsistencies were resolved by the consensus of the study investigators.

Outcome Measures From PHIS Database

All ED visits within a 12-month period (July 1, 2013–June 30, 2014) by patients younger than 18 years at time of index ED visit were eligible for inclusion in the study. An index visit was defined as any ED visit without another ED visit within the preceding 72 hours. The 72-hour time frame was used because it is the most widely studied time frame for ED RVs.5 Index ED visits that led to admission, observation status, death, or transfer were excluded.

The 2 primary outcomes of interest were (1) RVs within 72 hours of index ED visit discharge and (2) RVs within 72 hours that resulted in hospital admission or observation status at the next ED visit (RVA).7,9,27-30 For patients with multiple ED revisits within 72 hours, only the first was assessed. There was a 72-hour minimum between index visits for the same patient.

Statistical Analyses

To determine hospital groups based on RV and RVA rates, we adjusted RV and RVA rates using generalized linear mixed-effects models, controlling for clustering and allowing for correlated data (within hospitals), nonconstant variability (across hospitals), and non-normally distributed data, as we did in a study of patient-level factors associated with ED RV and RVA.3 For each calculated rate (RV, RVA), the hospitals were then classified into 3 groups based on whether the hospital’s adjusted RV and RVA rates were outside 2 SDs from the mean, below the 5th or above the 95th percentile, or within that range. These groups were labeled lowest outliers, highest outliers, and average-performing hospitals.

After the groups of hospitals were determined, we returned to using unadjusted data to statistically analyze them. We summarized continuous variables using minimum and maximum values, medians, and interquartile ranges (IQRs). We present categorical variables using counts and percentages. To identify hospital characteristics with the most potential to gain from improvement, we also analyzed associations using 2 collapsed groups: hospitals with RV (or RVA) rates included in the average-performing and lowest outlier groups and hospitals within the highest outlier group. Hospital characteristics and hospital’s patient population characteristics from the surveys are summarized based on RV and RVA rate groups. Differences in distributions among continuous variables were assessed by Kruskal-Wallis 1-way analysis of variance. Chi-square tests were used to evaluate differences in proportions among categorical variables. All statistical analyses were performed with SAS Version 9.4 (SAS Institute); 2-sided P < 0.05 was considered statistically significant.

 

 

RESULTS

Return Visit Rates and Hospital ED Site Population Characteristics

Twenty-four of 35 (68%) eligible hospitals that met PHIS quality control standards for ED patient visits responded to the ED medical director survey. The included hospitals that both met quality control standards and completed the survey had a total of 1,456,377 patient visits during the study period. Individual sites had annual volumes ranging from 26,627 to 96,637 ED encounters. The mean RV rate across the institutions was 3.7% (range, 3.0%-4.8%), and the mean RVA rate across the hospitals was 0.7% (range, 0.5%-1.1%) (Figure).

Figure

There were 5 hospitals with RV rates less than 2 SDs of the mean rate, placing them in the lowest outlier group for RV; 13 hospitals with RV rates within 2 SDs of the mean RV rate, placing them in the average-performing group; and 6 hospitals with RV rates above 2 SDs of the mean, placing them in the highest outlier group. Table 1 lists the hospital ED site population characteristics among the 3 RV rate groups. Hospitals in the highest outlier group served populations with higher proportions of patients with insurance from a government payer, lower proportions of patients covered by a commercial insurance plan, and higher proportion of patients with lower median household incomes.

Table 1

In the RVA analysis, there were 6 hospitals with RVA rates less than 2 SDs of the mean RVA rate (lowest outliers); 14 hospitals with RVA rates within 2 SDs of the mean RVA rate (average performers); and 4 hospitals with RVA rates above 2 SDs of the mean RVA rate (highest outliers). When using these groups based on RVA rate, there were no statistically significant differences in hospital ED site population characteristics (Supplemental Table 1).

RV Rates and Hospital-Level Factors Survey Characteristics

Table 2 lists the ED medical director survey hospital-level data among the 3 RV rate groups. There were fewer FTEs by PEM fellowship-trained physicians per 10,000 patient visits at sites with higher RV rates (Table 2). Hospital-level characteristics assessed by the survey were not associated with RVA rates (Supplemental Table 2).

Table 2

Evaluating characteristics of hospitals with the most potential to gain from improvement, hospitals with the highest RV rates (highest outlier group), compared with hospitals in the lowest outlier and average-performing groups collapsed together, persisted in having fewer PEM fellowship-trained physician FTEs per patient visit (Table 3). A similar collapsed analysis of RVA rates demonstrated that hospitals in the highest outlier group had longer-wait-to-physician time (81 minutes; IQR, 51-105 minutes) compared with hospitals in the other 2 groups (30 minutes; IQR, 19-42.5 minutes) (Table 3).

Table 3

In response to the first qualitative question on the ED medial director survey, “In your opinion, what is the largest barrier to reducing the number of return visits within 72 hours of discharge from a previous ED visit?”, 15 directors (62.5%) reported limited access to primary care, 4 (16.6%) reported inadequate discharge instructions and/or education provided, and 3 (12.5%) reported lack of access to specialist care. To the second question, “In your opinion, what is the best way of reducing the number of the return visits within 72 hours of previous ED visit for the same condition?”, they responded that RVs could be reduced by innovations in scheduling primary care or specialty follow-up visits (19, 79%), improving discharge education and instructions (6, 25%), and identifying more case management or care coordination (4, 16.6%).

DISCUSSION

Other studies have identified patient- and visit-level characteristics associated with higher ED RV and RVA rates.3,8,9,31 However, as our goal was to identify possible modifiable institutional features, our study examined factors at hospital and population-served levels (instead of patient or visit level) that may impact ED RV and RVA rates. Interestingly, our sample of tertiary-care pediatric center EDs provided evidence of variability in RV and RVA rates. We identified factors associated with RV rates related to the SDHs of the populations served by the ED, which suggests these factors are not modifiable at an institution level. In addition, we found that the increased availability of PEM providers per patient visit correlated with fewer ED RVs.

Hospitals serving ED populations with more government-insured and fewer commercially insured patients had higher rates of return to the ED. Similarly, hospitals with larger proportions of patients from areas with lower median household incomes had higher RV rates. These factors may indicate that patients with limited resources may have more frequent ED RVs,3,6,32,33 and hospitals that serve them have higher ED RV rates. Our findings complement those of a recent study by Sills et al.,11 who evaluated hospital readmissions and proposed risk adjustment for performance reimbursement. This study found that hospital population-level race, ethnicity, insurance status, and household income were predictors of hospital readmission after discharge.

Of note, our data did not identify similar site-level attributes related to the population served that correlated with RVA rates. We postulate that the need for admission on RV may indicate an inherent clinical urgency or medical need associated with the return to the ED, whereas RV without admission may be related more to patient- or population-level sociodemographic factors than to quality of care and clinical course, which influence ED utilization.1,3,30 EDs treating higher proportions of patients of minority race or ethnicity, those with fewer financial resources, and those in more need of government health insurance are at higher risk for ED revisits.

We observed that increased PEM fellowship-trained physician staffing was associated with decreased RV rates. The availability of specialty-trained physicians in PEM may allow a larger proportion of patients treated by physicians with honed clinical skills for the patient population. Data from a single pediatric center showed PEM fellowship-trained physicians had admission rates lower than those of their counterparts without subspecialty fellowship training.34 The lower RV rate for this group in our study is especially interesting in light of previously reported lower admission rates at index visit in PEM trained physicians. With lower index admission rates, it may have been assumed that visits associated with PEM trained physician care would have an increased (rather than decreased) chance of RV. In addition, we noted the increased RVA rates were associated with longer waits to see a physician. These measures may indicate the effect of institutional access to robust resources (the ability to hire and support more specialty-trained physicians). These novel findings warrant further evaluation, particularly as our sample included only pediatric centers.

Our survey data demonstrated the impact that access to care has on ED RV rates. The ED medical directors indicated that limited access to outpatient appointments with PCPs and specialists was an important factor increasing ED RVs and a potential avenue for interventions. As the 2 open-ended questions addressed barriers and potential solutions, it is interesting that the respondents cited access to care and discharge instructions as the largest barriers and identified innovations in access to care and discharge education as important potential remedies.

This study demonstrated that, at the hospital level, ED RV quality measures are influenced by complex and varied SDHs that primarily reflect the characteristics of the patient populations served. Prior work has similarly highlighted the importance of gaining a rigorous understanding of other quality measures before widespread use, reporting, and dissemination of results.11,35-38 With this in mind, as quality measures are developed and implemented, care should be taken to ensure they accurately and appropriately reflect the quality of care provided to the patient and are not more representative of other factors not directly within institutional control. These findings call into question the usefulness of ED RVs as a quality measure for comparing institutions.

 

 

Study Limitations

This study had several limitations. The PHIS dataset tracks only patients within each institution and does not include RVs to other EDs, which may account for a proportion of RVs.39 Our survey response rate was 68% among medical directors, excluding 11 hospitals from analysis, which decreased the study’s power to detect differences that may be present between groups. In addition, the generalizability of our findings may be limited to tertiary-care children’s hospitals, as the PHIS dataset included only these types of healthcare facilities. We also included data only from the sites’ main EDs, and therefore cannot know if our results are applicable to satellite EDs. ED staffing of PEM physicians was analyzed using FTEs. However, number of clinical hours in 1 FTE may vary among sites, leading to imprecision in this hospital characteristic.

CONCLUSION

Hospitals with the highest RV rates served populations with a larger proportion of patients with government insurance and lower household income, and these hospitals had fewer PEM trained physicians. Variation in RV rates among hospitals may be indicative of the SDHs of their unique patient populations. ED revisit rates should be used cautiously in determining the quality of care of hospitals providing care to differing populations.

Disclosure

Nothing to report.

 

Return visit (RV) rate is a quality measure commonly used in the emergency department (ED) setting. This metric may represent suboptimal care at the index ED visit.1-5 Although patient- and visit-level factors affecting ED RVs have been evaluated,1,3,4,6-9 hospital-level factors and factors of a hospital’s patient population that may play roles in ED RV rates have not been examined. Identifying the factors associated with increased RVs may allow resources to be designated to areas that improve emergent care for children.10

Hospital readmission rates are a closely followed quality measure and are linked to reimbursement by the federal government, but a recent study found the influence a hospital can have on this marker may be mitigated by the impact of the social determinates of health (SDHs) of the hospital’s patient population.11 That study and others have prompted an ongoing debate about adjusting quality measures for SDHs.12,13 A clearer understanding of these interactions may permit us to focus on factors that can truly lead to improvement in care instead of penalizing practitioners or hospitals that provide care to those most in need.

Prior work has identified several SDHs associated with higher ED RV rates in patient- or visit-level analyses.3,11,14 We conducted a study of hospital-level characteristics and characteristics of a hospital’s patient population to identify potentially mutable factors associated with increased ED RV rates that, once recognized, may allow for improvement in this quality measure.

PATIENTS AND METHODS

This study was not considered human subjects research in accordance with Common Rule 45 CFR§46.104(f) and was evaluated by the Ann and Robert H. Lurie Children’s Hospital and Northwestern University Feinberg School of Medicine Institutional Review Boards and deemed exempt from review.

Study Population and Protocol

Our study had 2 data sources (to be described in detail): the Pediatric Health Information System (PHIS) and a survey of ED medical directors of the hospitals represented within PHIS. Hospitals were eligible for inclusion in the study if their data (1) met PHIS quality control standards for ED patient visits as determined by internal data assurance processes incorporated in PHIS,3,14,15 (2) included data only from an identifiable single main ED, and (3) completed the ED medical director’s survey.

 

 

PHIS Database

PHIS, an administrative database managed by Truven Health Analytics, includes data from ED, ambulatory surgery, observation, and inpatient encounters across Children’s Hospital Association member children’s hospitals in North America. Data are subjected to validity checks before being included in the database.16 PHIS assigns unique patient identifiers to track individual patient visits within participating institutions over time.

Hospitals were described by percentages of ED patients in several groups: age (<1, 1-4, 5-9, 10-14, and 15-18 years)17; sex; race/ethnicity; insurance type (commercial, government, other); ED International Classification of Diseases, Ninth Edition (ICD-9) diagnosis code–based severity classification system score (1-2, low severity; 3-5, high severity)18; complex chronic condition presence at ED visits in prior year14,19-21; home postal (Zip) code median household income from 2010 US Census data compared with Federal Poverty Level (<1.5, 1.5-2, 2-3, and >3 × FPL)17; and primary care physician (PCP) density in Federal Health Service Area of patient’s home address as reported by Dartmouth Atlas of Health Care modeled by quartiles.22 Density of PCPs—general pediatricians, family practitioners, general practitioners, and general internists—is calculated as number of PCPs per 100,000 residents. We used PCP density to account for potential care provided by any of the PCPs mentioned. We also assessed, at hospital level, index visit arrival time (8:01 am to 4:00 pm; 4:01 pm to 12:00 am; 12:01 am to 8:00 am) and index visit season.23

ED Medical Director Survey

A web-based survey was constructed in an iterative process based on literature review and expert opinion to assess hospital-level factors that may impact ED RV rates.3,7,24-26 The survey was piloted at 3 institutions to refine its structure and content.

The survey included 15 close-ended or multiple-choice questions on ED environment and operations and 2 open-ended questions, “What is the largest barrier to reducing the number of return visits within 72 hours of discharge from a previous ED visit?” and “In your opinion, what is the best way of reducing the number of the return visits within 72 hours of previous ED visit ?” (questionnaire in Supplemental material). Hospital characteristics from the survey included total clinical time allotment, or full-time equivalent (FTE), among all physicians, pediatric emergency medicine (PEM) fellowship-trained physicians, and all other (non-PEM) physicians. The data were standardized across sites by calculating FTE-per-10,000-visits values for each hospital; median duration of ED visit for admitted and discharged patients; median time from arrival to ED physician evaluation; rate of leaving without being seen; discharge educational material authorship and age specificity; follow-up visit scheduling procedure; and percentage of ED patients for whom English was a second language.

Responses to the 2 open-ended questions were independently categorized by Drs. Pittsenbarger and Alpern. Responses could be placed in more than 1 category if multiple answers to the question were included in the response. Categorizations were compared for consistency, and any inconsistencies were resolved by the consensus of the study investigators.

Outcome Measures From PHIS Database

All ED visits within a 12-month period (July 1, 2013–June 30, 2014) by patients younger than 18 years at time of index ED visit were eligible for inclusion in the study. An index visit was defined as any ED visit without another ED visit within the preceding 72 hours. The 72-hour time frame was used because it is the most widely studied time frame for ED RVs.5 Index ED visits that led to admission, observation status, death, or transfer were excluded.

The 2 primary outcomes of interest were (1) RVs within 72 hours of index ED visit discharge and (2) RVs within 72 hours that resulted in hospital admission or observation status at the next ED visit (RVA).7,9,27-30 For patients with multiple ED revisits within 72 hours, only the first was assessed. There was a 72-hour minimum between index visits for the same patient.

Statistical Analyses

To determine hospital groups based on RV and RVA rates, we adjusted RV and RVA rates using generalized linear mixed-effects models, controlling for clustering and allowing for correlated data (within hospitals), nonconstant variability (across hospitals), and non-normally distributed data, as we did in a study of patient-level factors associated with ED RV and RVA.3 For each calculated rate (RV, RVA), the hospitals were then classified into 3 groups based on whether the hospital’s adjusted RV and RVA rates were outside 2 SDs from the mean, below the 5th or above the 95th percentile, or within that range. These groups were labeled lowest outliers, highest outliers, and average-performing hospitals.

After the groups of hospitals were determined, we returned to using unadjusted data to statistically analyze them. We summarized continuous variables using minimum and maximum values, medians, and interquartile ranges (IQRs). We present categorical variables using counts and percentages. To identify hospital characteristics with the most potential to gain from improvement, we also analyzed associations using 2 collapsed groups: hospitals with RV (or RVA) rates included in the average-performing and lowest outlier groups and hospitals within the highest outlier group. Hospital characteristics and hospital’s patient population characteristics from the surveys are summarized based on RV and RVA rate groups. Differences in distributions among continuous variables were assessed by Kruskal-Wallis 1-way analysis of variance. Chi-square tests were used to evaluate differences in proportions among categorical variables. All statistical analyses were performed with SAS Version 9.4 (SAS Institute); 2-sided P < 0.05 was considered statistically significant.

 

 

RESULTS

Return Visit Rates and Hospital ED Site Population Characteristics

Twenty-four of 35 (68%) eligible hospitals that met PHIS quality control standards for ED patient visits responded to the ED medical director survey. The included hospitals that both met quality control standards and completed the survey had a total of 1,456,377 patient visits during the study period. Individual sites had annual volumes ranging from 26,627 to 96,637 ED encounters. The mean RV rate across the institutions was 3.7% (range, 3.0%-4.8%), and the mean RVA rate across the hospitals was 0.7% (range, 0.5%-1.1%) (Figure).

Figure

There were 5 hospitals with RV rates less than 2 SDs of the mean rate, placing them in the lowest outlier group for RV; 13 hospitals with RV rates within 2 SDs of the mean RV rate, placing them in the average-performing group; and 6 hospitals with RV rates above 2 SDs of the mean, placing them in the highest outlier group. Table 1 lists the hospital ED site population characteristics among the 3 RV rate groups. Hospitals in the highest outlier group served populations with higher proportions of patients with insurance from a government payer, lower proportions of patients covered by a commercial insurance plan, and higher proportion of patients with lower median household incomes.

Table 1

In the RVA analysis, there were 6 hospitals with RVA rates less than 2 SDs of the mean RVA rate (lowest outliers); 14 hospitals with RVA rates within 2 SDs of the mean RVA rate (average performers); and 4 hospitals with RVA rates above 2 SDs of the mean RVA rate (highest outliers). When using these groups based on RVA rate, there were no statistically significant differences in hospital ED site population characteristics (Supplemental Table 1).

RV Rates and Hospital-Level Factors Survey Characteristics

Table 2 lists the ED medical director survey hospital-level data among the 3 RV rate groups. There were fewer FTEs by PEM fellowship-trained physicians per 10,000 patient visits at sites with higher RV rates (Table 2). Hospital-level characteristics assessed by the survey were not associated with RVA rates (Supplemental Table 2).

Table 2

Evaluating characteristics of hospitals with the most potential to gain from improvement, hospitals with the highest RV rates (highest outlier group), compared with hospitals in the lowest outlier and average-performing groups collapsed together, persisted in having fewer PEM fellowship-trained physician FTEs per patient visit (Table 3). A similar collapsed analysis of RVA rates demonstrated that hospitals in the highest outlier group had longer-wait-to-physician time (81 minutes; IQR, 51-105 minutes) compared with hospitals in the other 2 groups (30 minutes; IQR, 19-42.5 minutes) (Table 3).

Table 3

In response to the first qualitative question on the ED medial director survey, “In your opinion, what is the largest barrier to reducing the number of return visits within 72 hours of discharge from a previous ED visit?”, 15 directors (62.5%) reported limited access to primary care, 4 (16.6%) reported inadequate discharge instructions and/or education provided, and 3 (12.5%) reported lack of access to specialist care. To the second question, “In your opinion, what is the best way of reducing the number of the return visits within 72 hours of previous ED visit for the same condition?”, they responded that RVs could be reduced by innovations in scheduling primary care or specialty follow-up visits (19, 79%), improving discharge education and instructions (6, 25%), and identifying more case management or care coordination (4, 16.6%).

DISCUSSION

Other studies have identified patient- and visit-level characteristics associated with higher ED RV and RVA rates.3,8,9,31 However, as our goal was to identify possible modifiable institutional features, our study examined factors at hospital and population-served levels (instead of patient or visit level) that may impact ED RV and RVA rates. Interestingly, our sample of tertiary-care pediatric center EDs provided evidence of variability in RV and RVA rates. We identified factors associated with RV rates related to the SDHs of the populations served by the ED, which suggests these factors are not modifiable at an institution level. In addition, we found that the increased availability of PEM providers per patient visit correlated with fewer ED RVs.

Hospitals serving ED populations with more government-insured and fewer commercially insured patients had higher rates of return to the ED. Similarly, hospitals with larger proportions of patients from areas with lower median household incomes had higher RV rates. These factors may indicate that patients with limited resources may have more frequent ED RVs,3,6,32,33 and hospitals that serve them have higher ED RV rates. Our findings complement those of a recent study by Sills et al.,11 who evaluated hospital readmissions and proposed risk adjustment for performance reimbursement. This study found that hospital population-level race, ethnicity, insurance status, and household income were predictors of hospital readmission after discharge.

Of note, our data did not identify similar site-level attributes related to the population served that correlated with RVA rates. We postulate that the need for admission on RV may indicate an inherent clinical urgency or medical need associated with the return to the ED, whereas RV without admission may be related more to patient- or population-level sociodemographic factors than to quality of care and clinical course, which influence ED utilization.1,3,30 EDs treating higher proportions of patients of minority race or ethnicity, those with fewer financial resources, and those in more need of government health insurance are at higher risk for ED revisits.

We observed that increased PEM fellowship-trained physician staffing was associated with decreased RV rates. The availability of specialty-trained physicians in PEM may allow a larger proportion of patients treated by physicians with honed clinical skills for the patient population. Data from a single pediatric center showed PEM fellowship-trained physicians had admission rates lower than those of their counterparts without subspecialty fellowship training.34 The lower RV rate for this group in our study is especially interesting in light of previously reported lower admission rates at index visit in PEM trained physicians. With lower index admission rates, it may have been assumed that visits associated with PEM trained physician care would have an increased (rather than decreased) chance of RV. In addition, we noted the increased RVA rates were associated with longer waits to see a physician. These measures may indicate the effect of institutional access to robust resources (the ability to hire and support more specialty-trained physicians). These novel findings warrant further evaluation, particularly as our sample included only pediatric centers.

Our survey data demonstrated the impact that access to care has on ED RV rates. The ED medical directors indicated that limited access to outpatient appointments with PCPs and specialists was an important factor increasing ED RVs and a potential avenue for interventions. As the 2 open-ended questions addressed barriers and potential solutions, it is interesting that the respondents cited access to care and discharge instructions as the largest barriers and identified innovations in access to care and discharge education as important potential remedies.

This study demonstrated that, at the hospital level, ED RV quality measures are influenced by complex and varied SDHs that primarily reflect the characteristics of the patient populations served. Prior work has similarly highlighted the importance of gaining a rigorous understanding of other quality measures before widespread use, reporting, and dissemination of results.11,35-38 With this in mind, as quality measures are developed and implemented, care should be taken to ensure they accurately and appropriately reflect the quality of care provided to the patient and are not more representative of other factors not directly within institutional control. These findings call into question the usefulness of ED RVs as a quality measure for comparing institutions.

 

 

Study Limitations

This study had several limitations. The PHIS dataset tracks only patients within each institution and does not include RVs to other EDs, which may account for a proportion of RVs.39 Our survey response rate was 68% among medical directors, excluding 11 hospitals from analysis, which decreased the study’s power to detect differences that may be present between groups. In addition, the generalizability of our findings may be limited to tertiary-care children’s hospitals, as the PHIS dataset included only these types of healthcare facilities. We also included data only from the sites’ main EDs, and therefore cannot know if our results are applicable to satellite EDs. ED staffing of PEM physicians was analyzed using FTEs. However, number of clinical hours in 1 FTE may vary among sites, leading to imprecision in this hospital characteristic.

CONCLUSION

Hospitals with the highest RV rates served populations with a larger proportion of patients with government insurance and lower household income, and these hospitals had fewer PEM trained physicians. Variation in RV rates among hospitals may be indicative of the SDHs of their unique patient populations. ED revisit rates should be used cautiously in determining the quality of care of hospitals providing care to differing populations.

Disclosure

Nothing to report.

 

References

1. Goldman RD, Kapoor A, Mehta S. Children admitted to the hospital after returning to the emergency department within 72 hours. Pediatr Emerg Care. 2011;27(9):808-811. PubMed
2. Cho CS, Shapiro DJ, Cabana MD, Maselli JH, Hersh AL. A national depiction of children with return visits to the emergency department within 72 hours, 2001–2007. Pediatr Emerg Care. 2012;28(7):606-610. PubMed
3. Akenroye AT, Thurm CW, Neuman MI, et al. Prevalence and predictors of return visits to pediatric emergency departments. J Hosp Med. 2014;9(12):779-787. PubMed
4. Gallagher RA, Porter S, Monuteaux MC, Stack AM. Unscheduled return visits to the emergency department: the impact of language. Pediatr Emerg Care. 2013;29(5):579-583. PubMed
5. Sørup CM, Jacobsen P, Forberg JL. Evaluation of emergency department performance—a systematic review on recommended performance and quality-in-care measures. Scand J Trauma Resusc Emerg Med. 2013;21:62. PubMed
6. Gabayan GZ, Asch SM, Hsia RY, et al. Factors associated with short-term bounce-back admissions after emergency department discharge. Ann Emerg Med. 2013;62(2):136-144.e1. PubMed
7. Ali AB, Place R, Howell J, Malubay SM. Early pediatric emergency department return visits: a prospective patient-centric assessment. Clin Pediatr (Phila). 2012;51(7):651-658. PubMed
8. Alessandrini EA, Lavelle JM, Grenfell SM, Jacobstein CR, Shaw KN. Return visits to a pediatric emergency department. Pediatr Emerg Care. 2004;20(3):166-171. PubMed
9. Goldman RD, Ong M, Macpherson A. Unscheduled return visits to the pediatric emergency department—one-year experience. Pediatr Emerg Care. 2006;22(8):545-549. PubMed
10. Berry JG, Toomey SL, Zaslavsky AM, et al. Pediatric readmission prevalence and variability across hospitals. JAMA. 2013;309(4):372-380. PubMed
11. Sills MR, Hall M, Colvin JD, et al. Association of social determinants with children’s hospitals’ preventable readmissions performance. JAMA Pediatr. 2016;170(4):350-358. PubMed
12. Fiscella K, Burstin HR, Nerenz DR. Quality measures and sociodemographic risk factors: to adjust or not to adjust. JAMA. 2014;312(24):2615-2616. PubMed
13. Lipstein SH, Dunagan WC. The risks of not adjusting performance measures for sociodemographic factors. Ann Intern Med. 2014;161(8):594-596. PubMed
14. Berry JG, Hall DE, Kuo DZ, et al. Hospital utilization and characteristics of patients experiencing recurrent readmissions within children’s hospitals. JAMA. 2011;305(7):682-690. PubMed
15. Bourgeois FT, Monuteaux MC, Stack AM, Neuman MI. Variation in emergency department admission rates in US children’s hospitals. Pediatrics. 2014;134(3):539-545. PubMed
16. Fletcher DM. Achieving data quality. How data from a pediatric health information system earns the trust of its users. J AHIMA. 2004;75(10):22-26. PubMed
17. US Census Bureau. US Census current estimates data. 2014. https://www.census.gov/programs-surveys/popest/data/data-sets.2014.html. Accessed June 2015.
18. Alessandrini EA, Alpern ER, Chamberlain JM, Shea JA, Gorelick MH. A new diagnosis grouping system for child emergency department visits. Acad Emerg Med. 2010;17(2):204-213. PubMed
19. Feudtner C, Levin JE, Srivastava R, et al. How well can hospital readmission be predicted in a cohort of hospitalized children? A retrospective, multicenter study. Pediatrics. 2009;123(1):286-293. PubMed
20. Feinstein JA, Feudtner C, Kempe A. Adverse drug event–related emergency department visits associated with complex chronic conditions. Pediatrics. 2014;133(6):e1575-e1585. PubMed
21. Simon TD, Berry J, Feudtner C, et al. Children with complex chronic conditions in inpatient hospital settings in the United States. Pediatrics. 2010;126(4):647-655. PubMed
22. Dartmouth Medical School, Center for Evaluative Clinical Sciences. The Dartmouth Atlas of Health Care. Chicago, IL: American Hospital Publishing; 2015. 
23. Kansagara D, Englander H, Salanitro A, et al. Risk prediction models for hospital readmission: a systematic review. JAMA. 2011;306(15):1688-1698. PubMed
24. Lawrence LM, Jenkins CA, Zhou C, Givens TG. The effect of diagnosis-specific computerized discharge instructions on 72-hour return visits to the pediatric emergency department. Pediatr Emerg Care. 2009;25(11):733-738. PubMed
25. National Quality Forum. National Quality Forum issue brief: strengthening pediatric quality measurement and reporting. J Healthc Qual. 2008;30(3):51-55. PubMed
26. Rising KL, Victor TW, Hollander JE, Carr BG. Patient returns to the emergency department: the time-to-return curve. Acad Emerg Med. 2014;21(8):864-871. PubMed
27. Cho CS, Shapiro DJ, Cabana MD, Maselli JH, Hersh AL. A national depiction of children with return visits to the emergency department within 72 hours, 2001–2007. Pediatr Emerg Care. 2012;28(7):606-610. PubMed
28. Adekoya N. Patients seen in emergency departments who had a prior visit within the previous 72 h—National Hospital Ambulatory Medical Care Survey, 2002. Public Health. 2005;119(10):914-918. PubMed
29. Mittal MK, Zorc JJ, Garcia-Espana JF, Shaw KN. An assessment of clinical performance measures for pediatric emergency physicians. Am J Med Qual. 2013;28(1):33-39. PubMed
30. Depiero AD, Ochsenschlager DW, Chamberlain JM. Analysis of pediatric hospitalizations after emergency department release as a quality improvement tool. Ann Emerg Med. 2002;39(2):159-163. PubMed
31. Sung SF, Liu KE, Chen SC, Lo CL, Lin KC, Hu YH. Predicting factors and risk stratification for return visits to the emergency department within 72 hours in pediatric patients. Pediatr Emerg Care. 2015;31(12):819-824. PubMed
32. Jacobstein CR, Alessandrini EA, Lavelle JM, Shaw KN. Unscheduled revisits to a pediatric emergency department: risk factors for children with fever or infection-related complaints. Pediatr Emerg Care. 2005;21(12):816-821. PubMed
33. Barnett ML, Hsu J, McWilliams J. Patient characteristics and differences in hospital readmission rates. JAMA Intern Med. 2015;175(11):1803-1812. PubMed
34. Gaucher N, Bailey B, Gravel J. Impact of physicians’ characteristics on the admission risk among children visiting a pediatric emergency department. Pediatr Emerg Care. 2012;28(2):120-124. PubMed
35. McHugh M, Neimeyer J, Powell E, Khare RK, Adams JG. An early look at performance on the emergency care measures included in Medicare’s hospital inpatient Value-Based Purchasing Program. Ann Emerg Med. 2013;61(6):616-623.e2. PubMed
36. Axon RN, Williams MV. Hospital readmission as an accountability measure. JAMA. 2011;305(5):504-505. PubMed
37. Adams JG. Ensuring the quality of quality metrics for emergency care. JAMA. 2016;315(7):659-660. PubMed
38. Payne NR, Flood A. Preventing pediatric readmissions: which ones and how? J Pediatr. 2015;166(3):519-520. PubMed
39. Khan A, Nakamura MM, Zaslavsky AM, et al. Same-hospital readmission rates as a measure of pediatric quality of care. JAMA Pediatr. 2015;169(10):905-912. PubMed

References

1. Goldman RD, Kapoor A, Mehta S. Children admitted to the hospital after returning to the emergency department within 72 hours. Pediatr Emerg Care. 2011;27(9):808-811. PubMed
2. Cho CS, Shapiro DJ, Cabana MD, Maselli JH, Hersh AL. A national depiction of children with return visits to the emergency department within 72 hours, 2001–2007. Pediatr Emerg Care. 2012;28(7):606-610. PubMed
3. Akenroye AT, Thurm CW, Neuman MI, et al. Prevalence and predictors of return visits to pediatric emergency departments. J Hosp Med. 2014;9(12):779-787. PubMed
4. Gallagher RA, Porter S, Monuteaux MC, Stack AM. Unscheduled return visits to the emergency department: the impact of language. Pediatr Emerg Care. 2013;29(5):579-583. PubMed
5. Sørup CM, Jacobsen P, Forberg JL. Evaluation of emergency department performance—a systematic review on recommended performance and quality-in-care measures. Scand J Trauma Resusc Emerg Med. 2013;21:62. PubMed
6. Gabayan GZ, Asch SM, Hsia RY, et al. Factors associated with short-term bounce-back admissions after emergency department discharge. Ann Emerg Med. 2013;62(2):136-144.e1. PubMed
7. Ali AB, Place R, Howell J, Malubay SM. Early pediatric emergency department return visits: a prospective patient-centric assessment. Clin Pediatr (Phila). 2012;51(7):651-658. PubMed
8. Alessandrini EA, Lavelle JM, Grenfell SM, Jacobstein CR, Shaw KN. Return visits to a pediatric emergency department. Pediatr Emerg Care. 2004;20(3):166-171. PubMed
9. Goldman RD, Ong M, Macpherson A. Unscheduled return visits to the pediatric emergency department—one-year experience. Pediatr Emerg Care. 2006;22(8):545-549. PubMed
10. Berry JG, Toomey SL, Zaslavsky AM, et al. Pediatric readmission prevalence and variability across hospitals. JAMA. 2013;309(4):372-380. PubMed
11. Sills MR, Hall M, Colvin JD, et al. Association of social determinants with children’s hospitals’ preventable readmissions performance. JAMA Pediatr. 2016;170(4):350-358. PubMed
12. Fiscella K, Burstin HR, Nerenz DR. Quality measures and sociodemographic risk factors: to adjust or not to adjust. JAMA. 2014;312(24):2615-2616. PubMed
13. Lipstein SH, Dunagan WC. The risks of not adjusting performance measures for sociodemographic factors. Ann Intern Med. 2014;161(8):594-596. PubMed
14. Berry JG, Hall DE, Kuo DZ, et al. Hospital utilization and characteristics of patients experiencing recurrent readmissions within children’s hospitals. JAMA. 2011;305(7):682-690. PubMed
15. Bourgeois FT, Monuteaux MC, Stack AM, Neuman MI. Variation in emergency department admission rates in US children’s hospitals. Pediatrics. 2014;134(3):539-545. PubMed
16. Fletcher DM. Achieving data quality. How data from a pediatric health information system earns the trust of its users. J AHIMA. 2004;75(10):22-26. PubMed
17. US Census Bureau. US Census current estimates data. 2014. https://www.census.gov/programs-surveys/popest/data/data-sets.2014.html. Accessed June 2015.
18. Alessandrini EA, Alpern ER, Chamberlain JM, Shea JA, Gorelick MH. A new diagnosis grouping system for child emergency department visits. Acad Emerg Med. 2010;17(2):204-213. PubMed
19. Feudtner C, Levin JE, Srivastava R, et al. How well can hospital readmission be predicted in a cohort of hospitalized children? A retrospective, multicenter study. Pediatrics. 2009;123(1):286-293. PubMed
20. Feinstein JA, Feudtner C, Kempe A. Adverse drug event–related emergency department visits associated with complex chronic conditions. Pediatrics. 2014;133(6):e1575-e1585. PubMed
21. Simon TD, Berry J, Feudtner C, et al. Children with complex chronic conditions in inpatient hospital settings in the United States. Pediatrics. 2010;126(4):647-655. PubMed
22. Dartmouth Medical School, Center for Evaluative Clinical Sciences. The Dartmouth Atlas of Health Care. Chicago, IL: American Hospital Publishing; 2015. 
23. Kansagara D, Englander H, Salanitro A, et al. Risk prediction models for hospital readmission: a systematic review. JAMA. 2011;306(15):1688-1698. PubMed
24. Lawrence LM, Jenkins CA, Zhou C, Givens TG. The effect of diagnosis-specific computerized discharge instructions on 72-hour return visits to the pediatric emergency department. Pediatr Emerg Care. 2009;25(11):733-738. PubMed
25. National Quality Forum. National Quality Forum issue brief: strengthening pediatric quality measurement and reporting. J Healthc Qual. 2008;30(3):51-55. PubMed
26. Rising KL, Victor TW, Hollander JE, Carr BG. Patient returns to the emergency department: the time-to-return curve. Acad Emerg Med. 2014;21(8):864-871. PubMed
27. Cho CS, Shapiro DJ, Cabana MD, Maselli JH, Hersh AL. A national depiction of children with return visits to the emergency department within 72 hours, 2001–2007. Pediatr Emerg Care. 2012;28(7):606-610. PubMed
28. Adekoya N. Patients seen in emergency departments who had a prior visit within the previous 72 h—National Hospital Ambulatory Medical Care Survey, 2002. Public Health. 2005;119(10):914-918. PubMed
29. Mittal MK, Zorc JJ, Garcia-Espana JF, Shaw KN. An assessment of clinical performance measures for pediatric emergency physicians. Am J Med Qual. 2013;28(1):33-39. PubMed
30. Depiero AD, Ochsenschlager DW, Chamberlain JM. Analysis of pediatric hospitalizations after emergency department release as a quality improvement tool. Ann Emerg Med. 2002;39(2):159-163. PubMed
31. Sung SF, Liu KE, Chen SC, Lo CL, Lin KC, Hu YH. Predicting factors and risk stratification for return visits to the emergency department within 72 hours in pediatric patients. Pediatr Emerg Care. 2015;31(12):819-824. PubMed
32. Jacobstein CR, Alessandrini EA, Lavelle JM, Shaw KN. Unscheduled revisits to a pediatric emergency department: risk factors for children with fever or infection-related complaints. Pediatr Emerg Care. 2005;21(12):816-821. PubMed
33. Barnett ML, Hsu J, McWilliams J. Patient characteristics and differences in hospital readmission rates. JAMA Intern Med. 2015;175(11):1803-1812. PubMed
34. Gaucher N, Bailey B, Gravel J. Impact of physicians’ characteristics on the admission risk among children visiting a pediatric emergency department. Pediatr Emerg Care. 2012;28(2):120-124. PubMed
35. McHugh M, Neimeyer J, Powell E, Khare RK, Adams JG. An early look at performance on the emergency care measures included in Medicare’s hospital inpatient Value-Based Purchasing Program. Ann Emerg Med. 2013;61(6):616-623.e2. PubMed
36. Axon RN, Williams MV. Hospital readmission as an accountability measure. JAMA. 2011;305(5):504-505. PubMed
37. Adams JG. Ensuring the quality of quality metrics for emergency care. JAMA. 2016;315(7):659-660. PubMed
38. Payne NR, Flood A. Preventing pediatric readmissions: which ones and how? J Pediatr. 2015;166(3):519-520. PubMed
39. Khan A, Nakamura MM, Zaslavsky AM, et al. Same-hospital readmission rates as a measure of pediatric quality of care. JAMA Pediatr. 2015;169(10):905-912. PubMed

Issue
Journal of Hospital Medicine 12(7)
Issue
Journal of Hospital Medicine 12(7)
Page Number
536-543
Page Number
536-543
Topics
Article Type
Display Headline
Hospital-level factors associated with pediatric emergency department return visits
Display Headline
Hospital-level factors associated with pediatric emergency department return visits
Sections
Article Source

© 2017 Society of Hospital Medicine

Disallow All Ads
Correspondence Location
Ann & Robert H. Lurie Children’s Hospital of Chicago, Division of Emergency Medicine Box #62, 225 E. Chicago Ave, Chicago IL 60611-2991, Telephone: 312-227-6080; Fax: 312-227-9475; E-mail: [email protected]
Content Gating
Gated (full article locked unless allowed per User)
Alternative CME
Disqus Comments
Default
Gating Strategy
First Peek Free
Article PDF Media
Media Files