Affiliations
Department of Emergency Medicine, Vanderbilt University School of Medicine
Given name(s)
Amanda
Family name
Wilson
Degrees
MD

Focusing on Inattention: The Diagnostic Accuracy of Brief Measures of Inattention for Detecting Delirium

Article Type
Changed
Fri, 10/04/2019 - 15:53

Delirium is an acute neurocognitive disorder1 that affects up to 25% of older emergency department (ED) and hospitalized patients.2-4 The relationship between delirium and adverse outcomes is well documented.5-7 Delirium is a strong predictor of increased length of mechanical ventilation, longer intensive care unit and hospital stays, increased risk of falls, long-term cognitive impairment, and mortality.8-13 Delirium is frequently missed by healthcare professionals2,14-16 and goes undetected in up to 3 out of 4 patients by bedside nurses and medical practitioners in many hospital settings.14,17-22 A significant barrier to recognizing delirium is the absence of brief delirium assessments.

In an effort to improve delirium recognition in the acute care setting, there has been a concerted effort to develop and validate brief delirium assessments. To address this unmet need, 4 ‘A’s Test (4AT), the Brief Confusion Assessment Method (bCAM), and the 3-minute diagnostic assessment for CAM-defined delirium (3D-CAM) are 1- to 3-minute delirium assessments that were validated in acutely ill older patients.23 However, 1 to 3 minutes may still be too long in busy clinical environments, and briefer (<30 seconds) delirium assessments may be needed.

One potential more-rapid method to screen for delirium is to specifically test for the presence of inattention, which is a cardinal feature of delirium.24,25 Inattention can be ascertained by having the patient recite the months backwards, recite the days of the week backwards, or spell a word backwards.26 Recent studies have evaluated the diagnostic accuracy of reciting the months of the year backwards for delirium. O’Regan et al.27 evaluated the diagnostic accuracy of the month of the year backwards from December to July (MOTYB-6) and observed that this task was 84% sensitive and 90% specific for delirium in older patients. However, they performed the reference standard delirium assessments in patients who had a positive MOTYB-6, which can overestimate sensitivity and underestimate specificity (verification bias).28 Fick et al.29 examined the diagnostic accuracy of 20 individual elements of the 3D-CAM and observed that reciting the months of the year backwards from December to January (MOTYB-12) was 83% sensitive and 69% specific for delirium. However, this was an exploratory study that was designed to identify an element of the 3D-CAM that had the best diagnostic accuracy.

To address these limitations, we sought to evaluate the diagnostic performance of the MOTYB-6 and MOTYB-12 for delirium as diagnosed by a reference standard. We also explored other brief tests of inattention such as spelling a word (“LUNCH”) backwards, reciting the days of the week backwards, 10-letter vigilance “A” task, and 5 picture recognition task.

METHODS

Study Design and Setting

This was a preplanned secondary analysis of a prospective observational study that validated 3 delirium assessments.30,31 This study was conducted at a tertiary care, academic ED. The local institutional review board (IRB) reviewed and approved this study. Informed consent from the patient or an authorized surrogate was obtained whenever possible. Because this was an observational study and posed minimal risk to the patient, the IRB granted a waiver of consent for patients who were both unable to provide consent and were without an authorized surrogate available in the ED or by phone.

Selection of Participants

We enrolled a convenience sample of patients between June 2010 and February 2012 Monday through Friday from 8 am to 4 pm. This enrollment window was based upon the psychiatrist’s availability. Because of the extensiveness of the psychiatric evaluations, we limited enrollment to 1 patient per day. Patients who were 65 years or older, not in a hallway bed, and in the ED for less than 12 hours at the time of enrollment were included. We used a 12-hour cutoff so that patients who presented in the evening and early morning hours could be included. Patients were excluded if they were previously enrolled, non-English speaking, deaf or blind, comatose, suffered from end-stage dementia, or were unable to complete all the study assessments. The rationale for excluding patients with end-stage dementia was that diagnosing delirium in this patient population is challenging.

 

 

Research assistants approached patients who met inclusion criteria and determined if any exclusion criteria were present. If none of the exclusion criteria were present, then the research assistant reviewed the informed consent document with the patient or authorized surrogate if the patient was not capable of providing consent. If a patient was not capable of providing consent and no authorized surrogate was available, then the patient was enrolled (under the waiver of consent) as long as the patient assented to be a part of the study. Once the patient was enrolled, the research assistant contacted the physician rater and reference standard psychiatrists to approach the patient.

Measures of Inattention

An emergency physician (JHH) who had no formal training in the mental status assessment of elders administered a cognitive battery to the patient, including tests of inattention. The following inattention tasks were administered:

  • Spell the word “LUNCH” backwards.30 Patients were initially allowed to spell the word “LUNCH” forwards. Patients who were unable to perform the task were assigned 5 errors.
  • Recite the months of the year backwards from December to July.23,26,27,30,32 Patients who were unable to perform the task were assigned 6 errors.
  • Recite the days of the week backwards.23,26,33 Patients who were unable to perform the task were assigned 7 errors.
  • Ten-letter vigilance “A” task.34 The patient was given a series of 10 letters (“S-A-V-E-A-H-A-A-R-T”) every 3 seconds and was asked to squeeze the rater’s hand every time the patient heard the letter “A.” Patients who were unable to perform the task were assigned 10 errors.
  • Five picture recognition task.34 Patients were shown 5 objects on picture cards. Afterwards, patients were shown 10 pictures with the previously shown objects intermingled. The patient had to identify which objects were seen previously in the first 5 pictures. Patients who were unable to perform the task were assigned 10 errors.
  • Recite the months of the year backwards from December to January.29 Patients who were unable to perform the task were assigned 12 errors.

Reference Standard for Delirium

A comprehensive consultation-liaison psychiatrist assessment was the reference standard for delirium; the diagnosis of delirium was based on Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (DSM-IV-TR) criteria.35 Three psychiatrists who each had an average of 11 years of clinical experience and regularly diagnosed delirium as part of their daily clinical practice were available to perform these assessments. To arrive at the diagnosis of delirium, they interviewed those who best understood the patient’s mental status (eg, the patient’s family members or caregivers, physician, and nurses). They also reviewed the patient’s medical record and radiology and laboratory test results. They performed bedside cognitive testing that included, but was not limited to, the Mini-Mental State Examination, Clock Drawing Test, Luria hand sequencing task, and tests for verbal fluency. A focused neurological examination was also performed (ie, screening for paraphasic errors, tremors, tone, asterixis, frontal release signs, etc.), and they also evaluated the patient for affective lability, hallucinations, and level of alertness. If the presence of delirium was still questionable, then confrontational naming, proverb interpretation or similarities, and assessments for apraxias were performed at the discretion of the psychiatrist. The psychiatrists were blinded to the physician’s assessments, and the assessments were conducted within 3 hours of each other.

Additional Variables Collected

Using medical record review, comorbidity burden, severity of illness, and premorbid cognition were ascertained. The Charlson Comorbidity Index, a weighted index that takes into account the number and seriousness of 19 preexisting comorbid conditions, was used to quantify comorbidity burden; higher scores indicate higher comorbid burden.36,37 The Acute Physiology Score of the Acute Physiology and Chronic Health Evaluation II was used to quantify severity of illness.38 This score is based upon the initial values of 12 routine physiologic measurements such as vital sign and laboratory abnormalities; higher scores represent higher severities of illness.38 The medical record was reviewed to ascertain the presence of premorbid cognitive impairment; any documentation of dementia in the patient’s clinical problem list or physician history and physical examination from the outpatient or inpatient settings was considered positive. The medical record review was performed by a research assistant and was double-checked for accuracy by one of the investigators (JHH).

Data Analyses

Measures of central tendency and dispersion for continuous variables were reported as medians and interquartile ranges. Categorical variables were reported as proportions. Receiver operating characteristic curves were constructed for each inattention task. Area under the receiver operating characteristic curves (AUC) was reported to provide a global measure of diagnostic accuracy. Sensitivities, specificities, positive likelihood ratios (PLRs), and negative likelihood ratios (NLRs) with their 95% CIs were calculated using the psychiatrist’s assessment as the reference standard.39 Cut-points with PLRs greater than 10 (strongly increased the likelihood of delirium) or NLRs less than 0.1 (strongly decreased the likelihood of delirium) were preferentially reported whenever possible.

 

 

All statistical analyses were performed with open source R statistical software version 3.0.1 (http://www.r-project.org/), SAS 9.4 (SAS Institute, Cary, NC), and Microsoft Excel 2010 (Microsoft Inc., Redmond, WA).

RESULTS

A total of 542 patients were screened; 214 patients refused to participate, and 93 were excluded, leaving 235 patients. The patient characteristics can be seen in Table 1. Compared with all patients (N = 15,359) who presented to the ED during the study period, enrolled patients were similar in age but more likely to be male, have cardiovascular chief complaints, and be admitted to the hospital. Of those enrolled, 25 (10.6%) were delirious. Delirious patients were older, more likely to be nonwhite, have a past history of dementia, have a graduate school degree, and have a chief complaint of altered mental status.

Making any error on the MOTYB-6 task had a sensitivity of 80.0% (95% CI, 60.9%-91.1%), specificity of 57.1% (95% CI, 50.4%-63.7%), PLR of 1.87 (95% CI, 1.45-2.40) and NLR of 0.35 (95% CI, 0.16-0.77) for delirium as diagnosed by a psychiatrist. Making any error on the MOTYB-12 task had a sensitivity of 84.0% (95% CI, 65.4%-93.6%), specificity of 51.9% (95% CI, 45.2%-58.5%), PLR of 1.75 (95% CI, 1.40-2.18), and NLR of 0.31 (95% CI, 0.12-0.76) for delirium. The AUCs for the MOTYB-6 and MOTYB-12 tasks were 0.77 and 0.78, respectively, indicating very good diagnostic performance.

The diagnostic performances of the other inattention tasks and additional cutoff values for the MOTYB-6 and MOTYB-12 tasks can be seen in Table 2. Increasing the MOTYB-6 cut-off to 2 or more errors and MOTYB-12 cut-off to 3 or more errors increased the specificity to 70.0% and 70.5%, respectively, without decreasing their sensitivity. The best combination of sensitivity and specificity was reciting the days of the week backwards task; if the patient made any error, this was 84.0% (95% CI, 65.4%-93.6%) sensitive and 81.9% (95% CI, 76.1%-86.5%) specific for delirium. The inattention tasks that strongly increased the likelihood of delirium (PLR > 10) were the vigilance “A” and picture recognition tasks. If the patient made 2 or more errors on the vigilance task or 3 or more errors on the picture recognition task, then the likelihood of delirium strongly increased, as evidenced by a PLR of 16.80 (95% CI, 8.01-35.23) and 23.10 (95% CI, 7.95-67.12), respectively. No other inattention tasks were able to achieve a PLR of greater than 10, regardless of what cutoff was used. No inattention task was able to achieve a NLR of less than 0.10, which would have strongly decreased the likelihood of delirium. The best NLRs were if the patient made no errors spelling the word “LUNCH” backwards (NLR, 0.16; 95% CI, 0.04-0.60), no errors on the vigilance “A” task (NLR, 0.18; 95% CI, 0.07-0.43), and no errors on the days of the week backwards task (NLR, 0.20; 95% CI, 0.08-0.48).

DISCUSSION

Delirium is frequently missed by healthcare providers because it is not routinely screened for in the acute care setting. To help address this deficiency of care, we evaluated several brief measures of inattention that take less than 30 seconds to complete. We observed that any errors made on the MOTYB-6 and MOTYB-12 tasks had very good sensitivities (80% and 84%) but were limited by their modest specificities (approximately 50%) for delirium. As a result, these assessments have limited clinical utility as standalone delirium screens. We also explored other commonly used brief measures of inattention and at a variety of error cutoffs. Reciting the days of the week backwards appeared to best balance sensitivity and specificity. None of the inattention measures could convincingly rule out delirium (NLR < 0.10), but the vigilance “A” and picture recognition tasks may have clinical utility in ruling in delirium (PLR > 10). Overall, all the inattention tasks, including MOTYB-6 and MOTYB-12, had very good diagnostic performances based upon their AUC. However, achieving a high sensitivity often had to be sacrificed for specificity or, alternatively, achieving a high specificity had to be sacrificed for sensitivity.

Inattention has been shown to be the cardinal feature for delirium,40 and its assessment using cognitive testing has been recommended to help identify the presence of delirium according to an expert consensus panel.26 The diagnostic performance of the MOTYB-12 observed in our study is similar to a study by Fick et al., who reported that MOTYB-12 had very good sensitivity (83%) but had modest specificity (69%) with a cutoff of 1 or more errors. Hendry et al. observed that the MOTYB-12 was 91% sensitive and 50% specific using a cutoff of 4 or more errors. With regard to the MOTYB-6, our reported specificity was different from what was observed by O’Regan et al.27 Using 1 or more errors as a cutoff, they observed a much higher specificity for delirium than we did (90% vs 57%). Discordant observations regarding the diagnostic accuracy for other inattention tasks also exist. We observed that making any error on the days of the week backwards task was 84% sensitive and 82% specific for delirium, whereas Fick et al. observed a sensitivity and specificity of 50% and 94%, respectively. For the vigilance “A” task, we observed that making 2 or more errors over a series of 10 letters was 64.0% sensitive and 91.4% specific for delirium, whereas Pompei et al.41 observed that making 2 or more errors over a series of 60 letters was 51% sensitive and 77% specific for delirium.

The abovementioned discordant findings may be driven by spectrum bias, wherein the sensitivities and specificities for each inattention task may differ in different subgroups. As a result, differences in the age distribution, proportion of college graduates, history of dementia, and susceptibility to delirium can influence overall sensitivity and specificity. Objective measures of delirium, including the inattention screens studied, are particularly prone to spectrum bias.31,34 However, the strength of this approach is that the assessment of inattention becomes less reliant upon clinical judgment and allows it to be used by raters from a wide range of clinical backgrounds. On the other hand, a subjective interpretation of these inattention tasks may allow the rater to capture the subtleties of inattention (ie, decreased speed of performance in a highly intelligent and well-educated patient without dementia). The disadvantage of this approach, however, is that it is more dependent on clinical judgment and may have decreased diagnostic accuracy in those with less clinical experience or with limited training.14,42,43 These factors must be carefully considered when determining which delirium assessment to use.

Additional research is required to determine the clinical utility of these brief inattention assessments. These findings need to be further validated in larger studies, and the optimal cutoff of each task for different subgroup of patients (eg, demented vs nondemented) needs to be further clarified. It is not completely clear whether these inattention tests can serve as standalone assessments. Depending on the cutoff used, some of these assessments may have unacceptable false negative or false positive rates that may lead to increased adverse patient outcomes or increased resource utilization, respectively. Additional components or assessments may be needed to improve the diagnostic accuracy of these assessments. In addition to understanding these inattention assessments’ diagnostic accuracies, their ability to predict adverse outcomes also needs to be investigated. While a previous study observed that making any error on the MOTYB-12 task was associated with increased physical restraint use and prolonged hospital length of stay,44 these assessments’ ability to prognosticate long-term outcomes such as mortality or long-term cognition or function need to be studied. Lastly, studies should also evaluate how easily implementable these assessments are and whether improved delirium recognition leads to improved patient outcomes.

This study has several notable limitations. Though planned a priori, this was a secondary analysis of a larger investigation designed to validate 3 delirium assessments. Our sample size was also relatively small, causing our 95% CIs to overlap in most cases and limiting the statistical power to truly determine whether one measure is better than the other. We also asked the patient to recite the months backwards from December to July as well as recite the months backwards from December to January. It is possible that the patient may have performed better at going from December to January because of learning effect. Our reference standard for delirium was based upon DSM-IV-TR criteria. The new DSM-V criteria may be more restrictive and may slightly change the sensitivities and specificities of the inattention tasks. We enrolled a convenience sample and enrolled patients who were more likely to be male, have cardiovascular chief complaints, and be admitted to the hospital; as a result, selection bias may have been introduced. Lastly, this study was conducted in a single center and enrolled patients who were 65 years and older. Our findings may not be generalizable to other settings and in those who are less than 65 years of age.

 

 

CONCLUSIONS

The MOTYB-6 and MOTYB-12 tasks had very good sensitivities but modest specificities (approximately 50%) using any error made as a cutoff; increasing cutoff to 2 errors and 3 errors, respectively, improved their specificities (approximately 70%) with minimal impact to their sensitivities. Reciting the days of the week backwards, spelling the word “LUNCH” backwards, and the 10-letter vigilance “A” task appeared to perform the best in ruling out delirium but only moderately decreased the likelihood of delirium. The 10-letter Vigilance “A” and picture recognition task appeared to perform the best in ruling in delirium. Days of the week backwards appeared to have the best combination of sensitivity and specificity.

Disclosure

This study was funded by the Emergency Medicine Foundation Career Development Award, National Institutes of Health K23AG032355, and National Center for Research Resources, Grant UL1 RR024975-01. The authors report no financial conflicts of interest.

References

1. American Psychiatric Association. Diagnostic and statistical manual of mental disorders: DSM-5. Washington, DC: American Psychiatric Association; 2013.
2. Hustey FM, Meldon SW, Smith MD, Lex CK. The effect of mental status screening on the care of elderly emergency department patients. Ann Emerg Med. 2003;41(5):678-684.
3. Inouye SK, Rushing JT, Foreman MD, Palmer RM, Pompei P. Does delirium contribute to poor hospital outcomes? A three-site epidemiologic study. J Gen Intern Med. 1998;13(4):234-242.
4. Pitkala KH, Laurila JV, Strandberg TE, Tilvis RS. Prognostic significance of delirium in frail older people. Dement Geriatr Cogn Disord. 2005;19(2-3):158-163.
5. Han JH, Shintani A, Eden S, et al. Delirium in the emergency department: an independent predictor of death within 6 months. Ann Emerg Med. 2010;56(3):244-252.
6. Gross AL, Jones RN, Habtemariam DA, et al. Delirium and long-term cognitive trajectory among persons with dementia. Arch Intern Med. 2012;172(17):1324-1331.
7. Davis DH, Muniz Terrera G, Keage H, et al. Delirium is a strong risk factor for dementia in the oldest-old: a population-based cohort study. Brain. 2012;135(Pt 9):2809-2816.
8. Ely EW, Baker AM, Dunagan DP, et al. Effect on the duration of mechanical ventilation of identifying patients capable of breathing spontaneously. N Engl J Med. 1996;335(25):1864-1869.
9. Ely EW, Shintani A, Truman B, et al. Delirium as a predictor of mortality in mechanically ventilated patients in the intensive care unit. JAMA. 2004;291(14):1753-1762.
10. Lin SM, Liu CY, Wang CH, et al. The impact of delirium on the survival of mechanically ventilated patients. Crit Care Med. 2004;32(11):2254-2259.
11. Salluh JI, Wang H, Schneider EB, et al. Outcome of delirium in critically ill patients: systematic review and meta-analysis. BMJ. 2015;350:h2538.
12. Klein Klouwenberg PM, Zaal IJ, Spitoni C, et al. The attributable mortality of delirium in critically ill patients: prospective cohort study. BMJ. 2014;349:g6652.
13. Han JH, Vasilevskis EE, Chandrasekhar R, et al. Delirium in the Emergency Department and Its Extension into Hospitalization (DELINEATE) Study: Effect on 6-month Function and Cognition. J Am Geriatr Soc. 2017;65(6):1333-1338.
14. Inouye SK, Foreman MD, Mion LC, Katz KH, Cooney LM Jr. Nurses’ recognition of delirium and its symptoms: comparison of nurse and researcher ratings. Arch Intern Med. 2001;161(20):2467-2473.
15. Han JH, Zimmerman EE, Cutler N, et al. Delirium in older emergency department patients: recognition, risk factors, and psychomotor subtypes. Acad Emerg Med. 2009;16(3):193-200.
16. Elie M, Cole MG, Primeau FJ, Bellavance F. Delirium risk factors in elderly hospitalized patients. J Gen Intern Med. 1998;13(3):204-212.
17. Spronk PE, Riekerk B, Hofhuis J, Rommes JH. Occurrence of delirium is severely underestimated in the ICU during daily care. Intensive Care Med. 2009;35(7):1276-1280.
18. van Eijk MM, van Marum RJ, Klijn IA, de Wit N, Kesecioglu J, Slooter AJ. Comparison of delirium assessment tools in a mixed intensive care unit. Crit Care Med. 2009;37(6):1881-1885.
19. Devlin JW, Fong JJ, Schumaker G, O’Connor H, Ruthazer R, Garpestad E. Use of a validated delirium assessment tool improves the ability of physicians to identify delirium in medical intensive care unit patients. Crit Care Med. 2007;35(12):2721-2724.
20. Han JH, Eden S, Shintani A, et al. Delirium in Older Emergency Department Patients Is an Independent Predictor of Hospital Length of Stay. Acad Emerg Med. 2011;18(5):451-457.
21. Pun BT, Gordon SM, Peterson JF, et al. Large-scale implementation of sedation and delirium monitoring in the intensive care unit: a report from two medical centers. Crit Care Med. 2005;33(6):1199-1205.
22. Grossmann FF, Hasemann W, Graber A, Bingisser R, Kressig RW, Nickel CH. Screening, detection and management of delirium in the emergency department - a pilot study on the feasibility of a new algorithm for use in older emergency department patients: the modified Confusion Assessment Method for the Emergency Department (mCAM-ED). Scand J Trauma Resusc Emerg Med. 2014;22:19.
23. Marcantonio ER, Ngo LH, O’Connor M, et al. 3D-CAM: Derivation and Validation of a 3-Minute Diagnostic Interview for CAM-Defined Delirium: A Cross-sectional Diagnostic Test Study. Ann Intern Med. 2014;161(8):554-561.
24. Blazer DG, van Nieuwenhuizen AO. Evidence for the diagnostic criteria of delirium: an update. Curr Opin Psychiatry. 2012;25(3):239-243.
25. Meagher DJ, Maclullich AM, Laurila JV. Defining delirium for the International Classification of Diseases, 11th Revision. J Psychosom Res. 2008;65(3):207-214.
26. Huang LW, Inouye SK, Jones RN, et al. Identifying indicators of important diagnostic features of delirium. J Am Geriatr Soc. 2012;60(6):1044-1050.
27. O’Regan NA, Ryan DJ, Boland E, et al. Attention! A good bedside test for delirium? J Neurol Neurosurg Psychiatry. 2014;85(10):1122-1131.
28. Richardson ML, Petscavage JM. Verification bias: an under-recognized source of error in assessing the efficacy of MRI of the meniscii. Acad Radiol. 2011;18(11):1376-1381.
29. Fick DM, Inouye SK, Guess J, et al. Preliminary development of an ultrabrief two-item bedside test for delirium. J Hosp Med. 2015;10(10):645-650.
30. Han JH, Wilson A, Vasilevskis EE, et al. Diagnosing delirium in older emergency department patients: validity and reliability of the delirium triage screen and the brief confusion assessment method. Ann Emerg Med. 2013;62(5):457-465.
31. Han JH, Wilson A, Graves AJ, et al. Validation of the Confusion Assessment Method for the Intensive Care Unit in older emergency department patients. Acad Emerg Med. 2014;21(2):180-187.
32. Inouye SK. Delirium in hospitalized older patients. Clin Geriatr Med. 1998;14(4):745-764.

33. Hamrick I, Hafiz R, Cummings DM. Use of days of the week in a modified mini-mental state exam (M-MMSE) for detecting geriatric cognitive impairment. J Am Board Fam Med. 2013;26(4):429-435.

34. Ely EW, Inouye SK, Bernard GR, et al. Delirium in mechanically ventilated patients: validity and reliability of the confusion assessment method for the intensive care unit (CAM-ICU). JAMA. 2001;286(21):2703-2710.
35. American Psychiatric Association. Task Force on DSM-IV. Diagnostic and statistical manual of mental disorders: DSM-IV-TR. 4th ed. Washington, DC: American Psychiatric Association; 2000.
36. Murray SB, Bates DW, Ngo L, Ufberg JW, Shapiro NI. Charlson Index is associated with one-year mortality in emergency department patients with suspected infection. Acad Emerg Med. 2006;13(5):530-536.
37. Charlson ME, Pompei P, Ales KL, MacKenzie CR. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. 1987;40(5):373-383.
38. Knaus WA, Draper EA, Wagner DP, Zimmerman JE. APACHE II: a severity of disease classification system. Crit Care Med. 1985;13(10):818-829.
39. Simel DL, Samsa GP, Matchar DB. Likelihood ratios with confidence: sample size estimation for diagnostic test studies. J Clin Epidemiol. 1991;44(8):763-770.
40. Blazer DG, Wu LT. The epidemiology of substance use and disorders among middle aged and elderly community adults: national survey on drug use and health. Am J Geriatr Psychiatry. 2009;17(3):237-245.
41. Pompei P, Foreman M, Cassel CK, Alessi C, Cox D. Detecting delirium among hospitalized older patients. Arch Intern Med. 1995;155(3):301-307.
42. Lemiengre J, Nelis T, Joosten E, et al. Detection of delirium by bedside nurses using the confusion assessment method. J Am Geriatr Soc. 2006;54(4):685-689.
43. Ryan K, Leonard M, Guerin S, Donnelly S, Conroy M, Meagher D. Validation of the confusion assessment method in the palliative care setting. Palliat Med. 2009;23(1):40-45.
44. Yevchak AM, Doherty K, Archambault EG, Kelly B, Fonda JR, Rudolph JL. The association between an ultrabrief cognitive screening in older adults and hospital outcomes. J Hosp Med. 2015;10(10):651-657.

Article PDF
Issue
Journal of Hospital Medicine 13(8)
Publications
Topics
Page Number
551-557. Published online first March 26, 2018
Sections
Article PDF
Article PDF
Related Articles

Delirium is an acute neurocognitive disorder1 that affects up to 25% of older emergency department (ED) and hospitalized patients.2-4 The relationship between delirium and adverse outcomes is well documented.5-7 Delirium is a strong predictor of increased length of mechanical ventilation, longer intensive care unit and hospital stays, increased risk of falls, long-term cognitive impairment, and mortality.8-13 Delirium is frequently missed by healthcare professionals2,14-16 and goes undetected in up to 3 out of 4 patients by bedside nurses and medical practitioners in many hospital settings.14,17-22 A significant barrier to recognizing delirium is the absence of brief delirium assessments.

In an effort to improve delirium recognition in the acute care setting, there has been a concerted effort to develop and validate brief delirium assessments. To address this unmet need, 4 ‘A’s Test (4AT), the Brief Confusion Assessment Method (bCAM), and the 3-minute diagnostic assessment for CAM-defined delirium (3D-CAM) are 1- to 3-minute delirium assessments that were validated in acutely ill older patients.23 However, 1 to 3 minutes may still be too long in busy clinical environments, and briefer (<30 seconds) delirium assessments may be needed.

One potential more-rapid method to screen for delirium is to specifically test for the presence of inattention, which is a cardinal feature of delirium.24,25 Inattention can be ascertained by having the patient recite the months backwards, recite the days of the week backwards, or spell a word backwards.26 Recent studies have evaluated the diagnostic accuracy of reciting the months of the year backwards for delirium. O’Regan et al.27 evaluated the diagnostic accuracy of the month of the year backwards from December to July (MOTYB-6) and observed that this task was 84% sensitive and 90% specific for delirium in older patients. However, they performed the reference standard delirium assessments in patients who had a positive MOTYB-6, which can overestimate sensitivity and underestimate specificity (verification bias).28 Fick et al.29 examined the diagnostic accuracy of 20 individual elements of the 3D-CAM and observed that reciting the months of the year backwards from December to January (MOTYB-12) was 83% sensitive and 69% specific for delirium. However, this was an exploratory study that was designed to identify an element of the 3D-CAM that had the best diagnostic accuracy.

To address these limitations, we sought to evaluate the diagnostic performance of the MOTYB-6 and MOTYB-12 for delirium as diagnosed by a reference standard. We also explored other brief tests of inattention such as spelling a word (“LUNCH”) backwards, reciting the days of the week backwards, 10-letter vigilance “A” task, and 5 picture recognition task.

METHODS

Study Design and Setting

This was a preplanned secondary analysis of a prospective observational study that validated 3 delirium assessments.30,31 This study was conducted at a tertiary care, academic ED. The local institutional review board (IRB) reviewed and approved this study. Informed consent from the patient or an authorized surrogate was obtained whenever possible. Because this was an observational study and posed minimal risk to the patient, the IRB granted a waiver of consent for patients who were both unable to provide consent and were without an authorized surrogate available in the ED or by phone.

Selection of Participants

We enrolled a convenience sample of patients between June 2010 and February 2012 Monday through Friday from 8 am to 4 pm. This enrollment window was based upon the psychiatrist’s availability. Because of the extensiveness of the psychiatric evaluations, we limited enrollment to 1 patient per day. Patients who were 65 years or older, not in a hallway bed, and in the ED for less than 12 hours at the time of enrollment were included. We used a 12-hour cutoff so that patients who presented in the evening and early morning hours could be included. Patients were excluded if they were previously enrolled, non-English speaking, deaf or blind, comatose, suffered from end-stage dementia, or were unable to complete all the study assessments. The rationale for excluding patients with end-stage dementia was that diagnosing delirium in this patient population is challenging.

 

 

Research assistants approached patients who met inclusion criteria and determined if any exclusion criteria were present. If none of the exclusion criteria were present, then the research assistant reviewed the informed consent document with the patient or authorized surrogate if the patient was not capable of providing consent. If a patient was not capable of providing consent and no authorized surrogate was available, then the patient was enrolled (under the waiver of consent) as long as the patient assented to be a part of the study. Once the patient was enrolled, the research assistant contacted the physician rater and reference standard psychiatrists to approach the patient.

Measures of Inattention

An emergency physician (JHH) who had no formal training in the mental status assessment of elders administered a cognitive battery to the patient, including tests of inattention. The following inattention tasks were administered:

  • Spell the word “LUNCH” backwards.30 Patients were initially allowed to spell the word “LUNCH” forwards. Patients who were unable to perform the task were assigned 5 errors.
  • Recite the months of the year backwards from December to July.23,26,27,30,32 Patients who were unable to perform the task were assigned 6 errors.
  • Recite the days of the week backwards.23,26,33 Patients who were unable to perform the task were assigned 7 errors.
  • Ten-letter vigilance “A” task.34 The patient was given a series of 10 letters (“S-A-V-E-A-H-A-A-R-T”) every 3 seconds and was asked to squeeze the rater’s hand every time the patient heard the letter “A.” Patients who were unable to perform the task were assigned 10 errors.
  • Five picture recognition task.34 Patients were shown 5 objects on picture cards. Afterwards, patients were shown 10 pictures with the previously shown objects intermingled. The patient had to identify which objects were seen previously in the first 5 pictures. Patients who were unable to perform the task were assigned 10 errors.
  • Recite the months of the year backwards from December to January.29 Patients who were unable to perform the task were assigned 12 errors.

Reference Standard for Delirium

A comprehensive consultation-liaison psychiatrist assessment was the reference standard for delirium; the diagnosis of delirium was based on Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (DSM-IV-TR) criteria.35 Three psychiatrists who each had an average of 11 years of clinical experience and regularly diagnosed delirium as part of their daily clinical practice were available to perform these assessments. To arrive at the diagnosis of delirium, they interviewed those who best understood the patient’s mental status (eg, the patient’s family members or caregivers, physician, and nurses). They also reviewed the patient’s medical record and radiology and laboratory test results. They performed bedside cognitive testing that included, but was not limited to, the Mini-Mental State Examination, Clock Drawing Test, Luria hand sequencing task, and tests for verbal fluency. A focused neurological examination was also performed (ie, screening for paraphasic errors, tremors, tone, asterixis, frontal release signs, etc.), and they also evaluated the patient for affective lability, hallucinations, and level of alertness. If the presence of delirium was still questionable, then confrontational naming, proverb interpretation or similarities, and assessments for apraxias were performed at the discretion of the psychiatrist. The psychiatrists were blinded to the physician’s assessments, and the assessments were conducted within 3 hours of each other.

Additional Variables Collected

Using medical record review, comorbidity burden, severity of illness, and premorbid cognition were ascertained. The Charlson Comorbidity Index, a weighted index that takes into account the number and seriousness of 19 preexisting comorbid conditions, was used to quantify comorbidity burden; higher scores indicate higher comorbid burden.36,37 The Acute Physiology Score of the Acute Physiology and Chronic Health Evaluation II was used to quantify severity of illness.38 This score is based upon the initial values of 12 routine physiologic measurements such as vital sign and laboratory abnormalities; higher scores represent higher severities of illness.38 The medical record was reviewed to ascertain the presence of premorbid cognitive impairment; any documentation of dementia in the patient’s clinical problem list or physician history and physical examination from the outpatient or inpatient settings was considered positive. The medical record review was performed by a research assistant and was double-checked for accuracy by one of the investigators (JHH).

Data Analyses

Measures of central tendency and dispersion for continuous variables were reported as medians and interquartile ranges. Categorical variables were reported as proportions. Receiver operating characteristic curves were constructed for each inattention task. Area under the receiver operating characteristic curves (AUC) was reported to provide a global measure of diagnostic accuracy. Sensitivities, specificities, positive likelihood ratios (PLRs), and negative likelihood ratios (NLRs) with their 95% CIs were calculated using the psychiatrist’s assessment as the reference standard.39 Cut-points with PLRs greater than 10 (strongly increased the likelihood of delirium) or NLRs less than 0.1 (strongly decreased the likelihood of delirium) were preferentially reported whenever possible.

 

 

All statistical analyses were performed with open source R statistical software version 3.0.1 (http://www.r-project.org/), SAS 9.4 (SAS Institute, Cary, NC), and Microsoft Excel 2010 (Microsoft Inc., Redmond, WA).

RESULTS

A total of 542 patients were screened; 214 patients refused to participate, and 93 were excluded, leaving 235 patients. The patient characteristics can be seen in Table 1. Compared with all patients (N = 15,359) who presented to the ED during the study period, enrolled patients were similar in age but more likely to be male, have cardiovascular chief complaints, and be admitted to the hospital. Of those enrolled, 25 (10.6%) were delirious. Delirious patients were older, more likely to be nonwhite, have a past history of dementia, have a graduate school degree, and have a chief complaint of altered mental status.

Making any error on the MOTYB-6 task had a sensitivity of 80.0% (95% CI, 60.9%-91.1%), specificity of 57.1% (95% CI, 50.4%-63.7%), PLR of 1.87 (95% CI, 1.45-2.40) and NLR of 0.35 (95% CI, 0.16-0.77) for delirium as diagnosed by a psychiatrist. Making any error on the MOTYB-12 task had a sensitivity of 84.0% (95% CI, 65.4%-93.6%), specificity of 51.9% (95% CI, 45.2%-58.5%), PLR of 1.75 (95% CI, 1.40-2.18), and NLR of 0.31 (95% CI, 0.12-0.76) for delirium. The AUCs for the MOTYB-6 and MOTYB-12 tasks were 0.77 and 0.78, respectively, indicating very good diagnostic performance.

The diagnostic performances of the other inattention tasks and additional cutoff values for the MOTYB-6 and MOTYB-12 tasks can be seen in Table 2. Increasing the MOTYB-6 cut-off to 2 or more errors and MOTYB-12 cut-off to 3 or more errors increased the specificity to 70.0% and 70.5%, respectively, without decreasing their sensitivity. The best combination of sensitivity and specificity was reciting the days of the week backwards task; if the patient made any error, this was 84.0% (95% CI, 65.4%-93.6%) sensitive and 81.9% (95% CI, 76.1%-86.5%) specific for delirium. The inattention tasks that strongly increased the likelihood of delirium (PLR > 10) were the vigilance “A” and picture recognition tasks. If the patient made 2 or more errors on the vigilance task or 3 or more errors on the picture recognition task, then the likelihood of delirium strongly increased, as evidenced by a PLR of 16.80 (95% CI, 8.01-35.23) and 23.10 (95% CI, 7.95-67.12), respectively. No other inattention tasks were able to achieve a PLR of greater than 10, regardless of what cutoff was used. No inattention task was able to achieve a NLR of less than 0.10, which would have strongly decreased the likelihood of delirium. The best NLRs were if the patient made no errors spelling the word “LUNCH” backwards (NLR, 0.16; 95% CI, 0.04-0.60), no errors on the vigilance “A” task (NLR, 0.18; 95% CI, 0.07-0.43), and no errors on the days of the week backwards task (NLR, 0.20; 95% CI, 0.08-0.48).

DISCUSSION

Delirium is frequently missed by healthcare providers because it is not routinely screened for in the acute care setting. To help address this deficiency of care, we evaluated several brief measures of inattention that take less than 30 seconds to complete. We observed that any errors made on the MOTYB-6 and MOTYB-12 tasks had very good sensitivities (80% and 84%) but were limited by their modest specificities (approximately 50%) for delirium. As a result, these assessments have limited clinical utility as standalone delirium screens. We also explored other commonly used brief measures of inattention and at a variety of error cutoffs. Reciting the days of the week backwards appeared to best balance sensitivity and specificity. None of the inattention measures could convincingly rule out delirium (NLR < 0.10), but the vigilance “A” and picture recognition tasks may have clinical utility in ruling in delirium (PLR > 10). Overall, all the inattention tasks, including MOTYB-6 and MOTYB-12, had very good diagnostic performances based upon their AUC. However, achieving a high sensitivity often had to be sacrificed for specificity or, alternatively, achieving a high specificity had to be sacrificed for sensitivity.

Inattention has been shown to be the cardinal feature for delirium,40 and its assessment using cognitive testing has been recommended to help identify the presence of delirium according to an expert consensus panel.26 The diagnostic performance of the MOTYB-12 observed in our study is similar to a study by Fick et al., who reported that MOTYB-12 had very good sensitivity (83%) but had modest specificity (69%) with a cutoff of 1 or more errors. Hendry et al. observed that the MOTYB-12 was 91% sensitive and 50% specific using a cutoff of 4 or more errors. With regard to the MOTYB-6, our reported specificity was different from what was observed by O’Regan et al.27 Using 1 or more errors as a cutoff, they observed a much higher specificity for delirium than we did (90% vs 57%). Discordant observations regarding the diagnostic accuracy for other inattention tasks also exist. We observed that making any error on the days of the week backwards task was 84% sensitive and 82% specific for delirium, whereas Fick et al. observed a sensitivity and specificity of 50% and 94%, respectively. For the vigilance “A” task, we observed that making 2 or more errors over a series of 10 letters was 64.0% sensitive and 91.4% specific for delirium, whereas Pompei et al.41 observed that making 2 or more errors over a series of 60 letters was 51% sensitive and 77% specific for delirium.

The abovementioned discordant findings may be driven by spectrum bias, wherein the sensitivities and specificities for each inattention task may differ in different subgroups. As a result, differences in the age distribution, proportion of college graduates, history of dementia, and susceptibility to delirium can influence overall sensitivity and specificity. Objective measures of delirium, including the inattention screens studied, are particularly prone to spectrum bias.31,34 However, the strength of this approach is that the assessment of inattention becomes less reliant upon clinical judgment and allows it to be used by raters from a wide range of clinical backgrounds. On the other hand, a subjective interpretation of these inattention tasks may allow the rater to capture the subtleties of inattention (ie, decreased speed of performance in a highly intelligent and well-educated patient without dementia). The disadvantage of this approach, however, is that it is more dependent on clinical judgment and may have decreased diagnostic accuracy in those with less clinical experience or with limited training.14,42,43 These factors must be carefully considered when determining which delirium assessment to use.

Additional research is required to determine the clinical utility of these brief inattention assessments. These findings need to be further validated in larger studies, and the optimal cutoff of each task for different subgroup of patients (eg, demented vs nondemented) needs to be further clarified. It is not completely clear whether these inattention tests can serve as standalone assessments. Depending on the cutoff used, some of these assessments may have unacceptable false negative or false positive rates that may lead to increased adverse patient outcomes or increased resource utilization, respectively. Additional components or assessments may be needed to improve the diagnostic accuracy of these assessments. In addition to understanding these inattention assessments’ diagnostic accuracies, their ability to predict adverse outcomes also needs to be investigated. While a previous study observed that making any error on the MOTYB-12 task was associated with increased physical restraint use and prolonged hospital length of stay,44 these assessments’ ability to prognosticate long-term outcomes such as mortality or long-term cognition or function need to be studied. Lastly, studies should also evaluate how easily implementable these assessments are and whether improved delirium recognition leads to improved patient outcomes.

This study has several notable limitations. Though planned a priori, this was a secondary analysis of a larger investigation designed to validate 3 delirium assessments. Our sample size was also relatively small, causing our 95% CIs to overlap in most cases and limiting the statistical power to truly determine whether one measure is better than the other. We also asked the patient to recite the months backwards from December to July as well as recite the months backwards from December to January. It is possible that the patient may have performed better at going from December to January because of learning effect. Our reference standard for delirium was based upon DSM-IV-TR criteria. The new DSM-V criteria may be more restrictive and may slightly change the sensitivities and specificities of the inattention tasks. We enrolled a convenience sample and enrolled patients who were more likely to be male, have cardiovascular chief complaints, and be admitted to the hospital; as a result, selection bias may have been introduced. Lastly, this study was conducted in a single center and enrolled patients who were 65 years and older. Our findings may not be generalizable to other settings and in those who are less than 65 years of age.

 

 

CONCLUSIONS

The MOTYB-6 and MOTYB-12 tasks had very good sensitivities but modest specificities (approximately 50%) using any error made as a cutoff; increasing cutoff to 2 errors and 3 errors, respectively, improved their specificities (approximately 70%) with minimal impact to their sensitivities. Reciting the days of the week backwards, spelling the word “LUNCH” backwards, and the 10-letter vigilance “A” task appeared to perform the best in ruling out delirium but only moderately decreased the likelihood of delirium. The 10-letter Vigilance “A” and picture recognition task appeared to perform the best in ruling in delirium. Days of the week backwards appeared to have the best combination of sensitivity and specificity.

Disclosure

This study was funded by the Emergency Medicine Foundation Career Development Award, National Institutes of Health K23AG032355, and National Center for Research Resources, Grant UL1 RR024975-01. The authors report no financial conflicts of interest.

Delirium is an acute neurocognitive disorder1 that affects up to 25% of older emergency department (ED) and hospitalized patients.2-4 The relationship between delirium and adverse outcomes is well documented.5-7 Delirium is a strong predictor of increased length of mechanical ventilation, longer intensive care unit and hospital stays, increased risk of falls, long-term cognitive impairment, and mortality.8-13 Delirium is frequently missed by healthcare professionals2,14-16 and goes undetected in up to 3 out of 4 patients by bedside nurses and medical practitioners in many hospital settings.14,17-22 A significant barrier to recognizing delirium is the absence of brief delirium assessments.

In an effort to improve delirium recognition in the acute care setting, there has been a concerted effort to develop and validate brief delirium assessments. To address this unmet need, 4 ‘A’s Test (4AT), the Brief Confusion Assessment Method (bCAM), and the 3-minute diagnostic assessment for CAM-defined delirium (3D-CAM) are 1- to 3-minute delirium assessments that were validated in acutely ill older patients.23 However, 1 to 3 minutes may still be too long in busy clinical environments, and briefer (<30 seconds) delirium assessments may be needed.

One potential more-rapid method to screen for delirium is to specifically test for the presence of inattention, which is a cardinal feature of delirium.24,25 Inattention can be ascertained by having the patient recite the months backwards, recite the days of the week backwards, or spell a word backwards.26 Recent studies have evaluated the diagnostic accuracy of reciting the months of the year backwards for delirium. O’Regan et al.27 evaluated the diagnostic accuracy of the month of the year backwards from December to July (MOTYB-6) and observed that this task was 84% sensitive and 90% specific for delirium in older patients. However, they performed the reference standard delirium assessments in patients who had a positive MOTYB-6, which can overestimate sensitivity and underestimate specificity (verification bias).28 Fick et al.29 examined the diagnostic accuracy of 20 individual elements of the 3D-CAM and observed that reciting the months of the year backwards from December to January (MOTYB-12) was 83% sensitive and 69% specific for delirium. However, this was an exploratory study that was designed to identify an element of the 3D-CAM that had the best diagnostic accuracy.

To address these limitations, we sought to evaluate the diagnostic performance of the MOTYB-6 and MOTYB-12 for delirium as diagnosed by a reference standard. We also explored other brief tests of inattention such as spelling a word (“LUNCH”) backwards, reciting the days of the week backwards, 10-letter vigilance “A” task, and 5 picture recognition task.

METHODS

Study Design and Setting

This was a preplanned secondary analysis of a prospective observational study that validated 3 delirium assessments.30,31 This study was conducted at a tertiary care, academic ED. The local institutional review board (IRB) reviewed and approved this study. Informed consent from the patient or an authorized surrogate was obtained whenever possible. Because this was an observational study and posed minimal risk to the patient, the IRB granted a waiver of consent for patients who were both unable to provide consent and were without an authorized surrogate available in the ED or by phone.

Selection of Participants

We enrolled a convenience sample of patients between June 2010 and February 2012 Monday through Friday from 8 am to 4 pm. This enrollment window was based upon the psychiatrist’s availability. Because of the extensiveness of the psychiatric evaluations, we limited enrollment to 1 patient per day. Patients who were 65 years or older, not in a hallway bed, and in the ED for less than 12 hours at the time of enrollment were included. We used a 12-hour cutoff so that patients who presented in the evening and early morning hours could be included. Patients were excluded if they were previously enrolled, non-English speaking, deaf or blind, comatose, suffered from end-stage dementia, or were unable to complete all the study assessments. The rationale for excluding patients with end-stage dementia was that diagnosing delirium in this patient population is challenging.

 

 

Research assistants approached patients who met inclusion criteria and determined if any exclusion criteria were present. If none of the exclusion criteria were present, then the research assistant reviewed the informed consent document with the patient or authorized surrogate if the patient was not capable of providing consent. If a patient was not capable of providing consent and no authorized surrogate was available, then the patient was enrolled (under the waiver of consent) as long as the patient assented to be a part of the study. Once the patient was enrolled, the research assistant contacted the physician rater and reference standard psychiatrists to approach the patient.

Measures of Inattention

An emergency physician (JHH) who had no formal training in the mental status assessment of elders administered a cognitive battery to the patient, including tests of inattention. The following inattention tasks were administered:

  • Spell the word “LUNCH” backwards.30 Patients were initially allowed to spell the word “LUNCH” forwards. Patients who were unable to perform the task were assigned 5 errors.
  • Recite the months of the year backwards from December to July.23,26,27,30,32 Patients who were unable to perform the task were assigned 6 errors.
  • Recite the days of the week backwards.23,26,33 Patients who were unable to perform the task were assigned 7 errors.
  • Ten-letter vigilance “A” task.34 The patient was given a series of 10 letters (“S-A-V-E-A-H-A-A-R-T”) every 3 seconds and was asked to squeeze the rater’s hand every time the patient heard the letter “A.” Patients who were unable to perform the task were assigned 10 errors.
  • Five picture recognition task.34 Patients were shown 5 objects on picture cards. Afterwards, patients were shown 10 pictures with the previously shown objects intermingled. The patient had to identify which objects were seen previously in the first 5 pictures. Patients who were unable to perform the task were assigned 10 errors.
  • Recite the months of the year backwards from December to January.29 Patients who were unable to perform the task were assigned 12 errors.

Reference Standard for Delirium

A comprehensive consultation-liaison psychiatrist assessment was the reference standard for delirium; the diagnosis of delirium was based on Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (DSM-IV-TR) criteria.35 Three psychiatrists who each had an average of 11 years of clinical experience and regularly diagnosed delirium as part of their daily clinical practice were available to perform these assessments. To arrive at the diagnosis of delirium, they interviewed those who best understood the patient’s mental status (eg, the patient’s family members or caregivers, physician, and nurses). They also reviewed the patient’s medical record and radiology and laboratory test results. They performed bedside cognitive testing that included, but was not limited to, the Mini-Mental State Examination, Clock Drawing Test, Luria hand sequencing task, and tests for verbal fluency. A focused neurological examination was also performed (ie, screening for paraphasic errors, tremors, tone, asterixis, frontal release signs, etc.), and they also evaluated the patient for affective lability, hallucinations, and level of alertness. If the presence of delirium was still questionable, then confrontational naming, proverb interpretation or similarities, and assessments for apraxias were performed at the discretion of the psychiatrist. The psychiatrists were blinded to the physician’s assessments, and the assessments were conducted within 3 hours of each other.

Additional Variables Collected

Using medical record review, comorbidity burden, severity of illness, and premorbid cognition were ascertained. The Charlson Comorbidity Index, a weighted index that takes into account the number and seriousness of 19 preexisting comorbid conditions, was used to quantify comorbidity burden; higher scores indicate higher comorbid burden.36,37 The Acute Physiology Score of the Acute Physiology and Chronic Health Evaluation II was used to quantify severity of illness.38 This score is based upon the initial values of 12 routine physiologic measurements such as vital sign and laboratory abnormalities; higher scores represent higher severities of illness.38 The medical record was reviewed to ascertain the presence of premorbid cognitive impairment; any documentation of dementia in the patient’s clinical problem list or physician history and physical examination from the outpatient or inpatient settings was considered positive. The medical record review was performed by a research assistant and was double-checked for accuracy by one of the investigators (JHH).

Data Analyses

Measures of central tendency and dispersion for continuous variables were reported as medians and interquartile ranges. Categorical variables were reported as proportions. Receiver operating characteristic curves were constructed for each inattention task. Area under the receiver operating characteristic curves (AUC) was reported to provide a global measure of diagnostic accuracy. Sensitivities, specificities, positive likelihood ratios (PLRs), and negative likelihood ratios (NLRs) with their 95% CIs were calculated using the psychiatrist’s assessment as the reference standard.39 Cut-points with PLRs greater than 10 (strongly increased the likelihood of delirium) or NLRs less than 0.1 (strongly decreased the likelihood of delirium) were preferentially reported whenever possible.

 

 

All statistical analyses were performed with open source R statistical software version 3.0.1 (http://www.r-project.org/), SAS 9.4 (SAS Institute, Cary, NC), and Microsoft Excel 2010 (Microsoft Inc., Redmond, WA).

RESULTS

A total of 542 patients were screened; 214 patients refused to participate, and 93 were excluded, leaving 235 patients. The patient characteristics can be seen in Table 1. Compared with all patients (N = 15,359) who presented to the ED during the study period, enrolled patients were similar in age but more likely to be male, have cardiovascular chief complaints, and be admitted to the hospital. Of those enrolled, 25 (10.6%) were delirious. Delirious patients were older, more likely to be nonwhite, have a past history of dementia, have a graduate school degree, and have a chief complaint of altered mental status.

Making any error on the MOTYB-6 task had a sensitivity of 80.0% (95% CI, 60.9%-91.1%), specificity of 57.1% (95% CI, 50.4%-63.7%), PLR of 1.87 (95% CI, 1.45-2.40) and NLR of 0.35 (95% CI, 0.16-0.77) for delirium as diagnosed by a psychiatrist. Making any error on the MOTYB-12 task had a sensitivity of 84.0% (95% CI, 65.4%-93.6%), specificity of 51.9% (95% CI, 45.2%-58.5%), PLR of 1.75 (95% CI, 1.40-2.18), and NLR of 0.31 (95% CI, 0.12-0.76) for delirium. The AUCs for the MOTYB-6 and MOTYB-12 tasks were 0.77 and 0.78, respectively, indicating very good diagnostic performance.

The diagnostic performances of the other inattention tasks and additional cutoff values for the MOTYB-6 and MOTYB-12 tasks can be seen in Table 2. Increasing the MOTYB-6 cut-off to 2 or more errors and MOTYB-12 cut-off to 3 or more errors increased the specificity to 70.0% and 70.5%, respectively, without decreasing their sensitivity. The best combination of sensitivity and specificity was reciting the days of the week backwards task; if the patient made any error, this was 84.0% (95% CI, 65.4%-93.6%) sensitive and 81.9% (95% CI, 76.1%-86.5%) specific for delirium. The inattention tasks that strongly increased the likelihood of delirium (PLR > 10) were the vigilance “A” and picture recognition tasks. If the patient made 2 or more errors on the vigilance task or 3 or more errors on the picture recognition task, then the likelihood of delirium strongly increased, as evidenced by a PLR of 16.80 (95% CI, 8.01-35.23) and 23.10 (95% CI, 7.95-67.12), respectively. No other inattention tasks were able to achieve a PLR of greater than 10, regardless of what cutoff was used. No inattention task was able to achieve a NLR of less than 0.10, which would have strongly decreased the likelihood of delirium. The best NLRs were if the patient made no errors spelling the word “LUNCH” backwards (NLR, 0.16; 95% CI, 0.04-0.60), no errors on the vigilance “A” task (NLR, 0.18; 95% CI, 0.07-0.43), and no errors on the days of the week backwards task (NLR, 0.20; 95% CI, 0.08-0.48).

DISCUSSION

Delirium is frequently missed by healthcare providers because it is not routinely screened for in the acute care setting. To help address this deficiency of care, we evaluated several brief measures of inattention that take less than 30 seconds to complete. We observed that any errors made on the MOTYB-6 and MOTYB-12 tasks had very good sensitivities (80% and 84%) but were limited by their modest specificities (approximately 50%) for delirium. As a result, these assessments have limited clinical utility as standalone delirium screens. We also explored other commonly used brief measures of inattention and at a variety of error cutoffs. Reciting the days of the week backwards appeared to best balance sensitivity and specificity. None of the inattention measures could convincingly rule out delirium (NLR < 0.10), but the vigilance “A” and picture recognition tasks may have clinical utility in ruling in delirium (PLR > 10). Overall, all the inattention tasks, including MOTYB-6 and MOTYB-12, had very good diagnostic performances based upon their AUC. However, achieving a high sensitivity often had to be sacrificed for specificity or, alternatively, achieving a high specificity had to be sacrificed for sensitivity.

Inattention has been shown to be the cardinal feature for delirium,40 and its assessment using cognitive testing has been recommended to help identify the presence of delirium according to an expert consensus panel.26 The diagnostic performance of the MOTYB-12 observed in our study is similar to a study by Fick et al., who reported that MOTYB-12 had very good sensitivity (83%) but had modest specificity (69%) with a cutoff of 1 or more errors. Hendry et al. observed that the MOTYB-12 was 91% sensitive and 50% specific using a cutoff of 4 or more errors. With regard to the MOTYB-6, our reported specificity was different from what was observed by O’Regan et al.27 Using 1 or more errors as a cutoff, they observed a much higher specificity for delirium than we did (90% vs 57%). Discordant observations regarding the diagnostic accuracy for other inattention tasks also exist. We observed that making any error on the days of the week backwards task was 84% sensitive and 82% specific for delirium, whereas Fick et al. observed a sensitivity and specificity of 50% and 94%, respectively. For the vigilance “A” task, we observed that making 2 or more errors over a series of 10 letters was 64.0% sensitive and 91.4% specific for delirium, whereas Pompei et al.41 observed that making 2 or more errors over a series of 60 letters was 51% sensitive and 77% specific for delirium.

The abovementioned discordant findings may be driven by spectrum bias, wherein the sensitivities and specificities for each inattention task may differ in different subgroups. As a result, differences in the age distribution, proportion of college graduates, history of dementia, and susceptibility to delirium can influence overall sensitivity and specificity. Objective measures of delirium, including the inattention screens studied, are particularly prone to spectrum bias.31,34 However, the strength of this approach is that the assessment of inattention becomes less reliant upon clinical judgment and allows it to be used by raters from a wide range of clinical backgrounds. On the other hand, a subjective interpretation of these inattention tasks may allow the rater to capture the subtleties of inattention (ie, decreased speed of performance in a highly intelligent and well-educated patient without dementia). The disadvantage of this approach, however, is that it is more dependent on clinical judgment and may have decreased diagnostic accuracy in those with less clinical experience or with limited training.14,42,43 These factors must be carefully considered when determining which delirium assessment to use.

Additional research is required to determine the clinical utility of these brief inattention assessments. These findings need to be further validated in larger studies, and the optimal cutoff of each task for different subgroup of patients (eg, demented vs nondemented) needs to be further clarified. It is not completely clear whether these inattention tests can serve as standalone assessments. Depending on the cutoff used, some of these assessments may have unacceptable false negative or false positive rates that may lead to increased adverse patient outcomes or increased resource utilization, respectively. Additional components or assessments may be needed to improve the diagnostic accuracy of these assessments. In addition to understanding these inattention assessments’ diagnostic accuracies, their ability to predict adverse outcomes also needs to be investigated. While a previous study observed that making any error on the MOTYB-12 task was associated with increased physical restraint use and prolonged hospital length of stay,44 these assessments’ ability to prognosticate long-term outcomes such as mortality or long-term cognition or function need to be studied. Lastly, studies should also evaluate how easily implementable these assessments are and whether improved delirium recognition leads to improved patient outcomes.

This study has several notable limitations. Though planned a priori, this was a secondary analysis of a larger investigation designed to validate 3 delirium assessments. Our sample size was also relatively small, causing our 95% CIs to overlap in most cases and limiting the statistical power to truly determine whether one measure is better than the other. We also asked the patient to recite the months backwards from December to July as well as recite the months backwards from December to January. It is possible that the patient may have performed better at going from December to January because of learning effect. Our reference standard for delirium was based upon DSM-IV-TR criteria. The new DSM-V criteria may be more restrictive and may slightly change the sensitivities and specificities of the inattention tasks. We enrolled a convenience sample and enrolled patients who were more likely to be male, have cardiovascular chief complaints, and be admitted to the hospital; as a result, selection bias may have been introduced. Lastly, this study was conducted in a single center and enrolled patients who were 65 years and older. Our findings may not be generalizable to other settings and in those who are less than 65 years of age.

 

 

CONCLUSIONS

The MOTYB-6 and MOTYB-12 tasks had very good sensitivities but modest specificities (approximately 50%) using any error made as a cutoff; increasing cutoff to 2 errors and 3 errors, respectively, improved their specificities (approximately 70%) with minimal impact to their sensitivities. Reciting the days of the week backwards, spelling the word “LUNCH” backwards, and the 10-letter vigilance “A” task appeared to perform the best in ruling out delirium but only moderately decreased the likelihood of delirium. The 10-letter Vigilance “A” and picture recognition task appeared to perform the best in ruling in delirium. Days of the week backwards appeared to have the best combination of sensitivity and specificity.

Disclosure

This study was funded by the Emergency Medicine Foundation Career Development Award, National Institutes of Health K23AG032355, and National Center for Research Resources, Grant UL1 RR024975-01. The authors report no financial conflicts of interest.

References

1. American Psychiatric Association. Diagnostic and statistical manual of mental disorders: DSM-5. Washington, DC: American Psychiatric Association; 2013.
2. Hustey FM, Meldon SW, Smith MD, Lex CK. The effect of mental status screening on the care of elderly emergency department patients. Ann Emerg Med. 2003;41(5):678-684.
3. Inouye SK, Rushing JT, Foreman MD, Palmer RM, Pompei P. Does delirium contribute to poor hospital outcomes? A three-site epidemiologic study. J Gen Intern Med. 1998;13(4):234-242.
4. Pitkala KH, Laurila JV, Strandberg TE, Tilvis RS. Prognostic significance of delirium in frail older people. Dement Geriatr Cogn Disord. 2005;19(2-3):158-163.
5. Han JH, Shintani A, Eden S, et al. Delirium in the emergency department: an independent predictor of death within 6 months. Ann Emerg Med. 2010;56(3):244-252.
6. Gross AL, Jones RN, Habtemariam DA, et al. Delirium and long-term cognitive trajectory among persons with dementia. Arch Intern Med. 2012;172(17):1324-1331.
7. Davis DH, Muniz Terrera G, Keage H, et al. Delirium is a strong risk factor for dementia in the oldest-old: a population-based cohort study. Brain. 2012;135(Pt 9):2809-2816.
8. Ely EW, Baker AM, Dunagan DP, et al. Effect on the duration of mechanical ventilation of identifying patients capable of breathing spontaneously. N Engl J Med. 1996;335(25):1864-1869.
9. Ely EW, Shintani A, Truman B, et al. Delirium as a predictor of mortality in mechanically ventilated patients in the intensive care unit. JAMA. 2004;291(14):1753-1762.
10. Lin SM, Liu CY, Wang CH, et al. The impact of delirium on the survival of mechanically ventilated patients. Crit Care Med. 2004;32(11):2254-2259.
11. Salluh JI, Wang H, Schneider EB, et al. Outcome of delirium in critically ill patients: systematic review and meta-analysis. BMJ. 2015;350:h2538.
12. Klein Klouwenberg PM, Zaal IJ, Spitoni C, et al. The attributable mortality of delirium in critically ill patients: prospective cohort study. BMJ. 2014;349:g6652.
13. Han JH, Vasilevskis EE, Chandrasekhar R, et al. Delirium in the Emergency Department and Its Extension into Hospitalization (DELINEATE) Study: Effect on 6-month Function and Cognition. J Am Geriatr Soc. 2017;65(6):1333-1338.
14. Inouye SK, Foreman MD, Mion LC, Katz KH, Cooney LM Jr. Nurses’ recognition of delirium and its symptoms: comparison of nurse and researcher ratings. Arch Intern Med. 2001;161(20):2467-2473.
15. Han JH, Zimmerman EE, Cutler N, et al. Delirium in older emergency department patients: recognition, risk factors, and psychomotor subtypes. Acad Emerg Med. 2009;16(3):193-200.
16. Elie M, Cole MG, Primeau FJ, Bellavance F. Delirium risk factors in elderly hospitalized patients. J Gen Intern Med. 1998;13(3):204-212.
17. Spronk PE, Riekerk B, Hofhuis J, Rommes JH. Occurrence of delirium is severely underestimated in the ICU during daily care. Intensive Care Med. 2009;35(7):1276-1280.
18. van Eijk MM, van Marum RJ, Klijn IA, de Wit N, Kesecioglu J, Slooter AJ. Comparison of delirium assessment tools in a mixed intensive care unit. Crit Care Med. 2009;37(6):1881-1885.
19. Devlin JW, Fong JJ, Schumaker G, O’Connor H, Ruthazer R, Garpestad E. Use of a validated delirium assessment tool improves the ability of physicians to identify delirium in medical intensive care unit patients. Crit Care Med. 2007;35(12):2721-2724.
20. Han JH, Eden S, Shintani A, et al. Delirium in Older Emergency Department Patients Is an Independent Predictor of Hospital Length of Stay. Acad Emerg Med. 2011;18(5):451-457.
21. Pun BT, Gordon SM, Peterson JF, et al. Large-scale implementation of sedation and delirium monitoring in the intensive care unit: a report from two medical centers. Crit Care Med. 2005;33(6):1199-1205.
22. Grossmann FF, Hasemann W, Graber A, Bingisser R, Kressig RW, Nickel CH. Screening, detection and management of delirium in the emergency department - a pilot study on the feasibility of a new algorithm for use in older emergency department patients: the modified Confusion Assessment Method for the Emergency Department (mCAM-ED). Scand J Trauma Resusc Emerg Med. 2014;22:19.
23. Marcantonio ER, Ngo LH, O’Connor M, et al. 3D-CAM: Derivation and Validation of a 3-Minute Diagnostic Interview for CAM-Defined Delirium: A Cross-sectional Diagnostic Test Study. Ann Intern Med. 2014;161(8):554-561.
24. Blazer DG, van Nieuwenhuizen AO. Evidence for the diagnostic criteria of delirium: an update. Curr Opin Psychiatry. 2012;25(3):239-243.
25. Meagher DJ, Maclullich AM, Laurila JV. Defining delirium for the International Classification of Diseases, 11th Revision. J Psychosom Res. 2008;65(3):207-214.
26. Huang LW, Inouye SK, Jones RN, et al. Identifying indicators of important diagnostic features of delirium. J Am Geriatr Soc. 2012;60(6):1044-1050.
27. O’Regan NA, Ryan DJ, Boland E, et al. Attention! A good bedside test for delirium? J Neurol Neurosurg Psychiatry. 2014;85(10):1122-1131.
28. Richardson ML, Petscavage JM. Verification bias: an under-recognized source of error in assessing the efficacy of MRI of the meniscii. Acad Radiol. 2011;18(11):1376-1381.
29. Fick DM, Inouye SK, Guess J, et al. Preliminary development of an ultrabrief two-item bedside test for delirium. J Hosp Med. 2015;10(10):645-650.
30. Han JH, Wilson A, Vasilevskis EE, et al. Diagnosing delirium in older emergency department patients: validity and reliability of the delirium triage screen and the brief confusion assessment method. Ann Emerg Med. 2013;62(5):457-465.
31. Han JH, Wilson A, Graves AJ, et al. Validation of the Confusion Assessment Method for the Intensive Care Unit in older emergency department patients. Acad Emerg Med. 2014;21(2):180-187.
32. Inouye SK. Delirium in hospitalized older patients. Clin Geriatr Med. 1998;14(4):745-764.

33. Hamrick I, Hafiz R, Cummings DM. Use of days of the week in a modified mini-mental state exam (M-MMSE) for detecting geriatric cognitive impairment. J Am Board Fam Med. 2013;26(4):429-435.

34. Ely EW, Inouye SK, Bernard GR, et al. Delirium in mechanically ventilated patients: validity and reliability of the confusion assessment method for the intensive care unit (CAM-ICU). JAMA. 2001;286(21):2703-2710.
35. American Psychiatric Association. Task Force on DSM-IV. Diagnostic and statistical manual of mental disorders: DSM-IV-TR. 4th ed. Washington, DC: American Psychiatric Association; 2000.
36. Murray SB, Bates DW, Ngo L, Ufberg JW, Shapiro NI. Charlson Index is associated with one-year mortality in emergency department patients with suspected infection. Acad Emerg Med. 2006;13(5):530-536.
37. Charlson ME, Pompei P, Ales KL, MacKenzie CR. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. 1987;40(5):373-383.
38. Knaus WA, Draper EA, Wagner DP, Zimmerman JE. APACHE II: a severity of disease classification system. Crit Care Med. 1985;13(10):818-829.
39. Simel DL, Samsa GP, Matchar DB. Likelihood ratios with confidence: sample size estimation for diagnostic test studies. J Clin Epidemiol. 1991;44(8):763-770.
40. Blazer DG, Wu LT. The epidemiology of substance use and disorders among middle aged and elderly community adults: national survey on drug use and health. Am J Geriatr Psychiatry. 2009;17(3):237-245.
41. Pompei P, Foreman M, Cassel CK, Alessi C, Cox D. Detecting delirium among hospitalized older patients. Arch Intern Med. 1995;155(3):301-307.
42. Lemiengre J, Nelis T, Joosten E, et al. Detection of delirium by bedside nurses using the confusion assessment method. J Am Geriatr Soc. 2006;54(4):685-689.
43. Ryan K, Leonard M, Guerin S, Donnelly S, Conroy M, Meagher D. Validation of the confusion assessment method in the palliative care setting. Palliat Med. 2009;23(1):40-45.
44. Yevchak AM, Doherty K, Archambault EG, Kelly B, Fonda JR, Rudolph JL. The association between an ultrabrief cognitive screening in older adults and hospital outcomes. J Hosp Med. 2015;10(10):651-657.

References

1. American Psychiatric Association. Diagnostic and statistical manual of mental disorders: DSM-5. Washington, DC: American Psychiatric Association; 2013.
2. Hustey FM, Meldon SW, Smith MD, Lex CK. The effect of mental status screening on the care of elderly emergency department patients. Ann Emerg Med. 2003;41(5):678-684.
3. Inouye SK, Rushing JT, Foreman MD, Palmer RM, Pompei P. Does delirium contribute to poor hospital outcomes? A three-site epidemiologic study. J Gen Intern Med. 1998;13(4):234-242.
4. Pitkala KH, Laurila JV, Strandberg TE, Tilvis RS. Prognostic significance of delirium in frail older people. Dement Geriatr Cogn Disord. 2005;19(2-3):158-163.
5. Han JH, Shintani A, Eden S, et al. Delirium in the emergency department: an independent predictor of death within 6 months. Ann Emerg Med. 2010;56(3):244-252.
6. Gross AL, Jones RN, Habtemariam DA, et al. Delirium and long-term cognitive trajectory among persons with dementia. Arch Intern Med. 2012;172(17):1324-1331.
7. Davis DH, Muniz Terrera G, Keage H, et al. Delirium is a strong risk factor for dementia in the oldest-old: a population-based cohort study. Brain. 2012;135(Pt 9):2809-2816.
8. Ely EW, Baker AM, Dunagan DP, et al. Effect on the duration of mechanical ventilation of identifying patients capable of breathing spontaneously. N Engl J Med. 1996;335(25):1864-1869.
9. Ely EW, Shintani A, Truman B, et al. Delirium as a predictor of mortality in mechanically ventilated patients in the intensive care unit. JAMA. 2004;291(14):1753-1762.
10. Lin SM, Liu CY, Wang CH, et al. The impact of delirium on the survival of mechanically ventilated patients. Crit Care Med. 2004;32(11):2254-2259.
11. Salluh JI, Wang H, Schneider EB, et al. Outcome of delirium in critically ill patients: systematic review and meta-analysis. BMJ. 2015;350:h2538.
12. Klein Klouwenberg PM, Zaal IJ, Spitoni C, et al. The attributable mortality of delirium in critically ill patients: prospective cohort study. BMJ. 2014;349:g6652.
13. Han JH, Vasilevskis EE, Chandrasekhar R, et al. Delirium in the Emergency Department and Its Extension into Hospitalization (DELINEATE) Study: Effect on 6-month Function and Cognition. J Am Geriatr Soc. 2017;65(6):1333-1338.
14. Inouye SK, Foreman MD, Mion LC, Katz KH, Cooney LM Jr. Nurses’ recognition of delirium and its symptoms: comparison of nurse and researcher ratings. Arch Intern Med. 2001;161(20):2467-2473.
15. Han JH, Zimmerman EE, Cutler N, et al. Delirium in older emergency department patients: recognition, risk factors, and psychomotor subtypes. Acad Emerg Med. 2009;16(3):193-200.
16. Elie M, Cole MG, Primeau FJ, Bellavance F. Delirium risk factors in elderly hospitalized patients. J Gen Intern Med. 1998;13(3):204-212.
17. Spronk PE, Riekerk B, Hofhuis J, Rommes JH. Occurrence of delirium is severely underestimated in the ICU during daily care. Intensive Care Med. 2009;35(7):1276-1280.
18. van Eijk MM, van Marum RJ, Klijn IA, de Wit N, Kesecioglu J, Slooter AJ. Comparison of delirium assessment tools in a mixed intensive care unit. Crit Care Med. 2009;37(6):1881-1885.
19. Devlin JW, Fong JJ, Schumaker G, O’Connor H, Ruthazer R, Garpestad E. Use of a validated delirium assessment tool improves the ability of physicians to identify delirium in medical intensive care unit patients. Crit Care Med. 2007;35(12):2721-2724.
20. Han JH, Eden S, Shintani A, et al. Delirium in Older Emergency Department Patients Is an Independent Predictor of Hospital Length of Stay. Acad Emerg Med. 2011;18(5):451-457.
21. Pun BT, Gordon SM, Peterson JF, et al. Large-scale implementation of sedation and delirium monitoring in the intensive care unit: a report from two medical centers. Crit Care Med. 2005;33(6):1199-1205.
22. Grossmann FF, Hasemann W, Graber A, Bingisser R, Kressig RW, Nickel CH. Screening, detection and management of delirium in the emergency department - a pilot study on the feasibility of a new algorithm for use in older emergency department patients: the modified Confusion Assessment Method for the Emergency Department (mCAM-ED). Scand J Trauma Resusc Emerg Med. 2014;22:19.
23. Marcantonio ER, Ngo LH, O’Connor M, et al. 3D-CAM: Derivation and Validation of a 3-Minute Diagnostic Interview for CAM-Defined Delirium: A Cross-sectional Diagnostic Test Study. Ann Intern Med. 2014;161(8):554-561.
24. Blazer DG, van Nieuwenhuizen AO. Evidence for the diagnostic criteria of delirium: an update. Curr Opin Psychiatry. 2012;25(3):239-243.
25. Meagher DJ, Maclullich AM, Laurila JV. Defining delirium for the International Classification of Diseases, 11th Revision. J Psychosom Res. 2008;65(3):207-214.
26. Huang LW, Inouye SK, Jones RN, et al. Identifying indicators of important diagnostic features of delirium. J Am Geriatr Soc. 2012;60(6):1044-1050.
27. O’Regan NA, Ryan DJ, Boland E, et al. Attention! A good bedside test for delirium? J Neurol Neurosurg Psychiatry. 2014;85(10):1122-1131.
28. Richardson ML, Petscavage JM. Verification bias: an under-recognized source of error in assessing the efficacy of MRI of the meniscii. Acad Radiol. 2011;18(11):1376-1381.
29. Fick DM, Inouye SK, Guess J, et al. Preliminary development of an ultrabrief two-item bedside test for delirium. J Hosp Med. 2015;10(10):645-650.
30. Han JH, Wilson A, Vasilevskis EE, et al. Diagnosing delirium in older emergency department patients: validity and reliability of the delirium triage screen and the brief confusion assessment method. Ann Emerg Med. 2013;62(5):457-465.
31. Han JH, Wilson A, Graves AJ, et al. Validation of the Confusion Assessment Method for the Intensive Care Unit in older emergency department patients. Acad Emerg Med. 2014;21(2):180-187.
32. Inouye SK. Delirium in hospitalized older patients. Clin Geriatr Med. 1998;14(4):745-764.

33. Hamrick I, Hafiz R, Cummings DM. Use of days of the week in a modified mini-mental state exam (M-MMSE) for detecting geriatric cognitive impairment. J Am Board Fam Med. 2013;26(4):429-435.

34. Ely EW, Inouye SK, Bernard GR, et al. Delirium in mechanically ventilated patients: validity and reliability of the confusion assessment method for the intensive care unit (CAM-ICU). JAMA. 2001;286(21):2703-2710.
35. American Psychiatric Association. Task Force on DSM-IV. Diagnostic and statistical manual of mental disorders: DSM-IV-TR. 4th ed. Washington, DC: American Psychiatric Association; 2000.
36. Murray SB, Bates DW, Ngo L, Ufberg JW, Shapiro NI. Charlson Index is associated with one-year mortality in emergency department patients with suspected infection. Acad Emerg Med. 2006;13(5):530-536.
37. Charlson ME, Pompei P, Ales KL, MacKenzie CR. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. 1987;40(5):373-383.
38. Knaus WA, Draper EA, Wagner DP, Zimmerman JE. APACHE II: a severity of disease classification system. Crit Care Med. 1985;13(10):818-829.
39. Simel DL, Samsa GP, Matchar DB. Likelihood ratios with confidence: sample size estimation for diagnostic test studies. J Clin Epidemiol. 1991;44(8):763-770.
40. Blazer DG, Wu LT. The epidemiology of substance use and disorders among middle aged and elderly community adults: national survey on drug use and health. Am J Geriatr Psychiatry. 2009;17(3):237-245.
41. Pompei P, Foreman M, Cassel CK, Alessi C, Cox D. Detecting delirium among hospitalized older patients. Arch Intern Med. 1995;155(3):301-307.
42. Lemiengre J, Nelis T, Joosten E, et al. Detection of delirium by bedside nurses using the confusion assessment method. J Am Geriatr Soc. 2006;54(4):685-689.
43. Ryan K, Leonard M, Guerin S, Donnelly S, Conroy M, Meagher D. Validation of the confusion assessment method in the palliative care setting. Palliat Med. 2009;23(1):40-45.
44. Yevchak AM, Doherty K, Archambault EG, Kelly B, Fonda JR, Rudolph JL. The association between an ultrabrief cognitive screening in older adults and hospital outcomes. J Hosp Med. 2015;10(10):651-657.

Issue
Journal of Hospital Medicine 13(8)
Issue
Journal of Hospital Medicine 13(8)
Page Number
551-557. Published online first March 26, 2018
Page Number
551-557. Published online first March 26, 2018
Publications
Publications
Topics
Article Type
Sections
Article Source

© 2018 Society of Hospital Medicine

Disallow All Ads
Correspondence Location
Jin H. Han, MD, MSc, Suite 312, 1313 21st Ave S, Nashville, TN 37232; Telephone: 615-322-0253; Fax: 615-936-1316 ; E-mail: jin.h.han@vanderbilt.edu
Content Gating
Open Access (article Unlocked/Open Access)
Alternative CME
Disqus Comments
Default
Use ProPublica
Hide sidebar & use full width
render the right sidebar.
Article PDF Media

Impaired Arousal and Mortality

Article Type
Changed
Sun, 05/21/2017 - 13:35
Display Headline
Impaired arousal at initial presentation predicts 6‐month mortality: An analysis of 1084 acutely ill older patients

Arousal is defined as the patient's overall level of responsiveness to the environment. Its assessment is standard of care in most intensive care units (ICUs) to monitor depth of sedation and underlying brain dysfunction. There has been recent interest in expanding the role of arousal assessment beyond the ICU. Specifically, the Veterans Affairs Delirium Working Group proposed that simple arousal assessment be a vital sign to quantify underlying brain dysfunction.[1] The rationale is that impaired arousal is closely linked with delirium,[2] and is an integral component of multiple delirium assessments.[3, 4, 5] Chester et al. observed that the presence of impaired arousal was 64% sensitive and 93% specific for delirium diagnosed by a psychiatrist.[2] Delirium is an under‐recognized public health problem that affects up to 25% of older hospitalized patients,[6, 7] is associated with a multitude of adverse outcomes such as death and accelerated cognitive decline,[8] and costs the US healthcare system an excess of $152 billion dollars.[9]

Most delirium assessments require the patient to undergo additional cognitive testing. The assessment of arousal, however, requires the rater to merely observe the patient during routine clinical care and can be easily integrated into the clinical workflow.[10] Because of its simplicity and brevity, assessing arousal alone using validated scales such as the Richmond Agitation‐Sedation Scale (RASS) may be a more appealing alternative to traditional, more complex delirium screening in the acute care setting. Its clinical utility would be further strengthened if impaired arousal was also associated with mortality, and conferred risk even in the absence of delirium. As a result, we sought to determine if impaired arousal at initial presentation in older acutely ill patients predicted 6‐month mortality and whether this relationship was present in the absence of delirium.

METHODS

Design Overview

We performed a planned secondary analysis of 2 prospective cohorts that enrolled patients from May 2007 to August 2008 between 8 am and 10 pm during the weekdays, and July 2009 to February 2012 between 8 am and 4 pm during the weekdays. The first cohort was designed to evaluate the relationship between delirium and patient outcomes.[11, 12] The second cohort was used to validate brief delirium assessments using a psychiatrist's assessment as the reference standard.[5, 13] The local institutional review board approved these studies.

Setting and Participants

These studies were conducted in an urban emergency department located within an academic, tertiary care hospital with over 57,000 visits annually. Patients were included if they were 65 years or older and in the emergency department for <12 hours at the time of enrollment. The 12‐hour cutoff was used to include patients who presented to the emergency department in the evening and early morning hours. Patients were excluded if they were previously enrolled, non‐English speaking, comatose, or were nonverbal and unable to follow simple commands prior to the acute illness. Because the July 2009 to February 2012 cohort was designed to validate delirium assessments with auditory and visual components, patients were also excluded if they were deaf or blind.

Measurement of Arousal

RASS is an arousal scale commonly used in ICUs to assess depth of sedation and ranges from 5 (unarousable) to +4 (combative); 0 represents normal arousal.[10, 14] The RASS simply requires the rater to observe the patient during their routine interactions and does not require any additional cognitive testing. The RASS terms sedation was modified to drowsy (Table 1), because we wanted to capture impaired arousal regardless of sedation administration. We did not use the modified RASS (mRASS) proposed by the Veteran's Affairs Delirium Working Group, because it was published after data collection began.[1] The mRASS is very similar to the RASS, except it also incorporates a very informal inattention assessment. The RASS was ascertained by research assistants who were college students and graduates, and emergency medical technician basics and paramedics. The principal investigator gave them a 5‐minute didactic lecture about the RASS and observed them perform the RASS in at least 5 patients prior to the start of the study. Inter‐rater reliability between trained research assistants and a physician was assessed for 456 (42.0%) patients of the study sample. The weighted kappa of the RASS was 0.61, indicating very good inter‐rater reliability. Because the 81.7% of patients with impaired arousal had a RASS of 1, the RASS dichotomized as normal (RASS=0) or impaired (RASS other than 0).

Richmond Agitation‐Sedation Scale
ScoreTermDescription
  • NOTE: The Richmond Agitation‐Sedation Scale (RASS) is a brief (<10 seconds) arousal scale that was developed by Sessler et al.[10] The RASS is traditionally used in the intensive care unit to monitor depth of sedation. The terms were modified to better reflect the patient's level of arousal rather than sedation. A RASS of 0 indicates normal level of arousal (awake and alert), whereas a RASS <0 indicates decreased arousal, and a RASS >0 indicates increased arousal.

+4CombativeOvertly combative, violent, immediate danger to staff
+3Very agitatedPulls or removes tube(s) or catheter(s), aggressive
+2AgitatedFrequent nonpurposeful movement
+1RestlessAnxious but movements not aggressive or vigorous
0Alert and calm 
1Slight drowsyNot fully alert, but has sustained awakening (eye opening/eye contact) to voice (>10 seconds)
2Moderately drowsyBriefly awakens with eye contact to voice (<10 seconds)
3Very drowsyMovement or eye opening to voice (but no eye contact)
4Awakens to pain onlyNo response to voice, but movement or eye opening to physical stimulation
5UnarousableNo response to voice or physical stimulation

Death Ascertainment

Death within 6 months was ascertained using the following algorithm: (1) The electronic medical record was searched to determine the patient's death status. (2) Patients who had a documented emergency department visit, outpatient clinic visit, or hospitalization after 6 months were considered to be alive at 6 months. (3) For the remaining patients, date of death was searched in the Social Security Death Index (SSDI). (4) Patients without a death recorded in the SSDI 1 year after the index visit was considered to be alive at 6 months. Nine hundred thirty‐one (85.9%) out of 1084 patients had a recorded death in the medical record or SSDI, or had an emergency department or hospital visit documented in their record 6 months after the index visit.

Additional Variables Collected

Patients were considered to have dementia if they had: (1) documented dementia in the medical record, (2) a short form Informant Questionnaire on Cognitive Decline in the Elderly score (IQCODE) greater than 3.38,[15] or (3) prescribed cholinesterase inhibitors prior to admission. The short form IQCODE is an informant questionnaire with 16 items; a cutoff of 3.38 out of 5.00 is 79% sensitive and 82% specific for dementia.[16] Premorbid functional status was determined by the Katz Activities of Daily Living (Katz ADL) and ranges from 0 (completely dependent) to 6 (completely independent).[17] Patients with a score <5 were considered to be functionally dependent. Both the IQCODE and Katz ADL were prospectively collected in the emergency department at the time of enrollment.

The Charlson Comorbidity Index was used to measure comorbid burden.[18] The Acute Physiology Score (APS) of the Acute Physiology and Chronic Health Evaluation II score was used to quantify severity of illness.[19] The Glasgow Coma Scale was not included in the APS because it was not collected. Intravenous, intramuscular, and oral benzodiazepine and opioids given in the prehospital and emergency department were also recorded. The Charlson Comorbidity Index, APS, and benzodiazepine and opioid administration were collected after patient enrollment using the electronic medical record.

Within 3 hours of the RASS, a subset of 406 patients was evaluated by a consultation‐liaison psychiatrist who determined the patient's delirium status using Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (DSM‐IV‐TR) criteria.[20] Details of their comprehensive assessments have been described in a previous report.[5]

Statistical Analysis

Measures of central tendency and dispersion for continuous variables were reported as medians and interquartile ranges. Categorical variables were reported as proportions. For simple comparisons, Wilcoxon rank sum tests were performed for continuous data, and 2 analyses or Fisher exact test were performed for categorical data. To evaluate the predictive validity of impaired arousal on 6‐month mortality, the cumulative probability of survival was estimated within 6 months from the study enrollment date using the Kaplan‐Meier method. Cox proportional hazards regression was performed to assess if impaired arousal was independently associated with 6‐month mortality after adjusting for age, gender, nonwhite race, comorbidity burden (Charlson Comorbidity Index), severity of illness (APS), dementia, functional dependence (Katz ADL <5), nursing home residence, admission status, and benzodiazepine or opioid medication administration. Patients were censored at the end of 6 months. The selection of covariates was based upon expert opinion and literature review. The number of covariates used for the model was limited by the number of events to minimize overfitting; 1 df was allowed for every 10 to 15 events.[21] Because severity of illness, psychoactive medication administration, and admission status might modify the relationship between 6‐month mortality and impaired arousal, 2‐way interaction terms were incorporated. To maintain parsimony and minimize overfitting and collinearity, nonsignificant interaction terms (P>0.20) were removed in the final model.[22] Hazard ratios (HR) with their 95% confidence interval (95% CI) were reported.

To determine if arousal was associated with 6‐month mortality in the absence of delirium, we performed another Cox proportional hazard regression in a subset of 406 patients who received a psychiatrist assessment. Six‐month mortality was the dependent variable, and the independent variable was a 3‐level categorical variable of different arousal/delirium combinations: (1) impaired arousal/delirium positive, (2) impaired arousal/delirium negative, and (3) normal arousal (with or without delirium). Because there were only 8 patients who had normal arousal with delirium, this group was collapsed into the normal arousal without delirium group. Because there were 55 deaths, the number of covariates that could be entered into the Cox proportional hazard regression model was limited. We used the inverse weighted propensity score method to help minimize residual confounding.[23] Traditional propensity score adjustment could not be performed because there were 3 arousal/delirium categories. Similar to propensity score adjustment, inverse weighted propensity score method was used to help balance the distribution of patient characteristics among the exposure groups and also allow adjustment for multiple confounders while minimizing the degrees of freedom expended. A propensity score was the probability of having a particular arousal/delirium category based upon baseline patient characteristics. Multinomial logistic regression was performed to calculate the propensity score, and the baseline covariates used were age, gender, nonwhite race, comorbidity burden, severity of illness, dementia, functional dependence, and nursing home residence. For the Cox proportional hazard regression model, each observation was weighted by the inverse of the propensity score for their given arousal/delirium category; propensity scores exceeding the 95th percentile were trimmed to avoid overly influential weighting. Benzodiazepine and opioid medications given in the emergency department and admission status were adjusted as covariates in the weighted Cox proportional hazard regression model.

Nineteen patients (1.8%) had missing Katz ADL; these missing values were imputed using multiple imputation. The reliability of the final regression models were internally validated using the bootstrap method.[21] Two thousand sets of bootstrap samples were generated by resampling the original data, and the optimism was estimated to determine the degree of overfitting.[21] An optimism value >0.85 indicated no evidence of substantial overfitting.[21] Variance inflation factors were used to check multicollinearity. Schoenfeld residuals were also analyzed to determine goodness‐of‐fit and assess for outliers. P values <0.05 were considered statistically significant. All statistical analyses were performed using SAS version 9.4 (SAS Institute, Cary, NC) and open source R statistical software version 3.0.1 (http://www.r‐project.org/).

RESULTS

A total of 1903 patients were screened, and 1084 patients met enrollment criteria (Figure 1). Of these, 1051 (97.0%) were non‐ICU patients. Patient characteristics of this cohort can be seen in Table 2. Enrolled patients and potentially eligible patients who presented to the emergency department during the enrollment window were similar in age, gender, and severity of illness, but enrolled patients were slightly more likely to have a chief complaint of chest pain and syncope (unpublished data).

Figure 1
Enrollment flow diagram. RASS, Richmond Agitation‐Sedation Scale. Patients who were non‐verbal or unable to follow simple commands prior to their acute illness were considered to have end‐stage dementia.
Patient Characteristics
VariablesNormal Arousal, n=835Impaired Arousal, n=249P Value
  • NOTE: Patient characteristics and demographics of enrolled patients. Continuous and ordinal variables are expressed in medians and interquartile (IQR) ranges. Categorical variables are expressed in absolute numbers and percentages. *Patient was considered to have dementia if it was documented in the medical record, the patient was on home cholineresterase inhibitors, or had a short‐form Informant Questionnaire on Cognitive Decline in the Elderly >3.38. Patients with a Katz Activities of Daily Living of <5 were considered to be functionally dependent. There were 19 patients with missing Katz Activities of Daily Living scores. Charlson index is a weighted scale used to measure comorbidity burden. Higher scores indicate higher comorbidity burden. The Acute Physiology Score (APS) of the Acute Physiology and Chronic Health Evaluation II score was used quantify severity of illness. Glasgow Coma Scale was not incorporated in this score. Higher scores indicate higher severity of illness.

Median age, y (IQR)74 (6980)75 (7083)0.005
Female gender459 (55.0%)132 (53.0%)0.586
Nonwhite race122 (14.6%)51 (20.5%)0.027
Residence  <0.001
Home752 (90.1%)204 (81.9%) 
Assisted living29 (3.5%)13 (5.2%) 
Rehabilitation8 (1.0%)5 (2.0%) 
Nursing home42 (5.0%)27 (10.8%) 
Dementia*175 (21.0%)119 (47.8%)<0.001
Dependent120 (14.4%)99 (39.8%)<0.001
Median Charlson (IQR)2 (1, 4)3 (2, 5)<0.001
Median APS (IQR)2 (1, 4)2 (1, 5)<0.001
Primary complaint  <0.001
Abdominal pain45 (5.4%)13 (5.2%) 
Altered mental status12 (1.4%)36 (14.5%) 
Chest pain128 (15.3%)31 (12.5%) 
Disturbances of sensation17 (2.0%)2 (0.8%) 
Dizziness16 (1.9%)2 (0.8%) 
Fever11 (1.3%)7 (2.8%) 
General illness, malaise26 (3.1%)5 (2.0%) 
General weakness68 (8.1%)29 (11.7%) 
Nausea/vomiting29 (3.5%)4 (1.6%) 
Shortness of breath85 (10.2%)21 (8.4%) 
Syncope46 (5.5%)10 (4.0%) 
Trauma, multiple organs19 (2.3%)8 (3.2%) 
Other333 (39.9%)81 (32.5%) 
Benzodiazepines or opioid medications administration188 (22.5%)67 (26.9%)0.152
Admitted to the hospital478 (57.3%)191 (76.7%)0.002
Internal medicine411 (86.0%)153 (80.1%) 
Surgery38 (8.0%)21 (11.0%) 
Neurology19 (4.0%)13 (6.8%) 
Psychiatry1 (0.2%)2 (1.1%) 
Unknown/missing9 (1.9%)2 (1.1%) 
Death within 6 months81 (9.7%)59 (23.7%)<0.001

Of those enrolled, 249 (23.0%) had an abnormal RASS at initial presentation, and their distribution can be seen in Figure 2. Within 6 months, patients with an abnormal RASS were more likely to die compared with patients with a RASS of 0 (23.7% vs 9.7%, P<0.001). The Kaplan‐Meier survival curves for all enrolled patients with impaired and normal RASS can be seen in Figure 3; the survival curve declined more slowly in patients with a normal RASS compared with those with an abnormal RASS.

Figure 2
Richmond Agitation‐Sedation Scale (RASS) distribution among enrolled patients. Distribution of RASS at initial presentation among 1084 acutely ill older patients, and of these, 1051 patients (97.0%) were non–intensive care unit patients. The RASS is a widely used arousal scale that can be performed during routine clinical care and takes <10 seconds to perform. A RASS of 0 indicates normal level of arousal (awake and alert), whereas a RASS of <0 indicates decreased arousal and a RASS of >0 indicates increased arousal.
Figure 3
Kaplan‐Meier survival curves in acutely ill older patients with a normal and impaired arousal at initial presentation over a 6‐month period. Arousal was assessed for using the Richmond Agitation‐Sedation Scale (RASS). Patients with impaired arousal were more likely to die compared to patients with normal arousal (23.7% vs 9.7%) within 6 months. Using Cox proportional hazard regression, patients with an abnormal RASS were 73% more likely to die within 6 months after adjusting for age, dementia, functional dependence, comorbidity burden, severity of illness, hearing impairment, nursing home residence, admission status, and administration of benzodiazepines/opioids medications. Severity of illness (P = 0.52), benzodiazepine/opioid medication administration (P = 0.38), and admission status (P = 0.57) did not modify the relationship between impaired arousal and 6‐month mortality. Abbreviations: CI, confidence interval.

Using Cox proportional hazards regression, the relationship between an abnormal RASS at initial presentation and 6‐month mortality persisted (HR: 1.73, 95% CI: 1.21‐2.49) after adjusting for age, sex, nonwhite race, comorbidity burden, severity of illness, dementia, functional dependence, nursing home residence, psychoactive medications given, and admission status. The interaction between an abnormal RASS and APS (severity of illness) had a P value of 0.52. The interaction between an abnormal RASS and benzodiazepine or opioid medication administration had a P value of 0.38. The interaction between an abnormal RASS and admission status had a P value of 0.57. This indicated that severity of illness, psychoactive medication administration, and admission status did not modify the relationship between an abnormal RASS and 6‐month mortality.

We analyzed a subset of 406 patients who received a psychiatrist's assessment to determine if an abnormal RASS was associated with 6‐month mortality regardless of delirium status using Cox proportional hazard regression weighted by the inverse of the propensity score. Patients with an abnormal RASS and no delirium were significantly associated with higher mortality compared to those with a normal RASS (HR: 2.20, 95% CI: 1.10‐4.41). Patients with an abnormal RASS with delirium also had an increased risk for 6‐month mortality (HR: 2.86, 95% CI: 1.29‐6.34).

All regression models were internally validated. There was no evidence of substantial overfitting or collinearity. The Schoenfeld residuals for each model were examined graphically and there was good model fit overall, and no significant outliers were observed.

DISCUSSION

Vital sign measurements are a fundamental component of patient care, and abnormalities can serve as an early warning signal of the patient's clinical deterioration. However, traditional vital signs do not include an assessment of the patient's brain function. Our chief finding is that impaired arousal at initial presentation, as determined by the nonphysician research staff, increased the risk of 6‐month mortality by 73% after adjusting for confounders in a diverse group of acutely ill older patients. This relationship existed regardless of severity of illness, administration of psychoactive medications, and admission status. Though impaired arousal is closely linked with delirium,[2, 24] which is another well‐known predictor of mortality,[11, 25, 26] the prognostic significance of impaired arousal appeared to extend beyond delirium. We observed that the relationship between 6‐month mortality and impaired arousal in the absence of delirium was remarkably similar to that observed with impaired arousal with delirium. Arousal can be assessed for by simply observing the patient during routine clinical care and can be performed by nonphysician and physician healthcare providers. Its assessment should be performed and communicated in conjunction with traditional vital sign measurements in the emergency department and inpatient settings.[1]

Most of the data linking impaired arousal to death have been collected in the ICU. Coma, which represents the most severe form of depressed arousal, has been shown to increase the likelihood of death regardless of underlying etiology.[27, 28, 29, 30, 31] This includes patients who have impaired arousal because they received sedative medications during mechanical ventilation.[32] Few studies have investigated the effect of impaired arousal in a non‐ICU patient population. Zuliani et al. observed that impaired arousal was associated with 30‐day mortality, but their study was conducted in 469 older stroke patients, limiting the study's external validity to a more general patient population.[33] Our data advance the current stage of knowledge; we observed a similar relationship between impaired arousal and 6‐month mortality in a much broader clinical population who were predominantly not critically ill regardless of delirium status. Additionally, most of our impaired arousal cohort had a RASS of 1, indicating that even subtle abnormalities portended adverse outcomes.

In addition to long‐term prognosis, the presence of impaired arousal has immediate clinical implications. Using arousal scales like the RASS can serve as a way for healthcare providers to succinctly communicate the patient's mental status in a standardized manner during transitions of care (eg, emergency physician to inpatient team). Regardless of which clinical setting they are in, older acutely ill patients with an impaired arousal may also require close monitoring, especially if the impairment is acute. Because of its close relationship with delirium, these patients likely have an underlying acute medical illness that precipitated their impaired arousal.

Understanding the true clinical significance of impaired arousal in the absence of delirium requires further study. Because of the fluctuating nature of delirium, it is possible that these patients may have initially been delirious and then became nondelirious during the psychiatrist's evaluation. Conversely, it is also possible that these patients may have eventually transitioned into delirium at later point in time; the presence of impaired arousal alone may be a precursor to delirium. Last, these patients may have had subsyndromal delirium, which is defined as having 1 or more delirium symptoms without ever meeting full DSM‐IV‐TR criteria for delirium.[34] Patients with subsyndromal delirium have poorer outcomes, such as prolonged hospitalizations, and higher mortality than patients without delirium symptoms.[34]

Additional studies are also needed to further clarify the impact of impaired arousal on nonmortality outcomes such as functional and cognitive decline. The prognostic significance of serial arousal measurements also requires further study. It is possible that patients whose impaired arousal rapidly resolves after an intervention may have better prognoses than those who have persistent impairment. The measurement of arousal may have additional clinical applications in disease prognosis models. The presence of altered mental status is incorporated in various disease‐specific risk scores such as the CURB‐65 or Pneumonia Severity Index for pneumonia,[35, 36] and the Pulmonary Embolism Severity Index for pulmonary embolism.[37] However, the definition of altered mental status is highly variable; it ranges from subjective impressions that can be unreliable to formal cognitive testing, which can be time consuming. Arousal scales such as the RASS may allow for more feasible, reliable, and standardized assessment of mental status. Future studies should investigate if incorporating the RASS would improve the discrimination of these disease‐severity indices.

This study has several notable limitations. We excluded patients with a RASS of 4 and 5, which represented comatose patients. This exclusion, however, likely biased our findings toward the null. We enrolled a convenience sample that may have introduced selection bias. However, our enrolled cohort was similar to all potentially eligible patients who presented to the emergency department during the study period. We also attempted to mitigate this selection bias by using multivariable regression and adjusting for factors that may have confounded the relationship between RASS and 6‐month mortality. This study was performed at a single, urban, academic hospital and enrolled patients who were aged 65 years and older. Our findings may not be generalizable to other settings and to those who are under 65 years of age. Because 406 patients received a psychiatric evaluation, this limited the number of covariates that could be incorporated into the multivariable model to evaluate if impaired arousal in the absence of delirium is associated with 6‐month mortality. To minimize residual confounding, we used the inverse weighted propensity score, but we acknowledge that this bias may still exist. Larger studies are needed to clarify the relationships between arousal, delirium, and mortality.

CONCLUSION

In conclusion, impaired arousal at initial presentation is an independent predictor for 6‐month mortality in a diverse group of acutely ill older patients, and this risk appears to be present even in the absence of delirium. Because of its ease of use and prognostic significance, it may be a useful vital sign for underlying brain dysfunction. Routine standardized assessment and communication of arousal during routine clinical care may be warranted.

Disclosures: Research reported in this publication was supported by the Vanderbilt Physician Scientist Development Award, Emergency Medicine Foundation, and National Institute on Aging of the National Institutes of Health under award number K23AG032355. This study was also supported by the National Center for Research Resources, grant UL1 RR024975‐01, and is now at the National Center for Advancing Translational Sciences, grant 2 UL1 TR000445‐06. Dr. Vasilevskis was supported in part by the National Institute on Aging of the National Institutes of Health under award number K23AG040157. Dr. Powers was supported by Health Resources and Services Administration Geriatric Education Centers, grant 1D31HP08823‐01‐00. Dr. Storrow was supported by the National Heart, Lung, and Blood Institute of the National Institutes of Health under award number K12HL1090 and the National Center for Advancing Translational Sciences under award number UL1TR000445. Dr. Ely was supported in part by the National Institute on Aging of the National Institutes of Health under award numbers R01AG027472 and R01AG035117, and a Veteran Affairs MERIT award. Drs. Vasilevskis, Schnelle, Dittus, Powers, and Ely were supported by the Veteran Affairs Geriatric Research, Education, and Clinical Center. The content is solely the responsibility of the authors and does not necessarily represent the official views of Vanderbilt University, Emergency Medicine Foundation, National Institutes of Health, and Veterans Affairs. The funding agencies did not have any role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; and preparation, review, or approval of the manuscript.

J.H.H., E.W.E., J.F.S., A.B.S., and R.D.S. conceived the trial. J.H.H., E.W.E., A.B.S., J.F.S., R.D.S., A.S., and A.W. participated in the study design. J.H.H. and A.W. recruited patients and collected the data. J.H.H., A.J.G., and A.S. analyzed the data. All authors participated in the interpretation of results. J.H.H. drafted the manuscript, and all authors contributed to the critical review and revision of the manuscript.

The authors report no conflicts of interest.

Files
References
  1. Flaherty JH, Shay K, Weir C, et al. The development of a mental status vital sign for use across the spectrum of care. J Am Med Dir Assoc. 2009;10:379380.
  2. Chester JG, Beth Harrington M, Rudolph JL, Group VADW. Serial administration of a modified Richmond Agitation and Sedation Scale for delirium screening. J Hosp Med. 2012;7:450453.
  3. Inouye SK, Dyck CH, Alessi CA, Balkin S, Siegal AP, Horwitz RI. Clarifying confusion: the confusion assessment method. A new method for detection of delirium. Ann Intern Med. 1990;113:941948.
  4. Ely EW, Inouye SK, Bernard GR, et al. Delirium in mechanically ventilated patients: validity and reliability of the confusion assessment method for the intensive care unit (CAM‐ICU). JAMA. 2001;286:27032710.
  5. Han JH, Wilson A, Vasilevskis EE, et al. Diagnosing delirium in older emergency department patients: validity and reliability of the Delirium Triage Screen And The Brief Confusion Assessment Method. Ann Emerg Med. 2013;62:457465.
  6. Inouye SK, Rushing JT, Foreman MD, Palmer RM, Pompei P. Does delirium contribute to poor hospital outcomes? A three‐site epidemiologic study. J Gen Intern Med. 1998;13:234242.
  7. Pitkala KH, Laurila JV, Strandberg TE, Tilvis RS. Prognostic significance of delirium in frail older people. Dement Geriatr Cogn Disord. 2005;19:158163.
  8. Witlox J, Eurelings LS, Jonghe JF, Kalisvaart KJ, Eikelenboom P, Gool WA. Delirium in elderly patients and the risk of postdischarge mortality, institutionalization, and dementia: a meta‐analysis. JAMA. 2010;304:443451.
  9. Leslie DL, Marcantonio ER, Zhang Y, Leo‐Summers L, Inouye SK. One‐year health care costs associated with delirium in the elderly population. Arch Intern Med. 2008;168:2732.
  10. Sessler CN, Gosnell MS, Grap MJ, et al. The Richmond Agitation‐Sedation Scale: validity and reliability in adult intensive care unit patients. Am J Respir Crit Care Med. 2002;166:13381344.
  11. Han JH, Shintani A, Eden S, et al. Delirium in the emergency department: an independent predictor of death within 6 months. Ann Emerg Med. 2010;56:244252.
  12. Han JH, Eden S, Shintani A, et al. Delirium in older emergency department patients is an independent predictor of hospital length of stay. Acad Emerg Med. 2011;18:451457.
  13. Han JH, Wilson A, Graves AJ, et al. Validation of the Confusion Assessment Method For The Intensive Care Unit in older emergency department patients. Acad Emerg Med. 2014;21:180187.
  14. Ely EW, Truman B, Shintani A, et al. Monitoring sedation status over time in ICU patients: reliability and validity of the Richmond Agitation‐Sedation Scale (RASS). JAMA. 2003;289:29832991.
  15. Holsinger T, Deveau J, Boustani M, Williams JW. Does this patient have dementia? JAMA. 2007;297:23912404.
  16. Jorm AF. A short form of the Informant Questionnaire on Cognitive Decline in the Elderly (IQCODE): development and cross‐validation. Psychol Med. 1994;24:145153.
  17. Katz S. Assessing self‐maintenance: activities of daily living, mobility, and instrumental activities of daily living. J Am Geriatr Soc. 1983;31:721727.
  18. Murray SB, Bates DW, Ngo L, Ufberg JW, Shapiro NI. Charlson Index is associated with one‐year mortality in emergency department patients with suspected infection. Acad Emerg Med. 2006;13:530536.
  19. Knaus WA, Draper EA, Wagner DP, Zimmerman JE. APACHE II: a severity of disease classification system. Crit Care Med. 1985;13:818829.
  20. American Psychiatric Association. Task Force on DSM‐IV. Diagnostic and Statistical Manual of Mental Disorders: DSM‐IV‐TR. 4th ed. Washington, DC: American Psychiatric Association; 2000.
  21. Harrell FE. Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis. New York, NY: Springer; 2001.
  22. Marshall SW. Power for tests of interaction: effect of raising the Type I error rate. Epidemiol Perspect Innov. 2007;4:4.
  23. Austin PC. An Introduction to Propensity Score Methods for Reducing the Effects of Confounding in Observational Studies. Multivariate Behav Res. 2011;46:399424.
  24. Meagher DJ, Maclullich AM, Laurila JV. Defining delirium for the International Classification of Diseases, 11th Revision. J Psychosom Res. 2008;65:207214.
  25. McCusker J, Cole M, Abrahamowicz M, Primeau F, Belzile E. Delirium predicts 12‐month mortality. Arch Intern Med. 2002;162:457463.
  26. Ely EW, Shintani A, Truman B, et al. Delirium as a predictor of mortality in mechanically ventilated patients in the intensive care unit. JAMA. 2004;291:17531762.
  27. Teres D, Brown RB, Lemeshow S. Predicting mortality of intensive care unit patients. The importance of coma. Crit Care Med. 1982;10:8695.
  28. Jennett B, Bond M. Assessment of outcome after severe brain damage. Lancet. 1975;1:480484.
  29. Levy DE, Caronna JJ, Singer BH, Lapinski RH, Frydman H, Plum F. Predicting outcome from hypoxic‐ischemic coma. JAMA. 1985;253:14201426.
  30. Tuhrim S, Dambrosia JM, Price TR, et al. Prediction of intracerebral hemorrhage survival. Ann Neurol. 1988;24:258263.
  31. Booth CM, Boone RH, Tomlinson G, Detsky AS. Is this patient dead, vegetative, or severely neurologically impaired? Assessing outcome for comatose survivors of cardiac arrest. JAMA. 2004;291:870879.
  32. Shehabi Y, Bellomo R, Reade MC, et al. Early intensive care sedation predicts long‐term mortality in ventilated critically ill patients. Am J Respir Crit Care Med. 2012;186:724731.
  33. Zuliani G, Cherubini A, Ranzini M, Ruggiero C, Atti AR, Fellin R. Risk factors for short‐term mortality in older subjects with acute ischemic stroke. Gerontology. 2006;52:231236.
  34. Cole M, McCusker J, Dendukuri N, Han L. The prognostic significance of subsyndromal delirium in elderly medical inpatients. J Am Geriatr Soc. 2003;51:754760.
  35. Lim WS, der Eerden MM, Laing R, et al. Defining community acquired pneumonia severity on presentation to hospital: an international derivation and validation study. Thorax. 2003;58:377382.
  36. Fine MJ, Auble TE, Yealy DM, et al. A prediction rule to identify low‐risk patients with community‐acquired pneumonia. N Engl J Med. 1997;336:243250.
  37. Aujesky D, Obrosky DS, Stone RA, et al. Derivation and validation of a prognostic model for pulmonary embolism. Am J Respir Crit Care Med. 2005;172:10411046.
Article PDF
Issue
Journal of Hospital Medicine - 9(12)
Publications
Page Number
772-778
Sections
Files
Files
Article PDF
Article PDF

Arousal is defined as the patient's overall level of responsiveness to the environment. Its assessment is standard of care in most intensive care units (ICUs) to monitor depth of sedation and underlying brain dysfunction. There has been recent interest in expanding the role of arousal assessment beyond the ICU. Specifically, the Veterans Affairs Delirium Working Group proposed that simple arousal assessment be a vital sign to quantify underlying brain dysfunction.[1] The rationale is that impaired arousal is closely linked with delirium,[2] and is an integral component of multiple delirium assessments.[3, 4, 5] Chester et al. observed that the presence of impaired arousal was 64% sensitive and 93% specific for delirium diagnosed by a psychiatrist.[2] Delirium is an under‐recognized public health problem that affects up to 25% of older hospitalized patients,[6, 7] is associated with a multitude of adverse outcomes such as death and accelerated cognitive decline,[8] and costs the US healthcare system an excess of $152 billion dollars.[9]

Most delirium assessments require the patient to undergo additional cognitive testing. The assessment of arousal, however, requires the rater to merely observe the patient during routine clinical care and can be easily integrated into the clinical workflow.[10] Because of its simplicity and brevity, assessing arousal alone using validated scales such as the Richmond Agitation‐Sedation Scale (RASS) may be a more appealing alternative to traditional, more complex delirium screening in the acute care setting. Its clinical utility would be further strengthened if impaired arousal was also associated with mortality, and conferred risk even in the absence of delirium. As a result, we sought to determine if impaired arousal at initial presentation in older acutely ill patients predicted 6‐month mortality and whether this relationship was present in the absence of delirium.

METHODS

Design Overview

We performed a planned secondary analysis of 2 prospective cohorts that enrolled patients from May 2007 to August 2008 between 8 am and 10 pm during the weekdays, and July 2009 to February 2012 between 8 am and 4 pm during the weekdays. The first cohort was designed to evaluate the relationship between delirium and patient outcomes.[11, 12] The second cohort was used to validate brief delirium assessments using a psychiatrist's assessment as the reference standard.[5, 13] The local institutional review board approved these studies.

Setting and Participants

These studies were conducted in an urban emergency department located within an academic, tertiary care hospital with over 57,000 visits annually. Patients were included if they were 65 years or older and in the emergency department for <12 hours at the time of enrollment. The 12‐hour cutoff was used to include patients who presented to the emergency department in the evening and early morning hours. Patients were excluded if they were previously enrolled, non‐English speaking, comatose, or were nonverbal and unable to follow simple commands prior to the acute illness. Because the July 2009 to February 2012 cohort was designed to validate delirium assessments with auditory and visual components, patients were also excluded if they were deaf or blind.

Measurement of Arousal

RASS is an arousal scale commonly used in ICUs to assess depth of sedation and ranges from 5 (unarousable) to +4 (combative); 0 represents normal arousal.[10, 14] The RASS simply requires the rater to observe the patient during their routine interactions and does not require any additional cognitive testing. The RASS terms sedation was modified to drowsy (Table 1), because we wanted to capture impaired arousal regardless of sedation administration. We did not use the modified RASS (mRASS) proposed by the Veteran's Affairs Delirium Working Group, because it was published after data collection began.[1] The mRASS is very similar to the RASS, except it also incorporates a very informal inattention assessment. The RASS was ascertained by research assistants who were college students and graduates, and emergency medical technician basics and paramedics. The principal investigator gave them a 5‐minute didactic lecture about the RASS and observed them perform the RASS in at least 5 patients prior to the start of the study. Inter‐rater reliability between trained research assistants and a physician was assessed for 456 (42.0%) patients of the study sample. The weighted kappa of the RASS was 0.61, indicating very good inter‐rater reliability. Because the 81.7% of patients with impaired arousal had a RASS of 1, the RASS dichotomized as normal (RASS=0) or impaired (RASS other than 0).

Richmond Agitation‐Sedation Scale
ScoreTermDescription
  • NOTE: The Richmond Agitation‐Sedation Scale (RASS) is a brief (<10 seconds) arousal scale that was developed by Sessler et al.[10] The RASS is traditionally used in the intensive care unit to monitor depth of sedation. The terms were modified to better reflect the patient's level of arousal rather than sedation. A RASS of 0 indicates normal level of arousal (awake and alert), whereas a RASS <0 indicates decreased arousal, and a RASS >0 indicates increased arousal.

+4CombativeOvertly combative, violent, immediate danger to staff
+3Very agitatedPulls or removes tube(s) or catheter(s), aggressive
+2AgitatedFrequent nonpurposeful movement
+1RestlessAnxious but movements not aggressive or vigorous
0Alert and calm 
1Slight drowsyNot fully alert, but has sustained awakening (eye opening/eye contact) to voice (>10 seconds)
2Moderately drowsyBriefly awakens with eye contact to voice (<10 seconds)
3Very drowsyMovement or eye opening to voice (but no eye contact)
4Awakens to pain onlyNo response to voice, but movement or eye opening to physical stimulation
5UnarousableNo response to voice or physical stimulation

Death Ascertainment

Death within 6 months was ascertained using the following algorithm: (1) The electronic medical record was searched to determine the patient's death status. (2) Patients who had a documented emergency department visit, outpatient clinic visit, or hospitalization after 6 months were considered to be alive at 6 months. (3) For the remaining patients, date of death was searched in the Social Security Death Index (SSDI). (4) Patients without a death recorded in the SSDI 1 year after the index visit was considered to be alive at 6 months. Nine hundred thirty‐one (85.9%) out of 1084 patients had a recorded death in the medical record or SSDI, or had an emergency department or hospital visit documented in their record 6 months after the index visit.

Additional Variables Collected

Patients were considered to have dementia if they had: (1) documented dementia in the medical record, (2) a short form Informant Questionnaire on Cognitive Decline in the Elderly score (IQCODE) greater than 3.38,[15] or (3) prescribed cholinesterase inhibitors prior to admission. The short form IQCODE is an informant questionnaire with 16 items; a cutoff of 3.38 out of 5.00 is 79% sensitive and 82% specific for dementia.[16] Premorbid functional status was determined by the Katz Activities of Daily Living (Katz ADL) and ranges from 0 (completely dependent) to 6 (completely independent).[17] Patients with a score <5 were considered to be functionally dependent. Both the IQCODE and Katz ADL were prospectively collected in the emergency department at the time of enrollment.

The Charlson Comorbidity Index was used to measure comorbid burden.[18] The Acute Physiology Score (APS) of the Acute Physiology and Chronic Health Evaluation II score was used to quantify severity of illness.[19] The Glasgow Coma Scale was not included in the APS because it was not collected. Intravenous, intramuscular, and oral benzodiazepine and opioids given in the prehospital and emergency department were also recorded. The Charlson Comorbidity Index, APS, and benzodiazepine and opioid administration were collected after patient enrollment using the electronic medical record.

Within 3 hours of the RASS, a subset of 406 patients was evaluated by a consultation‐liaison psychiatrist who determined the patient's delirium status using Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (DSM‐IV‐TR) criteria.[20] Details of their comprehensive assessments have been described in a previous report.[5]

Statistical Analysis

Measures of central tendency and dispersion for continuous variables were reported as medians and interquartile ranges. Categorical variables were reported as proportions. For simple comparisons, Wilcoxon rank sum tests were performed for continuous data, and 2 analyses or Fisher exact test were performed for categorical data. To evaluate the predictive validity of impaired arousal on 6‐month mortality, the cumulative probability of survival was estimated within 6 months from the study enrollment date using the Kaplan‐Meier method. Cox proportional hazards regression was performed to assess if impaired arousal was independently associated with 6‐month mortality after adjusting for age, gender, nonwhite race, comorbidity burden (Charlson Comorbidity Index), severity of illness (APS), dementia, functional dependence (Katz ADL <5), nursing home residence, admission status, and benzodiazepine or opioid medication administration. Patients were censored at the end of 6 months. The selection of covariates was based upon expert opinion and literature review. The number of covariates used for the model was limited by the number of events to minimize overfitting; 1 df was allowed for every 10 to 15 events.[21] Because severity of illness, psychoactive medication administration, and admission status might modify the relationship between 6‐month mortality and impaired arousal, 2‐way interaction terms were incorporated. To maintain parsimony and minimize overfitting and collinearity, nonsignificant interaction terms (P>0.20) were removed in the final model.[22] Hazard ratios (HR) with their 95% confidence interval (95% CI) were reported.

To determine if arousal was associated with 6‐month mortality in the absence of delirium, we performed another Cox proportional hazard regression in a subset of 406 patients who received a psychiatrist assessment. Six‐month mortality was the dependent variable, and the independent variable was a 3‐level categorical variable of different arousal/delirium combinations: (1) impaired arousal/delirium positive, (2) impaired arousal/delirium negative, and (3) normal arousal (with or without delirium). Because there were only 8 patients who had normal arousal with delirium, this group was collapsed into the normal arousal without delirium group. Because there were 55 deaths, the number of covariates that could be entered into the Cox proportional hazard regression model was limited. We used the inverse weighted propensity score method to help minimize residual confounding.[23] Traditional propensity score adjustment could not be performed because there were 3 arousal/delirium categories. Similar to propensity score adjustment, inverse weighted propensity score method was used to help balance the distribution of patient characteristics among the exposure groups and also allow adjustment for multiple confounders while minimizing the degrees of freedom expended. A propensity score was the probability of having a particular arousal/delirium category based upon baseline patient characteristics. Multinomial logistic regression was performed to calculate the propensity score, and the baseline covariates used were age, gender, nonwhite race, comorbidity burden, severity of illness, dementia, functional dependence, and nursing home residence. For the Cox proportional hazard regression model, each observation was weighted by the inverse of the propensity score for their given arousal/delirium category; propensity scores exceeding the 95th percentile were trimmed to avoid overly influential weighting. Benzodiazepine and opioid medications given in the emergency department and admission status were adjusted as covariates in the weighted Cox proportional hazard regression model.

Nineteen patients (1.8%) had missing Katz ADL; these missing values were imputed using multiple imputation. The reliability of the final regression models were internally validated using the bootstrap method.[21] Two thousand sets of bootstrap samples were generated by resampling the original data, and the optimism was estimated to determine the degree of overfitting.[21] An optimism value >0.85 indicated no evidence of substantial overfitting.[21] Variance inflation factors were used to check multicollinearity. Schoenfeld residuals were also analyzed to determine goodness‐of‐fit and assess for outliers. P values <0.05 were considered statistically significant. All statistical analyses were performed using SAS version 9.4 (SAS Institute, Cary, NC) and open source R statistical software version 3.0.1 (http://www.r‐project.org/).

RESULTS

A total of 1903 patients were screened, and 1084 patients met enrollment criteria (Figure 1). Of these, 1051 (97.0%) were non‐ICU patients. Patient characteristics of this cohort can be seen in Table 2. Enrolled patients and potentially eligible patients who presented to the emergency department during the enrollment window were similar in age, gender, and severity of illness, but enrolled patients were slightly more likely to have a chief complaint of chest pain and syncope (unpublished data).

Figure 1
Enrollment flow diagram. RASS, Richmond Agitation‐Sedation Scale. Patients who were non‐verbal or unable to follow simple commands prior to their acute illness were considered to have end‐stage dementia.
Patient Characteristics
VariablesNormal Arousal, n=835Impaired Arousal, n=249P Value
  • NOTE: Patient characteristics and demographics of enrolled patients. Continuous and ordinal variables are expressed in medians and interquartile (IQR) ranges. Categorical variables are expressed in absolute numbers and percentages. *Patient was considered to have dementia if it was documented in the medical record, the patient was on home cholineresterase inhibitors, or had a short‐form Informant Questionnaire on Cognitive Decline in the Elderly >3.38. Patients with a Katz Activities of Daily Living of <5 were considered to be functionally dependent. There were 19 patients with missing Katz Activities of Daily Living scores. Charlson index is a weighted scale used to measure comorbidity burden. Higher scores indicate higher comorbidity burden. The Acute Physiology Score (APS) of the Acute Physiology and Chronic Health Evaluation II score was used quantify severity of illness. Glasgow Coma Scale was not incorporated in this score. Higher scores indicate higher severity of illness.

Median age, y (IQR)74 (6980)75 (7083)0.005
Female gender459 (55.0%)132 (53.0%)0.586
Nonwhite race122 (14.6%)51 (20.5%)0.027
Residence  <0.001
Home752 (90.1%)204 (81.9%) 
Assisted living29 (3.5%)13 (5.2%) 
Rehabilitation8 (1.0%)5 (2.0%) 
Nursing home42 (5.0%)27 (10.8%) 
Dementia*175 (21.0%)119 (47.8%)<0.001
Dependent120 (14.4%)99 (39.8%)<0.001
Median Charlson (IQR)2 (1, 4)3 (2, 5)<0.001
Median APS (IQR)2 (1, 4)2 (1, 5)<0.001
Primary complaint  <0.001
Abdominal pain45 (5.4%)13 (5.2%) 
Altered mental status12 (1.4%)36 (14.5%) 
Chest pain128 (15.3%)31 (12.5%) 
Disturbances of sensation17 (2.0%)2 (0.8%) 
Dizziness16 (1.9%)2 (0.8%) 
Fever11 (1.3%)7 (2.8%) 
General illness, malaise26 (3.1%)5 (2.0%) 
General weakness68 (8.1%)29 (11.7%) 
Nausea/vomiting29 (3.5%)4 (1.6%) 
Shortness of breath85 (10.2%)21 (8.4%) 
Syncope46 (5.5%)10 (4.0%) 
Trauma, multiple organs19 (2.3%)8 (3.2%) 
Other333 (39.9%)81 (32.5%) 
Benzodiazepines or opioid medications administration188 (22.5%)67 (26.9%)0.152
Admitted to the hospital478 (57.3%)191 (76.7%)0.002
Internal medicine411 (86.0%)153 (80.1%) 
Surgery38 (8.0%)21 (11.0%) 
Neurology19 (4.0%)13 (6.8%) 
Psychiatry1 (0.2%)2 (1.1%) 
Unknown/missing9 (1.9%)2 (1.1%) 
Death within 6 months81 (9.7%)59 (23.7%)<0.001

Of those enrolled, 249 (23.0%) had an abnormal RASS at initial presentation, and their distribution can be seen in Figure 2. Within 6 months, patients with an abnormal RASS were more likely to die compared with patients with a RASS of 0 (23.7% vs 9.7%, P<0.001). The Kaplan‐Meier survival curves for all enrolled patients with impaired and normal RASS can be seen in Figure 3; the survival curve declined more slowly in patients with a normal RASS compared with those with an abnormal RASS.

Figure 2
Richmond Agitation‐Sedation Scale (RASS) distribution among enrolled patients. Distribution of RASS at initial presentation among 1084 acutely ill older patients, and of these, 1051 patients (97.0%) were non–intensive care unit patients. The RASS is a widely used arousal scale that can be performed during routine clinical care and takes <10 seconds to perform. A RASS of 0 indicates normal level of arousal (awake and alert), whereas a RASS of <0 indicates decreased arousal and a RASS of >0 indicates increased arousal.
Figure 3
Kaplan‐Meier survival curves in acutely ill older patients with a normal and impaired arousal at initial presentation over a 6‐month period. Arousal was assessed for using the Richmond Agitation‐Sedation Scale (RASS). Patients with impaired arousal were more likely to die compared to patients with normal arousal (23.7% vs 9.7%) within 6 months. Using Cox proportional hazard regression, patients with an abnormal RASS were 73% more likely to die within 6 months after adjusting for age, dementia, functional dependence, comorbidity burden, severity of illness, hearing impairment, nursing home residence, admission status, and administration of benzodiazepines/opioids medications. Severity of illness (P = 0.52), benzodiazepine/opioid medication administration (P = 0.38), and admission status (P = 0.57) did not modify the relationship between impaired arousal and 6‐month mortality. Abbreviations: CI, confidence interval.

Using Cox proportional hazards regression, the relationship between an abnormal RASS at initial presentation and 6‐month mortality persisted (HR: 1.73, 95% CI: 1.21‐2.49) after adjusting for age, sex, nonwhite race, comorbidity burden, severity of illness, dementia, functional dependence, nursing home residence, psychoactive medications given, and admission status. The interaction between an abnormal RASS and APS (severity of illness) had a P value of 0.52. The interaction between an abnormal RASS and benzodiazepine or opioid medication administration had a P value of 0.38. The interaction between an abnormal RASS and admission status had a P value of 0.57. This indicated that severity of illness, psychoactive medication administration, and admission status did not modify the relationship between an abnormal RASS and 6‐month mortality.

We analyzed a subset of 406 patients who received a psychiatrist's assessment to determine if an abnormal RASS was associated with 6‐month mortality regardless of delirium status using Cox proportional hazard regression weighted by the inverse of the propensity score. Patients with an abnormal RASS and no delirium were significantly associated with higher mortality compared to those with a normal RASS (HR: 2.20, 95% CI: 1.10‐4.41). Patients with an abnormal RASS with delirium also had an increased risk for 6‐month mortality (HR: 2.86, 95% CI: 1.29‐6.34).

All regression models were internally validated. There was no evidence of substantial overfitting or collinearity. The Schoenfeld residuals for each model were examined graphically and there was good model fit overall, and no significant outliers were observed.

DISCUSSION

Vital sign measurements are a fundamental component of patient care, and abnormalities can serve as an early warning signal of the patient's clinical deterioration. However, traditional vital signs do not include an assessment of the patient's brain function. Our chief finding is that impaired arousal at initial presentation, as determined by the nonphysician research staff, increased the risk of 6‐month mortality by 73% after adjusting for confounders in a diverse group of acutely ill older patients. This relationship existed regardless of severity of illness, administration of psychoactive medications, and admission status. Though impaired arousal is closely linked with delirium,[2, 24] which is another well‐known predictor of mortality,[11, 25, 26] the prognostic significance of impaired arousal appeared to extend beyond delirium. We observed that the relationship between 6‐month mortality and impaired arousal in the absence of delirium was remarkably similar to that observed with impaired arousal with delirium. Arousal can be assessed for by simply observing the patient during routine clinical care and can be performed by nonphysician and physician healthcare providers. Its assessment should be performed and communicated in conjunction with traditional vital sign measurements in the emergency department and inpatient settings.[1]

Most of the data linking impaired arousal to death have been collected in the ICU. Coma, which represents the most severe form of depressed arousal, has been shown to increase the likelihood of death regardless of underlying etiology.[27, 28, 29, 30, 31] This includes patients who have impaired arousal because they received sedative medications during mechanical ventilation.[32] Few studies have investigated the effect of impaired arousal in a non‐ICU patient population. Zuliani et al. observed that impaired arousal was associated with 30‐day mortality, but their study was conducted in 469 older stroke patients, limiting the study's external validity to a more general patient population.[33] Our data advance the current stage of knowledge; we observed a similar relationship between impaired arousal and 6‐month mortality in a much broader clinical population who were predominantly not critically ill regardless of delirium status. Additionally, most of our impaired arousal cohort had a RASS of 1, indicating that even subtle abnormalities portended adverse outcomes.

In addition to long‐term prognosis, the presence of impaired arousal has immediate clinical implications. Using arousal scales like the RASS can serve as a way for healthcare providers to succinctly communicate the patient's mental status in a standardized manner during transitions of care (eg, emergency physician to inpatient team). Regardless of which clinical setting they are in, older acutely ill patients with an impaired arousal may also require close monitoring, especially if the impairment is acute. Because of its close relationship with delirium, these patients likely have an underlying acute medical illness that precipitated their impaired arousal.

Understanding the true clinical significance of impaired arousal in the absence of delirium requires further study. Because of the fluctuating nature of delirium, it is possible that these patients may have initially been delirious and then became nondelirious during the psychiatrist's evaluation. Conversely, it is also possible that these patients may have eventually transitioned into delirium at later point in time; the presence of impaired arousal alone may be a precursor to delirium. Last, these patients may have had subsyndromal delirium, which is defined as having 1 or more delirium symptoms without ever meeting full DSM‐IV‐TR criteria for delirium.[34] Patients with subsyndromal delirium have poorer outcomes, such as prolonged hospitalizations, and higher mortality than patients without delirium symptoms.[34]

Additional studies are also needed to further clarify the impact of impaired arousal on nonmortality outcomes such as functional and cognitive decline. The prognostic significance of serial arousal measurements also requires further study. It is possible that patients whose impaired arousal rapidly resolves after an intervention may have better prognoses than those who have persistent impairment. The measurement of arousal may have additional clinical applications in disease prognosis models. The presence of altered mental status is incorporated in various disease‐specific risk scores such as the CURB‐65 or Pneumonia Severity Index for pneumonia,[35, 36] and the Pulmonary Embolism Severity Index for pulmonary embolism.[37] However, the definition of altered mental status is highly variable; it ranges from subjective impressions that can be unreliable to formal cognitive testing, which can be time consuming. Arousal scales such as the RASS may allow for more feasible, reliable, and standardized assessment of mental status. Future studies should investigate if incorporating the RASS would improve the discrimination of these disease‐severity indices.

This study has several notable limitations. We excluded patients with a RASS of 4 and 5, which represented comatose patients. This exclusion, however, likely biased our findings toward the null. We enrolled a convenience sample that may have introduced selection bias. However, our enrolled cohort was similar to all potentially eligible patients who presented to the emergency department during the study period. We also attempted to mitigate this selection bias by using multivariable regression and adjusting for factors that may have confounded the relationship between RASS and 6‐month mortality. This study was performed at a single, urban, academic hospital and enrolled patients who were aged 65 years and older. Our findings may not be generalizable to other settings and to those who are under 65 years of age. Because 406 patients received a psychiatric evaluation, this limited the number of covariates that could be incorporated into the multivariable model to evaluate if impaired arousal in the absence of delirium is associated with 6‐month mortality. To minimize residual confounding, we used the inverse weighted propensity score, but we acknowledge that this bias may still exist. Larger studies are needed to clarify the relationships between arousal, delirium, and mortality.

CONCLUSION

In conclusion, impaired arousal at initial presentation is an independent predictor for 6‐month mortality in a diverse group of acutely ill older patients, and this risk appears to be present even in the absence of delirium. Because of its ease of use and prognostic significance, it may be a useful vital sign for underlying brain dysfunction. Routine standardized assessment and communication of arousal during routine clinical care may be warranted.

Disclosures: Research reported in this publication was supported by the Vanderbilt Physician Scientist Development Award, Emergency Medicine Foundation, and National Institute on Aging of the National Institutes of Health under award number K23AG032355. This study was also supported by the National Center for Research Resources, grant UL1 RR024975‐01, and is now at the National Center for Advancing Translational Sciences, grant 2 UL1 TR000445‐06. Dr. Vasilevskis was supported in part by the National Institute on Aging of the National Institutes of Health under award number K23AG040157. Dr. Powers was supported by Health Resources and Services Administration Geriatric Education Centers, grant 1D31HP08823‐01‐00. Dr. Storrow was supported by the National Heart, Lung, and Blood Institute of the National Institutes of Health under award number K12HL1090 and the National Center for Advancing Translational Sciences under award number UL1TR000445. Dr. Ely was supported in part by the National Institute on Aging of the National Institutes of Health under award numbers R01AG027472 and R01AG035117, and a Veteran Affairs MERIT award. Drs. Vasilevskis, Schnelle, Dittus, Powers, and Ely were supported by the Veteran Affairs Geriatric Research, Education, and Clinical Center. The content is solely the responsibility of the authors and does not necessarily represent the official views of Vanderbilt University, Emergency Medicine Foundation, National Institutes of Health, and Veterans Affairs. The funding agencies did not have any role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; and preparation, review, or approval of the manuscript.

J.H.H., E.W.E., J.F.S., A.B.S., and R.D.S. conceived the trial. J.H.H., E.W.E., A.B.S., J.F.S., R.D.S., A.S., and A.W. participated in the study design. J.H.H. and A.W. recruited patients and collected the data. J.H.H., A.J.G., and A.S. analyzed the data. All authors participated in the interpretation of results. J.H.H. drafted the manuscript, and all authors contributed to the critical review and revision of the manuscript.

The authors report no conflicts of interest.

Arousal is defined as the patient's overall level of responsiveness to the environment. Its assessment is standard of care in most intensive care units (ICUs) to monitor depth of sedation and underlying brain dysfunction. There has been recent interest in expanding the role of arousal assessment beyond the ICU. Specifically, the Veterans Affairs Delirium Working Group proposed that simple arousal assessment be a vital sign to quantify underlying brain dysfunction.[1] The rationale is that impaired arousal is closely linked with delirium,[2] and is an integral component of multiple delirium assessments.[3, 4, 5] Chester et al. observed that the presence of impaired arousal was 64% sensitive and 93% specific for delirium diagnosed by a psychiatrist.[2] Delirium is an under‐recognized public health problem that affects up to 25% of older hospitalized patients,[6, 7] is associated with a multitude of adverse outcomes such as death and accelerated cognitive decline,[8] and costs the US healthcare system an excess of $152 billion dollars.[9]

Most delirium assessments require the patient to undergo additional cognitive testing. The assessment of arousal, however, requires the rater to merely observe the patient during routine clinical care and can be easily integrated into the clinical workflow.[10] Because of its simplicity and brevity, assessing arousal alone using validated scales such as the Richmond Agitation‐Sedation Scale (RASS) may be a more appealing alternative to traditional, more complex delirium screening in the acute care setting. Its clinical utility would be further strengthened if impaired arousal was also associated with mortality, and conferred risk even in the absence of delirium. As a result, we sought to determine if impaired arousal at initial presentation in older acutely ill patients predicted 6‐month mortality and whether this relationship was present in the absence of delirium.

METHODS

Design Overview

We performed a planned secondary analysis of 2 prospective cohorts that enrolled patients from May 2007 to August 2008 between 8 am and 10 pm during the weekdays, and July 2009 to February 2012 between 8 am and 4 pm during the weekdays. The first cohort was designed to evaluate the relationship between delirium and patient outcomes.[11, 12] The second cohort was used to validate brief delirium assessments using a psychiatrist's assessment as the reference standard.[5, 13] The local institutional review board approved these studies.

Setting and Participants

These studies were conducted in an urban emergency department located within an academic, tertiary care hospital with over 57,000 visits annually. Patients were included if they were 65 years or older and in the emergency department for <12 hours at the time of enrollment. The 12‐hour cutoff was used to include patients who presented to the emergency department in the evening and early morning hours. Patients were excluded if they were previously enrolled, non‐English speaking, comatose, or were nonverbal and unable to follow simple commands prior to the acute illness. Because the July 2009 to February 2012 cohort was designed to validate delirium assessments with auditory and visual components, patients were also excluded if they were deaf or blind.

Measurement of Arousal

RASS is an arousal scale commonly used in ICUs to assess depth of sedation and ranges from 5 (unarousable) to +4 (combative); 0 represents normal arousal.[10, 14] The RASS simply requires the rater to observe the patient during their routine interactions and does not require any additional cognitive testing. The RASS terms sedation was modified to drowsy (Table 1), because we wanted to capture impaired arousal regardless of sedation administration. We did not use the modified RASS (mRASS) proposed by the Veteran's Affairs Delirium Working Group, because it was published after data collection began.[1] The mRASS is very similar to the RASS, except it also incorporates a very informal inattention assessment. The RASS was ascertained by research assistants who were college students and graduates, and emergency medical technician basics and paramedics. The principal investigator gave them a 5‐minute didactic lecture about the RASS and observed them perform the RASS in at least 5 patients prior to the start of the study. Inter‐rater reliability between trained research assistants and a physician was assessed for 456 (42.0%) patients of the study sample. The weighted kappa of the RASS was 0.61, indicating very good inter‐rater reliability. Because the 81.7% of patients with impaired arousal had a RASS of 1, the RASS dichotomized as normal (RASS=0) or impaired (RASS other than 0).

Richmond Agitation‐Sedation Scale
ScoreTermDescription
  • NOTE: The Richmond Agitation‐Sedation Scale (RASS) is a brief (<10 seconds) arousal scale that was developed by Sessler et al.[10] The RASS is traditionally used in the intensive care unit to monitor depth of sedation. The terms were modified to better reflect the patient's level of arousal rather than sedation. A RASS of 0 indicates normal level of arousal (awake and alert), whereas a RASS <0 indicates decreased arousal, and a RASS >0 indicates increased arousal.

+4CombativeOvertly combative, violent, immediate danger to staff
+3Very agitatedPulls or removes tube(s) or catheter(s), aggressive
+2AgitatedFrequent nonpurposeful movement
+1RestlessAnxious but movements not aggressive or vigorous
0Alert and calm 
1Slight drowsyNot fully alert, but has sustained awakening (eye opening/eye contact) to voice (>10 seconds)
2Moderately drowsyBriefly awakens with eye contact to voice (<10 seconds)
3Very drowsyMovement or eye opening to voice (but no eye contact)
4Awakens to pain onlyNo response to voice, but movement or eye opening to physical stimulation
5UnarousableNo response to voice or physical stimulation

Death Ascertainment

Death within 6 months was ascertained using the following algorithm: (1) The electronic medical record was searched to determine the patient's death status. (2) Patients who had a documented emergency department visit, outpatient clinic visit, or hospitalization after 6 months were considered to be alive at 6 months. (3) For the remaining patients, date of death was searched in the Social Security Death Index (SSDI). (4) Patients without a death recorded in the SSDI 1 year after the index visit was considered to be alive at 6 months. Nine hundred thirty‐one (85.9%) out of 1084 patients had a recorded death in the medical record or SSDI, or had an emergency department or hospital visit documented in their record 6 months after the index visit.

Additional Variables Collected

Patients were considered to have dementia if they had: (1) documented dementia in the medical record, (2) a short form Informant Questionnaire on Cognitive Decline in the Elderly score (IQCODE) greater than 3.38,[15] or (3) prescribed cholinesterase inhibitors prior to admission. The short form IQCODE is an informant questionnaire with 16 items; a cutoff of 3.38 out of 5.00 is 79% sensitive and 82% specific for dementia.[16] Premorbid functional status was determined by the Katz Activities of Daily Living (Katz ADL) and ranges from 0 (completely dependent) to 6 (completely independent).[17] Patients with a score <5 were considered to be functionally dependent. Both the IQCODE and Katz ADL were prospectively collected in the emergency department at the time of enrollment.

The Charlson Comorbidity Index was used to measure comorbid burden.[18] The Acute Physiology Score (APS) of the Acute Physiology and Chronic Health Evaluation II score was used to quantify severity of illness.[19] The Glasgow Coma Scale was not included in the APS because it was not collected. Intravenous, intramuscular, and oral benzodiazepine and opioids given in the prehospital and emergency department were also recorded. The Charlson Comorbidity Index, APS, and benzodiazepine and opioid administration were collected after patient enrollment using the electronic medical record.

Within 3 hours of the RASS, a subset of 406 patients was evaluated by a consultation‐liaison psychiatrist who determined the patient's delirium status using Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision (DSM‐IV‐TR) criteria.[20] Details of their comprehensive assessments have been described in a previous report.[5]

Statistical Analysis

Measures of central tendency and dispersion for continuous variables were reported as medians and interquartile ranges. Categorical variables were reported as proportions. For simple comparisons, Wilcoxon rank sum tests were performed for continuous data, and 2 analyses or Fisher exact test were performed for categorical data. To evaluate the predictive validity of impaired arousal on 6‐month mortality, the cumulative probability of survival was estimated within 6 months from the study enrollment date using the Kaplan‐Meier method. Cox proportional hazards regression was performed to assess if impaired arousal was independently associated with 6‐month mortality after adjusting for age, gender, nonwhite race, comorbidity burden (Charlson Comorbidity Index), severity of illness (APS), dementia, functional dependence (Katz ADL <5), nursing home residence, admission status, and benzodiazepine or opioid medication administration. Patients were censored at the end of 6 months. The selection of covariates was based upon expert opinion and literature review. The number of covariates used for the model was limited by the number of events to minimize overfitting; 1 df was allowed for every 10 to 15 events.[21] Because severity of illness, psychoactive medication administration, and admission status might modify the relationship between 6‐month mortality and impaired arousal, 2‐way interaction terms were incorporated. To maintain parsimony and minimize overfitting and collinearity, nonsignificant interaction terms (P>0.20) were removed in the final model.[22] Hazard ratios (HR) with their 95% confidence interval (95% CI) were reported.

To determine if arousal was associated with 6‐month mortality in the absence of delirium, we performed another Cox proportional hazard regression in a subset of 406 patients who received a psychiatrist assessment. Six‐month mortality was the dependent variable, and the independent variable was a 3‐level categorical variable of different arousal/delirium combinations: (1) impaired arousal/delirium positive, (2) impaired arousal/delirium negative, and (3) normal arousal (with or without delirium). Because there were only 8 patients who had normal arousal with delirium, this group was collapsed into the normal arousal without delirium group. Because there were 55 deaths, the number of covariates that could be entered into the Cox proportional hazard regression model was limited. We used the inverse weighted propensity score method to help minimize residual confounding.[23] Traditional propensity score adjustment could not be performed because there were 3 arousal/delirium categories. Similar to propensity score adjustment, inverse weighted propensity score method was used to help balance the distribution of patient characteristics among the exposure groups and also allow adjustment for multiple confounders while minimizing the degrees of freedom expended. A propensity score was the probability of having a particular arousal/delirium category based upon baseline patient characteristics. Multinomial logistic regression was performed to calculate the propensity score, and the baseline covariates used were age, gender, nonwhite race, comorbidity burden, severity of illness, dementia, functional dependence, and nursing home residence. For the Cox proportional hazard regression model, each observation was weighted by the inverse of the propensity score for their given arousal/delirium category; propensity scores exceeding the 95th percentile were trimmed to avoid overly influential weighting. Benzodiazepine and opioid medications given in the emergency department and admission status were adjusted as covariates in the weighted Cox proportional hazard regression model.

Nineteen patients (1.8%) had missing Katz ADL; these missing values were imputed using multiple imputation. The reliability of the final regression models were internally validated using the bootstrap method.[21] Two thousand sets of bootstrap samples were generated by resampling the original data, and the optimism was estimated to determine the degree of overfitting.[21] An optimism value >0.85 indicated no evidence of substantial overfitting.[21] Variance inflation factors were used to check multicollinearity. Schoenfeld residuals were also analyzed to determine goodness‐of‐fit and assess for outliers. P values <0.05 were considered statistically significant. All statistical analyses were performed using SAS version 9.4 (SAS Institute, Cary, NC) and open source R statistical software version 3.0.1 (http://www.r‐project.org/).

RESULTS

A total of 1903 patients were screened, and 1084 patients met enrollment criteria (Figure 1). Of these, 1051 (97.0%) were non‐ICU patients. Patient characteristics of this cohort can be seen in Table 2. Enrolled patients and potentially eligible patients who presented to the emergency department during the enrollment window were similar in age, gender, and severity of illness, but enrolled patients were slightly more likely to have a chief complaint of chest pain and syncope (unpublished data).

Figure 1
Enrollment flow diagram. RASS, Richmond Agitation‐Sedation Scale. Patients who were non‐verbal or unable to follow simple commands prior to their acute illness were considered to have end‐stage dementia.
Patient Characteristics
VariablesNormal Arousal, n=835Impaired Arousal, n=249P Value
  • NOTE: Patient characteristics and demographics of enrolled patients. Continuous and ordinal variables are expressed in medians and interquartile (IQR) ranges. Categorical variables are expressed in absolute numbers and percentages. *Patient was considered to have dementia if it was documented in the medical record, the patient was on home cholineresterase inhibitors, or had a short‐form Informant Questionnaire on Cognitive Decline in the Elderly >3.38. Patients with a Katz Activities of Daily Living of <5 were considered to be functionally dependent. There were 19 patients with missing Katz Activities of Daily Living scores. Charlson index is a weighted scale used to measure comorbidity burden. Higher scores indicate higher comorbidity burden. The Acute Physiology Score (APS) of the Acute Physiology and Chronic Health Evaluation II score was used quantify severity of illness. Glasgow Coma Scale was not incorporated in this score. Higher scores indicate higher severity of illness.

Median age, y (IQR)74 (6980)75 (7083)0.005
Female gender459 (55.0%)132 (53.0%)0.586
Nonwhite race122 (14.6%)51 (20.5%)0.027
Residence  <0.001
Home752 (90.1%)204 (81.9%) 
Assisted living29 (3.5%)13 (5.2%) 
Rehabilitation8 (1.0%)5 (2.0%) 
Nursing home42 (5.0%)27 (10.8%) 
Dementia*175 (21.0%)119 (47.8%)<0.001
Dependent120 (14.4%)99 (39.8%)<0.001
Median Charlson (IQR)2 (1, 4)3 (2, 5)<0.001
Median APS (IQR)2 (1, 4)2 (1, 5)<0.001
Primary complaint  <0.001
Abdominal pain45 (5.4%)13 (5.2%) 
Altered mental status12 (1.4%)36 (14.5%) 
Chest pain128 (15.3%)31 (12.5%) 
Disturbances of sensation17 (2.0%)2 (0.8%) 
Dizziness16 (1.9%)2 (0.8%) 
Fever11 (1.3%)7 (2.8%) 
General illness, malaise26 (3.1%)5 (2.0%) 
General weakness68 (8.1%)29 (11.7%) 
Nausea/vomiting29 (3.5%)4 (1.6%) 
Shortness of breath85 (10.2%)21 (8.4%) 
Syncope46 (5.5%)10 (4.0%) 
Trauma, multiple organs19 (2.3%)8 (3.2%) 
Other333 (39.9%)81 (32.5%) 
Benzodiazepines or opioid medications administration188 (22.5%)67 (26.9%)0.152
Admitted to the hospital478 (57.3%)191 (76.7%)0.002
Internal medicine411 (86.0%)153 (80.1%) 
Surgery38 (8.0%)21 (11.0%) 
Neurology19 (4.0%)13 (6.8%) 
Psychiatry1 (0.2%)2 (1.1%) 
Unknown/missing9 (1.9%)2 (1.1%) 
Death within 6 months81 (9.7%)59 (23.7%)<0.001

Of those enrolled, 249 (23.0%) had an abnormal RASS at initial presentation, and their distribution can be seen in Figure 2. Within 6 months, patients with an abnormal RASS were more likely to die compared with patients with a RASS of 0 (23.7% vs 9.7%, P<0.001). The Kaplan‐Meier survival curves for all enrolled patients with impaired and normal RASS can be seen in Figure 3; the survival curve declined more slowly in patients with a normal RASS compared with those with an abnormal RASS.

Figure 2
Richmond Agitation‐Sedation Scale (RASS) distribution among enrolled patients. Distribution of RASS at initial presentation among 1084 acutely ill older patients, and of these, 1051 patients (97.0%) were non–intensive care unit patients. The RASS is a widely used arousal scale that can be performed during routine clinical care and takes <10 seconds to perform. A RASS of 0 indicates normal level of arousal (awake and alert), whereas a RASS of <0 indicates decreased arousal and a RASS of >0 indicates increased arousal.
Figure 3
Kaplan‐Meier survival curves in acutely ill older patients with a normal and impaired arousal at initial presentation over a 6‐month period. Arousal was assessed for using the Richmond Agitation‐Sedation Scale (RASS). Patients with impaired arousal were more likely to die compared to patients with normal arousal (23.7% vs 9.7%) within 6 months. Using Cox proportional hazard regression, patients with an abnormal RASS were 73% more likely to die within 6 months after adjusting for age, dementia, functional dependence, comorbidity burden, severity of illness, hearing impairment, nursing home residence, admission status, and administration of benzodiazepines/opioids medications. Severity of illness (P = 0.52), benzodiazepine/opioid medication administration (P = 0.38), and admission status (P = 0.57) did not modify the relationship between impaired arousal and 6‐month mortality. Abbreviations: CI, confidence interval.

Using Cox proportional hazards regression, the relationship between an abnormal RASS at initial presentation and 6‐month mortality persisted (HR: 1.73, 95% CI: 1.21‐2.49) after adjusting for age, sex, nonwhite race, comorbidity burden, severity of illness, dementia, functional dependence, nursing home residence, psychoactive medications given, and admission status. The interaction between an abnormal RASS and APS (severity of illness) had a P value of 0.52. The interaction between an abnormal RASS and benzodiazepine or opioid medication administration had a P value of 0.38. The interaction between an abnormal RASS and admission status had a P value of 0.57. This indicated that severity of illness, psychoactive medication administration, and admission status did not modify the relationship between an abnormal RASS and 6‐month mortality.

We analyzed a subset of 406 patients who received a psychiatrist's assessment to determine if an abnormal RASS was associated with 6‐month mortality regardless of delirium status using Cox proportional hazard regression weighted by the inverse of the propensity score. Patients with an abnormal RASS and no delirium were significantly associated with higher mortality compared to those with a normal RASS (HR: 2.20, 95% CI: 1.10‐4.41). Patients with an abnormal RASS with delirium also had an increased risk for 6‐month mortality (HR: 2.86, 95% CI: 1.29‐6.34).

All regression models were internally validated. There was no evidence of substantial overfitting or collinearity. The Schoenfeld residuals for each model were examined graphically and there was good model fit overall, and no significant outliers were observed.

DISCUSSION

Vital sign measurements are a fundamental component of patient care, and abnormalities can serve as an early warning signal of the patient's clinical deterioration. However, traditional vital signs do not include an assessment of the patient's brain function. Our chief finding is that impaired arousal at initial presentation, as determined by the nonphysician research staff, increased the risk of 6‐month mortality by 73% after adjusting for confounders in a diverse group of acutely ill older patients. This relationship existed regardless of severity of illness, administration of psychoactive medications, and admission status. Though impaired arousal is closely linked with delirium,[2, 24] which is another well‐known predictor of mortality,[11, 25, 26] the prognostic significance of impaired arousal appeared to extend beyond delirium. We observed that the relationship between 6‐month mortality and impaired arousal in the absence of delirium was remarkably similar to that observed with impaired arousal with delirium. Arousal can be assessed for by simply observing the patient during routine clinical care and can be performed by nonphysician and physician healthcare providers. Its assessment should be performed and communicated in conjunction with traditional vital sign measurements in the emergency department and inpatient settings.[1]

Most of the data linking impaired arousal to death have been collected in the ICU. Coma, which represents the most severe form of depressed arousal, has been shown to increase the likelihood of death regardless of underlying etiology.[27, 28, 29, 30, 31] This includes patients who have impaired arousal because they received sedative medications during mechanical ventilation.[32] Few studies have investigated the effect of impaired arousal in a non‐ICU patient population. Zuliani et al. observed that impaired arousal was associated with 30‐day mortality, but their study was conducted in 469 older stroke patients, limiting the study's external validity to a more general patient population.[33] Our data advance the current stage of knowledge; we observed a similar relationship between impaired arousal and 6‐month mortality in a much broader clinical population who were predominantly not critically ill regardless of delirium status. Additionally, most of our impaired arousal cohort had a RASS of 1, indicating that even subtle abnormalities portended adverse outcomes.

In addition to long‐term prognosis, the presence of impaired arousal has immediate clinical implications. Using arousal scales like the RASS can serve as a way for healthcare providers to succinctly communicate the patient's mental status in a standardized manner during transitions of care (eg, emergency physician to inpatient team). Regardless of which clinical setting they are in, older acutely ill patients with an impaired arousal may also require close monitoring, especially if the impairment is acute. Because of its close relationship with delirium, these patients likely have an underlying acute medical illness that precipitated their impaired arousal.

Understanding the true clinical significance of impaired arousal in the absence of delirium requires further study. Because of the fluctuating nature of delirium, it is possible that these patients may have initially been delirious and then became nondelirious during the psychiatrist's evaluation. Conversely, it is also possible that these patients may have eventually transitioned into delirium at later point in time; the presence of impaired arousal alone may be a precursor to delirium. Last, these patients may have had subsyndromal delirium, which is defined as having 1 or more delirium symptoms without ever meeting full DSM‐IV‐TR criteria for delirium.[34] Patients with subsyndromal delirium have poorer outcomes, such as prolonged hospitalizations, and higher mortality than patients without delirium symptoms.[34]

Additional studies are also needed to further clarify the impact of impaired arousal on nonmortality outcomes such as functional and cognitive decline. The prognostic significance of serial arousal measurements also requires further study. It is possible that patients whose impaired arousal rapidly resolves after an intervention may have better prognoses than those who have persistent impairment. The measurement of arousal may have additional clinical applications in disease prognosis models. The presence of altered mental status is incorporated in various disease‐specific risk scores such as the CURB‐65 or Pneumonia Severity Index for pneumonia,[35, 36] and the Pulmonary Embolism Severity Index for pulmonary embolism.[37] However, the definition of altered mental status is highly variable; it ranges from subjective impressions that can be unreliable to formal cognitive testing, which can be time consuming. Arousal scales such as the RASS may allow for more feasible, reliable, and standardized assessment of mental status. Future studies should investigate if incorporating the RASS would improve the discrimination of these disease‐severity indices.

This study has several notable limitations. We excluded patients with a RASS of 4 and 5, which represented comatose patients. This exclusion, however, likely biased our findings toward the null. We enrolled a convenience sample that may have introduced selection bias. However, our enrolled cohort was similar to all potentially eligible patients who presented to the emergency department during the study period. We also attempted to mitigate this selection bias by using multivariable regression and adjusting for factors that may have confounded the relationship between RASS and 6‐month mortality. This study was performed at a single, urban, academic hospital and enrolled patients who were aged 65 years and older. Our findings may not be generalizable to other settings and to those who are under 65 years of age. Because 406 patients received a psychiatric evaluation, this limited the number of covariates that could be incorporated into the multivariable model to evaluate if impaired arousal in the absence of delirium is associated with 6‐month mortality. To minimize residual confounding, we used the inverse weighted propensity score, but we acknowledge that this bias may still exist. Larger studies are needed to clarify the relationships between arousal, delirium, and mortality.

CONCLUSION

In conclusion, impaired arousal at initial presentation is an independent predictor for 6‐month mortality in a diverse group of acutely ill older patients, and this risk appears to be present even in the absence of delirium. Because of its ease of use and prognostic significance, it may be a useful vital sign for underlying brain dysfunction. Routine standardized assessment and communication of arousal during routine clinical care may be warranted.

Disclosures: Research reported in this publication was supported by the Vanderbilt Physician Scientist Development Award, Emergency Medicine Foundation, and National Institute on Aging of the National Institutes of Health under award number K23AG032355. This study was also supported by the National Center for Research Resources, grant UL1 RR024975‐01, and is now at the National Center for Advancing Translational Sciences, grant 2 UL1 TR000445‐06. Dr. Vasilevskis was supported in part by the National Institute on Aging of the National Institutes of Health under award number K23AG040157. Dr. Powers was supported by Health Resources and Services Administration Geriatric Education Centers, grant 1D31HP08823‐01‐00. Dr. Storrow was supported by the National Heart, Lung, and Blood Institute of the National Institutes of Health under award number K12HL1090 and the National Center for Advancing Translational Sciences under award number UL1TR000445. Dr. Ely was supported in part by the National Institute on Aging of the National Institutes of Health under award numbers R01AG027472 and R01AG035117, and a Veteran Affairs MERIT award. Drs. Vasilevskis, Schnelle, Dittus, Powers, and Ely were supported by the Veteran Affairs Geriatric Research, Education, and Clinical Center. The content is solely the responsibility of the authors and does not necessarily represent the official views of Vanderbilt University, Emergency Medicine Foundation, National Institutes of Health, and Veterans Affairs. The funding agencies did not have any role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; and preparation, review, or approval of the manuscript.

J.H.H., E.W.E., J.F.S., A.B.S., and R.D.S. conceived the trial. J.H.H., E.W.E., A.B.S., J.F.S., R.D.S., A.S., and A.W. participated in the study design. J.H.H. and A.W. recruited patients and collected the data. J.H.H., A.J.G., and A.S. analyzed the data. All authors participated in the interpretation of results. J.H.H. drafted the manuscript, and all authors contributed to the critical review and revision of the manuscript.

The authors report no conflicts of interest.

References
  1. Flaherty JH, Shay K, Weir C, et al. The development of a mental status vital sign for use across the spectrum of care. J Am Med Dir Assoc. 2009;10:379380.
  2. Chester JG, Beth Harrington M, Rudolph JL, Group VADW. Serial administration of a modified Richmond Agitation and Sedation Scale for delirium screening. J Hosp Med. 2012;7:450453.
  3. Inouye SK, Dyck CH, Alessi CA, Balkin S, Siegal AP, Horwitz RI. Clarifying confusion: the confusion assessment method. A new method for detection of delirium. Ann Intern Med. 1990;113:941948.
  4. Ely EW, Inouye SK, Bernard GR, et al. Delirium in mechanically ventilated patients: validity and reliability of the confusion assessment method for the intensive care unit (CAM‐ICU). JAMA. 2001;286:27032710.
  5. Han JH, Wilson A, Vasilevskis EE, et al. Diagnosing delirium in older emergency department patients: validity and reliability of the Delirium Triage Screen And The Brief Confusion Assessment Method. Ann Emerg Med. 2013;62:457465.
  6. Inouye SK, Rushing JT, Foreman MD, Palmer RM, Pompei P. Does delirium contribute to poor hospital outcomes? A three‐site epidemiologic study. J Gen Intern Med. 1998;13:234242.
  7. Pitkala KH, Laurila JV, Strandberg TE, Tilvis RS. Prognostic significance of delirium in frail older people. Dement Geriatr Cogn Disord. 2005;19:158163.
  8. Witlox J, Eurelings LS, Jonghe JF, Kalisvaart KJ, Eikelenboom P, Gool WA. Delirium in elderly patients and the risk of postdischarge mortality, institutionalization, and dementia: a meta‐analysis. JAMA. 2010;304:443451.
  9. Leslie DL, Marcantonio ER, Zhang Y, Leo‐Summers L, Inouye SK. One‐year health care costs associated with delirium in the elderly population. Arch Intern Med. 2008;168:2732.
  10. Sessler CN, Gosnell MS, Grap MJ, et al. The Richmond Agitation‐Sedation Scale: validity and reliability in adult intensive care unit patients. Am J Respir Crit Care Med. 2002;166:13381344.
  11. Han JH, Shintani A, Eden S, et al. Delirium in the emergency department: an independent predictor of death within 6 months. Ann Emerg Med. 2010;56:244252.
  12. Han JH, Eden S, Shintani A, et al. Delirium in older emergency department patients is an independent predictor of hospital length of stay. Acad Emerg Med. 2011;18:451457.
  13. Han JH, Wilson A, Graves AJ, et al. Validation of the Confusion Assessment Method For The Intensive Care Unit in older emergency department patients. Acad Emerg Med. 2014;21:180187.
  14. Ely EW, Truman B, Shintani A, et al. Monitoring sedation status over time in ICU patients: reliability and validity of the Richmond Agitation‐Sedation Scale (RASS). JAMA. 2003;289:29832991.
  15. Holsinger T, Deveau J, Boustani M, Williams JW. Does this patient have dementia? JAMA. 2007;297:23912404.
  16. Jorm AF. A short form of the Informant Questionnaire on Cognitive Decline in the Elderly (IQCODE): development and cross‐validation. Psychol Med. 1994;24:145153.
  17. Katz S. Assessing self‐maintenance: activities of daily living, mobility, and instrumental activities of daily living. J Am Geriatr Soc. 1983;31:721727.
  18. Murray SB, Bates DW, Ngo L, Ufberg JW, Shapiro NI. Charlson Index is associated with one‐year mortality in emergency department patients with suspected infection. Acad Emerg Med. 2006;13:530536.
  19. Knaus WA, Draper EA, Wagner DP, Zimmerman JE. APACHE II: a severity of disease classification system. Crit Care Med. 1985;13:818829.
  20. American Psychiatric Association. Task Force on DSM‐IV. Diagnostic and Statistical Manual of Mental Disorders: DSM‐IV‐TR. 4th ed. Washington, DC: American Psychiatric Association; 2000.
  21. Harrell FE. Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis. New York, NY: Springer; 2001.
  22. Marshall SW. Power for tests of interaction: effect of raising the Type I error rate. Epidemiol Perspect Innov. 2007;4:4.
  23. Austin PC. An Introduction to Propensity Score Methods for Reducing the Effects of Confounding in Observational Studies. Multivariate Behav Res. 2011;46:399424.
  24. Meagher DJ, Maclullich AM, Laurila JV. Defining delirium for the International Classification of Diseases, 11th Revision. J Psychosom Res. 2008;65:207214.
  25. McCusker J, Cole M, Abrahamowicz M, Primeau F, Belzile E. Delirium predicts 12‐month mortality. Arch Intern Med. 2002;162:457463.
  26. Ely EW, Shintani A, Truman B, et al. Delirium as a predictor of mortality in mechanically ventilated patients in the intensive care unit. JAMA. 2004;291:17531762.
  27. Teres D, Brown RB, Lemeshow S. Predicting mortality of intensive care unit patients. The importance of coma. Crit Care Med. 1982;10:8695.
  28. Jennett B, Bond M. Assessment of outcome after severe brain damage. Lancet. 1975;1:480484.
  29. Levy DE, Caronna JJ, Singer BH, Lapinski RH, Frydman H, Plum F. Predicting outcome from hypoxic‐ischemic coma. JAMA. 1985;253:14201426.
  30. Tuhrim S, Dambrosia JM, Price TR, et al. Prediction of intracerebral hemorrhage survival. Ann Neurol. 1988;24:258263.
  31. Booth CM, Boone RH, Tomlinson G, Detsky AS. Is this patient dead, vegetative, or severely neurologically impaired? Assessing outcome for comatose survivors of cardiac arrest. JAMA. 2004;291:870879.
  32. Shehabi Y, Bellomo R, Reade MC, et al. Early intensive care sedation predicts long‐term mortality in ventilated critically ill patients. Am J Respir Crit Care Med. 2012;186:724731.
  33. Zuliani G, Cherubini A, Ranzini M, Ruggiero C, Atti AR, Fellin R. Risk factors for short‐term mortality in older subjects with acute ischemic stroke. Gerontology. 2006;52:231236.
  34. Cole M, McCusker J, Dendukuri N, Han L. The prognostic significance of subsyndromal delirium in elderly medical inpatients. J Am Geriatr Soc. 2003;51:754760.
  35. Lim WS, der Eerden MM, Laing R, et al. Defining community acquired pneumonia severity on presentation to hospital: an international derivation and validation study. Thorax. 2003;58:377382.
  36. Fine MJ, Auble TE, Yealy DM, et al. A prediction rule to identify low‐risk patients with community‐acquired pneumonia. N Engl J Med. 1997;336:243250.
  37. Aujesky D, Obrosky DS, Stone RA, et al. Derivation and validation of a prognostic model for pulmonary embolism. Am J Respir Crit Care Med. 2005;172:10411046.
References
  1. Flaherty JH, Shay K, Weir C, et al. The development of a mental status vital sign for use across the spectrum of care. J Am Med Dir Assoc. 2009;10:379380.
  2. Chester JG, Beth Harrington M, Rudolph JL, Group VADW. Serial administration of a modified Richmond Agitation and Sedation Scale for delirium screening. J Hosp Med. 2012;7:450453.
  3. Inouye SK, Dyck CH, Alessi CA, Balkin S, Siegal AP, Horwitz RI. Clarifying confusion: the confusion assessment method. A new method for detection of delirium. Ann Intern Med. 1990;113:941948.
  4. Ely EW, Inouye SK, Bernard GR, et al. Delirium in mechanically ventilated patients: validity and reliability of the confusion assessment method for the intensive care unit (CAM‐ICU). JAMA. 2001;286:27032710.
  5. Han JH, Wilson A, Vasilevskis EE, et al. Diagnosing delirium in older emergency department patients: validity and reliability of the Delirium Triage Screen And The Brief Confusion Assessment Method. Ann Emerg Med. 2013;62:457465.
  6. Inouye SK, Rushing JT, Foreman MD, Palmer RM, Pompei P. Does delirium contribute to poor hospital outcomes? A three‐site epidemiologic study. J Gen Intern Med. 1998;13:234242.
  7. Pitkala KH, Laurila JV, Strandberg TE, Tilvis RS. Prognostic significance of delirium in frail older people. Dement Geriatr Cogn Disord. 2005;19:158163.
  8. Witlox J, Eurelings LS, Jonghe JF, Kalisvaart KJ, Eikelenboom P, Gool WA. Delirium in elderly patients and the risk of postdischarge mortality, institutionalization, and dementia: a meta‐analysis. JAMA. 2010;304:443451.
  9. Leslie DL, Marcantonio ER, Zhang Y, Leo‐Summers L, Inouye SK. One‐year health care costs associated with delirium in the elderly population. Arch Intern Med. 2008;168:2732.
  10. Sessler CN, Gosnell MS, Grap MJ, et al. The Richmond Agitation‐Sedation Scale: validity and reliability in adult intensive care unit patients. Am J Respir Crit Care Med. 2002;166:13381344.
  11. Han JH, Shintani A, Eden S, et al. Delirium in the emergency department: an independent predictor of death within 6 months. Ann Emerg Med. 2010;56:244252.
  12. Han JH, Eden S, Shintani A, et al. Delirium in older emergency department patients is an independent predictor of hospital length of stay. Acad Emerg Med. 2011;18:451457.
  13. Han JH, Wilson A, Graves AJ, et al. Validation of the Confusion Assessment Method For The Intensive Care Unit in older emergency department patients. Acad Emerg Med. 2014;21:180187.
  14. Ely EW, Truman B, Shintani A, et al. Monitoring sedation status over time in ICU patients: reliability and validity of the Richmond Agitation‐Sedation Scale (RASS). JAMA. 2003;289:29832991.
  15. Holsinger T, Deveau J, Boustani M, Williams JW. Does this patient have dementia? JAMA. 2007;297:23912404.
  16. Jorm AF. A short form of the Informant Questionnaire on Cognitive Decline in the Elderly (IQCODE): development and cross‐validation. Psychol Med. 1994;24:145153.
  17. Katz S. Assessing self‐maintenance: activities of daily living, mobility, and instrumental activities of daily living. J Am Geriatr Soc. 1983;31:721727.
  18. Murray SB, Bates DW, Ngo L, Ufberg JW, Shapiro NI. Charlson Index is associated with one‐year mortality in emergency department patients with suspected infection. Acad Emerg Med. 2006;13:530536.
  19. Knaus WA, Draper EA, Wagner DP, Zimmerman JE. APACHE II: a severity of disease classification system. Crit Care Med. 1985;13:818829.
  20. American Psychiatric Association. Task Force on DSM‐IV. Diagnostic and Statistical Manual of Mental Disorders: DSM‐IV‐TR. 4th ed. Washington, DC: American Psychiatric Association; 2000.
  21. Harrell FE. Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis. New York, NY: Springer; 2001.
  22. Marshall SW. Power for tests of interaction: effect of raising the Type I error rate. Epidemiol Perspect Innov. 2007;4:4.
  23. Austin PC. An Introduction to Propensity Score Methods for Reducing the Effects of Confounding in Observational Studies. Multivariate Behav Res. 2011;46:399424.
  24. Meagher DJ, Maclullich AM, Laurila JV. Defining delirium for the International Classification of Diseases, 11th Revision. J Psychosom Res. 2008;65:207214.
  25. McCusker J, Cole M, Abrahamowicz M, Primeau F, Belzile E. Delirium predicts 12‐month mortality. Arch Intern Med. 2002;162:457463.
  26. Ely EW, Shintani A, Truman B, et al. Delirium as a predictor of mortality in mechanically ventilated patients in the intensive care unit. JAMA. 2004;291:17531762.
  27. Teres D, Brown RB, Lemeshow S. Predicting mortality of intensive care unit patients. The importance of coma. Crit Care Med. 1982;10:8695.
  28. Jennett B, Bond M. Assessment of outcome after severe brain damage. Lancet. 1975;1:480484.
  29. Levy DE, Caronna JJ, Singer BH, Lapinski RH, Frydman H, Plum F. Predicting outcome from hypoxic‐ischemic coma. JAMA. 1985;253:14201426.
  30. Tuhrim S, Dambrosia JM, Price TR, et al. Prediction of intracerebral hemorrhage survival. Ann Neurol. 1988;24:258263.
  31. Booth CM, Boone RH, Tomlinson G, Detsky AS. Is this patient dead, vegetative, or severely neurologically impaired? Assessing outcome for comatose survivors of cardiac arrest. JAMA. 2004;291:870879.
  32. Shehabi Y, Bellomo R, Reade MC, et al. Early intensive care sedation predicts long‐term mortality in ventilated critically ill patients. Am J Respir Crit Care Med. 2012;186:724731.
  33. Zuliani G, Cherubini A, Ranzini M, Ruggiero C, Atti AR, Fellin R. Risk factors for short‐term mortality in older subjects with acute ischemic stroke. Gerontology. 2006;52:231236.
  34. Cole M, McCusker J, Dendukuri N, Han L. The prognostic significance of subsyndromal delirium in elderly medical inpatients. J Am Geriatr Soc. 2003;51:754760.
  35. Lim WS, der Eerden MM, Laing R, et al. Defining community acquired pneumonia severity on presentation to hospital: an international derivation and validation study. Thorax. 2003;58:377382.
  36. Fine MJ, Auble TE, Yealy DM, et al. A prediction rule to identify low‐risk patients with community‐acquired pneumonia. N Engl J Med. 1997;336:243250.
  37. Aujesky D, Obrosky DS, Stone RA, et al. Derivation and validation of a prognostic model for pulmonary embolism. Am J Respir Crit Care Med. 2005;172:10411046.
Issue
Journal of Hospital Medicine - 9(12)
Issue
Journal of Hospital Medicine - 9(12)
Page Number
772-778
Page Number
772-778
Publications
Publications
Article Type
Display Headline
Impaired arousal at initial presentation predicts 6‐month mortality: An analysis of 1084 acutely ill older patients
Display Headline
Impaired arousal at initial presentation predicts 6‐month mortality: An analysis of 1084 acutely ill older patients
Sections
Article Source

© 2014 Society of Hospital Medicine

Disallow All Ads
Correspondence Location
Address for correspondence and reprint requests: Jin H. Han, MD, Department of Emergency Medicine, Vanderbilt University Medical Center, 703 Oxford House, Nashville, TN 37232‐4700; Telephone: 615‐936‐0087; Fax: 615‐936‐1316; E‐mail: jin.h.han@vanderbilt.edu
Content Gating
No Gating (article Unlocked/Free)
Alternative CME
Article PDF Media
Media Files