© 2006 American Public Health Association DOI: 10.2105/AJPH.2005.062620
David W. Baker, Kenzie A. Cameron, Joseph Feinglass, Jason Thompson, and Patricia Georgas are with the Division of General Internal Medicine, Department of Medicine, Feinberg School of Medicine, Northwestern University, Chicago, Ill. D.W. Baker is also with the Institute for Healthcare Studies, Feinberg School of Medicine. Shawn Foster, Deborah Pierce, and Romana Hasnain-Wynia are with the Health Research and Educational Trust, Chicago. Correspondence: Requests for reprints should be sent to David W. Baker, MD, MPH, 676 N St Clair St, Suite 200, Chicago, IL 60611 (e-mail: dwbaker{at}northwestern.edu).
Objectives. We assessed the feasibility of collecting race/ethnicity data from patients using their own preferred racial/ethnic terms. Methods. The 424 patients described their race/ethnicity using their own categories, and we compared their descriptions with their responses to the questions (1) "Do you consider yourself Latino or Hispanic?" and (2) "Which category best describes your race?" (7 response options in our computer interview). We also determined patients preferences between the 2 approaches.
Results.seconds. Rates of missing values and categorization as "other" race were lower than with the closed questions. Agreement between racial/ethnic categorization with open-ended and closed responses was 93% ( Conclusions. Collecting race/ethnicity data using patients own racial/ethnic categories is feasible with the use of computerized systems to capture verbatim responses and results in lower rates of missing and unusable data than do standard questions.
People of color and racial/ethnic minorities in the United States often receive lower-quality health care than Whites.1,2 The first step toward addressing this problem is for health care providers to routinely collect data on patients race, ethnicity, and language and link these data to measures of quality, safety, and utilization.15 With such information, provider organizations can target quality-improvement programs to reduce disparities at their own institutions. A recent survey found that only 78% of US hospitals systematically collected data on patients race or ethnicity.6 Among those that did, more than half relied on registration clerks impressions of patients race and ethnicity rather than directly asking patients. When compared with patients self-report, staff impressions of patients race/ethnicity are reasonably accurate for Whites and Blacks but much less accurate for other groups.711 In addition, relying on staff impressions results in high rates of patients having missing data or being classified as "unknown" or "other."4,7 Because of this, expert panels have recommended that patients race and ethnicity should be collected by self-report.15 However, even if there is widespread adoption of the recommendation to collect patients race/ethnicity by self-report, current data collection methods are suboptimal. Providers typically ask questions that comply with the Revisions to the Standards for the Classification of Federal Data on Race and Ethnicity from the Office of Management and Budget (OMB) and are used by the US Census Bureau: (1) "Do you consider yourself Latino or Hispanic?" and (2) "Which category best describes your race: White, Black or African American, Asian, Native Hawaiian or Pacific Islander, American Indian or Alaska Native, multiracial, or another race I did not mention?"12 More-detailed information is needed to address health and health care disparities.5 For example, several studies have emphasized the heterogeneity within the Hispanic/Latino population1318; categorizing individuals of Mexican, Cuban, and Puerto Rican ancestry as simply Hispanic/Latino may obscure disparities in access, utilization, or quality of care.19 Important differences in risk factors for cardiovascular disease have been reported across Asian subgroups.20,21 The increasing heterogeneity of the population of the United States makes the deficiencies of the traditional OMB race/ethnicity questions even more salient. How should immigrants from Poland, Iraq, and India classify themselves? Classifying them as White blurs important distinctions in their health beliefs, behaviors, and treatment by health care providers. Alternatively, classifying people from Iraq and India as Asian combines them with individuals of Hmong or Japanese ancestry, which is equally inappropriate. The OMB standards were expressly intended as a minimum data requirement, but few health care providers ask more detailed questions. In part these questions are not asked because collecting more-detailed information is problematic if a clerk must read a long list of options from which patients are asked to choose. An alternative is to allow patients to self-describe themselves and record their verbatim responses with computerized data collection tools. This should allow providers to analyze their data at different levels of complexity, ranging from small-subgroup analyses using unique categories (e.g., Korean or Puerto Rican) up to more coarse, aggregated categories (e.g., Asian or Hispanic/Latino). We undertook this project to develop an instrument that could rapidly and accurately collect patients self-described race/ethnicity. The first phase tested the feasibility of allowing patients to self-identify themselves with whatever terms they wanted. The second phase tested a computerized tool for capturing this information and sought to examine the terms used by patients to self-identify themselves, rates of missing values and refusals, and time required for data collection.
Study Site and Population Our study was conducted in the General Internal Medicine Clinic of the Northwestern Medical Faculty Foundation in Chicago, Ill, from December 2003 to April 2004. Patients were eligible if they were aged 18 years and spoke English fluently. We approached patients after their visit and asked them to participate, stating: "The purpose of this research study is to understand what patients think about being asked to give information about their race or ethnic background." A parking pass was offered as an incentive, and we did not offer additional incentives to people who refused. After finishing the interview, a research assistant approached the next patient coming to the discharge area. Thus, the study population was a convenience sample and not a random sample. Patients who refused were asked their age, and the research assistant recorded the patients gender and probable race/ethnicity.
Initial Instrument Development and Testing (Phase 1) Next, we asked patients how comfortable they would feel stating their race/ethnicity if asked to do so by a clerk at a hospital or clinic. We tested 4 statements to determine if they increased patients level of comfort when providing their race/ethnicity. These results have been described in a previous paper22 and are not presented here. The preferred statement, for both Whites and non-Whites, was "We want to make sure that all our patients get the best care possible, regardless of their race or ethnic background. We would like you to tell us your race or ethnic background so that we can review the treatment that all patients receive and make sure that everyone gets the highest quality of care." A modified version of this statement was, therefore, chosen as the introduction to race/ethnicity question in phase 2.
Testing of the Final Instrument (Phase 2) During the interview, patients were first read this statement: "We want to make sure that all our patients get the best care possible, regardless of their race or ethnic background. We would like you to tell us your race or ethnic background so that we can review the treatment that all patients receive and make sure that everyone gets the highest quality of care. You can use general terms such as White, African American, Latino, or Asian or you can use more specific terms like Irish, Mexican, Hawaiian, or Navajo. You can use more than 1 term if you like." The results of pilot testing suggested that asking "Anything else?" after each response led some individuals to search for answers and give redundant responses (e.g., White and "Caucasian"). We therefore simply allowed respondents time to say other terms, but we did not solicit additional responses. The database had drop-down windows with terms listed alphabetically so that the research assistant only had to type the first few letters of what the patient said and then select the desired term by double-clicking on it to enter it into the database. The database automatically proceeded to the next field after entry to allow participants to say additional terms. Time stamps were automatically recorded during this process to measure the time required for completion.
Data Analysis
Differences in patients preferences for describing their race/ethnicity with their own terms compared with choosing from a list of options were assessed with pairwise 2 tests. We then used multinomial logistic regression to examine racial/ethnic differences in patients preferences with adjustment for age, gender, and study phase (1 or 2); patients who thought the 2 methods were "about the same" were used as the reference group in the model. One Native American patient, 2 patients who described themselves only as "American," and 2 patients who refused to answer the open-ended questions were excluded from the final multivariate model. Two-tailed tests were used for all analyses, and a final P value of .05 was used to determine statistical significance.
Finally, we analyzed differences between patients self-reported descriptions of their race/ethnicity and their responses to the OMB questions regarding Hispanic/Latino ethnicity and race. We used patients own terms to categorize them into groups that approximated the categories used in federal studies, as described earlier. Because the open-ended self-report method does not ask separate questions for Hispanic ethnicity and race, it was not possible to classify all individuals as non-Hispanic White, Hispanic White, non-Hispanic Black, Hispanic Black, and so forth. Therefore, to assess the agreement between the 2 systems, we compared the Latino/Hispanic category from the open-ended questions with the Hispanic White category derived from the OMB questions; the 3 participants who were classified as Black Hispanic on the basis of the OMB questions were considered to not have a matching category on the basis of the classification from self-report (i.e., 100% disagreement between the 2 methods). The percentage agreement between the 2 methods of collecting race/ethnicity data was calculated along with the
Participant Characteristics Of the 373 patients who were approached and asked to participate in the initial instrument development (phase 1), 220 (59%) gave informed consent and completed the interview. The mean age of participants was 44 years, and 67% were female (Table 1
Responses to Open-Ended Question About Race/Ethnicity When participants in the initial testing (phase 1) were asked to describe their race or ethnicity with any terms they wanted, 46% of participants described themselves with a single term, 33% used 2 terms, 14% used 3 terms, and 6% used 4 terms; 2 patients (1%) said only "American," and 1 patient (0.5%) refused to answer. Among those who specified 2 or more terms, many used redundant terms that mapped to a single composite race/ethnicity category (e.g., White and "Caucasian," or Black and African American). The distribution of participants responses is shown in Table 2 2 test comparing the number of terms used in phase 1 vs phase 2); no patients in phase 2 used the term "American," and no patients refused to answer.
Responses to the OMB Questions
Preferences for Using Own Terms vs Choosing from a List
Categorization With Patients Own Terms vs Choosing from OMB Categories There was excellent agreement between the categorization based on patients own words compared with their responses to their choices for the questions on Hispanic ethnicity and race (Table 4 statistic = 0.88).
Despite the overall high level of agreement between the 2 classification systems, the areas of disagreement were noteworthy and important. Among the 37 patients who described themselves as Latino, Hispanic, or being from a country in Central or South America, 17 (46.0%) chose Other as their race when asked to choose from a list. Another 8 (21.6%) patients refused to choose any race term. In addition, a "Spaniard from Spain" refused to answer the question about Hispanic ethnicity.
Rates of choosing the Other race option from the OMB list or refusing to answer the race question were lower among self-described Whites and African Americans but still problematic (3.1% and 8.3%, respectively). For example, 1 patient self-described herself as Jamaican and Barbasian (i.e., from Barbados) and chose Other as her race; a patient from "West Africa" did the same. Of the 6 patients who described themselves as being of Middle Eastern origin (i.e., Arab, Iranian, Palestinian, or "Middle Eastern"), 2 chose their race as Other and 1 refused to answer (50%). Disagreement between the 2 methods was worst among the 33 individuals who described themselves with terms from 2 or more categories or who described themselves as multiracial (Table 4
Time for Completion
This study shows it is possible to rapidly collect detailed information on patients race and ethnicity with open-ended questions that allow people to respond with terms of their choice. This allows maximal flexibility and adaptability. Health care providers can conduct analyses using the most specific categories possible, such as "How many Pakistani patients did we care for last year, and what proportion of Pakistani women received screening for cervical cancer?" This high level of detail will allow providers to identify vulnerable populations that should be targeted for quality-improvement efforts.24 If an organization needs to report data to meet federal or state reporting requirements, providers can still aggregate unique terms to correspond to the major categories used by federal agencies and achieve similar results to what they would have obtained if they collected race/ethnicity information using the OMB categories. Allowing patients to state their race/ethnicity with their own terms took an average of only 37 seconds, so providers should be reassured that collecting this level of detail does not pose a major time or cost burden. The most important advantage of using open-ended questions to collect information about patients race and ethnicity is improved accuracy. Many of the patients who identified themselves as Latino, Hispanic, or being from a country in Central or South America chose their race as Other or refused to answer the race question at all when they were presented with the standard OMB question. This result is consistent with previous reports that many Latinos do not view themselves as having a race separate from their ethnicity.25,26 This reticence probably explains why Latinos/Hispanics in our study were much more likely to prefer to use their own words to describe their race and ethnicity (adjusted odds ratio = 3.0, 95% CI = 1.2, 7.5). Similarly, individuals from the Middle East, the Indian subcontinent, Southeast Asia, and the Pacific Islands often do not describe themselves as White or Asian, and neither term is a meaningful indicator of their culture, their beliefs about health and health care, or their experiences within our health care system. As a consequence, many of the individuals from these groups in our study chose Other as their race or refused to answer the question. Using the OMB questions to collect race/ethnicity results in an unacceptably high rate of unusable data for individuals in these groups as well as Latinos/Hispanics, which could create bias in analyses of disparities in care and diminish the power to detect disparities among vulnerable groups. Collecting race/ethnicity by allowing people to describe themselves with their own terms is particularly important for individuals who identify with 2 or more racial or ethnic groups. First, this system allows this group to identify themselves more precisely. Second, a majority (51.5%) of multiracial/multiethnic study participants said they somewhat or strongly preferred to describe themselves with their own terms as opposed to choosing from a list. However, we did not allow people to choose multiple racial categories from the OMB list. It is possible that the preference among multiracial individuals for describing themselves with their own terms would have been less if we had allowed people to choose multiple options from the list of race choices. Despite these advantages, allowing patients to describe themselves with their own terms has limitations as well. Three patients described themselves only as Black or African American, but they said that they were Latino/Hispanic when asked this question directly. The fact that these patients did not describe themselves with the term Latino or Hispanic might mean that this aspect of their heritage was not a very important part of their identity, although we did not specifically ask patients about this. Regardless, our results suggest that a small percentage of people with Latino/Hispanic ancestry may not be captured if patients describe their race/ethnicity with their own terms without being asked specifically whether or not they are Latino/Hispanic.
Limitations We were also unable to interview people who did not speak English fluently. Approximately 3% of patients at our institution prefer to speak a language other than English, and approximately 1% have limited English proficiency. Most of these individuals are likely to have strong ethnic identities. Including them probably would have increased the number of people who used specific terms to describe their race/ethnicity and who chose Other or had missing data for the OMB race question. Although we did not interview patients with limited English proficiency or ask about language barriers as part of this study, routinely asking patients their preferred language and their ability to speak and understand English is probably just as important as collecting data on race/ethnicity. More research is needed to understand patients attitudes about collecting this information and the accuracy of their self-reported English proficiency. We also do not know the generalizability of our findings. The study sample was relatively small and had limited representation of Latinos, Asians, and other groups. The study was conducted at only 1 site, and results could vary across the country. Although the agreement between the OMB classification and patients self-reported race/ethnicity was high in this study, the discrepancies found suggest that our agreement rate would have been markedly lower among a study population with a higher proportion of Latinos/Hispanics, people from the Middle East or South Asia, and multiracial/multiethnic individuals. Finally, we do not know health care providers typical costs of implementing changes in data collection for patients race/ethnicity. At Northwestern Memorial Hospital, it was possible to implement a new system with only a few days effort by a computer programmer and several 1-hour sessions to train registration staff in new techniques. This likely varies significantly depending on the hardware and software used for registration. Training materials are now available over the Internet (available at: http://www.hretdisparities.org/hretdisparities/html/general/gtraining.html), so the cost of developing training materials is negligible.
Conclusions
This study was funded by The Commonwealth Fund and the Health Research and Educational Trust.
Human Participant Protection
Peer Reviewed
Contributors Accepted for publication June 1, 2005.
1. Committee on Understanding and Eliminating Racial and Ethnic Disparities in Health Care. Unequal Treatment: Confronting Racial and Ethnic Disparities in Health Care. Washington, DC: Institute of Medicine; 2003. 2. The Right to Equal Treatment. Washington, DC: Physicians for Human Rights; 2004. 3. Fiscella K, Franks P, Gold MR, et al. Inequality in quality: addressing socioeconomic, racial, and ethnic disparities in health care. JAMA. 2000;283: 25792584. 4. Bierman AS, Lurie N, Collins KS, et al. Addressing racial and ethnic barriers to effective health care: the need for better data. Health Aff (Millwood). 2002;21: 91102. 5. Panel on DHHS Collection of Race and Ethnicity Data. Eliminating Health Disparities: Measurement and Data Needs. Washington, DC: National Academies Press; 2004. 6. Hasnain-Wynia R, Pierce D, Pittman MA. Who, What, When, Where: The Current State of Data of Collection on Race and Ethnicity in Hospitals. New York, NY: Commonwealth Fund; 2004. 7. Boehmer U, Kressin NR, Berlowitz DR, et al. Self-reported vs administrative race/ethnicity data and study results. Am J Public Health. 2002;92:14711472. 8. Hahn RA, Mulinare J, Teutsch SM. Inconsistencies in coding of race and ethnicity between birth and death in US infants. A new look at infant mortality, 1983 through 1985. JAMA. 1992;267:259263. 9. Swallen KC, Glaser SL, Stewart SL, et al. Accuracy of racial classification of Vietnamese patients in a population-based cancer registry. Ethn Dis. 1998;8: 218227.[Medline] 10. Swallen KC, West DW, Stewart SL, et al. Predictors of misclassification of Hispanic ethnicity in a population-based cancer registry. Ann Epidemiol. 1997;7: 200206.[CrossRef][Web of Science][Medline] 11. Pan CX, Glynn RJ, Mogun H, et al. Definition of race and ethnicity in older people in Medicare and Medicaid. J Am Geriatr Soc. 1999;47:730733.[Web of Science][Medline] 12. Office of Management and Budget. Revisions to the Standards for the Classification of Federal Data on Race and Ethnicity. Washington, DC: US Census Bureau. Available at: http://www.census.gov/population/www/socdemo/race/Ombdir15.html. Accessed January 16, 2005. 13. Weinick RM, Jacobs EA, Stone LC, et al. Hispanic healthcare disparities: challenging the myth of a monolithic Hispanic population. Med Care. 2004;42: 313320.[CrossRef][Web of Science][Medline] 14. Hispanic health in the United States. Council on Scientific Affairs. JAMA. 1991;265:248252. 15. Trevino FM, Moyer ME, Valdez RB, et al. Health insurance coverage and utilization of health services by Mexican Americans, mainland Puerto Ricans, and Cuban Americans. JAMA. 1991;265:233237. 16. Mendoza FS, Ventura SJ, Valdez RB, et al. Selected measures of health status for Mexican-American, mainland Puerto Rican, and Cuban-American children. JAMA. 1991;265:227232. 17. Stroup-Benham CA, Trevino FM. Reproductive characteristics of Mexican-American, mainland Puerto Rican, and Cuban-American women: Data from the Hispanic Health and Nutrition Examination Survey. JAMA. 1991;265:222226. 18. Becerra JE, Hogue CJ, Atrash HK, et al. Infant mortality among Hispanics. A portrait of heterogeneity. JAMA. 1991;265:217221. 19. Zambrana RE, Carter-Pokras O. Health data issues for Hispanics: implications for public health research. J Health Care Poor Underserved. 2001;12(1):2034.[Web of Science][Medline] 20. Gomez SL, Kelsey JL, Glaser SL, et al. Immigration and acculturation in relation to health and health-related risk factors among specific Asian subgroups in a health maintenance organization. Am J Public Health. 2004; 94:19771984. 21. Lee J, Heng D, Chia KS, et al. Risk factors and incident coronary heart disease in Chinese, Malay and Asian Indian males: the Singapore Cardiovascular Cohort Study. Int J Epidemiol. 2001; 30:983988. 22. Baker DW, Cameron KA, Feinglass J, et al. Patients attitudes toward health care providers collecting information about their race and ethnicity. J Gen Intern Med. 2005;20:895900.[CrossRef][Web of Science][Medline] 23. Landis JR, Koch GG. The measure of observer agreement for categorical data. Biometrics. 1977;33: 159174.[CrossRef][Web of Science][Medline] 24. Greenfield A. New hospital gown gives Muslim women comfort [press release]. Maine Medical Center; June 22, 2004. Available at: http://www.mmc.org/mmc_news/062204.htm. Accessed January 16, 2005. 25. Zubizarreta P. Is there a Hispanic race? Lancet. 1999;354:780.[Web of Science][Medline] 26. Navarro M. Going beyond Black and White, Hispanics choose "other." New York Times. November 9, 2003. This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||