Objectives. We evaluated the racial misclassification of American Indians and Alaska Natives (AI/ANs) in cancer incidence and all-cause mortality data by Indian Health Service (IHS) Contract Health Service Delivery Area (CHSDA).

Methods. We evaluated data from 3 sources: IHS-National Vital Statistics System (NVSS), IHS-National Program of Cancer Registries (NPCR)/Surveillance, Epidemiology and End Results (SEER) program, and National Longitudinal Mortality Study (NLMS). We calculated, within each data source, the sensitivity and classification ratios by sex, IHS region, and urban–rural classification by CHSDA county.

Results. Sensitivity was significantly greater in CHSDA counties (IHS-NVSS: 83.6%; IHS-NPCR/SEER: 77.6%; NLMS: 68.8%) than non-CHSDA counties (IHS-NVSS: 54.8%; IHS-NPCR/SEER: 39.0%; NLMS: 28.3%). Classification ratios indicated less misclassification in CHSDA counties (IHS-NVSS: 1.20%; IHS-NPCR/SEER: 1.29%; NLMS: 1.18%) than non-CHSDA counties (IHS-NVSS: 1.82%; IHS-NPCR/SEER: 2.56%; NLMS: 1.81%). Race misclassification was less in rural counties and in regions with the greatest concentrations of AI/AN persons (Alaska, Southwest, and Northern Plains).

Conclusions. Limiting presentation and analysis to CHSDA counties helped mitigate the effects of race misclassification of AI/AN persons, although a portion of the population was excluded.

Accurate determinations of disease and mortality are a critical first step toward addressing disease burden and health disparities. American Indian/Alaska Native (AI/AN) populations experience some of the greatest health disparities in the country compared with other racial and ethnic groups.1–3 Health and mortality status assessments for AI/AN populations are often hindered by a lack of complete and accurate data on race and ethnicity in surveillance and vital statistics systems. AI/AN populations are more likely to be misclassified as another race than other racial groups in cancer registries, resulting in underestimates of cancer incidence.4–10 Similarly, misclassification of AI/AN race is a common problem on death certificates,11–18 on which ascertainment of race is usually provided by a funeral director. As a result, mortality estimates for the AI/AN population in the United States have been significantly underestimated.13

A study of racial/ethnic misclassification on US death certificates, which compared self-identified race from the US Census Bureau’s Current Population Survey (CPS) to the race recorded on death certificates for a sample of decedents in the National Longitudinal Mortality Study (NLMS) database, found markedly higher race misclassification of AI/AN persons (30%) compared with persons of other races that varied substantially by degree of geographic co-ethnic concentration.13 For example, AI/AN decedents who died in counties with high concentrations of AI/AN populations were significantly more likely to be classified correctly on death certificates than those who died outside of these counties.13 Similarly, a study comparing the National Cancer Institute’s (NCI) Surveillance, Epidemiology, and End Results (SEER) Program with NLMS found that SEER data considerably underreported AI/AN persons.19 A project matching Indian Health Service (IHS) patient registration records with the National Death Index (NDI) records of persons who died from 1986 to 1988 showed that the percentage of inconsistent classifications of AI/AN race varied from 1.2% in the Navajo IHS Area to 30.4% in the California IHS Area.20

The IHS provides primary health care to approximately 2.2 million enrolled members of federally recognized tribes, a number equivalent to approximately 64% of the United States estimated 3.4 million AI/AN population.21,22 Health care services for AI/AN individuals are provided in more than 670 IHS and tribal health care facilities, mostly in rural and isolated areas.23 Eligible AI/AN persons can receive health care at any IHS facility, but complex rules govern and restrict the delivery of contract health services for specialty medical care that is not available at IHS facilities.24 One eligibility requirement for contract health services is residence within the Contract Health Service Delivery Area (CHSDA) of the tribe in which the patient is enrolled. The geographic composition of the CHSDAs follows county boundaries and is established for each federally recognized tribe by the IHS.25 Details of the IHS regions (Northern Plains, Alaska, Southern Plains, Pacific Coast, East, and Southwest) and CHSDA areas are provided elsewhere26 and shown in Figure A (available as a supplement to the online version of this article at http://www.ajph.org).

Record linkages with IHS patient enrollment data are 1 method for addressing misclassification of AI/AN race in central cancer registries and in vital statistics mortality data; such linkages have been found to be both timely and cost effective.8,26–29 An additional method to reduce the impact of race misclassification that has been used in cancer and mortality reporting is that of restricting analysis to CHSDA counties.26,28,30–32 The proportion of AI/AN persons in the total population is higher in CHSDA counties than in non-CHSDA counties, and previous studies have shown lower levels of racial misclassification for AI/AN persons in CHSDA counties.13,33 The rationale for this approach is that there is likely to be more awareness of AI/AN race in theses counties.13

Our objective was to evaluate racial misclassification in both cancer registry incidence and all-cause mortality databases and to present evidence for using CHSDAs in future reports to address race misclassification of AI/AN individuals. To investigate this, we used data from the IHS linkages with mortality and cancer registries, with confirmation from an IHS-independent linkage in the form of the NLMS.

Detailed methods describing the mortality data are explained elsewhere in this supplement.34 Detailed methods describing incidence data are available in a previous publication.26

IHS–National Vital Statistics System Mortality Files

The IHS patient registration database was linked to the NDI to identify IHS AI/AN decedents.34 Following this linkage, IHS AI/AN records for persons identified as deceased, by matching with the NDI, were then linked to 1990 to 2009 annual National Vital Statistics System (NVSS) mortality files, and the linked records were flagged to identify AI/AN ancestry. The final numbers of decedents classified as being of AI/AN race in the amended NVSS mortality files included those that were already classified as such in the original NVSS mortality file and those we identified as AI/AN decedents through the linkage with IHS. The amended NVSS mortality files provided the ability to evaluate the racial classification on the death certificate by comparing IHS race classification to race classification on the death certificates of linked records.

Indian Health Service.

For a person to receive services they had to prove that they were eligible. IHS has a classification (beneficiary) code that can be used to identify an individual as AI/AN. We restricted the IHS patient registration file to only those individuals that IHS classified as AI/AN.

Death records.

The NDI is a central electronic repository maintained within the NCHS of death record information on file in individual state vital statistics offices.35 The NDI is a file of national death record information (beginning with 1979 deaths) containing personal identifiers that is compiled from electronic files submitted by individual state vital statistics offices.

National Vital Statistics System annual mortality files.

The NVSS is the product of a voluntary contractual agreement between individual vital statistics registration areas and the Centers for Disease Control and Prevention (CDC) National Center for Health Statistics (NCHS) to collect US birth and death information. Death certificate data are compiled by each state and sent to the NCHS, where the data are assessed and edited for consistency. The NCHS makes this information available to the research community as part of the NVSS, and includes underlying and multiple cause of death fields, state of residence, age, sex, race, and ethnicity.36 The NVSS covers more than 99% of all deaths occurring annually in the 50 states, the District of Columbia, New York City (a separate area from that of New York State), and the US territories.13 Because some states have adopted the 2003 Standard Death Certificate, requiring the collection of multiple races, but others continue to use the 1989 Standard Death Certificate, which requires race categories to be in single race, the NCHS uses the same algorithm to bridge multiple race responses on death certificates to single race, which the Census Bureau uses to attain uniformity and comparability until all states adopt the 2003 standard.37

Cancer Data

We used data from state and regional population-based cancer registries in the United States that collect information on newly diagnosed primary cancers. These registries participate in the CDC’s National Program of Cancer Registries (NPCR) or the NCI’s SEER program.38–40 Incident cancer cases diagnosed from 1999 to 2002 from 43 population-based state cancer registries that provided clinical and demographic characteristic data were included in this analysis (AK, AZ, AR, CT, DE, GA, HI, ID, IA, IN, KY, LA, ME, MD, MA, MI, MS, MO, MT, NE, NV, NM, NY, NC, ND, OH, OK, OR, PA, RI, SC, SD, TN, TX, UT, VT, VA, WA, WV, WI, WY).26 We used data from 1999 to 2002 because the first IHS linkage covered those years and provided the best estimates of racial misclassification in the cancer registries. Some cancer registries updated race in response to the IHS linkages, thereby affecting calculations of sensitivity and classification ratios for more recent years.

To examine the patterns of misclassification of AI/AN cases as non-AI/AN persons, all records from the NPCR and SEER population-based registries were linked with the IHS patient registration database. Files were prepared by the registries and sent to the IHS Division of Epidemiology and Disease Prevention in Albuquerque, New Mexico for linkage. Link Plus 2.0 (CDC, Atlanta, GA),41 a probabilistic linkage software program, was used to link the central cancer registry data with IHS using key patient identifiers.26

National Longitudinal Mortality Study

The NLMS is made up of a series of CPS Annual Social and Economic Supplements and a sample of the 1980 decennial census combined with NVSS death certificate information to identify mortality status and cause of death. The CPS is a multistage, stratified probability sample of the US noninstitutionalized civilian population with an approximate 95% response rate.42

Currently, the NLMS includes 30 files covering years 1973 and 1978 to 2002, for a total of approximately 2.7 million records. Through a linkage with the NCHS NDI for 1979 to 2002, more than 341 000 of these records were identified as deaths. We evaluated the degree of racial misclassification on death certificates from 1990 to 2002 for the AI/AN population using the NLMS by comparing race as reported on the CPS to race as reported on death certificates for the sample of NLMS decedents who self-identified or were identified as AI/AN by a household member on the CPS.

Population Estimates

We used county-level population estimates produced by the US Census Bureau as denominators in the rate calculations. To manage multiple race data collected since 2000, a technique of bridging race categories into single-race annual population estimates was developed by the NCHS in collaboration with the Census Bureau.43

The NCI made further refinements regarding race and county geographic codes and adjustments for population shifts because of Hurricanes Katrina and Rita in 2005, and provided public access to these estimates at the SEER website for calculation of incidence and death rates.44

Statistical Analyses

We evaluated race classification on death certificates for AI/AN decedents and in cancer registries for AI/AN cases by calculating 2 statistical measures (Table A, available as a supplement to the online version of this article at http://www.ajph.org). First, record-level agreement between the IHS patient registration or CPS databases and the death certificates or cancer registry records for individual decedents was estimated through a measure of sensitivity. Sensitivity is the percentage of individuals who were truly AI/AN as classified by IHS or self-identified in the CPS who were correctly classified as such on the death certificate or in cancer registry records. Second, a measure of the net difference in assignment of AI/AN race between 2 distinct data collection systems was ascertained through the estimation of “classification ratios.”13 Classification ratios were the ratios of the total number of AI/AN persons in the IHS or CPS to the total number of AI/AN persons on the death certificate or in cancer registry records. These 2 measures of racial misclassification were estimated for CHSDA and non-CHSDA counties, and by IHS region.34 We restricted the analyses to only those decedents or cases that linked to IHS in the IHS-dependent linkages and to only those decedents that linked to CPS in the IHS-independent NLMS linkage.

The calculation of sensitivity and classification ratios for each of the comparisons is described in Table A. The χ2 statistic was used to determine whether differences between CHSDAs and non-CHSDAs were statistically significant.45

All-Cause Death Rates

The amended NVSS mortality files were combined with corresponding annual bridged race intercensal population data files to create an analytical file in SEER*Stat version 8.0.4 (NCI, Bethesda, MD; AI/AN-US Mortality Database [AMD]). All-cause death rates, expressed per 100 000 population, were directly age adjusted, using SEER*Stat software,46 to the 2000 US standard population and using 11 age groups (less than 1, 1–4, 5–14, 15–24, 25–34, 35–44, 45–54, 55–64, 65–74, 75–84, and 85 years and older) in accordance with a 1998 US Department of Health and Human Services recommendation.47,48 These data are different from, and therefore, were not comparable with, published death rates adjusted using a different standard population.

During preliminary analyses, it was discovered that the updated bridged intercensal populations estimates significantly overestimated AI/AN persons of Hispanic origin.49 Therefore, to avoid underestimating all-cause mortality in AI/AN populations, rate analyses were limited to non-Hispanic AI/AN persons. Non-Hispanic White was chosen as the most homogeneous referent group. For conciseness, the term “non-Hispanic” is henceforth omitted when discussing both groups.

Using the age-adjusted, all-cause death rates, standardized rate ratios (RRs) were calculated for AI/AN populations using White rates for comparison. Ninety-five percent confidence intervals (CI) for age-adjusted rates and standardized RRs were calculated based on methods described by Tiwari et al. using SEER*Stat 8.0.4 and were presented as rounded to 2 decimal places.50

Table 1 shows counts of death, estimates of sensitivity and classification ratios by sex, IHS region, urban–rural classification, and CHSDA county status. The IHS linkage with the NDI yielded a total of 187 537 IHS decedents for 1990–2009. Of the 187 537 decedents, 151 880 were identified as an AI/AN person on the death certificate, and the remaining 35 657 were misclassified as another race (data not shown). Misclassification results varied considerably by IHS region: the lowest percentages of misclassified decedents were observed in the Southwest (6.3%) and Alaska (6.5%), whereas the highest percentages of misclassified decedents were observed in the East (35.2%) and the Southern Plains (36.6%; data not shown). The majority of IHS decedents were residents of CHSDA counties (170 743 vs 16 794 in non-CHSDA counties) and rural counties (109 228 vs 78 209 in urban counties).

Table

TABLE 1— Sensitivity and Classification Ratios for Death Certificates That Linked to IHS: IHS-NVSS Mortality Files, United States, 19902009

TABLE 1— Sensitivity and Classification Ratios for Death Certificates That Linked to IHS: IHS-NVSS Mortality Files, United States, 19902009

No. AI/AN CHSDA Deaths
No. AI/AN Non-CHSDA Deaths
Sensitivitya
Classification Ratiob
GroupIHSDCIHSDCCHSDANon-CHSDACHSDANon-CHSDA
Male and Femalec170 743142 67516 794920583.654.81.21.82
 Male94 53879 6909035505084.355.91.191.79
 Female76 20562 9857759415582.753.61.211.87
Northern Plainsc33 80431 0534960367891.974.21.091.35
Alaskad13 78212 888NANA93.5NA1.07NA
Southern Plainsc42 61527 8343223123765.338.41.532.61
Southwestc51 91048 953137399294.372.31.061.38
Pacific Coastc23 29217 5053454182875.252.91.331.89
Eastc534044423784147083.238.81.22.57
Urbanc65 55151 00112 658700877.855.41.291.81
Ruralc105 09291 5774136219787.153.11.151.88

Note. AI/AN = American Indian and Alaska Native; CHSDA = Contract Health Service Delivery Area; DC = death certificate; IHS = Indian Health Service; NA = not applicable; NVSS = National Vital Statistics System. Data from 50 states and the District of Columbia. Analyses include AI/AN persons of Hispanic origin and are limited to decedents who linked to IHS.

Source. IHS-NVSS Mortality Files.

a Percentage of IHS decedents also coded as AI/AN on the death certificate of all IHS decedents.

b Ratio of the total number of decedents classified as AI/AN in IHS to the total number of decedents classified as AI/AN on the death certificate.

c Chi-square test indicates statistically significant differences between the number of AI/AN deaths in CHSDA and non-CHSDA counties, P < .01.

d All counties are CHSDA counties.

Record-level agreement or sensitivity was 83.6% for decedents in CHSDA counties and 54.8% in non-CHSDA counties. Sensitivity varied by region and county type. Among CHSDA counties, sensitivity ranged from 65.3% in the Southern Plains to 94.3% in the Southwest. In non-CHSDA counties, sensitivity ranged from 38.4% in the Southern Plains to 74.2% in the Northern Plains. In urban areas, sensitivity was 77.8% in CHSDA counties and 55.4% in non-CHSDA counties. Rural CHSDA counties had a sensitivity of 87.1% compared with 53.1% in rural non-CHSDA counties. Sensitivity was higher in rural CHSDA counties than urban CHSDA counties.

All-Cause Death Rates

The effects of race misclassification were explored by examining age-adjusted, all-cause death rates and death rate ratios for AI/AN compared with White persons before and after the IHS linkage in CHSDA counties only (Table 2). The number of decedents identified as an AI/AN person on the death certificate increased from 105 552 before the linkage to 122 644 after the linkage, for a misclassification prevalence of 14%.

Table

TABLE 2— Death Rates for All Causes by IHS Region and Sex for American Indians/Alaska Natives Compared With Whites: CHSDA Counties, United States, 19902009

TABLE 2— Death Rates for All Causes by IHS Region and Sex for American Indians/Alaska Natives Compared With Whites: CHSDA Counties, United States, 19902009

Prelink
Postlink
AI/AN Differences
IHS Region/SexAI/AN CountAI/AN RateWhite CountWhite RateAI/AN:White RR (95% CI)AI/AN CountAI/AN RateWhite CountWhite RateAI/AN:White RR (95% CI)CountRateRR
Northern Plains
 Male and female21 5221337.3788 175772.51.73* (1.70, 1.76)23 3311461.8786 392770.61.90* (1.87, 1.93)1809124.50.17
 Male11 7821604.6387 075929.51.73* (1.69, 1.77)12 7091748.8386 164927.41.89* (1.84, 1.93)927144.20.16
 Female97401133.0401 100650.91.74* (1.70, 1.78)10 6221243.4400 228649.21.92* (1.88, 1.96)882110.40.17
Alaska
 Male and female80421142.424 162752.51.52* (1.48, 1.56)86161218.623 621738.21.65* (1.61, 1.70)57476.20.13
 Male44351333.913 912873.91.53* (1.47, 1.59)47711431.613 600856.81.67* (1.61, 1.74)33697.70.14
 Female3607982.110 250639.21.54* (1.48, 1.60)38451041.210 021627.31.66* (1.60, 1.73)23859.10.12
Southern Plains
 Male and female21 802931.8367 010951.00.98* (0.97, 0.99)30 4211313.1358 711928.71.41* (1.40, 1.43)8619381.30.43
 Male11 6001127.6179 9641127.91.00 (0.98, 1.02)15 9461568.7175 7781102.21.42* (1.40, 1.45)4346441.10.42
 Female10 202781.8187 046810.20.97* (0.95, 0.98)14 4751116.3182 933790.91.41* (1.39, 1.44)4273334.50.45
Southwest
 Male and female31 488960.9671 400791.91.21* (1.20, 1.23)33 3251017.8669 622789.71.29* (1.27, 1.30)183756.90.08
 Male17 8261183.4348 602928.91.27* (1.25, 1.30)18 8361251.4347 628926.21.35* (1.33, 1.37)1010680.08
 Female13 662780.3322 798672.21.16* (1.14, 1.18)14 489828.1321 994670.41.24* (1.21, 1.26)82747.80.07
Pacific
 Male and female17 088889.21 462 986798.01.11* (1.10, 1.13)20 7791091.51 459 406796.01.37* (1.35, 1.39)3691202.30.26
 Male90111016.3723 661936.01.09* (1.06, 1.11)10 8751238.3721 856933.71.33* (1.30, 1.36)18642220.24
 Female8077785.7739 325685.61.15* (1.12, 1.17)9904971.1737 550683.81.42* (1.39, 1.45)1827185.40.27
East
 Male and female5610753.41 559 827796.00.95* (0.92, 0.97)6172828.71 559 313795.71.04* (1.01, 1.07)56275.30.10
 Male2937854.7750 874958.00.89* (0.86, 0.93)3231939.1750 611957.70.98 (0.94, 1.02)29484.40.09
 Female2673668.1808 953671.31.00 (0.96, 1.04)2941735.4808 702671.01.10* (1.06, 1.14)26867.30.10
Total
 Male and female105 552994.04 873 560801.71.24* (1.23, 1.25)122 6441165.94 857 065798.81.46* (1.45, 1.47)17 092171.90.22
 Male57 5911186.92 404 088952.01.25* (1.23, 1.26)66 3681381.82 395 637948.81.46* (1.44, 1.47)8777194.90.21
 Female47 961839.02 469 472681.11.23* (1.22, 1.24)56 276991.52 461 428678.61.46* (1.45, 1.47)8315152.50.23

Note. AI/AN = American Indian and Alaska Native; CHSDA = Contract Health Service Delivery Area; CI = confidence interval; IHS = Indian Health Service; RR = rate ratio. Analysis includes only persons of non-Hispanic origin. States and years of data excluded because Hispanic origin was not collected on the death certificate: LA: 1990; NH: 1990–1992; OK: 1990–1996. Rates are per 100 000 persons and are age-adjusted to the 2000 US standard population (11 age groups - Census P25-1130). RRs are calculated in SEER*Stat before rounding of rates and may not equal RRs calculated from rates presented in table. Percent regional coverage of AI/AN persons in CHSDA counties to AI/AN persons in all counties: Northern Plains = 64.8%; Alaska = 100%; Southern Plains = 76.3%; Southwest = 91.3%; Pacific Coast = 71.3%; East = 18.2%; total US = 64.2%.

Source. AI/AN Mortality Database (AMD 1990-2009). Data are based on linked Indian Health Service-National Vital Statistics System mortality files.

* P < .05.

The US estimated age-adjusted, all-cause death rate for AI/AN persons rose from 994 per 100 000 (prelink) to 1166 per 100 000 (postlink). Relative to the rate among Whites, this represented an increase in RRs from 1.24 to 1.46. Age-adjusted, all-cause death rates varied by region, with RRs relative to White increasing as little as 8% in the Southwest and 9% in the East to as high as 43% in the Southern Plains (Figure B, available as a supplement to the online version of this article at http://www.ajph.org).

Cancer Cases in NPCR-SEER Data

The IHS linkage with the NPCR and SEER central cancer registries yielded a total of 12 553 matches for males and females in diagnosis years 1999 to 2002. Table 3 shows misclassification measures, sensitivity, and classification ratios by sex, IHS region, urban–rural classification, and CHSDA county status. The percent of cases that linked to IHS and were correctly identified by cancer registries was 77.6% in CHSDA counties and 39.0% in non-CHSDA counties. Sensitivity measures by IHS region in CHSDA counties ranged from 52.8% in the East to 99.4% in Alaska, and in non-CHSDA counties, sensitivity varied from 13.9% in the Southern Plains to 71.4% in Northern Plains. In CHSDA counties, the sensitivity measure in urban areas was 72.9%, and in rural areas, it was 80.8%, whereas in non-CHSDA counties, the sensitivity in urban areas was 41.6%, and in rural areas, it was 52.1%.

Table

TABLE 3— Sensitivity and Classification Ratios for Cases That Linked to IHS: NPCR-SEER Data, United States, 19902009

TABLE 3— Sensitivity and Classification Ratios for Cases That Linked to IHS: NPCR-SEER Data, United States, 19902009

No. AI/AN CHSDA Cases
No. AI/AN Non-CHSDA Cases
Sensitivitya
Classification Ratiob
GroupIHSRegistryIHSRegistryCHSDANon-CHSDACHSDANon-CHSDA
Male and Femalec11 3518811120246977.639.01.292.56
 Male5233404651220477.339.81.292.51
 Female6118476569026577.938.41.282.6
Northern Plainsc2387210641629788.271.41.131.4
Alaskad12601253NANA99.4NA1.01NA
Southern Plainsc381721442964156.213.91.787.22
Southwestc23142062573689.163.21.121.58
Pacific Coastc12281064704086.657.11.151.75
Eastc3451823635552.815.21.96.6
Urbanc4593334952621972.941.61.372.4
Ruralc6758546229215280.852.11.241.92

Note. AI/AN = American Indian and Alaska Native; CHSDA = Contract Health Service Delivery Areas; IHS = Indian Health Service; NA = not applicable; NPCR = National Program of Cancer Registries; SEER = Surveillance, Epidemiology and End Results. Analysis includes AI/AN persons of Hispanic origin. Registries used: (43 states) AK,z AZz, AR, CT,z DE, GA, HI, IDz, IA,z IN,z KY, LA,z ME,z MD, MA,z MI,z MS,z MO, MT,z NEz, NV,z NH, NJ, NM,z NY,z NC,z ND,z OH, OK,z OR,z PA,z RI,z SC,z SD,z TN, TX,z UT,z VT, VA, WA,z WV, WI,z WY,z (zindicate states with at least 1 county designated as CHSDA). Analyses limited to cases who linked to IHS.

Source. Cancer registries in the Centers for Disease Control and Prevention’s NPCR or the National Cancer Institute’s SEER.

a Percentage of IHS cases also coded as an AI/AN person in the cancer registry of all IHS cases.

b Ratio of the total number of cases classified as an AI/AN person in IHS to the total number of cases classified as an AI/AN person in the cancer registry.

c Indicates statisticially significant differences (P < .01) between the number of AI/AN cases in CHSDA and non-CHSDA.

d All counties are CHSDA counties.

The classification ratios also reflected significantly better agreement between IHS and cancer registry incidence data in CHSDA counties than non-CHSDA counties. The classification ratios for males and females in non-CHSDA counties was 2.6, meaning that the IHS linkage identified an additional 156% of AI/AN cases compared with the cancer registries alone, whereas in CHSDA counties only an additional 29% were identified. Classification ratios varied greatly by IHS region and county type. In CHSDA counties, the ratios ranged from 1.01 in Alaska to 1.90 in the East; in non-CHSDA counties, they ranged from 1.40 in the Northern Plains to 7.22 in the Southern Plains. Similarly, classification ratios were higher in urban than in rural counties, particularly in non-CHSDA areas.

National Longitudinal Mortality Study

Table 4 presents sensitivity measures and classification ratios by sex, IHS region, urban–rural classification, and CHSDA county status for the sample of NLMS decedents who self-identified as an AI/AN person on the CPS and died between 1990 and 2002. These findings also supported the hypothesis that AI/AN decedents were significantly more likely to be correctly classified on the death certificate in areas of greatest concentration of AI/AN persons. Nationally, AI/AN decedents were significantly more likely to be correctly classified on death certificates in CHSDA than in non-CHSDA counties. Sensitivity was 68.8% in CHSDA counties and 28.3% in non-CHSDA counties. Classification ratios were 1.18 in CHSDA counties and 1.81 in non-CHSDA counties. Classification varied by IHS region, with the highest levels of misclassification in the East (CHSDA = 3.34 vs non-CHSDA = 1.90) and the lowest in Alaska (CHSDA = 0.96) and the Northern Plains (CHSDA = 1.00 vs non-CHSDA = 1.20). In almost all instances, classification was best in CHSDA counties, with an exception in the East, which was likely because of the very small sample size. Similarly, racial classification was found to be better in rural CHSDA counties (1.08) versus urban CHSDA counties (1.36).

Table

TABLE 4— Sensitivity and Classification Ratios: National Longitudinal Mortality Study, United States, 19902002

TABLE 4— Sensitivity and Classification Ratios: National Longitudinal Mortality Study, United States, 19902002

No. AI/AN CHSDA Deaths
No. AI/AN Non-CHSDA Deaths
SensitivityaClassification Ratiob
GroupCPSDCCPSDCCHSDANon-CHSDACHSDANon-CHSDA
Male and female107399647933368.828.31.181.81
 Male57151623816369.730.21.201.77
 Female50248024117067.826.81.161.84
Northern Plains3703731169985.538.61.001.20
Alaskac208217NANA95.3NA0.96NA
Southern Plains9775745342.629.91.512.00
Southwest212189313179.92.81.141.18
Pacific Coast160132452164.910.71.163.34
East26101819812.328.93.341.90
Urban32127425215749.523.21.361.86
Rural75272222717681.436.91.081.74

Note. AI/AN = American Indian and Alaska Native; CHSDA = Contract Health Service Delivery Area; CPS = Current Population Survey; DC = death certificate; NA = not applicable. Includes AI/AN of Hispanic origin. Analyses are based on weighted data and are limited to decedents who linked to CPS.

a Percentage of CPS AI/AN decedents also coded as AI/AN on the death certificate of all CPS AI/AN decedents.

b Ratio of the total number of decedents classified as AI/AN on the CPS to the total number of decedents classified as AI/AN on the death certificate.

c All counties are CHSDA counties.

Our evaluation of racial misclassification in national cancer incidence and all-cause mortality data added to the evidence that racial misclassification is a widespread problem for the AI/AN population.11–18 Racial misclassification resulted in significant underestimations of all-cause death rates and cancer incidence among AI/AN populations.

We described racial misclassification of AI/AN persons in 2 IHS-related linkages and 1 IHS-independent linkage. In the IHS-related linkages, we observed less misclassification in CHSDA counties than in non-CHSDA counties as predicted, because a large majority of IHS registrants resided in IHS CHSDA counties. Self-reported race was often thought to be the “gold standard,” but the AI/AN sample size in the IHS-independent NLMS was very small and did not allow regional analyses. However, its inclusion confirmed that our results were applicable to other data sets. Again, it confirmed that there was less misclassification of AI/AN persons in CHSDA counties than non-CHSDA counties.

Estimates of aggregate-level agreement between the 2 data systems, classification ratios, further supported our hypothesis that racial classification on death certificates was better in areas with high co-ethnic concentration. In other words, there was significantly better agreement between IHS and death certificate race classification in areas with the greatest concentrations of AI/AN populations: CHSDA counties versus non-CHSDA counties, rural counties versus urban counties, and in Alaska and the Northern Plains versus other regions.

We found that racial misclassification was lowest in Alaska, followed by the Southwest and the Northern Plains consistently across all 3 data sets. These were the regions with the highest percentage of the AI/AN population in CHSDA counties.34 In the IHS-NVSS mortality files, racial misclassification was highest in the Southern Plains and Pacific Coast, whereas in the NPCR-SEER and NLMS data sets, racial misclassification was highest in the Southern Plains and the East. The percentage of the AI/AN population in CHSDA counties varied in these 3 regions, with an estimated 76.3% of the AI/AN population residing in CHSDA counties in the Southern Plains, 71.3% in the Pacific Coast, and 18.2% in the East. In the Southeast, the majority of AIs were not members of federally recognized tribes and were not served by IHS.51 The 2010 Census found that the geographic distribution of the multiple-race AI/AN population differed from the AI/AN-alone population. The percent distribution of multiple-race AI/ANs who lived in the Northeast (13%) was nearly twice as high as the percent distribution of the AI/AN-alone population.52

Several other factors might influence the differential misclassification observed in these regions. These included the availability and types of health services (e.g., some regions had no hospitals or specialty facilities operated by IHS, and certain regions had more health facilities that were operated by tribes by contract or compact instead of directly by IHS), thus making their patient data less likely to be included in the IHS registration database. In regions that did not have a large IHS presence, such as the East, it was reasonable to assume that our estimates were still inaccurate, and misclassification of AI/ANs was even higher than we found. It was also found that as percent blood quantum (degree of AI/AN ancestry) decreased, misclassification increased, suggesting that health care personnel might be completing race information largely based on appearance.15,53–55 Regional data for percent blood quantum is currently not available; therefore, multiple-race AI/AN persons were used as a proxy. Multiple-race AI/AN persons were less likely to live in AI/AN areas (i.e., federal reservation or off-reservation trust land, Oklahoma tribal statistical area, state reservation, or federal- or state-designated American Indian statistical area).56 In addition, the proportion of AI/AN persons residing in urban areas is continually increasing.57 This in turn diminished the likelihood that these individuals would be seen at IHS and included in the IHS registration database; thus, multiple-race AI/AN persons could still be misclassified as non-AI/AN persons. Some ongoing efforts are attempting to address these issues by including tribal enrollment and urban clinic populations in linkages with state surveillance data.51,58,59

Using record level data from the IHS-NVSS mortality files, we found that the effects of misclassification varied considerably by urban–rural classification, CHSDA county status, IHS region, and in some cases, sex. It was possible that other subpopulations (e.g., age groups) were also differentially affected by racial misclassification, further complicating the reporting of health statistics for this population.

Limitations

There were several limitations to consider when interpreting the results presented in this article. First, the linkage with the IHS patient registration database improved the race classification for many AI/AN decedents, but the issues were not completely resolved because AI/AN persons who were not members of federally recognized tribes were not eligible for IHS services and not represented in the IHS registration database. Additionally, some decedents might have been eligible for, but never used IHS services, and therefore, were not included in the IHS registration database. Second, the findings from CHSDA counties highlighted did not represent all AI/AN populations in the United States or in individual IHS regions. Third, race reported in the IHS patient registration database and in the CPS was used as the standard for comparison with the classification from the death certificate and cancer registries. IHS patient registration and CPS race classification were not without error; however, it was assumed that the information provided by an IHS registrant or survey respondent about racial identity was more valid than proxy reporting by a funeral director or medical record. Fourth, the individuals self-reporting AI/AN race in the CPS might not have been enrolled in a tribe or otherwise eligible for IHS services; thus, they likely represented a different population from the AI/AN persons captured in the IHS linkages. Fifth, self-report allowed individuals to identify their race differently at different times or in different settings; this limitation would apply to the population estimates collected by the Census and upon which all population health statistics are based. Finally, although the exclusion of Hispanic AI/AN persons from the rates presented in Table 2 reduced the overall AI/AN deaths by less than 5%, it might disproportionately affect some states.

Conclusions

The availability of accurate mortality and cancer data are essential for planning, implementation, and evaluation of public health strategies and programs to address the magnitude of health disparities in this population. The high rate of misclassification of AI/AN persons has resulted in significant underestimates of cancer incidence and mortality estimates in this population.4–18 The IHS-dependent and IHS-independent linkages that we analyzed indicated that racial misclassification of AI/AN individuals varied by region and was less likely in areas where AI/AN persons were a higher percentage of the population. Therefore, limiting analysis to CHSDA counties is crucial to improving the key health indicators (mortality and cancer incidence) and to improving the overall health status of AI/AN persons.

Human Participant Protection

Centers for Disease Control and Prevention and Indian Health Service determined this project to constitute public health practice and not research; therefore, no formal institutional review board approvals were required.

References

1. Denny CH, Holtzman D, Cobb N. Surveillance for health behaviors of American Indians and Alaska Natives. Findings from the Behavioral Risk Factor Surveillance System, 1997–2000. MMWR Surveill Summ. 2003;52(7):113. MedlineGoogle Scholar
2. Indian Health Service. Trends in Indian Health, 2002–2003 Edition. Washington, DC: Government Printing Office; 2009. Google Scholar
3. Steele CB, Cardinez CJ, Richardson LC, Tom-Orme L, Shaw KM. Surveillance for health behaviors of American Indians and Alaska Natives—findings from the behavioral risk factor surveillance system, 2000–2006. Cancer. 2008;113(5 suppl):11311141. Crossref, MedlineGoogle Scholar
4. Frost F, Taylor V, Fries E. Racial misclassification of Native Americans in a Surveillance, Epidemiology, and End Results cancer registry. J Natl Cancer Inst. 1992;84(12):957962. Crossref, MedlineGoogle Scholar
5. Kwong SL, Perkins CI, Snipes KP, Wright WE. Improving American Indian cancer data in the California Cancer Registry by linkage with the Indian Health Service. J Registry Manage. 1998;25(1):1720. Google Scholar
6. Partin MR, Rith-Najarian SJ, Slater JS, Korn JE, Cobb N, Soler JT. Improving cancer incidence estimates for American Indians in Minnesota. Am J Public Health. 1999;89(11):16731677. LinkGoogle Scholar
7. Swan J, Edwards BK. Cancer rates among American Indians and Alaska Natives: is there a national perspective? Cancer. 2003;98(6):12621272. Crossref, MedlineGoogle Scholar
8. Puukka E, Stehr-Green P, Becker TM. Measuring the health status gap for American Indians/Alaska Natives: getting closer to the truth. Am J Public Health. 2005;95(5):838843. LinkGoogle Scholar
9. Mettlin CJ, Menck HR, Winchester DP, Murphy GP. A comparison of breast, colorectal, lung, and prostate cancers reported to the National Cancer Data Base and the Surveillance, Epidemiology, and End Results Program. Cancer. 1997;79(10):20522061. Crossref, MedlineGoogle Scholar
10. Ioannou GN, Chapko MK, Dominitz JA. Predictors of colorectal cancer screening participation in the United States. Am J Gastroenterol. 2003;98(9):20822091. Crossref, MedlineGoogle Scholar
11. Noymer A, Penner AM, Saperstein A. Cause of death affects racial classification on death certificates. PLoS ONE. 2011;6(1):e15812. Crossref, MedlineGoogle Scholar
12. Rosenberg HM, Maurer JD, Sorlie PD, et al. Quality of death rates by race and Hispanic origin: a summary of current research, 1999. Vital Health Stat 2. 1999;(128):1–13. Google Scholar
13. Arias E, Schauman WS, Eschbach K, Sorlie PD, Backlund E. The validity of race and Hispanic origin reporting on death certificates in the United States. Vital Health Stat 2. 2008; (148):1–23. Google Scholar
14. Harwell TS, Hansen D, Moore KR, Jeanotte D, Gohdes D, Helgerson SD. Accuracy of race coding on American Indian death certificates, Montana 1996-1998. Public Health Rep. 2002;117(1):4449. Crossref, MedlineGoogle Scholar
15. Stehr-Green P, Bettles J, Robertson LD. Effect of racial/ethnic misclassification of American Indians and Alaskan Natives on Washington State death certificates, 1989–1997. Am J Public Health. 2002;92(3):443444. LinkGoogle Scholar
16. Hahn RA, Truman BI, Barker ND. Identifying ancestry: the reliability of ancestral identification in the United States by self, proxy, interviewer, and funeral director. Epidemiology. 1996;7(1):7580. Crossref, MedlineGoogle Scholar
17. Hahn RA, Wetterhall SF, Gay GA, et al. The recording of demographic information on death certificates: a national survey of funeral directors. Public Health Rep. 2002;117(1):3743. Crossref, MedlineGoogle Scholar
18. Poe GS, Powell-Griner E, McLaughlin JK, Placek PJ, Thompson GB, Robinson K. Comparability of the death certificate and the 1986 National Mortality Followback Survey. Vital Health Stat 2. 1993;(118):1–53. Google Scholar
19. Clegg LX, Reichman ME, Hankey BF, et al. Quality of race, Hispanic ethnicity, and immigrant status in population-based cancer registry data: implications for health disparity studies. Cancer Causes Control. 2007;18(2):177187. Crossref, MedlineGoogle Scholar
20. Division of Program Statistics, Indian Health Service. Methodology for Adjusting IHS Mortality Data for Inconsistent Classification of Race-Ethnicity of American Indians and Alaska Natives Between State Death Certificates and IHS Patient Registration Records. Rockville, MD: Indian Health Service; 1996. Google Scholar
21. Indian Health Service. Indian Health Service Fact Sheets - IHS Year 2014 Profile. 2014. Available at: http://www.ihs.gov/newsroom/factsheets/ihsyear2014profile. Accessed March 4, 2014. Google Scholar
22. Indian Health Service. Indian Health Service Fact Sheets - A Quick Look. 2014. Available at: http://www.ihs.gov/newsroom/factsheets/quicklook. Accessed March 4, 2014. Google Scholar
23. Indian Health Service. Indian Health Service Fact Sheets - Health Facilities Construction. 2013. Available at: http://www.ihs.gov/newsroom/factsheets/healthfacilitiesconstruction. Accessed March 4, 2014. Google Scholar
24. Indian Health Service. Indian Health Service Fact Sheets - Contract Health Services. 2013. Available at: http://www.ihs.gov/newsroom/factsheets/contracthealthservices. Accessed March 4, 2014. Google Scholar
25. Indian Health Service. Contract Health Services, Resources: Regulations - 136.22. Available at: http://www.ihs.gov/chs/index.cfm?module=chs_resources_regulations_136_22. Accessed July 8, 2013. Google Scholar
26. Espey DK, Wiggins C, Becker TM, Jim MA, Johnson C, Miller BA. Methods for improving cancer surveillance data in American Indian and Alaska Native populations. Cancer. 2008;113(5 suppl):11201130. Crossref, MedlineGoogle Scholar
27. Burhansstipanov L, Hampton JW, Wiggins CL. Issues in cancer data and surveillance for American Indian and Alaska Native populations. J Registry Manage. 1999;26:153157. Google Scholar
28. Espey DK, Wu X-C, Swan J, et al. Annual report to the nation on the status of cancer, 1975–2004, featuring cancer in American Indians and Alaska Natives. Cancer. 2007;110(10):21192152. Crossref, MedlineGoogle Scholar
29. Wiggins CL, Espey DK, Wingo PA, et al. Cancer among American Indians and Alaska Natives in the United States, 1999–2004. Cancer. 2008;113(5 suppl):11421152. Crossref, MedlineGoogle Scholar
30. Jemal A, Simard EP, Dorell C, et al. Annual report to the nation on the status of cancer, 1975–2009, featuring the burden and trends in human papillomavirus (HPV)-associated cancers and HPV vaccination coverage levels. J Natl Cancer Inst. 2013;105(3):175201. Crossref, MedlineGoogle Scholar
31. Copeland G, Lake A, Firth R, et al. Cancer in North America: 2006–2010. Volume One: Combined Cancer Incidence for the United States, Canada and North America. Springfield, IL: North American Association of Central Cancer Registries, Inc; 2013. Google Scholar
32. Howlader NNA, Krapcho M, Garshell J, et al. SEER cancer statistics review, 1975–2010, National Cancer Institute. Available at: http://seer.cancer.gov/csr/1975_2010. Accessed July 9, 2013. Google Scholar
33. Jim MA, Espey DK, Wiggins CL, Cobb N, Wingo PA. Racial misclassification of American Indians residing near IHS facilities. In: NAACCR 2006 Conference Final Program and Abstracts. Vancouver, BC, Canada: Utratech Printing Ltd; 2006. Google Scholar
34. Espey DK, Jim MA, Richards T, Begay C, Haverkamp D, Roberts D. Methods for improving the quality and completeness of mortality data for American Indians and Alaska Natives. Am J Public Health. 2014;104(6 suppl 3):S286S294. Google Scholar
35. National Center for Health Statistics. About the National Death Index. Available at: http://www.cdc.gov/nchs/data_access/ndi/about_ndi.htm. Accessed April 9, 2012. Google Scholar
36. National Center for Health Statistics. National Vital Statistics System. Available at: http://www.cdc.gov/nchs/nvss/about_nvss.htm. Accessed May 20, 2013. Google Scholar
37. National Center for Health Statistics. NCHS Procedures for Multiple-Race and Hispanic Origin Data: Collection, Coding, Editing, and Transmitting. Hyattsville, MD: National Center for Health Statistics; 2004. Available at: http://www.cdc.gov/nchs/data/dvs/Multiple_race_documentation_5-10-04.pdf. Accessed May 20, 2013. Google Scholar
38. Hankey BF, Ries LA, Edwards BK. The Surveillance, Epidemiology, and End Results program: a national resource. Cancer Epidemiol Biomarkers Prev. 1999;8(12):11171121. MedlineGoogle Scholar
39. Thoburn KK, German RR, Lewis M, Nichols PJ, Ahmed F, Jackson-Thompson J. Case completeness and data accuracy in the Centers for Disease Control and Prevention’s National Program of Cancer Registries. Cancer. 2007;109(8):16071616. Crossref, MedlineGoogle Scholar
40. US Cancer Statistics Working Group. United States Cancer Statistics: 1999–2009 Incidence and Mortality Web-based Report. Atlanta, GA: US Department of Health and Human Services, Centers for Disease Control and Prevention and National Cancer Institute; 2013. Available at: http://www.cdc.gov/uscs. Accessed July 9, 2013. Google Scholar
41. Division of Cancer Prevention and Control, National Center for Chronic Disease Prevention and Health Promotion, Centers for Disease Control and Prevention. Link Plus: A Suite of Publicly Available Software Programs for Collecting and Processing Cancer Registry Data. Version 2.0. Available at: http://www.cdc.gov/cancer/npcr/tools/registryplus/lp.htm. Accessed July 9, 2013. Google Scholar
42. US Census Bureau. Current Population Survey. Design and Methodology. Technical paper (63RV). 2002. Available at: http://www.census.gov/prod/2002pubs/tp63rv.pdf. Accessed August 16, 2013. Google Scholar
43. Ingram DD, Parker JD, Schenker N, et al. United States Census 2000 population with bridged race categories. Vital Health Stat 2. 2003;(135):1–55. Google Scholar
44. National Cancer Institute. Surveillance, Epidemiology, and End Results (SEER) Program. Statistical Resources. US population data 1969–2011. Available at: http://seer.cancer.gov/popdata. Accessed June 10, 2013. Google Scholar
45. Aschengrau A, Seage G. Essentials of Epidemiology in Public Health. Sudbury, MA: Jones and Bartlett Publishers; 2003. Google Scholar
46. National Cancer Institute. Surveillance Research Program, National Cancer Institute SEER*Stat software. Version 8.1.2. Available at: http://seer.cancer.gov/seerstat. Accessed September 17, 2013. Google Scholar
47. Anderson RN, Rosenberg HM. Report of the second workshop on age adjustment. Vital Health Stat 4. 1998;(30)137. Google Scholar
48. Anderson RN, Rosenberg HM. Age Standardization of Death Rates: Implementation of the Year 2000 Standard. Atlanta, GA: National Center for Health Statistics; 1998. Google Scholar
49. Edwards BK, Noone AM, Mariotto AB, et al. Annual report to the nation on the status of cancer, 1975–2010, featuring prevalence of comorbidity and impact on survival among persons with lung, colorectal, breast, or prostate cancer. Cancer. 2013;Epub ahead of print. Google Scholar
50. Tiwari RC, Clegg LX, Zou Z. Efficient interval estimation for age-adjusted cancer rates. Stat Methods Med Res. 2006;15(6):547569. Crossref, MedlineGoogle Scholar
51. Yankaskas BC, Knight KL, Fleg A, Rao C. Misclassification of American Indian race in state cancer data among non-federally recognized Indians in North Carolina. J Registry Manag. 2009;36(1):711. MedlineGoogle Scholar
52. US Census Bureau. The American Indian and Alaska Native Population: 2010 (C2010BR-10). 2012. Available at: http://www.census.gov/prod/cen2010/briefs/c2010br-10.pdf. Accessed June 10, 2013. Google Scholar
53. Sugarman JR, Soderberg R, Gordon JE, Rivara FP. Racial misclassification of American Indians: its effect on injury rates in Oregon, 1989 through 1990. Am J Public Health. 1993;83(5):681684. LinkGoogle Scholar
54. Frost F, Tollestrup K, Ross A, Sabotta E, Kimball E. Correctness of racial coding of American Indians and Alaska Natives on the Washington State death certificate. Am J Prev Med. 1994;10(5):290294. Crossref, MedlineGoogle Scholar
55. Thoroughman DA, Frederickson D, Cameron HD, Shelby LK, Cheek JE. Racial misclassification of American Indians in Oklahoma State surveillance data for sexually transmitted diseases. Am J Epidemiol. 2002;155(12):11371141. Crossref, MedlineGoogle Scholar
56. US Census Bureau. 2010 Census Restricting Data (Public Law 94-171) Summary File - Technical Documentation (PL/10-2 (RV). 2011. Available at: http://www.census.gov/prod/cen2010/doc/pl94-171.pdf. Accessed June 10, 2013. Google Scholar
57. US Census Bureau. 2010 Census Briefs: The American Indian and Alaska Native Population: 2010. Available at: http://www.census.gov/prod/cen2010/briefs/c2010br-10.pdf. Accessed June 17, 2013. Google Scholar
58. Hoopes MJ, Taualii M, Weiser TM, Brucker R, Becker TM. Including self-reported race to improve cancer surveillance data for American Indians and Alaska Natives in Washington state. J Registry Manag. 2010;37(2):4348. MedlineGoogle Scholar
59. Johnson JC, Soliman AS, Tadgerson D, et al. Tribal linkage and race data quality for American Indians in a state cancer registry. Am J Prev Med. 2009;36(6):549554. Crossref, MedlineGoogle Scholar

Related

No related items

TOOLS

SHARE

ARTICLE CITATION

Melissa A. Jim, MPH, Elizabeth Arias, PhD, Dean S. Seneca, MPH, MCURP, Megan J. Hoopes, MPH, Cheyenne C. Jim, MS, Norman J. Johnson, PhD, and Charles L. Wiggins, PhDMelissa A. Jim is with the Division of Cancer Prevention and Control, National Center for Chronic Disease Prevention and Health Promotion, Centers for Disease Control and Prevention, Albuquerque, NM. Elizabeth Arias is with the Division of Vital Statistics, National Center for Health Statistics, Centers for Disease Control and Prevention, Hyattsville, MD. Dean S. Seneca is with the Division of Public Health Capacity Development, Office for State, Tribal, Local and Territorial Support, Centers for Disease Control and Prevention, Atlanta, GA. Megan J. Hoopes is with the Northwest Tribal Epidemiology Center, Northwest Portland Area Indian Health Board, Portland, OR. Cheyenne C. Jim is with Immunization Services Division, National Center for Immunization and Respiratory Diseases, Centers for Disease Control and Prevention, Albuquerque, NM. Norman J. Johnson is with the National Longitudinal Mortality Study Branch, US Census Bureau, Suitland, MD. Charles L. Wiggins is with the New Mexico Tumor Registry, University of New Mexico Cancer Center, Albuquerque. “Racial Misclassification of American Indians and Alaska Natives by Indian Health Service Contract Health Service Delivery Area”, American Journal of Public Health 104, no. S3 (June 1, 2014): pp. S295-S302.

https://doi.org/10.2105/AJPH.2014.301933

PMID: 24754617