Using name lists to infer Asian racial/ethnic subgroups in the healthcare setting

Med Care. 2010 Jun;48(6):540-6. doi: 10.1097/MLR.0b013e3181d559e9.

Abstract

Background: Many clinical data sources used to assess health disparities lack Asian subgroup information, but do include patient names.

Objective: This project validates Asian surname and given name lists for identifying Asian subgroups (Asian Indian, Chinese, Filipino, Japanese, Korean, Vietnamese) in clinical records.

Subjects: We used 205,000 electronic medical records from the Palo Alto Medical Foundation, a multipayer, outpatient healthcare organization in Northern California, containing patient self-identified race/ethnicity information.

Research design: Name lists were used to infer racial/ethnic subgroup for patients with self-identified race/ethnicity data. Using self-identification as the "gold standard," sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) of classification by name were calculated. Clinical outcomes (obesity and hypertension) were compared for name-identified versus self-identified racial/ethnic groups.

Results: With classification using surname and given name, the overall sensitivities ranged from 0.45 to 0.76 for the 6 racial/ethnic groups when no race data are available, and 0.40 to 0.79 when the broad racial classification of "Asian" is known. Specificities ranged from 0.99 to 1.00. PPV and NPV depended on the prevalence of Asians in the population. The lists performed better for men than women and better for persons aged 65 and older. Clinical outcomes were very similar for name-identified and self-identified racial/ethnic groups.

Conclusions: In a clinical setting with a high prevalence of Asian Americans, name-identified and self-identified racial/ethnic groups had similar clinical characteristics. Asian name lists may be a valid substitute for identifying Asian subgroups when self-identification is unavailable.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Age Factors
  • Aged
  • Asian / classification
  • Asian / statistics & numerical data*
  • California / epidemiology
  • Ethnicity / classification*
  • Ethnicity / statistics & numerical data*
  • Female
  • Hospital Information Systems / statistics & numerical data*
  • Humans
  • Male
  • Medical Records / statistics & numerical data*
  • Middle Aged
  • Names*
  • Patient Identification Systems
  • Sex Factors
  • Surveys and Questionnaires
  • Young Adult