Some of the names in the raw data set that were clearly Chinese had been coded as another ethnicity or lacked an
ethnicity code. The ethnicities of these cases were changed to Chinese.
The UB uses the Centers for Disease Control Race and
Ethnicity Code Set, which has a hierarchical structure that provides for detailed reporting of race and ethnicity but also for "rolled up" codes that are compatible with the current OMB standard.
Nonetheless, comparisons of the EDB race/ ethnicity codes with self-reported race/ ethnicity data from the Medicare Current Beneficiary Survey (MCBS) indicated that identification of Hispanics, Asians/Pacific Islanders, and American Indians/Alaska Natives was still quite incomplete and might result in biased analyses (Arday et al., 2000).
The self-reported race/ ethnicity codes from these data are the SELFRACE variable and constitute the gold standard.
According to the Pentagon's own website, the military began "overlaying
ethnicity codes and telephone numbers" in 2004.