Background: Geocoding is often used in epidemiologic studies to map residences with geographic information systems (GIS). The accuracy of the method is usually not determined.
Methods: We collected global positioning system (GPS) measurements at homes in a case-control study of non-Hodgkin lymphoma in Iowa. We geocoded the addresses by 2 methods: (1) in-house, using ArcView 3.2 software and the U.S. Census Bureau TIGER 2000 street database; and (2) automated geocoding by a commercial firm. We calculated the distance between the geocoded and GPS location (positional error) overall and separately for homes within towns and outside (rural). We evaluated the error in classifying homes with respect to their proximity to crop fields.
Results: Overall, the majority of homes were geocoded with positional errors of less than 100 m by both methods (ArcView/TIGER 2000, median = 62 m [interquartile range = 39-103]; commercial firm, median = 61 m [interquartile range = 35-137]). For town residences, the percent geocoded with errors of </=100 m was 81% for ArcView/TIGER 2000 and 84% for the commercial firm. For rural residences, a smaller percent of addresses were geocoded with this level of accuracy, especially by the commercial firm (ArcView/TIGER 2000, 56%; commercial firm, 28%). Geocoding errors affected our classification of homes according to their proximity to agricultural fields at 100 m, but not at greater distances (250-500 m).
Conclusions: Our results indicate greater positional errors for rural addresses compared with town addresses. Using a commercial firm did not improve accuracy compared with our in-house method. The effect of geocoding errors on exposure classification will depend on the spatial variation of the exposure being studied.