<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1476-069X-6-10</ui>
   <ji>1476-069X</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Quantifying geocode location error using GIS methods</p>
         </title>
         <aug>
            <au id="A1" ca="yes" ce="yes">
               <snm>Strickland</snm>
               <mi>J</mi>
               <fnm>Matthew</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>MStrickland@cdc.gov</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Siffel</snm>
               <fnm>Csaba</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>CSiffel@cdc.gov</email>
            </au>
            <au id="A3" ce="yes">
               <snm>Gardner</snm>
               <mi>R</mi>
               <fnm>Bennett</fnm>
               <insr iid="I1"/>
               <email>BRGardner@cdc.gov</email>
            </au>
            <au id="A4" ce="yes">
               <snm>Berzen</snm>
               <mi>K</mi>
               <fnm>Alissa</fnm>
               <insr iid="I4"/>
               <email>ABerzen@cdc.gov</email>
            </au>
            <au id="A5" ce="yes">
               <snm>Correa</snm>
               <fnm>Adolfo</fnm>
               <insr iid="I1"/>
               <email>ACorrea@cdc.gov</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>National Center on Birth Defects and Developmental Disabilities, Centers for Disease Control and Prevention, Atlanta, Georgia, USA</p>
            </ins>
            <ins id="I2">
               <p>Battelle Centers for Public Health Research and Evaluation, Atlanta, Georgia, USA</p>
            </ins>
            <ins id="I3">
               <p>Computer Sciences Corporation, Atlanta, Georgia, USA</p>
            </ins>
            <ins id="I4">
               <p>Agency for Toxic Substances and Disease Registry, Centers for Disease Control and Prevention, Atlanta, Georgia, USA</p>
            </ins>
         </insg>
         <source>Environmental Health</source>
         <issn>1476-069X</issn>
         <pubdate>2007</pubdate>
         <volume>6</volume>
         <issue>1</issue>
         <fpage>10</fpage>
         <url>http://www.ehjournal.net/content/6/1/10</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17408484</pubid>
               <pubid idtype="doi">10.1186/1476-069X-6-10</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>17</day>
               <month>5</month>
               <year>2006</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>04</day>
               <month>4</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>04</day>
               <month>4</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Strickland et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The Metropolitan Atlanta Congenital Defects Program (MACDP) collects maternal address information at the time of delivery for infants and fetuses with birth defects. These addresses have been geocoded by two independent agencies: (1) the Georgia Division of Public Health Office of Health Information and Policy (OHIP) and (2) a commercial vendor. Geographic information system (GIS) methods were used to quantify uncertainty in the two sets of geocodes using orthoimagery and tax parcel datasets.</p>
            </sec>
            <sec>
               <st>
                  <p>Methods</p>
               </st>
               <p>We sampled 599 infants and fetuses with birth defects delivered during 1994&#8211;2002 with maternal residence in either Fulton or Gwinnett County. Tax parcel datasets were obtained from the tax assessor's offices of Fulton and Gwinnett County. High-resolution orthoimagery for these counties was acquired from the U.S. Geological Survey. For each of the 599 addresses we attempted to locate the tax parcel corresponding to the maternal address. If the tax parcel was identified the distance and the angle between the geocode and the residence were calculated. We used simulated data to characterize the impact of geocode location error. In each county 5,000 geocodes were generated and assigned their corresponding Census 2000 tract. Each geocode was then displaced at a random angle by a random distance drawn from the distribution of observed geocode location errors. The census tract of the displaced geocode was determined. We repeated this process 5,000 times and report the percentage of geocodes that resolved into incorrect census tracts.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Median location error was less than 100 meters for both OHIP and commercial vendor geocodes; the distribution of angles appeared uniform. Median location error was approximately 35% larger in Gwinnett (a suburban county) relative to Fulton (a county with urban and suburban areas). Location error occasionally caused the simulated geocodes to be displaced into incorrect census tracts; the median percentage of geocodes resolving into incorrect census tracts ranged between 4.5% and 5.3%, depending upon the county and geocoding agency.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Geocode location uncertainty can be estimated using tax parcel databases in a GIS. This approach is a viable alternative to global positioning system field validation of geocodes.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Federal, state, and local public health surveillance systems often collect residential address information as part of their surveillance activities. Prior to spatial statistical analyses, residential address information must be geocoded (e.g., latitude and longitude coordinates), a process typically accomplished through the use of electronic street databases <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Public health applications of geocoded data include defining a study population, linking health outcomes with environmental hazards, and investigating disease clusters. Although the hope is that all geocodes correctly reflect the true geographic location of the addresses, some geocodes are likely inaccurate due to errors in street databases, errors in residential address information, algorithms that permit imperfect address matches (i.e., the "match rate," or how similar the submitted address must be to the address in the database), and the distance geocodes are placed from the street centerline <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. There is generally a trade-off between the proportion of missing geocodes and geocode accuracy; lenient match rates tend to increase the proportion of successfully geocoded addresses at the expense of geocode accuracy <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. In addition to carefully collected residential address information, street databases that are current, free of errors, and spatially accurate should help reduce location error <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>.</p>
         <p>Because geocode inaccuracies can affect spatial analyses, <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> understanding the magnitude of location error in geocoded data is desirable. One approach is to travel to the address location and verify coordinates using a global positioning system (GPS) <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. Although this approach is accurate it is also resource-intensive, particularly when the geographic area of interest is large. Whereas a GPS may be the only viable option for geocoding in remote settings [e.g., <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>], in the U.S. alternative options for geocode validation are generally available, and those overseeing surveillance systems may not wish to, or have the resources to, verify large numbers of addresses using a GPS. In this paper we describe an alternative computer-based method <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> to verify address locations for a sample of birth defect records in metropolitan Atlanta.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Population and sample</p>
            </st>
            <p>The Metropolitan Atlanta Congenital Defects Program (MACDP) is a population-based birth defects surveillance system operated by Centers for Disease Control and Prevention since 1968 <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. MACDP actively ascertains infants and fetuses with birth defects born to mothers residing in one of five metropolitan Atlanta counties at delivery (Clayton, Cobb, DeKalb, Fulton, and Gwinnett). The address of the maternal residence is recorded for each case and is subsequently sent to a commercial vendor for geocoding. Independent of MACDP geocoding efforts, the Office of Health Information and Policy (OHIP), Georgia Division of Public Health, has geocoded the live birth cohort in Georgia since 1994.</p>
            <p>The initial phase of geocoding is similar for both the commercial vendor and OHIP. Although the tolerance for accepting imperfect matches may differ, both agencies begin by comparing (in batch mode) submitted addresses with addresses in a street database. The commercial vendor uses street databases distributed by Geographic Data Technology (now Tele Atlas), whereas OHIP uses street databases distributed by Group 1 software. Street databases contain many road segments; each segment has two address ranges (one side of the road has an even numbered address range and the other side has an odd numbered address range). If the submitted address falls within a range then a geocode is generated by interpolating between the two known addresses at opposite ends of the road segment. When a street-level match cannot be achieved the software assigns a geocode corresponding to a polygon centroid. The commercial vendor accepts centroid matches up to the 5-digit ZIP code level and OHIP accepts centroid matches up to the census tract level. After batch geocoding, the commercial vendor manually compares each address not successfully geocoded to a list of potential addresses. If a potential address is judged to be a reasonable match the record is manually geocoded. OHIP performs a spatial imputation on addresses that are not geocoded successfully <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Imputation begins by estimating the county for all remaining addresses. Using vital records data, the expected number of births by race and census tract is calculated for each county. Addresses are imputed into census tracts that have less than the expected number of geocoded records and are assigned the corresponding centroid.</p>
            <p>As part of ongoing surveillance activities, MACDP links its birth defects records with OHIP records using a deterministic approach based on several variables including names, dates, and addresses. As a result, each successfully linked record has two independently created geocodes &#8211; one OHIP geocode and one commercial vendor geocode. We defined the study population, based on MACDP records, as all infants with birth defects delivered during 1994&#8211;2002 with maternal residence at delivery in Fulton or Gwinnett County. From this study population, we randomly selected 665 records meeting the following criteria: 1) successful link with OHIP records, 2) address on the MACDP record matched address on the OHIP record, and 3) both OHIP and the commercial vendor attempted to geocode the address. This study was approved by the CDC institutional review board and was conducted in accordance with the Declaration of Helsinki of the World Medical Association.</p>
         </sec>
         <sec>
            <st>
               <p>Geographic data</p>
            </st>
            <p>A "shapefile" is a set of computer files used to store geographic information (e.g., census tract boundaries) and tables of attributes associated with the geographic information (e.g., census tract housing and demographic characteristics) <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Shapefiles can be manipulated using a geographic information system (GIS); ArcView 8.3 (ESRI, Redlands, CA) was used in this project. Tax parcel shapefiles were obtained from the Fulton and Gwinnett County tax assessor's offices. These shapefiles contain polygons corresponding to the location and dimensions of each taxable land parcel in the county. The address of each parcel is stored in its attribute table.</p>
            <p>We also obtained high-resolution (0.3 meter resolution per pixel) digital orthoimages from the U.S. Geological Survey (USGS). An orthoimage is a remotely sensed digital photograph of the earth's surface that has been mathematically manipulated to minimize distortion due to terrain relief and sensor orientation <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. USGS estimates that the design accuracy of its orthoimages does not exceed a root mean squared error of 3-meters in diagonal <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Location error assessment</p>
            </st>
            <p>For each of the 665 records we attempted to identify the tax parcel corresponding to the maternal address. When the parcel was identified, a point was placed on the residence located within the parcel. We elected not to place points when tax parcels contained many buildings (i.e., large apartment complexes) because there was no obvious location for point placement. During validation, we identified a subset of records that presumably had the incorrect county recorded in the MACDP database. We examined these addresses further using the U.S. Postal Service online lookup database to infer the correct county. After excluding records with incorrect county codes (n = 66), the final sample consisted of 599 records.</p>
            <p>The geographic coordinates of the placed points were determined using ArcView and represent the "gold standard." For each validated address, we calculated both the distance and the angle between the gold standard and each of the two geocodes applying a spherical earth model <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. We assumed a constant elevation of 300 meters above sea level. Figure <figr fid="F1">1</figr> displays tax parcel data overlain on orthoimagery; we define location error as the distance between the geocode and the gold standard (residence). We report the empirical cumulative distribution of location errors for both the OHIP and commercial vendor geocodes. Rose plots were generated to inspect whether the distribution of angles appeared uniform, and Rayleigh tests were performed to evaluate the null hypothesis of a uniform circular distribution of angles. We created rose plots by stratifying addresses according to the angle of the location error. Each stratum, or "bin," correspond to a 15&#176; increment (i.e., 0&#176;-15&#176;, 15&#176;-30&#176;, etc.). Each bin has its own "petal," which varies in size according to the number of addresses within the bin.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Tax Parcel Data Overlain on Orthoimagery</p>
               </caption>
               <text>
                  <p><b>Tax Parcel Data Overlain on Orthoimagery</b>. The distance between the geocode and the residence (gold standard) is the "location error" for the address.</p>
               </text>
               <graphic file="1476-069X-6-10-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Census tract point-in-polygon simulations</p>
            </st>
            <p>To characterize the impact of geocode location error on census tract assignment we generated 5,000 random geocodes within each county and used a point-in-polygon routine to determine the Census 2000 tract for each geocode. Each geocode was then displaced at a random angle from a uniform (0, 2&#960;) distribution by a random distance drawn from an empirical distribution of geocode location errors (as reported in the Results). We then determined the census tract for each displaced geocode. We conducted 5,000 such simulations for each geocoding agency within each county and we report the percentage of geocodes that resolved into the incorrect census tract (median, 2.5%, and 97.5% of the 5,000 simulations). All simulations were performed using the Universal Transverse Mercator (UTM) Zone 16 North map projection with the software package R 2.4.0 (R Core Development Team).</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>The commercial vendor and OHIP created address-level geocodes for 96.0% and 91.7% of the sample, respectively (Table <tblr tid="T1">1</tblr>). Although 435 addresses included in the sample were located (72.6%), gold standard points were placed for only 376 addresses (62.8%). Points were not placed for 59 addresses (9.8%) because the parcels contained large, multi-unit housing complexes.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Frequencies of Geocoding Success and Geocode Validation Outcomes for 599 Selected Addresses, by County.</p>
            </caption>
            <tblbdy cols="7">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="2" ca="center">
                     <p>Fulton County (n = 339)</p>
                  </c>
                  <c cspan="2" ca="center">
                     <p>Gwinnett County (n = 260)</p>
                  </c>
                  <c cspan="2" ca="center">
                     <p>Both Counties (n = 599)</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>n</p>
                  </c>
                  <c ca="center">
                     <p>%</p>
                  </c>
                  <c ca="center">
                     <p>n</p>
                  </c>
                  <c ca="center">
                     <p>%</p>
                  </c>
                  <c ca="center">
                     <p>n</p>
                  </c>
                  <c ca="center">
                     <p>%</p>
                  </c>
               </r>
               <r>
                  <c cspan="7">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Address-level geocodes</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>OHIP</p>
                  </c>
                  <c ca="center">
                     <p>321</p>
                  </c>
                  <c ca="center">
                     <p>94.7</p>
                  </c>
                  <c ca="center">
                     <p>228</p>
                  </c>
                  <c ca="center">
                     <p>87.7</p>
                  </c>
                  <c ca="center">
                     <p>549</p>
                  </c>
                  <c ca="center">
                     <p>91.7</p>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>Commercial vendor</p>
                  </c>
                  <c ca="center">
                     <p>324</p>
                  </c>
                  <c ca="center">
                     <p>95.6</p>
                  </c>
                  <c ca="center">
                     <p>251</p>
                  </c>
                  <c ca="center">
                     <p>96.5</p>
                  </c>
                  <c ca="center">
                     <p>575</p>
                  </c>
                  <c ca="center">
                     <p>96.0</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Addresses located using GIS</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>House/small multi-unit complex</p>
                  </c>
                  <c ca="center">
                     <p>194</p>
                  </c>
                  <c ca="center">
                     <p>57.2</p>
                  </c>
                  <c ca="center">
                     <p>182</p>
                  </c>
                  <c ca="center">
                     <p>70.0</p>
                  </c>
                  <c ca="center">
                     <p>376</p>
                  </c>
                  <c ca="center">
                     <p>62.8</p>
                  </c>
               </r>
               <r>
                  <c indent="1" ca="left">
                     <p>Moderate/large multi-unit complex</p>
                  </c>
                  <c ca="center">
                     <p>41</p>
                  </c>
                  <c ca="center">
                     <p>12.1</p>
                  </c>
                  <c ca="center">
                     <p>18</p>
                  </c>
                  <c ca="center">
                     <p>6.9</p>
                  </c>
                  <c ca="center">
                     <p>59</p>
                  </c>
                  <c ca="center">
                     <p>9.8</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Unverified addresses</p>
                  </c>
                  <c ca="center">
                     <p>104</p>
                  </c>
                  <c ca="center">
                     <p>30.7</p>
                  </c>
                  <c ca="center">
                     <p>60</p>
                  </c>
                  <c ca="center">
                     <p>23.1</p>
                  </c>
                  <c ca="center">
                     <p>164</p>
                  </c>
                  <c ca="center">
                     <p>27.4</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <p>Selected percentiles from the empirical cumulative distributions of location error, stratified by county, are presented for both OHIP and commercial vendor geocodes in Table <tblr tid="T2">2</tblr>. Median location error was 71 meters for the commercial vendor and 91 meters for OHIP geocodes. Median location error was approximately 35% greater in Gwinnett County than in Fulton County. This finding was anticipated, as Gwinnett County is predominantly suburban whereas Fulton County has a mix of urban and suburban areas. Rose plots (Figure <figr fid="F2">2</figr>) were constructed by placing each record into one of 24 15&#176; bins according to the angle of the location error. Inspection of the rose plots did not suggest systematic bias in the direction of the geocode relative to the gold standard. There was no strong evidence to reject the null hypothesis of a uniform circular distribution. Rayleigh tests, which were performed for each geocoding agency (data pooled over counties) as well as for each combination of county and geocoding agency, were not significant (all p-values > 0.2).</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Distributions of Geocode Location Error Angles</p>
            </caption>
            <text>
               <p><b>Distributions of Geocode Location Error Angles</b>. Rose plots portraying the distribution of angles between the geocode and the residence (using 15&#176; bins) for OHIP and commercial vendor geocodes.</p>
            </text>
            <graphic file="1476-069X-6-10-2"/>
         </fig>
         <tbl id="T2">
            <title>
               <p>Table 2</p>
            </title>
            <caption>
               <p>Selected Percentiles From the Empirical Cumulative Distributions of Location Error. All distances reported in meters.</p>
            </caption>
            <tblbdy cols="7">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="2" ca="center">
                     <p>Fulton County</p>
                  </c>
                  <c cspan="2" ca="center">
                     <p>Gwinnett County</p>
                  </c>
                  <c cspan="2" ca="center">
                     <p>Both Counties</p>
                  </c>
               </r>
               <r>
                  <c cspan="7">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>Percentile</p>
                  </c>
                  <c ca="center">
                     <p>Commercial vendor (n = 189)</p>
                  </c>
                  <c ca="center">
                     <p>OHIP (n = 189)</p>
                  </c>
                  <c ca="center">
                     <p>Commercial vendor (n = 178)</p>
                  </c>
                  <c ca="center">
                     <p>OHIP (n = 169)</p>
                  </c>
                  <c ca="center">
                     <p>Commercial vendor (n = 367)</p>
                  </c>
                  <c ca="center">
                     <p>OHIP (n = 358)</p>
                  </c>
               </r>
               <r>
                  <c cspan="7">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>25%</p>
                  </c>
                  <c ca="center">
                     <p>39</p>
                  </c>
                  <c ca="center">
                     <p>38</p>
                  </c>
                  <c ca="center">
                     <p>44</p>
                  </c>
                  <c ca="center">
                     <p>61</p>
                  </c>
                  <c ca="center">
                     <p>42</p>
                  </c>
                  <c ca="center">
                     <p>48</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>50%</p>
                  </c>
                  <c ca="center">
                     <p>61</p>
                  </c>
                  <c ca="center">
                     <p>77</p>
                  </c>
                  <c ca="center">
                     <p>84</p>
                  </c>
                  <c ca="center">
                     <p>104</p>
                  </c>
                  <c ca="center">
                     <p>71</p>
                  </c>
                  <c ca="center">
                     <p>91</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>75%</p>
                  </c>
                  <c ca="center">
                     <p>124</p>
                  </c>
                  <c ca="center">
                     <p>136</p>
                  </c>
                  <c ca="center">
                     <p>141</p>
                  </c>
                  <c ca="center">
                     <p>171</p>
                  </c>
                  <c ca="center">
                     <p>147</p>
                  </c>
                  <c ca="center">
                     <p>155</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>90%</p>
                  </c>
                  <c ca="center">
                     <p>242</p>
                  </c>
                  <c ca="center">
                     <p>281</p>
                  </c>
                  <c ca="center">
                     <p>311</p>
                  </c>
                  <c ca="center">
                     <p>311</p>
                  </c>
                  <c ca="center">
                     <p>281</p>
                  </c>
                  <c ca="center">
                     <p>301</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>95%</p>
                  </c>
                  <c ca="center">
                     <p>322</p>
                  </c>
                  <c ca="center">
                     <p>361</p>
                  </c>
                  <c ca="center">
                     <p>378</p>
                  </c>
                  <c ca="center">
                     <p>373</p>
                  </c>
                  <c ca="center">
                     <p>352</p>
                  </c>
                  <c ca="center">
                     <p>369</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>99%</p>
                  </c>
                  <c ca="center">
                     <p>573</p>
                  </c>
                  <c ca="center">
                     <p>693</p>
                  </c>
                  <c ca="center">
                     <p>747</p>
                  </c>
                  <c ca="center">
                     <p>738</p>
                  </c>
                  <c ca="center">
                     <p>664</p>
                  </c>
                  <c ca="center">
                     <p>774</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>Max</p>
                  </c>
                  <c ca="center">
                     <p>1,389</p>
                  </c>
                  <c ca="center">
                     <p>20,677</p>
                  </c>
                  <c ca="center">
                     <p>1,500</p>
                  </c>
                  <c ca="center">
                     <p>2,324</p>
                  </c>
                  <c ca="center">
                     <p>1,500</p>
                  </c>
                  <c ca="center">
                     <p>20,677</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <p>The magnitude of location error reported in Table <tblr tid="T2">2</tblr> occasionally causes geocodes to be placed into incorrect census tracts. The point-in-polygon simulations for Fulton County using the commercial vendor location error caused 4.5% (4.0%, 5.0%) of the randomly generated geocodes to be placed into incorrect census tracts. OHIP location error caused incorrect census tract assignment for 5.3% (4.8%, 5.9%) of the geocodes in Fulton County. Results were similar for Gwinnett County; 4.8% (4.3%, 5.4%) of geocodes were placed into incorrect census tracts because of commercial vendor location error, and OHIP location error caused 5.2% (4.7%, 5.8%) of geocodes to be assigned the incorrect census tract.</p>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Location error is intrinsic to both the OHIP and commercial vendor geocodes. Interpolation is likely a major component of this error, as interpolation is necessary whenever the submitted address is not demarcated on the street map. Because only a small proportion of addresses are demarcated, interpolation occurs frequently. Observed differences in geocoding success and in location error magnitude between the commercial vendor and OHIP geocodes may be due to a number of factors, including the quality of the street database, the correctness of the submitted addresses, the ability of the software to match submitted addresses with addresses in the database (e.g., recognize that "Cir" is short for "Circle"), the tolerance for geocoding imperfect matches (i.e., the match rate), and the methodology used to geocode addresses that were not geocoded in batch mode. Although we were unable to quantify the relative contribution of each of these factors, it is likely that much of the difference in the percentage of addresses successfully geocoded is attributable to the manual address matching performed by the commercial vendor.</p>
         <p>Although the aim of our study was to estimate the distributions of geocode location error, there are additional errors to consider when analyzing geocoded address data. The addresses unsuccessfully geocoded by the commercial vendor and/or OHIP (and therefore excluded from analyses) may result in selection bias. If the probability that a geocode is missing is differential across space then this can bias the relationship between spatially-varying covariates and disease incidence <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. Furthermore, a high proportion of successfully geocoded addresses, although reassuring, does not preclude this selection bias. An additional source of error arises when spurious geocodes are created from fictitious addresses. This can occur when an imperfectly recorded address happens to fall within a range of viable addresses.</p>
         <p>Our study design excluded discordant addresses in the linked MACDP and OHIP database. This approach ensured that comparisons between the two sets of geocodes were fair and presumably reduced the number of low quality addresses that were validated (an address that is identical in both the MACDP and OHIP databases is likely to be correct). This design, however, may have underestimated the true distribution of geocode location error for both commercial vendor and OHIP geocodes. Many imperfect addresses, which were excluded from our study (because the MACDP and OHIP addresses were discordant), were nevertheless geocoded to the address-level by both agencies. The distribution of geocode location error for these addresses may be large relative to the distribution for the set of addresses selected for validation. Additionally, the 164 addresses selected for validation that we were unable to locate in the tax parcels (Table <tblr tid="T1">1</tblr>) frequently had address-level geocodes. Although it is probable that many of these addresses correspond to apartment complex roads not delineated in the shapefile, the true distribution of geocode location error for these geocodes may be larger than the distributions presented in Table <tblr tid="T2">2</tblr>.</p>
         <p>We used a simulation-based approach to evaluate the potential impact of location error on census tract assignment. The percentage of geocodes displaced into incorrect census tracts was similar for Fulton County and Gwinnett County, even though median location error was approximately 35% larger in Gwinnett County (Table <tblr tid="T2">2</tblr>). This finding is likely due to census tract size (tracts tend to be larger in the predominantly suburban Gwinnett County). A larger location error is needed to displace a geocode outside of its original census tract in Gwinnett.</p>
         <p>Presumably, the ramifications of geocode location error will vary depending upon the study design. For example, in air pollution epidemiology, designs using ambient air quality monitors to assign pollution levels to cohort members [e.g., <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>] may not be strongly impacted by the magnitude of geocode location error reported in Table <tblr tid="T2">2</tblr>, whereas fine-scale studies of traffic proximity [e.g., <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>] may be strongly impacted by this magnitude of location error. In both of these settings, however, an empirical estimate of geocode location error, specific to the local setting in which the study was conducted, can be used to formally evaluate the consequences of this source of measurement error on epidemiologic results. Tenuous speculations about measurement error can be replaced with inferences from more rigorous statistical approaches.</p>
         <p>Relative to street databases, tax parcel shapefiles offer two main advantages: 1) fictitious addresses that happen to fall within ranges of legitimate addresses are not geocoded, and 2) there is no need to interpolate between address ranges. Tax parcels, however, have certain disadvantages as well. They are created to assist the county tax assessor rather than to geocode addresses. Accordingly, an apartment complex encompassing numerous roads appears as one polygon because this is the taxable parcel. We were unable to locate 164 addresses (Table <tblr tid="T1">1</tblr>), and it is probable that many of these addresses correspond to apartment complex roads not delineated in the shapefile. An additional disadvantage is the limited availability of tax parcel shapefiles (as of June 2005 only two of the five counties covered by MACDP had tax parcel shapefiles). Some GIS software packages offer capabilities for batch parcel geocoding (e.g., the "One Field" style of locator in ArcGIS); as tax parcel shapefiles become increasingly available parcel-based geocoding may become more feasible. Building "footprints," where polygons in the shapefile correspond to building dimensions, also offer possibilities for geocoding and geocode validation.</p>
         <p>Cost is also an important consideration &#8211; whereas 25&#8211;30 addresses per hour can be validated using tax parcels, online commercial geocoding services offer near real-time geocoding for less than two cents per address. Tax parcels are therefore not a viable alternative to batch street database geocoding. Tax parcel validation, however, is more efficient than GPS field validation. Although the number of addresses per hour that can be field validated will vary greatly depending upon address proximity, it would be nearly impossible to field validate 25 addresses per hour in Atlanta. Past experiences at MACDP suggest rates of 5&#8211;10 addresses per hour are more typical.</p>
         <p>Applications of tax parcel datasets in environmental health extend beyond geocode generation and validation. For example, tax parcel datasets and housing characteristics have been combined to identify high priority regions of lead poisoning risk <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. The 2004 Olympic and Para Olympic environmental health inspection program <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> utilized tax parcel data in their GIS applications. Tax parcel datasets are also routinely used in urban planning, and some investigators have used tax parcels to model the environmental impact of urban development <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Geocode uncertainty can be quantified using tax parcel datasets, high resolution orthoimagery, and GIS. In metropolitan Atlanta, the median geocode location error was less than 100 meters for both the OHIP and commercial vendor geocodes, and there was no evidence of systematic bias in the angle of the location error. Geocode location error caused approximately 5% of the randomly generated geocodes to be placed into the incorrect census tract. We contend that the motivation for understanding the distribution of geocode location error parallels the motivation for assessing disease misclassification or exposure measurement error in epidemiological studies. Geocodes have an important role in environmental health research and surveillance, as they are frequently used to define the study population and to link health data with environmental hazards. Furthermore, many spatial statistical methods use geocodes, and the validity of these approaches may be compromised by location error. Further work is needed to evaluate the impact of location error on statistical methods and surveillance applications.</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The author(s) declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>MJS conducted the addresses validation and drafted the manuscript. CS helped design the study, draft the manuscript, and is responsible for the maintenance of MACDP geocoded data. BRG examined a subset of records to verify whether the addresses fell within the counties and assisted with the creation of Figures. AKB was responsible for obtaining the tax parcel datasets, working with the dataset owners to ensure appropriate use of the datasets, and assisting MJS and CS with GIS activities. AC helped design the study and participated in its coordination. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Don Gambrell of NCBDDD, CDC, for his assistance in linking the OHIP and commercial vendor databases. Thanks to Andy Dent and Steve Bullard of ATSDR, CDC, for their assistance in managing and accessing the orthoimagery and tax parcel datasets. We also thank Elaine Hallisey of OHIP for her helpful comments on this manuscript.</p>
            <p>This project received financial support from the Environmental Health Tracking Branch, NCEH, CDC. </p>
            <p>The findings and conclusions in  this report have not been formally disseminated by CDC and should not be  construed to represent any agency determination or policy.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Positional accuracy of two methods of geocoding</p>
            </title>
            <aug>
               <au>
                  <snm>Ward</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Nuckols</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Giglierano</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bonner</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Wolter</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Airola</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mix</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Colt</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hartge</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Epidemiology</source>
            <pubdate>2005</pubdate>
            <volume>16</volume>
            <fpage>542</fpage>
            <lpage>547</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/01.ede.0000165364.54925.f3</pubid>
                  <pubid idtype="pmpid" link="fulltext">15951673</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Evaluation of uncertainties associated with geocoding techniques</p>
            </title>
            <aug>
               <au>
                  <snm>Karimi</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Durcik</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rasdorf</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Comput-Aided Civ Inf</source>
            <pubdate>2004</pubdate>
            <volume>19</volume>
            <fpage>170</fpage>
            <lpage>185</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/j.1467-8667.2004.00346.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Comparison of residential geocoding methods in population-based study of air quality and birth defects</p>
            </title>
            <aug>
               <au>
                  <snm>Gilboa</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Mendola</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Olshan</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Harness</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Loomis</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Langlois</snm>
                  <fnm>PH</fnm>
               </au>
               <au>
                  <snm>Savitz</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Herring</snm>
                  <fnm>AH</fnm>
               </au>
            </aug>
            <source>Environ Res</source>
            <pubdate>2006</pubdate>
            <volume>101</volume>
            <fpage>256</fpage>
            <lpage>262</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.envres.2006.01.004</pubid>
                  <pubid idtype="pmpid" link="fulltext">16483563</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Geocoding addresses from a large population-based study: lessons learned</p>
            </title>
            <aug>
               <au>
                  <snm>McElroy</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Remington</snm>
                  <fnm>PL</fnm>
               </au>
               <au>
                  <snm>Trentham-Dietz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Newcomb</snm>
                  <fnm>PA</fnm>
               </au>
            </aug>
            <source>Epidemiology</source>
            <pubdate>2003</pubdate>
            <volume>14</volume>
            <fpage>399</fpage>
            <lpage>407</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12843762</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Positional error in automated geocoding of residential addresses</p>
            </title>
            <aug>
               <au>
                  <snm>Cayo</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Talbot</snm>
                  <fnm>TO</fnm>
               </au>
            </aug>
            <source>Int J Health Geogr</source>
            <pubdate>2003</pubdate>
            <volume>2</volume>
            <fpage>10</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">324564</pubid>
                  <pubid idtype="pmpid" link="fulltext">14687425</pubid>
                  <pubid idtype="doi">10.1186/1476-072X-2-10</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Accuracy of commercial geocoding: assessment and implications</p>
            </title>
            <aug>
               <au>
                  <snm>Whitsel</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Quibrera</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Catellier</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Liao</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Henley</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Heiss</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Epidemiol Perspect Innov</source>
            <pubdate>2006</pubdate>
            <volume>3</volume>
            <fpage>8</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1557664</pubid>
                  <pubid idtype="pmpid" link="fulltext">16857050</pubid>
                  <pubid idtype="doi">10.1186/1742-5573-3-8</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Conceptual and practical issues in the detection of local disease clusters: a study of mortality in Hamilton, Ontario</p>
            </title>
            <aug>
               <au>
                  <snm>Burra</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Jerrett</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Burnett</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Can Geogr</source>
            <pubdate>2002</pubdate>
            <volume>46</volume>
            <fpage>160</fpage>
            <lpage>171</lpage>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Positional accuracy of geocoded addresses in epidemiologic research</p>
            </title>
            <aug>
               <au>
                  <snm>Bonner</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nie</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rogerson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Vena</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Freudenheim</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Epidemiology</source>
            <pubdate>2003</pubdate>
            <volume>14</volume>
            <fpage>408</fpage>
            <lpage>412</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12843763</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Associations between self-reported and objective physical environmental factors and use of a community rail-trail</p>
            </title>
            <aug>
               <au>
                  <snm>Troped</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Saunders</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Pate</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Reininger</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ureda</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>SJ</fnm>
               </au>
            </aug>
            <source>Prev Med</source>
            <pubdate>2001</pubdate>
            <volume>32</volume>
            <fpage>191</fpage>
            <lpage>200</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/pmed.2000.0788</pubid>
                  <pubid idtype="pmpid" link="fulltext">11162346</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>On the wrong side of the tracts? Evaluating the accuracy of geocoding in public health research</p>
            </title>
            <aug>
               <au>
                  <snm>Krieger</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Waterman</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lemieux</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Zierler</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hogan</snm>
                  <fnm>JW</fnm>
               </au>
            </aug>
            <source>Am J Public Health</source>
            <pubdate>2001</pubdate>
            <volume>91</volume>
            <fpage>1114</fpage>
            <lpage>1116</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1446703</pubid>
                  <pubid idtype="pmpid" link="fulltext">11441740</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Community- and individual-level determinants of Wuchereria Bancrofti infection in Leogane Commune, Hati</p>
            </title>
            <aug>
               <au>
                  <snm>Boyd</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Waller</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Flanders</snm>
                  <fnm>WD</fnm>
               </au>
               <au>
                  <snm>Beach</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Sivilus</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Lovince</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lammie</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Addiss</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Am J Trop Med Hyg</source>
            <pubdate>2004</pubdate>
            <volume>70</volume>
            <fpage>266</fpage>
            <lpage>272</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15031515</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>The Metropolitan Atlanta Congenital Defects Program: 35 years of birth defects surveillance at the Centers for Disease Control and Prevention</p>
            </title>
            <aug>
               <au>
                  <snm>Correa-Villasenor</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cragan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kucik</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>O'Leary</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Siffel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Birth Defects Res A</source>
            <pubdate>2003</pubdate>
            <volume>67</volume>
            <fpage>617</fpage>
            <lpage>624</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/bdra.10111</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Office of Health Information and Policy, Georgia Division of Public Health, Spatial Imputation Algorithm</p>
            </title>
            <url>http://health.state.ga.us/pdfs/ohip/adgsi.60101.pdf</url>
         </bibl>
         <bibl id="B14">
            <title>
               <p>ESRI Shapefile Technical Description</p>
            </title>
            <url>http://www.esri.com/library/whitepapers/pdfs/shapefile.pdf</url>
         </bibl>
         <bibl id="B15">
            <title>
               <p>USGS. U.S. Geographic Survey High Resolution Orthoimagery metadata Atlanta, GA</p>
            </title>
            <pubdate>2003</pubdate>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Virtues of the Haversine</p>
            </title>
            <aug>
               <au>
                  <snm>Sinnott</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Sky Telescope</source>
            <pubdate>1984</pubdate>
            <volume>68</volume>
            <fpage>159</fpage>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Zip code caveat: bias due to spatiotemporal mismatches between zip codes and US census-defined geographic areas &#8211; the Public Health Disparities Geocoding Project</p>
            </title>
            <aug>
               <au>
                  <snm>Krieger</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Waterman</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Soobader</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Subramanian</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Carson</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Am J Public Health</source>
            <pubdate>2002</pubdate>
            <volume>92</volume>
            <fpage>1100</fpage>
            <lpage>1102</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1447194</pubid>
                  <pubid idtype="pmpid" link="fulltext">12084688</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Geographic bias related to geocoding in epidemiologic studies</p>
            </title>
            <aug>
               <au>
                  <snm>Oliver</snm>
                  <fnm>MN</fnm>
               </au>
               <au>
                  <snm>Matthews</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Siadaty</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hauck</snm>
                  <fnm>FR</fnm>
               </au>
               <au>
                  <snm>Pickle</snm>
                  <fnm>LW</fnm>
               </au>
            </aug>
            <source>Int J Health Geogr</source>
            <pubdate>2005</pubdate>
            <volume>4</volume>
            <fpage>29</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1298322</pubid>
                  <pubid idtype="pmpid" link="fulltext">16281976</pubid>
                  <pubid idtype="doi">10.1186/1476-072X-4-29</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Relation between ambient air quality and selected birth defects, Seven County Study, Texas, 1997&#8211;2000</p>
            </title>
            <aug>
               <au>
                  <snm>Gilboa</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Mendola</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Olshan</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Langlois</snm>
                  <fnm>PH</fnm>
               </au>
               <au>
                  <snm>Savitz</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Loomis</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Herring</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Fixler</snm>
                  <fnm>DE</fnm>
               </au>
            </aug>
            <source>Am J Epidemiol</source>
            <pubdate>2005</pubdate>
            <volume>162</volume>
            <fpage>238</fpage>
            <lpage>252</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/aje/kwi189</pubid>
                  <pubid idtype="pmpid" link="fulltext">15987727</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Examining associations between childhood asthma and traffic flow using a geographic information system</p>
            </title>
            <aug>
               <au>
                  <snm>English</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Neutra</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Scalf</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sullivan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Waller</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Environ Health Perspect</source>
            <pubdate>1999</pubdate>
            <volume>107</volume>
            <fpage>761</fpage>
            <lpage>767</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1566466</pubid>
                  <pubid idtype="pmpid">10464078</pubid>
                  <pubid idtype="doi">10.2307/3434663</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Mapping for prevention: GIS models for directing childhood lead poisoning prevention programs</p>
            </title>
            <aug>
               <au>
                  <snm>Miranda</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Dolinoy</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Overstreet</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Environ Health Perspect</source>
            <pubdate>2002</pubdate>
            <volume>110</volume>
            <fpage>947</fpage>
            <lpage>953</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1240996</pubid>
                  <pubid idtype="pmpid" link="fulltext">12204831</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Methodological aspects of a GIS-based environmental health inspection program used in the Athens 2004 Olympic and Para Olympic Games</p>
            </title>
            <aug>
               <au>
                  <snm>Hadjichristodoulou</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Soteriades</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Kolonia</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Falagas</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Pantelopoulos</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Panagakos</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mouchtouri</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Kremastinou</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>BMC Public Health</source>
            <pubdate>2005</pubdate>
            <volume>5</volume>
            <fpage>93</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1232856</pubid>
                  <pubid idtype="pmpid" link="fulltext">16138924</pubid>
                  <pubid idtype="doi">10.1186/1471-2458-5-93</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Urban form and thermal efficiency &#8211; how the design of cities influences the urban heat island effect</p>
            </title>
            <aug>
               <au>
                  <snm>Stone</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Rodgers</snm>
                  <fnm>MO</fnm>
               </au>
            </aug>
            <source>J Am Plann Assoc</source>
            <pubdate>2001</pubdate>
            <volume>67</volume>
            <fpage>186</fpage>
            <lpage>198</lpage>
         </bibl>
      </refgrp>
   </bm>
</art>
