This annual study provides selected income and tax items classified by State, ZIP Code, and the size of adjusted gross income. These data include the number of returns, which approximates the number of households; the number of personal exemptions, which approximates the population; adjusted gross income; wages and salaries; dividends before exclusion; and interest received. Data are based who reported on U.S. Individual Income Tax Returns (Forms 1040) filed with the IRS. SOI collects these data as part of its Individual Income Tax Return (Form 1040) Statistics program, Data by Geographic Areas, ZIP Code Data.
This feature service is derived from the Esri "United States Zip Code Boundaries" layer, queried to only CA data.For the original data see: https://esri.maps.arcgis.com/home/item.html?id=5f31109b46d541da86119bd4cf213848Published by the California Department of Technology Geographic Information Services Team.The GIS Team can be reached at ODSdataservices@state.ca.gov.U.S. ZIP Code Boundaries represents five-digit ZIP Code areas used by the U.S. Postal Service to deliver mail more effectively. The first digit of a five-digit ZIP Code divides the United States into 10 large groups of states (or equivalent areas) numbered from 0 in the Northeast to 9 in the far West. Within these areas, each state is divided into an average of 10 smaller geographical areas, identified by the second and third digits. These digits, in conjunction with the first digit, represent a Sectional Center Facility (SCF) or a mail processing facility area. The fourth and fifth digits identify a post office, station, branch or local delivery area.As of the time this layer was published, in January 2025, Esri's boundaries are sourced from TomTom (June 2024) and the 2023 population estimates are from Esri Demographics. Esri updates its layer annually and those changes will immediately be reflected in this layer. Note that, because this layer passes through Esri's data, if you want to know the true date of the underlying data, click through to Esri's original source data and look at their metadata for more information on updates.Cautions about using Zip Code boundary dataZip code boundaries have three characteristics you should be aware of before using them:Zip code boundaries change, in ways small and large - these are not a stable analysis unit. Data you received keyed to zip codes may have used an earlier and very different boundary for your zip codes of interest.Historically, the United States Postal Service has not published zip code boundaries, and instead, boundary datasets are compiled by third party vendors from address data. That means that the boundary data are not authoritative, and any data you have keyed to zip codes may use a different, vendor-specific method for generating boundaries from the data here.Zip codes are designed to optimize mail delivery, not social, environmental, or demographic characteristics. Analysis using zip codes is subject to create issues with the Modifiable Areal Unit Problem that will bias any results because your units of analysis aren't designed for the data being studied.As of early 2025, USPS appears to be in the process of releasing boundaries, which will at least provide an authoritative source, but because of the other factors above, we do not recommend these boundaries for many use cases. If you are using these for anything other than mailing purposes, we recommend reconsideration. We provide the boundaries as a convenience, knowing people are looking for them, in order to ensure that up-to-date boundaries are available.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Census ZIP Code Tabulation AreasThis feature layer, utilizing National Geospatial Data Asset (NGDA) data from the U.S. Census Bureau, displays ZIP Code Tabulation Areas. Per the USCB, “ZIP Code Tabulation Areas (ZCTAs) are approximate area representations of U.S. Postal Service (USPS) ZIP Code service areas that the Census Bureau creates to present statistical data for each decennial census. Data users should not use ZCTAs to identify the official USPS ZIP Code for mail delivery. The USPS makes periodic changes to ZIP Codes to support more efficient mail delivery.”Tabulation Area: 90069NGDAID: 58 (Series Information for 2020 Census 5-Digit ZIP Code Tabulation Area (ZCTA5) National TIGER/Line Shapefiles, Current)OGC API Features Link: (Census ZIP Code Tabulation Areas - OGC Features) copy this link to embed it in OGC Compliant viewersFor more information, please visit: ZIP Code Tabulation Areas (ZCTAs)For feedback please contact: Esri_US_Federal_Data@esri.comNGDA Data SetThis data set is part of the NGDA Governmental Units, and Administrative and Statistical Boundaries Theme Community. Per the Federal Geospatial Data Committee (FGDC), this theme is defined as the "boundaries that delineate geographic areas for uses such as governance and the general provision of services (e.g., states, American Indian reservations, counties, cities, towns, etc.), administration and/or for a specific purpose (e.g., congressional districts, school districts, fire districts, Alaska Native Regional Corporations, etc.), and/or provision of statistical data (census tracts, census blocks, metropolitan and micropolitan statistical areas, etc.). Boundaries for these various types of geographic areas are either defined through a documented legal description or through criteria and guidelines. Other boundaries may include international limits, those of federal land ownership, the extent of administrative regions for various federal agencies, as well as the jurisdictional offshore limits of U.S. sovereignty. Boundaries associated solely with natural resources and/or cultural entities are excluded from this theme and are included in the appropriate subject themes."For other NGDA Content: Esri Federal Datasets
This dataset contains model-based ZIP Code Tabulation Area (ZCTA) level estimates for the PLACES project by the Centers for Disease Control and Prevention (CDC), Division of Population Health, Epidemiology and Surveillance Branch. It represents a first-of-its kind effort to release information uniformly on this large scale. Data sources used to generate these model-based estimates include Behavioral Risk Factor Surveillance System (BRFSS) 2019 or 2018 data, Census Bureau 2010 population estimates, and American Community Survey (ACS) 2015–2019 or 2014–2018 estimates. The 2021 release uses 2019 BRFSS data for 22 measures and 2018 BRFSS data for 7 measures (all teeth lost, dental visits, mammograms, cervical cancer screening, colorectal cancer screening, core preventive services among older adults, and sleeping less than 7 hours a night). Seven measures are based on the 2018 BRFSS data because the relevant questions are only asked every other year in the BRFSS. This data only covers the health of adults (people 18 and over) in East Baton Rouge Parish. All estimates lie within a 95% confidence interval.
This is a series-level metadata record. The TIGER/Line shapefiles and related database files (.dbf) are an extract of selected geographic and cartographic information from the U.S. Census Bureau's Master Address File / Topologically Integrated Geographic Encoding and Referencing (MAF/TIGER) System (MTS). The MTS represents a seamless national file with no overlaps or gaps between parts, however, each TIGER/Line shapefile is designed to stand alone as an independent data set, or they can be combined to cover the entire nation. ZIP Code Tabulation Areas (ZCTAs) are approximate area representations of U.S. Postal Service (USPS) ZIP Code service areas that the Census Bureau creates to present statistical data for each decennial census. The Census Bureau delineates ZCTA boundaries for the United States, Puerto Rico, American Samoa, Guam, the Commonwealth of the Northern Mariana Islands, and the U.S. Virgin Islands once each decade following the decennial census. Data users should not use ZCTAs to identify the official USPS ZIP Code for mail delivery. The USPS makes periodic changes to ZIP Codes to support more efficient mail delivery. The Census Bureau uses tabulation blocks as the basis for defining each ZCTA. Tabulation blocks are assigned to a ZCTA based on the most frequently occurring ZIP Code for the addresses contained within that block. The most frequently occurring ZIP Code also becomes the five-digit numeric code of the ZCTA. These codes may contain leading zeros. Blocks that do not contain addresses but are surrounded by a single ZCTA (enclaves) are assigned to the surrounding ZCTA. Because the Census Bureau only uses the most frequently occurring ZIP Code to assign blocks, a ZCTA may not exist for every USPS ZIP Code. Some ZIP Codes may not have a matching ZCTA because too few addresses were associated with the specific ZIP Code or the ZIP Code was not the most frequently occurring ZIP Code within any of the blocks where it exists. The ZCTA boundaries in this release are those delineated following the 2020 Census.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset provides a wealth-tier classification of U.S. ZIP codes for high income brackets using IRS income data and multivariate KMeans clustering. It can help with regional targeting, CRM enrichment, market analysis, or any data science task that benefits from understanding high income distribution across the U.S.
Each row represents a ZIP code with:
A00100
), Total Income (A00200
)Low
, Medium
, or High
The cluster assignments are refined using distance to cluster centroids in normalized feature space to improve accuracy.
Column | Description |
---|---|
zipcode | U.S. ZIP code |
STATEFIPS | Federal Information Processing Standard (FIPS) code for the state |
STATE | U.S. state abbreviation (e.g., AL, CA) |
agi_stub | Adjusted Gross Income bracket (1 = <$25K, ..., 6 = $200K+) |
A00100 | Adjusted Gross Income |
A02650 | Total income from all sources |
A10600 | Total tax payments |
A00200 | Wages and salaries |
MARS2 | Count of married joint returns |
N2 | Number of dependents |
A00900 | Business/professional net income |
mars1 | Count of single returns |
A26270 | Partnership and S-Corp income |
A09400 | Self-employment tax |
MARS4 | Head of household returns |
A85300 | Net investment income |
A00600 | Ordinary dividends |
A04475 | Qualified business income deduction |
A00650 | Qualified dividends |
A18500 | Real estate taxes paid |
Cluster | Numeric cluster ID (0 = High, 1 = Medium, 2 = Low) |
Wealth_Tier | Human-readable wealth tier label |
Created by Namrata Nyamagoudar(LinkedIn) for open-source analysis and enrichment use cases.
https://www.usa.gov/government-workshttps://www.usa.gov/government-works
This dataset combines annual files from 2005 to 2017 published by the IRS. ZIP Code data show selected income and tax items classified by State, ZIP Code, and size of adjusted gross income. Data are based on individual income tax returns filed with the IRS. The data include items, such as:
Number of returns, which approximates the number of householdsNumber of personal exemptions, which approximates the populationAdjusted gross income (AGI)Wages and salariesDividends before exclusionInterest received Enrichment and notes:- the original data sheets (a column per variable, a line per year, zipcode and AGI group) have been transposed to get a record per year, zipcode, AGI group and variable- the data for Wyoming in 2006 was removed because AGI classes were not correctly defined, making the resulting data unfit for analysis.- the AGI groups have seen their definitions change: the variable "AGI Class" was used until 2008, with various intervals of AGI; "AGI Stub" replaced it in 2009. We provided the literal intervals (eg. "$50,000 under $75,000") as "AGI Group" in each case to help the analysis.- the codes for each tax item have been joined with a dataset of variables to provide full names.- some tax items are available since 2005, others since more recent years, depending on their introduction date (available in the dataset of variables); as a consequence, the time range of the plots or graphs may vary.- the unit for amounts and AGIs is a thousand dollars.
The TIGER/Line shapefiles and related database files (.dbf) are an extract of selected geographic and cartographic information from the U.S. Census Bureau's Master Address File / Topologically Integrated Geographic Encoding and Referencing (MAF/TIGER) Database (MTDB). The MTDB represents a seamless national file with no overlaps or gaps between parts, however, each TIGER/Line shapefile is designed to stand alone as an independent data set, or they can be combined to cover the entire nation. ZIP Code Tabulation Areas (ZCTAs) are approximate area representations of U.S. Postal Service (USPS) ZIP Code service areas that the Census Bureau creates to present statistical data for each decennial census. The Census Bureau delineates ZCTA boundaries for the United States, Puerto Rico, American Samoa, Guam, the Commonwealth of the Northern Mariana Islands, and the U.S. Virgin Islands once each decade following the decennial census. Data users should not use ZCTAs to identify the official USPS ZIP Code for mail delivery. The USPS makes periodic changes to ZIP Codes to support more efficient mail delivery. The Census Bureau uses tabulation blocks as the basis for defining each ZCTA. Tabulation blocks are assigned to a ZCTA based on the most frequently occurring ZIP Code for the addresses contained within that block. The most frequently occurring ZIP Code also becomes the five-digit numeric code of the ZCTA. These codes may contain leading zeros. Blocks that do not contain addresses but are surrounded by a single ZCTA (enclaves) are assigned to the surrounding ZCTA. Because the Census Bureau only uses the most frequently occurring ZIP Code to assign blocks, a ZCTA may not exist for every USPS ZIP Code. Some ZIP Codes may not have a matching ZCTA because too few addresses were associated with the specific ZIP Code or the ZIP Code was not the most frequently occurring ZIP Code within any of the blocks where it exists. The ZCTA boundaries in this release are those delineated following the 2020 Census.
Our zip code Database offers comprehensive postal code data for spatial analysis, including postal and administrative areas. This dataset contains accurate and up-to-date information on all administrative divisions, cities, and zip codes, making it an invaluable resource for various applications such as address capture and validation, map and visualization, reporting and business intelligence (BI), master data management, logistics and supply chain management, and sales and marketing. Our location data packages are available in various formats, including CSV, optimized for seamless integration with popular systems like Esri ArcGIS, Snowflake, QGIS, and more. Product features include fully and accurately geocoded data, multi-language support with address names in local and foreign languages, comprehensive city definitions, and the option to combine map data with UNLOCODE and IATA codes, time zones, and daylight saving times. Companies choose our location databases for their enterprise-grade service, reduction in integration time and cost by 30%, and weekly updates to ensure the highest quality.
https://www.icpsr.umich.edu/web/ICPSR/studies/59/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/59/terms
This data collection contains aggregate information from income tax returns for 5-digit ZIP-code areas for the entire United States. Data are provided for three income classes with adjusted gross income returns of under $3,000, $3,000 to $10,000, and over $10,000. Information is provided on gross income, taxes paid, personal exemptions, total number of joint returns filed by married couples, and aggregate number of returns filed by all taxpayers. These data, originally prepared by the Internal Revenue Service, were supplied to ICPSR in computer-readable form by Philip Lankford of the University of California at Los Angeles.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Context I am greatly inspired with this dataset containing geo spatial details for each zip code and contains the total wages for each area.This gave me opportunity to create a data visualisation in Tableau using HexBin chart which is added as a Kernel to this dataset.
Content
50 States + 361 AA Military
Americas 38 AE Military
Europe 164 AP Military
Pacific 1 AS American Samoa 290 DC Washinton DC 4 FM Federated States Micronesia 13 GU Guam 2 MH Marshall Islands 3 MP Northern Mariana Islands 176 PR Puerto Rico 2 PW Palau 16 VI Virgin Islands
Name Type Description
Zipcode Text 5 digit Zipcode or military postal code(FPO/APO)
ZipCodeType Text Standard, PO BOX Only, Unique, Military(implies APO or FPO)
City Text USPS offical city name(s)
State Text USPS offical state, territory, or quasi-state (AA, AE, AP) abbreviation code
LocationType Text Primary, Acceptable,Not Acceptable
Lat Double Decimal Latitude, if available
Long Double Decimal Longitude, if available
Location Text Standard Display (eg Phoenix, AZ ; Pago Pago, AS ; Melbourne, AU )
Decommissioned Text If Primary location, Yes implies historical Zipcode, No Implies current Zipcode; If not Primary, Yes implies Historical Placename
TaxReturnsFiled Long Integer Number of Individual Tax Returns Filed in 2008
EstimatedPopulation Long Integer Tax returns filed + Married filing jointly + Dependents
TotalWages Long Integer Total of Wages Salaries and Tips
Current zipcodes, placenames, zipcode type(Standard, PO, Unique, Military), placename type (Primary, Acceptable, Not Acceptable)
: USPS Military place names (base or ship name)
: MPSA 2008 Election Ballot information Tax returns filed, estimated population, total wages: IRS 2008 Latitude and Longitude; National Weather Service supplemented by Google Earth and Maps and occasionally other sources Decommissioned zip codes, Our old database--usually quality sources, but not verifiable.
Other Sources of zipcode information:
Placenames (Cities, towns, geographic features) can be found at US Geological Survey GNIS Dataset The IRS has additional data fields for 2008 and is reviewing their publication procedures for later years.
see http://www.irs.gov/taxstats/indtaxstats/article/0,,id=96947,00.html
The Census publishes data, but they use Zipcode Tabulation Areas (ZCTAs) which
1) have changed areas between the 2000 census and the 2010 census
2) do not map well to USPS zipcodes well. If needed http://www.census.gov/geo/ZCTA/zcta.html Social Security recipients by zipcode http://www.ssa.gov/policy/docs/statcomps/oasdi_zip/ For economic researchers and those who want tons of background on data sources by zipcode, University of Missouri OSEDA project
community developments where it needs immediate attention.
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
This dataset contains model-based ZIP Code Tabulation Area (ZCTA) level estimates. PLACES covers the entire United States—50 states and the District of Columbia—at county, place, census tract, and ZIP Code Tabulation Area levels. It provides information uniformly on this large scale for local areas at four geographic levels. Estimates were provided by the Centers for Disease Control and Prevention (CDC), Division of Population Health, Epidemiology and Surveillance Branch. PLACES was funded by the Robert Wood Johnson Foundation in conjunction with the CDC Foundation. The dataset includes estimates for 40 measures: 12 for health outcomes, 7 for preventive services use, 4 for chronic disease-related health risk behaviors, 7 for disabilities, 3 for health status, and 7 for health-related scocial needs. These estimates can be used to identify emerging health problems and to help develop and carry out effective, targeted public health prevention activities. Because the small area model cannot detect effects due to local interventions, users are cautioned against using these estimates for program or policy evaluations. Data sources used to generate these model-based estimates are Behavioral Risk Factor Surveillance System (BRFSS) 2022 or 2021 data, Census Bureau 2020 population data, and American Community Survey 2018–2022 estimates. The 2024 release uses 2022 BRFSS data for 36 measures and 2021 BRFSS data for 4 measures (high blood pressure, high cholesterol, cholesterol screening, and taking medicine for high blood pressure control among those with high blood pressure) that the survey collects data on every other year. More information about the methodology can be found at www.cdc.gov/places.
ZIP Code Tabulation Areas (ZCTAs) are approximate area representations of U.S. Postal Service (USPS) ZIP Code service areas that the Census Bureau creates to present statistical data for each decennial census. The Census Bureau uses tabulation blocks as the basis for defining each ZCTA. Tabulation blocks are assigned to a ZCTA based on the most frequently occurring ZIP Code for the addresses contained within that block. The most frequently occurring ZIP Code also becomes the five-digit numeric code of the ZCTA. Blocks that do not contain addresses but are surrounded by a single ZCTA (enclaves) are assigned to the surrounding ZCTA. Because the Census Bureau only uses the most frequently occurring ZIP Code to assign blocks, a ZCTA may not exist for every USPS ZIP Code. Some ZIP Codes may not have a matching ZCTA because too few addresses were associated with the specific ZIP Code or the ZIP Code was not the most frequently occurring ZIP Code within any of the blocks where it exists.
Users are encouraged to refer to the U.S. Census website for more information on ZCTAs: https://www.census.gov/programs-surveys/geography/guidance/geo-areas/zctas.html
and to the U.S. Postal Service for more information on ZIP Codes: https://faq.usps.com/
California - Census ZIP Code Tabulation Areas (ZCTA)This data is a subset of the National ZCTA data from the US Census Bureau. This layer was created by using the Select by Layer tool in ArcGIS Pro. First, the polygon for the California was selected from the United State County Borders, then the features from the ZCTA layer within the CA polygon were selected to create a new California only ZCTA layer.Census ZIP Code Tabulation AreasThis feature layer, utilizing National Geospatial Data Asset (NGDA) data from the U.S. Census Bureau, displays ZIP Code Tabulation Areas. Per the USCB, “ZIP Code Tabulation Areas (ZCTAs) are approximate area representations of U.S. Postal Service (USPS) ZIP Code service areas that the Census Bureau creates to present statistical data for each decennial census. Data users should not use ZCTAs to identify the official USPS ZIP Code for mail delivery. The USPS makes periodic changes to ZIP Codes to support more efficient mail delivery.”Tabulation Area: 90069NGDAID: 58 (Series Information for 2020 Census 5-Digit ZIP Code Tabulation Area (ZCTA5) National TIGER/Line Shapefiles, Current)OGC API Features Link: (Census ZIP Code Tabulation Areas - OGC Features) copy this link to embed it in OGC Compliant viewersFor more information, please visit: ZIP Code Tabulation Areas (ZCTAs)For feedback please contact: Esri_US_Federal_Data@esri.comNGDA Data SetThis data set is part of the NGDA Governmental Units, and Administrative and Statistical Boundaries Theme Community. Per the Federal Geospatial Data Committee (FGDC), this theme is defined as the "boundaries that delineate geographic areas for uses such as governance and the general provision of services (e.g., states, American Indian reservations, counties, cities, towns, etc.), administration and/or for a specific purpose (e.g., congressional districts, school districts, fire districts, Alaska Native Regional Corporations, etc.), and/or provision of statistical data (census tracts, census blocks, metropolitan and micropolitan statistical areas, etc.). Boundaries for these various types of geographic areas are either defined through a documented legal description or through criteria and guidelines. Other boundaries may include international limits, those of federal land ownership, the extent of administrative regions for various federal agencies, as well as the jurisdictional offshore limits of U.S. sovereignty. Boundaries associated solely with natural resources and/or cultural entities are excluded from this theme and are included in the appropriate subject themes."For other NGDA Content: Esri Federal Datasets
MassGIS had received quarterly updates of these data as part of its license for the HERE (Navteq) core map release (streets and related data); however, that license has expired. These ZIP Code boundaries are aligned to the street centerlines of the Q2 2018 HERE product (with a release date of April 1, 2018) and use a then-recent USPS source file.In March 2024, MassGIS modified the boundaries for all ZIP Code areas in Boston based on the U.S. Postal Service's ZIP Code Look Up by Address website. MassGIS also added polygons for ZIP Codes 02199 and 02203.Five-digit ZIP Codes were developed by the USPS and first introduced in 1963 for efficient mail delivery (the term ZIP stands for Zone Improvement Plan) but are difficult to map with complete certainty. In most cases, addresses in close proximity to each other are grouped in the same ZIP Code, which gives the appearance that ZIP Codes are defined by a clear geographic boundary. However, even when ZIP Codes appear to be geographically grouped, a clear ZIP Code boundary cannot always be drawn because ZIP Codes are only assigned to a point of delivery and not the spaces between delivery points. In areas without a regular postal route or no mail delivery, ZIP Codes may not be defined or have unclear boundaries.The USPS does not maintain an official ZIP Code map. The Census Bureau and many other commercial services will interpolate the data to create polygons to represent the approximate area covered by a ZIP code, but none of these maps are official or entirely accurate. Please see this good discussion of the issues of mapping ZIP Codes.See full metadata.Feature service also available.
https://www.geopostcodes.com/privacy-policy/https://www.geopostcodes.com/privacy-policy/
Comprehensive, annually-updated population datasets at ZIP code and administrative levels for 247 countries, spanning from 1975 to 2030, including historical, current, and projected population figures, enriched with attributes like area size, multilingual support, UNLOCODEs, IATA codes, and time zones.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
This dataset contains information on the ratio of family income to the federal poverty level at the zip code tabulation area (ZCTA) level. Each column beginning with a "T_" lists the total number of families that fall into each income category. In addition, the dataset contains information on margins of error and the reliability of each estimate, to help guide decisionmakers in more effectively using the data contained in this file. There are approximately 1,000 records in this dataset. ZCTA boundaries are designed to approximate actual zip code boundaries, but are fixed to allow for consistent data analysis (whereas regular zip code boundaries change frequently). Field description metadata is available for download. For more information on poverty data from the Census Bureau, please visit American Factfinder (www.factfinder2.census.gov).
A crosswalk dataset matching US ZIP codes to corresponding census tracts
The denominators used to calculate the address ratios are the ZIP code totals. When a ZIP is split by any of the other geographies, that ZIP code is duplicated in the crosswalk file.
**Example: **ZIP code 03870 is split by two different Census tracts, 33015066000 and 33015071000, which appear in the tract column. The ratio of residential addresses in the first ZIP-Tract record to the total number of residential addresses in the ZIP code is .0042 (.42%). The remaining residential addresses in that ZIP (99.58%) fall into the second ZIP-Tract record.
So, for example, if one wanted to allocate data from ZIP code 03870 to each Census tract located in that ZIP code, one would multiply the number of observations in the ZIP code by the residential ratio for each tract associated with that ZIP code.
https://redivis.com/fileUploads/4ecb405e-f533-4a5b-8286-11e56bb93368%3E" alt="">(Note that the sum of each ratio column for each distinct ZIP code may not always equal 1.00 (or 100%) due to rounding issues.)
Census tract definition
A census tract, census area, census district or meshblock is a geographic region defined for the purpose of taking a census. Sometimes these coincide with the limits of cities, towns or other administrative areas and several tracts commonly exist within a county. In unincorporated areas of the United States these are often arbitrary, except for coinciding with political lines.
Further reading
The following article demonstrates how to more effectively use the U.S. Department of Housing and Urban Development (HUD) United States Postal Service ZIP Code Crosswalk Files when working with disparate geographies.
Wilson, Ron and Din, Alexander, 2018. “Understanding and Enhancing the U.S. Department of Housing and Urban Development’s ZIP Code Crosswalk Files,” Cityscape: A Journal of Policy Development and Research, Volume 20 Number 2, 277 – 294. URL: https://www.huduser.gov/portal/periodicals/cityscpe/vol20num2/ch16.pdf
Contact information
Questions regarding these crosswalk files can be directed to Alex Din with the subject line HUD-Crosswalks.
Acknowledgement
This dataset is taken from the U.S. Department of Housing and Urban Development (HUD) office: https://www.huduser.gov/portal/datasets/usps_crosswalk.html#codebook
The Demographic Reports are produced by the Economic, Demographic and Statistical Research unit within the Countywide Service Integration and Planning Management (CSIPM) Division of the Fairfax County Department of Neighborhood and Community Services. Information produced by the Economic, Demographic and Statistical Research unit is used by every county department, board, authority and the Fairfax County Public Schools.
A crosswalk table from US postal ZIP codes to geo-points (latitude, longitude)
Data source: public.opendatasoft.
The ZIP code database contained in 'zipcode.csv' contains 43204 ZIP codes for the continental United States, Alaska, Hawaii, Puerto Rico, and American Samoa. The database is in comma separated value format, with columns for ZIP code, city, state, latitude, longitude, timezone (offset from GMT), and daylight savings time flag (1 if DST is observed in this ZIP code and 0 if not).
This database was composed using ZIP code gazetteers from the US Census Bureau from 1999 and 2000, augmented with additional ZIP code information The database is believed to contain over 98% of the ZIP Codes in current use in the United States. The remaining ZIP Codes absent from this database are entirely PO Box or Firm ZIP codes added in the last five years, which are no longer published by the Census Bureau, but in any event serve a very small minority of the population (probably on the order of .1% or less). Although every attempt has been made to filter them out, this data set may contain up to .5% false positives, that is, ZIP codes that do not exist or are no longer in use but are included due to erroneous data sources. The latitude and longitude given for each ZIP code is typically (though not always) the geographic centroid of the ZIP code; in any event, the location given can generally be expected to lie somewhere within the ZIP code's "boundaries".The ZIP code database contained in 'zipcode.csv' contains 43204 ZIP codes for the continental United States, Alaska, Hawaii, Puerto Rico, and American Samoa. The database is in comma separated value format, with columns for ZIP code, city, state, latitude, longitude, timezone (offset from GMT), and daylight savings time flag (1 if DST is observed in this ZIP code and 0 if not). This database was composed using ZIP code gazetteers from the US Census Bureau from 1999 and 2000, augmented with additional ZIP code information The database is believed to contain over 98% of the ZIP Codes in current use in the United States. The remaining ZIP Codes absent from this database are entirely PO Box or Firm ZIP codes added in the last five years, which are no longer published by the Census Bureau, but in any event serve a very small minority of the population (probably on the order of .1% or less). Although every attempt has been made to filter them out, this data set may contain up to .5% false positives, that is, ZIP codes that do not exist or are no longer in use but are included due to erroneous data sources. The latitude and longitude given for each ZIP code is typically (though not always) the geographic centroid of the ZIP code; in any event, the location given can generally be expected to lie somewhere within the ZIP code's "boundaries".
The database and this README are copyright 2004 CivicSpace Labs, Inc., and are published under a Creative Commons Attribution-ShareAlike license, which requires that all updates must be released under the same license. See http://creativecommons.org/licenses/by-sa/2.0/ for more details. Please contact schuyler@geocoder.us if you are interested in receiving updates to this database as they become available.The database and this README are copyright 2004 CivicSpace Labs, Inc., and are published under a Creative Commons Attribution-ShareAlike license, which requires that all updates must be released under the same license. See http://creativecommons.org/licenses/by-sa/2.0/ for more details. Please contact schuyler@geocoder.us if you are interested in receiving updates to this database as they become available.
This annual study provides selected income and tax items classified by State, ZIP Code, and the size of adjusted gross income. These data include the number of returns, which approximates the number of households; the number of personal exemptions, which approximates the population; adjusted gross income; wages and salaries; dividends before exclusion; and interest received. Data are based who reported on U.S. Individual Income Tax Returns (Forms 1040) filed with the IRS. SOI collects these data as part of its Individual Income Tax Return (Form 1040) Statistics program, Data by Geographic Areas, ZIP Code Data.