93 datasets found
  1. US Race and Ethnicity Codes

    • johnsnowlabs.com
    csv
    Updated Jan 20, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    John Snow Labs (2021). US Race and Ethnicity Codes [Dataset]. https://www.johnsnowlabs.com/marketplace/us-race-and-ethnicity-codes/
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jan 20, 2021
    Dataset authored and provided by
    John Snow Labs
    Area covered
    N/A, United States
    Description

    This dataset contains Race/Ethinicty codes. It is used to enter in patient demographics information.

  2. d

    Race and Ethnicity - ACS 2018-2022 - Tempe Zip Code

    • catalog.data.gov
    • data-academy.tempe.gov
    • +7more
    Updated May 10, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Tempe (2025). Race and Ethnicity - ACS 2018-2022 - Tempe Zip Code [Dataset]. https://catalog.data.gov/dataset/race-and-ethnicity-acs-2018-2022-tempe-zip-code
    Explore at:
    Dataset updated
    May 10, 2025
    Dataset provided by
    City of Tempe
    Area covered
    Tempe
    Description

    This layer shows the population broken down by race and Hispanic origin. Data is from US Census American Community Survey (ACS) 5-year estimates.To see the full list of attributes available in this service, go to the "Data" tab, and choose "Fields" at the top right (in ArcGIS Online). A ‘Null’ entry in the estimate indicates that data for this geographic area cannot be displayed because the number of sample cases is too small (per the U.S. Census).Vintage: 2018-2022ACS Table(s): B03002 (Not all lines of this ACS table are available in this feature layer.)Data downloaded from: Census Bureau's API for American Community Survey Data Preparation: Data table was downloaded and joined with Zip Code boundaries in the City of Tempe.Date of Census update: December 15, 2023National Figures: data.census.gov

  3. f

    Code lists for ethnicity and the conditions considered in our study.

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Nov 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Barrett, Jessica K.; Yau, Christopher; Griffin, Simon; Marshall, Tom; Nirantharakumar, Krish; Crowe, Francesca; Saunders, Catherine L.; Chen, Sida; Cooper, Jennifer; Kirk, Paul; Edwards, Duncan; Richardson, Sylvia; Jackson, Christopher (2023). Code lists for ethnicity and the conditions considered in our study. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000940558
    Explore at:
    Dataset updated
    Nov 3, 2023
    Authors
    Barrett, Jessica K.; Yau, Christopher; Griffin, Simon; Marshall, Tom; Nirantharakumar, Krish; Crowe, Francesca; Saunders, Catherine L.; Chen, Sida; Cooper, Jennifer; Kirk, Paul; Edwards, Duncan; Richardson, Sylvia; Jackson, Christopher
    Description

    Code lists for ethnicity and the conditions considered in our study.

  4. Race/Ethnicity of Newly Medi-Cal Eligible Individuals

    • data.chhs.ca.gov
    • healthdata.gov
    • +3more
    csv, zip
    Updated Nov 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Health Care Services (2025). Race/Ethnicity of Newly Medi-Cal Eligible Individuals [Dataset]. https://data.chhs.ca.gov/dataset/race-ethnicity-of-newly-medi-cal-eligible-individuals
    Explore at:
    csv(27548), zipAvailable download formats
    Dataset updated
    Nov 7, 2025
    Dataset provided by
    California Department of Health Care Serviceshttp://www.dhcs.ca.gov/
    Authors
    Department of Health Care Services
    Description

    This dataset includes race/ethnicity of newly Medi-Cal eligible individuals who identified their race/ethnicity as Hispanic, White, Other Asian or Pacific Islander, Black, Chinese, Filipino, Vietnamese, Asian Indian, Korean, Alaskan Native or American Indian, Japanese, Cambodian, Samoan, Laotian, Hawaiian, Guamanian, Amerasian, or Other, by reporting period. The race/ethnicity data is from the Medi-Cal Eligibility Data System (MEDS) and includes eligible individuals without prior Medi-Cal Eligibility. This dataset is part of the public reporting requirements set forth in California Welfare and Institutions Code 14102.5.

  5. Ethnicity coding

    • zenodo.org
    Updated Mar 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Paola Galdi; Paola Galdi; Luna De Ferrari; Luna De Ferrari (2025). Ethnicity coding [Dataset]. http://doi.org/10.5281/zenodo.15044385
    Explore at:
    Dataset updated
    Mar 18, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Paola Galdi; Paola Galdi; Luna De Ferrari; Luna De Ferrari
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This Zenodo entry details the methodology for extracting and reconciling ethnicity data from the Clinical Practice Research Datalink (CPRD), incorporating both General Practitioner (GP) and Hospital Episode Statistics (HES) sources. The approach aims to resolve discrepancies between these sources and provide a standardized single ethnicity value per patient, categorized into 6 and 12 levels according to NHS coding guidelines.

    Materials and Methods:

    Ethnicity data from the CPRD are recorded in multiple formats. This study harmonizes these data to achieve consistent ethnicity classification across patient records, following a hierarchal reconciliation protocol prioritizing hospital data over GP records.

    Ethnicity Levels: Ethnicity data are processed to conform to two levels of granularity:

    1. Six high-level categories: White, Black, Asian, Mixed, Other, Unknown
    2. Twelve detailed categories: Bangladeshi, Black African, Black Caribbean, Black Other, Chinese, Indian, Mixed, Other Asian, Other, Pakistani, Unknown, White

    Source Data Mapping:

    • CPRD Medcodes: Directly mapped to 490 SNOMED codes
    • SNOMED to NHS Codes: SNOMED codes are linked to 26 NHS ethnicity codes
    • NHS to HES Codes: These NHS codes further map into 12 HES hospital ethnicities, which then consolidate into the 6 broad categories mentioned above

    Algorithm (AIM-CISC):

    • Hospital Data Priority: Ethnicity records from hospital sources override those from GP records unless the hospital data is classified as "Unknown", null, or empty.
    • Conflict Resolution Within GP Data:
      • The frequency of recorded ethnicities determines the selection. The most frequently recorded ethnicity prevails.
      • If frequencies are tied, the most recent record is used.
      • In cases where records are equally recent, the first alphabetically listed ethnicity is selected.

    Unique Patient Identifiers: Each patient is represented once in hospital data, ensuring a single source of truth for hospital-based ethnicities. This simplifies reconciliation with GP data when discrepancies arise.

    Source Documentation and References:

    Notes on mapping:

    Instances were noted where multiple Medcodes map back to a single SNOMED code, highlighting the importance of careful data cross-referencing. For example, two different Medcodes represent the New Zealand European ethnicity, which both map back to the identical SNOMED code.

  6. Mapping detailed SNOMED ethnicity codes to harmonised Census 2021 ethnic...

    • ons.gov.uk
    • cy.ons.gov.uk
    xlsx
    Updated Nov 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office for National Statistics (2023). Mapping detailed SNOMED ethnicity codes to harmonised Census 2021 ethnic categories, England [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/healthinequalities/datasets/mappingdetailedsnomedethnicitycodestoharmonisedcensus2021ethniccategoriesengland
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Nov 6, 2023
    Dataset provided by
    Office for National Statisticshttp://www.ons.gov.uk/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Area covered
    England
    Description

    Comparing NHS England SNOMED code mapping with how individuals self-identified their ethnicity in Census 2021.

  7. d

    RACE ETHNICITY Percent Persons by Race COS 2000

    • catalog.data.gov
    • datasets.ai
    • +2more
    Updated Dec 2, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Department of Commerce, Bureau of the Census, Geography Division (Point of Contact) (2020). RACE ETHNICITY Percent Persons by Race COS 2000 [Dataset]. https://catalog.data.gov/dataset/race-ethnicity-percent-persons-by-race-cos-2000
    Explore at:
    Dataset updated
    Dec 2, 2020
    Dataset provided by
    U.S. Department of Commerce, Bureau of the Census, Geography Division (Point of Contact)
    Description

    TIGER, TIGER/Line, and Census TIGER are registered trademarks of the Bureau of the Census. The Redistricting Census 2000 TIGER/Line files are an extract of selected geographic and cartographic information from the Census TIGER data base. The geographic coverage for a single TIGER/Line file is a county or statistical equivalent entity, with the coverage area based on January 1, 2000 legal boundaries. A complete set of Redistricting Census 2000 TIGER/Line files includes all counties and statistically equivalent entities in the United States and Puerto Rico. The Redistricting Census 2000 TIGER/Line files will not include files for the Island Areas. The Census TIGER data base represents a seamless national file with no overlaps or gaps between parts. However, each county-based TIGER/Line file is designed to stand alone as an independent data set or the files can be combined to cover the whole Nation. The Redistricting Census 2000 TIGER/Line files consist of line segments representing physical features and governmental and statistical boundaries. The Redistricting Census 2000 TIGER/Line files do NOT contain the ZIP Code Tabulation Areas (ZCTAs) and the address ranges are of approximately the same vintage as those appearing in the 1999 TIGER/Line files. That is, the Census Bureau is producing the Redistricting Census 2000 TIGER/Line files in advance of the computer processing that will ensure that the address ranges in the TIGER/Line files agree with the final Master Address File (MAF) used for tabulating Census 2000. The files contain information distributed over a series of record types for the spatial objects of a county. There are 17 record types, including the basic data record, the shape coordinate points, and geographic codes that can be used with appropriate software to prepare maps. Other geographic information contained in the files includes attributes such as feature identifiers/census feature class codes (CFCC) used to differentiate feature types, address ranges and ZIP Codes, codes for legal and statistical entities, latitude/longitude coordinates of linear and point features, landmark point features, area landmarks, key geographic features, and area boundaries. The Redistricting Census 2000 TIGER/Line data dictionary contains a complete list of all the fields in the 17 record types.

  8. Demographics: Population, Race, Gender Data County

    • kaggle.com
    zip
    Updated Jan 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ahmed Mohamed (2025). Demographics: Population, Race, Gender Data County [Dataset]. https://www.kaggle.com/datasets/ahmedmohamed2003/county-level-demographic-population-race-gender
    Explore at:
    zip(93210 bytes)Available download formats
    Dataset updated
    Jan 14, 2025
    Authors
    Ahmed Mohamed
    Description

    """

    County-Level Demographic: Population, Race, Gender

    Overview

    This dataset provides a detailed breakdown of demographic information for counties across the United States, derived from the U.S. Census Bureau's 2023 American Community Survey (ACS). The data includes population counts by gender, race, and ethnicity, alongside unique identifiers for each county using State and County FIPS codes.

    Dataset Features

    The dataset includes the following columns: - County: Name of the county. - State: Name of the state the county belongs to. - State FIPS Code: Federal Information Processing Standard (FIPS) code for the state. - County FIPS Code: FIPS code for the county. - FIPS: Combined State and County FIPS codes, a unique identifier for each county. - Total Population: Total population in the county. - Male Population: Number of males in the county. - Female Population: Number of females in the county. - Total Race Responses: Total race-related responses recorded in the survey. - White Alone: Number of individuals identifying as White alone. - Black or African American Alone: Number of individuals identifying as Black or African American alone. - Hispanic or Latino: Number of individuals identifying as Hispanic or Latino.

    Processing Methodology

    1. Source:
    2. County-Level Aggregation:
      • Each county is uniquely identified using State FIPS Code and County FIPS Code.
      • These codes were concatenated to form the unified FIPS column.
    3. Data Cleaning:
      • All numeric columns were converted to appropriate data types.
      • County and state names were extracted from the raw NAME field for clarity.

    Why Use This Dataset?

    This dataset is highly versatile and suitable for: - Demographic Analysis: - Analyze population distribution by gender, race, and ethnicity. - Geographic Studies: - Use FIPS codes to map counties geographically. - Data Visualizations: - Create visual insights into demographic trends across counties.

    File Format

    • The dataset is available as a CSV file with 3,000+ rows (one for each county).

    Licensing

    • Source: Data is sourced from the U.S. Census Bureau's 2023 American Community Survey (ACS).
    • License: This dataset is in the public domain and provided under the U.S. Census Bureau’s terms of use. Attribution to the Census Bureau is appreciated.

    Acknowledgments

    Special thanks to the U.S. Census Bureau for making this data publicly available and to the Kaggle community for fostering a collaborative space for data analysis and exploration. """

  9. 👨‍👩‍👧 US Country Demographics

    • kaggle.com
    zip
    Updated Aug 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    mexwell (2023). 👨‍👩‍👧 US Country Demographics [Dataset]. https://www.kaggle.com/datasets/mexwell/us-country-demographics
    Explore at:
    zip(343499 bytes)Available download formats
    Dataset updated
    Aug 14, 2023
    Authors
    mexwell
    License

    http://www.gnu.org/licenses/old-licenses/gpl-2.0.en.htmlhttp://www.gnu.org/licenses/old-licenses/gpl-2.0.en.html

    Area covered
    United States
    Description

    The following data set is information obtained about counties in the United States from 2010 through 2019 through the United States Census Bureau. Information described in the data includes the age distributions, the education levels, employment statistics, ethnicity percents, houseold information, income, and other miscellneous statistics. (Values are denoted as -1, if the data is not available)

    Data Dictionary

    <...

    KeyList of...CommentExample Value
    CountyStringCounty name"Abbeville County"
    StateStringState name"SC"
    Age.Percent 65 and OlderFloatEstimated percentage of population whose ages are equal or greater than 65 years old are produced for the United States states and counties as well as for the Commonwealth of Puerto Rico and its municipios (county-equivalents for Puerto Rico).22.4
    Age.Percent Under 18 YearsFloatEstimated percentage of population whose ages are under 18 years old are produced for the United States states and counties as well as for the Commonwealth of Puerto Rico and its municipios (county-equivalents for Puerto Rico).19.8
    Age.Percent Under 5 YearsFloatEstimated percentage of population whose ages are under 5 years old are produced for the United States states and counties as well as for the Commonwealth of Puerto Rico and its municipios (county-equivalents for Puerto Rico).4.7
    Education.Bachelor's Degree or HigherFloatPercentage for the people who attended college but did not receive a degree and people who received an associate's bachelor's master's or professional or doctorate degree. These data include only persons 25 years old and over. The percentages are obtained by dividing the counts of graduates by the total number of persons 25 years old and over. Tha data is collected from 2015 to 2019.15.6
    Education.High School or HigherFloatPercentage of people whose highest degree was a high school diploma or its equivalent people who attended college but did not receive a degree and people who received an associate's bachelor's master's or professional or doctorate degree. These data include only persons 25 years old and over. The percentages are obtained by dividing the counts of graduates by the total number of persons 25 years old and over. Tha data is collected from 2015 to 201981.7
    Employment.Nonemployer EstablishmentsIntegerAn establishment is a single physical location at which business is conducted or where services or industrial operations are performed. It is not necessarily identical with a company or enterprise which may consist of one establishment or more. The data was collected from 2018.1416
    Ethnicities.American Indian and Alaska Native AloneFloatEstimated percentage of population having origins in any of the original peoples of North and South America (including Central America) and who maintains tribal affiliation or community attachment. This category includes people who indicate their race as "American Indian or Alaska Native" or report entries such as Navajo Blackfeet Inupiat Yup'ik or Central American Indian groups or South American Indian groups.0.3
    Ethnicities.Asian AloneFloatEstimated percentage of population having origins in any of the original peoples of the Far East Southeast Asia or the Indian subcontinent including for example Cambodia China India Japan Korea Malaysia Pakistan the Philippine Islands Thailand and Vietnam. This includes people who reported detailed Asian responses such as: "Asian Indian " "Chinese " "Filipino " "Korean " "Japanese " "Vietnamese " and "Other Asian" or provide other detailed Asian responses.0.4
    Ethnicities.Black AloneFloatEstimated percentage of population having origins in any of the Black racial groups of Africa. It includes people who indicate their race as "Black or African American " or report entries such as African American Kenyan Nigerian or Haitian.27.6
    Ethnicities.Hispanic or LatinoFloat
  10. t

    Race and Ethnicity - ACS 2016-2020 - Tempe Zip Codes

    • data.tempe.gov
    • data-academy.tempe.gov
    • +8more
    Updated May 2, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Tempe (2022). Race and Ethnicity - ACS 2016-2020 - Tempe Zip Codes [Dataset]. https://data.tempe.gov/datasets/tempegov::race-and-ethnicity-acs-2016-2020-tempe-zip-codes
    Explore at:
    Dataset updated
    May 2, 2022
    Dataset authored and provided by
    City of Tempe
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Description

    This layer shows population broken down by race and Hispanic origin. Data is from US Census American Community Survey (ACS) 5-year estimates.To see the full list of attributes available in this service, go to the "Data" tab, and choose "Fields" at the top right (in ArcGIS Online). A ‘Null’ entry in the estimate indicates that data for this geographic area cannot be displayed because the number of sample cases is too small (per the U.S. Census).Vintage: 2016-2020ACS Table(s): B03002 (Not all lines of this ACS table are available in this feature layer.)Data downloaded from: Census Bureau's API for American Community Survey Data Preparation: Data table downloaded and joined with Zip Code boundaries in the City of Tempe.Date of Census update: March 17, 2022National Figures: data.census.gov

  11. D

    ARCHIVED: COVID-19 Testing by Race/Ethnicity Over Time

    • data.sfgov.org
    • healthdata.gov
    • +1more
    csv, xlsx, xml
    Updated Jan 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Public Health - Population Health Division (2024). ARCHIVED: COVID-19 Testing by Race/Ethnicity Over Time [Dataset]. https://data.sfgov.org/Health-and-Social-Services/ARCHIVED-COVID-19-Testing-by-Race-Ethnicity-Over-T/kja3-qsky
    Explore at:
    xlsx, xml, csvAvailable download formats
    Dataset updated
    Jan 12, 2024
    Dataset authored and provided by
    Department of Public Health - Population Health Division
    License

    ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
    License information was derived automatically

    Description

    A. SUMMARY This dataset includes San Francisco COVID-19 tests by race/ethnicity and by date. This dataset represents the daily count of tests collected, and the breakdown of test results (positive, negative, or indeterminate). Tests in this dataset include all those collected from persons who listed San Francisco as their home address at the time of testing. It also includes tests that were collected by San Francisco providers for persons who were missing a locating address. This dataset does not include tests for residents listing a locating address outside of San Francisco, even if they were tested in San Francisco.

    The data were de-duplicated by individual and date, so if a person gets tested multiple times on different dates, all tests will be included in this dataset (on the day each test was collected). If a person tested multiple times on the same date, only one test is included from that date. When there are multiple tests on the same date, a positive result, if one exists, will always be selected as the record for the person. If a PCR and antigen test are taken on the same day, the PCR test will supersede. If a person tests multiple times on the same day and the results are all the same (e.g. all negative or all positive) then the first test done is selected as the record for the person.

    The total number of positive test results is not equal to the total number of COVID-19 cases in San Francisco.

    When a person gets tested for COVID-19, they may be asked to report information about themselves. One piece of information that might be requested is a person's race and ethnicity. These data are often incomplete in the laboratory and provider reports of the test results sent to the health department. The data can be missing or incomplete for several possible reasons:

    • The person was not asked about their race and ethnicity.
    • The person was asked, but refused to answer.
    • The person answered, but the testing provider did not include the person's answers in the reports.
    • The testing provider reported the person's answers in a format that could not be used by the health department.
    

    For any of these reasons, a person's race/ethnicity will be recorded in the dataset as “Unknown.”

    B. NOTE ON RACE/ETHNICITY The different values for Race/Ethnicity in this dataset are "Asian;" "Black or African American;" "Hispanic or Latino/a, all races;" "American Indian or Alaska Native;" "Native Hawaiian or Other Pacific Islander;" "White;" "Multi-racial;" "Other;" and “Unknown."

    The Race/Ethnicity categorization increases data clarity by emulating the methodology used by the U.S. Census in the American Community Survey. Specifically, persons who identify as "Asian," "Black or African American," "American Indian or Alaska Native," "Native Hawaiian or Other Pacific Islander," "White," "Multi-racial," or "Other" do NOT include any person who identified as Hispanic/Latino at any time in their testing reports that either (1) identified them as SF residents or (2) as someone who tested without a locating address by an SF provider. All persons across all races who identify as Hispanic/Latino are recorded as “"Hispanic or Latino/a, all races." This categorization increases data accuracy by correcting the way “Other” persons were counted. Previously, when a person reported “Other” for Race/Ethnicity, they would be recorded “Unknown.” Under the new categorization, they are counted as “Other” and are distinct from “Unknown.”

    If a person records their race/ethnicity as “Asian,” “Black or African American,” “American Indian or Alaska Native,” “Native Hawaiian or Other Pacific Islander,” “White,” or “Other” for their first COVID-19 test, then this data will not change—even if a different race/ethnicity is reported for this person for any future COVID-19 test. There are two exceptions to this rule. The first exception is if a person’s race/ethnicity value is reported as “Unknown” on their first test and then on a subsequent test they report “Asian;” "Black or African American;" "Hispanic or Latino/a, all races;" "American Indian or Alaska Native;" "Native Hawaiian or Other Pacific Islander;" or "White”, then this subsequent reported race/ethnicity will overwrite the previous recording of “Unknown”. If a person has only ever selected “Unknown” as their race/ethnicity, then it will be recorded as “Unknown.” This change provides more specific and actionable data on who is tested in San Francisco.

    The second exception is if a person ever marks “Hispanic or Latino/a, all races” for race/ethnicity then this choice will always overwrite any previous or future response. This is because it is an overarching category that can include any and all other races and is mutually exclusive with the other responses.

    A person's race/ethnicity will be recorded as “Multi-racial” if they select two or more values among the following choices: “Asian,” “Black or African American,” “American Indian or Alaska Native,” “Native Hawaiian or Other Pacific Islander,” “White,” or “Other.” If a person selects a combination of two or more race/ethnicity answers that includes “Hispanic or Latino/a, all races” then they will still be recorded as “Hispanic or Latino/a, all races”—not as “Multi-racial.”

    C. HOW THE DATASET IS CREATED COVID-19 laboratory test data is based on electronic laboratory test reports. Deduplication, quality assurance measures and other data verification processes maximize accuracy of laboratory test information.

    D. UPDATE PROCESS Updates automatically at 5:00AM Pacific Time each day. Redundant runs are scheduled at 7:00AM and 9:00AM in case of pipeline failure.

    E. HOW TO USE THIS DATASET San Francisco population estimates for race/ethnicity can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS).

    Due to the high degree of variation in the time needed to complete tests by different labs there is a delay in this reporting. On March 24, 2020 the Health Officer ordered all labs in the City to report complete COVID-19 testing information to the local and state health departments.

    In order to track trends over time, a user can analyze this data by sorting or filtering by the "specimen_collection_date" field.

    Calculating Percent Positivity: The positivity rate is the percentage of tests that return a positive result for COVID-19 (positive tests divided by the sum of positive and negative tests). Indeterminate results, which could not conclusively determine whether COVID-19 virus was present, are not included in the calculation of percent positive. When there are fewer than 20 positives tests for a given race/ethnicity and time period, the positivity rate is not calculated for the public tracker because rates of small test counts are less reliable.

    Calculating Testing Rates: To calculate the testing rate per 10,000 residents, divide the total number of tests collected (positive, negative, and indeterminate results) for the specified race/ethnicity by the total number of residents who identify as that race/ethnicity (according to the 2016-2020 American Community Survey (ACS) population estimate), then multiply by 10,000. When there are fewer than 20 total tests for a given race/ethnicity and time period, the testing rate is not calculated for the public tracker because rates of small test counts are less reliable.

    Read more about how this data is updated and validated daily: https://sf.gov/information/covid-19-data-questions

    F. CHANGE LOG

    • 1/12/2024 - This dataset will stop updating as of 1/12/2024
    • 6/21/2023 - A small number of additional COVID-19 testing records were released as part of our ongoing data cleaning efforts. An update to the race or ethnicity designation among a subset of testing records was simultaneously released.
    • 1/31/2023 - updated “population_estimate” column to reflect the 2020 Census Bureau American Community Survey (ACS) San Francisco Population estimates.
    • 1/31/2023 - renamed column “last_updated_at” to “data_as_of”.
    • 3/23/2022 - ‘Native American’ changed to ‘American Indian or Alaska Native’ to align with the census.
    • 2/10/2022 - race/ethnicity categorization was changed. See section NOTE ON RACE/ETHNICITY for additional information.
    • 4/16/2021 - dataset updated to refresh with a five-day data lag.

  12. a

    Racial Ethnic Distribution GIS

    • hub.arcgis.com
    Updated Aug 24, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Santa Clara County Public Health (2022). Racial Ethnic Distribution GIS [Dataset]. https://hub.arcgis.com/maps/sccphd::racial-ethnic-distribution-gis
    Explore at:
    Dataset updated
    Aug 24, 2022
    Dataset authored and provided by
    Santa Clara County Public Health
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Table contains count and percent distribution of county residents by racial/ethnic categories. Data are summarized at county, city, zip code and census tract of residence. Data are presented for zip codes (ZCTAs) fully within the county. People of color category includes people who identify as Latino, African American, American Indian/Alaska Native, Asian, Pacific Islander, or multi-race. Source: U.S. Census Bureau, 2016-2020 American Community Survey 5-year estimates, Table B03002; data accessed on April 11, 2022 from https://api.census.gov. The 2020 Decennial geographies are used for data summarization.METADATA:notes (String): Lists table title, notes, sourcesgeolevel (String): Level of geographyGEOID (String): Geography IDNAME (String): Name of geographyt_pop (Numeric): Total populationAfrican_American_NH (Numeric): Number of non-Hispanic African Americansp_African_American_NH (Numeric): Percent of non-Hispanic African AmericansAsian_NH (Numeric): Number of non-Hispanic Asiansp_Asian_NH (Numeric): Percent of non-Hispanic AsiansLatino (Numeric): Number of Latinosp_Latino (Numeric): Percent of LatinosWhite_NH (Numeric): Number of non-Hispanic Whitep_White_NH (Numeric): Percent of non-Hispanic Whitepeople_of_color2 (Numeric): Number of people of colorp_poc2 (Numeric): Percent of people of color

  13. FiveThirtyEight Most Common Name Dataset

    • kaggle.com
    zip
    Updated Apr 26, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    FiveThirtyEight (2019). FiveThirtyEight Most Common Name Dataset [Dataset]. https://www.kaggle.com/datasets/fivethirtyeight/fivethirtyeight-most-common-name-dataset/discussion
    Explore at:
    zip(2485070 bytes)Available download formats
    Dataset updated
    Apr 26, 2019
    Dataset authored and provided by
    FiveThirtyEighthttps://abcnews.go.com/538
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Content

    Most Common Name

    This directory contains the code and data behind the story Dear Mona, What’s The Most Common Name In America?

    The main script file is most-common-name.R

    There are four input files:

    And five output files:

    • adjusted-name-combinations-list.csv - Adjusted estimates for the most common full names.
    • adjusted-name-combinations-matrix.csv - The same data from the file adjusted-name-combinations-list.csv but in matrix form. These are the estimates presented in the second (and final) table of the article.
    • independent-name-combinations-by-pop.csv - Matrix of estimates for the top 100 most common first names by top 100 most common surnames. These were calculated using independent odds, and displayed in the first table presented in the article.
    • new-top-firstNames.csv - Final estimated ranking of top first names.
    • new-top-surnames.csv - Final estimated ranking of top surnames.

    Context

    This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using GitHub's API and Kaggle's API.

    This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.

  14. d

    Race and Hispanic Origin - ACS 2019-2023 - Tempe Zip Codes

    • catalog.data.gov
    • performance.tempe.gov
    • +9more
    Updated Aug 23, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Tempe (2025). Race and Hispanic Origin - ACS 2019-2023 - Tempe Zip Codes [Dataset]. https://catalog.data.gov/dataset/race-and-hispanic-origin-acs-2019-2023-tempe-zip-codes
    Explore at:
    Dataset updated
    Aug 23, 2025
    Dataset provided by
    City of Tempe
    Area covered
    Tempe
    Description

    This layer shows the population broken down by race and Hispanic origin. Data is from US Census American Community Survey (ACS) 5-year estimates.To see the full list of attributes available in this service, go to the "Data" tab, and choose "Fields" at the top right (in ArcGIS Online). A ‘Null’ entry in the estimate indicates that data for this geographic area cannot be displayed because the number of sample cases is too small (per the U.S. Census).Vintage: 2019-2023ACS Table(s): B03002 (Not all lines of this ACS table are available in this feature layer.)Data downloaded from: Census Bureau's API for American Community Survey Data Preparation: Data table was downloaded and joined with Zip Code boundaries in the City of Tempe.Date of Census update: December 12, 2024National Figures: data.census.gov

  15. Ethnic codes as defined by the Health Management Information System.

    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amrit Banstola; Ashik Banstola (2023). Ethnic codes as defined by the Health Management Information System. [Dataset]. http://doi.org/10.1371/journal.pone.0071311.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Amrit Banstola; Ashik Banstola
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Ethnic codes as defined by the Health Management Information System.

  16. Most Common Name in America

    • kaggle.com
    zip
    Updated Apr 21, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bojan Tunguz (2021). Most Common Name in America [Dataset]. https://www.kaggle.com/tunguz/most-common-name-in-america
    Explore at:
    zip(107213 bytes)Available download formats
    Dataset updated
    Apr 21, 2021
    Authors
    Bojan Tunguz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Most Common Name

    This directory contains the code and data behind the story Dear Mona, What’s The Most Common Name In America?

    The main script file is most-common-name.R

    There are four input files:

    And five output files:

    • adjusted-name-combinations-list.csv - Adjusted estimates for the most common full names.
    • adjusted-name-combinations-matrix.csv - The same data from the file adjusted-name-combinations-list.csv but in matrix form. These are the estimates presented in the second (and final) table of the article.
    • independent-name-combinations-by-pop.csv - Matrix of estimates for the top 100 most common first names by top 100 most common surnames. These were calculated using independent odds, and displayed in the first table presented in the article.
    • new-top-firstNames.csv - Final estimated ranking of top first names.
    • new-top-surnames.csv - Final estimated ranking of top surnames.
  17. 2022 American Community Survey: B01001B | Sex by Age (Black or African...

    • data.census.gov
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ACS, 2022 American Community Survey: B01001B | Sex by Age (Black or African American Alone) (ACS 1-Year Estimates Detailed Tables) [Dataset]. https://data.census.gov/table/ACSDT1Y2022.B01001B?q=Race%20and%20Ethnicity
    Explore at:
    Dataset provided by
    United States Census Bureauhttp://census.gov/
    Authors
    ACS
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    2022
    Area covered
    United States
    Description

    Although the American Community Survey (ACS) produces population, demographic and housing unit estimates, the decennial census is the official source of population totals for April 1st of each decennial year. In between censuses, the Census Bureau's Population Estimates Program produces and disseminates the official estimates of the population for the nation, states, counties, cities, and towns and estimates of housing units for states and counties..Information about the American Community Survey (ACS) can be found on the ACS website. Supporting documentation including code lists, subject definitions, data accuracy, and statistical testing, and a full list of ACS tables and table shells (without estimates) can be found on the Technical Documentation section of the ACS website.Sample size and data quality measures (including coverage rates, allocation rates, and response rates) can be found on the American Community Survey website in the Methodology section..Source: U.S. Census Bureau, 2022 American Community Survey 1-Year Estimates.Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted roughly as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see ACS Technical Documentation). The effect of nonsampling error is not represented in these tables..The Hispanic origin and race codes were updated in 2020. For more information on the Hispanic origin and race code changes, please visit the American Community Survey Technical Documentation website..The 2022 American Community Survey (ACS) data generally reflect the March 2020 Office of Management and Budget (OMB) delineations of metropolitan and micropolitan statistical areas. In certain instances the names, codes, and boundaries of the principal cities shown in ACS tables may differ from the OMB delineations due to differences in the effective dates of the geographic entities..Estimates of urban and rural populations, housing units, and characteristics reflect boundaries of urban areas defined based on 2020 Census data. As a result, data for urban and rural areas from the ACS do not necessarily reflect the results of ongoing urbanization..Explanation of Symbols:- The estimate could not be computed because there were an insufficient number of sample observations. For a ratio of medians estimate, one or both of the median estimates falls in the lowest interval or highest interval of an open-ended distribution. For a 5-year median estimate, the margin of error associated with a median was larger than the median itself.N The estimate or margin of error cannot be displayed because there were an insufficient number of sample cases in the selected geographic area. (X) The estimate or margin of error is not applicable or not available.median- The median falls in the lowest interval of an open-ended distribution (for example "2,500-")median+ The median falls in the highest interval of an open-ended distribution (for example "250,000+").** The margin of error could not be computed because there were an insufficient number of sample observations.*** The margin of error could not be computed because the median falls in the lowest interval or highest interval of an open-ended distribution.***** A margin of error is not appropriate because the corresponding estimate is controlled to an independent population or housing estimate. Effectively, the corresponding estimate has no sampling error and the margin of error may be treated as zero.

  18. 2020 American Community Survey: DP05 | ACS DEMOGRAPHIC AND HOUSING ESTIMATES...

    • data.census.gov
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ACS, 2020 American Community Survey: DP05 | ACS DEMOGRAPHIC AND HOUSING ESTIMATES (ACS 5-Year Estimates Data Profiles) [Dataset]. https://data.census.gov/table/ACSDP5Y2020.DP05
    Explore at:
    Dataset provided by
    United States Census Bureauhttp://census.gov/
    Authors
    ACS
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    2020
    Description

    Although the American Community Survey (ACS) produces population, demographic and housing unit estimates, for 2020, the 2020 Census provides the official counts of the population and housing units for the nation, states, counties, cities, and towns. For 2016 to 2019, the Population Estimates Program provides estimates of the population for the nation, states, counties, cities, and towns and intercensal housing unit estimates for the nation, states, and counties..Supporting documentation on code lists, subject definitions, data accuracy, and statistical testing can be found on the American Community Survey website in the Technical Documentation section.Sample size and data quality measures (including coverage rates, allocation rates, and response rates) can be found on the American Community Survey website in the Methodology section..Source: U.S. Census Bureau, 2016-2020 American Community Survey 5-Year Estimates.Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted roughly as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see ACS Technical Documentation). The effect of nonsampling error is not represented in these tables..For more information on understanding race and Hispanic origin data, please see the Census 2010 Brief entitled, Overview of Race and Hispanic Origin: 2010, issued March 2011. (pdf format).The Hispanic origin and race codes were updated in 2020. For more information on the Hispanic origin and race code changes, please visit the American Community Survey Technical Documentation website..The 2016-2020 American Community Survey (ACS) data generally reflect the September 2018 Office of Management and Budget (OMB) delineations of metropolitan and micropolitan statistical areas. In certain instances, the names, codes, and boundaries of the principal cities shown in ACS tables may differ from the OMB delineation lists due to differences in the effective dates of the geographic entities..Estimates of urban and rural populations, housing units, and characteristics reflect boundaries of urban areas defined based on Census 2010 data. As a result, data for urban and rural areas from the ACS do not necessarily reflect the results of ongoing urbanization..Explanation of Symbols:- The estimate could not be computed because there were an insufficient number of sample observations. For a ratio of medians estimate, one or both of the median estimates falls in the lowest interval or highest interval of an open-ended distribution.N The estimate or margin of error cannot be displayed because there were an insufficient number of sample cases in the selected geographic area. (X) The estimate or margin of error is not applicable or not available.median- The median falls in the lowest interval of an open-ended distribution (for example "2,500-")median+ The median falls in the highest interval of an open-ended distribution (for example "250,000+").** The margin of error could not be computed because there were an insufficient number of sample observations.*** The margin of error could not be computed because the median falls in the lowest interval or highest interval of an open-ended distribution.***** A margin of error is not appropriate because the corresponding estimate is controlled to an independent population or housing estimate. Effectively, the corresponding estimate has no sampling error and the margin of error may be treated as zero.

  19. D

    Race and Ethnicity - Seattle Neighborhoods

    • data.seattle.gov
    • catalog.data.gov
    • +1more
    csv, xlsx, xml
    Updated Oct 22, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Race and Ethnicity - Seattle Neighborhoods [Dataset]. https://data.seattle.gov/dataset/Race-and-Ethnicity-Seattle-Neighborhoods/r4ar-x7dx
    Explore at:
    xlsx, xml, csvAvailable download formats
    Dataset updated
    Oct 22, 2024
    Area covered
    Seattle
    Description

    Table from the American Community Survey (ACS) 5-year series on race and ethnicity related topics for City of Seattle Council Districts, Comprehensive Plan Growth Areas and Community Reporting Areas. Table includes B03002 Hispanic or Latino Origin by Race, B02008-B02013 Race Alone or in Combination with One or More. Data is pulled from block group tables for the most recent ACS vintage and summarized to the neighborhoods based on block group assignment.


    Table created for and used in the Neighborhood Profiles application.

    Vintages: 2023


    The United States Census Bureau's American Community Survey (ACS):
    This ready-to-use layer can be used within ArcGIS Pro, ArcGIS Online, its configurable apps, dashboards, Story Maps, custom apps, and mobile apps. Data can also be exported for offline workflows. Please cite the Census and ACS when using this data.

    Data Note from the Census:
    Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see Accuracy of the Data). The effect of nonsampling error is not represented in these tables.

    Data Processing Notes:
    • Boundaries come from the US Census TIGER geodatabases, specifically, the National Sub-State Geography Database (named tlgdb(year)a_us_substategeo.gdb). Boundaries are updated at the same time as the data updates (annually), and the boundary vintage appropriately matches the data vintage as specified by the Census. These are Census boundaries with water and/or coastlines erased for cartographic and mapping purposes. For census tracts, the water cutouts are derived from a subset of the 2020 Areal Hydrography boundaries offered by TIGER. Water bodies and rivers which are 50 million square meters or larger (mid to large sized water bodies) are erased from the tract level boundaries, as well as additional important features. For state and county boundaries, the water and coastlines are derived from the coastlines of the 2020 500k TIGER Cartographic Boundary Shapefiles. These are erased to more accurately portray the coastlines and Great Lakes. The original AWATER and ALAND fields are still available as attributes within the data table (units are square meters).
    • The States layer contains 52 records - all US states,

  20. 2023 American Community Survey: B08105G | Means of Transportation to Work...

    • data.census.gov
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ACS, 2023 American Community Survey: B08105G | Means of Transportation to Work (Two or More Races) (ACS 5-Year Estimates Detailed Tables) [Dataset]. https://data.census.gov/table/ACSDT5Y2023.B08105G?q=Otsego+County,+New+York+Employment&t=Race+and+Ethnicity
    Explore at:
    Dataset provided by
    United States Census Bureauhttp://census.gov/
    Authors
    ACS
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    2023
    Description

    Although the American Community Survey (ACS) produces population, demographic and housing unit estimates, the decennial census is the official source of population totals for April 1st of each decennial year. In between censuses, the Census Bureau's Population Estimates Program produces and disseminates the official estimates of the population for the nation, states, counties, cities, and towns and estimates of housing units and the group quarters population for states and counties..Information about the American Community Survey (ACS) can be found on the ACS website. Supporting documentation including code lists, subject definitions, data accuracy, and statistical testing, and a full list of ACS tables and table shells (without estimates) can be found on the Technical Documentation section of the ACS website.Sample size and data quality measures (including coverage rates, allocation rates, and response rates) can be found on the American Community Survey website in the Methodology section..Source: U.S. Census Bureau, 2019-2023 American Community Survey 5-Year Estimates.ACS data generally reflect the geographic boundaries of legal and statistical areas as of January 1 of the estimate year. For more information, see Geography Boundaries by Year..Users must consider potential differences in geographic boundaries, questionnaire content or coding, or other methodological issues when comparing ACS data from different years. Statistically significant differences shown in ACS Comparison Profiles, or in data users' own analysis, may be the result of these differences and thus might not necessarily reflect changes to the social, economic, housing, or demographic characteristics being compared. For more information, see Comparing ACS Data..Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted roughly as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see ACS Technical Documentation). The effect of nonsampling error is not represented in these tables..Workers include members of the Armed Forces and civilians who were at work last week..The Hispanic origin and race codes were updated in 2020. For more information on the Hispanic origin and race code changes, please visit the American Community Survey Technical Documentation website..Estimates of urban and rural populations, housing units, and characteristics reflect boundaries of urban areas defined based on 2020 Census data. As a result, data for urban and rural areas from the ACS do not necessarily reflect the results of ongoing urbanization..Explanation of Symbols:- The estimate could not be computed because there were an insufficient number of sample observations. For a ratio of medians estimate, one or both of the median estimates falls in the lowest interval or highest interval of an open-ended distribution. For a 5-year median estimate, the margin of error associated with a median was larger than the median itself.N The estimate or margin of error cannot be displayed because there were an insufficient number of sample cases in the selected geographic area. (X) The estimate or margin of error is not applicable or not available.median- The median falls in the lowest interval of an open-ended distribution (for example "2,500-")median+ The median falls in the highest interval of an open-ended distribution (for example "250,000+").** The margin of error could not be computed because there were an insufficient number of sample observations.*** The margin of error could not be computed because the median falls in the lowest interval or highest interval of an open-ended distribution.***** A margin of error is not appropriate because the corresponding estimate is controlled to an independent population or housing estimate. Effectively, the corresponding estimate has no sampling error and the margin of error may be treated as zero.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
John Snow Labs (2021). US Race and Ethnicity Codes [Dataset]. https://www.johnsnowlabs.com/marketplace/us-race-and-ethnicity-codes/
Organization logo

US Race and Ethnicity Codes

Explore at:
csvAvailable download formats
Dataset updated
Jan 20, 2021
Dataset authored and provided by
John Snow Labs
Area covered
N/A, United States
Description

This dataset contains Race/Ethinicty codes. It is used to enter in patient demographics information.

Search
Clear search
Close search
Google apps
Main menu