95 datasets found

D
ARCHIVED: COVID-19 Testing by Race/Ethnicity Over Time
data.sfgov.org
healthdata.gov
+1more
application/rdfxml +5
Updated Jan 12, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Public Health - Population Health Division (2024). ARCHIVED: COVID-19 Testing by Race/Ethnicity Over Time [Dataset]. https://data.sfgov.org/Health-and-Social-Services/ARCHIVED-COVID-19-Testing-by-Race-Ethnicity-Over-T/kja3-qsky
Explore at:
xml, csv, json, tsv, application/rssxml, application/rdfxmlAvailable download formats
Dataset updated
Jan 12, 2024
Dataset authored and provided by
Department of Public Health - Population Health Division
License
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
Description
A. SUMMARY This dataset includes San Francisco COVID-19 tests by race/ethnicity and by date. This dataset represents the daily count of tests collected, and the breakdown of test results (positive, negative, or indeterminate). Tests in this dataset include all those collected from persons who listed San Francisco as their home address at the time of testing. It also includes tests that were collected by San Francisco providers for persons who were missing a locating address. This dataset does not include tests for residents listing a locating address outside of San Francisco, even if they were tested in San Francisco.

The data were de-duplicated by individual and date, so if a person gets tested multiple times on different dates, all tests will be included in this dataset (on the day each test was collected). If a person tested multiple times on the same date, only one test is included from that date. When there are multiple tests on the same date, a positive result, if one exists, will always be selected as the record for the person. If a PCR and antigen test are taken on the same day, the PCR test will supersede. If a person tests multiple times on the same day and the results are all the same (e.g. all negative or all positive) then the first test done is selected as the record for the person.

The total number of positive test results is not equal to the total number of COVID-19 cases in San Francisco.

When a person gets tested for COVID-19, they may be asked to report information about themselves. One piece of information that might be requested is a person's race and ethnicity. These data are often incomplete in the laboratory and provider reports of the test results sent to the health department. The data can be missing or incomplete for several possible reasons:

• The person was not asked about their race and ethnicity. • The person was asked, but refused to answer. • The person answered, but the testing provider did not include the person's answers in the reports. • The testing provider reported the person's answers in a format that could not be used by the health department.

For any of these reasons, a person's race/ethnicity will be recorded in the dataset as “Unknown.”

B. NOTE ON RACE/ETHNICITY The different values for Race/Ethnicity in this dataset are "Asian;" "Black or African American;" "Hispanic or Latino/a, all races;" "American Indian or Alaska Native;" "Native Hawaiian or Other Pacific Islander;" "White;" "Multi-racial;" "Other;" and “Unknown."

The Race/Ethnicity categorization increases data clarity by emulating the methodology used by the U.S. Census in the American Community Survey. Specifically, persons who identify as "Asian," "Black or African American," "American Indian or Alaska Native," "Native Hawaiian or Other Pacific Islander," "White," "Multi-racial," or "Other" do NOT include any person who identified as Hispanic/Latino at any time in their testing reports that either (1) identified them as SF residents or (2) as someone who tested without a locating address by an SF provider. All persons across all races who identify as Hispanic/Latino are recorded as “"Hispanic or Latino/a, all races." This categorization increases data accuracy by correcting the way “Other” persons were counted. Previously, when a person reported “Other” for Race/Ethnicity, they would be recorded “Unknown.” Under the new categorization, they are counted as “Other” and are distinct from “Unknown.”

If a person records their race/ethnicity as “Asian,” “Black or African American,” “American Indian or Alaska Native,” “Native Hawaiian or Other Pacific Islander,” “White,” or “Other” for their first COVID-19 test, then this data will not change—even if a different race/ethnicity is reported for this person for any future COVID-19 test. There are two exceptions to this rule. The first exception is if a person’s race/ethnicity value is reported as “Unknown” on their first test and then on a subsequent test they report “Asian;” "Black or African American;" "Hispanic or Latino/a, all races;" "American Indian or Alaska Native;" "Native Hawaiian or Other Pacific Islander;" or "White”, then this subsequent reported race/ethnicity will overwrite the previous recording of “Unknown”. If a person has only ever selected “Unknown” as their race/ethnicity, then it will be recorded as “Unknown.” This change provides more specific and actionable data on who is tested in San Francisco.

The second exception is if a person ever marks “Hispanic or Latino/a, all races” for race/ethnicity then this choice will always overwrite any previous or future response. This is because it is an overarching category that can include any and all other races and is mutually exclusive with the other responses.

A person's race/ethnicity will be recorded as “Multi-racial” if they select two or more values among the following choices: “Asian,” “Black or African American,” “American Indian or Alaska Native,” “Native Hawaiian or Other Pacific Islander,” “White,” or “Other.” If a person selects a combination of two or more race/ethnicity answers that includes “Hispanic or Latino/a, all races” then they will still be recorded as “Hispanic or Latino/a, all races”—not as “Multi-racial.”

C. HOW THE DATASET IS CREATED COVID-19 laboratory test data is based on electronic laboratory test reports. Deduplication, quality assurance measures and other data verification processes maximize accuracy of laboratory test information.

D. UPDATE PROCESS Updates automatically at 5:00AM Pacific Time each day. Redundant runs are scheduled at 7:00AM and 9:00AM in case of pipeline failure.

E. HOW TO USE THIS DATASET San Francisco population estimates for race/ethnicity can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS).

Due to the high degree of variation in the time needed to complete tests by different labs there is a delay in this reporting. On March 24, 2020 the Health Officer ordered all labs in the City to report complete COVID-19 testing information to the local and state health departments.

In order to track trends over time, a user can analyze this data by sorting or filtering by the "specimen_collection_date" field.

Calculating Percent Positivity: The positivity rate is the percentage of tests that return a positive result for COVID-19 (positive tests divided by the sum of positive and negative tests). Indeterminate results, which could not conclusively determine whether COVID-19 virus was present, are not included in the calculation of percent positive. When there are fewer than 20 positives tests for a given race/ethnicity and time period, the positivity rate is not calculated for the public tracker because rates of small test counts are less reliable.

Calculating Testing Rates: To calculate the testing rate per 10,000 residents, divide the total number of tests collected (positive, negative, and indeterminate results) for the specified race/ethnicity by the total number of residents who identify as that race/ethnicity (according to the 2016-2020 American Community Survey (ACS) population estimate), then multiply by 10,000. When there are fewer than 20 total tests for a given race/ethnicity and time period, the testing rate is not calculated for the public tracker because rates of small test counts are less reliable.

Read more about how this data is updated and validated daily: https://sf.gov/information/covid-19-data-questions

F. CHANGE LOG
1/12/2024 - This dataset will stop updating as of 1/12/2024
6/21/2023 - A small number of additional COVID-19 testing records were released as part of our ongoing data cleaning efforts. An update to the race or ethnicity designation among a subset of testing records was simultaneously released.
1/31/2023 - updated “population_estimate” column to reflect the 2020 Census Bureau American Community Survey (ACS) San Francisco Population estimates.
1/31/2023 - renamed column “last_updated_at” to “data_as_of”.
3/23/2022 - ‘Native American’ changed to ‘American Indian or Alaska Native’ to align with the census.
2/10/2022 - race/ethnicity categorization was changed. See section NOTE ON RACE/ETHNICITY for additional information.
4/16/2021 - dataset updated to refresh with a five-day data lag.
o
ECIN Replication Package for "Adding Race and Ethnicity to Microeconomic...
openicpsr.org
delimited
Updated May 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Keith Ihlanfeldt; Luke Rodgers; Cynthia Yang (2025). ECIN Replication Package for "Adding Race and Ethnicity to Microeconomic Databases: An Assessment of Alternative Options" [Dataset]. http://doi.org/10.3886/E229541V1
Explore at:
delimitedAvailable download formats
Unique identifier
https://doi.org/10.3886/E229541V1
Dataset updated
May 13, 2025
Dataset provided by
Florida State University
Authors
Keith Ihlanfeldt; Luke Rodgers; Cynthia Yang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Estimating differences between racial/ethnic groups often requires merging demographic variables from one dataset to variables of interest in another. A common method merges Home Mortgage Disclosure Act data to property databases. One alternative is to acquire this information from voter registration files; another is to predict race with a name-based algorithm. Compared to Census data, which method is more representative varies by location and group. We explore the practical implications of each method by using the matched samples in two empirical applications. Researchers can arrive at different conclusions about racial/ethnic disparities depending on the method selected.
Race/Ethnicity of Newly Medi-Cal Eligible Individuals
data.chhs.ca.gov
data.ca.gov
+2more
csv, zip
Updated Mar 19, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Health Care Services (2025). Race/Ethnicity of Newly Medi-Cal Eligible Individuals [Dataset]. https://data.chhs.ca.gov/dataset/race-ethnicity-of-newly-medi-cal-eligible-individuals
Explore at:
zip, csv(24654)Available download formats
Dataset updated
Mar 19, 2025
Dataset provided by
California Department of Health Care Serviceshttp://www.dhcs.ca.gov/
Authors
Department of Health Care Services
Description
This dataset includes race/ethnicity of newly Medi-Cal eligible individuals who identified their race/ethnicity as Hispanic, White, Other Asian or Pacific Islander, Black, Chinese, Filipino, Vietnamese, Asian Indian, Korean, Alaskan Native or American Indian, Japanese, Cambodian, Samoan, Laotian, Hawaiian, Guamanian, Amerasian, or Other, by reporting period. The race/ethnicity data is from the Medi-Cal Eligibility Data System (MEDS) and includes eligible individuals without prior Medi-Cal Eligibility. This dataset is part of the public reporting requirements set forth in California Welfare and Institutions Code 14102.5.
n
Population by Race/Ethnicity (ACS)
linc.osbm.nc.gov
ncosbm.opendatasoft.com
csv, excel, geojson +1
Updated Nov 1, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2018). Population by Race/Ethnicity (ACS) [Dataset]. https://linc.osbm.nc.gov/explore/dataset/nc-count-by-ethnicity/
Explore at:
geojson, excel, csv, jsonAvailable download formats
Dataset updated
Nov 1, 2018
Description
Percent population by race and Hispanic Origin North Carolina and all counties from the 2012-2016 American Community Survey.
f
Data from: Using First Name Information to Improve Race and Ethnicity...
tandf.figshare.com
docx
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ioan Voicu (2023). Using First Name Information to Improve Race and Ethnicity Classification [Dataset]. http://doi.org/10.6084/m9.figshare.5813859.v2
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.5813859.v2
Dataset updated
May 31, 2023
Dataset provided by
Taylor & Francis
Authors
Ioan Voicu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This article uses a recent first name list to develop an improvement to an existing Bayesian classifier, namely the Bayesian Improved Surname Geocoding (BISG) method, which combines surname and geography information to impute missing race/ethnicity. The new Bayesian Improved First Name Surname Geocoding (BIFSG) method is validated using a large sample of mortgage applicants who self-report their race/ethnicity. BIFSG outperforms BISG, in terms of accuracy and coverage, for all major racial/ethnic categories. Although the overall magnitude of improvement is somewhat small, the largest improvements occur for non-Hispanic Blacks, a group for which the BISG performance is weakest. When estimating the race/ethnicity effects on mortgage pricing and underwriting decisions with regression models, estimation biases from both BIFSG and BISG are very small, with BIFSG generally having smaller biases, and the maximum a posteriori classifier resulting in smaller biases than through use of estimated probabilities. Robustness checks using voter registration data confirm BIFSG's improved performance vis-a-vis BISG and illustrate BIFSG's applicability to areas other than mortgage lending. Finally, I demonstrate an application of the BIFSG to the imputation of missing race/ethnicity in the Home Mortgage Disclosure Act data, and in the process, offer novel evidence that the incidence of missing race/ethnicity information is correlated with race/ethnicity.
U.S. household income percentage distribution 2023, by race and ethnicity
statista.com
Updated Sep 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). U.S. household income percentage distribution 2023, by race and ethnicity [Dataset]. https://www.statista.com/statistics/203207/percentage-distribution-of-household-income-in-the-us-by-ethnic-group/
Explore at:
Dataset updated
Sep 16, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2023
Area covered
United States
Description
In 2023, about 26.9 percent of Asian private households in the U.S. had an annual income of 200,000 U.S. dollars and more. Comparatively, around 13.9 percent of Black households had an annual income under 15,000 U.S. dollars.
l
Census 2021 - Ethnic groups
data.leicester.gov.uk
csv, excel, json
Updated Jun 29, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). Census 2021 - Ethnic groups [Dataset]. https://data.leicester.gov.uk/explore/dataset/census-2021-leicester-ethnic-groups/
Explore at:
csv, json, excelAvailable download formats
Dataset updated
Jun 29, 2023
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
The census is undertaken by the Office for National Statistics every 10 years and gives us a picture of all the people and households in England and Wales. The most recent census took place in March of 2021.The census asks every household questions about the people who live there and the type of home they live in. In doing so, it helps to build a detailed snapshot of society. Information from the census helps the government and local authorities to plan and fund local services, such as education, doctors' surgeries and roads.Key census statistics for Leicester are published on the open data platform to make information accessible to local services, voluntary and community groups, and residents. There is also a dashboard published showcasing various datasets from the census allowing users to view data for Leicester and compare this with national statistics.Further information about the census and full datasets can be found on the ONS website - https://www.ons.gov.uk/census/aboutcensus/censusproductsEthnicityThis dataset provides Census 2021 estimates that classify usual residents in England and Wales by ethnic group. The estimates are as at Census Day, 21 March 2021.Definition: The ethnic group that the person completing the census feels they belong to. This could be based on their culture, family background, identity or physical appearance.Respondents could choose one out of 19 tick-box response categories, including write-in response options.This dataset includes data relating to Leicester City and England overall.
N
Manns Choice, PA Non-Hispanic Population Breakdown By Race Dataset:...
neilsberg.com
csv, json
Updated Feb 21, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neilsberg Research (2025). Manns Choice, PA Non-Hispanic Population Breakdown By Race Dataset: Non-Hispanic Population Counts and Percentages for 7 Racial Categories as Identified by the US Census Bureau // 2025 Edition [Dataset]. https://www.neilsberg.com/research/datasets/99f29915-ef82-11ef-9e71-3860777c1fe6/
Explore at:
csv, jsonAvailable download formats
Dataset updated
Feb 21, 2025
Dataset authored and provided by
Neilsberg Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Pennsylvania, Manns Choice
Variables measured
Non-Hispanic Asian Population, Non-Hispanic Black Population, Non-Hispanic White Population, Non-Hispanic Some other race Population, Non-Hispanic Two or more races Population, Non-Hispanic American Indian and Alaska Native Population, Non-Hispanic Native Hawaiian and Other Pacific Islander Population, Non-Hispanic Asian Population as Percent of Total Non-Hispanic Population, Non-Hispanic Black Population as Percent of Total Non-Hispanic Population, Non-Hispanic White Population as Percent of Total Non-Hispanic Population, and 4 more
Measurement technique
The data presented in this dataset is derived from the latest U.S. Census Bureau American Community Survey (ACS) 2017-2021 5-Year Estimates. To measure the two variables, namely (a) Non-Hispanic population and (b) population as a percentage of the total Non-Hispanic population, we initially analyzed and categorized the data for each of the racial categories idetified by the US Census Bureau. It is ensured that the population estimates used in this dataset pertain exclusively to the identified racial categories, and are part of Non-Hispanic classification. For further information regarding these estimates, please feel free to reach out to us via email at research@neilsberg.com.
Dataset funded by
Neilsberg Research
Description
About this dataset

Context

The dataset tabulates the Non-Hispanic population of Manns Choice by race. It includes the distribution of the Non-Hispanic population of Manns Choice across various race categories as identified by the Census Bureau. The dataset can be utilized to understand the Non-Hispanic population distribution of Manns Choice across relevant racial categories.

Key observations

Of the Non-Hispanic population in Manns Choice, the largest racial group is White alone with a population of 265 (91.70% of the total Non-Hispanic population).

Content

When available, the data consists of estimates from the U.S. Census Bureau American Community Survey (ACS) 2019-2023 5-Year Estimates.

Racial categories include:

White

Black or African American

American Indian and Alaska Native

Asian

Native Hawaiian and Other Pacific Islander

Some other race

Two or more races (multiracial)

Variables / Data Columns

Race: This column displays the racial categories (for Non-Hispanic) for the Manns Choice

Population: The population of the racial category (for Non-Hispanic) in the Manns Choice is shown in this column.

% of Total Population: This column displays the percentage distribution of each race as a proportion of Manns Choice total Non-Hispanic population. Please note that the sum of all percentages may not equal one due to rounding of values.

Good to know

Margin of Error

Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

Custom data

If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

Inspiration

Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

Recommended for further research

This dataset is a part of the main dataset for Manns Choice Population by Race & Ethnicity. You can refer the same here
Total fertility rate by ethnicity U.S. 2022
statista.com
Updated Oct 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). Total fertility rate by ethnicity U.S. 2022 [Dataset]. https://www.statista.com/statistics/226292/us-fertility-rates-by-race-and-ethnicity/
Explore at:
Dataset updated
Oct 16, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2022
Area covered
United States
Description
Native Hawaiian and Pacific Islander women had the highest fertility rate of any ethnicity in the United States in 2022, with about 2,237.5 births per 1,000 women. The fertility rate for all ethnicities in the U.S. was 1,656.5 births per 1,000 women. What is the total fertility rate? The total fertility rate is an estimation of the number of children who would theoretically be born per 1,000 women through their childbearing years (generally considered to be between the ages of 15 and 44) according to age-specific fertility rates. The fertility rate is different from the birth rate, in that the birth rate is the number of births in relation to the population over a specific period of time. Fertility rates around the world Fertility rates around the world differ on a country-by-country basis, and more industrialized countries tend to see lower fertility rates. For example, Niger topped the list of the countries with the highest fertility rates, and Taiwan had the lowest fertility rate.
V
Population of Virginia localities (total, by race, and Hispanic/Latino...
data.virginia.gov
csv
Updated Feb 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Other (2024). Population of Virginia localities (total, by race, and Hispanic/Latino ethnicity), 2010-2018 [Dataset]. https://data.virginia.gov/dataset/population-of-virginia-localities-total-by-race-and-hispanic-latino-ethnicity-2010-2018
Explore at:
csvAvailable download formats
Dataset updated
Feb 3, 2024
Dataset authored and provided by
Other
Description
This table lists the overall population of each Virginia locality, as well as a breakdown of each locality's population by race. Each column's description explains the race identification. In addition, for each locality, there is a column for those who identified their ethnicity as "Hispanic or Latino Origin."

Please see note from the Census Reporter regarding race in Census data: Census data about race is complicated. While casual language and even much reporting proceeds as if each person had exactly one race, the Census Bureau allows each person to select as many as six race options, one of which is simply "some other race." Furthermore, "hispanic/latino" is not a race, but a characteristic tracked independently. Note that hispanic respondents disproportionately choose "some other race alone": nationwide, more than 25% of hispanics make that choice, compared to a fraction of a percent of non-hispanics. (https://censusreporter.org/topics/race-hispanic/)
a
Healthcare Worker Migration, New Mexico, 2021
arc-gis-hub-home-arcgishub.hub.arcgis.com
Updated May 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
New Mexico Community Data Collaborative (2023). Healthcare Worker Migration, New Mexico, 2021 [Dataset]. https://arc-gis-hub-home-arcgishub.hub.arcgis.com/maps/NMCDC::healthcare-worker-migration-new-mexico-2021
Explore at:
Dataset updated
May 3, 2023
Dataset authored and provided by
New Mexico Community Data Collaborative
Area covered

Description
Dataset, GDB, and Online Map created by Renee Haley, NMCDC, May 2023 DATA ACQUISITION PROCESS

Scope and purpose of project: New Mexico is struggling to maintain its healthcare workforce, particularly in Rural areas. This project was undertaken with the intent of looking at flows of healthcare workers into and out of New Mexico at the most granular geographic level possible. This dataset, in combination with others (such as housing cost and availability data) may help us understand where our healthcare workforce is relocating and why.

The most relevant and detailed data on workforce indicators in the United States is housed by the Census Bureau's Longitudinal Employer-Household Dynamics, LEHD, System. Information on this system is available here:

https://lehd.ces.census.gov/

The Job-to-Job flows explorer within this system was used to download the data. Information on the J2J explorer can ve found here:

https://j2jexplorer.ces.census.gov/explore.html#1432012

The dataset was built from data queried with the LED Extraction Tool, which allows for the query of more intersectional and detailed data than the explorer. This is a link to the LED extraction tool:

https://ledextract.ces.census.gov/

The geographies used are US Metro areas as determined by the Census, (N=389). The shapefile is named lehd_shp_gb.zip, and can be downloaded under this section of the following webpage: 5.5. Job-to-Job Flow Geographies, 5.5.1. Metropolitan (Complete). A link to the download site is available below:

https://lehd.ces.census.gov/data/schema/j2j_latest/lehd_shapefiles.html

DATA CLEANING PROCESS

This dataset was built from 8 non intersectional datasets downloaded from the LED Extraction Tool.

Separate datasets were downloaded in order to obtain detailed information on the race, ethnicity, and educational attainment levels of healthcare workers and where they are migrating.

Datasets included information for the four separate quarters of 2021. It was not possible to download annual data, only quarterly. Quarterly data was summed in a later step to derive annual totals for 2021.

4 datasets for healthcare workers moving OUT OF New Mexico, with details on race, ethnicity, and educational attainment, were downloaded. 1 contained information on educational attainment, 2 contained information on 7 racial categories identifying as non- Hispanic, 3 contained information on those same 7 categories also identifying as Hispanic, and 4 contained information for workers identifying as white and Hispanic.

4 datasets for healthcare worker moving INTO New Mexico, with details on race, ethnicity, and educational attainment, were downloaded with the same details outlined above.

Each dataset was cleaned according to Data Template which kept key attributes and discarded excess information. Within each dataset, the J2J Indicators reflecting 6 different types of job migration were totaled in order to simplify analysis, as this information was not needed in detail.

After cleaning, each set of 4 datasets for workers moving INTO New Mexico were joined. The process was repeated for workers moving OUT OF New Mexico. This resulted 2 main datasets.

These 2 main datasets still listed all of the variables by each quarter of 2021. Because of this the data was split in JMP, so that attributes of educational attainment, race and ethnicity, of workers migrating by quarter were moved from rows to columns. After this, summary columns for the year of 2021 were derived. This resulted in totals columns for workers identifying as: 6 separate races and all ethnicities, all races and Hispanic, white-Hispanic, and workers of 6 different education levels, reflecting how many workers of each indicator migrated to and from metro areas in New Mexico in 2021.

The data split transposed duplicate rows reflecting differing worker attributes within the same metro area, resulting in one row for each metro area and reflecting the attributes in columns, thus resulting in a mappable dataset.

The 2 datasets were joined (on Metro Area) resulting in one master file containing information on healthcare workers entering and leaving New Mexico.

Rows (N=389) reflect all of the metro areas across the US, and each state. Rows include the 5 metro areas within New Mexico, and New Mexico State.

Columns (N=99) contain information on worker race, ethnicity and educational attainment, specific to each metro area in New Mexico.

78 of these rows reflect workers of specific attributes moving OUT OF the 5 specific Metro Areas in New Mexico and totals for NM State. This level of detail is intended for analyzing who is leaving what area of New Mexico, where they are going to, and why.

13 Columns reflect each worker attribute for healthcare workers moving INTO New Mexico by race, ethnicity and education level. Because all 5 metro areas and New Mexico state are contained in the rows, this information for incoming workers is available by metro area and at the state level - there is less possability for mapping these attributes since it was not realistic or possible to create a dataset reflecting all of these variables for every healthcare worker from every metro area in the US also coming into New Mexico (that dataset would have over 1,000 columns and be unmappable). Therefore this dataset is easier to utilize in looking at why workers are leaving the state but also includes detailed information on who is coming in.

The remaining 8 columns contain geographic information.

GIS AND MAPPING PROCESS

The master file was opened in Arc GIS Pro and the Shapefile of US Metro Areas was also imported

The excel file was joined to the shapefile by Metro Area Name as they matched exactly

The resulting layer was exported as a GDB in order to retain null values which would turn to zeros if exported as a shapefile.

This GDB was uploaded to Arc GIS Online, Aliases were inserted as column header names, and the layer was visualized as desired.

SYSTEMS USED

MS Excel was used for data cleaning, summing NM state totals, and summing quarterly to annual data.

JMP was used to transpose, join, and split data.

ARC GIS Desktop was used to create the shapefile uploaded to NMCDC's online platform.

VARIABLE AND RECODING NOTES

Summary of variables selected for datasets downloaded focused on educational attainment:

J2J Flows by Educational Attainment

Summary of variables selected for datasets downloaded focused on race and ethnicity:

J2J Flows by Race and Ethnicity

Note: Variables in Datasets 1 through 4 downloaded twice, once for workers coming into New Mexico and once for those leaving NM. VARIABLE: LEHD VARIABLE DEFINITION LEHD VARIABLE NOTES DETAILS OR URL FOR RAW DATA DOWNLOAD

Geography Type - State Origin and Destination State

Data downloaded for worker migration into and out of all US States

Geography Type - Metropolitan Areas Origin and Dest Metro Area

Data downloaded for worker migration into and out of all US Metro Areas

NAICS sectors North American Industry Classification System Under Firm Characteristics Only downloaded for Healthcare and Social Assistance Sectors

Other Firm Characteristics No Firm Age / Size Detail Under Firm Characteristics Downloaded data on all firm ages, sizes, and other details.

Worker Characteristics Education, Race, Ethnicity

Non Intersectional data aside from Race / Ethnicity data.

Sex Gender

0 - All Sexes Selected

Age Age

A00 All Ages (14-99)

Education Education Level E0, E1, E2, E3, 34, E5 E0 - All Education Categories, E1 - Less than high school, E2 - High school or equivalent, no college, E3 - Some college or Associate’s degree, E4 - Bachelor's degree or advanced degree, E5 - Educational attainment not available (workers aged 24 or younger)

Dataset 1 All Education Levels, E1, E2, E3, E4, and E5

RACE

A0, A1, A2, A3, A4, A5 OPTIONS: A0 All Races, A1 White Alone, A2 Black or African American Alone, A3 American Indian or Alaska Native Alone, A4 Asian Alone, A5 Native Hawaiian or Other Pacific Islander Alone, SDA7 Two or More Race Groups

ETHNICITY

A0, A1, A2 OPTIONS: A0 All Ethnicities, A1 Not Hispanic or Latino, A2 Hispanic or Latino

Dataset 2 All Races (A0) and All Ethnicities (A0)

Dataset 3 6 Races (A1 through A5) and All Ethnicities (A0)

Dataset 4 White (A1) and Hispanic or Latino (A1)

Quarter Quarter and Year

Data from all quarters of 2021 to sum into annual numbers; yearly data was not available

Employer type Sector: Private or Governmental

Query included all healthcare sector workflows from all employer types and firm sizes from every quarter of 2021

J2J indicator categories Detailed types of job migration

All options were selected for all datasets and totaled: AQHire, AQHireS, EE, EES, J2J, J2JS. Counts were selected vs. earnings, and data was not seasonally adjusted (unavailable).

NOTES AND RESOURCES

The following resources and documentation were used to navigate the LEHD and J2J Worker Flows system and to answer questions about variables:

https://lehd.ces.census.gov/data/schema/j2j_latest/lehd_public_use_schema.html

https://www.census.gov/history/www/programs/geography/metropolitan_areas.html

https://lehd.ces.census.gov/data/schema/j2j_latest/lehd_csv_naming.html

Statewide (New
England and Wales Census 2021 - Ethnic group by age and sex
statistics.ukdataservice.ac.uk
xlsx
Updated Jan 24, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office for National Statistics; National Records of Scotland; Northern Ireland Statistics and Research Agency; UK Data Service. (2023). England and Wales Census 2021 - Ethnic group by age and sex [Dataset]. https://statistics.ukdataservice.ac.uk/dataset/england-and-wales-census-2021-ethnic-group-by-age-and-sex
Explore at:
xlsxAvailable download formats
Dataset updated
Jan 24, 2023
Dataset provided by
Office for National Statisticshttp://www.ons.gov.uk/
Northern Ireland Statistics and Research Agency
UK Data Servicehttps://ukdataservice.ac.uk/
Authors
Office for National Statistics; National Records of Scotland; Northern Ireland Statistics and Research Agency; UK Data Service.
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Area covered
Wales, England
Description
These datasets provide a breakdown of ethnic group by age and sex, ethnic group by age and ethnic group by sex

Information from Census 2021 on the sex and age characteristics of ethnic groups and how this has changed since 2011 in England and Wales.

Since 1991, the census for England and Wales has included a question about ethnic group.

In 2021, the ethnic group question had two stages. Firstly, a person identified through one of the following five high-level ethnic groups:

"Asian, Asian British, Asian Welsh"

"Black, Black British, Black Welsh, Caribbean or African"

"Mixed or Multiple ethnic groups"

"White"

"Other ethnic group"

Secondly, a person identifies through 1 of the 19 available response options, which include categories with write-in response options.
D
ARCHIVED: Mpox Vaccinations Given to SF Residents by Demographics
data.sfgov.org
healthdata.gov
+2more
application/rdfxml +5
Updated Jan 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). ARCHIVED: Mpox Vaccinations Given to SF Residents by Demographics [Dataset]. https://data.sfgov.org/Health-and-Social-Services/ARCHIVED-Mpox-Vaccinations-Given-to-SF-Residents-b/fk8q-nu3s
Explore at:
csv, json, application/rdfxml, application/rssxml, tsv, xmlAvailable download formats
Dataset updated
Jan 1, 2023
Area covered
San Francisco
Description
In early February 2024, we will be retiring the Mpox Vaccinations Given to SF Residents by Demographics dataset. This dataset will be archived and no longer update. A historic record of this data will remain available.

A. SUMMARY This dataset represents doses of mpox vaccine (JYNNEOS) administered in California to residents of San Francisco ages 18 years or older. This dataset only includes doses of the JYNNEOS vaccine given on or after 5/1/2022. All vaccines given to people who live in San Francisco are included, no matter where the vaccination took place. The data are broken down by multiple demographic stratifications.

B. HOW THE DATASET IS CREATED Information on doses administered to those who live in San Francisco is from the California Immunization Registry (CAIR2), run by the California Department of Public Health (CDPH). Information on individuals’ city of residence, age, race, ethnicity, and sex are recorded in CAIR2 and are self-reported at the time of vaccine administration. Because CAIR2 does not include information on sexual orientation, we pull information from the San Francisco Department of Public Health’s Epic Electronic Health Record (EHR). The populations represented in our Epic data and the CAIR2 data are different. Epic data only include vaccinations administered at SFDPH managed sites to SF residents.

Data notes for population characteristic types are listed below.

Age * Data only include individuals who are 18 years of age or older.

Race/ethnicity * The response option "Other Race" is categorized by the data source system, and the response option "Unknown" refers to a lack of data.

Sex * The response option "Other" is categorized by the source system, and the response option "Unknown" refers to a lack of data.

Sexual orientation * The response option “Unknown/Declined” refers to a lack of data or individuals who reported multiple different sexual orientations during their most recent interaction with SFDPH.

For convenience, we provide the 2020 5-year American Community Survey population estimates.

C. UPDATE PROCESS Updated daily via automated process.

D. HOW TO USE THIS DATASET This dataset includes many different types of demographic groups. Filter the “demographic_group” column to explore a topic area. Then, the “demographic_subgroup” column shows each group or category within that topic area and the total count of doses administered to that population subgroup.

E. CHANGE LOG
UPDATE 1/3/2023: Due to low case numbers, this page will no longer include vaccinations after 12/31/2022.
Ethnic group by National identity (England and Wales) 2011
statistics.ukdataservice.ac.uk
csv, zip
Updated Sep 20, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office for National Statistics; National Records of Scotland; Northern Ireland Statistics and Research Agency; UK Data Service. (2022). Ethnic group by National identity (England and Wales) 2011 [Dataset]. https://statistics.ukdataservice.ac.uk/dataset/ethnic-group-national-identity-england-and-wales-2011
Explore at:
csv, zipAvailable download formats
Dataset updated
Sep 20, 2022
Dataset provided by
Office for National Statisticshttp://www.ons.gov.uk/
Northern Ireland Statistics and Research Agency
UK Data Servicehttps://ukdataservice.ac.uk/
Authors
Office for National Statistics; National Records of Scotland; Northern Ireland Statistics and Research Agency; UK Data Service.
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Area covered
Wales, England
Description
Dataset population: Persons

Ethnic group

Ethnic group classifies people according to their own perceived ethnic group and cultural background.

This topic contains ethnic group write-in responses without reference to the five broad ethnic group categories, e.g. all Irish people, irrespective of whether they are White, Mixed/multiple ethnic groups, Asian/Asian British, Black/African/Caribbean/Black British or Other ethnic group, are in the 'Irish' response category. This topic was created as part of the commissioned table processing.

National identity

A person's national identity is a self-determined assessment of their own identity with respect to the country or countries with which they feel an affiliation. This assessment of identity is not dependent on legal nationality or ethnic group.

The national identity question included six tick box responses:

English

Welsh

Scottish

Northern Irish

British

Other

Where a person ticked 'Other' they were asked to write in the name of the country. People were asked to tick all options that they felt applied to them. This means that in results relating to national identity people may be classified with a single national identity or a combination of identities.
c
Poverty Status by Town - Datasets - CTData.org
data.ctdata.org
Updated Mar 16, 2016
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2016). Poverty Status by Town - Datasets - CTData.org [Dataset]. http://data.ctdata.org/dataset/poverty-status-by-town
Explore at:
Dataset updated
Mar 16, 2016
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The Census Bureau determines that a person is living in poverty when his or her total household income compared with the size and composition of the household is below the poverty threshold. The Census Bureau uses the federal government's official definition of poverty to determine the poverty threshold. Beginning in 2000, individuals were presented with the option to select one or more races. In addition, the Census asked individuals to identify their race separately from identifying their Hispanic origin. The Census has published individual tables for the races and ethnicities provided as supplemental information to the main table that does not dissaggregate by race or ethnicity. Race categories include the following - White, Black or African American, American Indian or Alaska Native, Asian, Native Hawaiian or Other Pacific Islander, Some other race, and Two or more races. We are not including specific combinations of two or more races as the counts of these combinations are small. Ethnic categories include - Hispanic or Latino and White Non-Hispanic. This data comes from the American Community Survey (ACS) 5-Year estimates, table B17001. The ACS collects these data from a sample of households on a rolling monthly basis. ACS aggregates samples into one-, three-, or five-year periods. CTdata.org generally carries the five-year datasets, as they are considered to be the most accurate, especially for geographic areas that are the size of a county or smaller.Poverty status determined is the denominator for the poverty rate. It is the population for which poverty status was determined so when poverty is calculated they exclude institutionalized people, people in military group quarters, people in college dormitories, and unrelated individuals under 15 years of age.Below poverty level are households as determined by the thresholds based on the criteria of looking at household size, Below poverty level are households as determined by the thresholds based on the criteria of looking at household size, number of children, and age of householder.number of children, and age of householder.
g
Gilbert Demographics
data.gilbertaz.gov
performance-management-tog.hub.arcgis.com
+1more
Updated Sep 21, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gilbert Demographics [Dataset]. https://data.gilbertaz.gov/datasets/gilbert-demographics
Explore at:
Dataset updated
Sep 21, 2020
Dataset authored and provided by
Gilbert, Arizona
License
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Area covered
Gilbert
Description
A data set of all employees that have previously worked for, currently work for, or who were offered employment by the Town of Gilbert highlighting demographics. This data set contains the following information for each individual. Names and any identifiable information have been removed from this data set.ID - A unique identifier for each record. This ID is not the employee ID of the individual.Department - The department that the individual works in and is assigned to in the organization.Division - The division within the main department in which the individual works and is assigned to.Organization - The internal organization or work group in which the individual works.Active Status Code - Whether the individual is currently active in the organization. An inactive employee may have previously been employed by Gilbert or may have been offered employment but never hired. Inactive employees are listed as "I" and active employees are listed as "A".Gilbert Resident - Whether the individual's primary residence is in Gilbert. Gilbert residents are listed as "Y" while all others are listed simply as "N".Employee Status - The type of position and status of the individual. Possible options for Employee Status include "Elected", "Full Time Sworn", "Full Time Non-Sworn", "Limited Term", "Part Time 0.5 Non-Benefited", "Part Time 0.75 Benefited", and "Seasonal".Degree Code - The highest level of educational degree attained by the individual. Options are "Associate", "Bachelor's", "Doctorate", "Elementary", "GED", "High School", "Juris Doctor", "Master's", "Master of Laws" or blank (if the individual chose not to respond)."Ethnicity" - The self-identified race or ethnicity of the individual. Possible choices are "Asian", "Black", "Hispanic", "Native American", "Other", "White", or "N/A" (if the individual was offered employment but never hired). Gilbert does not currently differentiate race from ethnicity when hiring.Age Group - The age group to which the individual belongs. Age groups include "Under 18", "18-24", "25-34", "35-44", "45-54", "55-64", "65+", and "N/A" (if the individual was offered employment but never hired).Gender - The self-identified gender of the individual. Genders in the data include "Female", "Male", and "N/A" (if the individual was offered employment but never hired).This data set is updated on the 15th of every month and the last day of every month.
s
Data from: Employment by occupation
ethnicity-facts-figures.service.gov.uk
csv
Updated Jul 27, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Employment by occupation [Dataset]. https://www.ethnicity-facts-figures.service.gov.uk/work-pay-and-benefits/employment/employment-by-occupation/latest
Explore at:
csv(309 KB)Available download formats
Dataset updated
Jul 27, 2022
Dataset authored and provided by
Race Disparity Unit
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Area covered
United Kingdom
Description
39.8% of workers from the Indian ethnic group were in 'professional' jobs in 2021 – the highest percentage out of all ethnic groups in this role.
t
Spanish TEDS Standard Demographic Questions
teds.tucsonaz.gov
Updated Mar 14, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
City of Tucson (2024). Spanish TEDS Standard Demographic Questions [Dataset]. https://teds.tucsonaz.gov/documents/6c12141f86494172b393c3de90348fcc
Explore at:
Dataset updated
Mar 14, 2024
Dataset authored and provided by
City of Tucson
Area covered

Description
Includes questions written in Spanish pertaining to: race & ethnicitygenderagetribal affiliationdisabilityincomelanguagelocation
Ethnic or cultural origin by gender and age: Canada, provinces and...
www150.statcan.gc.ca
open.canada.ca
Updated Oct 26, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Government of Canada, Statistics Canada (2022). Ethnic or cultural origin by gender and age: Canada, provinces and territories and census subdivisions with a population 5,000 or more [Dataset]. http://doi.org/10.25318/9810035801-eng
Explore at:
Unique identifier
https://doi.org/10.25318/9810035801-eng
Dataset updated
Oct 26, 2022
Dataset provided by
Statistics Canadahttps://statcan.gc.ca/en
Area covered
Canada
Description
Data on ethnic or cultural origin by gender and age for the population in private households in Canada, provinces and territories, and census subdivisions with 5,000-plus population.
s
Socioeconomic status
ethnicity-facts-figures.service.gov.uk
csv
Updated Jun 13, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Race Disparity Unit (2025). Socioeconomic status [Dataset]. https://www.ethnicity-facts-figures.service.gov.uk/uk-population-by-ethnicity/demographics/socioeconomic-status/latest
Explore at:
csv(638 KB)Available download formats
Dataset updated
Jun 13, 2025
Dataset authored and provided by
Race Disparity Unit
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Area covered
England and Wales
Description
In 2021, 20.1% of people from the Indian ethnic group were in higher managerial and professional occupations – the highest percentage out of all ethnic groups in this socioeconomic group.

Facebook

Twitter

Click to copy link

Link copied

Cite

Department of Public Health - Population Health Division (2024). ARCHIVED: COVID-19 Testing by Race/Ethnicity Over Time [Dataset]. https://data.sfgov.org/Health-and-Social-Services/ARCHIVED-COVID-19-Testing-by-Race-Ethnicity-Over-T/kja3-qsky

ARCHIVED: COVID-19 Testing by Race/Ethnicity Over Time

Explore at:

xml, csv, json, tsv, application/rssxml, application/rdfxmlAvailable download formats

Dataset updated

Jan 12, 2024

Dataset authored and provided by

Department of Public Health - Population Health Division

License

ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically

Description

A. SUMMARY This dataset includes San Francisco COVID-19 tests by race/ethnicity and by date. This dataset represents the daily count of tests collected, and the breakdown of test results (positive, negative, or indeterminate). Tests in this dataset include all those collected from persons who listed San Francisco as their home address at the time of testing. It also includes tests that were collected by San Francisco providers for persons who were missing a locating address. This dataset does not include tests for residents listing a locating address outside of San Francisco, even if they were tested in San Francisco.

The data were de-duplicated by individual and date, so if a person gets tested multiple times on different dates, all tests will be included in this dataset (on the day each test was collected). If a person tested multiple times on the same date, only one test is included from that date. When there are multiple tests on the same date, a positive result, if one exists, will always be selected as the record for the person. If a PCR and antigen test are taken on the same day, the PCR test will supersede. If a person tests multiple times on the same day and the results are all the same (e.g. all negative or all positive) then the first test done is selected as the record for the person.

The total number of positive test results is not equal to the total number of COVID-19 cases in San Francisco.

When a person gets tested for COVID-19, they may be asked to report information about themselves. One piece of information that might be requested is a person's race and ethnicity. These data are often incomplete in the laboratory and provider reports of the test results sent to the health department. The data can be missing or incomplete for several possible reasons:

• The person was not asked about their race and ethnicity.
• The person was asked, but refused to answer.
• The person answered, but the testing provider did not include the person's answers in the reports.
• The testing provider reported the person's answers in a format that could not be used by the health department.

For any of these reasons, a person's race/ethnicity will be recorded in the dataset as “Unknown.”

B. NOTE ON RACE/ETHNICITY The different values for Race/Ethnicity in this dataset are "Asian;" "Black or African American;" "Hispanic or Latino/a, all races;" "American Indian or Alaska Native;" "Native Hawaiian or Other Pacific Islander;" "White;" "Multi-racial;" "Other;" and “Unknown."

The Race/Ethnicity categorization increases data clarity by emulating the methodology used by the U.S. Census in the American Community Survey. Specifically, persons who identify as "Asian," "Black or African American," "American Indian or Alaska Native," "Native Hawaiian or Other Pacific Islander," "White," "Multi-racial," or "Other" do NOT include any person who identified as Hispanic/Latino at any time in their testing reports that either (1) identified them as SF residents or (2) as someone who tested without a locating address by an SF provider. All persons across all races who identify as Hispanic/Latino are recorded as “"Hispanic or Latino/a, all races." This categorization increases data accuracy by correcting the way “Other” persons were counted. Previously, when a person reported “Other” for Race/Ethnicity, they would be recorded “Unknown.” Under the new categorization, they are counted as “Other” and are distinct from “Unknown.”

If a person records their race/ethnicity as “Asian,” “Black or African American,” “American Indian or Alaska Native,” “Native Hawaiian or Other Pacific Islander,” “White,” or “Other” for their first COVID-19 test, then this data will not change—even if a different race/ethnicity is reported for this person for any future COVID-19 test. There are two exceptions to this rule. The first exception is if a person’s race/ethnicity value is reported as “Unknown” on their first test and then on a subsequent test they report “Asian;” "Black or African American;" "Hispanic or Latino/a, all races;" "American Indian or Alaska Native;" "Native Hawaiian or Other Pacific Islander;" or "White”, then this subsequent reported race/ethnicity will overwrite the previous recording of “Unknown”. If a person has only ever selected “Unknown” as their race/ethnicity, then it will be recorded as “Unknown.” This change provides more specific and actionable data on who is tested in San Francisco.

The second exception is if a person ever marks “Hispanic or Latino/a, all races” for race/ethnicity then this choice will always overwrite any previous or future response. This is because it is an overarching category that can include any and all other races and is mutually exclusive with the other responses.

A person's race/ethnicity will be recorded as “Multi-racial” if they select two or more values among the following choices: “Asian,” “Black or African American,” “American Indian or Alaska Native,” “Native Hawaiian or Other Pacific Islander,” “White,” or “Other.” If a person selects a combination of two or more race/ethnicity answers that includes “Hispanic or Latino/a, all races” then they will still be recorded as “Hispanic or Latino/a, all races”—not as “Multi-racial.”

C. HOW THE DATASET IS CREATED COVID-19 laboratory test data is based on electronic laboratory test reports. Deduplication, quality assurance measures and other data verification processes maximize accuracy of laboratory test information.

D. UPDATE PROCESS Updates automatically at 5:00AM Pacific Time each day. Redundant runs are scheduled at 7:00AM and 9:00AM in case of pipeline failure.

E. HOW TO USE THIS DATASET San Francisco population estimates for race/ethnicity can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS).

Due to the high degree of variation in the time needed to complete tests by different labs there is a delay in this reporting. On March 24, 2020 the Health Officer ordered all labs in the City to report complete COVID-19 testing information to the local and state health departments.

In order to track trends over time, a user can analyze this data by sorting or filtering by the "specimen_collection_date" field.

Calculating Percent Positivity: The positivity rate is the percentage of tests that return a positive result for COVID-19 (positive tests divided by the sum of positive and negative tests). Indeterminate results, which could not conclusively determine whether COVID-19 virus was present, are not included in the calculation of percent positive. When there are fewer than 20 positives tests for a given race/ethnicity and time period, the positivity rate is not calculated for the public tracker because rates of small test counts are less reliable.

Calculating Testing Rates: To calculate the testing rate per 10,000 residents, divide the total number of tests collected (positive, negative, and indeterminate results) for the specified race/ethnicity by the total number of residents who identify as that race/ethnicity (according to the 2016-2020 American Community Survey (ACS) population estimate), then multiply by 10,000. When there are fewer than 20 total tests for a given race/ethnicity and time period, the testing rate is not calculated for the public tracker because rates of small test counts are less reliable.

Read more about how this data is updated and validated daily: https://sf.gov/information/covid-19-data-questions

F. CHANGE LOG

1/12/2024 - This dataset will stop updating as of 1/12/2024
6/21/2023 - A small number of additional COVID-19 testing records were released as part of our ongoing data cleaning efforts. An update to the race or ethnicity designation among a subset of testing records was simultaneously released.
1/31/2023 - updated “population_estimate” column to reflect the 2020 Census Bureau American Community Survey (ACS) San Francisco Population estimates.
1/31/2023 - renamed column “last_updated_at” to “data_as_of”.
3/23/2022 - ‘Native American’ changed to ‘American Indian or Alaska Native’ to align with the census.
2/10/2022 - race/ethnicity categorization was changed. See section NOTE ON RACE/ETHNICITY for additional information.
4/16/2021 - dataset updated to refresh with a five-day data lag.

Clear search

Close search

Google apps

Main menu

ARCHIVED: COVID-19 Testing by Race/Ethnicity Over Time

ECIN Replication Package for "Adding Race and Ethnicity to Microeconomic...

Race/Ethnicity of Newly Medi-Cal Eligible Individuals

Population by Race/Ethnicity (ACS)

Data from: Using First Name Information to Improve Race and Ethnicity...

U.S. household income percentage distribution 2023, by race and ethnicity

Census 2021 - Ethnic groups

Manns Choice, PA Non-Hispanic Population Breakdown By Race Dataset:...

About this dataset

Content

Inspiration

Recommended for further research

Total fertility rate by ethnicity U.S. 2022

Population of Virginia localities (total, by race, and Hispanic/Latino...

Healthcare Worker Migration, New Mexico, 2021

England and Wales Census 2021 - Ethnic group by age and sex

ARCHIVED: Mpox Vaccinations Given to SF Residents by Demographics

Ethnic group by National identity (England and Wales) 2011

Poverty Status by Town - Datasets - CTData.org

Gilbert Demographics

Data from: Employment by occupation

Spanish TEDS Standard Demographic Questions

Ethnic or cultural origin by gender and age: Canada, provinces and...

Socioeconomic status

ARCHIVED: COVID-19 Testing by Race/Ethnicity Over TimeSee More Versions

ARCHIVED: COVID-19 Testing by Race/Ethnicity Over Time