37 datasets found

COVID-19 Case Surveillance Public Use Data
data.cdc.gov
data.virginia.gov
+7more
csv, xlsx, xml
Updated Jul 9, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CDC Data, Analytics and Visualization Task Force (2024). COVID-19 Case Surveillance Public Use Data [Dataset]. https://data.cdc.gov/widgets/vbim-akqf
Explore at:
xml, xlsx, csvAvailable download formats
Dataset updated
Jul 9, 2024
Dataset provided by
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Authors
CDC Data, Analytics and Visualization Task Force
License
https://www.usa.gov/government-workshttps://www.usa.gov/government-works
Description
Note: Reporting of new COVID-19 Case Surveillance data will be discontinued July 1, 2024, to align with the process of removing SARS-CoV-2 infections (COVID-19 cases) from the list of nationally notifiable diseases. Although these data will continue to be publicly available, the dataset will no longer be updated.

Authorizations to collect certain public health data expired at the end of the U.S. public health emergency declaration on May 11, 2023. The following jurisdictions discontinued COVID-19 case notifications to CDC: Iowa (11/8/21), Kansas (5/12/23), Kentucky (1/1/24), Louisiana (10/31/23), New Hampshire (5/23/23), and Oklahoma (5/2/23). Please note that these jurisdictions will not routinely send new case data after the dates indicated. As of 7/13/23, case notifications from Oregon will only include pediatric cases resulting in death.

This case surveillance public use dataset has 12 elements for all COVID-19 cases shared with CDC and includes demographics, any exposure history, disease severity indicators and outcomes, presence of any underlying medical conditions and risk behaviors, and no geographic data.

CDC has three COVID-19 case surveillance datasets:
COVID-19 Case Surveillance Public Use Data with Geography: Public use, patient-level dataset with clinical data (including symptoms), demographics, and county and state of residence. (19 data elements)
COVID-19 Case Surveillance Public Use Data: Public use, patient-level dataset with clinical and symptom data and demographics, with no geographic data. (12 data elements)
COVID-19 Case Surveillance Restricted Access Detailed Data: Restricted access, patient-level dataset with clinical and symptom data, demographics, and state and county of residence. Access requires a registration process and a data use agreement. (33 data elements)
The following apply to all three datasets:
Data elements can be found on the COVID-19 case report form located at www.cdc.gov/coronavirus/2019-ncov/downloads/pui-form.pdf.
Data are considered provisional by CDC and are subject to change until the data are reconciled and verified with the state and territorial data providers.
Some data cells are suppressed to protect individual privacy.
The datasets will include all cases with the earliest date available in each record (date received by CDC or date related to illness/specimen collection) at least 14 days prior to the creation of the current datasets. This 14-day lag allows case reporting to be stabilized and ensures that time-dependent outcome data are accurately captured.
Datasets are updated monthly.
Datasets are created using CDC’s Policy on Public Health Research and Nonresearch Data Management and Access and include protections designed to protect individual privacy.
For more information about data collection and reporting, please see https://www.cdc.gov/coronavirus/2019-ncov/covid-data/about-us-cases-deaths.html.
For more information about the COVID-19 case surveillance data, please see https://www.cdc.gov/coronavirus/2019-ncov/covid-data/faq-surveillance.html

Overview

The COVID-19 case surveillance database includes individual-level data reported to U.S. states and autonomous reporting entities, including New York City and the District of Columbia (D.C.), as well as U.S. territories and affiliates. On April 5, 2020, COVID-19 was added to the Nationally Notifiable Condition List and classified as “immediately notifiable, urgent (within 24 hours)” by a Council of State and Territorial Epidemiologists (CSTE) Interim Position Statement (Interim-20-ID-01). CSTE updated the position statement on August 5, 2020, to clarify the interpretation of antigen detection tests and serologic test results within the case classification (Interim-20-ID-02). The statement also recommended that all states and territories enact laws to make COVID-19 reportable in their jurisdiction, and that jurisdictions conducting surveillance should submit case notifications to CDC. COVID-19 case surveillance data are collected by jurisdictions and reported voluntarily to CDC.

For more information: NNDSS Supports the COVID-19 Response | CDC.

The deidentified data in the “COVID-19 Case Surveillance Public Use Data” include demographic characteristics, any exposure history, disease severity indicators and outcomes, clinical data, laboratory diagnostic test results, and presence of any underlying medical conditions and risk behaviors. All data elements can be found on the COVID-19 case report form located at www.cdc.gov/coronavirus/2019-ncov/downloads/pui-form.pdf.

COVID-19 Case Reports

COVID-19 case reports have been routinely submitted using nationally standardized case reporting forms. On April 5, 2020, CSTE released an Interim Position Statement with national surveillance case definitions for COVID-19 included. Current versions of these case definitions are available here: https://ndc.services.cdc.gov/case-definitions/coronavirus-disease-2019-2021/.

All cases reported on or after were requested to be shared by public health departments to CDC using the standardized case definitions for laboratory-confirmed or probable cases. On May 5, 2020, the standardized case reporting form was revised. Case reporting using this new form is ongoing among U.S. states and territories.

Data are Considered Provisional

The COVID-19 case surveillance data are dynamic; case reports can be modified at any time by the jurisdictions sharing COVID-19 data with CDC. CDC may update prior cases shared with CDC based on any updated information from jurisdictions. For instance, as new information is gathered about previously reported cases, health departments provide updated data to CDC. As more information and data become available, analyses might find changes in surveillance data and trends during a previously reported time window. Data may also be shared late with CDC due to the volume of COVID-19 cases.
Annual finalized data: To create the final NNDSS data used in the annual tables, CDC works carefully with the reporting jurisdictions to reconcile the data received during the year until each state or territorial epidemiologist confirms that the data from their area are correct.
Access Addressing Gaps in Public Health Reporting of Race and Ethnicity for COVID-19, a report from the Council of State and Territorial Epidemiologists, to better understand the challenges in completing race and ethnicity data for COVID-19 and recommendations for improvement.

Data Limitations

To learn more about the limitations in using case surveillance data, visit FAQ: COVID-19 Data and Surveillance.

Data Quality Assurance Procedures

CDC’s Case Surveillance Section routinely performs data quality assurance procedures (i.e., ongoing corrections and logic checks to address data errors). To date, the following data cleaning steps have been implemented:
Questions that have been left unanswered (blank) on the case report form are reclassified to a Missing value, if applicable to the question. For example, in the question “Was the individual hospitalized?” where the possible answer choices include “Yes,” “No,” or “Unknown,” the blank value is recoded to Missing because the case report form did not include a response to the question.
Logic checks are performed for date data. If an illogical date has been provided, CDC reviews the data with the reporting jurisdiction. For example, if a symptom onset date in the future is reported to CDC, this value is set to null until the reporting jurisdiction updates the date appropriately.
Additional data quality processing to recode free text data is ongoing. Data on symptoms, race and ethnicity, and healthcare worker status have been prioritized.

Data Suppression

To prevent release of data that could be used to identify people, data cells are suppressed for low frequency (<5) records and indirect identifiers (e.g., date of first positive specimen). Suppression includes rare combinations of demographic characteristics (sex, age group, race/ethnicity). Suppressed values are re-coded to the NA answer option; records with data suppression are never removed.

For questions, please contact Ask SRRG (eocevent394@cdc.gov).

Additional COVID-19 Data

COVID-19 data are available to the public as summary or aggregate count files, including total counts of cases and deaths by state and by county. These
Weekly United States COVID-19 Hospitalization Metrics by County (Historical)...
data.virginia.gov
healthdata.gov
+1more
csv, json, rdf, xsl
Updated Feb 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Centers for Disease Control and Prevention (2025). Weekly United States COVID-19 Hospitalization Metrics by County (Historical) – ARCHIVED [Dataset]. https://data.virginia.gov/dataset/weekly-united-states-covid-19-hospitalization-metrics-by-county-historical-archived
Explore at:
rdf, json, xsl, csvAvailable download formats
Dataset updated
Feb 23, 2025
Dataset provided by
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Area covered
United States
Description
Note: After May 3, 2024, this dataset will no longer be updated because hospitals are no longer required to report data on COVID-19 hospital admissions, hospital capacity, or occupancy data to HHS through CDC’s National Healthcare Safety Network (NHSN). The related CDC COVID Data Tracker site was revised or retired on May 10, 2023.

Note: May 3,2024: Due to incomplete or missing hospital data received for the April 21,2024 through April 27, 2024 reporting period, the COVID-19 Hospital Admissions Level could not be calculated for CNMI and will be reported as “NA” or “Not Available” in the COVID-19 Hospital Admissions Level data released on May 3, 2024.

This dataset represents COVID-19 hospitalization data and metrics aggregated to county or county-equivalent, for all counties or county-equivalents (including territories) in the United States as of the initial date of reporting for each weekly metric. COVID-19 hospitalization data are reported to CDC’s National Healthcare Safety Network, which monitors national and local trends in healthcare system stress, capacity, and community disease levels for approximately 6,000 hospitals in the United States. Data reported by hospitals to NHSN and included in this dataset represent aggregated counts and include metrics capturing information specific to COVID-19 hospital admissions, and inpatient and ICU bed capacity occupancy.

Reporting information:
As of December 15, 2022, COVID-19 hospital data are required to be reported to NHSN, which monitors national and local trends in healthcare system stress, capacity, and community disease levels for approximately 6,000 hospitals in the United States. Data reported by hospitals to NHSN represent aggregated counts and include metrics capturing information specific to hospital capacity, occupancy, hospitalizations, and admissions. Prior to December 15, 2022, hospitals reported data directly to the U.S. Department of Health and Human Services (HHS) or via a state submission for collection in the HHS Unified Hospital Data Surveillance System (UHDSS).
While CDC reviews these data for errors and corrects those found, some reporting errors might still exist within the data. To minimize errors and inconsistencies in data reported, CDC removes outliers before calculating the metrics. CDC and partners work with reporters to correct these errors and update the data in subsequent weeks.
Many hospital subtypes, including acute care and critical access hospitals, as well as Veterans Administration, Defense Health Agency, and Indian Health Service hospitals, are included in the metric calculations provided in this report. Psychiatric, rehabilitation, and religious non-medical hospital types are excluded from calculations.
Data are aggregated and displayed for hospitals with the same Centers for Medicare and Medicaid Services (CMS) Certification Number (CCN), which are assigned by CMS to counties based on the CMS Provider of Services files.

Full details on COVID-19 hospital data reporting guidance can be found here: https://www.hhs.gov/sites/default/files/covid-19-faqs-hospitals-hospital-laboratory-acute-care-facility-data-reporting.pdf
Calculation of county-level hospital metrics:
County-level hospital data are derived using calculations performed at the Health Service Area (HSA) level. An HSA is defined by CDC’s National Center for Health Statistics as a geographic area containing at least one county which is self-contained with respect to the population’s provision of routine hospital care. Every county in the United States is assigned to an HSA, and each HSA must contain at least one hospital. Therefore, use of HSAs in the calculation of local hospital metrics allows for more accurate characterization of the relationship between health care utilization and health status at the local level.
Data presented at the county-level represent admissions, hosp
cdc_clinical_trials
kaggle.com
zip
Updated Apr 2, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andy White (2020). cdc_clinical_trials [Dataset]. https://www.kaggle.com/ajrwhite/cdc-clinical-trials
Explore at:
zip(113426 bytes)Available download formats
Dataset updated
Apr 2, 2020
Authors
Andy White
Description
Context

The CDC keeps a register of clinical trials. This dataset contains all clinical trials relating to Covid-19 that are stored in the CDC's register.

Content

Fields include study name, interventions being tested, the study type, and the status of the study. Please comment here or on the Notebook if you would like more fields added.

Acknowledgements

Thank you to @savannareid for pointing me towards the website for this data.
CDC - BRFSS Survey Data 2024
kaggle.com
zip
Updated Nov 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rudrita Rahman (2025). CDC - BRFSS Survey Data 2024 [Dataset]. https://www.kaggle.com/datasets/rudritarahman/cdc-brfss-survey-data-2024
Explore at:
zip(160243325 bytes)Available download formats
Dataset updated
Nov 5, 2025
Authors
Rudrita Rahman
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Behavioral Risk Factor Surveillance System (BRFSS) 2024

Overview

The Behavioral Risk Factor Surveillance System (BRFSS) is the nation's premier system of health-related telephone surveys that collect uniform, state-specific data about U.S. residents regarding their health-related risk behaviors, chronic health conditions, and use of preventive services.

The objective of the BRFSS is to gather consistent, state-level data on preventive health practices and risk behaviors associated with chronic diseases, injuries, and preventable infectious diseases among adults (aged 18 and older).

Established in 1984 with 15 states, the BRFSS now collects data in all 50 states, the District of Columbia, and three U.S. territories. The system completes more than 400,000 adult interviews each year, making it the largest continuously conducted health survey system in the world.

2024 Data Notes

The 2024 BRFSS dataset continues to use the raking weighting methodology (introduced in 2011) and includes both landline and cellphone-only respondents, ensuring more accurate representation of the U.S. adult population.

The aggregate dataset combines landline and cell phone data collected in 2024 from 49 states, The District of Columbia, Guam, Puerto Rico, and The U.S. Virgin Islands.

This original dataset contains responses from 457,670 individuals and has 301 features. These features are either questions directly asked of participants, or calculated variables based on individual participant responses.

⚠️ Note: Tennessee was unable to collect enough responses to meet inclusion requirements for 2024 and is not included in this public dataset.

Certain survey questions and responses have been modified or omitted to comply with federal data policies in effect during the 2024 collection period. As a result, some variables may contain missing values or appear inconsistent due to questions that were removed or restructured.

Data Collection

Data are collected from a random sample of adults (one per household) via telephone interviews.

Factors assessed include: - Tobacco use - Health care access and coverage - Alcohol consumption - Physical activity and diet - HIV/AIDS knowledge and prevention - Chronic health conditions
- Preventive health services and screenings

Content

The annual dataset contains 301 variables, covering both core questions and optional modules. Please refer to the official BRFSS 2024 Codebook for detailed variable definitions and coding.

This dataset contains 3 files: 1. brfss_survey_data_2024.csv # Dataset in .csv format (converted from SAS) 2. codebook_2024.HTML # CDC codebook for variable definitions
3. main_data_brfss_2024.XPT # Main dataset

⚙️ Note: The CSV file were converted from the original SAS format using pandas. Minor conversion artifacts may exist.

Complete description about each column of the CSV file can be found in the codebook.

Source & Acknowledgements

Data provided by the U.S. Centers for Disease Control and Prevention (CDC).

Original source and additional years of BRFSS data: CDC BRFSS Annual Data

Citation:

Centers for Disease Control and Prevention (CDC). Behavioral Risk Factor Surveillance System Survey Data. Atlanta, Georgia: U.S. Department of Health and Human Services, Centers for Disease Control and Prevention, 2024.

License: Public Domain (U.S. Government Work)

Suggested Citation (for Kaggle users)

If you use this dataset in your analysis or publication, please cite as:

Behavioral Risk Factor Surveillance System (BRFSS) 2024. U.S. Centers for Disease Control and Prevention (CDC). Public Domain.

Prepared for Kaggle public dataset publication. All data are in the public domain as U.S. Government works.
Weekly United States COVID-19 Hospitalization Metrics by Jurisdiction –...
data.cdc.gov
data.virginia.gov
+1more
csv, xlsx, xml
Updated Jan 17, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CDC Division of Healthcare Quality Promotion (DHQP) Surveillance Branch, National Healthcare Safety Network (NHSN) (2025). Weekly United States COVID-19 Hospitalization Metrics by Jurisdiction – ARCHIVED [Dataset]. https://data.cdc.gov/Public-Health-Surveillance/Weekly-United-States-COVID-19-Hospitalization-Metr/7dk4-g6vg
Explore at:
xml, xlsx, csvAvailable download formats
Dataset updated
Jan 17, 2025
Dataset provided by
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Authors
CDC Division of Healthcare Quality Promotion (DHQP) Surveillance Branch, National Healthcare Safety Network (NHSN)
License
https://www.usa.gov/government-workshttps://www.usa.gov/government-works
Area covered
United States
Description
Note: After May 3, 2024, this dataset will no longer be updated because hospitals are no longer required to report data on COVID-19 hospital admissions, hospital capacity, or occupancy data to HHS through CDC’s National Healthcare Safety Network (NHSN). The related CDC COVID Data Tracker site was revised or retired on May 10, 2023.

This dataset represents weekly COVID-19 hospitalization data and metrics aggregated to national, state/territory, and regional levels. COVID-19 hospitalization data are reported to CDC’s National Healthcare Safety Network, which monitors national and local trends in healthcare system stress, capacity, and community disease levels for approximately 6,000 hospitals in the United States. Data reported by hospitals to NHSN and included in this dataset represent aggregated counts and include metrics capturing information specific to COVID-19 hospital admissions, and inpatient and ICU bed capacity occupancy.

Reporting information:
As of December 15, 2022, COVID-19 hospital data are required to be reported to NHSN, which monitors national and local trends in healthcare system stress, capacity, and community disease levels for approximately 6,000 hospitals in the United States. Data reported by hospitals to NHSN represent aggregated counts and include metrics capturing information specific to hospital capacity, occupancy, hospitalizations, and admissions. Prior to December 15, 2022, hospitals reported data directly to the U.S. Department of Health and Human Services (HHS) or via a state submission for collection in the HHS Unified Hospital Data Surveillance System (UHDSS).
While CDC reviews these data for errors and corrects those found, some reporting errors might still exist within the data. To minimize errors and inconsistencies in data reported, CDC removes outliers before calculating the metrics. CDC and partners work with reporters to correct these errors and update the data in subsequent weeks.
Many hospital subtypes, including acute care and critical access hospitals, as well as Veterans Administration, Defense Health Agency, and Indian Health Service hospitals, are included in the metric calculations provided in this report. Psychiatric, rehabilitation, and religious non-medical hospital types are excluded from calculations.
Data are aggregated and displayed for hospitals with the same Centers for Medicare and Medicaid Services (CMS) Certification Number (CCN), which are assigned by CMS to counties based on the CMS Provider of Services files.
Full details on COVID-19 hospital data reporting guidance can be found here: https://www.hhs.gov/sites/default/files/covid-19-faqs-hospitals-hospital-laboratory-acute-care-facility-data-reporting.pdf

Metric details:
Time Period: timeseries data will update weekly on Mondays as soon as they are reviewed and verified, usually before 8 pm ET. Updates will occur the following day when reporting coincides with a federal holiday. Note: Weekly updates might be delayed due to delays in reporting. All data are provisional. Because these provisional counts are subject to change, including updates to data reported previously, adjustments can occur. Data may be updated since original publication due to delays in reporting (to account for data received after a given Thursday publication) or data quality corrections.
New COVID-19 Hospital Admissions (count): Number of new admissions of patients with laboratory-confirmed COVID-19 in the previous week (including both adult and pediatric admissions) in the entire jurisdiction.
New COVID-19 Hospital Admissions (7-Day Average): 7-day average of new admissions of patients with laboratory-confirmed COVID-19 in the previous week (including both adult and pediatric admissions) in the entire jurisdiction.
Cumulative COVID-19 Hospital Admissions: Cumulative total number of admissions of patients with laboratory-confirmed COVID-19 (including both adult and pediatric admissions) in the entire jurisdiction since August 1, 2020.
Cumulative COVID-19 Hospital Admissions Rate: Cumulative total number of admissions of patients with laboratory-confirmed COVID-19 (including both adult and pediatric admissions) in the entire jurisdiction since August 1, 2020 divided by 2019 intercensal population estimate for that jurisdiction multiplied by 100,000.
New COVID-19 Hospital Admissions Rate (7-day average) percent change from prior week: Percent change in the 7-day average new admissions of patients with laboratory-confirmed COVID-19 per 100,000 population compared with the prior week.
New COVID-19 Hospital Admissions (7-Day Total): 7-day total number of new admissions of patients with laboratory-confirmed COVID-19 (including both adult and pediatric admissions) in the entire jurisdiction.
New COVID-19 Hospital Admissions Rate (7-Day Total): 7-day total number of new admissions of patients with laboratory-confirmed COVID-19 (including both adult and pediatric admissions) for the entire jurisdiction divided by 2019 intercensal population estimate for that jurisdiction multiplied by 100,000.
Total Hospitalized COVID-19 Patients: 7-day total number of patients currently hospitalized with laboratory-confirmed COVID-19 (including both adult and pediatric patients) for the entire jurisdiction.
Total Hospitalized COVID-19 Patients (7-Day Average): 7-day average of the number of patients currently hospitalized with laboratory-confirmed COVID-19 (including both adult and pediatric patients) for the entire jurisdiction.
COVID-19 Inpatient Bed Occupancy (7-Day Average): Percentage of all staffed inpatient beds occupied by patients with laboratory-confirmed COVID-19 (including both adult and pediatric patients) within the entire jurisdiction is calculated as an average of valid daily values within the past 7 days (e.g., if only three valid values, the average of those three is taken). Averages are separately calculated for the daily numerators (patients hospitalized with confirmed COVID-19) and denominators (staffed inpatient beds). The average percentage can then be taken as the ratio of these two values for the entire jurisdiction.
COVID-19 Inpatient Bed Occupancy absolute change from prior week: The absolute change in the percent of staffed inpatient beds occupied by patients with laboratory-confirmed COVID-19 represents the week-over-week absolute difference between the 7-day average occupancy of patients with confirmed COVID-19 in staffed inpatient beds in the past 7 days, compared with the prior week, in the entire jurisdiction.
COVID-19 ICU Bed Occupancy (7-Day Average): Percentage of all staffed inpatient beds occupied by adult patients with confirmed COVID-19 within the entire jurisdiction is calculated as a 7-day average of valid daily values within the past 7 days (e.g., if only three valid values, the average of those three is taken). Averages are separately calculated for the daily numerators (adult patients hospitalized with confirmed COVID-19) and denominators (staffed adult ICU beds). The average percentage can then be taken as the ratio of these two values for the entire jurisdiction.
COVID-19 ICU Bed Occupancy absolute change from prior week: The absolute change in the percent of staffed ICU beds occupied by patients with laboratory-confirmed COVID-19 represents the week-over-week absolute difference between the average occupancy of patients with confirmed COVID-19 in staffed adult ICU beds for the past 7 days, compared with the prior week, in the in the entire jurisdiction.

Note: October 27, 2023: Due to a data processing error, reported values for avg_percent_inpatient_beds_occupied_covid_confirmed will appear lower than previously reported values by an average difference of less than 1%. Therefore, previously reported values for avg_percent_inpatient_beds_occupied_covid_confirmed may have been overestimated and should be interpreted with caution.

October 27, 2023: Due to a data processing error, reported values for abs_chg_avg_percent_inpatient_beds_occupied_covid_confirmed will differ from previously reported values by an average absolute difference of less than 1%. Therefore, previously reported values for abs_chg_avg_percent_inpatient_beds_occupied_covid_confirmed should be interpreted with caution.

December 29, 2023: Hospitalization data reported to CDC’s National Healthcare Safety Network (NHSN) through December 23, 2023, should be interpreted with caution due to potential reporting delays that are impacted by Christmas and New Years holidays. As a result, metrics including new hospital admissions for COVID-19 and influenza and hospital occupancy may be underestimated for the week ending December 23, 2023.

January 5, 2024: Hospitalization data reported to CDC’s National Healthcare Safety Network (NHSN) through December 30, 2023 should be interpreted with caution due to potential reporting delays that are impacted by Christmas and New Years holidays. As a result, metrics including new hospital admissions for COVID-19 and influenza and hospital occupancy may be underestimated for the week ending December 30, 2023.
COVID-19 Dashboard
catalog.data.gov
healthdata.gov
+2more
Updated Oct 23, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
California Department of Public Health (2025). COVID-19 Dashboard [Dataset]. https://catalog.data.gov/dataset/covid-19-dashboard
Explore at:
Dataset updated
Oct 23, 2025
Dataset provided by
California Department of Public Healthhttps://www.cdph.ca.gov/
Description
The dashboard is updated each Friday. Laboratory surveillance data: California laboratories report SARS-CoV-2 test results to CDPH through electronic laboratory reporting. Los Angeles County SARS-CoV-2 lab data has a 7-day reporting lag. Test positivity is calculated using SARS-CoV-2 lab tests that has a specimen collection date reported during a given week. Specimens for testing are collected from patients in healthcare settings and do not reflect all testing for COVID-19 in California. Test positivity for a given week is calculated by dividing the number of positive COVID-19 results by the total number of specimens tested for that virus. Weekly laboratory surveillance data are defined as Sunday through Saturday. Hospitalization data: Data on COVID-19 and influenza hospital admissions are from Centers for Disease Control and Prevention’s (CDC) National Healthcare Safety Network (NHSN) Hospitalization dataset. The requirement to report COVID-19-associated hospitalizations was effective November 1, 2024. CDPH pulls NHSN data from the CDC on the Wednesday prior to the publication of the report. Results may differ depending on which day data are pulled. Admission rates are calculated using population estimates from the P-3: Complete State and County Projections Dataset (https://dof.ca.gov/forecasting/demographics/projections/) provided by the State of California Department of Finance. Reported weekly admission rates for the entire season use the population estimates for the year the season started. For more information on NHSN data including the protocol and data collection information, see the CDC NHSN webpage (https://www.cdc.gov/nhsn/index.html). Weekly hospitalization data are defined as Sunday through Saturday. Death certificate data: CDPH receives weekly year-to-date dynamic data on deaths occurring in California from the CDPH Center for Health Statistics and Informatics. These data are limited to deaths occurring among California residents and are analyzed to identify COVID-19-coded deaths. These deaths are not necessarily laboratory-confirmed and are an underestimate of all COVID-19-associated deaths in California. Weekly death data are defined as Sunday through Saturday.
Pregnancy Risk Data - PRAMS 2007 (CDC)
kaggle.com
zip
Updated Jan 31, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Emily "Ernest" Pierron (2024). Pregnancy Risk Data - PRAMS 2007 (CDC) [Dataset]. https://www.kaggle.com/datasets/emilybekapierron/pregnancy-risk-data-prams-2007-cdc
Explore at:
zip(26849754 bytes)Available download formats
Dataset updated
Jan 31, 2024
Authors
Emily "Ernest" Pierron
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
This is the raw CSV file pulled from the CDC Data website. The original source lists the following description:

"2007. Centers for Disease Control and Prevention (CDC). PRAMS, the Pregnancy Risk Assessment Monitoring System, is a surveillance system collecting state-specific, population-based data on maternal attitudes and experiences before, during, and shortly after pregnancy. It is a collaborative project of the Centers for Disease Control and Prevention (CDC) and state health departments. PRAMS provides data for state health officials to use to improve the health of mothers and infants. PRAMS topics include abuse, alcohol use, contraception, breastfeeding, mental health, morbidity, obesity, preconception health, pregnancy history, prenatal-care, sleep behavior, smoke exposure, stress, tobacco use, WIC, Medicaid, infant health, and unintended pregnancy. Data will be updated annually as it becomes available."
O
CDC COVID-19 Community Levels by County
opendata.ramseycountymn.gov
csv, xlsx, xml
Updated Dec 2, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Center for Disease Control and Prevention (2025). CDC COVID-19 Community Levels by County [Dataset]. https://opendata.ramseycountymn.gov/Public-Health/CDC-COVID-19-Community-Levels-by-County/uazb-iwdp
Explore at:
csv, xlsx, xmlAvailable download formats
Dataset updated
Dec 2, 2025
Dataset authored and provided by
Center for Disease Control and Prevention
License
https://www.usa.gov/government-workshttps://www.usa.gov/government-works
Description
This public use dataset has 11 data elements reflecting United States COVID-19 community levels for all available counties. This dataset contains the same values used to display information available on the COVID Data Tracker at: https://covid.cdc.gov/covid-data-tracker/#county-view?list_select_state=all_states&list_select_county=all_counties&data-type=CommunityLevels The data are updated weekly.

CDC looks at the combination of three metrics — new COVID-19 admissions per 100,000 population in the past 7 days, the percent of staffed inpatient beds occupied by COVID-19 patients, and total new COVID-19 cases per 100,000 population in the past 7 days — to determine the COVID-19 community level. The COVID-19 community level is determined by the higher of the new admissions and inpatient beds metrics, based on the current level of new cases per 100,000 population in the past 7 days. New COVID-19 admissions and the percent of staffed inpatient beds occupied represent the current potential for strain on the health system. Data on new cases acts as an early warning indicator of potential increases in health system strain in the event of a COVID-19 surge. Using these data, the COVID-19 community level is classified as low, medium, or high. COVID-19 Community Levels can help communities and individuals make decisions based on their local context and their unique needs. Community vaccination coverage and other local information, like early alerts from surveillance, such as through wastewater or the number of emergency department visits for COVID-19, when available, can also inform decision making for health officials and individuals.

See https://www.cdc.gov/coronavirus/2019-ncov/science/community-levels.html for more information.

For the most accurate and up-to-date data for any county or state, visit the relevant health department website. COVID Data Tracker may display data that differ from state and local websites. This can be due to differences in how data were collected, how metrics were calculated, or the timing of web updates.

For more details on the Minnesota Department of Health COVID-19 thresholds, see COVID-19 Public Health Risk Measures: Data Notes (Updated 4/13/22). https://mn.gov/covid19/assets/phri_tcm1148-434773.pdf

Note: This dataset was renamed from "United States COVID-19 Community Levels by County as Originally Posted" to "United States COVID-19 Community Levels by County" on March 31, 2022. March 31, 2022: Column name for county population was changed to “county_population”. No change was made to the data points previous released. March 31, 2022: New column, “health_service_area_population”, was added to the dataset to denote the total population in the designated Health Service Area based on 2019 Census estimate. March 31, 2022: FIPS codes for territories American Samoa, Guam, Commonwealth of the Northern Mariana Islands, and United States Virgin Islands were re-formatted to 5-digit numeric for records released on 3/3/2022 to be consistent with other records in the dataset. March 31, 2022: Changes were made to the text fields in variables “county”, “state”, and “health_service_area” so the formats are consistent across releases. March 31, 2022: The “%” sign was removed from the text field in column “covid_inpatient_bed_utilization”. No change was made to the data. As indicated in the column description, values in this column represent the percentage of staffed inpatient beds occupied by COVID-19 patients (7-day average). March 31, 2022: Data values for columns, “county_population”, “health_service_area_number”, and “health_service_area” were backfilled for records released on 2/24/2022. These columns were added since the week of 3/3/2022, thus the values were previously missing for records released the week prior. April 7, 2022: Updates made to data released on 3/24/2022 for Guam, Commonwealth of the Northern Mariana Islands, and United States Virgin Islands to correct a data mapping error.
Archive: COVID-19 LTC Program Vaccinations and Trends in the United States,...
healthdata.gov
datahub.hhs.gov
+2more
csv, xlsx, xml
Updated Oct 8, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
data.cdc.gov (2021). Archive: COVID-19 LTC Program Vaccinations and Trends in the United States, Jurisdiction [Dataset]. https://healthdata.gov/CDC/Archive-COVID-19-LTC-Program-Vaccinations-and-Tren/yazb-x9fi
Explore at:
xml, csv, xlsxAvailable download formats
Dataset updated
Oct 8, 2021
Dataset provided by
data.cdc.gov
Area covered
United States
Description
The Federal Pharmacy Partnership for Long-Term Care (LTC) Program was a partnership between CDC and CVS, Walgreens, and Managed Health Care Associates, Inc. The program offered on-site COVID-19 vaccination services for residents of nursing homes and assisted living facilities. The Federal Pharmacy Partnership for LTC Program was in effect after vaccines became available to April 23, 2021. This is the historical archived data related to the LTC Program and represents data that was shown on COVID Data Tracker through September 30, 2021. Twelve variables that provided data on residents and staff vaccinated through the program were removed from the COVID-19 Vaccinations in the United States,Jurisdiction dataset. LTC was removed as an option from the location variable in the following datasets: COVID-19 Vaccinations in the United States,Jurisdiction and COVID-19 Vaccination Trends in the United States,National and Jurisdictional.
Participant demographics.
plos.figshare.com
datasetcatalog.nlm.nih.gov
xls
Updated Jul 5, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Leib Litman; Zohn Rosen; Rachel Hartman; Cheskie Rosenzweig; Sarah L. Weinberger-Litman; Aaron J. Moss; Jonathan Robinson (2023). Participant demographics. [Dataset]. http://doi.org/10.1371/journal.pone.0287837.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0287837.t001
Dataset updated
Jul 5, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Leib Litman; Zohn Rosen; Rachel Hartman; Cheskie Rosenzweig; Sarah L. Weinberger-Litman; Aaron J. Moss; Jonathan Robinson
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Survey respondents who are non-attentive, respond randomly, or misrepresent who they are can impact the outcomes of surveys. Prior findings reported by the CDC have suggested that people engaged in highly dangerous cleaning practices during the COVID-19 pandemic, including ingesting household cleaners such as bleach. In our attempts to replicate the CDC’s results, we found that 100% of reported ingestion of household cleaners are made by problematic respondents. Once inattentive, acquiescent, and careless respondents are removed from the sample, we find no evidence that people ingested cleaning products to prevent a COVID-19 infection. These findings have important implications for public health and medical survey research, as well as for best practices for avoiding problematic respondents in all survey research conducted online.
Respiratory Virus Weekly Report
catalog.data.gov
data.chhs.ca.gov
+2more
Updated Sep 23, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
California Department of Public Health (2025). Respiratory Virus Weekly Report [Dataset]. https://catalog.data.gov/dataset/respiratory-virus-weekly-report-b5321
Explore at:
Dataset updated
Sep 23, 2025
Dataset provided by
California Department of Public Healthhttps://www.cdph.ca.gov/
Description
Data is from the California Department of Public Health (CDPH) Respiratory Virus Weekly Report. The report is updated each Friday. Laboratory surveillance data: California laboratories report SARS-CoV-2 test results to CDPH through electronic laboratory reporting. Los Angeles County SARS-CoV-2 lab data has a 7-day reporting lag. Test positivity is calculated using SARS-CoV-2 lab tests that has a specimen collection date reported during a given week. Laboratory surveillance for influenza, respiratory syncytial virus (RSV), and other respiratory viruses (parainfluenza types 1-4, human metapneumovirus, non-SARS-CoV-2 coronaviruses, adenovirus, enterovirus/rhinovirus) involves the use of data from clinical sentinel laboratories (hospital, academic or private) located throughout California. Specimens for testing are collected from patients in healthcare settings and do not reflect all testing for influenza, respiratory syncytial virus, and other respiratory viruses in California. These laboratories report the number of laboratory-confirmed influenza, respiratory syncytial virus, and other respiratory virus detections and isolations, and the total number of specimens tested by virus type on a weekly basis. Test positivity for a given week is calculated by dividing the number of positive COVID-19, influenza, RSV, or other respiratory virus results by the total number of specimens tested for that virus. Weekly laboratory surveillance data are defined as Sunday through Saturday. Hospitalization data: Data on COVID-19 and influenza hospital admissions are from Centers for Disease Control and Prevention’s (CDC) National Healthcare Safety Network (NHSN) Hospitalization dataset. The requirement to report COVID-19 and influenza-associated hospitalizations was effective November 1, 2024. CDPH pulls NHSN data from the CDC on the Wednesday prior to the publication of the report. Results may differ depending on which day data are pulled. Admission rates are calculated using population estimates from the P-3: Complete State and County Projections Dataset provided by the State of California Department of Finance (https://dof.ca.gov/forecasting/demographics/projections/). Reported weekly admission rates for the entire season use the population estimates for the year the season started. For more information on NHSN data including the protocol and data collection information, see the CDC NHSN webpage (https://www.cdc.gov/nhsn/index.html). CDPH collaborates with Northern California Kaiser Permanente (NCKP) to monitor trends in RSV admissions. The percentage of RSV admissions is calculated by dividing the number of RSV-related admissions by the total number of admissions during the same period. Admissions for pregnancy, labor and delivery, birth, and outpatient procedures are not included in total number of admissions. These admissions serve as a proxy for RSV activity and do not necessarily represent laboratory confirmed hospitalizations for RSV infections; NCKP members are not representative of all Californians. Weekly hospitalization data are defined as Sunday through Saturday. Death certificate data: CDPH receives weekly year-to-date dynamic data on deaths occurring in California from the CDPH Center for Health Statistics and Informatics. These data are limited to deaths occurring among California residents and are analyzed to identify influenza, respiratory syncytial virus, and COVID-19-coded deaths. These deaths are not necessarily laboratory-confirmed and are an underestimate of all influenza, respiratory syncytial virus, and COVID-19-associated deaths in California. Weekly death data are defined as Sunday through Saturday. Wastewater data: This dataset represents statewide weekly SARS-CoV-2 wastewater summary values. SARS-CoV-2 wastewater concentrations from all sites in California are combined into a single, statewide, unit-less summary value for each week, using a method for data transformation and aggregation developed by the CDC National Wastewater Surveillance System (NWSS). Please see the CDC NWSS data methods page for a description of how these summary values are calculated. Weekly wastewater data are defined as Sunday through Saturday.
Centers for Disease Control and Prevention, Division of Healthcare Quality...
opendata.ramseycountymn.gov
csv, xlsx, xml
Updated Nov 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CDC Division of Healthcare Quality Promotion (DHQP) Surveillance Branch, National Healthcare Safety Network (NHSN) (2025). Centers for Disease Control and Prevention, Division of Healthcare Quality Promotion, National Healthcare Safety Network, Weekly United States COVID-19 Hospitalization Metrics - Ramsey County [Dataset]. https://opendata.ramseycountymn.gov/w/5mvu-4mt4/cjij-g4h4?cur=wCPAmhgX7ip
Explore at:
csv, xml, xlsxAvailable download formats
Dataset updated
Nov 20, 2025
Dataset provided by
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Authors
CDC Division of Healthcare Quality Promotion (DHQP) Surveillance Branch, National Healthcare Safety Network (NHSN)
License
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
Area covered
Ramsey County, United States
Description
Note: This dataset has been limited to show metrics for Ramsey County, Minnesota.

This dataset represents COVID-19 hospitalization data and metrics aggregated to county or county-equivalent, for all counties or county-equivalents (including territories) in the United States. COVID-19 hospitalization data are reported to CDC’s National Healthcare Safety Network, which monitors national and local trends in healthcare system stress, capacity, and community disease levels for approximately 6,000 hospitals in the United States. Data reported by hospitals to NHSN and included in this dataset represent aggregated counts and include metrics capturing information specific to COVID-19 hospital admissions, and inpatient and ICU bed capacity occupancy.

Reporting information: As of December 15, 2022, COVID-19 hospital data are required to be reported to NHSN, which monitors national and local trends in healthcare system stress, capacity, and community disease levels for approximately 6,000 hospitals in the United States. Data reported by hospitals to NHSN represent aggregated counts and include metrics capturing information specific to hospital capacity, occupancy, hospitalizations, and admissions. Prior to December 15, 2022, hospitals reported data directly to the U.S. Department of Health and Human Services (HHS) or via a state submission for collection in the HHS Unified Hospital Data Surveillance System (UHDSS). While CDC reviews these data for errors and corrects those found, some reporting errors might still exist within the data. To minimize errors and inconsistencies in data reported, CDC removes outliers before calculating the metrics. CDC and partners work with reporters to correct these errors and update the data in subsequent weeks. Many hospital subtypes, including acute care and critical access hospitals, as well as Veterans Administration, Defense Health Agency, and Indian Health Service hospitals, are included in the metric calculations provided in this report. Psychiatric, rehabilitation, and religious non-medical hospital types are excluded from calculations. Data are aggregated and displayed for hospitals with the same Centers for Medicare and Medicaid Services (CMS) Certification Number (CCN), which are assigned by CMS to counties based on the CMS Provider of Services files. Full details on COVID-19 hospital data reporting guidance can be found here: https://www.hhs.gov/sites/default/files/covid-19-faqs-hospitals-hospital-laboratory-acute-care-facility-data-reporting.pdf

Calculation of county-level hospital metrics: County-level hospital data are derived using calculations performed at the Health Service Area (HSA) level. An HSA is defined by CDC’s National Center for Health Statistics as a geographic area containing at least one county which is self-contained with respect to the population’s provision of routine hospital care. Every county in the United States is assigned to an HSA, and each HSA must contain at least one hospital. Therefore, use of HSAs in the calculation of local hospital metrics allows for more accurate characterization of the relationship between health care utilization and health status at the local level. Data presented at the county-level represent admissions, hospital inpatient and ICU bed capacity and occupancy among hospitals within the selected HSA. Therefore, admissions, capacity, and occupancy are not limited to residents of the selected HSA. For all county-level hospital metrics listed below the values are calculated first for the entire HSA, and then the HSA-level value is then applied to each county within the HSA. For all county-level hospital metrics listed below the values are calculated first for the entire HSA, and then the HSA-level value is then applied to each county within the HSA.

Metric details: Time period: data for the previous MMWR week (Sunday-Saturday) will update weekly on Thursdays as soon as they are reviewed and verified, usually before 8 pm ET. Updates will occur the following day when reporting coincides with a federal holiday. Note: Weekly updates might be delayed due to delays in reporting. All data are provisional. Because these provisional counts are subject to change, including updates to data reported previously, adjustments can occur. Data may be updated since original publication due to delays in reporting (to account for data received after a given Thursday publication) or data quality corrections. New hospital admissions (count): Total number of admissions of patients with laboratory-confirmed COVID-19 in the previous week (including both adult and pediatric admissions) in the entire jurisdiction New Hospital Admissions Rate Value (Admissions per 100k): Total number of new admissions of patients with laboratory-confirmed COVID-19 in the past week (including both adult and pediatric admissions) for the entire jurisdiction divided by 2019 intercensal population estimate for that jurisdiction multiplied by 100,000. (Note: This metric is used to determine each county’s COVID-19 Hospital Admissions Level for a given week). New COVID-19 Hospital Admissions Rate Level: qualitative value of new COVID-19 hospital admissions rate level [Low, Medium, High, Insufficient Data] New hospital admissions percent change from prior week: Percent change in the current weekly total new admissions of patients with laboratory-confirmed COVID-19 per 100,000 population compared with the prior week. New hospital admissions percent change from prior week level: Qualitative value of percent change in hospital admissions rate from prior week [Substantial decrease, Moderate decrease, Stable, Moderate increase, Substantial increase, Insufficient data] COVID-19 Inpatient Bed Occupancy Value: Percentage of all staffed inpatient beds occupied by patients with laboratory-confirmed COVID-19 (including both adult and pediatric patients) within the in the entire jurisdiction is calculated as an average of valid daily values within the past week (e.g., if only three valid values, the average of those three is taken). Averages are separately calculated for the daily numerators (patients hospitalized with confirmed COVID-19) and denominators (staffed inpatient beds). The average percentage can then be taken as the ratio of these two values for the entire jurisdiction. COVID-19 Inpatient Bed Occupancy Level: Qualitative value of inpatient beds occupied by COVID-19 patients level [Minimal, Low, Moderate, Substantial, High, Insufficient data] COVID-19 Inpatient Bed Occupancy percent change from prior week: The absolute change in the percent of staffed inpatient beds occupied by patients with laboratory-confirmed COVID-19 represents the week-over-week absolute difference between the average occupancy of patients with confirmed COVID-19 in staffed inpatient beds in the past week, compared with the prior week, in the entire jurisdiction. COVID-19 ICU Bed Occupancy Value: Percentage of all staffed inpatient beds occupied by adult patients with confirmed COVID-19 within the entire jurisdiction is calculated as an average of valid daily values within the past week (e.g., if only three valid values, the average of those three is taken). Averages are separately calculated for the daily numerators (adult patients hospitalized with confirmed COVID-19) and denominators (staffed adult ICU beds). The average percentage can then be taken as the ratio of these two values for the entire jurisdiction. COVID-19 ICU Bed Occupancy Level: Qualitative value of ICU beds occupied by COVID-19 patients level [Minimal, Low, Moderate, Substantial, High, Insufficient data] COVID-19 ICU Bed Occupancy percent change from prior week: The absolute change in the percent of staffed ICU beds occupied by patients with laboratory-confirmed COVID-19 represents the week-over-week absolute difference between the average occupancy of patients with confirmed COVID-19 in staffed adult ICU beds for the past week, compared with the prior week, in the in the entire jurisdiction. For all metrics, if there are no data in the specified locality for a given week, the metric value is displayed as “insufficient data”.
O
MD COVID-19 - Total Vaccinations Statewide
opendata.maryland.gov
healthdata.gov
+1more
csv, xlsx, xml
Updated Apr 21, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Maryland Department of Health Prevention and Health Promotion Administration, MDH PHPA (2023). MD COVID-19 - Total Vaccinations Statewide [Dataset]. https://opendata.maryland.gov/Health-and-Human-Services/MD-COVID-19-Total-Vaccinations-Statewide/6j26-vi5v
Explore at:
csv, xlsx, xmlAvailable download formats
Dataset updated
Apr 21, 2023
Dataset authored and provided by
Maryland Department of Health Prevention and Health Promotion Administration, MDH PHPA
License
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
Area covered
Maryland
Description
Regarding all Vaccination Data The date of Last Update is 4/21/2023. Additionally on 4/27/2023 several COVID-19 datasets were retired and no longer included in public COVID-19 data dissemination.

See this link for more information https://imap.maryland.gov/pages/covid-data

Summary The cumulative number of COVID-19 vaccinations in Maryland: First dose, second dose, single dose, total vaccinations.

Description The MD COVID-19 - Total Vaccinations Statewide data layer is a collection of the statewide COVID-19 vaccinations that have been reported each day into ImmuNet. Doses administered also account for doses of vaccine provided to the District of Columbia to vaccinate Maryland residents who work in DC.

Terms of Use The Spatial Data, and the information therein, (collectively the "Data") is provided "as is" without warranty of any kind, either expressed, implied, or statutory. The user assumes the entire risk as to quality and performance of the Data. No guarantee of accuracy is granted, nor is any responsibility for reliance thereon assumed. In no event shall the State of Maryland be liable for direct, indirect, incidental, consequential or special damages of any kind. The State of Maryland does not accept liability for any damages or misrepresentation caused by inaccuracies in the Data or as a result to changes to the Data, nor is there responsibility assumed to maintain the Data in any manner or form. The Data can be freely distributed as long as the metadata entry is not modified or deleted. Any data derived from the Data must acknowledge the State of Maryland in the metadata.
United States COVID-19 Community Levels by County
datalumos.org
healthdata.gov
+2more
delimited
Updated Oct 16, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
United States Department of Health and Human Services. Centers for Disease Control and Prevention (2025). United States COVID-19 Community Levels by County [Dataset]. http://doi.org/10.3886/E238954V1
Explore at:
delimitedAvailable download formats
Unique identifier
https://doi.org/10.3886/E238954V1
Dataset updated
Oct 16, 2025
Dataset provided by
United States Department of Health and Human Serviceshttp://www.hhs.gov/
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Authors
United States Department of Health and Human Services. Centers for Disease Control and Prevention
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
United States
Description
Reporting of Aggregate Case and Death Count data was discontinued May 11, 2023, with the expiration of the COVID-19 public health emergency declaration. Although these data will continue to be publicly available, this dataset will no longer be updated.This archived public use dataset has 11 data elements reflecting United States COVID-19 community levels for all available counties.The COVID-19 community levels were developed using a combination of three metrics — new COVID-19 admissions per 100,000 population in the past 7 days, the percent of staffed inpatient beds occupied by COVID-19 patients, and total new COVID-19 cases per 100,000 population in the past 7 days. The COVID-19 community level was determined by the higher of the new admissions and inpatient beds metrics, based on the current level of new cases per 100,000 population in the past 7 days. New COVID-19 admissions and the percent of staffed inpatient beds occupied represent the current potential for strain on the health system. Data on new cases acts as an early warning indicator of potential increases in health system strain in the event of a COVID-19 surge.Using these data, the COVID-19 community level was classified as low, medium, or high.COVID-19 Community Levels were used to help communities and individuals make decisions based on their local context and their unique needs. Community vaccination coverage and other local information, like early alerts from surveillance, such as through wastewater or the number of emergency department visits for COVID-19, when available, can also inform decision making for health officials and individuals.For the most accurate and up-to-date data for any county or state, visit the relevant health department website. COVID Data Tracker may display data that differ from state and local websites. This can be due to differences in how data were collected, how metrics were calculated, or the timing of web updates.Archived Data Notes:This dataset was renamed from "United States COVID-19 Community Levels by County as Originally Posted" to "United States COVID-19 Community Levels by County" on March 31, 2022.March 31, 2022: Column name for county population was changed to “county_population”. No change was made to the data points previous released.March 31, 2022: New column, “health_service_area_population”, was added to the dataset to denote the total population in the designated Health Service Area based on 2019 Census estimate.March 31, 2022: FIPS codes for territories American Samoa, Guam, Commonwealth of the Northern Mariana Islands, and United States Virgin Islands were re-formatted to 5-digit numeric for records released on 3/3/2022 to be consistent with other records in the dataset.March 31, 2022: Changes were made to the text fields in variables “county”, “state”, and “health_service_area” so the formats are consistent across releases.March 31, 2022: The “%” sign was removed from the text field in column “covid_inpatient_bed_utilization”. No change was made to the data. As indicated in the column description, values in this column represent the percentage of staffed inpatient beds occupied by COVID-19 patients (7-day average).March 31, 2022: Data values for columns, “county_population”, “health_service_area_number”, and “health_service_area” were backfilled for records released on 2/24/2022. These columns were added since the week of 3/3/2022, thus the values were previously missing for records released the week prior.April 7, 2022: Updates made to data released on 3/24/2022 for Guam, Commonwealth of the Northern Mariana Islands, and United States Virgin Islands to correct a data mapping error.April 21, 2022: COVID-19 Community Level (CCL) data released for counties in Nebraska for the week of April 21, 2022 have 3 counties identified in the high category and 37 in the medium category. CDC has been working with state officials to verify the data submitted, as other data systems are not providing alerts for substantial increases in disease transmission or severity in the state.May 26, 2022: COVID-19 Community Level (CCL) data released for McCracken County, KY for the week of May 5, 2022 have been updated to correct a data processing error. McCracken County, KY should have appeared in the low community level category during the week of May 5, 2022. This correction is reflect
NCHS - Teen Birth Rates for Age Group 15-19 in the United States by County
catalog.data.gov
healthdata.gov
+4more
Updated Mar 16, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Centers for Disease Control and Prevention (2022). NCHS - Teen Birth Rates for Age Group 15-19 in the United States by County [Dataset]. https://catalog.data.gov/dataset/nchs-teen-birth-rates-for-age-group-15-19-in-the-united-states-by-county
Explore at:
Dataset updated
Mar 16, 2022
Dataset provided by
Centers for Disease Control and Preventionhttp://www.cdc.gov/
Area covered
United States
Description
This data set contains estimated teen birth rates for age group 15–19 (expressed per 1,000 females aged 15–19) by county and year. DEFINITIONS Estimated teen birth rate: Model-based estimates of teen birth rates for age group 15–19 (expressed per 1,000 females aged 15–19) for a specific county and year. Estimated county teen birth rates were obtained using the methods described elsewhere (1,2,3,4). These annual county-level teen birth estimates “borrow strength” across counties and years to generate accurate estimates where data are sparse due to small population size (1,2,3,4). The inferential method uses information—including the estimated teen birth rates from neighboring counties across years and the associated explanatory variables—to provide a stable estimate of the county teen birth rate. Median teen birth rate: The middle value of the estimated teen birth rates for the age group 15–19 for counties in a state. Bayesian credible intervals: A range of values within which there is a 95% probability that the actual teen birth rate will fall, based on the observed teen births data and the model. NOTES Data on the number of live births for women aged 15–19 years were extracted from the National Center for Health Statistics’ (NCHS) National Vital Statistics System birth data files for 2003–2015 (5). Population estimates were extracted from the files containing intercensal and postcensal bridged-race population estimates provided by NCHS. For each year, the July population estimates were used, with the exception of the year of the decennial census, 2010, for which the April estimates were used. Hierarchical Bayesian space–time models were used to generate hierarchical Bayesian estimates of county teen birth rates for each year during 2003–2015 (1,2,3,4). The Bayesian analogue of the frequentist confidence interval is defined as the Bayesian credible interval. A 100*(1-α)% Bayesian credible interval for an unknown parameter vector θ and observed data vector y is a subset C of parameter space Ф such that 1-α≤P({C│y})=∫p{θ │y}dθ, where integration is performed over the set and is replaced by summation for discrete components of θ. The probability that θ lies in C given the observed data y is at least (1- α) (6). County borders in Alaska changed, and new counties were formed and others were merged, during 2003–2015. These changes were reflected in the population files but not in the natality files. For this reason, two counties in Alaska were collapsed so that the birth and population counts were comparable. Additionally, Kalawao County, a remote island county in Hawaii, recorded no births, and census estimates indicated a denominator of 0 (i.e., no females between the ages of 15 and 19 years residing in the county from 2003 through 2015). For this reason, Kalawao County was removed from the analysis. Also , Bedford City, Virginia, was added to Bedford County in 2015 and no longer appears in the mortality file in 2015. For consistency, Bedford City was merged with Bedford County, Virginia, for the entire 2003–2015 period. Final analysis was conducted on 3,137 counties for each year from 2003 through 2015. County boundaries are consistent with the vintage 2005–2007 bridged-race population file geographies (7). SOURCES National Center for Health Statistics. Vital statistics data available online, Natality all-county files. Hyattsville, MD. Published annually. For details about file release and access policy, see NCHS data release and access policy for micro-data and compressed vital statistics files, available from: http://www.cdc.gov/nchs/nvss/dvs_data_release.htm. For natality public-use files, see vital statistics data available online, available from: https://www.cdc.gov/nchs/data_access/vitalstatsonline.htm. National Center for Health Statistics. U.S. Census populations with bridged race categories. Estimated population data available. Postcensal and intercensal files. Hyattsville, MD
a
CDC SVI SOCIOECONOMIC TRANSPORTATION FACTORS, 2018
chi-phi-nmcdc.opendata.arcgis.com
Updated Apr 22, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
New Mexico Community Data Collaborative (2021). CDC SVI SOCIOECONOMIC TRANSPORTATION FACTORS, 2018 [Dataset]. https://chi-phi-nmcdc.opendata.arcgis.com/maps/246f529af9bb4b46ad6f042fc5ac5c14
Explore at:
Dataset updated
Apr 22, 2021
Dataset authored and provided by
New Mexico Community Data Collaborative
Area covered

Description
NMCDC Copy of Living Atlas map. Source: https://www.arcgis.com/home/item.html?id=23ab8028f1784de4b0810104cd5d1c8fIllustration by Brian BrenemanThis layer shows population broken down by race and Hispanic origin. This is shown by tract, county, and state boundaries. This service is updated annually to contain the most currently released American Community Survey (ACS) 5-year data, and contains estimates and margins of error. There are also additional calculated attributes related to this topic, which can be mapped or used within analysis. This layer is symbolized to show the predominant race living within an area. To see the full list of attributes available in this service, go to the "Data" tab, and choose "Fields" at the top right. Current Vintage: 2013-2017ACS Table(s): B03002 (Not all lines of this ACS table are available in this feature layer.)Data downloaded from: Census Bureau's API for American Community Survey Date of API call: December 7, 2018National Figures: American Fact FinderThe United States Census Bureau's American Community Survey (ACS):About the SurveyGeography & ACSTechnical DocumentationNews & UpdatesThis ready-to-use layer can be used within ArcGIS Pro, ArcGIS Online, its configurable apps, dashboards, Story Maps, custom apps, and mobile apps. Data can also be exported for offline workflows. Please cite the Census and ACS when using this data.Data Note from the Census:Data are based on a sample and are subject to sampling variability. The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value. In addition to sampling variability, the ACS estimates are subject to nonsampling error (for a discussion of nonsampling variability, see Accuracy of the Data). The effect of nonsampling error is not represented in these tables.Data Processing Notes:This dataset is updated automatically when the most current vintage of ACS data is released each year. The service contains the ACS data as of the current vintage listed. Tabular data is updated annually with the Census Bureau's release schedule. This may alter data values, fields, and boundaries. Click here to learn more about ACS data releases.Boundaries come from the US Census TIGER geodatabases. Boundaries are updated at the same time as the data updates (annually), and the boundary vintage appropriately matches the data vintage as specified by the Census. These are Census boundaries with water and/or coastlines clipped for cartographic purposes. For census tracts, the water cutouts are derived from a subset of the 2010 AWATER (Area Water) boundaries offered by TIGER. For state and county boundaries, the water and coastlines are derived from the coastlines of the 500k TIGER Cartographic Boundary Shapefiles. The original AWATER and ALAND fields are still available as attributes within the data table (units are square meters). The States layer contains 52 records - all US states, Washington D.C., and Puerto RicoCensus tracts with no population that occur in areas of water, such as oceans, are removed from this data service (Census Tracts beginning with 99).Percentages and derived counts, and associated margins of error, are calculated values (that can be identified by the "_calc_" stub in the field name), and abide by the specifications defined by the American Community Survey.Field alias names were created based on the Table Shells file available from the American Community Survey Summary File Documentation page.Negative values (e.g., -555555...) have been set to null. These negative values exist in the raw API data to indicate the following situations:The margin of error column indicates that either no sample observations or too few sample observations were available to compute a standard error and thus the margin of error. A statistical test is not appropriate.Either no sample observations or too few sample observations were available to compute an estimate, or a ratio of medians cannot be calculated because one or both of the median estimates falls in the lowest interval or upper interval of an open-ended distribution.The median falls in the lowest interval of an open-ended distribution, or in the upper interval of an open-ended distribution. A statistical test is not appropriate.The estimate is controlled. A statistical test for sampling variability is not appropriate.The data for this geographic area cannot be displayed because the number of sample cases is too small. NOTE: any calculated percentages or counts that contain estimates that have null margins of error yield null margins of error for the calculated fields.
Data from: A Trans-Governmental Collaboration to Independently Evaluate...
data.niaid.nih.gov
immport.org
+1more
url
Updated Jan 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NIAID SAVE Program (2025). A Trans-Governmental Collaboration to Independently Evaluate SARS-CoV-2 Serology Assays [Dataset]. http://doi.org/10.21430/M3SK8ANYK1
Explore at:
urlAvailable download formats
Unique identifier
https://doi.org/10.21430/M3SK8ANYK1
Dataset updated
Jan 30, 2025
Dataset provided by
National Institute of Allergy and Infectious Diseaseshttp://www.niaid.nih.gov/
License
https://www.immport.org/agreementhttps://www.immport.org/agreement
Description
The emergence of SARS-CoV-2 created a crucial need for serology assays to detect anti-SARS-CoV-2 antibodies, which led to many serology assays entering the market. A trans-government collaboration was created in April 2020 to independently evaluate the performance of commercial SARS-CoV-2 serology assays and help inform U.S. Food and Drug Administration (FDA) regulatory decisions. To assess assay performance, three evaluation panels with similar antibody titer distributions were assembled. Each panel consisted of 110 samples with positive (n = 30) serum samples with a wide range of anti-SARS-CoV-2 antibody titers and negative (n = 80) plasma and/or serum samples that were collected before the start of the COVID-19 pandemic. Each sample was characterized for anti-SARS-CoV-2 antibodies against the spike protein using enzyme-linked immunosorbent assays (ELISA). Samples were selected for the panel when there was agreement on seropositivity by laboratories at National Cancer Institute's Frederick National Laboratory for Cancer Research (NCI-FNLCR) and Centers for Disease Control and Prevention (CDC). The sensitivity and specificity of each assay were assessed to determine Emergency Use Authorization (EUA) suitability. As of January 8, 2021, results from 91 evaluations were made publicly available (https://open.fda.gov/apis/device/covid19serology/, and https://www.cdc.gov/coronavirus/2019-ncov/covid-data/serology-surveillance/serology-test-evaluation.html). Sensitivity ranged from 27% to 100% for IgG (n = 81), from 10% to 100% for IgM (n = 74), and from 73% to 100% for total or pan-immunoglobulins (n = 5). The combined specificity ranged from 58% to 100% (n = 91). Approximately one-third (n = 27) of the assays evaluated are now authorized by FDA for emergency use. This collaboration established a framework for assay performance evaluation that could be used for future outbreaks and could serve as a model for other technologies. IMPORTANCE The SARS-CoV-2 pandemic created a crucial need for accurate serology assays to evaluate seroprevalence and antiviral immune responses. The initial flood of serology assays entering the market with inadequate performance emphasized the need for independent evaluation of commercial SARS-CoV-2 antibody assays using performance evaluation panels to determine suitability for use under EUA. Through a government-wide collaborative network, 91 commercial SARS-CoV-2 serology assay evaluations were performed. Three evaluation panels with similar overall antibody titer distributions were assembled to evaluate performance. Nearly one-third of the assays evaluated met acceptable performance recommendations, and two assays had EUAs revoked and were removed from the U.S. market based on inadequate performance. Data for all serology assays evaluated are available at the FDA and CDC websites (https://open.fda.gov/apis/device/covid19serology/, and https://www.cdc.gov/coronavirus/2019-ncov/covid-data/serology-surveillance/serology-test-evaluation.html).
COVID-19 Deaths and cases by state
figshare.com
xlsx
Updated Feb 28, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jennifer Cohen; Yana van der Meulen Rodgers (2021). COVID-19 Deaths and cases by state [Dataset]. http://doi.org/10.6084/m9.figshare.12751850.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.12751850.v1
Dataset updated
Feb 28, 2021
Dataset provided by
Figsharehttp://figshare.com/
figshare
Authors
Jennifer Cohen; Yana van der Meulen Rodgers
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
COVID-19 confirmed cases and deaths by state as of July 28, 2020 from https://www.cdc.gov/coronavirus/2019-ncov/cases-updates/cases-in-us.html and https://usafacts.org/visualizations/coronavirus-covid-19-spread-map The state numbers listed by the CDC are aggregated from the USAFact county data.The CDC reports healthcare personnel cases and infections (120,467 and 587 as of August 1, 2020; accessed August 2, 2020) but does not disaggregate the numbers by state.Healthcare worker deaths by state as of July 28, 2020 pulled from https://www.medscape.com/viewarticle/927976#vp_1
Opioid Epidemic Analysis by US County
kaggle.com
zip
Updated Jan 22, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andrew Eckberg (2020). Opioid Epidemic Analysis by US County [Dataset]. https://www.kaggle.com/ryanandreweckberg/opioid-crisis-by-interpersonal-relationships
Explore at:
zip(9119404 bytes)Available download formats
Dataset updated
Jan 22, 2020
Authors
Andrew Eckberg
Area covered
United States
Description
Opioid Data Description

Land Area of County: factfinder.census.gov 2010 Census Summary 1890 counties are taken under consideration

Year: 2011- 2017

Population: https://www.census.gov/data/datasets/time-series/demo/popest/2010s-counties-total.html#par_textimage_70769902 Annual Estimates of the Resident Population for Counties: April 1, 2010 to July 1, 2018

Death by Opioid Type: https://wonder.cdc.gov/ The mortality data are based on information from all death certificates filed in the fifty states all sub-national data representing zero to nine (0-9) deaths are suppressed.

601 counties had the minimum mortality rate to be represented for analysis and were pulled from the WONDER database. These were the recommended codes to use when relating to Opioid deaths provided by the CDC.

Type of death: T40.0 (Opium) – No county reached the number of deaths above 9 per year to not be suppressed when finding specific cause T40.1 (Heroin) T40.2 (Other opioids) T40.3 (Methadone) T40.4 (Other synthetic narcotics) From the CDC Wonder Database. Type of death by county will not add up to total mortality due to the fact that low death rate of a county was withheld from data to protect privacy of individuals.

Non-US Born: factfinder.census.gov American Community Survey 5-Year Estimates The total number of Non-Us born citizens that reside in each county

Education: factfinder.census.gov American Community Survey 5-Year Estimates Categories Consist of: Less Than High School Degree Some College or Associate’s Degree Bachelor’s Degree Graduate or Professional Degree

Income by Household: factfinder.census.gov American Community Survey 5-Year Estimates Incomes given by the mean household income in that county

Transportation: Percentage of County that uses these means of transportation to get to work. American Community Survey 5-Year Estimates Categories Consist of: Commute Alone to work by driving Carpool Walk Public Transit Bike

Unemployment Rate by county collected from: https://catalog.data.gov/dataset?tags=unemployment-rate

GDP by county in regards to funds spent on healthcare, education, and social assistance as well as overall GDP collected from: https://www.bea.gov/data/gdp/gdp-county-metro-and-other-areas
🍎 US Nutrition & Obesity Data (BRFSS 2011–2023)
kaggle.com
zip
Updated Aug 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pinar Topuz (2025). 🍎 US Nutrition & Obesity Data (BRFSS 2011–2023) [Dataset]. https://www.kaggle.com/datasets/pinuto/us-nutrition-and-obesity-data-brfss-20112023
Explore at:
zip(2412636 bytes)Available download formats
Dataset updated
Aug 28, 2025
Authors
Pinar Topuz
Description
📖 About Dataset

🌎 Overview

This dataset provides cleaned and structured information from the Behavioral Risk Factor Surveillance System (BRFSS) conducted by the CDC. It focuses on nutrition, physical activity, and obesity trends across U.S. states and national averages from 2011 to 2023.

The data originates from the Division of Nutrition, Physical Activity, and Obesity (DNPAO) and has been pre-processed to remove missing values, redundant columns, and inconsistencies, making it ready for analysis.

📊 Contents

The dataset contains 29 columns and over 106,000 rows of observations, including:

Year: Start and end years of data collection (2011–2023)

Location: State abbreviation, state name, and geographic coordinates

Class & Topic: High-level categories such as Obesity/Weight Status, Physical Activity, Fruits and Vegetables

Question: Specific health behavior measured (e.g., % of adults with BMI ≥30)

Data_Value: The main metric (percentage or proportion)

Confidence Intervals: Statistical lower and upper bounds

Sample Size: Number of participants

Demographics: Age, sex, income, education, race/ethnicity

✅ Cleaning Process

Removed fully empty columns (e.g., Total, Data_Value_Unit)

Imputed missing numeric values using median replacement

Categorical variables (Age, Sex, Education, Income, Race/Ethnicity) filled with Unknown

Dropped non-essential ID columns (ClassID, TopicID, etc.) to simplify analysis

Final dataset contains no missing values

🎯 Use Cases

This dataset is highly valuable for:

Public Health Research: Tracking obesity and physical activity trends

Policy Evaluation: Comparing state-level health initiatives

Data Science & ML: Building predictive models on obesity & lifestyle behaviors

Visualization Projects: Heatmaps, time series, and demographic comparisons

📌 Example Questions You Can Answer

How have obesity rates changed from 2011–2023 across U.S. states?

Which states report the highest vs lowest physical activity levels?

What is the relationship between income, education, and obesity?

How do dietary habits (fruit & vegetable intake) correlate with weight status?

📂 File Information

File Name: Nutrition_Physical_Activity_Obesity_Clean.csv

Rows: 106,260

Columns: 29

Format: CSV (comma-separated)

🏛 Source

Centers for Disease Control and Prevention (CDC)

Division of Nutrition, Physical Activity, and Obesity (DNPAO)

Original dataset: Data.gov – BRFSS Nutrition, Physical Activity, and Obesity

💡 Citation

If you use this dataset in your work, please cite: Centers for Disease Control and Prevention (CDC). Behavioral Risk Factor Surveillance System (BRFSS), 2011–2023.

✨ This cleaned version was prepared for easy exploration, analysis, and machine learning applications on Kaggle.

Facebook

Twitter

Click to copy link

Link copied

Cite

CDC Data, Analytics and Visualization Task Force (2024). COVID-19 Case Surveillance Public Use Data [Dataset]. https://data.cdc.gov/widgets/vbim-akqf

COVID-19 Case Surveillance Public Use Data

Explore at:

xml, xlsx, csvAvailable download formats

Dataset updated

Jul 9, 2024

Dataset provided by

Centers for Disease Control and Preventionhttp://www.cdc.gov/

Authors

CDC Data, Analytics and Visualization Task Force

License

https://www.usa.gov/government-workshttps://www.usa.gov/government-works

Description

Note: Reporting of new COVID-19 Case Surveillance data will be discontinued July 1, 2024, to align with the process of removing SARS-CoV-2 infections (COVID-19 cases) from the list of nationally notifiable diseases. Although these data will continue to be publicly available, the dataset will no longer be updated.

Authorizations to collect certain public health data expired at the end of the U.S. public health emergency declaration on May 11, 2023. The following jurisdictions discontinued COVID-19 case notifications to CDC: Iowa (11/8/21), Kansas (5/12/23), Kentucky (1/1/24), Louisiana (10/31/23), New Hampshire (5/23/23), and Oklahoma (5/2/23). Please note that these jurisdictions will not routinely send new case data after the dates indicated. As of 7/13/23, case notifications from Oregon will only include pediatric cases resulting in death.

This case surveillance public use dataset has 12 elements for all COVID-19 cases shared with CDC and includes demographics, any exposure history, disease severity indicators and outcomes, presence of any underlying medical conditions and risk behaviors, and no geographic data.

CDC has three COVID-19 case surveillance datasets:

COVID-19 Case Surveillance Public Use Data with Geography: Public use, patient-level dataset with clinical data (including symptoms), demographics, and county and state of residence. (19 data elements)
COVID-19 Case Surveillance Public Use Data: Public use, patient-level dataset with clinical and symptom data and demographics, with no geographic data. (12 data elements)
COVID-19 Case Surveillance Restricted Access Detailed Data: Restricted access, patient-level dataset with clinical and symptom data, demographics, and state and county of residence. Access requires a registration process and a data use agreement. (33 data elements)

The following apply to all three datasets:

Data elements can be found on the COVID-19 case report form located at www.cdc.gov/coronavirus/2019-ncov/downloads/pui-form.pdf.
Data are considered provisional by CDC and are subject to change until the data are reconciled and verified with the state and territorial data providers.
Some data cells are suppressed to protect individual privacy.
The datasets will include all cases with the earliest date available in each record (date received by CDC or date related to illness/specimen collection) at least 14 days prior to the creation of the current datasets. This 14-day lag allows case reporting to be stabilized and ensures that time-dependent outcome data are accurately captured.
Datasets are updated monthly.
Datasets are created using CDC’s Policy on Public Health Research and Nonresearch Data Management and Access and include protections designed to protect individual privacy.
For more information about data collection and reporting, please see https://www.cdc.gov/coronavirus/2019-ncov/covid-data/about-us-cases-deaths.html.
For more information about the COVID-19 case surveillance data, please see https://www.cdc.gov/coronavirus/2019-ncov/covid-data/faq-surveillance.html

Overview

The COVID-19 case surveillance database includes individual-level data reported to U.S. states and autonomous reporting entities, including New York City and the District of Columbia (D.C.), as well as U.S. territories and affiliates. On April 5, 2020, COVID-19 was added to the Nationally Notifiable Condition List and classified as “immediately notifiable, urgent (within 24 hours)” by a Council of State and Territorial Epidemiologists (CSTE) Interim Position Statement (Interim-20-ID-01). CSTE updated the position statement on August 5, 2020, to clarify the interpretation of antigen detection tests and serologic test results within the case classification (Interim-20-ID-02). The statement also recommended that all states and territories enact laws to make COVID-19 reportable in their jurisdiction, and that jurisdictions conducting surveillance should submit case notifications to CDC. COVID-19 case surveillance data are collected by jurisdictions and reported voluntarily to CDC.

For more information: NNDSS Supports the COVID-19 Response | CDC.

The deidentified data in the “COVID-19 Case Surveillance Public Use Data” include demographic characteristics, any exposure history, disease severity indicators and outcomes, clinical data, laboratory diagnostic test results, and presence of any underlying medical conditions and risk behaviors. All data elements can be found on the COVID-19 case report form located at www.cdc.gov/coronavirus/2019-ncov/downloads/pui-form.pdf.

COVID-19 Case Reports

COVID-19 case reports have been routinely submitted using nationally standardized case reporting forms. On April 5, 2020, CSTE released an Interim Position Statement with national surveillance case definitions for COVID-19 included. Current versions of these case definitions are available here: https://ndc.services.cdc.gov/case-definitions/coronavirus-disease-2019-2021/.

All cases reported on or after were requested to be shared by public health departments to CDC using the standardized case definitions for laboratory-confirmed or probable cases. On May 5, 2020, the standardized case reporting form was revised. Case reporting using this new form is ongoing among U.S. states and territories.

Data are Considered Provisional

The COVID-19 case surveillance data are dynamic; case reports can be modified at any time by the jurisdictions sharing COVID-19 data with CDC. CDC may update prior cases shared with CDC based on any updated information from jurisdictions. For instance, as new information is gathered about previously reported cases, health departments provide updated data to CDC. As more information and data become available, analyses might find changes in surveillance data and trends during a previously reported time window. Data may also be shared late with CDC due to the volume of COVID-19 cases.
Annual finalized data: To create the final NNDSS data used in the annual tables, CDC works carefully with the reporting jurisdictions to reconcile the data received during the year until each state or territorial epidemiologist confirms that the data from their area are correct.
Access Addressing Gaps in Public Health Reporting of Race and Ethnicity for COVID-19, a report from the Council of State and Territorial Epidemiologists, to better understand the challenges in completing race and ethnicity data for COVID-19 and recommendations for improvement.

Data Limitations

To learn more about the limitations in using case surveillance data, visit FAQ: COVID-19 Data and Surveillance.

Data Quality Assurance Procedures

CDC’s Case Surveillance Section routinely performs data quality assurance procedures (i.e., ongoing corrections and logic checks to address data errors). To date, the following data cleaning steps have been implemented:

Questions that have been left unanswered (blank) on the case report form are reclassified to a Missing value, if applicable to the question. For example, in the question “Was the individual hospitalized?” where the possible answer choices include “Yes,” “No,” or “Unknown,” the blank value is recoded to Missing because the case report form did not include a response to the question.
Logic checks are performed for date data. If an illogical date has been provided, CDC reviews the data with the reporting jurisdiction. For example, if a symptom onset date in the future is reported to CDC, this value is set to null until the reporting jurisdiction updates the date appropriately.
Additional data quality processing to recode free text data is ongoing. Data on symptoms, race and ethnicity, and healthcare worker status have been prioritized.

Data Suppression

To prevent release of data that could be used to identify people, data cells are suppressed for low frequency (<5) records and indirect identifiers (e.g., date of first positive specimen). Suppression includes rare combinations of demographic characteristics (sex, age group, race/ethnicity). Suppressed values are re-coded to the NA answer option; records with data suppression are never removed.

For questions, please contact Ask SRRG (eocevent394@cdc.gov).

Additional COVID-19 Data

COVID-19 data are available to the public as summary or aggregate count files, including total counts of cases and deaths by state and by county. These

Clear search

Close search

Google apps

Main menu

COVID-19 Case Surveillance Public Use Data

CDC has three COVID-19 case surveillance datasets:

Overview

COVID-19 Case Reports

Data are Considered Provisional

Data Limitations

Data Quality Assurance Procedures

Data Suppression

Additional COVID-19 Data

Weekly United States COVID-19 Hospitalization Metrics by County (Historical)...

cdc_clinical_trials

Context

Content

Acknowledgements

CDC - BRFSS Survey Data 2024

Behavioral Risk Factor Surveillance System (BRFSS) 2024

Overview

2024 Data Notes

Data Collection

Content

Source & Acknowledgements

Suggested Citation (for Kaggle users)

Weekly United States COVID-19 Hospitalization Metrics by Jurisdiction –...

COVID-19 Dashboard

Pregnancy Risk Data - PRAMS 2007 (CDC)

CDC COVID-19 Community Levels by County

Archive: COVID-19 LTC Program Vaccinations and Trends in the United States,...

Participant demographics.

Respiratory Virus Weekly Report

Centers for Disease Control and Prevention, Division of Healthcare Quality...

MD COVID-19 - Total Vaccinations Statewide

United States COVID-19 Community Levels by County

NCHS - Teen Birth Rates for Age Group 15-19 in the United States by County

CDC SVI SOCIOECONOMIC TRANSPORTATION FACTORS, 2018

Data from: A Trans-Governmental Collaboration to Independently Evaluate...

COVID-19 Deaths and cases by state

Opioid Epidemic Analysis by US County

🍎 US Nutrition & Obesity Data (BRFSS 2011–2023)

📖 About Dataset

🌎 Overview

📊 Contents

✅ Cleaning Process

🎯 Use Cases

📌 Example Questions You Can Answer

📂 File Information

🏛 Source

💡 Citation

COVID-19 Case Surveillance Public Use DataSee More Versions

CDC has three COVID-19 case surveillance datasets:

Overview

COVID-19 Case Reports

Data are Considered Provisional

Data Limitations

Data Quality Assurance Procedures

Data Suppression

Additional COVID-19 Data

COVID-19 Case Surveillance Public Use Data