10 datasets found
  1. COVID-19 Variant Data

    • kaggle.com
    zip
    Updated Mar 12, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nidhi Sharma (2023). COVID-19 Variant Data [Dataset]. https://www.kaggle.com/datasets/nidzsharma/covid-19-variant-data
    Explore at:
    zip(68659 bytes)Available download formats
    Dataset updated
    Mar 12, 2023
    Authors
    Nidhi Sharma
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The California Department of Public Health (CDPH) is identifying ​the prevalence of circulating SARS-CoV-2 variants by analysing ​CDPH Genomic Surveillance Data and ​CalREDIE, CDPH's communicable disease reporting and surveillance system. Viruses mutate into new strains or variants over time. Some variants emerge and then disappear. Other variants become common and circulate for a long time. Several specialized laboratories state-wide sequence the genomes of a fraction of all positive COVID-19 tests to determine which variants are circulating. Sequencing and reporting of variant results takes several days after a test is identified as a positive for COVID-19. Not all ​viruses from positive COVID-19 tests are ​sequenced. Knowing what variants are circulating in California informs public health and clinical action.

  2. COVID-19 Variant Data (ARCHIVED)

    • data.ca.gov
    • healthdata.gov
    • +4more
    csv, xlsx, zip
    Updated Nov 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    California Department of Public Health (2025). COVID-19 Variant Data (ARCHIVED) [Dataset]. https://data.ca.gov/dataset/covid-19-variant-data-archived
    Explore at:
    xlsx, csv, zipAvailable download formats
    Dataset updated
    Nov 7, 2025
    Dataset authored and provided by
    California Department of Public Healthhttps://www.cdph.ca.gov/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Note: This dataset is no longer being updated due to the end of the COVID-19 Public Health Emergency.

    The California Department of Public Health (CDPH) is identifying ​the prevalence of circulating SARS-CoV-2 variants by analyzing ​CDPH Genomic Surveillance Data and ​CalREDIE, CDPH's communicable disease reporting and surveillance system. Viruses mutate into new strains or variants over time. Some variants emerge and then disappear. Other variants become common and circulate for a long time. Several specialized laboratories statewide sequence the genomes of a fraction of all positive COVID-19 tests to determine which variants are circulating. Sequencing and reporting of variant results takes several days after a test is identified as a positive for COVID-19. Not all ​viruses from positive COVID-19 tests are ​sequenced. Knowing what variants are circulating in California informs public health and clinical action.

    Note: There is a natural reporting lag in these data due to the time commitment to complete whole genome sequencing; therefore, a 14 day lag is applied to these datasets to allow for data completeness. Please note that more recent data should be used with caution.

    For more information, please see: https://www.cdph.ca.gov/Programs/CID/DCDC/Pages/COVID-19/COVID-Variants.aspx

  3. CDPH-CalCAT Modeling Nowcasts and Forecasts for COVID-19 and Influenza

    • data.chhs.ca.gov
    • data.ca.gov
    • +2more
    csv, parquet, zip
    Updated Nov 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    California Department of Public Health (2025). CDPH-CalCAT Modeling Nowcasts and Forecasts for COVID-19 and Influenza [Dataset]. https://data.chhs.ca.gov/dataset/calcat
    Explore at:
    zip, csv(14373623), csv(679), csv(702), csv(2051609), csv(7696178), csv(649), parquet(183476911), zip(14740419)Available download formats
    Dataset updated
    Nov 28, 2025
    Dataset authored and provided by
    California Department of Public Healthhttps://www.cdph.ca.gov/
    Description

    This dataset includes three tables with the model-based projections and estimates as shown on CalCAT in 2025 (http://calcat.cdph.ca.gov) for California state, regions, and counties.

    (1) COVID-19 Nowcasts includes the R-effective estimates for COVID-19 from the different models available for the past 80 days from the archive date and the median ensemble thereof.

    (2) CalCAT Forecasts includes hospital census and admissions forecasts for COVID-19 and Influenza, and the corresponding ensemble metrics for a 4 week horizon from the archive date.

    (3) Variant Proportion Nowcasts contains the Integrated Genomic Epidemiology Dataset (IGED)-based and Terra-based estimates of COVID-19 variants circulating over the past 3 months as well as model-based predictions for the proportions of the variants of concern for dates leading up to the archive date. Prediction intervals are included when available.

    This dataset provides CalCAT users with programmatic access to the downloadable datasets on CalCAT.

    This dataset also includes a zipped file with the historical archives of the COVID-19 Nowcasts, CalCAT Forecasts and Variant Proportion Nowcasts through 2023.

  4. I

    SARS-CoV-2 Delta variant genomic variation associated with breakthrough...

    • immport.org
    • data.niaid.nih.gov
    • +1more
    url
    Updated May 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). SARS-CoV-2 Delta variant genomic variation associated with breakthrough infection in Northern California: A retrospective cohort study [Dataset]. http://doi.org/10.21430/M3CTYNVL3V
    Explore at:
    urlAvailable download formats
    Dataset updated
    May 17, 2023
    License

    https://www.immport.org/agreementhttps://www.immport.org/agreement

    Description

    To characterize the genomic variation within a circulating variant and identifying potential mutations associated with breakthrough infection among persons with Delta variant SARS-CoV-2 infection

  5. g

    CDPH-CalCAT Modeling Nowcasts and Forecasts for COVID-19 and Influenza |...

    • gimi9.com
    Updated Dec 12, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). CDPH-CalCAT Modeling Nowcasts and Forecasts for COVID-19 and Influenza | gimi9.com [Dataset]. https://gimi9.com/dataset/california_cdph-calcat-modeling-nowcasts-and-forecasts-for-covid-19-and-influenza/
    Explore at:
    Dataset updated
    Dec 12, 2024
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset includes three tables with the model-based projections and estimates as shown on CalCAT in 2024 (http://calcat.cdph.ca.gov) for California state, regions, and counties. (1) COVID-19 Nowcasts includes the R-effective estimates for COVID-19 from the different models available for the past 80 days from the archive date and the median ensemble thereof. (2) CalCAT Forecasts includes hospital census and admissions forecasts for COVID-19 and Influenza, ICU census forecasts for COVID-19, and the corresponding ensemble metrics for a 4 week horizon from the archive date. (3) Variant Proportion Nowcasts contains the Integrated Genomic Epidemiology Dataset (IGED)-based estimates of COVID-19 variants circulating over the past 3 months as well as model-based predictions for the proportions of the variants of concern for dates leading up to the archive date. Prediction intervals are included when available. This dataset provides CalCAT users with programmatic access to the downloadable datasets on CalCAT. This dataset also includes a zipped file with the historical archives of the COVID-19 Nowcasts, CalCAT Forecasts and Variant Proportion Nowcasts through 2023.

  6. I

    Data from: Risk of severe clinical outcomes among persons with SARS-CoV-2...

    • dev.immport.org
    • data.niaid.nih.gov
    • +1more
    url
    Updated Apr 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Risk of severe clinical outcomes among persons with SARS-CoV-2 infection with differing levels of vaccination during widespread Omicron (B.1.1.529) and Delta (B.1.617.2) variant circulation in Northern California: A retrospective cohort study [Dataset]. http://doi.org/10.21430/M3FXU7B7MZ
    Explore at:
    urlAvailable download formats
    Dataset updated
    Apr 18, 2023
    License

    https://www.immport.org/agreementhttps://www.immport.org/agreement

    Description

    To identify risk factors for severe clinical outcomes among persons with SARS-CoV-2 infection and persons with varying vaccination status for COVID-19 during periods of Omicron versus Delta variant circulation

  7. o

    The U.S. COVID-19 County Policy Database

    • openicpsr.org
    delimited
    Updated Sep 22, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rita Hamad; Mark Pletcher; Thomas Carton (2022). The U.S. COVID-19 County Policy Database [Dataset]. http://doi.org/10.3886/E180482V1
    Explore at:
    delimitedAvailable download formats
    Dataset updated
    Sep 22, 2022
    Dataset provided by
    University of California San Francisco
    Louisiana Public Health Institute
    Authors
    Rita Hamad; Mark Pletcher; Thomas Carton
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    LA 43.De Soto, TX 132.Bosque, LA 47.St. Tammany, CA 9.Del Norte, UT 162.Sanpete, CA 33.Sonoma, NY 91.Richmond, LA 45.Lincoln, UT 146.Morgan, LA 64.Hancock, United States
    Description

    It is increasingly recognized that policies have played a role in both alleviating and exacerbating the health and economic consequences of the COVID-19 pandemic. Yet there has been limited work to systematically evaluate the substantial variation in local COVID-19-related policies in the U.S. The objective of the U.S. COVID-19 County Policy (UCCP) Database is to systematically gather, characterize, and assess variation in U.S. county-level COVID-19-related policies. The current data upload represents the first wave of data collection, which includes data on over 20 policies gathered across 171 counties in 7 states during January-March 2021. These include county-level COVID-19-related policies within 3 policy domains that are likely to affect a variety of health outcomes: (1) containment/closure, (2) economic support, and (3) public health. In ongoing work, we are conducting retrospective longitudinal weekly data collection for the period 2020-2021 from a larger swath of 300+ U.S. counties in all 50 states and Washington D.C., and the current database will be updated with new data as it becomes available.

  8. D

    Understanding shared variation in SARS-CoV-2 genomes

    • datasetcatalog.nlm.nih.gov
    • data.niaid.nih.gov
    • +1more
    Updated Aug 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wyman, Stacia (2022). Understanding shared variation in SARS-CoV-2 genomes [Dataset]. http://doi.org/10.6078/D1JQ5C
    Explore at:
    Dataset updated
    Aug 28, 2022
    Authors
    Wyman, Stacia
    Description

    The project is a collaborative effort of investigators from the University of California, Berkeley’s Innovative Genomics Institute (IGI) and School of Public Health (SPH); Kaiser Permanente Northern California (KPNC); and the California Department of Public Health (CDPH), with administrative and programmatic support provided by Heluna Health. Over the project period, the collaborating investigators will analyze approximately 35,000 genomes of SARS-CoV-2 specimens obtained from KPNC members and sequenced by the CDPH through its COVIDNet activities. By combining results from the genomic analysis of low-frequency alleles with clinical and epidemiologic data available in patient records, including demographic variables, COVID-19 vaccination status (dates of vaccination; number of doses; manufacturer), COVID-19 disease severity, and underlying medical conditions, we assessed which shared genomic variations are associated with a greater risk of symptomatic infection and severe clinical outcomes; COVID-19 vaccine effectiveness; and transmission of SARS-CoV-2 in the household. The project and its results can serve as a model for community-based monitoring of the evolution and spread of SARS-CoV-2 and use of the data to inform decisions about the formulation and use of COVID-19 vaccines, including booster doses and next-generation vaccines.

  9. f

    DataSheet_3_Epitope Classification and RBD Binding Properties of...

    • frontiersin.figshare.com
    pdf
    Updated May 30, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ashlesha Deshpande; Bethany D. Harris; Luis Martinez-Sobrido; James J. Kobie; Mark R. Walter (2023). DataSheet_3_Epitope Classification and RBD Binding Properties of Neutralizing Antibodies Against SARS-CoV-2 Variants of Concern.pdf [Dataset]. http://doi.org/10.3389/fimmu.2021.691715.s003
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Frontiers
    Authors
    Ashlesha Deshpande; Bethany D. Harris; Luis Martinez-Sobrido; James J. Kobie; Mark R. Walter
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Severe acute respiratory syndrome coronavirus-2 (SAR-CoV-2) causes coronavirus disease 2019 (COVID19) that is responsible for short and long-term disease, as well as death, in susceptible hosts. The receptor binding domain (RBD) of the SARS-CoV-2 Spike (S) protein binds to cell surface angiotensin converting enzyme type-II (ACE2) to initiate viral attachment and ultimately viral pathogenesis. The SARS-CoV-2 S RBD is a major target of neutralizing antibodies (NAbs) that block RBD - ACE2 interactions. In this report, NAb-RBD binding epitopes in the protein databank were classified as C1, C1D, C2, C3, or C4, using a RBD binding profile (BP), based on NAb-specific RBD buried surface area and used to predict the binding epitopes of a series of uncharacterized NAbs. Naturally occurring SARS-CoV-2 RBD sequence variation was also quantified to predict NAb binding sensitivities to the RBD-variants. NAb and ACE2 binding studies confirmed the NAb classifications and determined whether the RBD variants enhanced ACE2 binding to promote viral infectivity, and/or disrupted NAb binding to evade the host immune response. Of 9 single RBD mutants evaluated, K417T, E484K, and N501Y disrupted binding of 65% of the NAbs evaluated, consistent with the assignment of the SARS-CoV-2 P.1 Japan/Brazil strain as a variant of concern (VoC). RBD variants E484K and N501Y exhibited ACE2 binding equivalent to a Wuhan-1 reference SARS-CoV-2 RBD. While slightly less disruptive to NAb binding, L452R enhanced ACE2 binding affinity. Thus, the L452R mutant, associated with the SARS-CoV-2 California VoC (B.1.427/B.1.429-California), has evolved to enhance ACE2 binding, while simultaneously disrupting C1 and C2 NAb classes. The analysis also identified a non-overlapping antibody pair (1213H7 and 1215D1) that bound to all SARS-CoV-2 RBD variants evaluated, representing an excellent therapeutic option for treatment of SARS-CoV-2 WT and VoC strains.

  10. t

    BIOGRID CURATED DATA FOR PUBLICATION: Epitope Classification and RBD Binding...

    • thebiogrid.org
    zip
    Updated Apr 13, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BioGRID Project (2021). BIOGRID CURATED DATA FOR PUBLICATION: Epitope Classification and RBD Binding Properties of Neutralizing Antibodies Against SARS-CoV-2 Variants of Concern. [Dataset]. https://thebiogrid.org/227928/publication/epitope-classification-and-rbd-binding-properties-of-neutralizing-antibodies-against-sars-cov-2-variants-of-concern.html
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 13, 2021
    Dataset authored and provided by
    BioGRID Project
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Protein-Protein, Genetic, and Chemical Interactions for Deshpande A (2021):Epitope Classification and RBD Binding Properties of Neutralizing Antibodies Against SARS-CoV-2 Variants of Concern. curated by BioGRID (https://thebiogrid.org); ABSTRACT: Severe acute respiratory syndrome coronavirus-2 (SAR-CoV-2) causes coronavirus disease 2019 (COVID19) that is responsible for short and long-term disease, as well as death, in susceptible hosts. The receptor binding domain (RBD) of the SARS-CoV-2 Spike (S) protein binds to cell surface angiotensin converting enzyme type-II (ACE2) to initiate viral attachment and ultimately viral pathogenesis. The SARS-CoV-2 S RBD is a major target of neutralizing antibodies (NAbs) that block RBD - ACE2 interactions. In this report, NAb-RBD binding epitopes in the protein databank were classified as C1, C1D, C2, C3, or C4, using a RBD binding profile (BP), based on NAb-specific RBD buried surface area and used to predict the binding epitopes of a series of uncharacterized NAbs. Naturally occurring SARS-CoV-2 RBD sequence variation was also quantified to predict NAb binding sensitivities to the RBD-variants. NAb and ACE2 binding studies confirmed the NAb classifications and determined whether the RBD variants enhanced ACE2 binding to promote viral infectivity, and/or disrupted NAb binding to evade the host immune response. Of 9 single RBD mutants evaluated, K417T, E484K, and N501Y disrupted binding of 65% of the NAbs evaluated, consistent with the assignment of the SARS-CoV-2 P.1 Japan/Brazil strain as a variant of concern (VoC). RBD variants E484K and N501Y exhibited ACE2 binding equivalent to a Wuhan-1 reference SARS-CoV-2 RBD. While slightly less disruptive to NAb binding, L452R enhanced ACE2 binding affinity. Thus, the L452R mutant, associated with the SARS-CoV-2 California VoC (B.1.427/B.1.429-California), has evolved to enhance ACE2 binding, while simultaneously disrupting C1 and C2 NAb classes. The analysis also identified a non-overlapping antibody pair (1213H7 and 1215D1) that bound to all SARS-CoV-2 RBD variants evaluated, representing an excellent therapeutic option for treatment of SARS-CoV-2 WT and VoC strains.

  11. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Nidhi Sharma (2023). COVID-19 Variant Data [Dataset]. https://www.kaggle.com/datasets/nidzsharma/covid-19-variant-data
Organization logo

COVID-19 Variant Data

COVID 19 Variant Data - CALIFORNIA

Explore at:
51 scholarly articles cite this dataset (View in Google Scholar)
zip(68659 bytes)Available download formats
Dataset updated
Mar 12, 2023
Authors
Nidhi Sharma
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

The California Department of Public Health (CDPH) is identifying ​the prevalence of circulating SARS-CoV-2 variants by analysing ​CDPH Genomic Surveillance Data and ​CalREDIE, CDPH's communicable disease reporting and surveillance system. Viruses mutate into new strains or variants over time. Some variants emerge and then disappear. Other variants become common and circulate for a long time. Several specialized laboratories state-wide sequence the genomes of a fraction of all positive COVID-19 tests to determine which variants are circulating. Sequencing and reporting of variant results takes several days after a test is identified as a positive for COVID-19. Not all ​viruses from positive COVID-19 tests are ​sequenced. Knowing what variants are circulating in California informs public health and clinical action.

Search
Clear search
Close search
Google apps
Main menu