A. SUMMARY Medical provider confirmed COVID-19 cases and confirmed COVID-19 related deaths in San Francisco, CA aggregated by several different geographic areas and normalized by 2016-2020 American Community Survey (ACS) 5-year estimates for population data to calculate rate per 10,000 residents. On September 12, 2021, a new case definition of COVID-19 was introduced that includes criteria for enumerating new infections after previous probable or confirmed infections (also known as reinfections). A reinfection is defined as a confirmed positive PCR lab test more than 90 days after a positive PCR or antigen test. The first reinfection case was identified on December 7, 2021. Cases and deaths are both mapped to the residence of the individual, not to where they were infected or died. For example, if one was infected in San Francisco at work but lives in the East Bay, those are not counted as SF Cases or if one dies in Zuckerberg San Francisco General but is from another county, that is also not counted in this dataset. Dataset is cumulative and covers cases going back to 3/2/2020 when testing began. Geographic areas summarized are: 1. Analysis Neighborhoods 2. Census Tracts 3. Census Zip Code Tabulation Areas B. HOW THE DATASET IS CREATED Addresses from medical data are geocoded by the San Francisco Department of Public Health (SFDPH). Those addresses are spatially joined to the geographic areas. Counts are generated based on the number of address points that match each geographic area. The 2016-2020 American Community Survey (ACS) population estimates provided by the Census are used to create a rate which is equal to ([count] / [acs_population]) * 10000) representing the number of cases per 10,000 residents. C. UPDATE PROCESS Geographic analysis is scripted by SFDPH staff and synced to this dataset daily at 7:30 Pacific Time. D. HOW TO USE THIS DATASET San Francisco population estimates for geographic regions can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS). Privacy rules in effect To protect privacy, certain rules are in effect: 1. Case counts greater than 0 and less than 10 are dropped - these will be null (blank) values 2. Death counts greater than 0 and less than 10 are dropped - these will be null (blank) values 3. Cases and deaths dropped altogether for areas where acs_population < 1000 Rate suppression in effect where counts lower than 20 Rates are not calculated unless the case count is greater than or equal to 20. Rates are generally unstable at small numbers, so we avoid calculating them directly. We advise you to apply the same approach as this is best practice in epidemiology. A note on Census ZIP Code Tabulation Areas (ZCTAs) ZIP Code Tabulation Areas are special boundaries created by the U.S. Census based on ZIP Codes developed by the USPS. They are not, however, the same thing. ZCTAs are areal representations of routes. Read how the Census develops ZCTAs on their website. Row included for Citywide case counts, incidence rate, and deaths A single row is included that has the Citywide case counts and incidence rate. This can be used for comparisons. Citywide will capture all cases regardless of address quality. While some cases cannot be mapped to sub-areas like Census Tracts, ongo
A. SUMMARY This dataset represents the COVID-19 vaccinations given to residents of San Francisco. All vaccines given to SF residents are included, no matter where the vaccination took place (the vaccine may have been administered in San Francisco or outside of San Francisco). The data are broken down by multiple demographic stratifications. This dataset also includes COVID-19 vaccinations given to SF residents by the San Francisco Department of Public Health (SFDPH).
Data provides counts for residents who have received at least one dose, residents who have completed a primary vaccine series, residents who have received one or two monovalent (not bivalent) booster doses, and residents who have received a bivalent booster dose. A primary vaccine series is complete after an individual has received all intended doses of the initial series. There are one, two, and three dose primary vaccine series.
B. HOW THE DATASET IS CREATED Information on doses administered to those who live in San Francisco is from the California Immunization Registry (CAIR2), run by the California Department of Public Health (CDPH). The information on individuals’ city of residence, age, race, and ethnicity are also recorded in CAIR and are self-reported at the time of vaccine administration.
In order to estimate the percent of San Franciscans vaccinated, we provide the 2016-2020 American Community Survey (ACS) population estimates for each demographic group.
C. UPDATE PROCESS Updated daily via automated process
D. HOW TO USE THIS DATASET San Francisco population estimates for race/ethnicity and age groups can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS).
Before analysis, you must filter the dataset to the desired stratification of data using the "overall_segment" column.
For example, filtering "overall_segment" to "All SF Residents by Age Bracket, Administered by All Providers" will filter the data to residents whose vaccinations were administered by any provider. You can then further segment the data and calculate percentages by Age Brackets.
If you filter "overall_segment" to "All SF Residents by Race/Ethnicity, Administered by DPH Only", you will see the race/ethnicity breakdown for residents who received vaccinations from the San Francisco Department of Public Health (SFDPH).
If you filter "overall_segment" to "All SF Residents by Age Group, Administered by All Providers" you will see vaccination counts of various age eligibility groups that were administered by any provider.
To count the number of individuals vaccinated (with any primary series dose), use the "total_recipients" column. To count the number of individuals who have completed their primary vaccine series, use the "total_series_completed" column. To count the number of primary series doses administered (1st, 2nd, 3rd, or single doses), use the "total_primary_series_doses" column.
To count the number of individuals who received one or two monovalent (not bivalent) booster doses, use the "total_booster_recipients" and "total_2nd_booster_recipients" columns. To count the number of individuals who received their first bivalent booster dose, use the "total_bivalent_booster_recipients" column. To count the number of monovalent (not including bivalent) or bivalent booster doses administered, use the "total_booster_doses" or "total_bivalent_booster_doses" columns.
E. ARCHIVED DATA A previous version of this dataset was archived on 10/27/2022. For historical purposes, you can access the archived dataset at the following link:
ARCHIVED: COVID-19 Vaccine Doses Given to San Franciscans by Demographics
F. CHA
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Robustness check with marginal probabilities estimated from logit and probit models for respondents who were extremely worried about COVID-19 after versus before the San Francisco Bay Area shelter-in-place announcement 1.
A. SUMMARY This dataset includes COVID-19 tests by resident neighborhood and specimen collection date (the day the test was collected). Specifically, this dataset includes tests of San Francisco residents who listed a San Francisco home address at the time of testing. These resident addresses were then geo-located and mapped to neighborhoods. The resident address associated with each test is hand-entered and susceptible to errors, therefore neighborhood data should be interpreted as an approximation, not a precise nor comprehensive total.
In recent months, about 5% of tests are missing addresses and therefore cannot be included in any neighborhood totals. In earlier months, more tests were missing address data. Because of this high percentage of tests missing resident address data, this neighborhood testing data for March, April, and May should be interpreted with caution (see below)
Percentage of tests missing address information, by month in 2020 Mar - 33.6% Apr - 25.9% May - 11.1% Jun - 7.2% Jul - 5.8% Aug - 5.4% Sep - 5.1% Oct (Oct 1-12) - 5.1%
To protect the privacy of residents, the City does not disclose the number of tests in neighborhoods with resident populations of fewer than 1,000 people. These neighborhoods are omitted from the data (they include Golden Gate Park, John McLaren Park, and Lands End).
Tests for residents that listed a Skilled Nursing Facility as their home address are not included in this neighborhood-level testing data. Skilled Nursing Facilities have required and repeated testing of residents, which would change neighborhood trends and not reflect the broader neighborhood's testing data.
This data was de-duplicated by individual and date, so if a person gets tested multiple times on different dates, all tests will be included in this dataset (on the day each test was collected).
The total number of positive test results is not equal to the total number of COVID-19 cases in San Francisco. During this investigation, some test results are found to be for persons living outside of San Francisco and some people in San Francisco may be tested multiple times (which is common). To see the number of new confirmed cases by neighborhood, reference this map: https://sf.gov/data/covid-19-case-maps#new-cases-maps
B. HOW THE DATASET IS CREATED COVID-19 laboratory test data is based on electronic laboratory test reports. Deduplication, quality assurance measures and other data verification processes maximize accuracy of laboratory test information. All testing data is then geo-coded by resident address. Then data is aggregated by analysis neighborhood and specimen collection date.
Data are prepared by close of business Monday through Saturday for public display.
C. UPDATE PROCESS Updates automatically at 05:00 Pacific Time each day. Redundant runs are scheduled at 07:00 and 09:00 in case of pipeline failure.
D. HOW TO USE THIS DATASET San Francisco population estimates for geographic regions can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS).
Due to the high degree of variation in the time needed to complete tests by different labs there is a delay in this reporting. On March 24 the Health Officer ordered all labs in the City to report complete COVID-19 testing information to the local and state health departments.
In order to track trends over time, a data user can analyze this data by "specimen_collection_date".
Calculating Percent Positivity: The positivity rate is the percentage of tests that return a positive result for COVID-19 (positive tests divided by the sum of positive and negative tests). Indeterminate results, which could not conclusively determine whether COVID-19 virus was present, are not included in the calculation of pe
COVID-19 data for San Francisco from data.sfgov.org
COVID-19 Hospitalizations
COVID-19 Cases Summarized by Date, Transmission and Case Disposition
COVID-19 Cases Summarized by Race and Ethnicity
COVID-19 Cases Summarized by Age Group and Gender
COVID-19 Tests
Data from https://data.sfgov.org/analytics and https://data.sfgov.org/COVID-19/COVID-19-Cases-Summarized-by-Age-Group-and-Gender/sunc-2t3k and https://data.sfgov.org/COVID-19/COVID-19-Tests/nfpa-mg4g and https://data.sfgov.org/COVID-19/COVID-19-Hospitalizations/nxjg-bhem and https://data.sfgov.org/COVID-19/COVID-19-Cases-Summarized-by-Date-Transmission-and/tvq9-ec9w and https://data.sfgov.org/COVID-19/COVID-19-Cases-Summarized-by-Race-and-Ethnicity/vqqm-nsqg
Dataset license: https://datasf.org/opendata/terms-of-use/
Banner Photo by Maarten van den Heuvel on Unsplash
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘COVID-19 Cases and Deaths Summarized by Geography’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/d2e381bb-f395-4b40-979e-920a79a3db88 on 11 February 2022.
--- Dataset description provided by original source is as follows ---
Note: On January 22, 2022, system updates to improve the timeliness and accuracy of San Francisco COVID-19 cases and deaths data were implemented. You might see some fluctuations in historic data as a result of this change. Due to the changes, starting on January 22, 2022, the number of new cases reported daily will be higher than under the old system as cases that would have taken longer to process will be reported earlier.
Note: As of April 16, 2021, this dataset will update daily with a five-day data lag.
A. SUMMARY Medical provider confirmed COVID-19 cases and confirmed COVID-19 related deaths in San Francisco, CA aggregated by several different geographic areas and normalized by 2019 American Community Survey (ACS) 5-year estimates for population data to calculate rate per 10,000 residents.
Cases and deaths are both mapped to the residence of the individual, not to where they were infected or died. For example, if one was infected in San Francisco at work but lives in the East Bay, those are not counted as SF Cases or if one dies in Zuckerberg San Francisco General but is from another county, that is also not counted in this dataset.
Dataset is cumulative and covers cases going back to March 2nd, 2020 when testing began.
Geographic areas summarized are: 1. Analysis Neighborhoods 2. Census Tracts 3. Census Zip Code Tabulation Areas
B. HOW THE DATASET IS CREATED Addresses from medical data are geocoded by the San Francisco Department of Public Health (SFDPH). Those addresses are spatially joined to the geographic areas. Counts are generated based on the number of address points that match each geographic area. The 2019 ACS estimates for population provided by the Census are used to create a rate which is equal to ([count] / [acs_population]) * 10000) representing the number of cases per 10,000 residents.
C. UPDATE PROCESS Geographic analysis is scripted by SFDPH staff and synced to this dataset daily at 7:30 Pacific Time.
D. HOW TO USE THIS DATASET Privacy rules in effect To protect privacy, certain rules are in effect: 1. Case counts greater than 0 and less than 10 are dropped - these will be null (blank) values 2. Death counts greater than 0 and less than 10 are dropped - these will be null (blank) values 3. Cases and deaths dropped altogether for areas where acs_population < 1000
Rate suppression in effect where counts lower than 20 Rates are not calculated unless the case count is greater than or equal to 20. Rates are generally unstable at small numbers, so we avoid calculating them directly. We advise you to apply the same approach as this is best practice in epidemiology.
A note on Census ZIP Code Tabulation Areas (ZCTAs) ZIP Code Tabulation Areas are special boundaries created by the U.S. Census based on ZIP Codes developed by the USPS. They are not, however, the same thing. ZCTAs are areal representations of routes. Read how the Census develops ZCTAs on their website.
Row included for Citywide case counts, incidence rate, and deaths A single row is included that has the Citywide case counts and incidence rate. This can be used for comparisons. Citywide will capture all cases regardless of address quality. While some cases cannot be mapped to sub-areas like Census Tracts, ongoing data quality efforts result in improved mapping on a rolling bases.
--- Original source retains full ownership of the source dataset ---
As of 9/12/2024, we have resumed reporting on COVID-19 hospitalization data using a San Francisco specific dataset. These new data differ slightly from previous hospitalization data sources but the overall patterns and trends in hospitalizations remain consistent. You can access the previous data here. A. SUMMARY This dataset includes information on COVID+ hospital admissions for San Francisco residents into San Francisco hospitals. Specifically, the dataset includes the count and rate of COVID+ hospital admissions per 100,000. The data are reported by week. B. HOW THE DATASET IS CREATED Hospital admission data is reported to the San Francisco Department of Public Health (SFDPH) via the COVID Hospital Data Repository (CHDR), a system created via health officer order C19-16. The data includes all San Francisco hospitals except for the San Francisco VA Medical Center. San Francisco population estimates are pulled from a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2018-2022 5-year American Community Survey (ACS). C. UPDATE PROCESS Data updates weekly on Wednesday with data for the past Wednesday-Tuesday (one week lag). Data may change as more current information becomes available. D. HOW TO USE THIS DATASET New admissions are the count of COVID+ hospital admissions among San Francisco residents to San Francisco hospitals by week. The admission rate per 100,000 is calculated by multiplying the count of admissions each week by 100,000 and dividing by the population estimate. E. CHANGE LOG
As of July 2nd, 2024 the COVID-19 Deaths by Population Characteristics Over Time dataset has been retired. This dataset is archived and will no longer update. We will be publishing a cumulative deaths by population characteristics dataset that will update moving forward.
A. SUMMARY This dataset shows San Francisco COVID-19 deaths by population characteristics and by date. This data may not be immediately available for recently reported deaths. Data updates as more information becomes available. Because of this, death totals for previous days may increase or decrease. More recent data is less reliable.
Population characteristics are subgroups, or demographic cross-sections, like age, race, or gender. The City tracks how deaths have been distributed among different subgroups. This information can reveal trends and disparities among groups.
B. HOW THE DATASET IS CREATED As of January 1, 2023, COVID-19 deaths are defined as persons who had COVID-19 listed as a cause of death or a significant condition contributing to their death on their death certificate. This definition is in alignment with the California Department of Public Health and the national https://preparedness.cste.org/wp-content/uploads/2022/12/CSTE-Revised-Classification-of-COVID-19-associated-Deaths.Final_.11.22.22.pdf">Council of State and Territorial Epidemiologists. Death certificates are maintained by the California Department of Public Health.
Data on the population characteristics of COVID-19 deaths are from: *Case reports *Medical records *Electronic lab reports *Death certificates
Data are continually updated to maximize completeness of information and reporting on San Francisco COVID-19 deaths.
To protect resident privacy, we summarize COVID-19 data by only one characteristic at a time. Data are not shown until cumulative citywide deaths reach five or more.
Data notes on each population characteristic type is listed below.
Race/ethnicity * We include all race/ethnicity categories that are collected for COVID-19 cases.
Gender * The City collects information on gender identity using these guidelines.
C. UPDATE PROCESS Updates automatically at 06:30 and 07:30 AM Pacific Time on Wednesday each week.
Dataset will not update on the business day following any federal holiday.
D. HOW TO USE THIS DATASET Population estimates are only available for age groups and race/ethnicity categories. San Francisco population estimates for race/ethnicity and age groups can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS).
This dataset includes many different types of characteristics. Filter the “Characteristic Type” column to explore a topic area. Then, the “Characteristic Group” column shows each group or category within that topic area and the number of deaths on each date.
New deaths are the count of deaths within that characteristic group on that specific date. Cumulative deaths are the running total of all San Francisco COVID-19 deaths in that characteristic group up to the date listed.
This data may not be immediately available for more recent deaths. Data updates as more information becomes available.
To explore data on the total number of deaths, use the COVID-19 Deaths Over Time dataset.
E. CHANGE LOG
A. SUMMARY This dataset includes San Francisco COVID-19 tests by race/ethnicity and by date. This dataset represents the daily count of tests collected, and the breakdown of test results (positive, negative, or indeterminate). Tests in this dataset include all those collected from persons who listed San Francisco as their home address at the time of testing. It also includes tests that were collected by San Francisco providers for persons who were missing a locating address. This dataset does not include tests for residents listing a locating address outside of San Francisco, even if they were tested in San Francisco.
The data were de-duplicated by individual and date, so if a person gets tested multiple times on different dates, all tests will be included in this dataset (on the day each test was collected). If a person tested multiple times on the same date, only one test is included from that date. When there are multiple tests on the same date, a positive result, if one exists, will always be selected as the record for the person. If a PCR and antigen test are taken on the same day, the PCR test will supersede. If a person tests multiple times on the same day and the results are all the same (e.g. all negative or all positive) then the first test done is selected as the record for the person.
The total number of positive test results is not equal to the total number of COVID-19 cases in San Francisco.
When a person gets tested for COVID-19, they may be asked to report information about themselves. One piece of information that might be requested is a person's race and ethnicity. These data are often incomplete in the laboratory and provider reports of the test results sent to the health department. The data can be missing or incomplete for several possible reasons:
• The person was not asked about their race and ethnicity.
• The person was asked, but refused to answer.
• The person answered, but the testing provider did not include the person's answers in the reports.
• The testing provider reported the person's answers in a format that could not be used by the health department.
For any of these reasons, a person's race/ethnicity will be recorded in the dataset as “Unknown.”
B. NOTE ON RACE/ETHNICITY The different values for Race/Ethnicity in this dataset are "Asian;" "Black or African American;" "Hispanic or Latino/a, all races;" "American Indian or Alaska Native;" "Native Hawaiian or Other Pacific Islander;" "White;" "Multi-racial;" "Other;" and “Unknown."
The Race/Ethnicity categorization increases data clarity by emulating the methodology used by the U.S. Census in the American Community Survey. Specifically, persons who identify as "Asian," "Black or African American," "American Indian or Alaska Native," "Native Hawaiian or Other Pacific Islander," "White," "Multi-racial," or "Other" do NOT include any person who identified as Hispanic/Latino at any time in their testing reports that either (1) identified them as SF residents or (2) as someone who tested without a locating address by an SF provider. All persons across all races who identify as Hispanic/Latino are recorded as “"Hispanic or Latino/a, all races." This categorization increases data accuracy by correcting the way “Other” persons were counted. Previously, when a person reported “Other” for Race/Ethnicity, they would be recorded “Unknown.” Under the new categorization, they are counted as “Other” and are distinct from “Unknown.”
If a person records their race/ethnicity as “Asian,” “Black or African American,” “American Indian or Alaska Native,” “Native Hawaiian or Other Pacific Islander,” “White,” or “Other” for their first COVID-19 test, then this data will not change—even if a different race/ethnicity is reported for this person for any future COVID-19 test. There are two exceptions to this rule. The first exception is if a person’s race/ethnicity value i
A. SUMMARY This dataset shows San Francisco COVID-19 deaths by population characteristics. This data may not be immediately available for recently reported deaths. Data updates as more information becomes available. Because of this, death totals may increase or decrease.
Population characteristics are subgroups, or demographic cross-sections, like age, race, or gender. The City tracks how deaths have been distributed among different subgroups. This information can reveal trends and disparities among groups.
B. HOW THE DATASET IS CREATED As of January 1, 2023, COVID-19 deaths are defined as persons who had COVID-19 listed as a cause of death or a significant condition contributing to their death on their death certificate. This definition is in alignment with the California Department of Public Health and the national https://preparedness.cste.org/wp-content/uploads/2022/12/CSTE-Revised-Classification-of-COVID-19-associated-Deaths.Final_.11.22.22.pdf">Council of State and Territorial Epidemiologists. Death certificates are maintained by the California Department of Public Health.
Data on the population characteristics of COVID-19 deaths are from: *Case reports *Medical records *Electronic lab reports *Death certificates
Data are continually updated to maximize completeness of information and reporting on San Francisco COVID-19 deaths.
To protect resident privacy, we summarize COVID-19 data by only one population characteristic at a time. Data are not shown until cumulative citywide deaths reach five or more.
Data notes on select population characteristic types are listed below.
Race/ethnicity * We include all race/ethnicity categories that are collected for COVID-19 cases.
Gender * The City collects information on gender identity using these guidelines.
C. UPDATE PROCESS Updates automatically at 06:30 and 07:30 AM Pacific Time on Wednesday each week.
Dataset will not update on the business day following any federal holiday.
D. HOW TO USE THIS DATASET Population estimates are only available for age groups and race/ethnicity categories. San Francisco population estimates for race/ethnicity and age groups can be found in a dataset based on the San Francisco Population and Demographic Census dataset.These population estimates are from the 2018-2022 5-year American Community Survey (ACS).
This dataset includes several characteristic types. Filter the “Characteristic Type” column to explore a topic area. Then, the “Characteristic Group” column shows each group or category within that topic area and the number of cumulative deaths.
Cumulative deaths are the running total of all San Francisco COVID-19 deaths in that characteristic group up to the date listed.
To explore data on the total number of deaths, use the COVID-19 Deaths Over Time dataset.
E. CHANGE LOG
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
It is increasingly recognized that policies have played a role in both alleviating and exacerbating the health and economic consequences of the COVID-19 pandemic. Yet there has been limited work to systematically evaluate the substantial variation in local COVID-19-related policies in the U.S. The objective of the U.S. COVID-19 County Policy (UCCP) Database is to systematically gather, characterize, and assess variation in U.S. county-level COVID-19-related policies. The current data upload represents the first wave of data collection, which includes data on over 20 policies gathered across 171 counties in 7 states during January-March 2021. These include county-level COVID-19-related policies within 3 policy domains that are likely to affect a variety of health outcomes: (1) containment/closure, (2) economic support, and (3) public health. In ongoing work, we are conducting retrospective longitudinal weekly data collection for the period 2020-2021 from a larger swath of 300+ U.S. counties in all 50 states and Washington D.C., and the current database will be updated with new data as it becomes available.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Covid-19 Tests by Race Ethnicity and Date’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/68410b4b-052f-4ce3-8d0c-873b5664f1a4 on 27 January 2022.
--- Dataset description provided by original source is as follows ---
Note: As of April 16, 2021, this dataset will update daily with a five-day data lag.
A. SUMMARY This dataset includes San Francisco COVID-19 tests by race/ ethnicity and date. For each day, this dataset represents the daily count of tests collected by race/ethnicity, and how many of those were positive, negative, and indeterminate. Tests in this dataset include all tests collected from San Francisco residents who listed a San Francisco home address at the time of testing, and tests that were collected in San Francisco but had a missing home address. Data are based on information collected at the time of testing.
For recent data, about 25-30% of tests are missing race/ ethnicity information. Tests where the race/ ethnicity of the patient is unknown are included in the dataset under the "Unknown" category.
This data was de-duplicated by individual and date, so if a person gets tested multiple times on different dates, all tests will be included in this dataset (on the day each test was collected).
The total number of positive test results is not equal to the total number of COVID-19 cases in San Francisco. Each positive test result is investigated. During this investigation, some test results are found to be for persons living outside of San Francisco and some people in San Francisco may be tested multiple times. In both cases, these results are not included in San Francisco’s total COVID-19 case count. To track the number of cases by race/ ethnicity, see this dashboard: https://data.sfgov.org/stories/s/w6za-6st8
B. HOW THE DATASET IS CREATED COVID-19 laboratory test data is based on electronic laboratory test reports. Deduplication, quality assurance measures and other data verification processes maximize accuracy of laboratory test information.
C. UPDATE PROCESS Updates automatically at 05:00 Pacific Time each day. Redundant runs are scheduled at 07:00 and 09:00 in case of pipeline failure.
D. HOW TO USE THIS DATASET Due to the high degree of variation in the time needed to complete tests by different labs there is a delay in this reporting. On March 24 the Health Officer ordered all labs in the City to report complete COVID-19 testing information to the local and state health departments.
In order to track trends over time, a data user can analyze this data by "specimen_collection_date".
Calculating Percent Positivity: The positivity rate is the percentage of tests that return a positive result for COVID-19 (positive tests divided by the sum of positive and negative tests). Indeterminate results, which could not conclusively determine whether COVID-19 virus was present, are not included in the calculation of percent positive. When there are fewer than 20 positives tests for a given race/ethnicity and time period, the positivity rate is not calculated for the public tracker because rates of small test counts are less reliable.
Calculating Testing Rates: To calculate the testing rate per 10,000 residents, divide the total number of tests collected (positive, negative, and indeterminate results) for the specified race/ ethnicity by the total number of residents who identify as that race/ ethnicity (according to the 2018 5-year estimates from the American Community Survey), then multiply by 10,000. When there are fewer than 20 total tests for a given race/ethnicity and time period, the testing rate is not calculated for the public tracker because rates of small test counts are less reliable.
Read more about how this data is updated and validated daily: https://data.sfgov.org/stories/s/nudz-9tg2
There are two other datasets related to tests: 1. COVID-19 Tests 2. <a href="https://data.sfgov.org/dataset/Covid-19-Testing-by
--- Original source retains full ownership of the source dataset ---
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
A. SUMMARY This dataset represents San Francisco COVID-19 related deaths by day. This data may not be immediately available for recently reported deaths. Data updates as more information becomes available. Because of this, death totals for previous days may increase or decrease. More recent data is less reliable.
B. HOW THE DATASET IS CREATED As of January 1, 2023, COVID-19 deaths are defined as persons who had COVID-19 listed as a cause of death or a significant condition contributing to their death on their death certificate. This definition is in alignment with the California Department of Public Health and the national https://preparedness.cste.org/wp-content/uploads/2022/12/CSTE-Revised-Classification-of-COVID-19-associated-Deaths.Final_.11.22.22.pdf">Council of State and Territorial Epidemiologists. Death data is provided by the California Department of Public Health.
It takes time to process this data. Because of this, death totals may increase or decrease over time.
Data are continually updated to maximize completeness of information and reporting on San Francisco COVID-19 deaths.
C. UPDATE PROCESS Updates automatically at 06:30 and 07:30 AM Pacific Time on Wednesday each week.
Dataset will not update on the business day following any federal holiday.
D. HOW TO USE THIS DATASET This dataset shows new deaths and cumulative deaths by date of death. New deaths are the count of deaths on that specific date. Cumulative deaths are the running total of all San Francisco COVID-19 deaths up to the date listed.
Use the Deaths by Population Characteristics Over Time dataset to see deaths by different subgroups including race/ethnicity, age, and gender.
E. CHANGE LOG
A. SUMMARY Geographic zones of the priority areas in the Tenderloin neighborhood used in the COVID-19 assessment and Tenderloin Neighborhood Plan. See more details on the plan here: https://sf.gov/news/san-francisco-releases-tenderloin-neighborhood-safety-assessment-and-plan-covid-19
B. HOW THE DATASET IS CREATED A team of representative City departments from the Healthy Streets Operation Center (Department of Emergency Management, Department of Public Health, Department of Homelessness and Supportive Housing, Human Rights Commission, San Francisco Police Department, San Francisco Fire Department, and Department of Public Works), SF Homeless Outreach Team, Felton Institute, and community groups and stakeholders was assembled to design and implement a robust Tenderloin Neighborhood Needs Assessment. This assessment was conducted on the morning of April 28, 2020 and consisted of multi-disciplinary teams walking each block of an area of the Tenderloin broken into six geographic zones. These zone locations are shown in the plan, and are mapped in this dataset.
C. UPDATE PROCESS This is a reference map that will not be updated.
D. HOW TO USE THIS DATASET These zones can be used with other datasets to track trends by zone. Note that these zones are the priority zones for the Tenderloin Plan and do not represent the entire Tenderloin Neighborhood boundary. For a boundary of the entire Tenderloin, use the analysis neighborhood boundary: https://data.sfgov.org/Geographic-Locations-and-Boundaries/Analysis-Neighborhoods/p5b7-5n3h
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘COVID-19 Deaths Over Time’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/55529260-9d00-464d-ad56-187b2ca7cd15 on 27 January 2022.
--- Dataset description provided by original source is as follows ---
Note: On January 22, 2022, system updates to improve the timeliness and accuracy of San Francisco COVID-19 cases and deaths data were implemented. You might see some fluctuations in historic data as a result of this change.
A. SUMMARY This dataset represents San Francisco COVID-19 related deaths by day. Deaths are included on the date the individual died.
Data is lagged by five days, meaning the most date included is 5 days prior to today. All data update daily as more information becomes available.
B. HOW THE DATASET IS CREATED COVID-19 deaths are suspected to be associated with COVID-19. This means COVID-19 is listed as a cause of death or significant condition on the death certificate.
Deaths may be reported by:
It takes time to process this data. Because of this, data is lagged by 5 days and death totals for previous days may increase or decrease. More recent data is less reliable.
Data are continually updated to maximize completeness of information and reporting on San Francisco COVID-19 deaths.
C. UPDATE PROCESS Updates automatically at 05:00 AM Pacific Time each day. Redundant runs are scheduled at 07:00 AM and 09:00 AM in case of pipeline failure.
Dataset will not update on the business day following any federal holiday.
D. HOW TO USE THIS DATASET This dataset shows new deaths and cumulative deaths by date of death. New deaths are the count of deaths on that specific date. Cumulative deaths are the running total of all San Francisco COVID-19 deaths up to the date listed.
Use the Deaths by Population Characteristics dataset to see deaths by different subgroups including race/ethnicity, age, gender, and homelessness.
--- Original source retains full ownership of the source dataset ---
A. SUMMARY This dataset contains COVID-19 positive confirmed cases aggregated by several different geographic areas and by day. COVID-19 cases are mapped to the residence of the individual and shown on the date the positive test was collected. In addition, 2016-2020 American Community Survey (ACS) population estimates are included to calculate the cumulative rate per 10,000 residents. Dataset covers cases going back to 3/2/2020 when testing began. This data may not be immediately available for recently reported cases and data will change to reflect as information becomes available. Data updated daily. Geographic areas summarized are: 1. Analysis Neighborhoods 2. Census Tracts 3. Census Zip Code Tabulation Areas B. HOW THE DATASET IS CREATED Addresses from the COVID-19 case data are geocoded by the San Francisco Department of Public Health (SFDPH). Those addresses are spatially joined to the geographic areas. Counts are generated based on the number of address points that match each geographic area for a given date. The 2016-2020 American Community Survey (ACS) population estimates provided by the Census are used to create a cumulative rate which is equal to ([cumulative count up to that date] / [acs_population]) * 10000) representing the number of total cases per 10,000 residents (as of the specified date). COVID-19 case data undergo quality assurance and other data verification processes and are continually updated to maximize completeness and accuracy of information. This means data may change for previous days as information is updated. C. UPDATE PROCESS Geographic analysis is scripted by SFDPH staff and synced to this dataset daily at 05:00 Pacific Time. D. HOW TO USE THIS DATASET San Francisco population estimates for geographic regions can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS). This dataset can be used to track the spread of COVID-19 throughout the city, in a variety of geographic areas. Note that the new cases column in the data represents the number of new cases confirmed in a certain area on the specified day, while the cumulative cases column is the cumulative total of cases in a certain area as of the specified date. Privacy rules in effect To protect privacy, certain rules are in effect: 1. Any area with a cumulative case count less than 10 are dropped for all days the cumulative count was less than 10. These will be null values. 2. Once an area has a cumulative case count of 10 or greater, that area will have a new row of case data every day following. 3. Cases are dropped altogether for areas where acs_population < 1000 4. Deaths data are not included in this dataset for privacy reasons. The low COVID-19 death rate in San Francisco, along with other publicly available information on deaths, means that deaths data by geography and day is too granular and potentially risky. Read more in our privacy guidelines Rate suppression in effect where counts lower than 20 Rates are not calculated unless the cumulative case count is greater than or equal to 20. Rates are generally unstable at small numbers, so we avoid calculating them directly. We advise you to apply the same approach as this is best practice in epidemiology. A note on Census ZIP Code Tabulation Areas (ZCTAs) ZIP Code Tabulation Areas are spec
On 6/28/2023, data on cases by vaccination status will be archived and will no longer update.
A. SUMMARY This dataset represents San Francisco COVID-19 positive confirmed cases by vaccination status over time, starting January 1, 2021. Cases are included on the date the positive test was collected (the specimen collection date). Cases are counted in three categories: (1) all cases; (2) unvaccinated cases; and (3) completed primary series cases.
All cases: Includes cases among all San Francisco residents regardless of vaccination status.
Unvaccinated cases: Cases are considered unvaccinated if their positive COVID-19 test was before receiving any vaccine. Cases that are not matched to a COVID-19 vaccination record are considered unvaccinated.
Completed primary series cases: Cases are considered completed primary series if their positive COVID-19 test was 14 days or more after they received their 2nd dose in a 2-dose COVID-19 series or the single dose of a 1-dose vaccine. These are also called “breakthrough cases.”
On September 12, 2021, a new case definition of COVID-19 was introduced that includes criteria for enumerating new infections after previous probable or confirmed infections (also known as reinfections). A reinfection is defined as a confirmed positive PCR lab test more than 90 days after a positive PCR or antigen test. The first reinfection case was identified on December 7, 2021.
Data is lagged by eight days, meaning the most recent specimen collection date included is eight days prior to today. All data updates daily as more information becomes available.
B. HOW THE DATASET IS CREATED Case information is based on confirmed positive laboratory tests reported to the City. The City then completes quality assurance and other data verification processes. Vaccination data comes from the California Immunization Registry (CAIR2). The California Department of Public Health runs CAIR2. Individual-level case and vaccination data are matched to identify cases by vaccination status in this dataset. Case records are matched to vaccine records using first name, last name, date of birth, phone number, and email address.
We include vaccination records from all nine Bay Area counties in order to improve matching rates. This allows us to identify breakthrough cases among people who moved to the City from other Bay Area counties after completing their vaccine series. Only cases among San Francisco residents are included.
C. UPDATE PROCESS Updates automatically at 08:00 AM Pacific Time each day.
D. HOW TO USE THIS DATASET Total San Francisco population estimates can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS). To identify total San Francisco population estimates, filter the view on “demographic_category_label” = “all ages”.
Population estimates by vaccination status are derived from our publicly reported vaccination counts, which can be found at COVID-19 Vaccinations Given to SF Residents Over Time.
The dataset includes new cases, 7-day average new cases, new case rates, 7-day average new case rates, percent of total cases, and 7-day average percent of total cases for each vaccination category.
New cases are the count of cases where the positive tests were collected on that specific specimen collection date. The 7-day rolling average shows the trend in new cases. The rolling average is calculated by averaging the new cases for a particular day with the prior 6 days.
New case rates are the count of new cases per 100,000 residents in each vaccination status group. The 7-day rolling average shows the trend in case rates. The rolling average is calculated by averaging the case rate for a part
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Alternative characterizations of DID groups for analysis of respondents who were sheltering-in-place all of the time after versus before the San Francisco Bay Area shelter-in-place announcement.
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
Note: As of March 2022, the race/ethnicity label changed from Native American to American Indian or Alaska Native to align with the Census.
Note: As of April 16, 2021, this dataset will update daily with a five-day data lag.
Note: As of February 2022, the way race/ethnicity is categorized has been changed. See Section B for additional information.
A. SUMMARY This dataset includes San Francisco COVID-19 tests by race/ethnicity and by date. This dataset represents the daily count of tests collected, and the breakdown of test results (positive, negative, or indeterminate). Tests in this dataset include all those collected from persons who listed San Francisco as their home address at the time of testing. It also includes tests that were collected by San Francisco providers for persons who were missing a locating address. This dataset does not include tests for residents listing a locating address outside of San Francisco, even if they were tested in San Francisco.
The data were de-duplicated by individual and date, so if a person gets tested multiple times on different dates, all tests will be included in this dataset (on the day each test was collected). If a person tested multiple times on the same date, only one test is included from that date. When there are multiple tests on the same date, a positive result, if one exists, will always be selected as the record for the person. If a PCR and antigen test are taken on the same day, the PCR test will supersede. If a person tests multiple times on the same day and the results are all the same (e.g. all negative or all positive) then the first test done is selected as the record for the person.
The total number of positive test results is not equal to the total number of COVID-19 cases in San Francisco. Each positive test result is investigated by the health department. While the city tries to only report on tests for San Francisco residents (or tests in San Francisco for those with no locating address listed), some test results purported to be for San Francisco residents are actually for people living outside the city. This can be discovered during a case investigation or data quality assurance. In such an instance, the test would be counted as a positive test in the SF data but would not be counted as a COVID-19 case in San Francisco. If a person tests positive for COVID-19 on different dates, they would be included each of those times in the testing data but only one case. To track the number of cases by race/ethnicity, see this dashboard: https://sf.gov/data/covid-19-population-characteristics#race-or-ethnicity-
When a person gets tested for COVID-19, they may be asked to report information about themselves. One piece of information that might be requested is a person's race and ethnicity. These data are often incomplete in the laboratory and provider reports of the test results sent to the health department. The data can be missing or incomplete for several possible reasons:
• The person was not asked about their race and ethnicity.
• The person was asked, but refused to answer.
• The person answered, but the testing provider did not include the person's answers in the reports.
• The testing provider reported the person's answers in a format that could not be used by the health department.
For any of these reasons, a person's race/ethnicity will be recorded in the dataset as “Unknown.”
B. NOTE ON RACE/ETHNICITY The different values for Race/Ethnicity in this dataset are "Asian;" "Black or African American;" "Hispanic or Latino/a, all races;" "American Indian or Alaska Native;" "Native Hawaiian or Other Pacific Islander;" "White;" "Multi-racial;" "Other;" and “Unknown."
On February 10, 2022, the method for which race/ethnicity is categorized was updated for the sake of data accuracy, clarity, and stability. The new categorization increases data clarity by emulating the methodology used by the U.S. Census in the
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘COVID-19 Vaccine Doses Given to San Franciscans by Demographics Over Time’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/96b957b9-1892-41fb-a453-685a8658f385 on 11 February 2022.
--- Dataset description provided by original source is as follows ---
A. SUMMARY This dataset represents doses of COVID-19 vaccine administered in California to San Francisco residents over time. All vaccines given to people who live in San Francisco are included, no matter where the vaccination took place (the vaccine may have been administered in San Francisco or outside of San Francisco). The data are broken down by multiple demographic stratifications. As of December 21, 2021, four dose types are counted separately, i.e. (1) first doses administered as a part of a two-dose primary vaccination series, (2) second doses administered as part of a two-dose primary vaccination series, and (3) single-dose vaccines administered as part of a one-dose primary series, and (4) booster doses regardless of primary vaccine type.
B. HOW THE DATASET IS CREATED Information on doses administered to those who live in San Francisco is from the California Immunization Registry (CAIR), run by the California Department of Public Health (CDPH). The information on individuals’ city of residence, age, race, and ethnicity are also recorded in CAIR and are self-reported at the time of vaccine administration.
In order to estimate the percent of San Franciscans vaccinated, we provide the same 2019 five-year American Community Survey population estimates that are used in our public dashboards.
C. UPDATE PROCESS Updated daily via automated process
D. HOW TO USE THIS DATASET Before analysis, you must filter the dataset to the desired stratification of data using the OVERALL_SEGMENT column.
For example, filtering OVERALL_SEGMENT to "Ages 5+ by Age Bracket, Administered by All Providers" will filter the data to residents 5 and over whose vaccinations were administered by any provider. You can then further segment the data and calculate percentages by Age Brackets.
If you filter OVERALL_SEGMENT to "Ages 65+ by Race/Ethnicity, Administered by DPH Only", you will see the race/ethnicity breakdown for residents aged 65+ who received vaccinations from San Francisco’s Department of Public Health (DPH).
--- Original source retains full ownership of the source dataset ---
A. SUMMARY Medical provider confirmed COVID-19 cases and confirmed COVID-19 related deaths in San Francisco, CA aggregated by several different geographic areas and normalized by 2016-2020 American Community Survey (ACS) 5-year estimates for population data to calculate rate per 10,000 residents. On September 12, 2021, a new case definition of COVID-19 was introduced that includes criteria for enumerating new infections after previous probable or confirmed infections (also known as reinfections). A reinfection is defined as a confirmed positive PCR lab test more than 90 days after a positive PCR or antigen test. The first reinfection case was identified on December 7, 2021. Cases and deaths are both mapped to the residence of the individual, not to where they were infected or died. For example, if one was infected in San Francisco at work but lives in the East Bay, those are not counted as SF Cases or if one dies in Zuckerberg San Francisco General but is from another county, that is also not counted in this dataset. Dataset is cumulative and covers cases going back to 3/2/2020 when testing began. Geographic areas summarized are: 1. Analysis Neighborhoods 2. Census Tracts 3. Census Zip Code Tabulation Areas B. HOW THE DATASET IS CREATED Addresses from medical data are geocoded by the San Francisco Department of Public Health (SFDPH). Those addresses are spatially joined to the geographic areas. Counts are generated based on the number of address points that match each geographic area. The 2016-2020 American Community Survey (ACS) population estimates provided by the Census are used to create a rate which is equal to ([count] / [acs_population]) * 10000) representing the number of cases per 10,000 residents. C. UPDATE PROCESS Geographic analysis is scripted by SFDPH staff and synced to this dataset daily at 7:30 Pacific Time. D. HOW TO USE THIS DATASET San Francisco population estimates for geographic regions can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS). Privacy rules in effect To protect privacy, certain rules are in effect: 1. Case counts greater than 0 and less than 10 are dropped - these will be null (blank) values 2. Death counts greater than 0 and less than 10 are dropped - these will be null (blank) values 3. Cases and deaths dropped altogether for areas where acs_population < 1000 Rate suppression in effect where counts lower than 20 Rates are not calculated unless the case count is greater than or equal to 20. Rates are generally unstable at small numbers, so we avoid calculating them directly. We advise you to apply the same approach as this is best practice in epidemiology. A note on Census ZIP Code Tabulation Areas (ZCTAs) ZIP Code Tabulation Areas are special boundaries created by the U.S. Census based on ZIP Codes developed by the USPS. They are not, however, the same thing. ZCTAs are areal representations of routes. Read how the Census develops ZCTAs on their website. Row included for Citywide case counts, incidence rate, and deaths A single row is included that has the Citywide case counts and incidence rate. This can be used for comparisons. Citywide will capture all cases regardless of address quality. While some cases cannot be mapped to sub-areas like Census Tracts, ongo