ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
A. SUMMARY Medical provider confirmed COVID-19 cases and confirmed COVID-19 related deaths in San Francisco, CA aggregated by Census ZIP Code Tabulation Areas and normalized by 2018 American Community Survey (ACS) 5-year estimates for population data to calculate rate per 10,000 residents.
Cases and deaths are both mapped to the residence of the individual, not to where they were infected or died. For example, if one was infected in San Francisco at work but lives in the East Bay, those are not counted as SF Cases or if one dies in Zuckerberg San Francisco General but is from another county, that is also not counted in this dataset.
Dataset is cumulative and covers cases going back to March 2nd, 2020 when testing began. It is updated daily.
B. HOW THE DATASET IS CREATED Addresses from medical data are geocoded by the San Francisco Department of Public Health (SFDPH). Those addresses are spatially joined to the geographic areas. Counts are generated based on the number of address points that match each geographic area. The 2018 ACS estimates for population provided by the Census are used to create a rate which is equal to ([count] / [acs_population]) * 10000) representing the number of cases per 10,000 residents.
C. UPDATE PROCESS Geographic analysis is scripted by SFDPH staff and synced to this dataset each day.
D. HOW TO USE THIS DATASET Privacy rules in effect To protect privacy, certain rules are in effect: 1. Case counts greater than 0 and less than 10 are dropped - these will be null (blank) values 2. Cases dropped altogether for areas where acs_population < 1000
Rate suppression in effect where counts lower than 20 Rates are not calculated unless the case count is greater than or equal to 20. Rates are generally unstable at small numbers, so we avoid calculating them directly. We advise you to apply the same approach as this is best practice in epidemiology.
A note on Census ZIP Code Tabulation Areas (ZCTAs) ZIP Code Tabulation Areas are special boundaries created by the U.S. Census based on ZIP Codes developed by the USPS. They are not, however, the same thing. ZCTAs are polygonal representations of USPS ZIP Code service area routes. Read how the Census develops ZCTAs on their website.
This dataset is a filtered view of another dataset You can find a full dataset of cases and deaths summarized by this and other geographic areas.
E. CHANGE LOG
A. SUMMARY This dataset shows San Francisco COVID-19 deaths by population characteristics. This data may not be immediately available for recently reported deaths. Data updates as more information becomes available. Because of this, death totals may increase or decrease.
Population characteristics are subgroups, or demographic cross-sections, like age, race, or gender. The City tracks how deaths have been distributed among different subgroups. This information can reveal trends and disparities among groups.
B. HOW THE DATASET IS CREATED As of January 1, 2023, COVID-19 deaths are defined as persons who had COVID-19 listed as a cause of death or a significant condition contributing to their death on their death certificate. This definition is in alignment with the California Department of Public Health and the national https://preparedness.cste.org/wp-content/uploads/2022/12/CSTE-Revised-Classification-of-COVID-19-associated-Deaths.Final_.11.22.22.pdf">Council of State and Territorial Epidemiologists. Death certificates are maintained by the California Department of Public Health.
Data on the population characteristics of COVID-19 deaths are from: *Case reports *Medical records *Electronic lab reports *Death certificates
Data are continually updated to maximize completeness of information and reporting on San Francisco COVID-19 deaths.
To protect resident privacy, we summarize COVID-19 data by only one population characteristic at a time. Data are not shown until cumulative citywide deaths reach five or more.
Data notes on select population characteristic types are listed below.
Race/ethnicity * We include all race/ethnicity categories that are collected for COVID-19 cases.
Gender * The City collects information on gender identity using these guidelines.
C. UPDATE PROCESS Updates automatically at 06:30 and 07:30 AM Pacific Time on Wednesday each week.
Dataset will not update on the business day following any federal holiday.
D. HOW TO USE THIS DATASET Population estimates are only available for age groups and race/ethnicity categories. San Francisco population estimates for race/ethnicity and age groups can be found in a dataset based on the San Francisco Population and Demographic Census dataset.These population estimates are from the 2018-2022 5-year American Community Survey (ACS).
This dataset includes several characteristic types. Filter the “Characteristic Type” column to explore a topic area. Then, the “Characteristic Group” column shows each group or category within that topic area and the number of cumulative deaths.
Cumulative deaths are the running total of all San Francisco COVID-19 deaths in that characteristic group up to the date listed.
To explore data on the total number of deaths, use the COVID-19 Deaths Over Time dataset.
E. CHANGE LOG
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘COVID-19 Cases by Population Characteristics Over Time’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://catalog.data.gov/dataset/a3291d85-0076-43c5-a59c-df49480cdc6d on 13 February 2022.
--- Dataset description provided by original source is as follows ---
Note: On January 22, 2022, system updates to improve the timeliness and accuracy of San Francisco COVID-19 cases and deaths data were implemented. You might see some fluctuations in historic data as a result of this change. Due to the changes, starting on January 22, 2022, the number of new cases reported daily will be higher than under the old system as cases that would have taken longer to process will be reported earlier.
A. SUMMARY This dataset shows San Francisco COVID-19 cases by population characteristics and by specimen collection date. Cases are included on the date the positive test was collected.
Population characteristics are subgroups, or demographic cross-sections, like age, race, or gender. The City tracks how cases have been distributed among different subgroups. This information can reveal trends and disparities among groups.
Data is lagged by five days, meaning the most recent specimen collection date included is 5 days prior to today. Tests take time to process and report, so more recent data is less reliable.
B. HOW THE DATASET IS CREATED Data on the population characteristics of COVID-19 cases and deaths are from: * Case interviews * Laboratories * Medical providers
These multiple streams of data are merged, deduplicated, and undergo data verification processes. This data may not be immediately available for recently reported cases because of the time needed to process tests and validate cases. Daily case totals on previous days may increase or decrease. Learn more.
Data are continually updated to maximize completeness of information and reporting on San Francisco residents with COVID-19.
Data notes on each population characteristic type is listed below.
Race/ethnicity * We include all race/ethnicity categories that are collected for COVID-19 cases. * The population estimates for the "Other" or “Multi-racial” groups should be considered with caution. The Census definition is likely not exactly aligned with how the City collects this data. For that reason, we do not recommend calculating population rates for these groups.
Sexual orientation * Sexual orientation data is collected from individuals who are 18 years old or older. These individuals can choose whether to provide this information during case interviews. Learn more about our data collection guidelines. * The City began asking for this information on April 28, 2020.
Gender * The City collects information on gender identity using these guidelines.
Comorbidities * Underlying conditions are reported when a person has one or more underlying health conditions at the time of diagnosis or death.
Transmission type * Information on transmission of COVID-19 is based on case interviews with individuals who have a confirmed positive test. Individuals are asked if they have been in close contact with a known COVID-19 case. If they answer yes, transmission category is recorded as contact with a known case. If they report no contact with a known case, transmission category is recorded as community transmission. If the case is not interviewed or was not asked the question, they are counted as unknown.
Homelessness
Persons are identified as homeless based on several data sources:
* self-reported living situation
* the location at the time of testing
* Department of Public Health homelessness and health databases
* Residents in Single-Room Occupancy hotels are not included in these figures.
These methods serve as an estimate of persons experiencing homelessness. They may not meet other homelessness definitions.
Skilled Nursing Facility (SNF) occupancy * A Skilled Nursing
--- Original source retains full ownership of the source dataset ---
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
A. SUMMARY Case information on COVID-19 Laboratory testing. This data includes a daily count of test results reported, and how many of those were positive, negative, and indeterminate. Reported tests include tests with a positive, negative or indeterminate result. Indeterminate results, which could not conclusively determine whether COVID-19 virus was present, are not included in the calculation of percent positive. Testing for the novel coronavirus is available through commercial, clinical, and hospital laboratories, as well as the SFDPH Public Health Laboratory.
Tests are de-duplicated by an individual and date. This means that if a person gets tested multiple times on different dates in the last 30 days, all of those individual tests will be included in this data as individual tests (on each specimen collection date).
Total positive test results is not equal to the total number of COVID-19 cases in San Francisco.
B. HOW THE DATASET IS CREATED Laboratory test volume and positivity for COVID-19 is based on electronic laboratory test reports. Deduplication, quality assurance measures and other data verification processes maximize accuracy of laboratory test information.
C. UPDATE PROCESS Updates automatically at 05:00 Pacific Time each day. A redundant run is scheduled at 09:00 in case of pipeline failure.
D. HOW TO USE THIS DATASET Due to the high degree of variation in the time needed to complete tests by different labs there is a delay in this reporting. On March 24 the Health Officer ordered all labs in the City to report complete COVID-19 testing information to the local and state health departments. In order to track trends over time, a data user can analyze this data by "result_date" and see how the count of reported results and positivity rate have changed over time.
E. CHANGE LOG
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
A. SUMMARY This dataset contains COVID-19 positive confirmed cases aggregated by several different geographic areas and by day. COVID-19 cases are mapped to the residence of the individual and shown on the date the positive test was collected. In addition, 2016-2020 American Community Survey (ACS) population estimates are included to calculate the cumulative rate per 10,000 residents.
Dataset covers cases going back to 3/2/2020 when testing began. This data may not be immediately available for recently reported cases and data will change to reflect as information becomes available. Data updated daily.
Geographic areas summarized are: 1. Analysis Neighborhoods 2. Census Tracts 3. Census Zip Code Tabulation Areas
B. HOW THE DATASET IS CREATED Addresses from the COVID-19 case data are geocoded by the San Francisco Department of Public Health (SFDPH). Those addresses are spatially joined to the geographic areas. Counts are generated based on the number of address points that match each geographic area for a given date.
The 2016-2020 American Community Survey (ACS) population estimates provided by the Census are used to create a cumulative rate which is equal to ([cumulative count up to that date] / [acs_population]) * 10000) representing the number of total cases per 10,000 residents (as of the specified date).
COVID-19 case data undergo quality assurance and other data verification processes and are continually updated to maximize completeness and accuracy of information. This means data may change for previous days as information is updated.
C. UPDATE PROCESS Geographic analysis is scripted by SFDPH staff and synced to this dataset daily at 05:00 Pacific Time.
D. HOW TO USE THIS DATASET San Francisco population estimates for geographic regions can be found in a view based on the San Francisco Population and Demographic Census dataset. These population estimates are from the 2016-2020 5-year American Community Survey (ACS).
This dataset can be used to track the spread of COVID-19 throughout the city, in a variety of geographic areas. Note that the new cases column in the data represents the number of new cases confirmed in a certain area on the specified day, while the cumulative cases column is the cumulative total of cases in a certain area as of the specified date.
Privacy rules in effect To protect privacy, certain rules are in effect: 1. Any area with a cumulative case count less than 10 are dropped for all days the cumulative count was less than 10. These will be null values. 2. Once an area has a cumulative case count of 10 or greater, that area will have a new row of case data every day following. 3. Cases are dropped altogether for areas where acs_population < 1000 4. Deaths data are not included in this dataset for privacy reasons. The low COVID-19 death rate in San Francisco, along with other publicly available information on deaths, means that deaths data by geography and day is too granular and potentially risky. Read more in our privacy guidelines
Rate suppression in effect where counts lower than 20 Rates are not calculated unless the cumulative case count is greater than or equal to 20. Rates are generally unstable at small numbers, so we avoid calculating them directly. We advise you to apply the same approach as this is best practice in epidemiology.
A note on Census ZIP Code Tabulation Areas (ZCTAs) ZIP Code Tabulation Areas are special boundaries created by the U.S. Census based on ZIP Codes developed by the USPS. They are not, however, the same thing. ZCTAs are areal representations of routes. Read how the Census develops ZCTAs on their website.
Rows included for Citywide case counts Rows are included for the Citywide case counts and incidence rate every day. These Citywide rows can be used for comparisons. Citywide will capture all cases regardless of address quality. While some cases cannot be mapped to sub-areas like Census Tracts, ongoing data quality efforts result in improved mapping on a rolling bases.
Related dataset See the dataset of the most recent cumulative counts for all geographic areas here: https://data.sfgov.org/COVID-19/COVID-19-Cases-and-Deaths-Summarized-by-Geography/tpyr-dvnc
E. CHANGE LOG
2019 Novel Coronavirus COVID-19 (2019-nCoV) Visual Dashboard and Map:
https://www.arcgis.com/apps/opsdashboard/index.html#/bda7594740fd40299423467b48e9ecf6
Downloadable data:
https://github.com/CSSEGISandData/COVID-19
Additional Information about the Visual Dashboard:
https://systems.jhu.edu/research/public-health/ncov
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Replication material for Jackelyn Hwang & Bina Patel Shrimali (2022) Shared and Crowded Housing in the Bay Area: Where Gentrification and the Housing Crisis Meet COVID-19, Housing Policy Debate, DOI: 10.1080/10511482.2022.2099934
Paper Abstract: Amid the growing affordable housing crisis and widespread gentrification over the last decade, people have been moving less than before and increasingly live in shared and often crowded households across the U.S. Crowded housing has various negative health implications, including stress, sleep disorders, and infectious diseases. Difference-in- difference analysis of a unique, large-scale longitudinal consumer credit database of over 450,000 San Francisco Bay Area residents from 2002 to 2020 shows gentrification affects the probability of residents shifting to crowded households across the socioeconomic spectrum but in different ways than expected. Gentrification is negatively associated with low- socioeconomic status (SES) residents’ probability of entering crowded households, and this is largely explained by increased shifts to crowded households in neighborhoods outside of major cities showing early signs of gentrification. Conversely, gentrification is associated with increases in the probability that middle-SES residents enter crowded households, primarily in Silicon Valley. Lastly, crowding is positively associated with COVID-19 case rates, beyond density and socioeconomic and racial composition in neighborhoods, although the role of gentrification remains unclear. Housing policies that mitigate crowding can serve as early interventions in displacement prevention and reducing health inequities.
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Background Rural U.S. communities are at risk from COVID-19 due to advanced age and limited access to acute care. Recognizing this, the Vashon Medical Reserve Corps (VMRC) in King County, Washington, implemented an all-volunteer, community-based COVID-19 response program. This program integrated public engagement, SARS-CoV-2 testing, contact tracing, vaccination, and material community support, and was associated with the lowest cumulative COVID-19 case rate in King County. This study aimed to investigate the contributions of demographics, geography and public health interventions to Vashon’s low COVID-19 rates. Methods This observational cross-sectional study compares cumulative COVID-19 rates and success of public health interventions from February 2020 through November 2021 for Vashon Island with King County (including metropolitan Seattle) and Whidbey Island, located ~50 km north of Vashon. To evaluate the role of demography, we developed multiple linear regression models of COVID-19 rates using metrics of age, race/ethnicity, wealth and educational attainment across 77 King County zip codes. To investigate the role of remote geography we expanded the regression models to include North, Central and South Whidbey, similarly remote island communities with varying demographic features. To evaluate the effectiveness of VMRC’s community-based public health measures, we directly compared Vashon’s success of vaccination and contact tracing with that of King County and South Whidbey, the Whidbey community most similar to Vashon. Results Vashon’s cumulative COVID-19 case rate was 29% that of King County overall (22.2 vs 76.8 cases/K). A multiple linear regression model based on King County demographics found educational attainment to be a major correlate of COVID-19 rates, and Vashon’s cumulative case rate was just 38% of predicted (p<.05), so demographics alone do not explain Vashon’s low COVID-19 case rate. Inclusion of Whidbey communities in the model identified a major effect of remote geography (-49 cases/K, p<.001), such that observed COVID-19 rates for all remote communities fell within the model’s 95% prediction interval. VMRC’s vaccination effort was highly effective, reaching a vaccination rate of 1500 doses/K four months before South Whidbey and King County and maintaining a cumulative vaccination rate 200 doses/K higher throughout the latter half of 2021 (p<.001). Including vaccination rates in the model reduced the effect of remote geography to -41 cases/K (p<.001). VMRC case investigation was also highly effective, interviewing 96% of referred cases in an average of 1.7 days compared with 69% in 3.7 days for Washington Department of Health investigating South Whidbey cases and 80% in 3.4 days for Public Health–Seattle & King County (both p<0.001). VMRC’s public health interventions were associated with a 30% lower case rate (p<0.001) and 55% lower hospitalization rate (p=0.056) than South Whidbey. Conclusion While the overall magnitude of the pre-Omicron COVID-19 pandemic in rural and urban U.S. communities was similar, we show that island communities in the Puget Sound region were substantially protected from COVID-19 by their geography. We further show that a volunteer community-based COVID-19 response program was highly effective in the Vashon community, augmenting the protective effect of geography. We suggest that Medical Reserve Corps should be an important element of future pandemic planning. Methods The study period extended from the pandemic onset in February 2020 through November 2021. Daily COVID-19 cases, hospitalizations, deaths and test numbers for King County as a whole and by zip code were downloaded from the King County COVID-19 dashboard (Feb 22, 2022 update). Population data for King County and Vashon are from the April 2020 US Census. Zip code level population data are the average of two zip code tabulation area estimates from the WA Office of Financial Management and Cubit (a commercial data vendor providing access to US Census information). The Asset Limited, Income Constrained, and Employed (ALICE) metric, a measure of the working poor, was obtained from United Way.
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Multisystem inflammatory syndrome in children (MIS-C) is a severe, post-infectious sequela of SARS-CoV-2 infection, yet the pathophysiological mechanism connecting the infection to the broad inflammatory syndrome remains unknown. Here we leveraged a large set of MIS-C patient samples (n=199) to identify a distinct set of host proteins that are differentially targeted by patient autoantibodies relative to matched controls. We identified an autoreactive epitope within SNX8, a protein expressed primarily in immune cells that regulates an antiviral pathway associated with MIS-C pathogenesis. In parallel, we also probed the SARS-CoV-2 proteome-wide MIS-C patient antibody response and found it to be differentially reactive to a distinct domain of the SARS-CoV-2 nucleocapsid (N) protein relative to controls. This viral N region and the mapped SNX8 epitope bear remarkable biochemical similarity. Furthermore, we find that many children with anti-SNX8 autoantibodies also have T cells cross-reactive to both SNX8 and this distinct region of the SARS-CoV-2 N protein. Together, these findings suggest that MIS-C patients develop a characteristic immune response against the SARS-CoV-2 N protein that is associated with cross-reactivity to the self-protein SNX8, demonstrating a mechanistic link from the infection to the inflammatory syndrome with implications for better understanding a range of post-infectious autoinflammatory diseases. Methods Patients Patients were recruited through the prospectively enrolling multicenter Overcoming COVID-19 and Taking on COVID-19 Together study in the United States. The study was approved by the central Boston Children’s Hospital Institutional Review Board (IRB) and reviewed by IRBs of participating sites with CDC IRB reliance. A total of 292 patients were enrolled into 1 of the following independent cohorts between June 1, 2020 and September 9, 2021: 223 patients hospitalized with MIS-C (199 in the primary discovery cohort, 24 in a separate subsequent validation cohort), 29 patients hospitalized for COVID-19 in either an intensive care or step-down unit (referred to as severe acute COVID-19 in this study), and 45 outpatients (referred to as “at-risk controls” in this study) post-SARS-CoV-2 infections associated with mild or no symptoms. The demographic and clinical data are summarized in Table I, Extended Data Table 1, and Extended Data Table 2. The 2020 US Centers for Disease Control and Prevention case definition was used to define MIS-C(1). All patients with MIS-C had positive SARS-CoV-2 serology results and/or positive SARS-CoV-2 test results by reverse transcriptase quantitative PCR. All patients with severe COVID-19 or outpatient SARS-CoV-2 infections had a positive antigen test or nucleic acid amplification test for SARS-CoV-2. For outpatients, samples were collected from 36 to 190 days after the positive test (median, 70 days after the positive test; interquartile range, 56-81 days). For use as controls in the SARS-CoV-2 specific PhIP-Seq, plasma from 48 healthy, pre-COVID-19 controls was obtained as deidentified samples from the New York Blood Center. These samples were part of retention tubes collected at the time of blood donations from volunteer donors who provided informed consent for their samples to be used for research. Human proteome PhIP-Seq Human Proteome PhIP-Seq was performed following our previously published vacuum-based PhIP-Seq protocol (2) (https://www.protocols.io/view/scaled-high-throughput-vacuum-phip-protocol-ewov1459kvr2/v1). Our human peptidome library consists of a custom-designed phage library of 731,724 unique T7 bacteriophage each presenting a different 49 amino-acid peptide on its surface. Collectively these peptides tile the entire human proteome including all known isoforms (as of 2016) with 25 amino-acid overlaps. 1 milliliter of phage library was incubated with 1 microliter of human serum overnight at 4C and immunoprecipitated with 25 microliters of 1:1 mixed protein A and protein G magnetic beads (Thermo Fisher, Waltham, MA, #10008D and #10009D). These beads were then washed, and the remaining phage-antibody complexes were eluted in 1 milliliter of E.Coli (BLT5403, EMD Millipore, Burlington, MA) at 0.5-0.7 OD and amplified by growing in 37C incubator. This new phage library was then re-incubated with the same individual’s serum and the previously described protocol was repeated. DNA was then extracted from the final phage library, barcoded, and PCR-amplified, and Illumina adaptors were added. Next-generation sequencing was then performed using an Illumina sequencer (Illumina, San Diego, CA) to a read depth of approximately 1 million per sample. Human proteome PhIP-Seq analysis All human peptidome analysis (except when specifically stated otherwise) was performed at the gene level, in which all reads for all peptides mapping to the same gene were summed, and 0.5 reads were added to each gene to allow the inclusion of genes with zero reads in mathematical analyses. Within each individual sample, reads were normalized by converting to the percentage of total reads. To normalize each sample against background non-specific binding, a fold-change (FC) over mock-IP was calculated by dividing the sample read percentage for each gene by the mean read percentage of the same gene for the AG bead-only controls. This FC signal was then used for side-by-side comparison between samples and cohorts. FC values were also used to calculate z-scores for each MIS-C patient relative to controls and for each control sample by using all remaining controls. These z-scores were used for the logistic regression feature weighting. In instances of peptide-level analysis, raw reads were normalized by calculating the number of reads per 100,000 reads. SARS-CoV-2 proteome PhIP-Seq SARS-CoV-2 Proteome PhIP-Seq was performed as previously described(3). Briefly, 38 amino acid fragments tiling all open reading frames from SARS-CoV-2, SARS-CoV-1, and 7 other CoVs were expressed on T7 bacteriophage with 19 amino acid overlaps. 1 milliliter of phage library was incubated with 1 microliter of human serum overnight at 4C and immunoprecipitated with 25 microliters of 1:1 mixed protein A and protein G magnetic beads (Thermo Fisher, Waltham, MA, #10008D and #10009D). Beads were washed 5 times on a magnetic plate using a P1000 multichannel pipette. The remaining phage-antibody complexes were eluted in 1 milliliter of E.Coli (BLT5403, EMD Millipore, Burlington, MA) at 0.5-0.7 OD and amplified by growing in a 37°C incubator. This new phage library was then re-incubated with the same individual’s serum and the previously described protocol was repeated for a total of 3 rounds of immunoprecipitations. DNA was then extracted from the final phage library, barcoded, and PCR-amplified, and Illumina adaptors were added. Next-generation sequencing was then performed using an Illumina sequencer (Illumina, San Diego, CA) to a read depth of approximately 1 million per sample. Coronavirus proteome PhIP-Seq analysis To account for differing read depths between samples, the total number of reads for each peptide fragment was converted to the number of reads per 100k (RPK). To calculate normalized enrichment relative to pre-COVID controls (FC > Pre-COVID), the RPK for each peptide fragment within each sample was divided by the mean RPK of each peptide fragment among all pre-COVID controls. These FC > Pre-COVID values were used for all subsequent analyses as described in the text and figures.
HAN Archive - 00432 (2021). https://emergency.cdc.gov/han/2020/han00432.asp. S. E. Vazquez, S. A. Mann, A. Bodansky, A. F. Kung, Z. Quandt, E. M. N. Ferré, N. Landegren, D. Eriksson, P. Bastard, S.-Y. Zhang, J. Liu, A. Mitchell, I. Proekt, D. Yu, C. Mandel-Brehm, C.-Y. Wang, B. Miao, G. Sowa, K. Zorn, A. Y. Chan, V. M. Tagi, C. Shimizu, A. Tremoulet, K. Lynch, M. R. Wilson, O. Kämpe, K. Dobbs, O. M. Delmonte, R. Bacchetta, L. D. Notarangelo, J. C. Burns, J.-L. Casanova, M. S. Lionakis, T. R. Torgerson, M. S. Anderson, J. L. DeRisi, Autoantibody discovery across monogenic, acquired, and COVID-19-associated autoimmunity with scalable PhIP-seq. Elife 11 (2022). C. R. Zamecnik, J. V. Rajan, K. A. Yamauchi, S. A. Mann, R. P. Loudermilk, G. M. Sowa, K. C. Zorn, B. D. Alvarenga, C. Gaebler, M. Caskey, M. Stone, P. J. Norris, W. Gu, C. Y. Chiu, D. Ng, J. R. Byrnes, X. X. Zhou, J. A. Wells, D. F. Robbiani, M. C. Nussenzweig, J. L. DeRisi, M. R. Wilson, ReScan, a Multiplex Diagnostic Pipeline, Pans Human Sera for SARS-CoV-2 Antigens. Cell Rep Med 1, 100123 (2020).
Not seeing a result you expected?
Learn how you can add new datasets to our index.
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
A. SUMMARY Medical provider confirmed COVID-19 cases and confirmed COVID-19 related deaths in San Francisco, CA aggregated by Census ZIP Code Tabulation Areas and normalized by 2018 American Community Survey (ACS) 5-year estimates for population data to calculate rate per 10,000 residents.
Cases and deaths are both mapped to the residence of the individual, not to where they were infected or died. For example, if one was infected in San Francisco at work but lives in the East Bay, those are not counted as SF Cases or if one dies in Zuckerberg San Francisco General but is from another county, that is also not counted in this dataset.
Dataset is cumulative and covers cases going back to March 2nd, 2020 when testing began. It is updated daily.
B. HOW THE DATASET IS CREATED Addresses from medical data are geocoded by the San Francisco Department of Public Health (SFDPH). Those addresses are spatially joined to the geographic areas. Counts are generated based on the number of address points that match each geographic area. The 2018 ACS estimates for population provided by the Census are used to create a rate which is equal to ([count] / [acs_population]) * 10000) representing the number of cases per 10,000 residents.
C. UPDATE PROCESS Geographic analysis is scripted by SFDPH staff and synced to this dataset each day.
D. HOW TO USE THIS DATASET Privacy rules in effect To protect privacy, certain rules are in effect: 1. Case counts greater than 0 and less than 10 are dropped - these will be null (blank) values 2. Cases dropped altogether for areas where acs_population < 1000
Rate suppression in effect where counts lower than 20 Rates are not calculated unless the case count is greater than or equal to 20. Rates are generally unstable at small numbers, so we avoid calculating them directly. We advise you to apply the same approach as this is best practice in epidemiology.
A note on Census ZIP Code Tabulation Areas (ZCTAs) ZIP Code Tabulation Areas are special boundaries created by the U.S. Census based on ZIP Codes developed by the USPS. They are not, however, the same thing. ZCTAs are polygonal representations of USPS ZIP Code service area routes. Read how the Census develops ZCTAs on their website.
This dataset is a filtered view of another dataset You can find a full dataset of cases and deaths summarized by this and other geographic areas.
E. CHANGE LOG