Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Pre-existing conditions of people who died due to COVID-19, broken down by country, broad age group, and place of death occurrence, usual residents of England and Wales.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Provisional deaths registration data for single year of age and average age of death (median and mean) of persons whose death involved coronavirus (COVID-19), England and Wales. Includes deaths due to COVID-19 and breakdowns by sex.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
NHS UK - COVID-19 Daily Deaths
This section contains information on deaths of patients who have died in hospitals in England and had tested positive for COVID-19 at time of death. All deaths are recorded against the date of death rather than the date the deaths were announced. Interpretation of the figures should take into account the fact that totals by date of death, particularly for most recent days, are likely to be updated in future releases. For example as deaths are confirmed as testing positive for COVID-19, as more post-mortem tests are processed and data from them are validated. Any changes are made clear in the daily files.
These figures do not include deaths outside hospital, such as those in care homes. This approach makes it possible to compile deaths data on a daily basis using up to date figures.
Dataset Content
These figures will be updated at 2pm each day and include confirmed cases reported at 5pm the previous day. Confirmation of COVID-19 diagnosis, death notification and reporting in central figures can take up to several days and the hospitals providing the data are under significant operational pressure. This means that the totals reported at 5pm on each day may not include all deaths that occurred on that day or on recent prior days.
The original dataset is sourced directly from the NHS source site, this original dataset is then cleaned and converted to a csv format available for inclusion into a Kaggle notebook.
There are 3 files considered within the data :- 1. Fatalities_by_age_uk 2.Fatalities_by_region_uk 3.Fatalities_by_trust_uk
Data runs from March 1st up to the current day. Any discrepancies will be outlined. The first is cumulative for any previous days leading up to of relevance. The following days are not cumulative and represent the updated value for the date under consideration.
A start kernel is provided to demonstrate using the dataset.
Citations
This dataset is sourced from the NHS statistical work areas:- https://www.england.nhs.uk/statistics/statistical-work-areas/
This dataset has been sourced and provided to aid in the following competition:- https://www.kaggle.com/c/covid19-global-forecasting-week-4
Facebook
Twitterhttps://www.ons.gov.uk/aboutus/whatwedo/statistics/requestingstatistics/approvedresearcherschemehttps://www.ons.gov.uk/aboutus/whatwedo/statistics/requestingstatistics/approvedresearcherscheme
The Public Health Research Database (PHRD) is a linked asset which currently includes Census 2011 data; Mortality Data; Hospital Episode Statistics (HES); GP Extraction Service (GPES) Data for Pandemic Planning and Research data. Researchers may apply for these datasets individually or any combination of the current 4 datasets.
The purpose of this dataset is to enable analysis of deaths involving COVID-19 by multiple factors such as ethnicity, religion, disability and known comorbidities as well as age, sex, socioeconomic and marital status at subnational levels. 2011 Census data for usual residents of England and Wales, who were not known to have died by 1 January 2020, linked to death registrations for deaths registered between 1 January 2020 and 8 March 2021 on NHS number. The data exclude individuals who entered the UK in the year before the Census took place (due to their high propensity to have left the UK prior to the study period), and those over 100 years of age at the time of the Census, even if their death was not linked. The dataset contains all individuals who died (any cause) during the study period, and a 5% simple random sample of those still alive at the end of the study period. For usual residents of England, the dataset also contains comorbidity flags derived from linked Hospital Episode Statistics data from April 2017 to December 2019 and GP Extraction Service Data from 2015-2019.
Facebook
TwitterA List of UK Health Workers Who Have Died from COVID-19
Made machine-readable by hand from data from the UK newspaper "The Guardian", in this article: "Doctors, nurses, porters, volunteers: the UK health workers who have died from Covid-19" https://www.theguardian.com/world/2020/apr/16/doctors-nurses-porters-volunteers-the-uk-health-workers-who-have-died-from-covid-19
The Guardian is continuing to update the list day-by-day, as the COVID-19 pandemic continues. I do not plan to update this dataset, assuming, since the data collection biases are unknown, that nobody else will find it very interesting. I am not a copyright lawyer and do not know if this data is protected copyright, and if so, in which parts of the world.
Caveat: Creating this dataset from a newspaper article required a lot of hand work. I've done my best, but there may be mistakes.
Columns: Name age institution city: I have filled this in myself; I am ignorant of UK geography and there may well be mistakes date_of_death possible_ppe_issue: mostly blank, but I have filled in "yes" where the article mentions a person who had doubts about the adequacy of PPE (personal protective equipment) MED_SPEC: I have attempted to fill in a medical specialty from the values used on the Eurostat web site for Physicians by Medical Specialty" and "Nursing and caring professionals" tables. The idea is to be able to calculate a fraction of affected individuals by specialty.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
The number of deaths registered in England and Wales due to and involving coronavirus (COVID-19). Breakdowns include age, sex, region, local authority, Middle-layer Super Output Area (MSOA), indices of deprivation and place of death. Includes age-specific and age-standardised mortality rates.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
11th January 2020 Change to vaccination data made available by UK gov - now just cumulative number of vaccines delivered are available for both first and second doses. For the devolved nations the cumulative totals are available for the dates from when given, however for the UK as a whole the total doses given is just on the last date of the index, regardless of when those vaccines were given.
4th January 2020 VACCINATION DATA ADDED - New and Cumulative First Dose Vaccination Data added to UK_National_Total_COVID_Dataset.csv and UK_Devolved_Nations_COVID_Dataset.csv
2nd December 2020:
NEW population, land area and population density data added in file NEW_Official_Population_Data_ONS_mid-2019.csv. This data is scraped from the Office for National Statistics and covers the UK, devolved UK nations, regions and local authorities (boroughs).
20th November 2020:
With European governments struggling with a 'second-wave' of rising cases, hospitalisations and deaths resulting from the SARS-CoV-2 virus (COVID-19), I wanted to make a comparative analysis between the data coming out of major European nations since the start of the pandemic.
I started by creating a Sweden COVID-19 dataset and now I'm looking at my own country, the United Kingdom.
The data comes from https://coronavirus.data.gov.uk/ and I used the Developer's Guide to scrape the data, so it was a fairly simple process. The notebook that scapes the data is public and can be found here. Further information about data collection methodologies and definitions can be found here.
The data includes the overall numbers for the UK as a whole, the numbers for each of the devolved UK nations (Eng, Sco, Wal & NI), English Regions and Upper Tier Local Authorities (UTLA) for all of the UK (what we call Boroughs). I have also included a small table with the populations of the 4 devolved UK nations, used to calculate the death rates per 100,000 population.
As I've said for before - I am not an Epidemiologist, Sociologist or even a Data Scientist. I am actually a Mechanical Engineer! The objective here is to improve my data science skills and maybe provide some useful data to the wider community.
Any questions, comments or suggestions are most welcome! I am open to requests and collaborations! Stay Safe!
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Provisional counts of the number of deaths and age-standardised mortality rates involving the coronavirus (COVID-19), by occupational groups, for deaths registered between 9 March and 28 December 2020 in England and Wales. Figures are provided for males and females.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Provisional counts of the number of deaths registered in England and Wales, including deaths involving coronavirus (COVID-19), by local authority, health board and place of death in the latest weeks for which data are available. The occurrence tabs in the 2021 edition of this dataset were updated for the last time on 25 October 2022.
Facebook
TwitterCOVID-19 is a Pandemic which was spread worldwide in the early months of 2020, Which has had a major impact on the United Kingdom. As the UK has recently carried out wide spread vaccination and ended Lockdown I am providing the recent COVID-19 figures.
Several Datasets are provided, focusing on Deaths, Cases, Hospitalisation and Vaccination. Files often protray the same information but from a different reference point. For example for Deaths there is one displaying figures from people who died using there positive date as a reference point, whereas the other is using the date of death.
These datasets was scrapped off the UK Gov website in regards to COVID-19. For those looking to build a more complex project using a constant data flow, they do provide an API which may assist.
Possible area to explore are: What was the Impact of Vaccines on the COVID-19 Pandemic? What was the Impact of a Lockdown on the COVID-19 Pandemic? Which Nation managed the spread of COVID-19 the best?
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
The number of deaths registered in Leicester including deaths involving coronavirus (Covid-19).The data will be updated weekly.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The features in the order shown under “Feature name” are: GDP, inter-state distance based on lat-long coordinates, gender, ethnicity, quality of health care facility, number of homeless people, total infected and death, population density, airport passenger traffic, age group, days for infection and death to peak, number of people tested for COVID-19, days elapsed between first reported infection and the imposition of lockdown measures at a given state.
Facebook
Twitterhttp://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Why did I create this dataset? This is my first time creating a notebook in Kaggle and I am interested in learning more about COVID-19 and how different countries are affected by it and why. It might be useful to compare different metrics between different countries. And I also wanted to participate in a challenge, and I've decided to join the COVID-19 datasets challenge. While looking through the projects, I noticed https://www.kaggle.com/koryto/countryinfo and it inspired me to start this project.
My approach is to scour the Internet and Kaggle looking for country data that can potentially have an impact on how the COVID-19 pandemic spreads. In the end, I ended up with the following for each country:
See covid19_data - data_sources.csv for data source details.
Notebook: https://www.kaggle.com/bitsnpieces/covid19-data
Since I did not personally collect each datapoint, and because each datasource is different with different objectives, collected at different times, measured in different ways, any inferences from this dataset will need further investigation.
I want to acknowledge the authors of the datasets that made their data publicly available which has made this project possible. Banner image is by Brian.
I hope that the community finds this dataset useful. Feel free to recommend other datasets that you think will be useful / relevant! Thanks for looking.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Age-standardised mortality rates for deaths involving coronavirus (COVID-19), non-COVID-19 deaths and all deaths by vaccination status, broken down by age group.
Facebook
Twitterhttps://www.pioneerdatahub.co.uk/data/data-request-process/https://www.pioneerdatahub.co.uk/data/data-request-process/
OMOP dataset: Hospital COVID patients: severity, acuity, therapies, outcomes Dataset number 2.0
Coronavirus disease 2019 (COVID-19) was identified in January 2020. Currently, there have been more than 6 million cases & more than 1.5 million deaths worldwide. Some individuals experience severe manifestations of infection, including viral pneumonia, adult respiratory distress syndrome (ARDS) & death. There is a pressing need for tools to stratify patients, to identify those at greatest risk. Acuity scores are composite scores which help identify patients who are more unwell to support & prioritise clinical care. There are no validated acuity scores for COVID-19 & it is unclear whether standard tools are accurate enough to provide this support. This secondary care COVID OMOP dataset contains granular demographic, morbidity, serial acuity and outcome data to inform risk prediction tools in COVID-19.
PIONEER geography The West Midlands (WM) has a population of 5.9 million & includes a diverse ethnic & socio-economic mix. There is a higher than average percentage of minority ethnic groups. WM has a large number of elderly residents but is the youngest population in the UK. Each day >100,000 people are treated in hospital, see their GP or are cared for by the NHS. The West Midlands was one of the hardest hit regions for COVID admissions in both wave 1 & 2.
EHR. University Hospitals Birmingham NHS Foundation Trust (UHB) is one of the largest NHS Trusts in England, providing direct acute services & specialist care across four hospital sites, with 2.2 million patient episodes per year, 2750 beds & 100 ITU beds. UHB runs a fully electronic healthcare record (EHR) (PICS; Birmingham Systems), a shared primary & secondary care record (Your Care Connected) & a patient portal “My Health”. UHB has cared for >5000 COVID admissions to date. This is a subset of data in OMOP format.
Scope: All COVID swab confirmed hospitalised patients to UHB from January – August 2020. The dataset includes highly granular patient demographics & co-morbidities taken from ICD-10 & SNOMED-CT codes. Serial, structured data pertaining to care process (timings, staff grades, specialty review, wards), presenting complaint, acuity, all physiology readings (pulse, blood pressure, respiratory rate, oxygen saturations), all blood results, microbiology, all prescribed & administered treatments (fluids, antibiotics, inotropes, vasopressors, organ support), all outcomes.
Available supplementary data: Health data preceding & following admission event. Matched “non-COVID” controls; ambulance, 111, 999 data, synthetic data. Further OMOP data available as an additional service.
Available supplementary support: Analytics, Model build, validation & refinement; A.I.; Data partner support for ETL (extract, transform & load) process, Clinical expertise, Patient & end-user access, Purchaser access, Regulatory requirements, Data-driven trials, “fast screen” services.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
A new coronavirus designated 2019-nCoV was first identified in Wuhan, the capital of China's Hubei province. People developed pneumonia without a clear cause and for which existing vaccines or treatments were not effective. The virus has shown evidence of human-to-human transmission. You can use this data for Analysis
City Province Country LastUpdate keyID Confirmed Deaths
https://rapidapi.com/KishCom/api/covid-19-coronavirus-statistics
https://c.files.bbci.co.uk/D505/production/_115033545_gettyimages-1226314512.jpg
You can use this dataset for analyzing the Covid19 cases in different countries...
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the strain of coronavirus that causes coronavirus disease 2019 (COVID-19), the respiratory illness responsible for the COVID-19 pandemic.
Since its first identification in December 2019 in Wuhan, China, this virus has taken the world by storm. Some people prefer to look at the positive side of things and how this pandemic has brought forward several positive changes. However, the collateral damages produced by this pandemic cannot be overlooked. From the Economic impact to Mental Health impacts, this pandemic period will arguably be one of the hardest periods we'll encounter in our lives. That being said, we always have to arm ourselves with hope. With the new advancements in the vaccine studies, let's hope to wake up from this nightmare as soon as possible.
“Hope is being able to see that there is light despite all of the darkness.” – Desmond Tutu
As for the reason for me building this dataset, it's because I couldn't get my hands on an easily digestible and up-to-date dataset of Covid-19, so, I decided to build my own using Python and web scraping techniques. I will also update this dataset as frequently as possible!
This data was scraped from woldometers.info on 2022-05-14 by Joseph Assaker.
225 countries are represented in this data.
All of countries have records dating from 2020-2-15 until 2022-05-14 (820 days per country). That's with the exception of China, which has records dating from 2020-1-22 until 2022-05-14 (844 days per country), and Palau which has records dating from 2021-8-25 until 2022-05-14 (263 days per country)..
As previously mentioned, all the data present in this dataset is scraped from worldometers.info.
Going through this data, Kagglers can visualize various trends in their own country, or compare several countries. One can also combine this dataset with other news and key points in time (lockdowns, new UK mutation, Holidays, etc.) in order to study the effects of these events on the progression of Covid-19 in a multitude of countries. Implementing time series analysis on this dataset would also be an amazing idea! Getting a deep learning algorithm to learn from this sea of data and try to predict the future turn of events could be quite interesting!
Facebook
Twitterhttps://bhfdatasciencecentre.org/areas/cvd-covid-uk-covid-impact/https://bhfdatasciencecentre.org/areas/cvd-covid-uk-covid-impact/
CVD-COVID-UK, co-ordinated by the British Heart Foundation (BHF) Data Science Centre (https://bhfdatasciencecentre.org/), is one of the NIHR-BHF Cardiovascular Partnership’s National Flagship Projects.
CVD-COVID-UK aims to understand the relationship between COVID-19 and cardiovascular diseases through analyses of de-identified, pseudonymised, linked, nationally collated health datasets across the four nations of the UK. The consortium has over 400 members across more than 50 institutions including data custodians, data scientists and clinicians, all of whom have signed up to an agreed set of principles with an inclusive, open and transparent ethos.
Approved researchers access data within secure trusted/secure research environments (TREs/SDEs) provided by NHS England (England), the National Safe Haven (Scotland), the Secure Anonymised Information Linkage (SAIL) Databank (Wales) and the Honest Broker Service (Northern Ireland). A dashboard of datasets available in each nation’s TRE can be found here: https://bhfdatasciencecentre.org/areas/cvd-covid-uk-covid-impact/
This dataset represents the linked datasets in SAIL Databank’s TRE for Wales and contains the following datasets: • Welsh Longitudinal GP Dataset - Welsh Primary Care (Daily COVID codes only) (GPCD) • Welsh Longitudinal General Practice Dataset (WLGP) - Welsh Primary Care • Critical Care Dataset (CCDS) • Emergency Department Dataset Daily (EDDD) • Emergency Department Dataset (EDDS) • Outpatient Database for Wales (OPDW) • Outpatient Referral (OPRD) • Patient Episode Dataset for Wales (PEDW) • COVID-19 Test Results (PATD) • COVID-19 Test Trace and Protect (CTTP) - Legacy • COVID-19 Shielded People List (CVSP) • SARS-CoV-2 viral sequencing data (COG-UK data)-Lineage/Variant Data-Wales (CVSD) • Covid Vaccination Dataset (CVVD) • Annual District Death Daily (ADDD) • Annual District Death Extract (ADDE) • COVID-19 Consolidated Deaths (CDDS) • Intensive Care National Audit and Research Centre (ICCD) - Legacy - COVID only • Intensive Care National Audit and Research Centre (ICNC) • Welsh Dispensing Dataset (WDDS) - Legacy • Annual District Birth Extract (ADBE) • Maternity Indicators Dataset (MIDS) • National Community Child Health Database (NCCHD) • Care Home Dataset (CARE) • Congenital Anomaly Register and Information Service (CARS) • Referral to Treatment Times (RTTD) • SAIL Dementia e-Cohort (SDEC) • Welsh Ambulance Services NHS Trust (WASD) • Welsh Demographic Service Dataset (WDSD) • Welsh Results Reports Service (WRRS) • ONS 2011 Census Wales (CENW)
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Provisional counts of the number of deaths registered in England and Wales, by age, sex, region and Index of Multiple Deprivation (IMD), in the latest weeks for which data are available.
Facebook
Twitterhttps://www.pioneerdatahub.co.uk/data/data-request-process/https://www.pioneerdatahub.co.uk/data/data-request-process/
PIONEER: Deeply-phenotyped hospital COVID patients: severity, acuity, therapies, outcomes Dataset number 4.0
Coronavirus disease 2019 (COVID-19) was identified in January 2020. Currently, there have been more than 6 million cases& more than 1.5 million deaths worldwide. Some individuals experience severe manifestations of infection, including viral pneumonia, adult respiratory distress syndrome (ARDS)& death. There is a pressing need for tools to stratify patients, to identify those at greatest risk. Acuity scores are composite scores which help identify patients who are more unwell to support & prioritise clinical care. There are no validated acuity scores for COVID-19 & it is unclear whether standard tools are accurate enough to provide this support. This secondary care COVID dataset contains granular demographic, morbidity, serial acuity and outcome data to inform risk prediction tools in COVID-19.
PIONEER geography The West Midlands (WM) has a population of 5.9 million & includes a diverse ethnic & socio-economic mix. There is a higher than average percentage of minority ethnic groups. WM has a large number of elderly residents but is the youngest population in the UK. Each day >100,000 people are treated in hospital, see their GP or are cared for by the NHS. The West Midlands was one of the hardest hit regions for COVID admissions in both wave 1 & 2.
EHR. University Hospitals Birmingham NHS Foundation Trust (UHB) is one of the largest NHS Trusts in England, providing direct acute services & specialist care across four hospital sites, with 2.2 million patient episodes per year, 2750 beds & 100 ITU beds. UHB runs a fully electronic healthcare record (EHR) (PICS; Birmingham Systems), a shared primary & secondary care record (Your Care Connected) & a patient portal “My Health”. UHB has cared for >5000 COVID admissions to date.
Scope: All COVID swab confirmed hospitalised patients to UHB from January – May 2020. The dataset includes highly granular patient demographics & co-morbidities taken from ICD-10 & SNOMED-CT codes but also primary care records& clinic letters. Serial, structured data pertaining to care process (timings, staff grades, specialty review, wards), presenting complaint, acuity, all physiology readings (pulse, blood pressure, respiratory rate, oxygen saturations), all blood results, microbiology, all prescribed & administered treatments (fluids, antibiotics, inotropes, vasopressors, organ support), all outcomes. Linked images available (radiographs, CT, MRI, ultrasound).
Available supplementary data: Health data preceding & following admission event. Matched “non-COVID” controls; ambulance, 111, 999 data, synthetic data.
Available supplementary support: Analytics, Model build, validation & refinement; A.I.; Data partner support for ETL (extract, transform & load) process, Clinical expertise, Patient & end-user access, Purchaser access, Regulatory requirements, Data-driven trials, “fast screen” services.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Pre-existing conditions of people who died due to COVID-19, broken down by country, broad age group, and place of death occurrence, usual residents of England and Wales.