14 datasets found
  1. COVID-19 Dataset

    • kaggle.com
    zip
    Updated Nov 13, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Meir Nizri (2022). COVID-19 Dataset [Dataset]. https://www.kaggle.com/datasets/meirnizri/covid19-dataset
    Explore at:
    zip(4890659 bytes)Available download formats
    Dataset updated
    Nov 13, 2022
    Authors
    Meir Nizri
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Coronavirus disease (COVID-19) is an infectious disease caused by a newly discovered coronavirus. Most people infected with COVID-19 virus will experience mild to moderate respiratory illness and recover without requiring special treatment. Older people, and those with underlying medical problems like cardiovascular disease, diabetes, chronic respiratory disease, and cancer are more likely to develop serious illness. During the entire course of the pandemic, one of the main problems that healthcare providers have faced is the shortage of medical resources and a proper plan to efficiently distribute them. In these tough times, being able to predict what kind of resource an individual might require at the time of being tested positive or even before that will be of immense help to the authorities as they would be able to procure and arrange for the resources necessary to save the life of that patient.

    The main goal of this project is to build a machine learning model that, given a Covid-19 patient's current symptom, status, and medical history, will predict whether the patient is in high risk or not.

    content

    The dataset was provided by the Mexican government (link). This dataset contains an enormous number of anonymized patient-related information including pre-conditions. The raw dataset consists of 21 unique features and 1,048,576 unique patients. In the Boolean features, 1 means "yes" and 2 means "no". values as 97 and 99 are missing data.

    • sex: 1 for female and 2 for male.
    • age: of the patient.
    • classification: covid test findings. Values 1-3 mean that the patient was diagnosed with covid in different degrees. 4 or higher means that the patient is not a carrier of covid or that the test is inconclusive.
    • patient type: type of care the patient received in the unit. 1 for returned home and 2 for hospitalization.
    • pneumonia: whether the patient already have air sacs inflammation or not.
    • pregnancy: whether the patient is pregnant or not.
    • diabetes: whether the patient has diabetes or not.
    • copd: Indicates whether the patient has Chronic obstructive pulmonary disease or not.
    • asthma: whether the patient has asthma or not.
    • inmsupr: whether the patient is immunosuppressed or not.
    • hypertension: whether the patient has hypertension or not.
    • cardiovascular: whether the patient has heart or blood vessels related disease.
    • renal chronic: whether the patient has chronic renal disease or not.
    • other disease: whether the patient has other disease or not.
    • obesity: whether the patient is obese or not.
    • tobacco: whether the patient is a tobacco user.
    • usmr: Indicates whether the patient treated medical units of the first, second or third level.
    • medical unit: type of institution of the National Health System that provided the care.
    • intubed: whether the patient was connected to the ventilator.
    • icu: Indicates whether the patient had been admitted to an Intensive Care Unit.
    • date died: If the patient died indicate the date of death, and 9999-99-99 otherwise.
  2. Data_Sheet_1_The effect of diabetes on COVID-19 incidence and mortality:...

    • frontiersin.figshare.com
    • datasetcatalog.nlm.nih.gov
    pdf
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marta Ottone; Letizia Bartolini; Laura Bonvicini; Paolo Giorgi Rossi; Reggio Emilia COVID-19 working group (2023). Data_Sheet_1_The effect of diabetes on COVID-19 incidence and mortality: Differences between highly-developed-country and high-migratory-pressure-country populations.pdf [Dataset]. http://doi.org/10.3389/fpubh.2023.969143.s001
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    Frontiers Mediahttp://www.frontiersin.org/
    Authors
    Marta Ottone; Letizia Bartolini; Laura Bonvicini; Paolo Giorgi Rossi; Reggio Emilia COVID-19 working group
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The objective of this study was to compare the effect of diabetes and pathologies potentially related to diabetes on the risk of infection and death from COVID-19 among people from Highly-Developed-Country (HDC), including Italians, and immigrants from the High-Migratory-Pressure-Countries (HMPC). Among the population with diabetes, whose prevalence is known to be higher among immigrants, we compared the effect of body mass index among HDC and HMPC populations. A population-based cohort study was conducted, using population registries and routinely collected surveillance data. The population was stratified into HDC and HMPC, according to the place of birth; moreover, a focus was set on the South Asiatic population. Analyses restricted to the population with type-2 diabetes were performed. We reported incidence (IRR) and mortality rate ratios (MRR) and hazard ratios (HR) with 95% confidence interval (CI) to estimate the effect of diabetes on SARS-CoV-2 infection and COVID-19 mortality. Overall, IRR of infection and MRR from COVID-19 comparing HMPC with HDC group were 0.84 (95% CI 0.82–0.87) and 0.67 (95% CI 0.46–0.99), respectively. The effect of diabetes on the risk of infection and death from COVID-19 was slightly higher in the HMPC population than in the HDC population (HRs for infection: 1.37 95% CI 1.22–1.53 vs. 1.20 95% CI 1.14–1.25; HRs for mortality: 3.96 95% CI 1.82–8.60 vs. 1.71 95% CI 1.50–1.95, respectively). No substantial difference in the strength of the association was observed between obesity or other comorbidities and SARS-CoV-2 infection. Similarly for COVID-19 mortality, HRs for obesity (HRs: 18.92 95% CI 4.48–79.87 vs. 3.91 95% CI 2.69–5.69) were larger in HMPC than in the HDC population, but differences could be due to chance. Among the population with diabetes, the HMPC group showed similar incidence (IRR: 0.99 95% CI: 0.88–1.12) and mortality (MRR: 0.89 95% CI: 0.49–1.61) to that of HDC individuals. The effect of obesity on incidence was similar in both HDC and HMPC populations (HRs: 1.73 95% CI 1.41–2.11 among HDC vs. 1.41 95% CI 0.63–3.17 among HMPC), although the estimates were very imprecise. Despite a higher prevalence of diabetes and a stronger effect of diabetes on COVID-19 mortality in HMPC than in the HDC population, our cohort did not show an overall excess risk of COVID-19 mortality in immigrants.

  3. Table_1_Diabetes as a risk factor of death in hospitalized COVID-19 patients...

    • frontiersin.figshare.com
    • figshare.com
    docx
    Updated Jul 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michal Kania; Beata Koń; Konrad Kamiński; Jerzy Hohendorff; Przemysław Witek; Tomasz Klupa; Maciej T. Malecki (2023). Table_1_Diabetes as a risk factor of death in hospitalized COVID-19 patients – an analysis of a National Hospitalization Database from Poland, 2020.docx [Dataset]. http://doi.org/10.3389/fendo.2023.1161637.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jul 3, 2023
    Dataset provided by
    Frontiers Mediahttp://www.frontiersin.org/
    Authors
    Michal Kania; Beata Koń; Konrad Kamiński; Jerzy Hohendorff; Przemysław Witek; Tomasz Klupa; Maciej T. Malecki
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    IntroductionDiabetes is one of the comorbidities associated with poor prognosis in hospitalized COVID-19 patients. In this nationwide retrospective study, we evaluated the risk of in-hospital death attributed to diabetes.MethodsWe analyzed data from discharge reports of patients hospitalized with COVID-19 in 2020 as submitted to the Polish National Health Fund. Several multivariate logistic regression models were used. In each model, in-hospital death was estimated with explanatory variables. Models were built either on the whole cohorts or cohorts matched with propensity score matching (PSM). The models examined either the main effects of diabetes itself or the interaction of diabetes with other variables.ResultsWe included 174,621 patients with COVID-19 who were hospitalized in the year 2020. Among them, there were 40,168 diabetic patients (DPs), and the proportion of DPs in this group was higher than in the general population (23.0% vs. 9.5%, p

  4. NCHS - Weekly Counts of Deaths by State 2020-2022

    • kaggle.com
    zip
    Updated Nov 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ahmed Eltom (2025). NCHS - Weekly Counts of Deaths by State 2020-2022 [Dataset]. https://www.kaggle.com/datasets/ahmedeltom/nchs-weekly-counts-of-deaths-by-state-20202022
    Explore at:
    zip(348312 bytes)Available download formats
    Dataset updated
    Nov 18, 2025
    Authors
    Ahmed Eltom
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Cover image reference

    Provisional counts of deaths by the week the deaths occurred, by state of occurrence, and by select underlying causes of death for 2020-2022. The dataset also includes weekly provisional counts of death for COVID-19, coded to ICD-10 code U07.1 as an underlying or multiple cause of death.

    NOTE: death counts are presented with a one week lag.

    This dataset to be updated weekly with the notebook run. The coverage period is between 2020-2022. This can used in conjunction with other datasets to plot the bigger picture. ex. 2014-2018

    The dataset highlights select causes of death. Some prominent causes are not listed in specifics.

    Column NameDescription
    Data As OfDate of analysis
    Jurisdiction of OccurrenceJurisdiction of Occurrence
    MMWR YearMMWR Year
    MMWR WeekMMWR Week
    Week Ending DateWeek Ending Date
    All CauseAll Cause
    Natural CauseNatural Cause (A00-R99, U07)
    Septicemia (A40-A41)Septicemia (A40-A41)
    Malignant neoplasms (C00-C97)Malignant neoplasms (C00-C97)
    Diabetes mellitus (E10-E14)Diabetes mellitus (E10-E14)
    Alzheimer disease (G30)Alzheimer disease (G30)
    Influenza and pneumonia (J09-J18)Influenza and pneumonia (J09-J18)
    Chronic lower respiratory diseases (J40-J47)Chronic lower respiratory diseases (J40-J47)
    Other diseases of respiratory system (J00-J06,J30-J39,J67,J70-J98)Other diseases of respiratory system (J00-J06,J30-J39,J67,J70-J98)
    Nephritis, nephrotic syndrome and nephrosis (N00-N07,N17-N19,N25-N27)Nephritis, nephrotic syndrome and nephrosis (N00-N07,N17-N19,N25-N27)
    Symptoms, signs and abnormal clinical and laboratory findings, not elsewhere classified (R00-R99)Symptoms, signs and abnormal clinical and laboratory findings, not elsewhere classified (R00-R99)
    Diseases of heart (I00-I09,I11,I13,I20-I51)Diseases of heart (I00-I09,I11,I13,I20-I51)
    Cerebrovascular diseases (I60-I69)Cerebrovascular diseases (I60-I69)
    COVID-19 (U071, Multiple Cause of Death)COVID-19 (U071, Multiple Cause of Death)
    COVID-19 (U071, Underlying Cause of Death)COVID-19 (U071, Underlying Cause of Death)
    flag_allcauseSuppressed (counts 1-9) for All causes of death
    flag_natcauseSuppressed (counts 1-9) for Natural causes of death
    flag_septSuppressed (counts 1-9) for Septicemia
    flag_neoplSuppressed (counts 1-9) for Malignant eoplasms
    flag_diabSuppressed (counts 1-9) for Diabetes mellitis
    flag_alzSuppressed (counts 1-9) for Alzheimer disease
    flag_inflpnSuppressed (counts 1-9) for Influenza and pneumonia
    flag_clrdSuppressed (counts 1-9) for Chronic lower respiratory diseases
    flag_otherrespSuppressed (counts 1-9) for Other diseases of respiratory system
    flag_nephrSuppressed (counts 1-9) for Nephritis, nephrotic syndrome and nephrosis
    flag_otherunkSuppressed (counts 1-9) for Symptoms, signs and abnormal clinical and laboratory findings, not elsewhere classified
    flag_hdSuppressed (counts 1-9) for Diseases of heart
    flag_strokeSuppressed (counts 1-9) for Cerebrovascular diseases
    flag_cov19mcodSuppressed (counts 1-9) for COVID-19 (U071, Multiple Cause of Death)
    flag_cov19ucodSuppressed (counts 1-9) for COVID-19 (U071, Underlying Cause of Death)
  5. Leading causes of death, total population, by age group

    • www150.statcan.gc.ca
    • ouvert.canada.ca
    • +1more
    Updated Feb 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Government of Canada, Statistics Canada (2025). Leading causes of death, total population, by age group [Dataset]. http://doi.org/10.25318/1310039401-eng
    Explore at:
    Dataset updated
    Feb 19, 2025
    Dataset provided by
    Statistics Canadahttps://statcan.gc.ca/en
    Area covered
    Canada
    Description

    Rank, number of deaths, percentage of deaths, and age-specific mortality rates for the leading causes of death, by age group and sex, 2000 to most recent year.

  6. f

    S1 Data -

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated May 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kakande, Elijah; Owaraganise, Asiphas; Okiring, Jaffer; Nangendo, Joan; Mwangwa, Florence; Beesiga, Brian; Kabami, Jane; Akatukwasa, Cecilia; Lee, Jordan John; Roh, Michelle E.; Semitala, Fred C.; Kamya, Moses R. (2024). S1 Data - [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001311478
    Explore at:
    Dataset updated
    May 23, 2024
    Authors
    Kakande, Elijah; Owaraganise, Asiphas; Okiring, Jaffer; Nangendo, Joan; Mwangwa, Florence; Beesiga, Brian; Kabami, Jane; Akatukwasa, Cecilia; Lee, Jordan John; Roh, Michelle E.; Semitala, Fred C.; Kamya, Moses R.
    Description

    Chronic diseases such as HIV, hypertension, and diabetes increase the risk of severe coronavirus disease 2019 (COVID-19) and death. Thus, COVID-19 vaccine uptake data among these priority populations are needed to inform immunization programs. We assessed COVID-19 vaccine uptake among people living with HIV (PLWH) and those with hypertension/diabetes without HIV (PWoH) in Southwestern and Southcentral Uganda and determined factors influencing vaccination. We conducted a cross-sectional study from January to April 2023. We enrolled a random sample of participants aged 18 years and older seeking HIV, hypertension, or diabetes care at two regional referral hospitals (RRHs) in Mbarara and Masaka in Uganda. Using vaccination records abstraction and interviewer-administered questionnaires, we collected data on COVID-19 vaccine uptake, sociodemographic data, and reasons for non-uptake in unvaccinated persons. We compared COVID-19 vaccination uptake between PLWH and PWoH and applied modified Poisson regression to determine sociodemographic factors associated with vaccine uptake. The reasons for non-vaccine uptake were presented as percentages. Of the 1,376 enrolled participants, 65.6% were fully vaccinated against COVID-19. Vaccination coverage was 65% among PWLH versus 67% among PWoH. Higher education attainment and older age were associated with COVID vaccination. Participants with secondary education and those aged ≥50 years achieved >70% coverage. Fear of side effects was the most cited reason (67%) for non-vaccination among 330 unvaccinated participants, followed by vaccine mistrust (24.5%). People with chronic diseases in Southwestern Uganda had slightly lower than 70% COVID-19 vaccine coverage as recommended by WHO. Higher educational attainment and older age were linked to increased vaccine uptake. However, mistrust and fear of vaccine side effects were the main reasons for non-vaccination. To increase COVID-19 vaccine uptake, programs must reach those with lower educational attainment and younger age groups, and address the fear of vaccine side effects and mistrust among persons with underlying diseases in Uganda.

  7. Table_1_Risk factors for COVID-19 case fatality rate in people with type 1...

    • frontiersin.figshare.com
    docx
    Updated Jun 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    M. V. Shestakova; O. K. Vikulova; A. R. Elfimova; A. A. Deviatkin; I. I. Dedov; N. G. Mokrysheva (2023). Table_1_Risk factors for COVID-19 case fatality rate in people with type 1 and type 2 diabetes mellitus: A nationwide retrospective cohort study of 235,248 patients in the Russian Federation.docx [Dataset]. http://doi.org/10.3389/fendo.2022.909874.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    Frontiers Mediahttp://www.frontiersin.org/
    Authors
    M. V. Shestakova; O. K. Vikulova; A. R. Elfimova; A. A. Deviatkin; I. I. Dedov; N. G. Mokrysheva
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The aimTo study the association of demographic, clinical, and laboratory factors and the use of glucose-lowering drugs and anti-coronavirus disease (COVID-19) vaccination with the COVID-19-related case fatality rate (CFR) in diabetes mellitus (DM) patients.MethodsThis study is a nationwide observational cohort study based on the data from the National Diabetes Register (NDR) that is the database containing online clinical information about the population with DM. The outcomes (death or recovery) for COVID-19 were registered in 235,248 patients with DM [type 1 diabetes mellitus (T1DM), n = 11,058; type 2 diabetes mellitus (T2DM), n = 224,190] from March 20, 2020, until November 25, 2021. The unadjusted odds ratio (OR) and 95% confidence interval (CI) were used to estimate the risk factors for CFR. Then the ranging of significant factors was performed and the most vulnerable groups of factors for the lethal outcome were chosen.ResultsThe CFR due to COVID-19 was 8.1% in T1DM and 15.3% in T2DM. Increased CFR was associated with the male population [OR = 1.25 (95% CI: 1.09–1.44) in T1DM and 1.18 (95% CI: 1.15–1.21) in T2DM], age ≥65 years [OR = 4.44 (95% CI: 3.75–5.24) in T1DM and 3.18 (95% CI: 3.09–3.26) in T2DM], DM duration ≥10 years [OR = 2.46 (95% CI: 2.06–2.95) in T1DM and 2.11 (95% CI: 2.06–2.16) in T2DM], body mass index (BMI) ≥30 kg/m2 [OR = 1.95 (95% CI: 1.52–2.50)] in T1DM, HbA1c ≥7% [OR = 1.35 (95% CI: 1.29–1.43)] in T2DM. The atherosclerotic cardiovascular disease (ASCVD) and chronic kidney disease (CKD) were associated with higher CFR in T1DM but not in T2DM. The pre-COVID-19 glucose-lowering therapy in T2DM was differently associated with CFR (OR): 0.61 (95% CI: 0.59–0.62) for metformin, 0.59 (95% CI: 0.57–0.61) for dipeptidyl peptidase-4 inhibitors (DPP-4 inhibitors), 0.46 (95% CI: 0.44–0.49) for sodium-glucose co-transporter-2 (SGLT2) inhibitors, 0.38 (95% CI: 0.29–0.51) for glucagon-like peptide-1 receptor agonists (arGLP-1), 1.34 (95% CI: 1.31–1.37) for sulfonylurea (SU), and 1.47 (95% CI: 1.43–1.51) for insulin. Anti-COVID-19 vaccination was associated with a lower fatality risk in both DM types: OR = 0.07 (95% CI: 0.03–0.20) in T1DM and OR = 0.19 (95% CI: 0.17–0.22) in T2DM.ConclusionsThe results of our study suggest that increased COVID-19-related fatality risk in both T1DM and T2DM patients associated with the male population, older age, longer DM duration, and absence of anti-COVID-19 vaccination. In T2DM, pre-COVID-19 glucose-lowering therapy with metformin, DPP-4 inhibitors, SGLT2 inhibitors, and arGLP-1 had a positive effect on the risk of death. The most vulnerable combination of risk factors for lethal outcome in both DM types was vaccine absence + age ≥65 years + DM duration ≥10 years.

  8. Frequency distribution of socio-demographic and bio-clinical characteristics...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peter S. Nyasulu; Birhanu T. Ayele; Coenraad F. Koegelenberg; Elvis Irusen; Usha Lalla; Razeen Davids; Yazied Chothia; Francois Retief; Marianne Johnson; Stephen Venter; Renilda Pillay; Hans Prozesky; Jantjie Taljaard; Arifa Parker; Eric H. Decloedt; Portia Jordan; Sa’ad Lahri; M Rafique Moosa; Muhammad Saadiq Moolla; Anteneh Yalew; Nicola Baines; Padi Maud; Elizabeth Louw; Andre Nortje; Rory Dunbar; Lovemore N. Sigwadhi; Veranyuy D. Ngah; Jacques L. Tamuzi; Annalise Zemlin; Zivanai Chapanduka; René English; Brian W. Allwood (2023). Frequency distribution of socio-demographic and bio-clinical characteristics of COVID-19 patients admitted in the ICU. [Dataset]. http://doi.org/10.1371/journal.pone.0279565.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 21, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Peter S. Nyasulu; Birhanu T. Ayele; Coenraad F. Koegelenberg; Elvis Irusen; Usha Lalla; Razeen Davids; Yazied Chothia; Francois Retief; Marianne Johnson; Stephen Venter; Renilda Pillay; Hans Prozesky; Jantjie Taljaard; Arifa Parker; Eric H. Decloedt; Portia Jordan; Sa’ad Lahri; M Rafique Moosa; Muhammad Saadiq Moolla; Anteneh Yalew; Nicola Baines; Padi Maud; Elizabeth Louw; Andre Nortje; Rory Dunbar; Lovemore N. Sigwadhi; Veranyuy D. Ngah; Jacques L. Tamuzi; Annalise Zemlin; Zivanai Chapanduka; René English; Brian W. Allwood
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Frequency distribution of socio-demographic and bio-clinical characteristics of COVID-19 patients admitted in the ICU.

  9. Medication use associated with hazard ratio among COVID-19 patients admitted...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peter S. Nyasulu; Birhanu T. Ayele; Coenraad F. Koegelenberg; Elvis Irusen; Usha Lalla; Razeen Davids; Yazied Chothia; Francois Retief; Marianne Johnson; Stephen Venter; Renilda Pillay; Hans Prozesky; Jantjie Taljaard; Arifa Parker; Eric H. Decloedt; Portia Jordan; Sa’ad Lahri; M Rafique Moosa; Muhammad Saadiq Moolla; Anteneh Yalew; Nicola Baines; Padi Maud; Elizabeth Louw; Andre Nortje; Rory Dunbar; Lovemore N. Sigwadhi; Veranyuy D. Ngah; Jacques L. Tamuzi; Annalise Zemlin; Zivanai Chapanduka; René English; Brian W. Allwood (2023). Medication use associated with hazard ratio among COVID-19 patients admitted in ICU. [Dataset]. http://doi.org/10.1371/journal.pone.0279565.t005
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 19, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Peter S. Nyasulu; Birhanu T. Ayele; Coenraad F. Koegelenberg; Elvis Irusen; Usha Lalla; Razeen Davids; Yazied Chothia; Francois Retief; Marianne Johnson; Stephen Venter; Renilda Pillay; Hans Prozesky; Jantjie Taljaard; Arifa Parker; Eric H. Decloedt; Portia Jordan; Sa’ad Lahri; M Rafique Moosa; Muhammad Saadiq Moolla; Anteneh Yalew; Nicola Baines; Padi Maud; Elizabeth Louw; Andre Nortje; Rory Dunbar; Lovemore N. Sigwadhi; Veranyuy D. Ngah; Jacques L. Tamuzi; Annalise Zemlin; Zivanai Chapanduka; René English; Brian W. Allwood
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Medication use associated with hazard ratio among COVID-19 patients admitted in ICU.

  10. Laboratory results at initial measurements associated with mortality among...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peter S. Nyasulu; Birhanu T. Ayele; Coenraad F. Koegelenberg; Elvis Irusen; Usha Lalla; Razeen Davids; Yazied Chothia; Francois Retief; Marianne Johnson; Stephen Venter; Renilda Pillay; Hans Prozesky; Jantjie Taljaard; Arifa Parker; Eric H. Decloedt; Portia Jordan; Sa’ad Lahri; M Rafique Moosa; Muhammad Saadiq Moolla; Anteneh Yalew; Nicola Baines; Padi Maud; Elizabeth Louw; Andre Nortje; Rory Dunbar; Lovemore N. Sigwadhi; Veranyuy D. Ngah; Jacques L. Tamuzi; Annalise Zemlin; Zivanai Chapanduka; René English; Brian W. Allwood (2023). Laboratory results at initial measurements associated with mortality among COVID-19 patients admitted to ICU at Tygerberg Hospital. [Dataset]. http://doi.org/10.1371/journal.pone.0279565.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 19, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Peter S. Nyasulu; Birhanu T. Ayele; Coenraad F. Koegelenberg; Elvis Irusen; Usha Lalla; Razeen Davids; Yazied Chothia; Francois Retief; Marianne Johnson; Stephen Venter; Renilda Pillay; Hans Prozesky; Jantjie Taljaard; Arifa Parker; Eric H. Decloedt; Portia Jordan; Sa’ad Lahri; M Rafique Moosa; Muhammad Saadiq Moolla; Anteneh Yalew; Nicola Baines; Padi Maud; Elizabeth Louw; Andre Nortje; Rory Dunbar; Lovemore N. Sigwadhi; Veranyuy D. Ngah; Jacques L. Tamuzi; Annalise Zemlin; Zivanai Chapanduka; René English; Brian W. Allwood
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Laboratory results at initial measurements associated with mortality among COVID-19 patients admitted to ICU at Tygerberg Hospital.

  11. Socio-demographic and clinical characteristics associated with mortality...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peter S. Nyasulu; Birhanu T. Ayele; Coenraad F. Koegelenberg; Elvis Irusen; Usha Lalla; Razeen Davids; Yazied Chothia; Francois Retief; Marianne Johnson; Stephen Venter; Renilda Pillay; Hans Prozesky; Jantjie Taljaard; Arifa Parker; Eric H. Decloedt; Portia Jordan; Sa’ad Lahri; M Rafique Moosa; Muhammad Saadiq Moolla; Anteneh Yalew; Nicola Baines; Padi Maud; Elizabeth Louw; Andre Nortje; Rory Dunbar; Lovemore N. Sigwadhi; Veranyuy D. Ngah; Jacques L. Tamuzi; Annalise Zemlin; Zivanai Chapanduka; René English; Brian W. Allwood (2023). Socio-demographic and clinical characteristics associated with mortality among COVID-19 patients admitted to ICU. [Dataset]. http://doi.org/10.1371/journal.pone.0279565.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 20, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Peter S. Nyasulu; Birhanu T. Ayele; Coenraad F. Koegelenberg; Elvis Irusen; Usha Lalla; Razeen Davids; Yazied Chothia; Francois Retief; Marianne Johnson; Stephen Venter; Renilda Pillay; Hans Prozesky; Jantjie Taljaard; Arifa Parker; Eric H. Decloedt; Portia Jordan; Sa’ad Lahri; M Rafique Moosa; Muhammad Saadiq Moolla; Anteneh Yalew; Nicola Baines; Padi Maud; Elizabeth Louw; Andre Nortje; Rory Dunbar; Lovemore N. Sigwadhi; Veranyuy D. Ngah; Jacques L. Tamuzi; Annalise Zemlin; Zivanai Chapanduka; René English; Brian W. Allwood
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Socio-demographic and clinical characteristics associated with mortality among COVID-19 patients admitted to ICU.

  12. f

    Demographic characteristics based on life status.

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Mar 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mariam Joseph; Qiwei Li; Sunyoung Shin (2025). Demographic characteristics based on life status. [Dataset]. http://doi.org/10.1371/journal.pone.0319585.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Mar 31, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Mariam Joseph; Qiwei Li; Sunyoung Shin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Background The United States has experienced high surge in COVID-19 cases since the dawn of 2020. Identifying the types of diagnoses that pose a risk in leading COVID-19 death casualties will enable our community to obtain a better perspective in identifying the most vulnerable populations and enable these populations to implement better precautionary measures. Objective To identify demographic factors and health diagnosis codes that pose a high or a low risk to COVID-19 death from individual health record data sourced from the United States. Methods We used logistic regression models to analyze the top 500 health diagnosis codes and demographics that have been identified as being associated with COVID-19 death. Results Among 223,286 patients tested positive at least once, 218,831 (98%) patients were alive and 4,455 (2%) patients died during the duration of the study period. Through our logistic regression analysis, four demographic characteristics of patients; age, gender, race and region, were deemed to be associated with COVID-19 mortality. Patients from the West region of the United States: Alaska, Arizona, California, Colorado, Hawaii, Idaho, Montana, Nevada, New Mexico, Oregon, Utah, Washington, and Wyoming had the highest odds ratio of COVID-19 mortality across the United States. In terms of diagnoses, Complications mainly related to pregnancy (Adjusted Odds Ratio, OR:2.95; 95% Confidence Interval, CI:1.4 - 6.23) hold the highest odds ratio in influencing COVID-19 death followed by Other diseases of the respiratory system (OR:2.0; CI:1.84 – 2.18), Renal failure (OR:1.76; CI:1.61 – 1.93), Influenza and pneumonia (OR:1.53; CI:1.41 – 1.67), Other bacterial diseases (OR:1.45; CI:1.31 – 1.61), Coagulation defects, purpura and other hemorrhagic conditions(OR:1.37; CI:1.22 – 1.54), Injuries to the head (OR:1.27; CI:1.1 - 1.46), Mood [affective] disorders (OR:1.24; CI:1.12 – 1.36), Aplastic and other anemias (OR:1.22; CI:1.12 – 1.34), Chronic obstructive pulmonary disease and allied conditions (OR:1.18; CI:1.06 – 1.32), Other forms of heart disease (OR:1.18; CI:1.09 – 1.28), Infections of the skin and subcutaneous tissue (OR: 1.15; CI:1.04 – 1.27), Diabetes mellitus (OR:1.14; CI:1.03 – 1.26), and Other diseases of the urinary system (OR:1.12; CI:1.03 – 1.21). Conclusion We found demographic factors and medical conditions, including some novel ones which are associated with COVID-19 death. These findings can be used for clinical and public awareness and for future research purposes.

  13. Cox proportional hazards model identifying factors associated with the risk...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peter S. Nyasulu; Birhanu T. Ayele; Coenraad F. Koegelenberg; Elvis Irusen; Usha Lalla; Razeen Davids; Yazied Chothia; Francois Retief; Marianne Johnson; Stephen Venter; Renilda Pillay; Hans Prozesky; Jantjie Taljaard; Arifa Parker; Eric H. Decloedt; Portia Jordan; Sa’ad Lahri; M Rafique Moosa; Muhammad Saadiq Moolla; Anteneh Yalew; Nicola Baines; Padi Maud; Elizabeth Louw; Andre Nortje; Rory Dunbar; Lovemore N. Sigwadhi; Veranyuy D. Ngah; Jacques L. Tamuzi; Annalise Zemlin; Zivanai Chapanduka; René English; Brian W. Allwood (2023). Cox proportional hazards model identifying factors associated with the risk of hazard ratio among COVID-19 patients admitted to the ICU. [Dataset]. http://doi.org/10.1371/journal.pone.0279565.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 19, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Peter S. Nyasulu; Birhanu T. Ayele; Coenraad F. Koegelenberg; Elvis Irusen; Usha Lalla; Razeen Davids; Yazied Chothia; Francois Retief; Marianne Johnson; Stephen Venter; Renilda Pillay; Hans Prozesky; Jantjie Taljaard; Arifa Parker; Eric H. Decloedt; Portia Jordan; Sa’ad Lahri; M Rafique Moosa; Muhammad Saadiq Moolla; Anteneh Yalew; Nicola Baines; Padi Maud; Elizabeth Louw; Andre Nortje; Rory Dunbar; Lovemore N. Sigwadhi; Veranyuy D. Ngah; Jacques L. Tamuzi; Annalise Zemlin; Zivanai Chapanduka; René English; Brian W. Allwood
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Cox proportional hazards model identifying factors associated with the risk of hazard ratio among COVID-19 patients admitted to the ICU.

  14. Laboratory parameters associated with hazard ratio among COVID-19 patients...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peter S. Nyasulu; Birhanu T. Ayele; Coenraad F. Koegelenberg; Elvis Irusen; Usha Lalla; Razeen Davids; Yazied Chothia; Francois Retief; Marianne Johnson; Stephen Venter; Renilda Pillay; Hans Prozesky; Jantjie Taljaard; Arifa Parker; Eric H. Decloedt; Portia Jordan; Sa’ad Lahri; M Rafique Moosa; Muhammad Saadiq Moolla; Anteneh Yalew; Nicola Baines; Padi Maud; Elizabeth Louw; Andre Nortje; Rory Dunbar; Lovemore N. Sigwadhi; Veranyuy D. Ngah; Jacques L. Tamuzi; Annalise Zemlin; Zivanai Chapanduka; René English; Brian W. Allwood (2023). Laboratory parameters associated with hazard ratio among COVID-19 patients admitted to ICU. [Dataset]. http://doi.org/10.1371/journal.pone.0279565.t006
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 21, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Peter S. Nyasulu; Birhanu T. Ayele; Coenraad F. Koegelenberg; Elvis Irusen; Usha Lalla; Razeen Davids; Yazied Chothia; Francois Retief; Marianne Johnson; Stephen Venter; Renilda Pillay; Hans Prozesky; Jantjie Taljaard; Arifa Parker; Eric H. Decloedt; Portia Jordan; Sa’ad Lahri; M Rafique Moosa; Muhammad Saadiq Moolla; Anteneh Yalew; Nicola Baines; Padi Maud; Elizabeth Louw; Andre Nortje; Rory Dunbar; Lovemore N. Sigwadhi; Veranyuy D. Ngah; Jacques L. Tamuzi; Annalise Zemlin; Zivanai Chapanduka; René English; Brian W. Allwood
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Laboratory parameters associated with hazard ratio among COVID-19 patients admitted to ICU.

  15. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Meir Nizri (2022). COVID-19 Dataset [Dataset]. https://www.kaggle.com/datasets/meirnizri/covid19-dataset
Organization logo

COVID-19 Dataset

COVID-19 patient's symptoms, status, and medical history.

Explore at:
28 scholarly articles cite this dataset (View in Google Scholar)
zip(4890659 bytes)Available download formats
Dataset updated
Nov 13, 2022
Authors
Meir Nizri
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Context

Coronavirus disease (COVID-19) is an infectious disease caused by a newly discovered coronavirus. Most people infected with COVID-19 virus will experience mild to moderate respiratory illness and recover without requiring special treatment. Older people, and those with underlying medical problems like cardiovascular disease, diabetes, chronic respiratory disease, and cancer are more likely to develop serious illness. During the entire course of the pandemic, one of the main problems that healthcare providers have faced is the shortage of medical resources and a proper plan to efficiently distribute them. In these tough times, being able to predict what kind of resource an individual might require at the time of being tested positive or even before that will be of immense help to the authorities as they would be able to procure and arrange for the resources necessary to save the life of that patient.

The main goal of this project is to build a machine learning model that, given a Covid-19 patient's current symptom, status, and medical history, will predict whether the patient is in high risk or not.

content

The dataset was provided by the Mexican government (link). This dataset contains an enormous number of anonymized patient-related information including pre-conditions. The raw dataset consists of 21 unique features and 1,048,576 unique patients. In the Boolean features, 1 means "yes" and 2 means "no". values as 97 and 99 are missing data.

  • sex: 1 for female and 2 for male.
  • age: of the patient.
  • classification: covid test findings. Values 1-3 mean that the patient was diagnosed with covid in different degrees. 4 or higher means that the patient is not a carrier of covid or that the test is inconclusive.
  • patient type: type of care the patient received in the unit. 1 for returned home and 2 for hospitalization.
  • pneumonia: whether the patient already have air sacs inflammation or not.
  • pregnancy: whether the patient is pregnant or not.
  • diabetes: whether the patient has diabetes or not.
  • copd: Indicates whether the patient has Chronic obstructive pulmonary disease or not.
  • asthma: whether the patient has asthma or not.
  • inmsupr: whether the patient is immunosuppressed or not.
  • hypertension: whether the patient has hypertension or not.
  • cardiovascular: whether the patient has heart or blood vessels related disease.
  • renal chronic: whether the patient has chronic renal disease or not.
  • other disease: whether the patient has other disease or not.
  • obesity: whether the patient is obese or not.
  • tobacco: whether the patient is a tobacco user.
  • usmr: Indicates whether the patient treated medical units of the first, second or third level.
  • medical unit: type of institution of the National Health System that provided the care.
  • intubed: whether the patient was connected to the ventilator.
  • icu: Indicates whether the patient had been admitted to an Intensive Care Unit.
  • date died: If the patient died indicate the date of death, and 9999-99-99 otherwise.
Search
Clear search
Close search
Google apps
Main menu