Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
United States recorded 16306656 Coronavirus Recovered since the epidemic began, according to the World Health Organization (WHO). In addition, United States reported 797346 Coronavirus Deaths. This dataset includes a chart with historical data for the United States Coronavirus Recovered.
Facebook
TwitterAs of May 2, 2023, there were roughly 687 million global cases of COVID-19. Around 660 million people had recovered from the disease, while there had been almost 6.87 million deaths. The United States, India, and Brazil have been among the countries hardest hit by the pandemic.
The various types of human coronavirus The SARS-CoV-2 virus is the seventh known coronavirus to infect humans. Its emergence makes it the third in recent years to cause widespread infectious disease following the viruses responsible for SARS and MERS. A continual problem is that viruses naturally mutate as they attempt to survive. Notable new variants of SARS-CoV-2 were first identified in the UK, South Africa, and Brazil. Variants are of particular interest because they are associated with increased transmission.
Vaccination campaigns Common human coronaviruses typically cause mild symptoms such as a cough or a cold, but the novel coronavirus SARS-CoV-2 has led to more severe respiratory illnesses and deaths worldwide. Several COVID-19 vaccines have now been approved and are being used around the world.
Facebook
TwitterAs of May 2, 2023, the coronavirus disease (COVID-19) had been confirmed in almost every country and territory around the world. There had been roughly 687 million cases and 6.86 million deaths.
Vaccine approval in the United States The United States has recorded more coronavirus infections and deaths than any other country in the world. The regulatory agency in the country authorized three COVID-19 vaccines for emergency use. Both the Pfizer-BioNTech and Moderna vaccines were approved in December 2020, while the Johnson & Johnson vaccine was approved in February 2021. As of April 26, 2023, the number of COVID-19 vaccine doses administered in the U.S. had reached 675 million.
The difference between vaccines and antivirals Medications can help with the symptoms of viruses, but it is the role of the immune system to take care of them over time. However, the use of vaccines and antivirals can help the immune system in doing its job. The most tried and tested vaccine method is to inject an inactive or weakened form of a virus, encouraging the immune system to produce protective antibodies. The immune system keeps the virus in its memory, and if the real one appears, the body will recognize it and attack it more efficiently. Antivirals are designed to help target viruses, limiting their ability to reproduce and spread to other cells. They are used by patients who are already infected by a virus and can make the infection less severe.
Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Overview
The COVID-19 Patient Recovery Dataset is a synthetic collection of anonymized records for around 70,000 COVID-19 patients. It aims to assist with classification tasks in machine learning and epidemiological research. The dataset includes detailed clinical and demographic information, such as symptoms, existing health issues, vaccination status, COVID-19 variants, treatment details, and outcomes related to recovery or mortality. This dataset is great for predicting patient recovery (recovered), mortality (death), disease severity (severity), or the need for intensive care (icu_admission) using algorithms like Logistic Regression, Random Forest, XGBoost, or Neural Networks. It also allows for exploratory data analysis (EDA), statistical modeling, and time-series studies to find patterns in COVID-19 outcomes.
The data is synthetic and reflects realistic trends found in public health data, based on sources like WHO reports. It ensures privacy and follows ethical guidelines. Dates are provided in Excel serial format, meaning 44447 corresponds to September 8, 2021, and can be converted to standard dates using Python’s datetime or Excel. With 70,000 records and 28 columns, this dataset serves as a valuable resource for data scientists, researchers, and students interested in health-related machine learning or pandemic trends.
Data Source and Collection
Source: Synthetic data based on public health patterns from sources like the World Health Organization (WHO). It includes placeholder URLs.
Collection Period: Simulated from early 2020 to mid-2022, covering the Alpha, Delta, and Omicron waves.
Number of Records: 70,000.
File Format: CSV, which works with Pandas, R, Excel, and more.
Data Quality Notes:
About 5% of the values are missing in fields like symptoms_2, symptoms_3, treatment_given_2, and date.
There are rare inconsistencies, such as between recovery/death flags and dates, which may need some preprocessing.
Unique, anonymized patient IDs.
| Column Name | Data Type |
|---|---|
| patient_id | String |
| country | String |
| region/state | String |
| date_reported | Integer |
| age | Integer |
| gender | String |
| comorbidities | String |
| symptoms_1 | String |
| symptoms_2 | String |
| symptoms_3 | String |
| severity | String |
| hospitalized | Integer |
| icu_admission | Integer |
| ventilator_support | Integer |
| vaccination_status | String |
| variant | String |
| treatment_given_1 | String |
| treatment_given_2 | String |
| days_to_recovery | Integer |
| recovered | Integer |
| death | Integer |
| date_of_recovery | Integer |
| date_of_death | Integer |
| tests_conducted | Integer |
| test_type | String |
| hospital_name | String |
| doctor_assigned | String |
| source_url | String |
Key Column Details
patient_id: Unique identifier (e.g., P000001).
country: Reporting country (e.g., India, USA, Brazil, Germany, China, Pakistan, South Africa, UK).
region/state: Sub-national region (e.g., Sindh, California, São Paulo, Beijing).
date_reported, date_of_recovery, date_of_death: Excel serial dates (convert using datetime(1899,12,30) + timedelta(days=value)).
age: Patient age (1–100 years).
gender: Male or Female.
comorbidities: Pre-existing conditions (e.g., Diabetes, Hypertension, Cancer, Heart Disease, Asthma, None).
symptoms_1, symptoms_2, symptoms_3: Reported symptoms (e.g., Cough, Fever, Fatigue, Loss of Smell, Sore Throat, or empty).
severity: Case severity (Mild, Moderate, Severe, Critical).
hospitalized, icu_admission, ventilator_support: Binary (1 = Yes, 0 = No).
vaccination_status: None, Partial, Full, or Booster.
variant: COVID-19 variant (Omicron, Delta, Alpha).
treatment_given_1, treatment_given_2: Treatments administered (e.g., Antibiotics, Remdesivir, Oxygen, Steroids, Paracetamol, or empty).
days_to_recovery: Days from report to recovery (5–30, or empty if not recovered).
recovered, death: Binary outcomes (1 = Yes, 0 = No; generally mutually exclusive).
tests_conducted: Number of tests (1–5).
test_type: PCR or Antigen.
hospital_name: Fictional hospital (e.g., Aga Khan, Mayo Clinic, NHS Trust).
doctor_assigned: Fictional doctor name (e.g., Dr. Smith, Dr. Müller).
source_url: Placeholder.
Summary Statistics
Total Patients: 70,000.
Age: Mean ~50 years, Min 1, Max 100, evenly distributed.
Gender: ~50% Male, ~50% Female.
Top Countries: USA (20%), India (18%), Brazil (15%), China (12%), Germany (10%).
Comorbidities: Diabetes (25%), Hypertension (20%), Cancer (15%), Heart Disease (15%), Asthma (10%), None (15%).
Severity: Mild (60%), Moderate (25%), Severe (10%), Critical (5%).
Recovery Rate: ~60% recovered (recovered=1), ~30% deceased (death=1), ~10% unresolved (both 0).
Vaccination: None (40%), Full (30%), Partial (15%), Booster (15%).
Variants: Omicron (50%), Delt...
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
From World Health Organization - On 31 December 2019, WHO was alerted to several cases of pneumonia in Wuhan City, Hubei Province of China. The virus did not match any other known virus. This raised concern because when a virus is new, we do not know how it affects people.
So daily level information on the affected people can give some interesting insights when it is made available to the broader data science community.
Johns Hopkins University has made an excellent dashboard using the affected cases data. Data is extracted from the google sheets associated and made available here.
Now data is available as csv files in the Johns Hopkins Github repository. Please refer to the github repository for the Terms of Use details. Uploading it here for using it in Kaggle kernels and getting insights from the broader DS community.
2019 Novel Coronavirus (2019-nCoV) is a virus (more specifically, a coronavirus) identified as the cause of an outbreak of respiratory illness first detected in Wuhan, China. Early on, many of the patients in the outbreak in Wuhan, China reportedly had some link to a large seafood and animal market, suggesting animal-to-person spread. However, a growing number of patients reportedly have not had exposure to animal markets, indicating person-to-person spread is occurring. At this time, it’s unclear how easily or sustainably this virus is spreading between people - CDC
This dataset has daily level information on the number of affected cases, deaths and recovery from 2019 novel coronavirus. Please note that this is a time series data and so the number of cases on any given day is the cumulative number.
The data is available from 22 Jan, 2020.
Here’s a polished version suitable for a professional Kaggle dataset description:
This dataset contains time-series and case-level records of the COVID-19 pandemic. The primary file is covid_19_data.csv, with supporting files for earlier records and individual-level line list data.
This is the primary dataset and contains aggregated COVID-19 statistics by location and date.
This file contains earlier COVID-19 records. It is no longer updated and is provided only for historical reference. For current analysis, please use covid_19_data.csv.
This file provides individual-level case information, obtained from an open data source. It includes patient demographics, travel history, and case outcomes.
Another individual-level case dataset, also obtained from public sources, with detailed patient-level information useful for micro-level epidemiological analysis.
✅ Use covid_19_data.csv for up-to-date aggregated global trends.
✅ Use the line list datasets for detailed, individual-level case analysis.
If you are interested in knowing country level data, please refer to the following Kaggle datasets:
India - https://www.kaggle.com/sudalairajkumar/covid19-in-india
South Korea - https://www.kaggle.com/kimjihoo/coronavirusdataset
Italy - https://www.kaggle.com/sudalairajkumar/covid19-in-italy
Brazil - https://www.kaggle.com/unanimad/corona-virus-brazil
USA - https://www.kaggle.com/sudalairajkumar/covid19-in-usa
Switzerland - https://www.kaggle.com/daenuprobst/covid19-cases-switzerland
Indonesia - https://www.kaggle.com/ardisragen/indonesia-coronavirus-cases
Johns Hopkins University for making the data available for educational and academic research purposes
MoBS lab - https://www.mobs-lab.org/2019ncov.html
World Health Organization (WHO): https://www.who.int/
DXY.cn. Pneumonia. 2020. http://3g.dxy.cn/newh5/view/pneumonia.
BNO News: https://bnonews.com/index.php/2020/02/the-latest-coronavirus-cases/
National Health Commission of the People’s Republic of China (NHC): http://www.nhc.gov.cn/xcs/yqtb/list_gzbd.shtml
China CDC (CCDC): http://weekly.chinacdc.cn/news/TrackingtheEpidemic.htm
Hong Kong Department of Health: https://www.chp.gov.hk/en/features/102465.html
Macau Government: https://www.ssm.gov.mo/portal/
Taiwan CDC: https://sites.google....
Facebook
TwitterBy Liz Friedman [source]
Welcome to the Opportunity Insights Economic Tracker! Our goal is to provide a comprehensive, real-time look into how COVID-19 and stabilization policies are affecting the US economy. To do this, we have compiled a wide array of data points on spending and employment, gathered from several sources.
This dataset includes daily/weekly/monthly information at the state/county/city level for eight types of data: Google Mobility; Low-Income Employment and Earnings; UI Claims; Womply Merchants and Revenue; as well as weekly Math Learning from Zearn. Additionally, three files- Accounting for Geoids-State/County/City provide crosswalks between geographic areas that can be merged with other files having shared geographical levels.
Our goal here is to enable data users around the world to follow economic conditions in the US during this tumultuous period with maximum clarity and precision. We make all our datasets freely available so if you use them we kindly ask you attribute our work by linking or citing both our accompanying paper as well as this Economic Tracker at https://tracktherecoveryorg By doing so you are also agreeing to uphold our privacy & integrity standards which commit us both to individual & business confidentiality without compromising on independent nonpartisan research & policy analysis!
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset provides US COVID-19 case and death data, as well as Google Community Mobility Reports, on the state/county level. Here is how to use this dataset:
- Understand the file structure: This dataset consists of three main files: 1) US Cases & Deaths by State/County, 2) Google Community Mobility Reports, and 3) Data from third-parties providing small business openings & revenue information and unemployment insurance claim data (Low Inc Earnings & Employment, UI Claims and Womply Merchants & Revenue).
- Select your Subset: If you are interested in particular types of data (e.g., mobility or employment), select the corresponding files from within each section based on your geographic area of interest – national, state or county level – as indicated in each filename.
- Review metadata variables: Become familiar with the provided variables so that you can select which ones you need to explore further in your analysis. For example, if analyzing mobility trends at a city level look for columns such as ‘Retailer_and_recreation_percent_change’ or ‘Transit Stations Percent Change’; if focusing on employment decline look for columns such pay or emp figures that align with industries of interest to you such as low-income earners (emp_{inclow},pay_{inclow}).
- Unify dateformatting across row values : Convert date formats into one common unit so that all entries have consistent formatting if necessary; for exampe some entries may display dates using YYYY/MM/DD notation while others may use MM//DD//YY format depending on their source datasets; make sure to review column labels carefully before converting units where needed..
Merge datasets where applicable : Utilize GeoID crosswalks to combine multiple sets with same geographical coverageregionally covering ; example might be combining low income earnings figures with specific county settings by reference geo codes found in related documents like GeoIDs-County .
6 . Visualise Data : Now that all the different measures have been reviewed can begin generating charts visualize findings . This process may include cleaning up raw figures normalizing across currency formats , mapping geospatial locations others ; once ready create bar graphs line charts maps other visual according aggregate output desired Insightful representations at this stage will help inform concrete policy decisions during outbreak recovery period..Remember to cite
- Estimating the Impact of the COVID-19 Pandemic on Small Businesses - By comparing county-level Womply revenue and employment data with pre-COVID data, policymakers can gain an understanding of the economic impact that COVID has had on local small businesses.
- Analyzing Effects of Mobility Restrictions - The Google Mobility data provides insight into geographic areas where...
Facebook
TwitterThe COVID-19 Vulnerability and Recovery Index uses Tract and ZIP Code-level data* to identify California communities most in need of immediate and long-term pandemic and economic relief. Specifically, the Index is comprised of three components — Risk, Severity, and Recovery Need with the last scoring the ability to recover from the health, economic, and social costs of the pandemic. Communities with higher Index scores face a higher risk of COVID-19 infection and death and a longer uphill economic recovery. Conversely, those with lower scores are less vulnerable.
The Index includes one overarching Index score as well as a score for each of the individual components. Each component includes a set of indicators we found to be associated with COVID-19 risk, severity, or recovery in our review of existing indices and independent analysis. The Risk component includes indicators related to the risk of COVID-19 infection. The Severity component includes indicators designed to measure the risk of severe illness or death from COVID-19. The Recovery Need component includes indicators that measure community needs related to economic and social recovery. The overarching Index score is designed to show level of need from Highest to Lowest with ZIP Codes in the Highest or High need categories, or top 20th or 40th percentiles of the Index, having the greatest need for support.
The Index was originally developed as a statewide tool but has been adapted to LA County for the purposes of the Board motion. To distinguish between the LA County Index and the original Statewide Index, we refer to the revised Index for LA County as the LA County ARPA Index.
*Zip Code data has been crosswalked to Census Tract using HUD methodology
Indicators within each component of the LA County ARPA Index are:Risk: Individuals without U.S. citizenship; Population Below 200% of the Federal Poverty Level (FPL); Overcrowded Housing Units; Essential Workers Severity: Asthma Hospitalizations (per 10,000); Population Below 200% FPL; Seniors 75 and over in Poverty; Uninsured Population; Heart Disease Hospitalizations (per 10,000); Diabetes Hospitalizations (per 10,000)Recovery Need: Single-Parent Households; Gun Injuries (per 10,000); Population Below 200% FPL; Essential Workers; Unemployment; Uninsured PopulationData are sourced from US Census American Communities Survey (ACS) and the OSHPD Patient Discharge Database. For ACS indicators, the tables and variables used are as follows:
Indicator
ACS Table/Years
Numerator
Denominator
Non-US Citizen
B05001, 2019-2023
b05001_006e
b05001_001e
Below 200% FPL
S1701, 2019-2023
s1701_c01_042e
s1701_c01_001e
Overcrowded Housing Units
B25014, 2019-2023
b25014_006e + b25014_007e + b25014_012e + b25014_013e
b25014_001e
Essential Workers
S2401, 2019-2023
s2401_c01_005e + s2401_c01_011e + s2401_c01_013e + s2401_c01_015e + s2401_c01_019e + s2401_c01_020e + s2401_c01_023e + s2401_c01_024e + s2401_c01_029e + s2401_c01_033e
s2401_c01_001
Seniors 75+ in Poverty
B17020, 2019-2023
b17020_008e + b17020_009e
b17020_008e + b17020_009e + b17020_016e + b17020_017e
Uninsured
S2701, 2019-2023
s2701_c05_001e
NA, rate published in source table
Single-Parent Households
S1101, 2019-2023
s1101_c03_005e + s1101_c04_005e
s1101_c01_001e
Unemployment
S2301, 2019-2023
s2301_c04_001e
NA, rate published in source table
The remaining indicators are based data requested and received by Advancement Project CA from the OSHPD Patient Discharge database. Data are based on records aggregated at the ZIP Code level:
Indicator
Years
Definition
Denominator
Asthma Hospitalizations
2017-2019
All ICD 10 codes under J45 (under Principal Diagnosis)
American Community Survey, 2015-2019, 5-Year Estimates, Table DP05
Gun Injuries
2017-2019
Principal/Other External Cause Code "Gun Injury" with a Disposition not "Died/Expired". ICD 10 Code Y38.4 and all codes under X94, W32, W33, W34, X72, X73, X74, X93, X95, Y22, Y23, Y35 [All listed codes with 7th digit "A" for initial encounter]
American Community Survey, 2015-2019, 5-Year Estimates, Table DP05
Heart Disease Hospitalizations
2017-2019
ICD 10 Code I46.2 and all ICD 10 codes under I21, I22, I24, I25, I42, I50 (under Principal Diagnosis)
American Community Survey, 2015-2019, 5-Year Estimates, Table DP05
Diabetes (Type 2) Hospitalizations
2017-2019
All ICD 10 codes under E11 (under Principal Diagnosis)
American Community Survey, 2015-2019, 5-Year Estimates, Table DP05
For more information about this dataset, please contact egis@isd.lacounty.gov.
Facebook
TwitterOf the *** major companies in the United States and Europe that saw a fall in sales between 2019 and 2020, ** percent had reached revenue levels in 2021 equal to or higher than those recorded in 2019. However, there are significant differences between industries; Whereas companies within healthcare and consumer goods had reached more than 100 percent of the revenue recorded in 2019, less than ** percent of the companies within hospitality and leisure had achieved the same.
Facebook
TwitterAccording to WHO Coronavirus disease (COVID-19) is an infectious disease caused by a newly discovered coronavirus. Most people infected with the COVID-19 virus will experience mild to moderate respiratory illness and recover without requiring special treatment. Older people and those with underlying medical problems like cardiovascular disease, diabetes, chronic respiratory disease, and cancer are more likely to develop serious illnesses.
Johns Hopkins University has made an excellent dashboard for tracking the spread of COVID-19. Data is extracted from the Johns Hopkins Github repository associated and made available here.
This dataset has daily level information on the number of confirmed cases, deaths and recovery cases from 2019 novel coronavirus. Please note that this is a time series data and so the number of cases on any given day is the cumulative number. The data is available from 22 Jan, 2020 and updated regularly. Github repository of this clean dataset is here
Filename is covid-19_cleaned_data.csv(updated) - Province/State- Province/State of the observations - Country/Region-Country of observations - Date- Last update - Confirmed - Cumulative number of confirmed cases till that date - Recovered - Cumulative number of recovered till that date - Deaths- Cumulative number of deaths till that date - Lat and Long - Coordinates
Some insights could be 1. Mortality rate over time 2. Exponential growth 3. Changes in the number of affected cases over time 4. The latest number of affected cases
Facebook
TwitterThis dataset is daily time series data of all the COVID 19 confirmed, recovered and death cases across different nations in the world. It consists of columns:
Date <- From 22/01/2020 to the current date Country/Region <- Countries affected by Covid-19 Province/State <- Specific states in the Country Lat <- Latitude of the Country Long <- Longitude of the Country Confirmed <- Total Number of Confirmed Cases per day Recovered <- Total Number of Recovery Cases per day Deaths <- Total Number of Death Cases per day
Original Data Source <- https://raw.githubusercontent.com/datasets/covid-19/master/data/time-series-19-covid-combined.csv
Facebook
TwitterThis is a Covid 19 data set for India. The data set is updated frequently and is analysed using tableau. Click on the link to visit the tableau story. Click each of the caption in the story to unveil its content.
https://public.tableau.com/profile/ambili.nair#!/vizhome/COVID19Indiastory/Indiastory?publish=yes
The first Covid 19 case in India was reported on 30th January 2020 in South Indian state of Kerala on a medical student who was pursuing the studies at Wuhan University, China. Two more students were found to be infected in Kerala in the consecutive days. The Kerala government was successful in containing the disease with its proactive measures back then. The second outbreak of Covid 19 in India started in the first week of March from various parts of India in various people who visited the foreign countries and in some of the tourists from different countries.
The tableau story consists of the following data analysis : 1. State-wise number of infected and number of death count in India map. Hover the mouse on each state in the India map to know the count. 2. Click on the next caption to know the state-wise number of confirmed, active, recovered and deceased cases in the form of bar chart. 3. The next caption takes you to the bar chart which shows the number of cases getting confirmed in India each day starting from January 30, 2020. 4. Next caption takes us to an analysis of the Mortality rate and the Recovery rate (in percentage) of each of the Indian state. We get an idea how hard each of the state is hit by the pandemic. 5. Next caption gives a detailed analysis of the state Kerala which has the mortality rate of 0.806% and the recovery rate of 74.4% as of now. Hover the mouse to know the count in each district. Don't forget to have a look at the line graph of 'number of active cases' in Kerala. It looks almost flattened ! As everyday we hear the increasing number of cases and deaths across the country, this graph may make you feel better...! 6. Finally the caption takes you to the statistics from the topmost district of Kerala - Kasaragod. The total number of cases reported is 179 at Kasaragod. The active number of cases is just 12 as of now... !!! Have a look at the statistics from Kasaragod and the story of 'Kasaragod model' as some of the national media in India call it !!!
This data set consists of the following data: 1. state-wise statistics - Confirmed, Active, Recovered, Deceased cases 2. day-wise count of infected and deceased from various states 3. Statistics from Kerala - day-wise count of confirmed, Active, Recovered, Deceased cases 4. Statistics from Kasaragod district, Kerala - day-wise count of confirmed, Active, Recovered, Deceased cases 5. Count of confirmed cases from various districts of India
Ministry of Health and Family Welfare - India covid19india.org Wikipedia page - Covid 19 Pandemic India Govt. of Kerala dashboard - official Kerala Covid 19 statistics
Your data will be in front of the world's largest data science community. What questions do you want to see answered?
Facebook
TwitterThe labor force participation rate of prime-age individuals (age 25 to 54) in the United States declined dramatically during and after the Great Recession. While the rate remains below its pre-recession level, it has been increasing steadily since 2015. We examine how different demographic groups have contributed to this rebound and find that college-educated women have made the largest contribution to the recent recovery in the prime-age labor force participation rate.
Facebook
TwitterBased on the number of deaths of despair in 2018 and projected levels of unemployment from 2020 to 2029, it is estimated that the additional number of deaths in 2023 could range from 2,017 to 21,457 depending on the rate of economic recovery after the COVID-19 recession. This statistic shows the possible additional deaths of despair following the COVID-19 recession for select economic scenarios, given a 1.6 percent increase in unemployment, in the United States from 2020 to 2029.
Facebook
TwitterAs global communities responded to COVID-19, we heard from public health officials that the same type of aggregated, anonymized insights we use in products such as Google Maps would be helpful as they made critical decisions to combat COVID-19. These Community Mobility Reports aimed to provide insights into what changed in response to policies aimed at combating COVID-19. The reports charted movement trends over time by geography, across different categories of places such as retail and recreation, groceries and pharmacies, parks, transit stations, workplaces, and residential.
Facebook
TwitterIn a 2020 online survey, ** percent of small business owners in the United States said they expected small businesses like theirs to not recover from the impacts of COVID-19 until beyond 2021. Only ***** percent of respondents believed that businesses like theirs would be able to recover within few more weeks.
Facebook
TwitterAt the onset of COVID-19, U.S. labor-force participation dropped by about 3 percentage points and remained below pre-pandemic levels three years later. Recovery varied across states, with slower rebounds in those more affected by the pre-pandemic opioid crisis, as measured by age-adjusted opioid overdose death rates. An event study shows that a one-standard-deviation increase in pre-COVID opioid death rates corresponds to a 0.9 percentage point decline in post-COVID labor participation. The result is not driven by differences in overall health between states. The effect of prior opioid exposure had a more significant impact on individuals without a college degree. The slow recovery in states with more opioid exposure was characterized by an increase in individuals who are not in the labor force due to disability.
Facebook
TwitterThis is a time-series trend data collection with a series of json files primarily focused on countries most impacted by Covid-19. The tree formatted time series data should be able to enable various different kinds of analysis to answer questions about what may make a country's health system vulnerable to Covid-19 and what health demographics may help reducing the impact.
| Confirmed_cases(by 4/3/2020) | Country Name |
|---|---|
| 245,559 | US |
| 115,242 | Italy |
| 112,065 | Spain |
| 84,794 | Germany |
| 82,464 | China |
| 59,929 | France |
| 34,173 | United Kingdom |
| 18,827 | Switzerland |
| 18,135 | Turkey |
| 15,348 | Belgium |
| 14,788 | Netherlands |
| 11,284 | Canada |
| 11,129 | Austria |
| 10,062 | Korea, South |
Healthcare GDP Expenditure
Healthcare Employment
Hospital Bed Capacity
Air Pollution and Death Rate
Chronic illnesses and DALYs(Disability-Adjusted Life Years)
Body Weight
Elderly(Aged 65+) Population
CT Scanner Density
Tobacco Consumption(Smoker population %)
More metrics can be added upon request.
The raw CSV includes many different types of measurements such as number, percentage and per 1 million population. This data normalizes the time_series data by selecting data that is more about density, and number per capita data rather than absolute numbers. This could help doing comparison among nations since they may vary significantly on population.
Most of the JSON files contain time_series data. For people who want to use the data as country metadata, the most-recent data attribute is collected in top_countries_latest_fact_summary.json
The JSON data focuses on the above mentioned demographic areas in a simple tree schema
{
Country_name:
{
metric_name:[
List of {year, value, unit}
]
}
}
The data is sourced from OECD(https://stats.oecd.org/) and GDHX(http://ghdx.healthdata.org/). The json files with prefix "gbd_" are from GDHX
Following citation is needed for using GDHX data:
GBD Results tool: Use the following to cite data included in this download: Global Burden of Disease Collaborative Network. Global Burden of Disease Study 2017 (GBD 2017) Results. Seattle, United States: Institute for Health Metrics and Evaluation (IHME), 2018. Available from http://ghdx.healthdata.org/gbd-results-tool.
Where does US rank in term of Healthcare/Preventive spending in GDP, hospital bed/ICU bed/physician density and long-term illness? In which areas can US do more to prevent future Cov-19 crisis?
Is there correlation in a nation's medical preparedness and the rate of growth in confirmation, death rate and recovery rate? From GBD data graphs, it seems that Dalys(DALYs (Disability-Adjusted Life Years), rate per 100k) can divided nations into different camps.
How does death rate from Cov-19 correlate with Death rate related to Cardiovascular diseases and Chronic respiratory diseases?
What trends can we discover in various nation's health demographics over time? Are some areas getting better while others getting worse?
With time span from 2010 to 2018, this dataset can also correlate with data related to recent outbreaks such as seasonal flus, Avian influenza, etc.
With some quick analysis, it shows that the US actually ranks higher than China for DALYs(Disability-adjusted life years) caused by Chronic Respiratory conditions, which could be due to seasonal allergies. It seems counter-intuitive that this may suggest that countries with cleaner air may have higher burden of people with Chronic Respiratory conditions that may have made them more vulnerable in the Covid-19 crisis.
Example Kernel: https://www.kaggle.com/timxia/bar-chart-comparison-of-countries
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F4802460%2F2fce05195108856422b437316f34e837%2FTobacco.png?generation=1585936274243838&alt=media" alt="">
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F4802460%2Fe8db14764a47a8bce48fa79bdfdfb0f1%2FChronicDisease.png?generation=1585936274372639&alt=media" alt="">
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F4802460%2Fc534d40af042b9a503325f41c49b83cb%2FAirPollution.png?generation=1585936274337626&alt=media" alt="">
Facebook
Twitter
According to our latest research, the global debt recovery services market size reached USD 22.5 billion in 2024, reflecting strong demand across various end-user industries. The market is expected to grow at a CAGR of 8.1% during the forecast period, reaching a projected value of USD 44.1 billion by 2033. This sustained expansion is primarily driven by rising levels of consumer and corporate debt, increased regulatory scrutiny, and the growing need for efficient recovery solutions in both developed and emerging economies. The market is also benefiting from technological advancements and the adoption of digital recovery platforms, which are streamlining collection processes and improving recovery rates worldwide.
A primary growth factor for the debt recovery services market is the surge in non-performing loans (NPLs) across global financial institutions. The aftermath of economic disruptions, such as the COVID-19 pandemic and subsequent inflationary pressures, has led to a significant uptick in loan defaults. Financial institutions, particularly banks and non-banking financial companies, are increasingly outsourcing their debt recovery functions to specialized agencies to optimize resource allocation and focus on core operations. This trend is further accentuated by tightening regulatory requirements, which mandate transparent and compliant collection practices, compelling organizations to partner with reputable debt recovery service providers that adhere to industry standards and ethical guidelines.
Another significant driver is the rapid digital transformation within the debt recovery services sector. The adoption of advanced analytics, artificial intelligence, and machine learning algorithms has revolutionized traditional recovery methods, enabling service providers to segment debtors more effectively, personalize communication strategies, and prioritize high-probability recoveries. Cloud-based platforms are gaining traction, offering scalability, real-time monitoring, and enhanced data security. These technological innovations not only improve operational efficiency but also ensure compliance with data privacy regulations, making them highly attractive to both large enterprises and small and medium-sized businesses seeking to enhance their recovery rates and minimize write-offs.
The expansion of the debt recovery services market is also fueled by the diversification of end-user industries. While the BFSI sector remains the largest contributor, sectors such as healthcare, telecom, utilities, and government are increasingly leveraging third-party recovery services to manage overdue accounts and optimize cash flow. The proliferation of credit-based transactions in retail and the rising volume of unpaid medical bills in healthcare are notable trends driving demand. Additionally, global regulatory harmonization and the cross-border nature of debt recovery are encouraging multinational corporations to engage specialized agencies with international expertise, further boosting market growth.
Regionally, North America continues to dominate the debt recovery services market, accounting for a substantial share in 2024, followed closely by Europe and Asia Pacific. The United States, in particular, benefits from a mature financial ecosystem and stringent regulatory frameworks, while EuropeÂ’s market is driven by the growing volume of non-performing loans in Southern and Eastern Europe. Asia Pacific is emerging as the fastest-growing region, propelled by rapid urbanization, expanding credit markets, and the digitalization of financial services. Latin America and the Middle East & Africa are also witnessing steady growth, supported by increasing financial inclusion and the adoption of modern recovery practices.
In the context of debt recovery services, Revenue Recovery for Payments is becoming an increasingly vital component. As businesses across various sectors strive to maintain healthy cash flows, the ability to efficiently recover outstanding payments is crucial. This aspect of revenue recovery not only helps in stabilizing financial operations but also ensures that companies can reinvest in growth opportunities. The integration of advanced technologies, such as automated billing systems and predictive analytics, is enhancing the effectiveness of revenue recovery s
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Data across all counties in five states (Arizona, Colorado, New Mexico, Oklahoma, and Texas) in the U.S. were collected for the study on the impact of the socio-economic and political status on the county-level COVID-19 vaccination rates. Variables were obtained from various data sources; the Bureau of Labor Statistics, Bureau of Economic Analysis, 2010 US Census, Politico, and Centers for Disease Control and Prevention (CDC). It was found that county-level vaccination rates were significantly associated with the percentage of Democrat votes, the elderly population, and per capita income of the county. In addition, the results revealed racial and ethnic disparities in COVID-19 vaccination. The manuscript entitled “Impact of Socio-economic Status on the COVID-19 Vaccination: What Can We Learn for the Path of Recovery?” was submitted for publication.
Facebook
TwitterLike the rest of the world, Sudan has been experiencing the unprecedented social and economic impact of the COVID-19 pandemic. From restrictions on movement to school closures and lockdowns, the economic situation worsened, and commodity prices soared across the country. Results from the first six rounds of the High-Frequency Phone survey indicated that household welfare was negatively affected. The situation led to the loss of employment and income, decreased access to essential commodities and services, and food insecurity, particularly among the poor and vulnerable Sudanese. Moreover, the inability to access food and medicine degraded in July/August 2021 despite a slight amelioration in February/April 2021.
After COVID-19 in 2020, Sudan experienced situations that are more likely to compromise the recovery process. Political instability, unrest, and protests occurred before and after the military takeover in October 2021. Meanwhile, Sudan Central Bank devalued the currency, which may increase the already high commodities price. Besides, Sudan encountered historic flooding since the onset of the rainy season between May and June 2022. To monitor and assess the dynamics of the impacts of the country's economic and political situation (high inflation, social unrest, food shortages, asset loss, displacement, etc.) on households' welfare, another round of the Sudan High-Frequency Phone survey took place in June to August 2022.
Similar to the six previous rounds, the survey was conducted using mobile phones and covered all 18 states of Sudan. Round 7 sample is composed of 2816 Households from both urban and rural areas of Sudan. This sample allows us to draw statistical inferences about the Sudanese population at the national and rural/urban levels. The risk of nonresponse was a concern, so efforts were made to minimize this risk, including follow-up with respondents who failed to respond and keep the interviews short (15–20 minutes) to reduce respondent fatigue.
The questions are similar to the previous six rounds of the High-Frequency Phone survey but with added context. Households are asked about the key channels through which individuals and households are expected to be affected by the exchange rate distortions, country political instability, or flooding that occurred in May/June 2022, as well as how they have recovered from the COVID-19 pandemic impacts. Furthermore, questions cover a range of topics/themes including, but not limited to, health conditions, access to health facilities, access to other social services, availability of common food and non-food items (including medicines), nutrition and food security, employment/labor, income, assets, coping strategies, remittances, subjective welfare, climate/weather events, and the safety nets assistance.
National
The sampling methodology adopted for the implementation of this survey is probabilistic. Each of the units in the targeted population of the study must have a nonzero and known probability of selection. The sample was stratified by rural/urban for all 18 states. The distribution of the sub-sample between states and rural/urban is proportional to the size of the individuals owning mobile phones, i.e., not equal allocation. The selection of the individual phones (the households) is random, i.e., with equal probability, using a systematic sample procedure in the list (frame) of phones. This allows for extrapolating the results of the sample to the target population and estimating the precision of the results obtained. However, the implementation of this approach requires the availability of an adequate sampling frame containing all the units of the population without omissions or duplications.
In this survey, the sampling frame is provided by the phone lists. Considerable efforts were made to compile the frame using multiple lists of phone numbers collected during the implementation of various projects/surveys during the last few years at the household level across the country. This reduces the chances of having more than one phone number per household. Moreover, the interviewers double-checked during data collection that only one number was called for each selected surveyed household. Therefore, selecting individual phone numbers is the same as selecting households. It is worth noting that for West Kordofan and Central Darfur, the proportionality of rural/urban cannot be done according to the size of phones since there are no details for rural/urban. So, the size of the rural and urban populations (projection 2020) was used instead.
In Sudan, under the present federal system, the state is considered a semiautonomous entity mandated to take care of the affairs of the citizen, provide governance, and be responsible for planning, policy formulation, and implementation of the annual program. Consequently, the sample needed to cover all 18 states of the country. The sample is conceived to provide reliable estimates for the country (urban and rural) and to give statistically meaningful results at the national level.
Computer Assisted Telephone Interview [cati]
BASELINE (ROUND 1): One questionnaire, the Household Questionnaire, was administered to all households in the sample. The Household Questionnaire provides information on: - Demographics - Knowledge regarding the spread of COVID-19 - Behavior and social distancing - Access to basic goods and services (medicines, staple food, health, education, financial services) - Employment - Income loss - Food insecurity experience - Welfare - Shocks and Coping strategies - Social safety nets
ROUND 2: One questionnaire, the Household Questionnaire, was administered to all households in the sample. The Household Questionnaire provides information on: - Demographics - Knowledge regarding the spread of COVID-19 - Behavior and social distancing - Access to basic goods and services (medicines, staple food, health, education, financial services, water, transportation, housing, internet, energy) - Employment - Income loss - Food insecurity experience - Welfare - Shocks and Coping strategies - Social safety nets ROUND 3: One questionnaire, the Household Questionnaire, was administered to all households in the sample. The Household Questionnaire provides information on: - Demographics - Behavior and social distancing - Access to basic goods and services (medicines, staple food, health, education, financial services) - Employment - Income loss - Food insecurity experience - Welfare - Shocks and Coping strategies - Social safety nets ROUND 4: One questionnaire, the Household Questionnaire, was administered to all households in the sample. The Household Questionnaire provides information on: - Demographics - Youth module screening - Behavior and social distancing - Access to basic goods and services (medicines, staple food, health, education, transportation, fuel) - Employment - Income loss - Food insecurity experience - Welfare - Shocks and Coping strategies - Social safety nets ROUND 5: One questionnaire, the Household Questionnaire, was administered to all households in the sample. Respondent were asked to think about each child in their household for the education question. The Household Questionnaire provides information on: - Demographics - Mental health of the respondent - Children education.
ROUND 6: One questionnaire, the Household Questionnaire, was administered to all households in the sample. One youth per household is interviewed in the youth section of the questionnaire. The Questionnaire provides information on: - Demographics - Access to basic goods (medicines, staple food) - Youth employment - Youth job search - Youth aspirations and expectations - Youth skills and mental health.
ROUND 7: One questionnaire, the Household Questionnaire, was administered to all households in the sample. The Household Questionnaire provides information on: - Geography - Access to basic goods and services (medicines, staple food, health, education, water, housing, electricity) - Employment - Income loss - Food insecurity experience - Welfare - Experience of Climate/Weather events - Shocks and Coping strategies
BASELINE (ROUND 1): A total of 4,032 households were successfully interviewed during the first round of data collection (conducted during June 16–July 5, 2020). Selected households from each state include both rural and urban households, with the representation of each state in the final sample being proportional to the state’s population relative to the overall population. Households who refused to tell their location (mode of living and state) were dropped to minimize bias. The final sample size accounts 4,027 households.
ROUND 2: Interviewers attempted to contact and interview all 4,032 households that were successfully interviewed in the baseline of the Sudan HFS on COVID-19. 2,989 households were successfully interviewed in the second round. However, households who refused to tell their location (mode of living and state) were dropped to minimize bias. The final sample size accounts 2,987 households.
ROUND 3: Interviewers attempted to contact and interview all 4,032 households that were successfully interviewed in the Baseline of the Sudan HFS on COVID-19. 2,990 households were successfully interviewed in the third round. Households who refused to tell their location (mode of living and state) were dropped to minimize bias. The final sample size accounts 2,987 households.
ROUND 4: Interviewers attempted to contact and interview all 4,032 households that were successfully interviewed in the Baseline of the Sudan
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
United States recorded 16306656 Coronavirus Recovered since the epidemic began, according to the World Health Organization (WHO). In addition, United States reported 797346 Coronavirus Deaths. This dataset includes a chart with historical data for the United States Coronavirus Recovered.