2 datasets found
  1. Covid-19 Czech Republic

    • kaggle.com
    zip
    Updated Jul 3, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michal Brezak (2020). Covid-19 Czech Republic [Dataset]. https://www.kaggle.com/michalbrezk/covid19-czech-republic
    Explore at:
    zip(116897 bytes)Available download formats
    Dataset updated
    Jul 3, 2020
    Authors
    Michal Brezak
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    Czechia
    Description

    Context

    This dataset has been collected from multiple sources provided by MVCR on their websites and contains daily summarized statistics as well as details statistics up to age & sex level.

    Content

    What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

    Columns description

    Date - Calendar date when data were collected Daily tested - Sum of tests performed Daily infected - Sum of confirmed cases those were positive Daily cured - Sum of cured people that does not have Covid-19 anymore Daily deaths - Sum of people those died on Covid-19 Daily cum tested - Cumulative sum of tests performed Daily infected - Cumulative sum of confirmed cases those were positive Daily cured - Cumulative sum of cured people that does not have Covid-19 anymore Daily deaths - Cumulative sum of people those died on Covid-19 Region - Region of Czech republic Sub-Region - Sub-Region of Czech republic Region accessories qty - Quantity of health care accessories delivered to region for all the time Age - Age of person Sex - Sex of person Infected - Sum of infected people for specific date, region, sub-region, age and sex Cured - Sum of cured people for specific date, region, sub-region, age and sex Death - Sum of people those dies on Covid-19 for specific date, region, sub-region, age and sex

    Data granularity

    Dataset contains data on different level of granularities. Make sure you do not mix different granularities. Let's suppose you have loaded data into pandas dataframe called df.

    Day level

    df_daily = df.groupby(['date']).max()[['daily_tested','daily_infected','daily_cured','daily_deaths','daily_cum_tested','daily_cum_infected','daily_cum_cured','daily_cum_deaths']].reset_index()
    

    Region level

    df_region = df[df['region'] != ''].groupby(['region']).agg(
      region_accessories_qty=pd.NamedAgg(column='region_accessories_qty', aggfunc='max'), 
      infected=pd.NamedAgg(column='infected', aggfunc='sum'),
      cured=pd.NamedAgg(column='cured', aggfunc='sum'),
      death=pd.NamedAgg(column='death', aggfunc='sum')
    ).reset_index()
    

    Detail level

    df_detail = df[['date','region','sub_region','age','sex','infected','cured','death']].reset_index(drop=True)
    

    Acknowledgements

    Thanks to websites of MVCR for sharing such great information.

    Inspiration

    Can you see relation between health care accessories delivered to region and number of cured/infected in that region? Why Czech Republic belongs to pretty safe countries when talking about Covid-19 Pandemic? Can you find out what is difference of pandemic evolution in Czech Republic comparing to other surrounding coutries, like Germany or Slovakia?

  2. Covid-19 Czech Republic

    • kaggle.com
    Updated Jul 3, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michal Brezak (2020). Covid-19 Czech Republic [Dataset]. https://www.kaggle.com/michalbrezk/covid19-czech-republic/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 3, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Michal Brezak
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    Czechia
    Description

    Context

    This dataset has been collected from multiple sources provided by MVCR on their websites and contains daily summarized statistics as well as details statistics up to age & sex level.

    Content

    What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

    Columns description

    Date - Calendar date when data were collected Daily tested - Sum of tests performed Daily infected - Sum of confirmed cases those were positive Daily cured - Sum of cured people that does not have Covid-19 anymore Daily deaths - Sum of people those died on Covid-19 Daily cum tested - Cumulative sum of tests performed Daily infected - Cumulative sum of confirmed cases those were positive Daily cured - Cumulative sum of cured people that does not have Covid-19 anymore Daily deaths - Cumulative sum of people those died on Covid-19 Region - Region of Czech republic Sub-Region - Sub-Region of Czech republic Region accessories qty - Quantity of health care accessories delivered to region for all the time Age - Age of person Sex - Sex of person Infected - Sum of infected people for specific date, region, sub-region, age and sex Cured - Sum of cured people for specific date, region, sub-region, age and sex Death - Sum of people those dies on Covid-19 for specific date, region, sub-region, age and sex Infected abroad - Identifies if person was infected by Covid-19 in Czech republic or abroad Infected in country - code of country from where person came (origin country of Covid-19)

    Data granularity

    Dataset contains data on different level of granularities. Make sure you do not mix different granularities. Let's suppose you have loaded data into pandas dataframe called df.

    Day level

    df_daily = df.groupby(['date']).max()[['daily_tested','daily_infected','daily_cured','daily_deaths','daily_cum_tested','daily_cum_infected','daily_cum_cured','daily_cum_deaths']].reset_index()
    

    Region level

    df_region = df[df['region'] != ''].groupby(['region']).agg(
      region_accessories_qty=pd.NamedAgg(column='region_accessories_qty', aggfunc='max'), 
      infected=pd.NamedAgg(column='infected', aggfunc='sum'),
      cured=pd.NamedAgg(column='cured', aggfunc='sum'),
      death=pd.NamedAgg(column='death', aggfunc='sum')
    ).reset_index()
    

    Detail level

    df_detail = df[['date','region','sub_region','age','sex','infected','cured','death','infected_abroad','infected_in_country']].reset_index(drop=True)
    

    Acknowledgements

    Thanks to websites of MVCR for sharing such great information.

    Inspiration

    Can you see relation between health care accessories delivered to region and number of cured/infected in that region? Why Czech Republic belongs to pretty safe countries when talking about Covid-19 Pandemic? Can you find out what is difference of pandemic evolution in Czech Republic comparing to other surrounding coutries, like Germany or Slovakia?

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Michal Brezak (2020). Covid-19 Czech Republic [Dataset]. https://www.kaggle.com/michalbrezk/covid19-czech-republic
Organization logo

Covid-19 Czech Republic

Daily & regional level, including tested, infected, cured and death people

Explore at:
69 scholarly articles cite this dataset (View in Google Scholar)
zip(116897 bytes)Available download formats
Dataset updated
Jul 3, 2020
Authors
Michal Brezak
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Area covered
Czechia
Description

Context

This dataset has been collected from multiple sources provided by MVCR on their websites and contains daily summarized statistics as well as details statistics up to age & sex level.

Content

What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

Columns description

Date - Calendar date when data were collected Daily tested - Sum of tests performed Daily infected - Sum of confirmed cases those were positive Daily cured - Sum of cured people that does not have Covid-19 anymore Daily deaths - Sum of people those died on Covid-19 Daily cum tested - Cumulative sum of tests performed Daily infected - Cumulative sum of confirmed cases those were positive Daily cured - Cumulative sum of cured people that does not have Covid-19 anymore Daily deaths - Cumulative sum of people those died on Covid-19 Region - Region of Czech republic Sub-Region - Sub-Region of Czech republic Region accessories qty - Quantity of health care accessories delivered to region for all the time Age - Age of person Sex - Sex of person Infected - Sum of infected people for specific date, region, sub-region, age and sex Cured - Sum of cured people for specific date, region, sub-region, age and sex Death - Sum of people those dies on Covid-19 for specific date, region, sub-region, age and sex

Data granularity

Dataset contains data on different level of granularities. Make sure you do not mix different granularities. Let's suppose you have loaded data into pandas dataframe called df.

Day level

df_daily = df.groupby(['date']).max()[['daily_tested','daily_infected','daily_cured','daily_deaths','daily_cum_tested','daily_cum_infected','daily_cum_cured','daily_cum_deaths']].reset_index()

Region level

df_region = df[df['region'] != ''].groupby(['region']).agg(
  region_accessories_qty=pd.NamedAgg(column='region_accessories_qty', aggfunc='max'), 
  infected=pd.NamedAgg(column='infected', aggfunc='sum'),
  cured=pd.NamedAgg(column='cured', aggfunc='sum'),
  death=pd.NamedAgg(column='death', aggfunc='sum')
).reset_index()

Detail level

df_detail = df[['date','region','sub_region','age','sex','infected','cured','death']].reset_index(drop=True)

Acknowledgements

Thanks to websites of MVCR for sharing such great information.

Inspiration

Can you see relation between health care accessories delivered to region and number of cured/infected in that region? Why Czech Republic belongs to pretty safe countries when talking about Covid-19 Pandemic? Can you find out what is difference of pandemic evolution in Czech Republic comparing to other surrounding coutries, like Germany or Slovakia?

Search
Clear search
Close search
Google apps
Main menu