2 datasets found
  1. COVID-19. Novel coronavirus dataset Jan-Feb 2020

    • kaggle.com
    Updated Feb 10, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mukharbek Organokov (2020). COVID-19. Novel coronavirus dataset Jan-Feb 2020 [Dataset]. https://www.kaggle.com/muhakabartay/novel-coronavirus-2019ncov/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 10, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Mukharbek Organokov
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Coronaviruses are a large family of viruses found in both animals and humans. Some infect people and are known to cause illness ranging from the common cold to more severe diseases such as Middle East Respiratory Syndrome (MERS) and Severe Acute Respiratory Syndrome (SARS).

    A novel coronavirus (CoV) is a new strain of coronavirus that has not been previously identified in humans. The new, or “novel” coronavirus, now called 2019-nCoV, had not previously detected before the outbreak was reported in Wuhan, China in December 2019.

    Short story: On December 31, 2019, the WHO was informed of an outbreak of “pneumonia of unknown cause” detected in Wuhan City, Hubei Province, China – the seventh-largest city in China with 11 million residents. As of January 23, there are over 800 cases of 2019-nCoV confirmed globally, including cases in at least 20 regions in China and nine countries/territories. The first reported infected individuals, some of whom showed symptoms as early as December 8, were discovered to be among stallholders from the Wuhan South China Seafood Market. Subsequently, the wet market was closed on Jan 1. The virus causing the outbreak was quickly determined to be a novel coronavirus. On January 10, gene sequencing further determined it to be the new Wuhan coronavirus, namely 2019-nCoV, a betacoronavirus, related to the Middle Eastern Respiratory Syndrome virus (MERS-CoV) and the Severe Acute Respiratory Syndrome virus (SARS-CoV). However, the mortality and transmissibility of 2019-nCoV are still unknown, and likely to vary from those of the prior referenced coronaviruses.

    See more information on the webpage of World Health Organization

    Content

    John Hopkins University Google Sheet of time series confirmed|recovered|death case numbers converted to CSV format.

    The data operated by the Johns Hopkins University Center for Systems Science and Engineering (JHU CCSE).

    See also GitHub.

    Track virus here.

    Acknowledgements

    Thanks to
    - JHU CCSE
    - WHO
    - Centers for Disease Control and Prevention (CDC)
    - European Centre for Disease Prevention and Control (ECDC)
    - DXY
    - National Health Commission of the People's Republic of China (NHC)

  2. f

    Building-level functional maps of 109 Chinese cities

    • figshare.com
    zip
    Updated Jun 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhuohong Li; Linxin Li; Ting Hu; Mofan Cheng; Wei He; Tong Qiu; Liangpei Zhang; Hongyan Zhang (2025). Building-level functional maps of 109 Chinese cities [Dataset]. http://doi.org/10.6084/m9.figshare.29262584.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 13, 2025
    Dataset provided by
    figshare
    Authors
    Zhuohong Li; Linxin Li; Ting Hu; Mofan Cheng; Wei He; Tong Qiu; Liangpei Zhang; Hongyan Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    China
    Description

    BackgroundAs the world’s most rapidly urbanizing country, China now faces mounting challenges from growing inequalities in the built environment, including disparities in access to essential infrastructure and diverse functional facilities. Yet these urban inequalities have remained unclear due to coarse observation scales and limited analytical scopes. In this study, we present the first building-level functional map of China, covering 110 million individual buildings across 109 cities using 69 terabytes of 1-meter resolution multi-modal satellite imagery. The national-scale map is validated by government reports and 5,280,695 observation points, showing strong agreement with external benchmarks. This enables the first nationwide, multi-dimensional assessment of inequality in the built environment across city tiers, geographical regions, and intra-city zones.About dataBased on the Paraformer framework that we proposed previously, we produced the first nationwide building-level functional map of urban China, processing over 69 TB of satellite data, including 1-meter Google Earth optical imagery (https://earth.google.com), 10-meter nighttime lights (SGDSAT-1) (https://sdg.casearth.cn/en), and building height data (CNBH-10m) (https://zenodo.org/records/7827315). Labels were derived from: (1) Building footprint data, including the CN-OpenData (https://doi.org/10.11888/Geogra.tpdc.271702) and the East Asia Building Dataset (https://zenodo.org/records/8174931); and (2) Land use and AOI data used for constructing urban functional annotation are retrieved from OpenStreetMap (https://www.openstreetmap.org) and EULUC-China dataset (https://doi.org/10.1016/j.scib.2019.12.007). The first 1-meter resolution national-scale land-cover map used to conduct the accessibility analysis is available in our previous study: SinoLC-1 (https://doi.org/10.5281/zenodo.7707461). The housing inequality and infrastructure allocation analysis was conducted based on the 100-meter gridded population dataset from China's seventh census (https://figshare.com/s/d9dd5f9bb1a7f4fd3734?file=43847643).This version of the data includes (1) Building-level functional maps of 109 Chinese cities, and (2) In-situ validation point sets. The building-level functional maps of 109 Chinese cities are organized in the ESRI Shapefile format, which includes five components: “.cpg”, “.dbf”, “.shx”, “.shp”, and “.prj” files. These components are stored in “.zip” files. Each city is named “G_P_C.zip,” where “G” explains the geographical region (south, central, east, north, northeast, northwest, and southwest of China) information, “P” explains the provincial administrative region information, and “C” explains the city name. For example, the building functional map for Wuhan City, Hubei Province is named “Central_Hubei_Wuhan.zip”.Furthermore, each shapefile of a city contains the building functional types from 1 to 8, where the corresponding relationship between the values and the building functions is shown below:Residential buildingCommercial buildingIndustrial buildingHealthcare buildingSport and art buildingEducational buildingPublic service buildingAdministrative buildingAbout validationGiven the importance of accurate mapping for downstream analysis, we conducted a comprehensive evaluation using government reports and in situ validation data outlined in the Data Section. This evaluation comprised two parts. First, a statistical-level evaluation was performed for each city based on official reports from the China Urban-Rural Construction Statistical Yearbook (https://www.mohurd.gov.cn/gongkai/fdzdgknr/sjfb/tjxx/jstjnj/index.html) and China Statistical Yearbook (https://www.stats.gov.cn/sj/ndsj/2023/indexch.htm). Second, a building-level geospatial evaluation was conducted by using 5.28 million field-observed points from Amap Inc. (provided in this data version of "Validation_in-situ_points.zip"), and a confusion matrix was calculated to compare the in situ points with the mapped buildings at the same location. The "Validation_in-situ_points.zip" includes the original point sets of each city, named as the city name (e.g., Wuhan.shp and corresponding “.cpg”, “.dbf”, “.shx”, and “.prj” files).

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Mukharbek Organokov (2020). COVID-19. Novel coronavirus dataset Jan-Feb 2020 [Dataset]. https://www.kaggle.com/muhakabartay/novel-coronavirus-2019ncov/code
Organization logo

COVID-19. Novel coronavirus dataset Jan-Feb 2020

Time series of confirmed cases

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 10, 2020
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Mukharbek Organokov
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Context

Coronaviruses are a large family of viruses found in both animals and humans. Some infect people and are known to cause illness ranging from the common cold to more severe diseases such as Middle East Respiratory Syndrome (MERS) and Severe Acute Respiratory Syndrome (SARS).

A novel coronavirus (CoV) is a new strain of coronavirus that has not been previously identified in humans. The new, or “novel” coronavirus, now called 2019-nCoV, had not previously detected before the outbreak was reported in Wuhan, China in December 2019.

Short story: On December 31, 2019, the WHO was informed of an outbreak of “pneumonia of unknown cause” detected in Wuhan City, Hubei Province, China – the seventh-largest city in China with 11 million residents. As of January 23, there are over 800 cases of 2019-nCoV confirmed globally, including cases in at least 20 regions in China and nine countries/territories. The first reported infected individuals, some of whom showed symptoms as early as December 8, were discovered to be among stallholders from the Wuhan South China Seafood Market. Subsequently, the wet market was closed on Jan 1. The virus causing the outbreak was quickly determined to be a novel coronavirus. On January 10, gene sequencing further determined it to be the new Wuhan coronavirus, namely 2019-nCoV, a betacoronavirus, related to the Middle Eastern Respiratory Syndrome virus (MERS-CoV) and the Severe Acute Respiratory Syndrome virus (SARS-CoV). However, the mortality and transmissibility of 2019-nCoV are still unknown, and likely to vary from those of the prior referenced coronaviruses.

See more information on the webpage of World Health Organization

Content

John Hopkins University Google Sheet of time series confirmed|recovered|death case numbers converted to CSV format.

The data operated by the Johns Hopkins University Center for Systems Science and Engineering (JHU CCSE).

See also GitHub.

Track virus here.

Acknowledgements

Thanks to
- JHU CCSE
- WHO
- Centers for Disease Control and Prevention (CDC)
- European Centre for Disease Prevention and Control (ECDC)
- DXY
- National Health Commission of the People's Republic of China (NHC)

Search
Clear search
Close search
Google apps
Main menu