100+ datasets found
  1. h

    Agriculture-Plan-Diseases-QA-Pairs-Dataset

    • huggingface.co
    Updated Jun 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yuvraj Singh (2024). Agriculture-Plan-Diseases-QA-Pairs-Dataset [Dataset]. https://huggingface.co/datasets/YuvrajSingh9886/Agriculture-Plan-Diseases-QA-Pairs-Dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 30, 2024
    Authors
    Yuvraj Singh
    Description

    YuvrajSingh9886/Agriculture-Plan-Diseases-QA-Pairs-Dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  2. Boolean DataSet

    • kaggle.com
    zip
    Updated Feb 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Singh Prince Rinku (2024). Boolean DataSet [Dataset]. https://www.kaggle.com/datasets/singhprincerinku/boolean-dataset
    Explore at:
    zip(7000 bytes)Available download formats
    Dataset updated
    Feb 22, 2024
    Authors
    Singh Prince Rinku
    Description

    Dataset

    This dataset was created by Singh Prince Rinku

    Released under Other (specified in description)

    Contents

  3. Manufacturing Dataset

    • kaggle.com
    zip
    Updated Aug 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shreshth Vashisht (2024). Manufacturing Dataset [Dataset]. https://www.kaggle.com/datasets/shreshthvashisht/manufacturing-dataset
    Explore at:
    zip(108377 bytes)Available download formats
    Dataset updated
    Aug 23, 2024
    Authors
    Shreshth Vashisht
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Shreshth Vashisht

    Released under Apache 2.0

    Contents

  4. m

    Dataset for Crop Pest and Disease Detection

    • data.mendeley.com
    Updated Apr 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Patrick Mensah Kwabena (2023). Dataset for Crop Pest and Disease Detection [Dataset]. http://doi.org/10.17632/bwh3zbpkpv.1
    Explore at:
    Dataset updated
    Apr 26, 2023
    Authors
    Patrick Mensah Kwabena
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The application of Artificial Intelligence (AI) has been evident in the agricultural sector recently. The main goal of AI in agriculture is to improve crop yield, control crop pests/diseases, and reduce cost. The agricultural sector in developing countries faces severe in the form of disease and pest infestation, the knowledge gap between farmers and technology, and a lack of storage facilities, among others. To help address some of these challenges, this work presents crop pests/disease datasets sourced from local farms in Ghana. The dataset is presented in two folds; the raw images which consists of 24,881 images ( 6,549-Cashew, 7,508-Cassava, 5,389-Maize, and 5,435-Tomato) and augmented images which is further split into train and test set consists of 102,976 images (25,811-Cashew, 26,330-Cassava, 23,657-Maize, and 27,178-Tomato), categorized into 22 classes. All images are de-identified, validated by expert plant virologists, and freely available for use by the research community.

  5. h

    dataset

    • huggingface.co
    Updated Jul 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dmitrii Aspisov (2025). dataset [Dataset]. https://huggingface.co/datasets/aspisov/dataset
    Explore at:
    Dataset updated
    Jul 27, 2025
    Authors
    Dmitrii Aspisov
    Description

    aspisov/dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  6. Pills dataset

    • kaggle.com
    zip
    Updated Oct 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    JAYAPRAKASHPONDY (2023). Pills dataset [Dataset]. https://www.kaggle.com/datasets/jayaprakashpondy/pills-dataset
    Explore at:
    zip(141223154 bytes)Available download formats
    Dataset updated
    Oct 17, 2023
    Authors
    JAYAPRAKASHPONDY
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by JAYAPRAKASHPONDY

    Released under CC0: Public Domain

    Contents

  7. R

    Dataset Ow Dataset

    • universe.roboflow.com
    zip
    Updated Jan 8, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Overwatch (2024). Dataset Ow Dataset [Dataset]. https://universe.roboflow.com/overwatch-4wpfl/dataset-ow
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 8, 2024
    Dataset authored and provided by
    Overwatch
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Player Bounding Boxes
    Description

    Dataset Ow

    ## Overview
    
    Dataset Ow is a dataset for object detection tasks - it contains Player annotations for 10,000 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  8. CSIRO Sentinel-1 SAR image dataset of oil- and non-oil features for machine...

    • data.csiro.au
    • researchdata.edu.au
    Updated Dec 15, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    David Blondeau-Patissier; Thomas Schroeder; Foivos Diakogiannis; Zhibin Li (2022). CSIRO Sentinel-1 SAR image dataset of oil- and non-oil features for machine learning ( Deep Learning ) [Dataset]. http://doi.org/10.25919/4v55-dn16
    Explore at:
    Dataset updated
    Dec 15, 2022
    Dataset provided by
    CSIROhttp://www.csiro.au/
    Authors
    David Blondeau-Patissier; Thomas Schroeder; Foivos Diakogiannis; Zhibin Li
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Time period covered
    May 1, 2015 - Aug 31, 2022
    Area covered
    Dataset funded by
    CSIROhttp://www.csiro.au/
    ESA
    Description

    What this collection is: A curated, binary-classified image dataset of grayscale (1 band) 400 x 400-pixel size, or image chips, in a JPEG format extracted from processed Sentinel-1 Synthetic Aperture Radar (SAR) satellite scenes acquired over various regions of the world, and featuring clear open ocean chips, look-alikes (wind or biogenic features) and oil slick chips.

    This binary dataset contains chips labelled as: - "0" for chips not containing any oil features (look-alikes or clean seas)
    - "1" for those containing oil features.

    This binary dataset is imbalanced, and biased towards "0" labelled chips (i.e., no oil features), which correspond to 66% of the dataset. Chips containing oil features, labelled "1", correspond to 34% of the dataset.

    Why: This dataset can be used for training, validation and/or testing of machine learning, including deep learning, algorithms for the detection of oil features in SAR imagery. Directly applicable for algorithm development for the European Space Agency Sentinel-1 SAR mission (https://sentinel.esa.int/web/sentinel/missions/sentinel-1 ), it may be suitable for the development of detection algorithms for other SAR satellite sensors.

    Overview of this dataset: Total number of chips (both classes) is N=5,630 Class 0 1 Total 3,725 1,905

    Further information and description is found in the ReadMe file provided (ReadMe_Sentinel1_SAR_OilNoOil_20221215.txt)

  9. HWID12 (Highway Incidents Detection Dataset)

    • kaggle.com
    zip
    Updated Mar 17, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Landry KEZEBOU (2022). HWID12 (Highway Incidents Detection Dataset) [Dataset]. https://www.kaggle.com/datasets/landrykezebou/hwid12-highway-incidents-detection-dataset
    Explore at:
    zip(12018619931 bytes)Available download formats
    Dataset updated
    Mar 17, 2022
    Authors
    Landry KEZEBOU
    Description

    Context

    Action Recognition in video is known to be more challenging than image recognition problems. Unlike image recognition models which use 2D convolutional neural blocks, action classification models require additional dimensionality to capture the spatio-temporal information in video sequences. This intrinsically makes video action recognition models computationally intensive and significantly more data-hungry than image recognition counterparts. Unequivocally, existing video datasets such as Kinetics, AVA, Charades, Something-Something, HMDB51, and UFC101 have had tremendous impact on the recently evolving video recognition technologies. Artificial Intelligence models trained on these datasets have largely benefited applications such as behavior monitoring in elderly people, video summarization, and content-based retrieval. However, this growing concept of action recognition has yet to be explored in Intelligent Transportation System (ITS), particularly in vital applications such as incidents detection. This is partly due to the lack of availability of annotated dataset adequate for training models suitable for such direct ITS use cases. In this paper, the concept of video action recognition is explored to tackle the problem of highway incident detection and classification from live surveillance footage. First, a novel dataset - HWID12 (Highway Incidents Detection) dataset is introduced. The HWAD12 consists of 11 distinct highway incidents categories, and one additional category for negative samples representing normal traffic. The proposed dataset also includes 2780+ video segments of 3 to 8 seconds on average each, and 500k+ temporal frames. Next, the baseline for highway accident detection and classification is established with a state-of-the-art action recognition model trained on the proposed HWID12 dataset. Performance benchmarking for 12-class (normal traffic vs 11 accident categories), and 2-class (incident vs normal traffic) settings is performed. This benchmarking reveals a recognition accuracy of up to 88% and 98% for 12-class and 2-class recognition setting, respectively.

    Data Acquisition

    The Proposed Highway Incidents Detection Dataset (HWID12) is the first of its kind dataset aimed at fostering experimentation of video action recognition technologies to solve the practical problem of real-time highway incident detections which currently challenges intelligent transportation systems. The lack of such dataset has limited the expansion of the recent breakthroughs in video action classification for practical uses cases in intelligent transportation systems.. The proposed dataset contains more than 2780 video clips of length varying between 3 to 8 seconds. These video clips capture moments leading to, up until right after an incident occurred. The clips were manually segmented from accident compilations videos sourced from YouTube and other videos data platforms.

    Content

    There is one main zip file available for download. The zip file contains 2780+ video clips. 1) 12 folders
    2) each folder represents an incident category. One of the classes represent the negative sample class which simulates normal traffic.

    Terms and Conditions

    • Videos provided in this dataset are freely available for research and education purposes only. Please be sure to properly credit the authors by citing the article below.
    • Be sure to upvote this dataset if you find it useful by scrolling up and clicking the up-Arrow ^ sign at the top banner of the page, next to "New Notebook" button.
    • Be sure to blur out all plate numbers before publishing any of the contents available in this dataset.

    Acknowledgements

    Any publication using this database must reference to the following journal manuscript:

    • Landry Kezebou, Victor Oludare, Karen Panetta, James Intriligator, and Sos Agaian "Highway accident detection and classification from live traffic surveillance cameras: a comprehensive dataset and video action recognition benchmarking", Proc. SPIE 12100, Multimodal Image Exploitation and Learning 2022, 121000M (27 May 2022); https://doi.org/10.1117/12.2618943

    Note: if the link is broken, please use http instead of https.

    In Chrome, use the steps recommended in the following website to view the webpage if it appears to be broken https://www.technipages.com/chrome-enabledisable-not-secure-warning

    Other relevant datasets VCoR dataset: https://www.kaggle.com/landrykezebou/vcor-vehicle-color-recognition-dataset VRiV dataset: https://www.kaggle.com/landrykezebou/vriv-vehicle-recognition-in-videos-dataset

    For any enquires regarding the HWID12 dataset, contact: landrykezebou@gmail.com

  10. h

    Habi-Dataset

    • huggingface.co
    Updated May 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Telmo Robredo (2025). Habi-Dataset [Dataset]. https://huggingface.co/datasets/TelmoRobredo/Habi-Dataset
    Explore at:
    Dataset updated
    May 28, 2025
    Authors
    Telmo Robredo
    License

    https://choosealicense.com/licenses/llama4/https://choosealicense.com/licenses/llama4/

    Description

    TelmoRobredo/Habi-Dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. R

    鱼分类存储器 Dataset

    • universe.roboflow.com
    zip
    Updated Nov 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    wihuin (2024). 鱼分类存储器 Dataset [Dataset]. https://universe.roboflow.com/wihuin/-ymmmo/dataset/4
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 4, 2024
    Dataset authored and provided by
    wihuin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Fish Bounding Boxes
    Description

    鱼分类存储器

    ## Overview
    
    鱼分类存储器 is a dataset for object detection tasks - it contains Fish annotations for 7,828 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  12. R

    Old+new Dataset

    • universe.roboflow.com
    zip
    Updated May 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    hslupren (2025). Old+new Dataset [Dataset]. https://universe.roboflow.com/hslupren/old-new/dataset/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 22, 2025
    Dataset authored and provided by
    hslupren
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Graph Node Cone Obstacle ChE7 Bounding Boxes
    Description

    Old+new

    ## Overview
    
    Old+new is a dataset for object detection tasks - it contains Graph Node Cone Obstacle ChE7 annotations for 656 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  13. Restaurant Cost and Sales Dataset

    • kaggle.com
    zip
    Updated Oct 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Virtual_School (2023). Restaurant Cost and Sales Dataset [Dataset]. https://www.kaggle.com/datasets/virtualschool/restaurant-cost-and-sales-dataset
    Explore at:
    zip(2240359 bytes)Available download formats
    Dataset updated
    Oct 26, 2023
    Authors
    Virtual_School
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    The dataset includes following fields: (Food) Item, Category, Sub Category, Item Name, Price, Cost. The purpose of this dataset is to practice data visualization in tools like power bi and python.

  14. R

    Taco Official Dataset

    • universe.roboflow.com
    zip
    Updated Nov 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GGEEML (2022). Taco Official Dataset [Dataset]. https://universe.roboflow.com/ggeeml/taco-official/dataset/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 7, 2022
    Dataset authored and provided by
    GGEEML
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Trash Bounding Boxes
    Description

    Taco Official

    ## Overview
    
    Taco Official is a dataset for object detection tasks - it contains Trash annotations for 7,456 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  15. R

    Cotssss Dataset

    • universe.roboflow.com
    zip
    Updated Jan 15, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peter Oropeza (2022). Cotssss Dataset [Dataset]. https://universe.roboflow.com/peter-oropeza/cotssss/dataset/7
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 15, 2022
    Dataset authored and provided by
    Peter Oropeza
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Starfish Bounding Boxes
    Description

    COTSSSS

    ## Overview
    
    COTSSSS is a dataset for object detection tasks - it contains Starfish annotations for 5,923 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  16. R

    Dumb Dataset

    • universe.roboflow.com
    zip
    Updated Mar 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    hello (2023). Dumb Dataset [Dataset]. https://universe.roboflow.com/hello-7gpxt/dumb-tabxk/dataset/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Mar 25, 2023
    Dataset authored and provided by
    hello
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Pole Polygons
    Description

    Dumb

    ## Overview
    
    Dumb is a dataset for instance segmentation tasks - it contains Pole annotations for 300 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  17. Orange dataset table

    • figshare.com
    xlsx
    Updated Mar 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rui Simões (2022). Orange dataset table [Dataset]. http://doi.org/10.6084/m9.figshare.19146410.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Mar 4, 2022
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Rui Simões
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The complete dataset used in the analysis comprises 36 samples, each described by 11 numeric features and 1 target. The attributes considered were caspase 3/7 activity, Mitotracker red CMXRos area and intensity (3 h and 24 h incubations with both compounds), Mitosox oxidation (3 h incubation with the referred compounds) and oxidation rate, DCFDA fluorescence (3 h and 24 h incubations with either compound) and oxidation rate, and DQ BSA hydrolysis. The target of each instance corresponds to one of the 9 possible classes (4 samples per class): Control, 6.25, 12.5, 25 and 50 µM for 6-OHDA and 0.03, 0.06, 0.125 and 0.25 µM for rotenone. The dataset is balanced, it does not contain any missing values and data was standardized across features. The small number of samples prevented a full and strong statistical analysis of the results. Nevertheless, it allowed the identification of relevant hidden patterns and trends.

    Exploratory data analysis, information gain, hierarchical clustering, and supervised predictive modeling were performed using Orange Data Mining version 3.25.1 [41]. Hierarchical clustering was performed using the Euclidean distance metric and weighted linkage. Cluster maps were plotted to relate the features with higher mutual information (in rows) with instances (in columns), with the color of each cell representing the normalized level of a particular feature in a specific instance. The information is grouped both in rows and in columns by a two-way hierarchical clustering method using the Euclidean distances and average linkage. Stratified cross-validation was used to train the supervised decision tree. A set of preliminary empirical experiments were performed to choose the best parameters for each algorithm, and we verified that, within moderate variations, there were no significant changes in the outcome. The following settings were adopted for the decision tree algorithm: minimum number of samples in leaves: 2; minimum number of samples required to split an internal node: 5; stop splitting when majority reaches: 95%; criterion: gain ratio. The performance of the supervised model was assessed using accuracy, precision, recall, F-measure and area under the ROC curve (AUC) metrics.

  18. Pre-Crime-Dataset (Early Stages of Crimes)

    • kaggle.com
    zip
    Updated Jul 26, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shenal Pelpola (2022). Pre-Crime-Dataset (Early Stages of Crimes) [Dataset]. https://www.kaggle.com/datasets/shenalpelpola123/precrimedataset-early-stages-of-crimes
    Explore at:
    zip(895736697 bytes)Available download formats
    Dataset updated
    Jul 26, 2022
    Authors
    Shenal Pelpola
    Description

    The Dataset used for creating the pre-crime dataset

    UCF Crime – The dataset introduced in the research by (Sultani, Chen and Shah, 2018) contains 128 hours of untrimmed videos containing 13 different anomalies which includes abuse, arrest, arson, assault, road accident, explosions, fighting, robbery, shooting, stealing, shoplifting and vandalism. Shoplifting, stealing, robbery and normal video folders were selected to create the dataset

  19. Oracle Database metrics

    • kaggle.com
    zip
    Updated Aug 20, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Timerkhanov Yuriy (2020). Oracle Database metrics [Dataset]. https://www.kaggle.com/datasets/timerkhanovyuriy/oracle-database-metrics
    Explore at:
    zip(2792848 bytes)Available download formats
    Dataset updated
    Aug 20, 2020
    Authors
    Timerkhanov Yuriy
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Timerkhanov Yuriy

    Released under CC0: Public Domain

    Contents

  20. Covid-19 variants survival data

    • kaggle.com
    zip
    Updated Jan 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Massock Batalong Maurice Blaise (2025). Covid-19 variants survival data [Dataset]. https://www.kaggle.com/datasets/lumierebatalong/covid-19-variants-survival-data
    Explore at:
    zip(216589 bytes)Available download formats
    Dataset updated
    Jan 2, 2025
    Authors
    Massock Batalong Maurice Blaise
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Overview:

    This dataset provides a unique resource for researchers and data scientists interested in the global dynamics of the COVID-19 pandemic. It focuses on the impact of different SARS-CoV-2 variants and mutations on the duration of local epidemics. By combining variant information with epidemiological data, this dataset allows for a comprehensive analysis of factors influencing the trajectory of the pandemic.

    Key Features:

    • Global Coverage: Includes data from multiple countries.
    • Variant-Specific Information: Detailed records for various SARS-CoV-2 variants.
    • Epidemic Duration: Data on the duration of local epidemics, accounting for right-censoring.
    • Epidemiological Variables: Includes mortality rates, a proxy for R0, transmission proxies, and other pertinent variables.
    • Geographical characteristics: Include a continent variable for exploring geographical patterns
    • Time varying variables: Include the number of waves and the number of variants in the different countries for more in-depth exploration.

    Data Source: The data combines information from the Johns Hopkins University COVID-19 dataset (confirmed_cases.csv and deaths_cases.csv) and the covariants.org dataset (variants.csv). The dataset you see here is the combination of two datasets from Johns Hopkins University and covariants.org.

    Questions to Inspire Users:

    This dataset is designed for a diverse set of analytical questions. Here are some ideas to inspire the Kaggle community:

    Survival Analysis:

    1. How do different SARS-CoV-2 variants influence the duration of local epidemics?
    2. Which factors (mortality, R0, etc.) are most strongly associated with shorter or longer epidemic durations?
    3. Does the type of variant/mutation (mutation,S, Omicron, Delta, Other) have a significant impact on epidemic duration?
    4. Is there a geographical pattern to the duration of epidemics?

    Epidemiological Analysis:

    1. How do local transmission rates (represented by our proxy of R0) affect the duration of an epidemic?
    2. Do countries with higher mortality rates have different patterns of epidemic progression?
    3. How can we predict the duration of an epidemic based on its initial characteristics?
    4. How does the number of epidemic waves impact the duration of an epidemic?
    5. Does the number of variants in a country affect the duration of an épidémie?

    Data Science/Machine Learning:

    1. Can we develop a machine learning model to predict the duration of an epidemic?
    2. What features have the best predictive power ?
    3. Can we identify clusters of variants/regions with similar epidemic patterns?
    4. Are there interactions between variables that can explain the non-linearities that we have identified ?
Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Yuvraj Singh (2024). Agriculture-Plan-Diseases-QA-Pairs-Dataset [Dataset]. https://huggingface.co/datasets/YuvrajSingh9886/Agriculture-Plan-Diseases-QA-Pairs-Dataset

Agriculture-Plan-Diseases-QA-Pairs-Dataset

YuvrajSingh9886/Agriculture-Plan-Diseases-QA-Pairs-Dataset

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 30, 2024
Authors
Yuvraj Singh
Description

YuvrajSingh9886/Agriculture-Plan-Diseases-QA-Pairs-Dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

Search
Clear search
Close search
Google apps
Main menu