100+ datasets found
  1. Manufacturing Dataset

    • kaggle.com
    zip
    Updated Aug 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shreshth Vashisht (2024). Manufacturing Dataset [Dataset]. https://www.kaggle.com/datasets/shreshthvashisht/manufacturing-dataset
    Explore at:
    zip(108377 bytes)Available download formats
    Dataset updated
    Aug 23, 2024
    Authors
    Shreshth Vashisht
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Shreshth Vashisht

    Released under Apache 2.0

    Contents

  2. Orange dataset table

    • figshare.com
    xlsx
    Updated Mar 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rui Simões (2022). Orange dataset table [Dataset]. http://doi.org/10.6084/m9.figshare.19146410.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Mar 4, 2022
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Rui Simões
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The complete dataset used in the analysis comprises 36 samples, each described by 11 numeric features and 1 target. The attributes considered were caspase 3/7 activity, Mitotracker red CMXRos area and intensity (3 h and 24 h incubations with both compounds), Mitosox oxidation (3 h incubation with the referred compounds) and oxidation rate, DCFDA fluorescence (3 h and 24 h incubations with either compound) and oxidation rate, and DQ BSA hydrolysis. The target of each instance corresponds to one of the 9 possible classes (4 samples per class): Control, 6.25, 12.5, 25 and 50 µM for 6-OHDA and 0.03, 0.06, 0.125 and 0.25 µM for rotenone. The dataset is balanced, it does not contain any missing values and data was standardized across features. The small number of samples prevented a full and strong statistical analysis of the results. Nevertheless, it allowed the identification of relevant hidden patterns and trends.

    Exploratory data analysis, information gain, hierarchical clustering, and supervised predictive modeling were performed using Orange Data Mining version 3.25.1 [41]. Hierarchical clustering was performed using the Euclidean distance metric and weighted linkage. Cluster maps were plotted to relate the features with higher mutual information (in rows) with instances (in columns), with the color of each cell representing the normalized level of a particular feature in a specific instance. The information is grouped both in rows and in columns by a two-way hierarchical clustering method using the Euclidean distances and average linkage. Stratified cross-validation was used to train the supervised decision tree. A set of preliminary empirical experiments were performed to choose the best parameters for each algorithm, and we verified that, within moderate variations, there were no significant changes in the outcome. The following settings were adopted for the decision tree algorithm: minimum number of samples in leaves: 2; minimum number of samples required to split an internal node: 5; stop splitting when majority reaches: 95%; criterion: gain ratio. The performance of the supervised model was assessed using accuracy, precision, recall, F-measure and area under the ROC curve (AUC) metrics.

  3. Car data sets(UCI)

    • kaggle.com
    zip
    Updated Mar 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nikhil Sharma (2025). Car data sets(UCI) [Dataset]. https://www.kaggle.com/datasets/nikhil1sharma/car-data-setsuci
    Explore at:
    zip(7766 bytes)Available download formats
    Dataset updated
    Mar 22, 2025
    Authors
    Nikhil Sharma
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Nikhil Sharma

    Released under CC0: Public Domain

    Contents

  4. a

    Dataset Log

    • data-uvalibrary.opendata.arcgis.com
    • opendata.charlottesville.org
    • +1more
    Updated Oct 26, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    City of Charlottesville (2017). Dataset Log [Dataset]. https://data-uvalibrary.opendata.arcgis.com/datasets/charlottesville::dataset-log
    Explore at:
    Dataset updated
    Oct 26, 2017
    Dataset authored and provided by
    City of Charlottesville
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Description

    Added new dataset OpenDataLog. The dataset stores detailed information regarding issues with the open data portal, new or changes to datasets on the portal as well as other information related to the City's Open Data Portal

  5. d

    Original Vector Datasets for Hawaii StreamStats

    • catalog.data.gov
    • search.dataone.org
    • +2more
    Updated Oct 22, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Original Vector Datasets for Hawaii StreamStats [Dataset]. https://catalog.data.gov/dataset/original-vector-datasets-for-hawaii-streamstats
    Explore at:
    Dataset updated
    Oct 22, 2025
    Dataset provided by
    U.S. Geological Survey
    Area covered
    Hawaii
    Description

    These datasets each consist of a folder containing a personal geodatabase of the NHD, and shapefiles used in the HydroDEM process. These files are provided as a means to document exactly which lines were used to develop the HydroDEMs. Each folder contains a line shapefile named for the 8-digit HUC code, containing the NHD flowlines that comprise the coastline for that island. The “hydrolines.shp” shapefile contains the lines that were burned into the DEM. These lines were selected from the NHD flowlines, with some minor editing in places. The “wbpolys.shp” shapefile contains the water-body polygons that were selected from the NHD and used in the bathymetric gradient process. The folders for HUCs 20010000 (Hawaii) and 20020000 (Maui) also contain a “walls.shp” shapefile, which contains the lines that were superimposed on the surface as “walls.”

  6. N

    Dataset for Kiawah Island, SC Census Bureau Demographics and Population...

    • neilsberg.com
    Updated Jul 24, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neilsberg Research (2024). Dataset for Kiawah Island, SC Census Bureau Demographics and Population Distribution Across Age // 2024 Edition [Dataset]. https://www.neilsberg.com/research/datasets/b79be6a5-5460-11ee-804b-3860777c1fe6/
    Explore at:
    Dataset updated
    Jul 24, 2024
    Dataset authored and provided by
    Neilsberg Research
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Kiawah Island, South Carolina
    Dataset funded by
    Neilsberg Research
    Description
    About this dataset

    Context

    The dataset tabulates the Kiawah Island population by age. The dataset can be utilized to understand the age distribution and demographics of Kiawah Island.

    Content

    The dataset constitues the following three datasets

    • Kiawah Island, SC Age Group Population Dataset: A complete breakdown of Kiawah Island age demographics from 0 to 85 years, distributed across 18 age groups
    • Kiawah Island, SC Age Cohorts Dataset: Children, Working Adults, and Seniors in Kiawah Island - Population and Percentage Analysis
    • Kiawah Island, SC Population Pyramid Dataset: Age Groups, Male and Female Population, and Total Population for Demographics Analysis

    Good to know

    Margin of Error

    Data in the dataset are based on the estimates and are subject to sampling variability and thus a margin of error. Neilsberg Research recommends using caution when presening these estimates in your research.

    Custom data

    If you do need custom data for any of your research project, report or presentation, you can contact our research staff at research@neilsberg.com for a feasibility of a custom tabulation on a fee-for-service basis.

    Inspiration

    Neilsberg Research Team curates, analyze and publishes demographics and economic data from a variety of public and proprietary sources, each of which often includes multiple surveys and programs. The large majority of Neilsberg Research aggregated datasets and insights is made available for free download at https://www.neilsberg.com/research/.

  7. m

    Datasets for HGS paper

    • data.mendeley.com
    Updated Aug 16, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    He Zhang (2019). Datasets for HGS paper [Dataset]. http://doi.org/10.17632/bymz6hdsfh.1
    Explore at:
    Dataset updated
    Aug 16, 2019
    Authors
    He Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Here are the 143 datasets in "arff" format used in the HGS paper.

  8. Data sets

    • figshare.com
    xlsx
    Updated Aug 21, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    McKay Cavanaugh (2020). Data sets [Dataset]. http://doi.org/10.6084/m9.figshare.12783944.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Aug 21, 2020
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    McKay Cavanaugh
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    All raw data sets

  9. Boolean DataSet

    • kaggle.com
    zip
    Updated Feb 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Singh Prince Rinku (2024). Boolean DataSet [Dataset]. https://www.kaggle.com/datasets/singhprincerinku/boolean-dataset
    Explore at:
    zip(7000 bytes)Available download formats
    Dataset updated
    Feb 22, 2024
    Authors
    Singh Prince Rinku
    Description

    Dataset

    This dataset was created by Singh Prince Rinku

    Released under Other (specified in description)

    Contents

  10. X-Ray microtomography for ant taxonomy: An exploration and case study with...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    • +2more
    pdf
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Francisco Hita Garcia; Georg Fischer; Cong Liu; Tracy L. Audisio; Gary D. Alpert; Brian L. Fisher; Evan P. Economo (2023). X-Ray microtomography for ant taxonomy: An exploration and case study with two new Terataner (Hymenoptera, Formicidae, Myrmicinae) species from Madagascar [Dataset]. http://doi.org/10.1371/journal.pone.0172641
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Francisco Hita Garcia; Georg Fischer; Cong Liu; Tracy L. Audisio; Gary D. Alpert; Brian L. Fisher; Evan P. Economo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We explore the potential of x-ray micro computed tomography (μCT) for the field of ant taxonomy by using it to enhance the descriptions of two remarkable new species of the ant genus Terataner: T. balrog sp. n. and T. nymeria sp. n.. We provide an illustrated worker-based species identification key for all species found on Madagascar, as well as detailed taxonomic descriptions, which include diagnoses, discussions, measurements, natural history data, high-quality montage images and distribution maps for both new species. In addition to conventional morphological examination, we have used virtual reconstructions based on volumetric μCT scanning data for the species descriptions. We also include 3D PDFs, still images of virtual reconstructions, and 3D rotation videos for both holotype workers and one paratype queen. The complete μCT datasets have been made available online (Dryad, https://datadryad.org) and represent the first cybertypes in ants (and insects). We discuss the potential of μCT scanning and critically assess the usefulness of cybertypes for ant taxonomy.

  11. H

    TED: Original and Replication Datasets

    • dataverse.harvard.edu
    • search.dataone.org
    Updated Mar 30, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Irfan Nooruddin; Thomas Edward Flores; Gabriella Lloyd (2022). TED: Original and Replication Datasets [Dataset]. http://doi.org/10.7910/DVN/UXFY88
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 30, 2022
    Dataset provided by
    Harvard Dataverse
    Authors
    Irfan Nooruddin; Thomas Edward Flores; Gabriella Lloyd
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    1945 - 2015
    Description

    TED leader-level data for 1,733 heads of government from 1945-2015

  12. r

    concept_relationship

    • redivis.com
    • stanford.redivis.com
    Updated Jan 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shah Lab (2025). concept_relationship [Dataset]. https://redivis.com/datasets/48nr-frxd97exb
    Explore at:
    Dataset updated
    Jan 9, 2025
    Dataset authored and provided by
    Shah Lab
    Time period covered
    1970 - 2099
    Description

    The table concept_relationship is part of the dataset MedAlign, available at https://stanford.redivis.com/datasets/48nr-frxd97exb. It contains 58831134 rows across 8 variables.

  13. h

    AI-Generated-vs-Real-Images-Datasets

    • huggingface.co
    Updated Aug 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hem Bahadur Gurung (2025). AI-Generated-vs-Real-Images-Datasets [Dataset]. https://huggingface.co/datasets/Hemg/AI-Generated-vs-Real-Images-Datasets
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 19, 2025
    Authors
    Hem Bahadur Gurung
    Description

    Dataset Card for "AI-Generated-vs-Real-Images-Datasets"

    More Information needed

  14. h

    fmb

    • huggingface.co
    • tensorflow.org
    Updated Aug 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    LeRobot (2024). fmb [Dataset]. https://huggingface.co/datasets/lerobot/fmb
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 6, 2024
    Dataset authored and provided by
    LeRobot
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset was created using LeRobot.

      Dataset Structure
    

    meta/info.json: { "codebase_version": "v2.0", "robot_type": "unknown", "total_episodes": 1804, "total_frames": 338188, "total_tasks":24, "total_videos": 7216, "total_chunks": 2, "chunks_size": 1000, "fps": 10, "splits": { "train": "0:1804" }, "data_path": "data/chunk-{episode_chunk:03d}/episode_{episode_index:06d}.parquet", "video_path":… See the full description on the dataset page: https://huggingface.co/datasets/lerobot/fmb.

  15. m

    Dataset for Crop Pest and Disease Detection

    • data.mendeley.com
    Updated Apr 26, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Patrick Mensah Kwabena (2023). Dataset for Crop Pest and Disease Detection [Dataset]. http://doi.org/10.17632/bwh3zbpkpv.1
    Explore at:
    Dataset updated
    Apr 26, 2023
    Authors
    Patrick Mensah Kwabena
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The application of Artificial Intelligence (AI) has been evident in the agricultural sector recently. The main goal of AI in agriculture is to improve crop yield, control crop pests/diseases, and reduce cost. The agricultural sector in developing countries faces severe in the form of disease and pest infestation, the knowledge gap between farmers and technology, and a lack of storage facilities, among others. To help address some of these challenges, this work presents crop pests/disease datasets sourced from local farms in Ghana. The dataset is presented in two folds; the raw images which consists of 24,881 images ( 6,549-Cashew, 7,508-Cassava, 5,389-Maize, and 5,435-Tomato) and augmented images which is further split into train and test set consists of 102,976 images (25,811-Cashew, 26,330-Cassava, 23,657-Maize, and 27,178-Tomato), categorized into 22 classes. All images are de-identified, validated by expert plant virologists, and freely available for use by the research community.

  16. d

    Biodiversity by County - Distribution of Animals, Plants and Natural...

    • catalog.data.gov
    • datasets.ai
    • +2more
    Updated Jul 12, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    State of New York (2025). Biodiversity by County - Distribution of Animals, Plants and Natural Communities [Dataset]. https://catalog.data.gov/dataset/biodiversity-by-county-distribution-of-animals-plants-and-natural-communities
    Explore at:
    Dataset updated
    Jul 12, 2025
    Dataset provided by
    State of New York
    Description

    The NYS Department of Environmental Conservation (DEC) collects and maintains several datasets on the locations, distribution and status of species of plants and animals. Information on distribution by county from the following three databases was extracted and compiled into this dataset. First, the New York Natural Heritage Program biodiversity database: Rare animals, rare plants, and significant natural communities. Significant natural communities are rare or high-quality wetlands, forests, grasslands, ponds, streams, and other types of habitats. Next, the 2nd NYS Breeding Bird Atlas Project database: Birds documented as breeding during the atlas project from 2000-2005. And last, DEC’s NYS Reptile and Amphibian Database: Reptiles and amphibians; most records are from the NYS Amphibian & Reptile Atlas Project (Herp Atlas) from 1990-1999.

  17. FIRM Panels

    • gstore.unm.edu
    • datasets.ai
    • +2more
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Federal Emergency Management Agency, FIRM Panels [Dataset]. https://gstore.unm.edu/apps/rgis/datasets/e2b82ae8-5d62-4bdf-8f4a-04a03107294e/metadata/ISO-19115:2003.html
    Explore at:
    Dataset provided by
    Federal Emergency Management Agencyhttp://www.fema.gov/
    Time period covered
    Jan 16, 2009
    Area covered
    West Bound -108.312507806935 East Bound -103.0000000149 North Bound 36.2500001138827 South Bound 31.9999999822822
    Description

    The National Flood Hazard Layer (NFHL) data incorporates all Digital Flood Insurance Rate Map(DFIRM) databases published by FEMA, and any Letters Of Map Revision (LOMRs) that have been issued against those databases since their publication date. The DFIRM Database is the digital, geospatial version of the flood hazard information shown on the published paper Flood Insurance Rate Maps(FIRMs). The primary risk classifications used are the 1-percent-annual-chance flood event, the 0.2-percent-annual-chance flood event, and areas of minimal flood risk. The NFHL data are derived from Flood Insurance Studies (FISs), previously published Flood Insurance Rate Maps (FIRMs), flood hazard analyses performed in support of the FISs and FIRMs, and new mapping data where available. The FISs and FIRMs are published by the Federal Emergency Management Agency (FEMA). The specifications for the horizontal control of DFIRM data are consistent with those required for mapping at a scale of 1:12,000. The NFHL data contain layers in the Standard DFIRM datasets except for S_Label_Pt and S_Label_Ld. The NFHL is available as State or US Territory data sets. Each State or Territory data set consists of all DFIRMs and corresponding LOMRs available on the publication date of the data set.

  18. w

    County-level Data Sets

    • data.wu.ac.at
    • agdatacommons.nal.usda.gov
    • +2more
    html, xls
    Updated Mar 19, 2014
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Agriculture (2014). County-level Data Sets [Dataset]. https://data.wu.ac.at/schema/data_gov/NmZkYWQ5MzQtNzVhNC00NGQzLWFjZWQtMmE2OWEyODkzNTZk
    Explore at:
    html, xlsAvailable download formats
    Dataset updated
    Mar 19, 2014
    Dataset provided by
    Department of Agriculture
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    c056e41d4875571d6a50c66832f696d7914fa6ae
    Description

    Socioeconomic indicators like the poverty rate, population change, unemployment rate, and education levels vary across the nation. ERS has compiled the latest data on these measures into a mapping and data display/download application that allows users to identify and compare States and counties on these indicators.

  19. d

    Basic raw materials 1:200 000 (DMIRS-043) - Datasets - data.wa.gov.au

    • catalogue.data.wa.gov.au
    Updated Nov 15, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2019). Basic raw materials 1:200 000 (DMIRS-043) - Datasets - data.wa.gov.au [Dataset]. https://catalogue.data.wa.gov.au/dataset/basic-raw-materials-1-200-000
    Explore at:
    Dataset updated
    Nov 15, 2019
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Western Australia
    Description

    Basic raw materials 1:200 000 contains geoscientific data relating to basic raw material resources including clay, limesand, limestone, hard rock aggregate, sand and gravel. Show full description

  20. Pills dataset

    • kaggle.com
    zip
    Updated Oct 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    JAYAPRAKASHPONDY (2023). Pills dataset [Dataset]. https://www.kaggle.com/datasets/jayaprakashpondy/pills-dataset
    Explore at:
    zip(141223154 bytes)Available download formats
    Dataset updated
    Oct 17, 2023
    Authors
    JAYAPRAKASHPONDY
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by JAYAPRAKASHPONDY

    Released under CC0: Public Domain

    Contents

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Shreshth Vashisht (2024). Manufacturing Dataset [Dataset]. https://www.kaggle.com/datasets/shreshthvashisht/manufacturing-dataset
Organization logo

Manufacturing Dataset

Explore at:
zip(108377 bytes)Available download formats
Dataset updated
Aug 23, 2024
Authors
Shreshth Vashisht
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Dataset

This dataset was created by Shreshth Vashisht

Released under Apache 2.0

Contents

Search
Clear search
Close search
Google apps
Main menu