100+ datasets found
  1. deep learning image

    • kaggle.com
    zip
    Updated Aug 3, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rishabh (2019). deep learning image [Dataset]. https://www.kaggle.com/datasets/rishabh084/deep-learning-image
    Explore at:
    zip(1900170110 bytes)Available download formats
    Dataset updated
    Aug 3, 2019
    Authors
    Rishabh
    Description

    Dataset

    This dataset was created by Rishabh

    Contents

  2. Skin diseases image dataset

    • kaggle.com
    zip
    Updated Aug 16, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ismail Hossain (2021). Skin diseases image dataset [Dataset]. https://www.kaggle.com/datasets/ismailpromus/skin-diseases-image-dataset
    Explore at:
    zip(5568507391 bytes)Available download formats
    Dataset updated
    Aug 16, 2021
    Authors
    Ismail Hossain
    Description

    Dataset

    This dataset was created by Ismail Hossain

    Released under Data files © Original Authors

    Contents

  3. dl deep learning

    • kaggle.com
    zip
    Updated Dec 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    byvi19 (2024). dl deep learning [Dataset]. https://www.kaggle.com/datasets/byvi19/dl-deep-learning
    Explore at:
    zip(2948127439 bytes)Available download formats
    Dataset updated
    Dec 7, 2024
    Authors
    byvi19
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by byvi19

    Released under Apache 2.0

    Contents

  4. Deep Learning | Fall 24

    • kaggle.com
    zip
    Updated Dec 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Khaled Taha (2024). Deep Learning | Fall 24 [Dataset]. https://www.kaggle.com/datasets/khaledtahaa/deep-learning-fall-24/code
    Explore at:
    zip(413568 bytes)Available download formats
    Dataset updated
    Dec 25, 2024
    Authors
    Khaled Taha
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Khaled Taha

    Released under Apache 2.0

    Contents

  5. Data from: Cyberbullying detection

    • kaggle.com
    Updated Jul 12, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gaurav Bhattacharya (2023). Cyberbullying detection [Dataset]. https://www.kaggle.com/datasets/gbiamgaurav/cyberbullying-detection
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 12, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Gaurav Bhattacharya
    Description

    Context This dataset is a collection of datasets from different sources related to the automatic detection of cyber-bullying. The data is from different social media platforms like Kaggle, Twitter, Wikipedia Talk pages and YouTube. The data contain text and labeled as bullying or not. The data contains different types of cyber-bullying like hate speech, aggression, insults and toxicity.

    Content The data is from different social media platforms like Kaggle, Twitter, Wikipedia Talk pages and YouTube. The data contain text and labeled as bullying or not. The data contains different types of cyber-bullying like hate speech, aggression, insults and toxicity.

    Acknowledgements Elsafoury, Fatma (2020), “Cyberbullying datasets”, Mendeley Data, V1, doi: 10.17632/jf4pzyvnpj.1

  6. deep-learning-metricsp

    • kaggle.com
    zip
    Updated Feb 7, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shirley Ontaneda (2024). deep-learning-metricsp [Dataset]. https://www.kaggle.com/datasets/shirleyontaneda/deep-learning-metricsp
    Explore at:
    zip(629194 bytes)Available download formats
    Dataset updated
    Feb 7, 2024
    Authors
    Shirley Ontaneda
    Description

    Dataset

    This dataset was created by Shirley Ontaneda

    Contents

  7. Machine Learning Projects

    • kaggle.com
    zip
    Updated Apr 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deeksha3@ (2024). Machine Learning Projects [Dataset]. https://www.kaggle.com/datasets/deekshaa1/machine-learning-projects
    Explore at:
    zip(43093 bytes)Available download formats
    Dataset updated
    Apr 18, 2024
    Authors
    Deeksha3@
    Description

    Dataset

    This dataset was created by Deeksha3@

    Contents

  8. Meta Kaggle Code

    • kaggle.com
    zip
    Updated Mar 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kaggle (2025). Meta Kaggle Code [Dataset]. https://www.kaggle.com/datasets/kaggle/meta-kaggle-code/code
    Explore at:
    zip(133186454988 bytes)Available download formats
    Dataset updated
    Mar 20, 2025
    Dataset authored and provided by
    Kagglehttp://kaggle.com/
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Explore our public notebook content!

    Meta Kaggle Code is an extension to our popular Meta Kaggle dataset. This extension contains all the raw source code from hundreds of thousands of public, Apache 2.0 licensed Python and R notebooks versions on Kaggle used to analyze Datasets, make submissions to Competitions, and more. This represents nearly a decade of data spanning a period of tremendous evolution in the ways ML work is done.

    Why we’re releasing this dataset

    By collecting all of this code created by Kaggle’s community in one dataset, we hope to make it easier for the world to research and share insights about trends in our industry. With the growing significance of AI-assisted development, we expect this data can also be used to fine-tune models for ML-specific code generation tasks.

    Meta Kaggle for Code is also a continuation of our commitment to open data and research. This new dataset is a companion to Meta Kaggle which we originally released in 2016. On top of Meta Kaggle, our community has shared nearly 1,000 public code examples. Research papers written using Meta Kaggle have examined how data scientists collaboratively solve problems, analyzed overfitting in machine learning competitions, compared discussions between Kaggle and Stack Overflow communities, and more.

    The best part is Meta Kaggle enriches Meta Kaggle for Code. By joining the datasets together, you can easily understand which competitions code was run against, the progression tier of the code’s author, how many votes a notebook had, what kinds of comments it received, and much, much more. We hope the new potential for uncovering deep insights into how ML code is written feels just as limitless to you as it does to us!

    Sensitive data

    While we have made an attempt to filter out notebooks containing potentially sensitive information published by Kaggle users, the dataset may still contain such information. Research, publications, applications, etc. relying on this data should only use or report on publicly available, non-sensitive information.

    Joining with Meta Kaggle

    The files contained here are a subset of the KernelVersions in Meta Kaggle. The file names match the ids in the KernelVersions csv file. Whereas Meta Kaggle contains data for all interactive and commit sessions, Meta Kaggle Code contains only data for commit sessions.

    File organization

    The files are organized into a two-level directory structure. Each top level folder contains up to 1 million files, e.g. - folder 123 contains all versions from 123,000,000 to 123,999,999. Each sub folder contains up to 1 thousand files, e.g. - 123/456 contains all versions from 123,456,000 to 123,456,999. In practice, each folder will have many fewer than 1 thousand files due to private and interactive sessions.

    The ipynb files in this dataset hosted on Kaggle do not contain the output cells. If the outputs are required, the full set of ipynbs with the outputs embedded can be obtained from this public GCS bucket: kaggle-meta-kaggle-code-downloads. Note that this is a "requester pays" bucket. This means you will need a GCP account with billing enabled to download. Learn more here: https://cloud.google.com/storage/docs/requester-pays

    Questions / Comments

    We love feedback! Let us know in the Discussion tab.

    Happy Kaggling!

  9. Santiago Deep Learning Assignment 2

    • kaggle.com
    zip
    Updated Jan 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Madhushree Sannigrahi (2024). Santiago Deep Learning Assignment 2 [Dataset]. https://www.kaggle.com/datasets/madhushreesannigrahi/santiago-deep-learning-assignment-2/code
    Explore at:
    zip(2591752731 bytes)Available download formats
    Dataset updated
    Jan 16, 2024
    Authors
    Madhushree Sannigrahi
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Madhushree Sannigrahi

    Released under MIT

    Contents

  10. INPE: Deep Learning - Dataset of final project

    • kaggle.com
    zip
    Updated Dec 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dener Mendes (2024). INPE: Deep Learning - Dataset of final project [Dataset]. https://www.kaggle.com/datasets/mendesdener/inpe-deep-learning-dataset-of-final-project/suggestions?status=pending&yourSuggestions=true
    Explore at:
    zip(97715472 bytes)Available download formats
    Dataset updated
    Dec 12, 2024
    Authors
    Dener Mendes
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Dener Mendes

    Released under Apache 2.0

    Contents

  11. 2019 Kaggle Machine Learning & Data Science Survey

    • kaggle.com
    Updated Dec 22, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    EK (2020). 2019 Kaggle Machine Learning & Data Science Survey [Dataset]. https://www.kaggle.com/eswarankrishnasamy/2019-kaggle-machine-learning-data-science-survey/notebooks
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 22, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    EK
    Description

    Overview Welcome to Kaggle's third annual Machine Learning and Data Science Survey ― and our second-ever survey data challenge. You can read our executive summary here.

    This year, as in 2017 and 2018, we set out to conduct an industry-wide survey that presents a truly comprehensive view of the state of data science and machine learning. The survey was live for three weeks in October, and after cleaning the data we finished with 19,717 responses!

    There's a lot to explore here. The results include raw numbers about who is working with data, what’s happening with machine learning in different industries, and the best ways for new data scientists to break into the field. We've published the data in as raw a format as possible without compromising anonymization, which makes it an unusual example of a survey dataset.

    Challenge This year Kaggle is launching the second annual Data Science Survey Challenge, where we will be awarding a prize pool of $30,000 to notebook authors who tell a rich story about a subset of the data science and machine learning community.

    In our third year running this survey, we were once again awed by the global, diverse, and dynamic nature of the data science and machine learning industry. This survey data EDA provides an overview of the industry on an aggregate scale, but it also leaves us wanting to know more about the many specific communities comprised within the survey. For that reason, we’re inviting the Kaggle community to dive deep into the survey datasets and help us tell the diverse stories of data scientists from around the world.

    The challenge objective: tell a data story about a subset of the data science community represented in this survey, through a combination of both narrative text and data exploration. A “story” could be defined any number of ways, and that’s deliberate. The challenge is to deeply explore (through data) the impact, priorities, or concerns of a specific group of data science and machine learning practitioners. That group can be defined in the macro (for example: anyone who does most of their coding in Python) or the micro (for example: female data science students studying machine learning in masters programs). This is an opportunity to be creative and tell the story of a community you identify with or are passionate about!

    Submissions will be evaluated on the following:

    Composition - Is there a clear narrative thread to the story that’s articulated and supported by data? The subject should be well defined, well researched, and well supported through the use of data and visualizations. Originality - Does the reader learn something new through this submission? Or is the reader challenged to think about something in a new way? A great entry will be informative, thought provoking, and fresh all at the same time. Documentation - Are your code, and notebook, and additional data sources well documented so a reader can understand what you did? Are your sources clearly cited? A high quality analysis should be concise and clear at each step so the rationale is easy to follow and the process is reproducible To be valid, a submission must be contained in one notebook, made public on or before the submission deadline. Participants are free to use any datasets in addition to the Kaggle Data Science survey, but those datasets must also be publicly available on Kaggle by the deadline for a submission to be valid.

    How to Participate To make a submission, complete the submission form. Only one submission will be judged per participant, so if you make multiple submissions we will review the last (most recent) entry.

    No submission is necessary for the Weekly Notebook Award. To be eligible, a notebook must be public and use the 2019 Data Science Survey as a data source.

    Submission deadline: 11:59PM UTC, December 2nd, 2019.

    Survey Methodology This survey received 19,717 usable respondents from 171 countries and territories. If a country or territory received less than 50 respondents, we grouped them into a group named “Other” for anonymity.

    We excluded respondents who were flagged by our survey system as “Spam”.

    Most of our respondents were found primarily through Kaggle channels, like our email list, discussion forums and social media channels.

    The survey was live from October 8th to October 28th. We allowed respondents to complete the survey at any time during that window. The median response time for those who participated in the survey was approximately 10 minutes.

    Not every question was shown to every respondent. You can learn more about the different segments we used in the survey_schema.csv file. In general, respondents with more experience were asked more questions and respondents with less experience were asked less questions.

    To protect the respondents’ identity, the answers to multiple choice questions have been separated into a separate data file from the open-ended responses. We do not provide a key to match up the multiple choice and free form responses. Further, the free form responses have been randomized column-wise such that the responses that appear on the same row did not necessarily come from the same survey-taker.

    Multiple choice single response questions fit into individual columns whereas multiple choice multiple response questions were split into multiple columns. Text responses were encoded to protect user privacy and countries with fewer than 50 respondents were grouped into the category "other".

    Data has been released under a CC 2.0 license: https://creativecommons.org/licenses/by/2.0/

  12. Deep Learning

    • kaggle.com
    Updated May 10, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kiran Sangamnere (2023). Deep Learning [Dataset]. https://www.kaggle.com/datasets/kiransangamnere/deep-learning/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 10, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Kiran Sangamnere
    Description

    Dataset

    This dataset was created by Kiran Sangamnere

    Contents

  13. notMNIST

    • kaggle.com
    • opendatalab.com
    • +3more
    Updated Feb 14, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    jwjohnson314 (2018). notMNIST [Dataset]. https://www.kaggle.com/datasets/jwjohnson314/notmnist/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 14, 2018
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    jwjohnson314
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    The MNIST dataset is one of the best known image classification problems out there, and a veritable classic of the field of machine learning. This dataset is more challenging version of the same root problem: classifying letters from images. This is a multiclass classification dataset of glyphs of English letters A - J.

    This dataset is used extensively in the Udacity Deep Learning course, and is available in the Tensorflow Github repo (under Examples). I'm not aware of any license governing the use of this data, so I'm posting it here so that the community can use it with Kaggle kernels.

    Content

    notMNIST _large.zip is a large but dirty version of the dataset with 529,119 images, and notMNIST_small.zip is a small hand-cleaned version of the dataset, with 18726 images. The dataset was assembled by Yaroslav Bulatov, and can be obtained on his blog. According to this blog entry there is about a 6.5% label error rate on the large uncleaned dataset, and a 0.5% label error rate on the small hand-cleaned dataset.

    The two files each containing 28x28 grayscale images of letters A - J, organized into directories by letter. notMNIST_large.zip contains 529,119 images and notMNIST_small.zip contains 18726 images.

    Acknowledgements

    Thanks to Yaroslav Bulatov for putting together the dataset.

  14. Deep Learning 11 Frameworks

    • kaggle.com
    zip
    Updated Sep 21, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jeff Hale (2018). Deep Learning 11 Frameworks [Dataset]. https://www.kaggle.com/datasets/discdiver/deep-learning-11-frameworks/suggestions
    Explore at:
    zip(2039 bytes)Available download formats
    Dataset updated
    Sep 21, 2018
    Authors
    Jeff Hale
    Description

    Dataset

    This dataset was created by Jeff Hale

    Contents

  15. Railway Track Fault Detection

    • kaggle.com
    Updated Jan 27, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Salman Ibne Eunus (2021). Railway Track Fault Detection [Dataset]. http://doi.org/10.34740/kaggle/dsv/1884733
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 27, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Salman Ibne Eunus
    Description

    Accidents due to defective railway lines and derailments are common disasters that are observed frequently in Southeast Asian countries. It is imperative to run proper diagnosis over the detection of such faults to prevent such accidents. However, manual detection of such faults periodically can be both time-consuming and costly. In this paper, we have proposed a Deep Learning (DL)-based algorithm for automatic fault detection in railway tracks, which we termed an Ensembled Convolutional Autoencoder ResNet-based Recurrent Neural Network (ECARRNet). We compared its output with existing DL techniques in the form of several pre-trained DL models to investigate railway tracks and determine whether they are defective or not while considering commonly prevalent faults such as—defects in rails and fasteners. Moreover, we manually collected the images from different railway tracks situated in Bangladesh and made our dataset. After comparing our proposed model with the existing models, we found that our proposed architecture has produced the highest accuracy among all the previously existing state-of-the-art (SOTA) architecture, with an accuracy of 93.28% on the full dataset.

    If you are using our dataset, please also cite our paper -

    MDPI and ACS Style Eunus, S.I.; Hossain, S.; Ridwan, A.E.M.; Adnan, A.; Islam, M.S.; Karim, D.Z.; Alam, G.R.; Uddin, J. ECARRNet: An Efficient LSTM-Based Ensembled Deep Neural Network Architecture for Railway Fault Detection. AI 2024, 5, 482-503. https://doi.org/10.3390/ai5020024

    AMA Style Eunus SI, Hossain S, Ridwan AEM, Adnan A, Islam MS, Karim DZ, Alam GR, Uddin J. ECARRNet: An Efficient LSTM-Based Ensembled Deep Neural Network Architecture for Railway Fault Detection. AI. 2024; 5(2):482-503. https://doi.org/10.3390/ai5020024

    Chicago/Turabian Style Eunus, Salman Ibne, Shahriar Hossain, A. E. M. Ridwan, Ashik Adnan, Md. Saiful Islam, Dewan Ziaul Karim, Golam Rabiul Alam, and Jia Uddin. 2024. "ECARRNet: An Efficient LSTM-Based Ensembled Deep Neural Network Architecture for Railway Fault Detection" AI 5, no. 2: 482-503. https://doi.org/10.3390/ai5020024

  16. Machine_Learning_Tutorial_Parte_1

    • kaggle.com
    zip
    Updated Feb 23, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nelson InCube (2020). Machine_Learning_Tutorial_Parte_1 [Dataset]. https://kaggle.com/nelsonpereira/machine-learning-tutorial-parte-1
    Explore at:
    zip(291852 bytes)Available download formats
    Dataset updated
    Feb 23, 2020
    Authors
    Nelson InCube
    Description

    Dataset

    This dataset was created by Nelson InCube

    Contents

  17. machine-learning

    • kaggle.com
    zip
    Updated Mar 9, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mishalle Ni (2022). machine-learning [Dataset]. https://www.kaggle.com/datasets/mishalleni/machinelearning/suggestions
    Explore at:
    zip(920141 bytes)Available download formats
    Dataset updated
    Mar 9, 2022
    Authors
    Mishalle Ni
    Description

    Dataset

    This dataset was created by Mishalle Ni

    Contents

  18. DoS_UDP_1

    • kaggle.com
    zip
    Updated Oct 1, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    M. Raza Siddique (2023). DoS_UDP_1 [Dataset]. https://www.kaggle.com/razasiddique/dos-udp-1
    Explore at:
    zip(136330391 bytes)Available download formats
    Dataset updated
    Oct 1, 2023
    Authors
    M. Raza Siddique
    Description

    Dataset

    This dataset was created by M. Raza Siddique

    Contents

  19. machine learning

    • kaggle.com
    zip
    Updated Jun 25, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    like (2020). machine learning [Dataset]. https://www.kaggle.com/likewy/machine-learning
    Explore at:
    zip(31805 bytes)Available download formats
    Dataset updated
    Jun 25, 2020
    Authors
    like
    Description

    Dataset

    This dataset was created by like

    Contents

  20. PLOS Machine Learning Articles

    • kaggle.com
    zip
    Updated Apr 19, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nathan Karasch (2022). PLOS Machine Learning Articles [Dataset]. https://www.kaggle.com/nkarasch/plos-machine-learning-articles
    Explore at:
    zip(5812 bytes)Available download formats
    Dataset updated
    Apr 19, 2022
    Authors
    Nathan Karasch
    Description

    Dataset

    This dataset was created by Nathan Karasch

    Contents

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Rishabh (2019). deep learning image [Dataset]. https://www.kaggle.com/datasets/rishabh084/deep-learning-image
Organization logo

deep learning image

Explore at:
zip(1900170110 bytes)Available download formats
Dataset updated
Aug 3, 2019
Authors
Rishabh
Description

Dataset

This dataset was created by Rishabh

Contents

Search
Clear search
Close search
Google apps
Main menu