16 datasets found
  1. Titanic-FM

    • kaggle.com
    Updated Jul 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fernando Meneses (2024). Titanic-FM [Dataset]. https://www.kaggle.com/datasets/fertmeneses/titanic-fm
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 30, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Fernando Meneses
    Description

    Titanic - Machine Learning from Disaster (Kaggle competition). This dataset contains datafiles for the Notebook Titanic/Kaggle -Full analysis šŸ•µšŸ½, by Fernando Meneses. It includes: training and testing datasets, the solution file, Leaderboard statistics and pre-trained results.

  2. Titanic Dataset

    • kaggle.com
    Updated Apr 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Mudasar Sabir (2025). Titanic Dataset [Dataset]. https://www.kaggle.com/datasets/mudasarsabir/titanic-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 25, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Muhammad Mudasar Sabir
    Description

    Description šŸ‘‹šŸ›³ļø Ahoy, welcome to Kaggle! You’re in the right place. This is the legendary Titanic ML competition – the best, first challenge for you to dive into ML competitions and familiarize yourself with how the Kaggle platform works.

    If you want to talk with other users about this competition, come join our Discord! We've got channels for competitions, job postings and career discussions, resources, and socializing with your fellow data scientists. Follow the link here: https://discord.gg/kaggle

    The competition is simple: use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.

    Read on or watch the video below to explore more details. Once you’re ready to start competing, click on the "Join Competition button to create an account and gain access to the competition data. Then check out Alexis Cook’s Titanic Tutorial that walks you through step by step how to make your first submission!

    The Challenge The sinking of the Titanic is one of the most infamous shipwrecks in history.

    On April 15, 1912, during her maiden voyage, the widely considered ā€œunsinkableā€ RMS Titanic sank after colliding with an iceberg. Unfortunately, there weren’t enough lifeboats for everyone onboard, resulting in the death of 1502 out of 2224 passengers and crew.

    While there was some element of luck involved in surviving, it seems some groups of people were more likely to survive than others.

    In this challenge, we ask you to build a predictive model that answers the question: ā€œwhat sorts of people were more likely to survive?ā€ using passenger data (ie name, age, gender, socio-economic class, etc).

    Recommended Tutorial We highly recommend Alexis Cook’s Titanic Tutorial that walks you through making your very first submission step by step and this starter notebook to get started.

    How Kaggle’s Competitions Work Join the Competition Read about the challenge description, accept the Competition Rules and gain access to the competition dataset. Get to Work Download the data, build models on it locally or on Kaggle Notebooks (our no-setup, customizable Jupyter Notebooks environment with free GPUs) and generate a prediction file. Make a Submission Upload your prediction as a submission on Kaggle and receive an accuracy score. Check the Leaderboard See how your model ranks against other Kagglers on our leaderboard. Improve Your Score Check out the discussion forum to find lots of tutorials and insights from other competitors. Kaggle Lingo Video You may run into unfamiliar lingo as you dig into the Kaggle discussion forums and public notebooks. Check out Dr. Rachael Tatman’s video on Kaggle Lingo to get up to speed!

    What Data Will I Use in This Competition? In this competition, you’ll gain access to two similar datasets that include passenger information like name, age, gender, socio-economic class, etc. One dataset is titled train.csv and the other is titled test.csv.

    Train.csv will contain the details of a subset of the passengers on board (891 to be exact) and importantly, will reveal whether they survived or not, also known as the ā€œground truthā€.

    The test.csv dataset contains similar information but does not disclose the ā€œground truthā€ for each passenger. It’s your job to predict these outcomes.

    Using the patterns you find in the train.csv data, predict whether the other 418 passengers on board (found in test.csv) survived.

    Check out the ā€œDataā€ tab to explore the datasets even further. Once you feel you’ve created a competitive model, submit it to Kaggle to see where your model stands on our leaderboard against other Kagglers.

    How to Submit your Prediction to Kaggle Once you’re ready to make a submission and get on the leaderboard:

    Click on the ā€œSubmit Predictionsā€ button

    Upload a CSV file in the submission file format. You’re able to submit 10 submissions a day.

    Submission File Format: You should submit a csv file with exactly 418 entries plus a header row. Your submission will show an error if you have extra columns (beyond PassengerId and Survived) or rows.

    The file should have exactly 2 columns:

    PassengerId (sorted in any order) Survived (contains your binary predictions: 1 for survived, 0 for deceased) Got it! I’m ready to get started. Where do I get help if I need it? For Competition Help: Titanic Discussion Forum Kaggle doesn’t have a dedicated team to help troubleshoot your code so you’ll typically find that you receive a response more quickly by asking your question in the appropriate forum. The forums are full of useful information on the data, metric, and different approaches. We encourage you to use the forums often. If you share your knowledge, you'll find that others will share a lot in turn!

    A Last Word on Kaggle Notebooks As we mentioned before, Kaggle Notebooks is our no-setup, customizable, Jupyter Notebooks environment with free GPUs and a huge repository ...

  3. A

    ā€˜Titanic: cleaned data’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Sep 30, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ā€˜Titanic: cleaned data’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-titanic-cleaned-data-cbf4/dc9cd7ff/?iid=055-046&v=presentation
    Explore at:
    Dataset updated
    Sep 30, 2021
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ā€˜Titanic: cleaned data’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/jamesleslie/titanic-cleaned-data on 30 September 2021.

    --- Dataset description provided by original source is as follows ---

    Introduction

    This dataset was created in this notebook as part of a three-part series. The data is in machine-learning-ready format, with all missing values for the Age, Fare and Embarked columns having been imputed.

    Data imputation

    • Age: this column was imputed by using the median age for the passenger's title (Mr, Mrs, Dr etc).
    • Fare: the single missing value in this column was imputed using the median value for that passenger's class.
    • Embarked: the two missing values here were imputed using the Pandas backfill method.

    Usage

    This data is used in both the second and third parts of the series.

    --- Original source retains full ownership of the source dataset ---

  4. Preprocessed Titanic Survived Prediction Data

    • kaggle.com
    Updated Feb 6, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fethiye (2021). Preprocessed Titanic Survived Prediction Data [Dataset]. https://www.kaggle.com/fethiye/titanic-preprocessed-train-data/notebooks
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 6, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Fethiye
    Description

    Context

    Data set was created by preprocessing (filling lost data, extracting new features) of Titanic - Machine Learning Disaster data set.

    Using this processed data set, the machine learning models can be applied directly.

    You can see preprocessing step in notebook: https://www.kaggle.com/fethiye/titanic-predict-survival-prediction

  5. Titanic

    • kaggle.com
    zip
    Updated Nov 1, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MHouellemont (2017). Titanic [Dataset]. https://www.kaggle.com/mhouellemont/titanic
    Explore at:
    zip(34877 bytes)Available download formats
    Dataset updated
    Nov 1, 2017
    Authors
    MHouellemont
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by MHouellemont

    Released under CC0: Public Domain

    Contents

  6. Titanic Leaderboard March 2023

    • kaggle.com
    Updated Apr 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lucas Antoine (2023). Titanic Leaderboard March 2023 [Dataset]. http://doi.org/10.34740/kaggle/dsv/5281032
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 3, 2023
    Dataset provided by
    Kaggle
    Authors
    Lucas Antoine
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset used in my šŸ›³ļø Titanic - Top 1% with KNN [0.81818] notebook. It contains all the leaderboard's entries from the Titanic - Machine Learning from Disaster competition in March 2023.

  7. Titanic Data Set

    • kaggle.com
    zip
    Updated Feb 23, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    diogo22santos (2018). Titanic Data Set [Dataset]. https://www.kaggle.com/diogo22santos/titanic-data-set
    Explore at:
    zip(22544 bytes)Available download formats
    Dataset updated
    Feb 23, 2018
    Authors
    diogo22santos
    Description

    Dataset

    This dataset was created by diogo22santos

    Released under Other (specified in description)

    Contents

  8. titanic

    • kaggle.com
    zip
    Updated Mar 12, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Patil (2018). titanic [Dataset]. https://www.kaggle.com/patil4444/titanic
    Explore at:
    zip(33847 bytes)Available download formats
    Dataset updated
    Mar 12, 2018
    Authors
    Patil
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Patil

    Released under CC0: Public Domain

    Contents

    It contains the following files:

  9. Titanic

    • kaggle.com
    zip
    Updated Dec 5, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    karthik (2017). Titanic [Dataset]. https://www.kaggle.com/karthik10111/titanic
    Explore at:
    zip(9980 bytes)Available download formats
    Dataset updated
    Dec 5, 2017
    Authors
    karthik
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    Dataset

    This dataset was created by karthik

    Released under Database: Open Database, Contents: Database Contents

    Contents

  10. Titanic Case Study

    • kaggle.com
    Updated Sep 6, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aishwarya Sharma (2019). Titanic Case Study [Dataset]. https://www.kaggle.com/aishadroit/titanic-case-study/notebooks
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 6, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Aishwarya Sharma
    Description

    Context

    Titanic Dataset for checking the chance of survivals among passengers travelling during the same time.

    Content

    How priorities are set for the people saved on board ? What factors responsible for the ship to sink ?

  11. O

    notebookcdg

    • opendatalab.com
    zip
    Updated Apr 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    University of Waterloo (2023). notebookcdg [Dataset]. https://opendatalab.com/OpenDataLab/notebookcdg
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 2, 2023
    Dataset provided by
    IBM Research
    University of Waterloo
    University of Michigan
    JD.COM Silicon Valley Research Center
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Inspired by Wang et al. 2021, we decided to utilize the top-voted and well-documented Kaggle notebooks to construct the notebookCDGdataset We collected the top 10% highly-voted notebooks from the top 20 popular competitions on Kaggle (e.g. Titanic). We checked the data policy of each of the 20 competitions, none of them has copyright issues. We also contacted the Kaggle administrators to make sure our data collection complies with the platform’s policy. In total, we collected 3,944 notebooks as raw data. After data preprocessing, the final dataset contains 2,476 notebooks out of the 3,944 notebooks from the raw data. It has 28,625 code–documentation pairs. The overall code-to-markdown ratio is 2.2195

  12. Titanic: pickle save

    • kaggle.com
    zip
    Updated Nov 22, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carl McBride Ellis (2020). Titanic: pickle save [Dataset]. https://www.kaggle.com/carlmcbrideellis/titanic-pickle-save
    Explore at:
    zip(1178 bytes)Available download formats
    Dataset updated
    Nov 22, 2020
    Authors
    Carl McBride Ellis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is a pickle file of the model used in the notebook "Titanic: some sex, a bit of class, and a tree..." for use in the notebook All in a pickle: Saving the Titanic.

  13. my-titanic

    • kaggle.com
    zip
    Updated Oct 21, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DavidS (2019). my-titanic [Dataset]. https://www.kaggle.com/davids1992/mytitanic
    Explore at:
    zip(22544 bytes)Available download formats
    Dataset updated
    Oct 21, 2019
    Authors
    DavidS
    Description

    Dataset

    This dataset was created by DavidS

    Contents

    It contains the following files:

  14. Titanic EDA Data

    • kaggle.com
    Updated Jul 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pranjal Yadav (2025). Titanic EDA Data [Dataset]. https://www.kaggle.com/datasets/pranjalyadav92905/titanic-eda-data/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 4, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Pranjal Yadav
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This dataset contains cleaned Titanic passenger data for EDA and machine learning tasks. Includes features like age, sex, class, fare, and family details. Ideal for survival prediction and beginner ML projects.

    šŸš€ Great for:

    Feature engineering

    Data visualization

    Classification modeling

    šŸ”„ Both train and test sets included.

    šŸ’¬ If you find this dataset helpful, please upvote and share your notebook!

  15. Titanic survivors

    • kaggle.com
    Updated Jul 6, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sawan Kumar Yadav (2020). Titanic survivors [Dataset]. https://www.kaggle.com/sawankumar1/titanic-survivors/notebooks
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 6, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sawan Kumar Yadav
    Description

    Dataset

    This dataset was created by Sawan Kumar Yadav

    Contents

  16. EDA and STATISTCS on Titanic

    • kaggle.com
    zip
    Updated Dec 20, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ASAD KAREL (2020). EDA and STATISTCS on Titanic [Dataset]. https://www.kaggle.com/alahazrat/eda-and-statistcs-on-titanic
    Explore at:
    zip(2133571 bytes)Available download formats
    Dataset updated
    Dec 20, 2020
    Authors
    ASAD KAREL
    Description

    Dataset

    This dataset was created by ASAD KAREL

    Contents

    It contains the following files:

  17. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Fernando Meneses (2024). Titanic-FM [Dataset]. https://www.kaggle.com/datasets/fertmeneses/titanic-fm
Organization logo

Titanic-FM

Files for the Notebook Titanic/Kaggle -Full analysis šŸ•µšŸ½, by Fernando Meneses

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 30, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Fernando Meneses
Description

Titanic - Machine Learning from Disaster (Kaggle competition). This dataset contains datafiles for the Notebook Titanic/Kaggle -Full analysis šŸ•µšŸ½, by Fernando Meneses. It includes: training and testing datasets, the solution file, Leaderboard statistics and pre-trained results.

Search
Clear search
Close search
Google apps
Main menu