16 datasets found

Titanic-FM
kaggle.com
Updated Jul 30, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fernando Meneses (2024). Titanic-FM [Dataset]. https://www.kaggle.com/datasets/fertmeneses/titanic-fm
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 30, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Fernando Meneses
Description
Titanic - Machine Learning from Disaster (Kaggle competition). This dataset contains datafiles for the Notebook Titanic/Kaggle -Full analysis 🕵🏽, by Fernando Meneses. It includes: training and testing datasets, the solution file, Leaderboard statistics and pre-trained results.
Titanic Dataset
kaggle.com
Updated Apr 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Muhammad Mudasar Sabir (2025). Titanic Dataset [Dataset]. https://www.kaggle.com/datasets/mudasarsabir/titanic-dataset/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 25, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Muhammad Mudasar Sabir
Description
Description 👋🛳️ Ahoy, welcome to Kaggle! You’re in the right place. This is the legendary Titanic ML competition – the best, first challenge for you to dive into ML competitions and familiarize yourself with how the Kaggle platform works.

If you want to talk with other users about this competition, come join our Discord! We've got channels for competitions, job postings and career discussions, resources, and socializing with your fellow data scientists. Follow the link here: https://discord.gg/kaggle

The competition is simple: use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.

Read on or watch the video below to explore more details. Once you’re ready to start competing, click on the "Join Competition button to create an account and gain access to the competition data. Then check out Alexis Cook’s Titanic Tutorial that walks you through step by step how to make your first submission!

The Challenge The sinking of the Titanic is one of the most infamous shipwrecks in history.

On April 15, 1912, during her maiden voyage, the widely considered “unsinkable” RMS Titanic sank after colliding with an iceberg. Unfortunately, there weren’t enough lifeboats for everyone onboard, resulting in the death of 1502 out of 2224 passengers and crew.

While there was some element of luck involved in surviving, it seems some groups of people were more likely to survive than others.

In this challenge, we ask you to build a predictive model that answers the question: “what sorts of people were more likely to survive?” using passenger data (ie name, age, gender, socio-economic class, etc).

Recommended Tutorial We highly recommend Alexis Cook’s Titanic Tutorial that walks you through making your very first submission step by step and this starter notebook to get started.

How Kaggle’s Competitions Work Join the Competition Read about the challenge description, accept the Competition Rules and gain access to the competition dataset. Get to Work Download the data, build models on it locally or on Kaggle Notebooks (our no-setup, customizable Jupyter Notebooks environment with free GPUs) and generate a prediction file. Make a Submission Upload your prediction as a submission on Kaggle and receive an accuracy score. Check the Leaderboard See how your model ranks against other Kagglers on our leaderboard. Improve Your Score Check out the discussion forum to find lots of tutorials and insights from other competitors. Kaggle Lingo Video You may run into unfamiliar lingo as you dig into the Kaggle discussion forums and public notebooks. Check out Dr. Rachael Tatman’s video on Kaggle Lingo to get up to speed!

What Data Will I Use in This Competition? In this competition, you’ll gain access to two similar datasets that include passenger information like name, age, gender, socio-economic class, etc. One dataset is titled train.csv and the other is titled test.csv.

Train.csv will contain the details of a subset of the passengers on board (891 to be exact) and importantly, will reveal whether they survived or not, also known as the “ground truth”.

The test.csv dataset contains similar information but does not disclose the “ground truth” for each passenger. It’s your job to predict these outcomes.

Using the patterns you find in the train.csv data, predict whether the other 418 passengers on board (found in test.csv) survived.

Check out the “Data” tab to explore the datasets even further. Once you feel you’ve created a competitive model, submit it to Kaggle to see where your model stands on our leaderboard against other Kagglers.

How to Submit your Prediction to Kaggle Once you’re ready to make a submission and get on the leaderboard:

Click on the “Submit Predictions” button

Upload a CSV file in the submission file format. You’re able to submit 10 submissions a day.

Submission File Format: You should submit a csv file with exactly 418 entries plus a header row. Your submission will show an error if you have extra columns (beyond PassengerId and Survived) or rows.

The file should have exactly 2 columns:

PassengerId (sorted in any order) Survived (contains your binary predictions: 1 for survived, 0 for deceased) Got it! I’m ready to get started. Where do I get help if I need it? For Competition Help: Titanic Discussion Forum Kaggle doesn’t have a dedicated team to help troubleshoot your code so you’ll typically find that you receive a response more quickly by asking your question in the appropriate forum. The forums are full of useful information on the data, metric, and different approaches. We encourage you to use the forums often. If you share your knowledge, you'll find that others will share a lot in turn!

A Last Word on Kaggle Notebooks As we mentioned before, Kaggle Notebooks is our no-setup, customizable, Jupyter Notebooks environment with free GPUs and a huge repository ...
A
‘Titanic: cleaned data’ analyzed by Analyst-2
analyst-2.ai
Updated Sep 30, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Titanic: cleaned data’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-titanic-cleaned-data-cbf4/dc9cd7ff/?iid=055-046&v=presentation
Explore at:
Dataset updated
Sep 30, 2021
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘Titanic: cleaned data’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/jamesleslie/titanic-cleaned-data on 30 September 2021.

--- Dataset description provided by original source is as follows ---

Introduction

This dataset was created in this notebook as part of a three-part series. The data is in machine-learning-ready format, with all missing values for the Age, Fare and Embarked columns having been imputed.

Data imputation

Age: this column was imputed by using the median age for the passenger's title (Mr, Mrs, Dr etc).

Fare: the single missing value in this column was imputed using the median value for that passenger's class.

Embarked: the two missing values here were imputed using the Pandas backfill method.

Usage

This data is used in both the second and third parts of the series.

--- Original source retains full ownership of the source dataset ---
Preprocessed Titanic Survived Prediction Data
kaggle.com
Updated Feb 6, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fethiye (2021). Preprocessed Titanic Survived Prediction Data [Dataset]. https://www.kaggle.com/fethiye/titanic-preprocessed-train-data/notebooks
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 6, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Fethiye
Description
Context

Data set was created by preprocessing (filling lost data, extracting new features) of Titanic - Machine Learning Disaster data set.

Using this processed data set, the machine learning models can be applied directly.

You can see preprocessing step in notebook: https://www.kaggle.com/fethiye/titanic-predict-survival-prediction
Titanic
kaggle.com
zip
Updated Nov 1, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MHouellemont (2017). Titanic [Dataset]. https://www.kaggle.com/mhouellemont/titanic
Explore at:
zip(34877 bytes)Available download formats
Dataset updated
Nov 1, 2017
Authors
MHouellemont
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Dataset

This dataset was created by MHouellemont

Released under CC0: Public Domain

Contents
Titanic Leaderboard March 2023
kaggle.com
Updated Apr 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lucas Antoine (2023). Titanic Leaderboard March 2023 [Dataset]. http://doi.org/10.34740/kaggle/dsv/5281032
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/dsv/5281032
Dataset updated
Apr 3, 2023
Dataset provided by
Kaggle
Authors
Lucas Antoine
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Dataset used in my 🛳️ Titanic - Top 1% with KNN [0.81818] notebook. It contains all the leaderboard's entries from the Titanic - Machine Learning from Disaster competition in March 2023.
Titanic Data Set
kaggle.com
zip
Updated Feb 23, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
diogo22santos (2018). Titanic Data Set [Dataset]. https://www.kaggle.com/diogo22santos/titanic-data-set
Explore at:
zip(22544 bytes)Available download formats
Dataset updated
Feb 23, 2018
Authors
diogo22santos
Description
Dataset

This dataset was created by diogo22santos

Released under Other (specified in description)

Contents
titanic
kaggle.com
zip
Updated Mar 12, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Patil (2018). titanic [Dataset]. https://www.kaggle.com/patil4444/titanic
Explore at:
zip(33847 bytes)Available download formats
Dataset updated
Mar 12, 2018
Authors
Patil
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Dataset

This dataset was created by Patil

Released under CC0: Public Domain

Contents

It contains the following files:
Titanic
kaggle.com
zip
Updated Dec 5, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
karthik (2017). Titanic [Dataset]. https://www.kaggle.com/karthik10111/titanic
Explore at:
zip(9980 bytes)Available download formats
Dataset updated
Dec 5, 2017
Authors
karthik
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
Dataset

This dataset was created by karthik

Released under Database: Open Database, Contents: Database Contents

Contents
Titanic Case Study
kaggle.com
Updated Sep 6, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aishwarya Sharma (2019). Titanic Case Study [Dataset]. https://www.kaggle.com/aishadroit/titanic-case-study/notebooks
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 6, 2019
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Aishwarya Sharma
Description
Context

Titanic Dataset for checking the chance of survivals among passengers travelling during the same time.

Content

How priorities are set for the people saved on board ? What factors responsible for the ship to sink ?
O
notebookcdg
opendatalab.com
zip
Updated Apr 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
University of Waterloo (2023). notebookcdg [Dataset]. https://opendatalab.com/OpenDataLab/notebookcdg
Explore at:
zipAvailable download formats
Dataset updated
Apr 2, 2023
Dataset provided by
IBM Research
University of Waterloo
University of Michigan
JD.COM Silicon Valley Research Center
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Inspired by Wang et al. 2021, we decided to utilize the top-voted and well-documented Kaggle notebooks to construct the notebookCDGdataset We collected the top 10% highly-voted notebooks from the top 20 popular competitions on Kaggle (e.g. Titanic). We checked the data policy of each of the 20 competitions, none of them has copyright issues. We also contacted the Kaggle administrators to make sure our data collection complies with the platform’s policy. In total, we collected 3,944 notebooks as raw data. After data preprocessing, the final dataset contains 2,476 notebooks out of the 3,944 notebooks from the raw data. It has 28,625 code–documentation pairs. The overall code-to-markdown ratio is 2.2195
Titanic: pickle save
kaggle.com
zip
Updated Nov 22, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Carl McBride Ellis (2020). Titanic: pickle save [Dataset]. https://www.kaggle.com/carlmcbrideellis/titanic-pickle-save
Explore at:
zip(1178 bytes)Available download formats
Dataset updated
Nov 22, 2020
Authors
Carl McBride Ellis
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This is a pickle file of the model used in the notebook "Titanic: some sex, a bit of class, and a tree..." for use in the notebook All in a pickle: Saving the Titanic.
my-titanic
kaggle.com
zip
Updated Oct 21, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
DavidS (2019). my-titanic [Dataset]. https://www.kaggle.com/davids1992/mytitanic
Explore at:
zip(22544 bytes)Available download formats
Dataset updated
Oct 21, 2019
Authors
DavidS
Description
Dataset

This dataset was created by DavidS

Contents

It contains the following files:
Titanic EDA Data
kaggle.com
Updated Jul 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pranjal Yadav (2025). Titanic EDA Data [Dataset]. https://www.kaggle.com/datasets/pranjalyadav92905/titanic-eda-data/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 4, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Pranjal Yadav
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
This dataset contains cleaned Titanic passenger data for EDA and machine learning tasks. Includes features like age, sex, class, fare, and family details. Ideal for survival prediction and beginner ML projects.

🚀 Great for:

Feature engineering

Data visualization

Classification modeling

🔄 Both train and test sets included.

💬 If you find this dataset helpful, please upvote and share your notebook!
Titanic survivors
kaggle.com
Updated Jul 6, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sawan Kumar Yadav (2020). Titanic survivors [Dataset]. https://www.kaggle.com/sawankumar1/titanic-survivors/notebooks
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 6, 2020
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Sawan Kumar Yadav
Description
Dataset

This dataset was created by Sawan Kumar Yadav

Contents
EDA and STATISTCS on Titanic
kaggle.com
zip
Updated Dec 20, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ASAD KAREL (2020). EDA and STATISTCS on Titanic [Dataset]. https://www.kaggle.com/alahazrat/eda-and-statistcs-on-titanic
Explore at:
zip(2133571 bytes)Available download formats
Dataset updated
Dec 20, 2020
Authors
ASAD KAREL
Description
Dataset

This dataset was created by ASAD KAREL

Contents

It contains the following files:
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Fernando Meneses (2024). Titanic-FM [Dataset]. https://www.kaggle.com/datasets/fertmeneses/titanic-fm

Titanic-FM

Files for the Notebook Titanic/Kaggle -Full analysis 🕵🏽, by Fernando Meneses

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Jul 30, 2024

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Fernando Meneses

Description

Titanic - Machine Learning from Disaster (Kaggle competition). This dataset contains datafiles for the Notebook Titanic/Kaggle -Full analysis 🕵🏽, by Fernando Meneses. It includes: training and testing datasets, the solution file, Leaderboard statistics and pre-trained results.

Clear search

Close search

Google apps

Main menu

Titanic-FM

Titanic Dataset

‘Titanic: cleaned data’ analyzed by Analyst-2

Introduction

Data imputation

Usage

Preprocessed Titanic Survived Prediction Data

Context

Titanic

Dataset

Contents

Titanic Leaderboard March 2023

Titanic Data Set

Dataset

Contents

titanic

Dataset

Contents

Titanic

Dataset

Contents

Titanic Case Study

Context

Content

notebookcdg

Titanic: pickle save

my-titanic

Dataset

Contents

Titanic EDA Data

Titanic survivors

Dataset

Contents

EDA and STATISTCS on Titanic

Dataset

Contents

Titanic-FM

Files for the Notebook Titanic/Kaggle -Full analysis 🕵🏽, by Fernando Meneses