100+ datasets found
  1. Titanic Dataset

    • kaggle.com
    Updated Apr 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sakshi Satre (2024). Titanic Dataset [Dataset]. https://www.kaggle.com/datasets/sakshisatre/titanic-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 30, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sakshi Satre
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The dataset containing information about passengers aboard the Titanic is one of the most famous datasets used in data science and machine learning. It was created to analyze and understand the factors that influenced survival rates among passengers during the tragic sinking of the RMS Titanic on April 15, 1912.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F19517213%2Fd4016c159f1ad17cb30d8905192fe9d7%2Ftitanic-ship_1027017-11.avif?generation=1711562371875068&alt=media" alt="">

    Data Description :-

    The dataset is often used for predictive modeling and statistical analysis to determine which factors (such as socio-economic status, age, gender, etc.) were associated with a higher likelihood of survival. It contains 1309 rows and 14 columns.

    Columns : -

    • Pclass: Ticket class indicating the socio-economic status of the passenger. It is categorized into three classes: 1 = Upper, 2 = Middle, 3 = Lower.

    • Survived: A binary indicator that shows whether the passenger survived (1) or not (0) during the Titanic disaster. This is the target variable for analysis.

    • Name: The full name of the passenger, including title (e.g., Mr., Mrs., etc.).

    • Sex: The gender of the passenger, denoted as either male or female.

    • Age: The age of the passenger in years.

    • SibSp: The number of siblings or spouses aboard the Titanic for the respective passenger.

    • Parch: The number of parents or children aboard the Titanic for the respective passenger.

    • Ticket: The ticket number assigned to the passenger.

    • Fare: The fare paid by the passenger for the ticket.

    • Cabin: The cabin number assigned to the passenger, if available.

    • Embarked: The port of embarkation for the passenger. It can take one of three values: C = Cherbourg, Q = Queenstown, S = Southampton.

    • Boat: If the passenger survived, this column contains the identifier of the lifeboat they were rescued in.

    • Body: If the passenger did not survive, this column contains the identification number of their recovered body, if applicable.

    • Home.dest: The destination or place of residence of the passenger.

    These descriptions provide a detailed understanding of each column in the Titanic dataset subset, offering insights into the demographic, travel, and survival-related information recorded for each passenger.

  2. Titanic Dataset

    • kaggle.com
    Updated Apr 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sajid (2023). Titanic Dataset [Dataset]. https://www.kaggle.com/datasets/dbdmobile/tita111
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 20, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sajid
    Description

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F11299784%2F6530245ff6b6d097af8cb56c86b79943%2Fpxfuel.jpg?generation=1682007437079315&alt=media" alt="">The Titanic dataset is a widely used dataset that contains information on the passengers who were aboard the Titanic when it sank on its maiden voyage in 1912. The dataset includes features such as age, sex, passenger class, and fare paid, as well as whether or not the passenger survived the sinking. The dataset is often used for machine learning and data analysis tasks, such as predicting survival based on passenger characteristics or exploring patterns in the data. The Titanic dataset is a classic example of data analysis and is a great starting point for those new to data science.

    The Titanic dataset is available in CSV format and contains two files, one for training and one for testing. The training file is used to build the machine learning model, while the testing file is used to test the performance of the model.

    Column Description

    PassengerId: unique identifier for each passenger Survived: whether the passenger survived (1) or not (0) Pclass: passenger class (1 = 1st class, 2 = 2nd class, 3 = 3rd class) Name: name of the passenger Sex: gender of the passenger Age: age of the passenger (in years) SibSp: number of siblings or spouses aboard the Titanic Parch: number of parents or children aboard the Titanic Ticket: ticket number Fare: passenger fare Cabin: cabin number Embarked: port of embarkation (C = Cherbourg, Q = Queenstown, S = Southampton)

    MIT License

    Copyright (c) [2023] [Md Kazi Sajiduddin]

    Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

    The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

    THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

  3. h

    Kaggle-Titanic

    • huggingface.co
    Updated Nov 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ensalada (2023). Kaggle-Titanic [Dataset]. https://huggingface.co/datasets/Tomate/Kaggle-Titanic
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 2, 2023
    Authors
    Ensalada
    Description

    Tomate/Kaggle-Titanic dataset hosted on Hugging Face and contributed by the HF Datasets community

  4. Titanic Data with Missing Values Removed

    • kaggle.com
    Updated Sep 19, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael Lomuscio (2019). Titanic Data with Missing Values Removed [Dataset]. https://www.kaggle.com/mlomuscio/titanic-data-with-some-missing-values-removed/tasks
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 19, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Michael Lomuscio
    Description

    This dataset is a copy of the Titanic train.csv dataset used in the Kaggle Titanic competition. I removed all of the missing values from the Age and Embarked columns so that the dataset could be used by high school students that I teach. The following description is taken from the Kaggle competition dataset which can be found here.

    Data Dictionary

    • Survival = Whether or not the passenger survived. 0=No, 1=Yes
    • Pclass = Ticket class. 1=1st Class, 2=2nd Class, 3=3rd Class.
    • Sex = The gender of the passenger.
    • Age = Age in years. If the passenger was less than 1, then the age is a decimal.
    • Sibsp = The number of siblings/spouses aboard the Titanic.
    • Parch = The number of parents/children aboard the Titanic.
    • Ticket = The passenger's ticket number.
    • Fare = The cost of the passenger's ticket.
    • Cabin = The passenger's cabin number.
    • Embarked = The port that the passenger embarked from. C=Cherbourg, Q=Queenstown, S=Southampton.
  5. A

    ā€˜Titanic-Dataset (train.csv)’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Nov 12, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ā€˜Titanic-Dataset (train.csv)’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-titanic-dataset-train-csv-d701/832937de/?iid=019-246&v=presentation
    Explore at:
    Dataset updated
    Nov 12, 2021
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ā€˜Titanic-Dataset (train.csv)’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/hesh97/titanicdataset-traincsv on 12 November 2021.

    --- No further description of dataset provided by original source ---

    --- Original source retains full ownership of the source dataset ---

  6. Projeto Titanic Kaggle

    • kaggle.com
    Updated Jun 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VictorSantos23 (2025). Projeto Titanic Kaggle [Dataset]. https://www.kaggle.com/datasets/victorsantos23/projeto-titanic-kaggle
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 19, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    VictorSantos23
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by VictorSantos23

    Released under MIT

    Contents

  7. A

    ā€˜Titanic Solution for Beginner's Guide’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Sep 30, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ā€˜Titanic Solution for Beginner's Guide’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-titanic-solution-for-beginner-s-guide-78b3/db683166/?iid=041-447&v=presentation
    Explore at:
    Dataset updated
    Sep 30, 2021
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ā€˜Titanic Solution for Beginner's Guide’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/harunshimanto/titanic-solution-for-beginners-guide on 30 September 2021.

    --- Dataset description provided by original source is as follows ---

    Overview

    The data has been split into two groups:

    training set (train.csv)
    test set (test.csv)
    

    The training set should be used to build your machine learning models. For the training set, we provide the outcome (also known as the ā€œground truthā€) for each passenger. Your model will be based on ā€œfeaturesā€ like passengers’ gender and class. You can also use feature engineering to create new features.

    The test set should be used to see how well your model performs on unseen data. For the test set, we do not provide the ground truth for each passenger. It is your job to predict these outcomes. For each passenger in the test set, use the model you trained to predict whether or not they survived the sinking of the Titanic.

    We also include gender_submission.csv, a set of predictions that assume all and only female passengers survive, as an example of what a submission file should look like.

    Data Dictionary

    Variable Definition Key survival Survival 0 = No, 1 = Yes pclass Ticket class 1 = 1st, 2 = 2nd, 3 = 3rd sex Sex
    Age Age in years
    sibsp # of siblings / spouses aboard the Titanic
    parch # of parents / children aboard the Titanic
    ticket Ticket number
    fare Passenger fare
    cabin Cabin number
    embarked Port of Embarkation C = Cherbourg, Q = Queenstown, S = Southampton

    Variable Notes

    pclass: A proxy for socio-economic status (SES) 1st = Upper 2nd = Middle 3rd = Lower

    age: Age is fractional if less than 1. If the age is estimated, is it in the form of xx.5

    sibsp: The dataset defines family relations in this way... Sibling = brother, sister, stepbrother, stepsister Spouse = husband, wife (mistresses and fiancƩs were ignored)

    parch: The dataset defines family relations in this way... Parent = mother, father Child = daughter, son, stepdaughter, stepson Some children travelled only with a nanny, therefore parch=0 for them.

    --- Original source retains full ownership of the source dataset ---

  8. A

    ā€˜Titanic.csv’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Sep 30, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ā€˜Titanic.csv’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-titanic-csv-ab29/5cede3f9/?iid=000-877&v=presentation
    Explore at:
    Dataset updated
    Sep 30, 2021
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ā€˜Titanic.csv’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/garrettrlynch/titaniccsv on 30 September 2021.

    --- No further description of dataset provided by original source ---

    --- Original source retains full ownership of the source dataset ---

  9. A

    ā€˜Titanic’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Nov 12, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ā€˜Titanic’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-titanic-f363/c09b2d16/?iid=020-855&v=presentation
    Explore at:
    Dataset updated
    Nov 12, 2021
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ā€˜Titanic’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/heptapod/titanic on 12 November 2021.

    --- Dataset description provided by original source is as follows ---

    Overview

    This is the original data from Titanic competition plus some changes that I applied to it to be better suited for binary logistic regression:

    1. Merged the train and test data.

    2. Removed the 'ticket' and 'cabin' attributes.

    3. Moved the 'Survived' attribute to the last column.

    4. Added extra zero columns for categorical inputs to be better suited for One-Hot-Encoding.

    5. Substituted the values of 'Sex' and 'Embarked' attributes with binary and categorical values respectively.

    6. Filled the missing values in 'Age' and 'Fare' attributes with the median of the data.

    --- Original source retains full ownership of the source dataset ---

  10. titanic dataset

    • kaggle.com
    Updated Jun 15, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pushpraj Namdev (2022). titanic dataset [Dataset]. https://www.kaggle.com/datasets/pushprajnamdev/titanic-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 15, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Pushpraj Namdev
    Description

    Dataset

    This dataset was created by Pushpraj Namdev

    Contents

  11. Titanic dataset

    • kaggle.com
    Updated Aug 8, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pavithra Naidu (2018). Titanic dataset [Dataset]. https://www.kaggle.com/datasets/pavithranaidu/titanic-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 8, 2018
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Pavithra Naidu
    Description

    Dataset

    This dataset was created by Pavithra Naidu

    Contents

  12. Titanic Dataset

    • kaggle.com
    zip
    Updated Apr 29, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stephen Andrew Lynch (2021). Titanic Dataset [Dataset]. https://www.kaggle.com/datasets/stephenandrewlynch/titanic-dataset
    Explore at:
    zip(1081 bytes)Available download formats
    Dataset updated
    Apr 29, 2021
    Authors
    Stephen Andrew Lynch
    Description

    Dataset

    This dataset was created by Stephen Andrew Lynch

    Contents

  13. Titanic Dataset

    • kaggle.com
    Updated Aug 8, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Hassaan (2024). Titanic Dataset [Dataset]. https://www.kaggle.com/datasets/mhassaan1122/titanic-dataset/discussion?sort=undefined
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 8, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Muhammad Hassaan
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Muhammad Hassaan

    Released under CC0: Public Domain

    Contents

  14. Titanic Edited (for my k-NN tips)

    • kaggle.com
    Updated Jun 24, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aleks Mashanski (2021). Titanic Edited (for my k-NN tips) [Dataset]. https://www.kaggle.com/datasets/alexmaszanski/titanic-edited-for-my-knn-tips
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 24, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Aleks Mashanski
    Description

    Dataset

    This dataset was created by Aleks Mashanski

    Contents

  15. Titanic Dataset Complete Analysis

    • kaggle.com
    Updated Aug 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ankita Nain (2023). Titanic Dataset Complete Analysis [Dataset]. https://www.kaggle.com/datasets/ankitanain/titanic-dataset-complete-analysis
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 17, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Ankita Nain
    Description

    Dataset

    This dataset was created by Ankita Nain

    Contents

  16. Titanic dataset

    • kaggle.com
    Updated Oct 18, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chia Yi (2017). Titanic dataset [Dataset]. https://www.kaggle.com/chiayii/titanic-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 18, 2017
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Chia Yi
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Chia Yi

    Released under CC0: Public Domain

    Contents

  17. Competition_Titanic_machine learning from disaster

    • kaggle.com
    Updated Jan 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    mukti shukla (2023). Competition_Titanic_machine learning from disaster [Dataset]. https://www.kaggle.com/datasets/muktishukla/titanic-servival
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 20, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    mukti shukla
    License

    http://www.gnu.org/licenses/lgpl-3.0.htmlhttp://www.gnu.org/licenses/lgpl-3.0.html

    Description

    Dataset

    This dataset was created by mukti shukla

    Released under GNU Lesser General Public License 3.0

    Contents

  18. Titanic Dataset

    • kaggle.com
    Updated Sep 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ShaikSakeena (2025). Titanic Dataset [Dataset]. https://www.kaggle.com/datasets/bibisakeena/titanic-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 4, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    ShaikSakeena
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset

    This dataset was created by ShaikSakeena

    Released under Attribution 4.0 International (CC BY 4.0)

    Contents

  19. Titanic Dataset for GDSC - AI Model

    • kaggle.com
    Updated Apr 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    alirizaercan (2024). Titanic Dataset for GDSC - AI Model [Dataset]. https://www.kaggle.com/datasets/alirizaercan/titanic-dataset-for-gdsc-ai-model
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 18, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    alirizaercan
    Description

    The Titanic Dataset for GDSC - AI Model

    This dataset contains information about the passengers and crew members who were on board the RMS Titanic, a British passenger liner that sank in the North Atlantic Ocean in the early hours of April 15, 1912, after striking an iceberg during her maiden voyage from Southampton to New York City. The sinking of the Titanic resulted in a large loss of life and remains one of the deadliest commercial peacetime maritime disasters in modern history.

    The dataset includes a variety of features about the passengers and crew members, such as:

    Passenger Class: Indicates the class (1st, 2nd, 3rd) that the passenger traveled in. Name: The passenger's name. Sex: The passenger's sex. Age: The passenger's age. SibSp: The number of siblings or spouses aboard the Titanic with the passenger. Parch: The number of parents or children aboard the Titanic with the passenger. Ticket: The passenger's ticket number. Fare: The passenger's fare. Cabin: The passenger's cabin number. Embarked: The port where the passenger embarked the Titanic. Survived: Whether the passenger survived the sinking of the Titanic (1 = survived, 0 = did not survive). What You Can Do With The Dataset

    The Titanic Dataset is a valuable resource for anyone interested in machine learning, data science, or the history of the Titanic. Here are some examples of what you can do with this dataset:

    Predict passenger survival: You can use the dataset to train a machine learning model to predict whether a passenger was more likely to survive the sinking of the Titanic based on features such as their class, sex, age, and number of relatives on board. Analyze factors that influenced survival rates: You can use the dataset to analyze the factors that influenced passenger survival rates. For example, you could look at how factors such as class, sex, and age affected a passenger's chances of survival. Build a classification model to identify passengers who were more likely to survive: You can use the dataset to build a classification model that can identify passengers who were more likely to survive the sinking of the Titanic. This model could be used to help us understand the factors that influenced survival rates and could also be used to improve the safety of passengers in future maritime disasters. Overall, the Titanic Dataset is a rich and informative dataset that can be used for a variety of purposes. If you are interested in machine learning, data science, or the history of the Titanic, then this dataset is a great resource to explore.

  20. titanic dataset

    • kaggle.com
    Updated Sep 22, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Priyanka (2018). titanic dataset [Dataset]. https://www.kaggle.com/priyanka2018/titanic-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 22, 2018
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Priyanka
    Description

    Dataset

    This dataset was created by Priyanka

    Contents

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Sakshi Satre (2024). Titanic Dataset [Dataset]. https://www.kaggle.com/datasets/sakshisatre/titanic-dataset
Organization logo

Titanic Dataset

"Tragedy at Sea : The Titanic Disaster !!"

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 30, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Sakshi Satre
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

The dataset containing information about passengers aboard the Titanic is one of the most famous datasets used in data science and machine learning. It was created to analyze and understand the factors that influenced survival rates among passengers during the tragic sinking of the RMS Titanic on April 15, 1912.

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F19517213%2Fd4016c159f1ad17cb30d8905192fe9d7%2Ftitanic-ship_1027017-11.avif?generation=1711562371875068&alt=media" alt="">

Data Description :-

The dataset is often used for predictive modeling and statistical analysis to determine which factors (such as socio-economic status, age, gender, etc.) were associated with a higher likelihood of survival. It contains 1309 rows and 14 columns.

Columns : -

  • Pclass: Ticket class indicating the socio-economic status of the passenger. It is categorized into three classes: 1 = Upper, 2 = Middle, 3 = Lower.

  • Survived: A binary indicator that shows whether the passenger survived (1) or not (0) during the Titanic disaster. This is the target variable for analysis.

  • Name: The full name of the passenger, including title (e.g., Mr., Mrs., etc.).

  • Sex: The gender of the passenger, denoted as either male or female.

  • Age: The age of the passenger in years.

  • SibSp: The number of siblings or spouses aboard the Titanic for the respective passenger.

  • Parch: The number of parents or children aboard the Titanic for the respective passenger.

  • Ticket: The ticket number assigned to the passenger.

  • Fare: The fare paid by the passenger for the ticket.

  • Cabin: The cabin number assigned to the passenger, if available.

  • Embarked: The port of embarkation for the passenger. It can take one of three values: C = Cherbourg, Q = Queenstown, S = Southampton.

  • Boat: If the passenger survived, this column contains the identifier of the lifeboat they were rescued in.

  • Body: If the passenger did not survive, this column contains the identification number of their recovered body, if applicable.

  • Home.dest: The destination or place of residence of the passenger.

These descriptions provide a detailed understanding of each column in the Titanic dataset subset, offering insights into the demographic, travel, and survival-related information recorded for each passenger.

Search
Clear search
Close search
Google apps
Main menu