100+ datasets found
  1. Patient Dataset for Clustering (Raw Data)

    • kaggle.com
    Updated Aug 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arjunn Sharma (2023). Patient Dataset for Clustering (Raw Data) [Dataset]. https://www.kaggle.com/datasets/arjunnsharma/patient-dataset-for-clustering-raw-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 10, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Arjunn Sharma
    Description

    About Dataset ● Based on patient symptoms, identify patients needing immediate resuscitation; to assign patients to a predesignated patient care area, thereby prioritizing their care; and to initiate diagnostic/therapeutic measures as appropriate. ● Three individual datasets used for three urgent illness/injury, each dataset has its own features and symptoms for each patient and we merged them to know what are the most severe symptoms for each illness and give them priority of treatment.

    PROJECT SUMMARY Triage refers to the sorting of injured or sick people according to their need for emergency medical attention. It is a method of determining priority for who gets care first. BACKGROUND Triage is the prioritization of patient care (or victims during a disaster) based on illness/injury, symptoms, severity, prognosis, and resource availability. The purpose of triage is to identify patients needing immediate resuscitation; to assign patients to a predesignated patient care area, thereby prioritizing their care; and to initiate diagnostic/therapeutic measures as appropriate. BUSINESS CHALLENGE Based on patient symptoms, identify patients needing immediate resuscitation; to assign patients to a predesignated patient care area, thereby prioritizing their care; and to initiate diagnostic/therapeutic measures as appropriate.

  2. Customer Segmentation : Clustering

    • kaggle.com
    zip
    Updated Jan 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vishakh Patel (2024). Customer Segmentation : Clustering [Dataset]. https://www.kaggle.com/datasets/vishakhdapat/customer-segmentation-clustering
    Explore at:
    zip(63448 bytes)Available download formats
    Dataset updated
    Jan 13, 2024
    Authors
    Vishakh Patel
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Customer Personality Analysis involves a thorough examination of a company's optimal customer profiles. This analysis facilitates a deeper understanding of customers, enabling businesses to tailor products to meet the distinct needs, behaviors, and concerns of various customer types.

    By conducting a Customer Personality Analysis, businesses can refine their products based on the preferences of specific customer segments. Rather than allocating resources to market a new product to the entire customer database, companies can identify the segments most likely to be interested in the product. Subsequently, targeted marketing efforts can be directed toward those particular segments, optimizing resource utilization and increasing the likelihood of successful product adoption.

    Details of Features are as below:

    • Id: Unique identifier for each individual in the dataset.
    • Year_Birth: The birth year of the individual.
    • Education: The highest level of education attained by the individual.
    • Marital_Status: The marital status of the individual.
    • Income: The annual income of the individual.
    • Kidhome: The number of young children in the household.
    • Teenhome: The number of teenagers in the household.
    • Dt_Customer: The date when the customer was first enrolled or became a part of the company's database.
    • Recency: The number of days since the last purchase or interaction.
    • MntWines: The amount spent on wines.
    • MntFruits: The amount spent on fruits.
    • MntMeatProducts: The amount spent on meat products.
    • MntFishProducts: The amount spent on fish products.
    • MntSweetProducts: The amount spent on sweet products.
    • MntGoldProds: The amount spent on gold products.
    • NumDealsPurchases: The number of purchases made with a discount or as part of a deal.
    • NumWebPurchases: The number of purchases made through the company's website.
    • NumCatalogPurchases: The number of purchases made through catalogs.
    • NumStorePurchases: The number of purchases made in physical stores.
    • NumWebVisitsMonth: The number of visits to the company's website in a month.
    • AcceptedCmp3: Binary indicator (1 or 0) whether the individual accepted the third marketing campaign.
    • AcceptedCmp4: Binary indicator (1 or 0) whether the individual accepted the fourth marketing campaign.
    • AcceptedCmp5: Binary indicator (1 or 0) whether the individual accepted the fifth marketing campaign.
    • AcceptedCmp1: Binary indicator (1 or 0) whether the individual accepted the first marketing campaign.
    • AcceptedCmp2: Binary indicator (1 or 0) whether the individual accepted the second marketing campaign.
    • Complain: Binary indicator (1 or 0) whether the individual has made a complaint.
    • Z_CostContact: A constant cost associated with contacting a customer.
    • Z_Revenue: A constant revenue associated with a successful campaign response.
    • Response: Binary indicator (1 or 0) whether the individual responded to the marketing campaign.
  3. Clustering Exercises

    • kaggle.com
    zip
    Updated Apr 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joonas (2022). Clustering Exercises [Dataset]. https://www.kaggle.com/datasets/joonasyoon/clustering-exercises
    Explore at:
    zip(3602272 bytes)Available download formats
    Dataset updated
    Apr 29, 2022
    Authors
    Joonas
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Overview

    https://i.imgur.com/ZUX61cD.png" alt="Overview">

    Context

    The method of disuniting similar data is called clustering. you can create dummy data for classifying clusters by method from sklearn package but it needs to put your effort into job.

    For users who making hard test cases for example of clustering, I think this dataset helps them.

    Try out to select a meaningful number of clusters, and dividing the data into clusters. Here are exercises for you.

    Dataset

    All csv files contain a lots of x, y and color, and you can see above figures.

    If you want to use position as type of integer, scale it and round off to integer as like x = round(x * 100).

    Furthermore, here is GUI Tool to generate 2D points for clustering. you can make your dataset with this tool. https://www.joonas.io/cluster-paint

    Stay tuned for further updates! also if any idea, you can comment me.

  4. 2D Clustering Dataset Collection

    • kaggle.com
    zip
    Updated Jan 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SAMOILOV MIKHAIL (2025). 2D Clustering Dataset Collection [Dataset]. https://www.kaggle.com/datasets/samoilovmikhail/2d-clustering-dataset-collection
    Explore at:
    zip(136543 bytes)Available download formats
    Dataset updated
    Jan 21, 2025
    Authors
    SAMOILOV MIKHAIL
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This dataset collection comprises 15 diverse two-dimensional datasets specifically designed for clustering analysis. Each dataset contains three columns: x, y, and target, where x and y represent the coordinates of the data points, and target indicates the cluster label.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F20292402%2F3cc81328beabc815fe500973fee1f7ac%2Fdescription.png?generation=1737484616903723&alt=media" alt="Visualisation of data">

  5. Benchmarks datasets for cluster analysis

    • kaggle.com
    zip
    Updated Nov 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Onthada Preedasawakul (2023). Benchmarks datasets for cluster analysis [Dataset]. https://www.kaggle.com/datasets/onthada/benchmarks-datasets-for-clustering
    Explore at:
    zip(608532 bytes)Available download formats
    Dataset updated
    Nov 15, 2023
    Authors
    Onthada Preedasawakul
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Description

    25 Artificial Datasets

    The datasets are generated using either Gaussian or Uniform distributions. Each dataset contains several known sub-groups intended for testing centroid-based clustering results and cluster validity indices.

    Cluster analysis is a popular machine learning used for segmenting datasets with similar data points in the same group. For those who are familiar with R, there is a new R package called "UniversalCVI" https://CRAN.R-project.org/package=UniversalCVI used for cluster evaluation. This package provides algorithms for checking the accuracy of a clustering result with known classes, computing cluster validity indices, and generating plots for comparing them. The package is compatible with K-means, fuzzy C means, EM clustering, and hierarchical clustering (single, average, and complete linkage). To use the "UniversalCVI" package, one can follow the instructions provided in the R documentation.

    For more in-depth details of the package and cluster evaluation, please see the papers https://doi.org/10.1016/j.patcog.2023.109910 and https://arxiv.org/abs/2308.14785

    All the datasets are also available on GitHub at

    https://github.com/O-PREEDASAWAKUL/FuzzyDatasets.git .

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F17645646%2Fa2f87fbad212a908718535589681a703%2Frealplot.jpeg?generation=1700111724010268&alt=media" alt="">

  6. Small image dataset for unsupervised clustering

    • kaggle.com
    zip
    Updated Oct 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Won-Du Chang (2022). Small image dataset for unsupervised clustering [Dataset]. https://www.kaggle.com/datasets/heavensky/image-dataset-for-unsupervised-clustering
    Explore at:
    zip(6947440 bytes)Available download formats
    Dataset updated
    Oct 29, 2022
    Authors
    Won-Du Chang
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Is it possible to cluster all the photos in your phone automatically without labeling?

    This small dataset includes 80 photos of dogs (10), cats (10), family (20), alone (20), and food (20). There is no labeling info, but you will see it clearly.

    All the photos were from pixabay(https://pixabay.com/). They are free under some restrictions. please see the license page of pixabay (https://pixabay.com/ko/service/license/).

  7. student clustering

    • kaggle.com
    zip
    Updated Aug 31, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deepansh Saxena1 (2022). student clustering [Dataset]. https://www.kaggle.com/datasets/deepanshsaxena1/student-clusteringg
    Explore at:
    zip(875 bytes)Available download formats
    Dataset updated
    Aug 31, 2022
    Authors
    Deepansh Saxena1
    Description

    Dataset

    This dataset was created by Deepansh Saxena1

    Contents

  8. Hierarchical Clustering Practice Dataset (small)

    • kaggle.com
    zip
    Updated Dec 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jainil Patel (2024). Hierarchical Clustering Practice Dataset (small) [Dataset]. https://www.kaggle.com/datasets/jainilspatel/cluster
    Explore at:
    zip(388 bytes)Available download formats
    Dataset updated
    Dec 31, 2024
    Authors
    Jainil Patel
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset Description

    This dataset explores the relationship between age, mobile usage hours, and income levels. It contains four columns:

    1. NAME: The individual's name (the first 8 names are from the creator's friend circle).
    2. HOUR: Daily mobile usage in hours.
    3. INCOME: Monthly income in a standardized format.
    4. AGE: Age of the individual.

    The dataset is small and ideal for beginners to practice hierarchical clustering techniques. It provides insights into how mobile usage and income vary across different age groups, making it suitable for educational and learning purposes.

  9. fake dataset for clustering

    • kaggle.com
    zip
    Updated Feb 26, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ridlo Wahyudi Wibowo (2022). fake dataset for clustering [Dataset]. https://www.kaggle.com/datasets/ridloww/fake-dataset-untuk-clustering
    Explore at:
    zip(24078 bytes)Available download formats
    Dataset updated
    Feb 26, 2022
    Authors
    Ridlo Wahyudi Wibowo
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Ridlo Wahyudi Wibowo

    Released under CC0: Public Domain

    Contents

  10. Data from: Galaxy clustering

    • kaggle.com
    zip
    Updated Jan 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). Galaxy clustering [Dataset]. https://www.kaggle.com/datasets/thedevastator/clustering-polygons-utilizing-iris-moon-and-circ
    Explore at:
    zip(6339 bytes)Available download formats
    Dataset updated
    Jan 3, 2023
    Authors
    The Devastator
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Galaxy clustering

    Iris, Moon, and Circles datasets for Galaxy clustering tutorial

    By [source]

    About this dataset

    This dataset contains a wealth of information that can be used to explore the effectiveness of various clustering algorithms. With its inclusion of numerical measurements (X, Y, Sepal.Length, and Petal.Length) and categorical values (Species), it is possible to investigate the relationship between different types of variables and clustering performance. Additionally, by comparing results for the 3 datasets provided - moon.csv (which contains x and y coordinates), iris.csv (which contains measurements for sepal and petal lengths),and circles.csv - we can gain insights into how different data distributions affect clustering techniques such as K-Means or Hierarchical Clustering among others!

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    This dataset can also be a great starting point to further explore more complex clusters by using higher dimensional space variables such as color or texture that may be present in other datasets not included here but which can help to form more accurate groups when using cluster-analysis algorithms. Additionally, it could also assist in visualization projects where clusters may need to be generated such as plotting mapped data points or examining relationships between two different variables within a certain region drawn on a chart.

    To use this dataset effectively it is important to understand how exactly your chosen algorithm works since some require specifying parameters beforehand while others take care of those details automatically; otherwise the interpretation may be invalid depending on the methods used alongside clustering you intend for your project. Furthermore, familiarize yourself with concepts like silhouette score and rand index - these are commonly used metrics that measure your cluster’s performance against other clusterings models so you know if what you have done so far satisfies an acceptable level of accuracy or not yet! Good luck!

    Research Ideas

    • Utilizing the sepal and petal lengths and widths to perform flower recognition or part of a larger image recognition pipeline.
    • Classifying the data points in each dataset by the X-Y coordinates using clustering algorithms to analyze galaxy locations or overall formation patterns for stars, planets, or galaxies.
    • Exploring correlations between species of flowers in terms of sepal/petal lengths by performing supervised learning tasks such as classification with this dataset

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

    Columns

    File: moon.csv | Column name | Description | |:--------------|:------------------------------------------| | X | X coordinate of the data point. (Numeric) | | Y | Y coordinate of the data point. (Numeric) |

    File: iris.csv | Column name | Description | |:-----------------|:---------------------------------------------| | Sepal.Length | Length of the sepal of the flower. (Numeric) | | Petal.Length | Length of the petal of the flower. (Numeric) | | Species | Species of the flower. (Categorical) |

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. If you use this dataset in your research, please credit .

  11. CLUSTERING: K-MEANS

    • kaggle.com
    zip
    Updated Apr 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Syed Touqeer (2020). CLUSTERING: K-MEANS [Dataset]. https://www.kaggle.com/datasets/syedtouqeer/clustering-kmeans
    Explore at:
    zip(1599 bytes)Available download formats
    Dataset updated
    Apr 24, 2020
    Authors
    Syed Touqeer
    Description

    Dataset

    This dataset was created by Syed Touqeer

    Contents

  12. Text Document Classification Dataset

    • kaggle.com
    zip
    Updated Dec 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    sunil thite (2023). Text Document Classification Dataset [Dataset]. https://www.kaggle.com/datasets/sunilthite/text-document-classification-dataset
    Explore at:
    zip(1941393 bytes)Available download formats
    Dataset updated
    Dec 4, 2023
    Authors
    sunil thite
    Description

    This is text document classification dataset which contains 2225 text data and five categories of documents. Five categories are politics, sport, tech, entertainment and business. We can use this dataset for documents classification and document clustering.

    About Dataset - Dataset contains two features text and label. - No. of Rows : 2225 - No. of Columns : 2

    Text: It contains different categories of text data Label: It contains labels for five different categories : 0,1,2,3,4

    1. Politics = 0
    2. Sport = 1
    3. Technology = 2
    4. Entertainment =3
    5. Business = 4
  13. The Kaggle Dataset

    • kaggle.com
    zip
    Updated Jul 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DG (2023). The Kaggle Dataset [Dataset]. https://www.kaggle.com/datasets/davidgauthier/the-kaggle-dataset
    Explore at:
    zip(5167093 bytes)Available download formats
    Dataset updated
    Jul 4, 2023
    Authors
    DG
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    There are close to 200 000 datasets on Kaggle.

    This dataset brings information on all of them to help navigate all the data. The file contains upvotes, links usability, and so on for every dataset hosted on Kaggle.

    An example of usage is to study the ties between the dataset's features and its popularity.

  14. Clustering Dataset

    • kaggle.com
    zip
    Updated May 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nakshatra Goswami (2024). Clustering Dataset [Dataset]. https://www.kaggle.com/datasets/nakshatragoswami/clustering-dataset/code
    Explore at:
    zip(74494 bytes)Available download formats
    Dataset updated
    May 16, 2024
    Authors
    Nakshatra Goswami
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Nakshatra Goswami

    Released under Apache 2.0

    Contents

  15. Customer Segmentation for Targeted Campaigns

    • kaggle.com
    zip
    Updated May 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mani Devesh (2024). Customer Segmentation for Targeted Campaigns [Dataset]. https://www.kaggle.com/datasets/manidevesh/customer-sales-data
    Explore at:
    zip(914292 bytes)Available download formats
    Dataset updated
    May 21, 2024
    Authors
    Mani Devesh
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Project Overview: Customer Segmentation Using K-Means Clustering

    Introduction In this project, I analysed customer data from a retail store to identify distinct customer segments. The dataset includes key attributes such as age, city, and total sales of the customers. By leveraging K-Means clustering, an unsupervised machine learning technique, I aim to group customers based on their age and sales metrics. These insights will enable the creation of targeted marketing campaigns tailored to the specific needs and behaviours of each customer segment.

    Objectives - Cluster Customers: Use K-Means clustering to group customers based on age and total sales. - Analyse Segments: Examine the characteristics of each customer segment. - Targeted Marketing: Develop strategies for personalized marketing campaigns targeting each identified customer group.

    Data Description The dataset comprises:

    • Age: The age of the customers.
    • City: The city where the customers reside.
    • Total Sales: The total sales generated by each customer.

    Methodology - Data Preprocessing: Clean and preprocess the data to handle any missing or inconsistent entries. - Feature Selection: Focus on age and total sales as primary features for clustering. - K-Means Clustering: Apply the K-Means algorithm to identify distinct customer segments. - Cluster Analysis: Analyse the resulting clusters to understand the demographic and sales characteristics of each group. - Marketing Strategy Development: Create targeted marketing strategies for each customer segment to enhance engagement and sales.

    Expected Outcomes - Customer Segments: Clear identification of customer groups based on age and purchasing behaviour. - Insights for Marketing: Detailed understanding of each segment to inform targeted marketing efforts. - Business Impact: Enhanced ability to tailor marketing campaigns, potentially leading to increased customer satisfaction and sales.

    By clustering customers based on age and total sales, this project aims to provide actionable insights for personalized marketing, ultimately driving better customer engagement and higher sales for the retail store.

  16. Clustering of 3D coordinates

    • kaggle.com
    zip
    Updated Oct 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MEGHANAparappa (2023). Clustering of 3D coordinates [Dataset]. https://www.kaggle.com/datasets/meghanaparappa/clustering-of-3d-coordinates
    Explore at:
    zip(344 bytes)Available download formats
    Dataset updated
    Oct 9, 2023
    Authors
    MEGHANAparappa
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset provides 50 numbers of cartesian coordinates having 3 dimensions. The goal is to group these coordinates effectively into clusters. Application of any suitable Machine learning model is welcomed. Happy Learning!

  17. 2D clustering data

    • kaggle.com
    zip
    Updated Sep 11, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samuel Cortinhas (2022). 2D clustering data [Dataset]. https://www.kaggle.com/datasets/samuelcortinhas/2d-clustering-data/versions/2
    Explore at:
    zip(6686 bytes)Available download formats
    Dataset updated
    Sep 11, 2022
    Authors
    Samuel Cortinhas
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Small 2 dimensional clustering dataset for examples and case studies.

    Created using https://www.joonas.io/cluster-paint/

    I used this in my introduction to k-Means clustering notebook here: https://www.kaggle.com/samuelcortinhas/k-means-from-scratch

  18. Clustering dataset for weka software

    • kaggle.com
    zip
    Updated Sep 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Click Mintaka (2024). Clustering dataset for weka software [Dataset]. https://www.kaggle.com/datasets/muhammadismailo/clustering-dataset-for-weka-software
    Explore at:
    zip(5441 bytes)Available download formats
    Dataset updated
    Sep 20, 2024
    Authors
    Click Mintaka
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Click Mintaka

    Released under CC0: Public Domain

    Contents

  19. Customer Dataset for clustering

    • kaggle.com
    zip
    Updated Sep 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abhash Rai (2024). Customer Dataset for clustering [Dataset]. https://www.kaggle.com/datasets/abhashrai/customer-dataset-for-clustering
    Explore at:
    zip(20870 bytes)Available download formats
    Dataset updated
    Sep 3, 2024
    Authors
    Abhash Rai
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Beginner friendly dataset for clustering.

    You can train a model to cluster customers in segments (High, Medium, Low) based on 'Avg_Order_Value' and 'Total_Spending'.

    Actual segment is also provided.

  20. uber clustering

    • kaggle.com
    zip
    Updated Apr 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tek Bahadur Kshetri (2023). uber clustering [Dataset]. https://www.kaggle.com/datasets/tekbahadurkshetri/uber-clustering
    Explore at:
    zip(5012595 bytes)Available download formats
    Dataset updated
    Apr 16, 2023
    Authors
    Tek Bahadur Kshetri
    Description

    Dataset

    This dataset was created by Tek Bahadur Kshetri

    Contents

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Arjunn Sharma (2023). Patient Dataset for Clustering (Raw Data) [Dataset]. https://www.kaggle.com/datasets/arjunnsharma/patient-dataset-for-clustering-raw-data
Organization logo

Patient Dataset for Clustering (Raw Data)

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 10, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Arjunn Sharma
Description

About Dataset ● Based on patient symptoms, identify patients needing immediate resuscitation; to assign patients to a predesignated patient care area, thereby prioritizing their care; and to initiate diagnostic/therapeutic measures as appropriate. ● Three individual datasets used for three urgent illness/injury, each dataset has its own features and symptoms for each patient and we merged them to know what are the most severe symptoms for each illness and give them priority of treatment.

PROJECT SUMMARY Triage refers to the sorting of injured or sick people according to their need for emergency medical attention. It is a method of determining priority for who gets care first. BACKGROUND Triage is the prioritization of patient care (or victims during a disaster) based on illness/injury, symptoms, severity, prognosis, and resource availability. The purpose of triage is to identify patients needing immediate resuscitation; to assign patients to a predesignated patient care area, thereby prioritizing their care; and to initiate diagnostic/therapeutic measures as appropriate. BUSINESS CHALLENGE Based on patient symptoms, identify patients needing immediate resuscitation; to assign patients to a predesignated patient care area, thereby prioritizing their care; and to initiate diagnostic/therapeutic measures as appropriate.

Search
Clear search
Close search
Google apps
Main menu