100+ datasets found
  1. Bank Customer Churn

    • kaggle.com
    Updated Mar 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CAT Reloaded || Data Science circle (2025). Bank Customer Churn [Dataset]. https://www.kaggle.com/datasets/cat-reloaded-data-science/bank-customer-churn/versions/1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 14, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    CAT Reloaded || Data Science circle
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Bank Customer Churn Dataset is a collection of data related to customers of a bank who have either left (churned) or stayed with the bank. This dataset is typically used for predictive modeling to identify patterns and factors that lead to customer churn, enabling banks to take proactive measures to retain customers.

    • id: Unique identifier for each customer.

    • CustomerId: Unique identifier for the customer account.

    • Surname: Last name of the customer.

    • CreditScore: Numeric representation of the customer's creditworthiness.

    • Geography:str, Gender:str:Country or region where the customer resides ,Gender of the customer (e.g., Male, Female).

    • Age: Age of the customer.

    • Tenure: Number of years the customer has been with the bank.

    • Balance: Current balance in the customer's account.

    • NumOfProducts: Number of bank products the customer uses.

    • HasCrCard: Binary indicator (0 or 1) for whether the customer has a credit card.

    • IsActiveMember: Binary indicator (0 or 1) for whether the customer is an active member.

    • EstimatedSalary: Estimated salary of the customer.

    • Exited: Binary indicator (0 or 1) for whether the customer has churned (the target).

  2. c

    Data from: Customer Churn Dataset

    • cubig.ai
    Updated May 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2025). Customer Churn Dataset [Dataset]. https://cubig.ai/store/products/256/customer-churn-dataset
    Explore at:
    Dataset updated
    May 20, 2025
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Privacy-preserving data transformation via differential privacy, Synthetic data generation using AI techniques for model training
    Description

    1) Data Introduction • The Customer Churn Dataset is a dataset that collects various customer characteristics and service usage information to predict whether or not communication service customers will turn.

    2) Data Utilization (1) Customer Churn Dataset has characteristics that: • The dataset consists of several categorical and numerical variables, including customer demographics, service types, contract information, charges, usage patterns, and Turn. (2) Customer Churn Dataset can be used to: • Development of customer churn prediction model : Machine learning and deep learning techniques can be used to develop classification models that predict churn based on customer characteristics and service use data. • Segmenting customers and developing marketing strategies : It can be used to analyze customer groups at high risk of departure and to design custom retention strategies or targeted marketing campaigns.

  3. i

    Data from: Customer Churn Dataset

    • ieee-dataport.org
    Updated Jun 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Usman JOY (2024). Customer Churn Dataset [Dataset]. https://ieee-dataport.org/documents/customer-churn-dataset
    Explore at:
    Dataset updated
    Jun 4, 2024
    Authors
    Usman JOY
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    259

  4. Customer Churn - Decision Tree & Random Forest

    • kaggle.com
    Updated Jul 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    vikram amin (2023). Customer Churn - Decision Tree & Random Forest [Dataset]. https://www.kaggle.com/datasets/vikramamin/customer-churn-decision-tree-and-random-forest
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 6, 2023
    Dataset provided by
    Kaggle
    Authors
    vikram amin
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description
    • Main objective: Find out customers who will churn and who will not.
    • Methodology: It is a classification problem. We will use decision tree and random forest to predict the outcome.
    • Steps Involved
    • Read the data
    • Check for data types https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10868729%2F1ffb600d8a4b4b36bc25e957524a3524%2FPicture1.png?generation=1688638600831386&alt=media" alt="">
    1. Change character vector to factor vector as this is as classification problem
    2. Drop the variable which is not significant for the analysis. We drop "customerID".
    3. Check for missing values. None are found.
    4. Split the data into train and test so we can use the train data for building the model and use test data for prediction. We split this into 80-20 ratio (train/test) using the sample function.
    5. Install and run libraries (rpart, rpart.plot, rattle, RColorBrewer, caret)
    6. Run decision tree using rpart function. The dependent variable is Churn and 19 other independent variables

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10868729%2F8d3442e6c82d8026c6a448e4780ab38c%2FPicture2.png?generation=1688638685268853&alt=media" alt=""> 9. Plot the decision tree

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10868729%2F9ab0591e323dc30fe116c79f6d014d06%2FPicture3.png?generation=1688638747644320&alt=media" alt="">

    Average customer churn is 27%. The churn can take place if the tenure is more than >=7.5 and there is no internet service

    1. Tuning the model
    2. Define the search grid using the expand.grid function
    3. Set up the control parameters through 5 fold cross validation
    4. When we print the model we get the best CP = 0.01 and an accuracy of 79.00%

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10868729%2F16080ac04d3743ec238227e1ef2c8269%2FPicture4.png?generation=1688639197455166&alt=media" alt="">

    1. Predict the model
    2. Find out the variables which are most and least significant. https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10868729%2F61beb4224e9351cfc772147c43800502%2FPicture5.png?generation=1688639468638950&alt=media" alt="">

    Significant variables are Internet Service, Tenure and the least significant are Streaming Movies, Tech Support.

    USE RANDOM FOREST

    1. Run library(randomForest). Here we are using the default ntree (500) and mtry (p/3) where p is the number of independent variables. https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10868729%2Fc27fe7e83f0b53b7e067371b69c7f4a7%2FPicture6.png?generation=1688640478682685&alt=media" alt="">

      Through confusion matrix, accuracy is coming 79.27%. The accuracy is marginally higher than that of decision tree i.e 79.00%. The error rate is pretty low when predicting "No" and much higher when predicting "Yes".

    2. Plot the model showing which variables reduce the gini impunity the most and least. Total charges and tenure reduce the gini impunity the most while phone service has the least impact.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10868729%2Fec25fc3ba74ab9cef1a81188209512b1%2FPicture7.png?generation=1688640726235724&alt=media" alt="">

    1. Predict the model and create a new data frame showing the actuals vs predicted values

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10868729%2F50aa40e5dd676c8285020fd2fe627bf1%2FPicture8.png?generation=1688640896763066&alt=media" alt="">

    1. Plot the model so as to find out where the OOB (out of bag ) error stops decreasing or becoming constant. As we can see that the error stops decreasing between 100 to 200 trees. So we decide to take ntree = 200 when we tune the model.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10868729%2F87211e1b218c595911fbe6ea2806e27a%2FPicture9.png?generation=1688641103367564&alt=media" alt="">

    Tune the model mtry=2 has the lowest OOB error rate

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10868729%2F6057af5bb0719b16f1a97a58c3d4aa1d%2FPicture10.png?generation=1688641391027971&alt=media" alt="">

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10868729%2Fc7045eba4ee298c58f1bd0230c24c00d%2FPicture11.png?generation=1688641605829830&alt=media" alt="">

    Use random forest with mtry = 2 and ntree = 200

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F10868729%2F01541eff1f9c6303591aa50dd707b5f5%2FPicture12.png?generation=1688641634979403&alt=media" alt="">

    Through confusion matrix, accuracy is coming 79.71%. The accuracy is marginally higher than that of default (when ntree was 500 and mtry was 4) i.e 79.27% and of decision tree i.e 79.00%. The error rate is pretty low when predicting "No" and m...

  5. c

    Data from: Telco Customer Churn Dataset

    • cubig.ai
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG, Telco Customer Churn Dataset [Dataset]. https://cubig.ai/store/products/312/telco-customer-churn-dataset
    Explore at:
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Synthetic data generation using AI techniques for model training, Privacy-preserving data transformation via differential privacy
    Description

    1) Data Introduction • The Telco Customer Churn Dataset includes carrier customer service usage, account information, demographics and churn, which can be used to predict and analyze customer churn.

    2) Data Utilization (1) Telco Customer Churn Dataset has characteristics that: • This dataset includes a variety of customer and service characteristics, including gender, age group, partner and dependents, service subscription status (telephone, Internet, security, backup, device protection, technical support, streaming, etc.), contract type, payment method, monthly fee, total fee, and departure. (2) Telco Customer Churn Dataset can be used to: • Development of customer churn prediction model: Using customer service usage patterns and account information, we can build a machine learning-based churn prediction model to proactively identify customers at risk of churn.

  6. C

    Churn Prediction Software Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated May 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Churn Prediction Software Report [Dataset]. https://www.datainsightsmarket.com/reports/churn-prediction-software-502488
    Explore at:
    doc, ppt, pdfAvailable download formats
    Dataset updated
    May 11, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Churn Prediction Software market is experiencing robust growth, driven by the increasing need for businesses across diverse sectors to proactively manage customer retention. The market's expansion is fueled by the rising adoption of cloud-based solutions, offering scalability and cost-effectiveness. Key applications include telecommunications, banking and finance, retail, e-commerce, and healthcare, where minimizing customer churn is crucial for profitability. The market is witnessing a shift towards sophisticated predictive analytics and machine learning algorithms that provide more accurate churn predictions, allowing businesses to implement targeted retention strategies. This includes personalized offers, proactive customer support, and improved product/service offerings. Furthermore, the integration of churn prediction software with CRM systems enhances data analysis and facilitates more effective customer relationship management. Competition is intensifying with established players like SAP, Salesforce, and Oracle competing alongside agile startups offering specialized solutions. The market's growth, while positive, also faces certain restraints, such as the high initial investment costs for implementing these sophisticated solutions and the need for skilled data scientists to interpret and leverage the insights derived from the analyses. Despite these challenges, the market's future remains promising. The increasing availability of large datasets, coupled with advancements in artificial intelligence and machine learning, is expected to drive innovation and further enhance the accuracy and effectiveness of churn prediction software. Regional growth will vary, with North America and Europe likely leading the market initially, driven by higher technology adoption rates and established business practices. However, growth in Asia-Pacific is anticipated to accelerate significantly in the coming years as businesses in developing economies prioritize customer retention strategies. The continued development of user-friendly interfaces and the increasing integration of these tools into existing business workflows will further contribute to the overall market expansion and wider adoption across various industries.

  7. E-commerce Customer Churn

    • kaggle.com
    Updated Aug 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samuel Semaya (2024). E-commerce Customer Churn [Dataset]. https://www.kaggle.com/datasets/samuelsemaya/e-commerce-customer-churn
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 6, 2024
    Dataset provided by
    Kaggle
    Authors
    Samuel Semaya
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    E-commerce Customer Churn Dataset

    Context

    This dataset belongs to a leading online E-commerce company. The company wants to identify customers who are likely to churn, so they can proactively approach these customers with promotional offers.

    Content

    The dataset contains various features related to customer behavior and characteristics, which can be used to predict customer churn.

    Features

    1. Tenure: Tenure of a customer in the company (numeric)
    2. WarehouseToHome: Distance between the warehouse to the customer's home (numeric)
    3. NumberOfDeviceRegistered: Total number of devices registered to a particular customer (numeric)
    4. PreferedOrderCat: Preferred order category of a customer in the last month (categorical)
    5. SatisfactionScore: Satisfactory score of a customer on service (numeric)
    6. MaritalStatus: Marital status of a customer (categorical)
    7. NumberOfAddress: Total number of addresses added for a particular customer (numeric)
    8. Complaint: Whether any complaint has been raised in the last month (binary)
    9. DaySinceLastOrder: Days since last order by customer (numeric)
    10. CashbackAmount: Average cashback in last month (numeric)
    11. Churn: Churn flag (target variable, binary)

    Task

    The main task is to predict customer churn based on the given features. This is a binary classification problem where the target variable is 'Churn'.

    Potential Applications

    1. Customer Retention: Identify at-risk customers and take proactive measures to retain them.
    2. Targeted Marketing: Design specific marketing campaigns for customers likely to churn.
    3. Service Improvement: Analyze features contributing to churn and improve those aspects of the service.

    Acknowledgements

    This dataset is provided for educational purposes. While it represents a real-world scenario, the data itself may be simulated or anonymized.

  8. D

    Customer Churn Software Market Report | Global Forecast From 2025 To 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Oct 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2024). Customer Churn Software Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/customer-churn-software-market
    Explore at:
    csv, pptx, pdfAvailable download formats
    Dataset updated
    Oct 5, 2024
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Customer Churn Software Market Outlook



    The global customer churn software market size was valued at approximately USD 1.5 billion in 2023 and is projected to reach USD 4.8 billion by 2032, growing at a CAGR of 13.7% during the forecast period. This robust growth is driven by several factors, including the increasing importance of customer retention in competitive markets, advancements in AI and machine learning technologies, and the growing adoption of digital transformation initiatives across industries.



    One of the primary growth factors propelling the customer churn software market is the increasing emphasis on customer satisfaction and retention. In today's highly competitive business environment, retaining existing customers is more cost-effective than acquiring new ones. Companies are realizing the value of customer loyalty, and as a result, they are investing heavily in tools that can help predict and mitigate churn. Customer churn software offers advanced analytics and predictive capabilities, enabling organizations to identify at-risk customers and take proactive measures to retain them.



    Another significant driver is the advancement in artificial intelligence (AI) and machine learning technologies. These technologies have revolutionized the way customer data is analyzed and interpreted. AI-powered customer churn software can process vast amounts of data from multiple sources, identify patterns, and generate actionable insights. This ability to leverage big data and predictive analytics is crucial for businesses aiming to stay ahead of the competition. As AI and machine learning continue to evolve, the effectiveness and efficiency of customer churn software are expected to improve further.



    The increasing adoption of digital transformation initiatives across various industries is also contributing to the market growth. As businesses undergo digital transformation, they generate enormous amounts of data related to customer behavior, preferences, and interactions. Customer churn software helps organizations make sense of this data, enabling them to develop personalized strategies to enhance customer experience and loyalty. The shift towards data-driven decision-making is compelling companies to invest in advanced analytics solutions, thereby driving the demand for customer churn software.



    From a regional perspective, North America holds a significant share of the customer churn software market, driven by the presence of major technology companies and the early adoption of advanced analytics solutions. However, the Asia Pacific region is expected to witness the highest growth rate during the forecast period. Factors such as the rapid digitalization of economies, increasing investments in AI and machine learning, and the growing focus on customer-centric strategies in emerging markets are fueling the demand for customer churn software in this region.



    Component Analysis



    The customer churn software market is segmented into two primary components: software and services. The software segment includes the actual customer churn solutions, while the services segment encompasses implementation, training, support, and consulting services. The software segment is expected to dominate the market due to the high demand for advanced analytics and predictive tools. Companies across various industries are increasingly adopting software solutions to gain insights into customer behavior and predict churn. The software segment's growth is further supported by continuous advancements in AI and machine learning technologies, which enhance the capabilities of customer churn solutions.



    The services segment, although smaller in comparison to the software segment, plays a crucial role in the market. Services such as implementation and training ensure that organizations can effectively deploy and utilize customer churn software. Support and consulting services are equally important, as they help companies optimize their software usage and develop customized strategies to address specific churn-related challenges. The demand for these services is expected to grow in tandem with the adoption of customer churn software, as businesses seek to maximize their return on investment and achieve better customer retention outcomes.



    Moreover, the integration of customer churn software with existing CRM systems and other business applications is becoming increasingly important. This integration enables a seamless flow of data and enhances the overall efficiency of customer retention efforts. As a result, solutions that offer robust integration capa

  9. t

    Telco_Customer_churn_Data

    • test.researchdata.tuwien.at
    bin, csv, png
    Updated Apr 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Erum Naz; Erum Naz; Erum Naz; Erum Naz (2025). Telco_Customer_churn_Data [Dataset]. http://doi.org/10.82556/b0ch-cn44
    Explore at:
    png, csv, binAvailable download formats
    Dataset updated
    Apr 28, 2025
    Dataset provided by
    TU Wien
    Authors
    Erum Naz; Erum Naz; Erum Naz; Erum Naz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Apr 28, 2025
    Description

    Context and Methodology

    The dataset originates from the research domain of Customer Churn Prediction in the Telecom Industry. It was created as part of the project "Data-Driven Churn Prediction: ML Solutions for the Telecom Industry," completed within the Data Stewardship course (Master programme Data Science, TU Wien).

    The primary purpose of this dataset is to support machine learning model development for predicting customer churn based on customer demographics, service usage, and account information.
    The dataset enables the training, testing, and evaluation of classification algorithms, allowing researchers and practitioners to explore techniques for customer retention optimization.

    The dataset was originally obtained from the IBM Accelerator Catalog and adapted for academic use. It was uploaded to TU Wien’s DBRepo test system and accessed via SQLAlchemy connections to the MariaDB environment.

    Technical Details

    The dataset has a tabular structure and was initially stored in CSV format. It contains:

    • Rows: 7,043 customer records

    • Columns: 21 features including customer attributes (gender, senior citizen status, partner status), account information (tenure, contract type, payment method), service usage (internet service, streaming TV, tech support), and the target variable (Churn: Yes/No).

    Naming Convention:

    • The table in the database is named telco_customer_churn_data.

    Software Requirements:

    • To open and work with the dataset, any standard database client or programming language supporting MariaDB connections can be used (e.g., Python etc).

    • For machine learning applications, libraries such as pandas, scikit-learn, and joblib are typically used.

    Additional Resources:

    Further Details

    When reusing the dataset, users should be aware:

    • Licensing: The dataset is shared under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.

    • Use Case Suitability: The dataset is best suited for classification tasks, particularly binary classification (churn vs. no churn).

    • Metadata Standards: Metadata describing the dataset adheres to FAIR principles and is supplemented by CodeMeta and Croissant standards for improved interoperability.

  10. f

    Details of feature variables of the data set.

    • plos.figshare.com
    xls
    Updated Dec 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ke Peng; Yan Peng; Wenguang Li (2023). Details of feature variables of the data set. [Dataset]. http://doi.org/10.1371/journal.pone.0289724.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Dec 8, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Ke Peng; Yan Peng; Wenguang Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In recent years, with the continuous improvement of the financial system and the rapid development of the banking industry, the competition of the banking industry itself has intensified. At the same time, with the rapid development of information technology and Internet technology, customers’ choice of financial products is becoming more and more diversified, and customers’ dependence and loyalty to banking institutions is becoming less and less, and the problem of customer churn in commercial banks is becoming more and more prominent. How to predict customer behavior and retain existing customers has become a major challenge for banks to solve. Therefore, this study takes a bank’s business data on Kaggle platform as the research object, uses multiple sampling methods to compare the data for balancing, constructs a bank customer churn prediction model for churn identification by GA-XGBoost, and conducts interpretability analysis on the GA-XGBoost model to provide decision support and suggestions for the banking industry to prevent customer churn. The results show that: (1) The applied SMOTEENN is more effective than SMOTE and ADASYN in dealing with the imbalance of banking data. (2) The F1 and AUC values of the model improved and optimized by XGBoost using genetic algorithm can reach 90% and 99%, respectively, which are optimal compared to other six machine learning models. The GA-XGBoost classifier was identified as the best solution for the customer churn problem. (3) Using Shapley values, we explain how each feature affects the model results, and analyze the features that have a high impact on the model prediction, such as the total number of transactions in the past year, the amount of transactions in the past year, the number of products owned by customers, and the total sales balance. The contribution of this paper is mainly in two aspects: (1) this study can provide useful information from the black box model based on the accurate identification of churned customers, which can provide reference for commercial banks to improve their service quality and retain customers; (2) it can provide reference for customer churn early warning models of other related industries, which can help the banking industry to maintain customer stability, maintain market position and reduce corporate losses.

  11. C

    Customer Churn Analysis Software Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Jul 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Customer Churn Analysis Software Report [Dataset]. https://www.datainsightsmarket.com/reports/customer-churn-analysis-software-1390551
    Explore at:
    doc, ppt, pdfAvailable download formats
    Dataset updated
    Jul 19, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Customer Churn Analysis Software market is experiencing robust growth, driven by the increasing need for businesses to understand and mitigate customer attrition. The market's expansion is fueled by several factors, including the rising adoption of cloud-based solutions, the proliferation of big data analytics, and the growing demand for predictive analytics capabilities to proactively identify at-risk customers. Businesses across diverse sectors, including SaaS, e-commerce, and telecommunications, are increasingly leveraging these sophisticated tools to gain actionable insights into customer behavior, personalize their offerings, and improve customer retention strategies. This market is characterized by a competitive landscape with both established players like Adobe and Google, and specialized niche providers such as Infer and Churnly Technologies Limited. The integration of AI and machine learning capabilities within these platforms is a prominent trend, enabling more accurate prediction models and automated interventions to reduce churn. While the initial investment in such software can be a restraint for some smaller businesses, the long-term return on investment, in terms of improved customer retention and reduced acquisition costs, is a compelling driver for market growth. The forecast period (2025-2033) is expected to witness significant expansion, building upon the historical growth from 2019-2024. Assuming a conservative CAGR (let's estimate it at 15% based on industry trends), and a 2025 market size of $5 billion (a reasonable estimate given the presence of major players and the importance of the sector), the market is projected to reach approximately $17 billion by 2033. This expansion will be propelled by continuous technological advancements, the growing adoption of subscription-based business models, and a heightened focus on customer experience management across industries. Regional variations will likely exist, with North America and Europe leading the market initially due to higher adoption rates and technological infrastructure, but emerging markets in Asia-Pacific are expected to show significant growth in the later years of the forecast period. The competitive landscape will remain dynamic, with mergers, acquisitions, and the emergence of innovative solutions shaping the future of customer churn analysis software.

  12. Bank Customer Churn Dataset

    • kaggle.com
    Updated Aug 30, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gaurav Topre (2022). Bank Customer Churn Dataset [Dataset]. https://www.kaggle.com/datasets/gauravtopre/bank-customer-churn-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 30, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Gaurav Topre
    Description

    This dataset is for ABC Multistate bank with following columns:

    1. customer_id, unused variable.
    2. credit_score, used as input.
    3. country, used as input.
    4. gender, used as input.
    5. age, used as input.
    6. tenure, used as input.
    7. balance, used as input.
    8. products_number, used as input.
    9. credit_card, used as input.
    10. active_member, used as input.
    11. estimated_salary, used as input.
    12. churn, used as the target. 1 if the client has left the bank during some period or 0 if he/she has not.

    Aim is to Predict the Customer Churn for ABC Bank.

    https://miro.medium.com/max/737/1*Xap6OxaZvD7C7eMQKkaHYQ.jpeg" alt="">

  13. w

    Global Customer Churn Software Market Research Report: By Deployment Mode...

    • wiseguyreports.com
    Updated Jul 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    wWiseguy Research Consultants Pvt Ltd (2024). Global Customer Churn Software Market Research Report: By Deployment Mode (On-Premise, Cloud-Based), By Organization Size (Small and Medium-Sized Enterprises (SMEs), Large Enterprises), By Industry Vertical (Retail and Consumer Goods, Manufacturing, Healthcare, Financial Services, Information Technology, Telecommunications, Transportation and Logistics), By Functionality (Churn Prediction, Customer Segmentation, Customer Journey Analysis, Real-Time Monitoring), By Data Integration (Customer Relationship Management (CRM) Systems, Marketing Automation Platforms, Data Warehouses, Other Data Sources) and By Regional (North America, Europe, South America, Asia Pacific, Middle East and Africa) - Forecast to 2032. [Dataset]. https://www.wiseguyreports.com/reports/customer-churn-software-market
    Explore at:
    Dataset updated
    Jul 23, 2024
    Dataset authored and provided by
    wWiseguy Research Consultants Pvt Ltd
    License

    https://www.wiseguyreports.com/pages/privacy-policyhttps://www.wiseguyreports.com/pages/privacy-policy

    Time period covered
    Jan 7, 2024
    Area covered
    Global
    Description
    BASE YEAR2024
    HISTORICAL DATA2019 - 2024
    REPORT COVERAGERevenue Forecast, Competitive Landscape, Growth Factors, and Trends
    MARKET SIZE 20233.24(USD Billion)
    MARKET SIZE 20243.75(USD Billion)
    MARKET SIZE 203212.1(USD Billion)
    SEGMENTS COVEREDDeployment Mode ,Organization Size ,Industry Vertical ,Functionality ,Data Integration ,Regional
    COUNTRIES COVEREDNorth America, Europe, APAC, South America, MEA
    KEY MARKET DYNAMICSAIpowered churn prediction Realtime customer insights Predictive analytics Cloudbased deployment Integration with CRM systems
    MARKET FORECAST UNITSUSD Billion
    KEY COMPANIES PROFILEDHubSpot ,Oracle ,Zoho ,Freshworks ,Pegasystems ,Mixpanel ,Zendesk ,Medallia ,Adobe ,IBM ,Salesforce ,Amplitude ,SAP ,Qualtrics ,Microsoft
    MARKET FORECAST PERIOD2024 - 2032
    KEY MARKET OPPORTUNITIESAIpowered churn prediction Personalized churn prevention strategies Predictive analytics for proactive customer retention Selfservice churn management tools Integration with CRM and other business systems
    COMPOUND ANNUAL GROWTH RATE (CAGR) 15.79% (2024 - 2032)
  14. D

    AI-Powered Customer Churn Prediction Market Research Report 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Jun 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). AI-Powered Customer Churn Prediction Market Research Report 2033 [Dataset]. https://dataintelo.com/report/ai-powered-customer-churn-prediction-market
    Explore at:
    csv, pdf, pptxAvailable download formats
    Dataset updated
    Jun 28, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    AI-Powered Customer Churn Prediction Market Outlook




    According to our latest research, the AI-powered customer churn prediction market size reached USD 1.58 billion globally in 2024, with a robust CAGR of 19.7% expected from 2025 to 2033. Driven by rapid digital transformation and the increasing need for predictive analytics across sectors, the market is forecasted to attain a value of USD 7.57 billion by 2033. The growth of this market is primarily attributed to the escalating adoption of AI and machine learning technologies by enterprises seeking to reduce customer attrition, optimize retention strategies, and enhance overall customer lifetime value, as per the latest industry research.




    One of the fundamental growth drivers for the AI-powered customer churn prediction market is the proliferation of customer data and the imperative need for businesses to leverage this data to drive actionable insights. With the advent of digital touchpoints, organizations are now able to collect vast amounts of structured and unstructured data from various customer interactions. This data, when processed using advanced AI and machine learning algorithms, empowers companies to predict potential churn with high accuracy. As a result, businesses across industries such as telecommunications, BFSI, retail, and healthcare are increasingly investing in AI-powered churn prediction solutions to proactively identify at-risk customers and implement targeted retention strategies, thereby reducing revenue loss and improving profitability.




    Another significant factor fueling market expansion is the growing emphasis on customer experience and personalization. In today's hyper-competitive landscape, retaining existing customers has become more cost-effective than acquiring new ones. AI-powered churn prediction tools enable organizations to segment their customer base, understand behavior patterns, and tailor interventions for individual customers. This level of personalization not only helps in reducing churn rates but also enhances customer satisfaction and loyalty. The integration of AI-driven insights into CRM systems and marketing automation platforms further streamlines the process, making it easier for businesses to act on predictions in real time. Moreover, the rising adoption of cloud-based solutions has made these technologies more accessible to small and medium enterprises (SMEs), broadening the market’s reach.




    The surge in demand for scalable, real-time analytics platforms is also contributing to market growth. Enterprises are increasingly seeking AI-powered solutions that can integrate seamlessly with their existing IT infrastructure, deliver instant insights, and scale as their data grows. The shift towards cloud deployment models has accelerated this trend, offering cost-effective, flexible, and easily deployable churn prediction solutions. Additionally, advancements in natural language processing (NLP), deep learning, and big data analytics are further enhancing the accuracy and reliability of churn prediction models. As organizations strive to stay ahead of the competition by minimizing customer attrition, the demand for sophisticated, AI-driven predictive analytics tools continues to rise.




    Regionally, North America holds the largest market share, followed by Europe and Asia Pacific. The dominance of North America can be attributed to the early adoption of AI technologies, presence of major technology vendors, and a strong focus on customer-centric strategies among enterprises in the region. Europe is also witnessing significant growth, driven by stringent regulations around data protection and a growing emphasis on customer retention in industries like BFSI and retail. The Asia Pacific region is expected to exhibit the highest CAGR during the forecast period, fueled by rapid digitalization, increasing investments in AI, and the expansion of e-commerce and telecommunications sectors. Latin America and the Middle East & Africa are also experiencing gradual adoption, primarily in financial services and telecommunications.



    Component Analysis




    The component segment of the AI-powered customer churn prediction market is categorized into software and services. The software segment dominates the market, accounting for the largest share in 2024, owing to the widespread deployment of advanced AI and machine learning platforms

  15. A dataset for Customer Churn Prediction for Video Websites Incorporating...

    • zenodo.org
    Updated May 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chen shuofeng; Asif Mahbub Karim; Wenbin Ma; Guoen Xia; Chen shuofeng; Asif Mahbub Karim; Wenbin Ma; Guoen Xia (2023). A dataset for Customer Churn Prediction for Video Websites Incorporating Behavioral Sequence Features [Dataset]. http://doi.org/10.5281/zenodo.7886937
    Explore at:
    Dataset updated
    May 3, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Chen shuofeng; Asif Mahbub Karim; Wenbin Ma; Guoen Xia; Chen shuofeng; Asif Mahbub Karim; Wenbin Ma; Guoen Xia
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In order to study the issue of network customer churn, the iQiyi customer dataset was collected. Behavioral sequence features were extracted from it to build a deep learning model and experiments were conducted.Here, we provide the corresponding raw dataset, including all the data we used.

  16. m

    Customer Churn Analysis Software Market Size, Share & Trends Analysis 2033

    • marketresearchintellect.com
    Updated Jun 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Intellect (2024). Customer Churn Analysis Software Market Size, Share & Trends Analysis 2033 [Dataset]. https://www.marketresearchintellect.com/product/customer-churn-analysis-software-market/
    Explore at:
    Dataset updated
    Jun 17, 2024
    Dataset authored and provided by
    Market Research Intellect
    License

    https://www.marketresearchintellect.com/privacy-policyhttps://www.marketresearchintellect.com/privacy-policy

    Area covered
    Global
    Description

    Dive into Market Research Intellect's Customer Churn Analysis Software Market Report, valued at USD 2.1 billion in 2024, and forecast to reach USD 4.8 billion by 2033, growing at a CAGR of 10.2% from 2026 to 2033.

  17. f

    Confusion matrix.

    • plos.figshare.com
    xls
    Updated Dec 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ke Peng; Yan Peng; Wenguang Li (2023). Confusion matrix. [Dataset]. http://doi.org/10.1371/journal.pone.0289724.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Dec 8, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Ke Peng; Yan Peng; Wenguang Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In recent years, with the continuous improvement of the financial system and the rapid development of the banking industry, the competition of the banking industry itself has intensified. At the same time, with the rapid development of information technology and Internet technology, customers’ choice of financial products is becoming more and more diversified, and customers’ dependence and loyalty to banking institutions is becoming less and less, and the problem of customer churn in commercial banks is becoming more and more prominent. How to predict customer behavior and retain existing customers has become a major challenge for banks to solve. Therefore, this study takes a bank’s business data on Kaggle platform as the research object, uses multiple sampling methods to compare the data for balancing, constructs a bank customer churn prediction model for churn identification by GA-XGBoost, and conducts interpretability analysis on the GA-XGBoost model to provide decision support and suggestions for the banking industry to prevent customer churn. The results show that: (1) The applied SMOTEENN is more effective than SMOTE and ADASYN in dealing with the imbalance of banking data. (2) The F1 and AUC values of the model improved and optimized by XGBoost using genetic algorithm can reach 90% and 99%, respectively, which are optimal compared to other six machine learning models. The GA-XGBoost classifier was identified as the best solution for the customer churn problem. (3) Using Shapley values, we explain how each feature affects the model results, and analyze the features that have a high impact on the model prediction, such as the total number of transactions in the past year, the amount of transactions in the past year, the number of products owned by customers, and the total sales balance. The contribution of this paper is mainly in two aspects: (1) this study can provide useful information from the black box model based on the accurate identification of churned customers, which can provide reference for commercial banks to improve their service quality and retain customers; (2) it can provide reference for customer churn early warning models of other related industries, which can help the banking industry to maintain customer stability, maintain market position and reduce corporate losses.

  18. f

    Comparison of GA-XGBoost with XGBoost and LightGBM test results.

    • figshare.com
    xls
    Updated Dec 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ke Peng; Yan Peng; Wenguang Li (2023). Comparison of GA-XGBoost with XGBoost and LightGBM test results. [Dataset]. http://doi.org/10.1371/journal.pone.0289724.t008
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Dec 8, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Ke Peng; Yan Peng; Wenguang Li
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Comparison of GA-XGBoost with XGBoost and LightGBM test results.

  19. f

    S1 Data -

    • plos.figshare.com
    zip
    Updated Oct 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yancong Zhou; Wenyue Chen; Xiaochen Sun; Dandan Yang (2023). S1 Data - [Dataset]. http://doi.org/10.1371/journal.pone.0292466.s001
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 11, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Yancong Zhou; Wenyue Chen; Xiaochen Sun; Dandan Yang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analyzing customers’ characteristics and giving the early warning of customer churn based on machine learning algorithms, can help enterprises provide targeted marketing strategies and personalized services, and save a lot of operating costs. Data cleaning, oversampling, data standardization and other preprocessing operations are done on 900,000 telecom customer personal characteristics and historical behavior data set based on Python language. Appropriate model parameters were selected to build BPNN (Back Propagation Neural Network). Random Forest (RF) and Adaboost, the two classic ensemble learning models were introduced, and the Adaboost dual-ensemble learning model with RF as the base learner was put forward. The four models and the other four classical machine learning models-decision tree, naive Bayes, K-Nearest Neighbor (KNN), Support Vector Machine (SVM) were utilized respectively to analyze the customer churn data. The results show that the four models have better performance in terms of recall rate, precision rate, F1 score and other indicators, and the RF-Adaboost dual-ensemble model has the best performance. Among them, the recall rates of BPNN, RF, Adaboost and RF-Adaboost dual-ensemble model on positive samples are respectively 79%, 90%, 89%,93%, the precision rates are 97%, 99%, 98%, 99%, and the F1 scores are 87%, 95%, 94%, 96%. The RF-Adaboost dual-ensemble model has the best performance, and the three indicators are 10%, 1%, and 6% higher than the reference. The prediction results of customer churn provide strong data support for telecom companies to adopt appropriate retention strategies for pre-churn customers and reduce customer churn.

  20. C

    Customer Churn Software Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Customer Churn Software Report [Dataset]. https://www.datainsightsmarket.com/reports/customer-churn-software-1409294
    Explore at:
    ppt, pdf, docAvailable download formats
    Dataset updated
    Apr 17, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Customer Churn Software market is experiencing robust growth, driven by increasing customer acquisition costs and the rising need for businesses to retain their existing clientele. The market, estimated at $5 billion in 2025, is projected to exhibit a Compound Annual Growth Rate (CAGR) of 15% from 2025 to 2033, reaching approximately $15 billion by 2033. This expansion is fueled by several key factors: the escalating adoption of cloud-based solutions offering scalability and cost-effectiveness; the growing importance of data analytics in predicting and mitigating churn; and the increasing demand for personalized customer experiences to enhance loyalty. Key industry players, including established giants like IBM, Salesforce, and Microsoft, along with agile startups specializing in AI-powered churn prediction, are vying for market share, driving innovation and competition. Segmentation analysis reveals that the cloud-based segment currently dominates, owing to its flexibility and accessibility, while the telecommunications, banking, and e-commerce sectors represent the largest application segments, reflecting their high customer volumes and intense competition. Significant regional variations exist. North America currently holds a leading position, due to the high technological adoption rate and the presence of numerous technology companies. However, the Asia-Pacific region is anticipated to witness the fastest growth rate in the coming years, fueled by increasing digitalization and expanding internet penetration across emerging markets like India and China. Despite these positive trends, challenges remain. Data privacy concerns and the complexity of implementing and integrating churn prediction software can impede adoption. Furthermore, the need for continuous investment in software updates and skilled personnel to manage the systems presents an ongoing hurdle for businesses. The market's future hinges on the continued development of sophisticated AI algorithms, the enhancement of user-friendly interfaces, and the provision of robust customer support to address implementation complexities.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
CAT Reloaded || Data Science circle (2025). Bank Customer Churn [Dataset]. https://www.kaggle.com/datasets/cat-reloaded-data-science/bank-customer-churn/versions/1
Organization logo

Bank Customer Churn

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 14, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
CAT Reloaded || Data Science circle
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Bank Customer Churn Dataset is a collection of data related to customers of a bank who have either left (churned) or stayed with the bank. This dataset is typically used for predictive modeling to identify patterns and factors that lead to customer churn, enabling banks to take proactive measures to retain customers.

  • id: Unique identifier for each customer.

  • CustomerId: Unique identifier for the customer account.

  • Surname: Last name of the customer.

  • CreditScore: Numeric representation of the customer's creditworthiness.

  • Geography:str, Gender:str:Country or region where the customer resides ,Gender of the customer (e.g., Male, Female).

  • Age: Age of the customer.

  • Tenure: Number of years the customer has been with the bank.

  • Balance: Current balance in the customer's account.

  • NumOfProducts: Number of bank products the customer uses.

  • HasCrCard: Binary indicator (0 or 1) for whether the customer has a credit card.

  • IsActiveMember: Binary indicator (0 or 1) for whether the customer is an active member.

  • EstimatedSalary: Estimated salary of the customer.

  • Exited: Binary indicator (0 or 1) for whether the customer has churned (the target).

Search
Clear search
Close search
Google apps
Main menu