43 datasets found

E-Commerce Customer Satisfaction Survey
kaggle.com
Updated Sep 19, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rachit Khandelwal (2020). E-Commerce Customer Satisfaction Survey [Dataset]. https://www.kaggle.com/datasets/tihcarkhandelwal/ecommerce-customer-satisfaction-survey
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 19, 2020
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Rachit Khandelwal
Description
Dataset

This dataset was created by Rachit Khandelwal

Contents
US Airline passenger satisfaction survey
kaggle.com
Updated Oct 28, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
najib mrh (2019). US Airline passenger satisfaction survey [Dataset]. https://www.kaggle.com/datasets/najibmh/us-airline-passenger-satisfaction-survey
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 28, 2019
Dataset provided by
Kaggle
Authors
najib mrh
Description
Dataset

This dataset was created by najib mrh

Contents

US Airline passenger satisfaction survey
Airline Reviews Dataset
kaggle.com
Updated Mar 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sujal Suthar (2024). Airline Reviews Dataset [Dataset]. https://www.kaggle.com/datasets/sujalsuthar/airlines-reviews
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 6, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Sujal Suthar
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
This dataset contains reviews of the top 10 rated airlines in 2023 sourced from the Airline Quality (https://www.airlinequality.com) website. The reviews cover various aspects of the flight experience, including seat comfort, staff service, food and beverages, inflight entertainment, value for money, and overall rating. The dataset is suitable for sentiment analysis, customer satisfaction analysis, and other similar tasks.

Usage - Download the dataset file airlines_reviews.csv. - Use the dataset for analysis, visualization, and machine learning tasks.

List of Airlines 1. Singapore Airlines 2. Qatar Airways 3. All Nippon Airways 4. Emirates 5. Japan Airlines 6. Turkish Airlines 7. Air France 8. Cathay Pacific Airways 9. EVA Air 10.Korean Air

This dataset is provided under the MIT License.
customer satisfaction prediction
kaggle.com
Updated Mar 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AmitVerma2030 (2024). customer satisfaction prediction [Dataset]. https://www.kaggle.com/datasets/amitverma2030/customer-satisfaction-prediction
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 2, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
AmitVerma2030
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset

This dataset was created by AmitVerma2030

Released under MIT

Contents
Customer Satisfaction
kaggle.com
Updated Mar 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Subodh Kumar (2025). Customer Satisfaction [Dataset]. https://www.kaggle.com/datasets/subodh3111/customer-satisfaction/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 25, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Subodh Kumar
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset

This dataset was created by Subodh Kumar

Released under MIT

Contents
Airlines Customer satisfaction
kaggle.com
Updated May 1, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ahmad Noor (2020). Airlines Customer satisfaction [Dataset]. https://www.kaggle.com/datasets/ahmednour/mmmmmmm/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 1, 2020
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Ahmad Noor
Description
Dataset

This dataset was created by Ahmad Noor

Contents
Customer360Insights
kaggle.com
Updated Jun 9, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dave Darshan (2024). Customer360Insights [Dataset]. https://www.kaggle.com/datasets/davedarshan/customer360insights
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 9, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Dave Darshan
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Customer360Insights

The Customer360Insights dataset is a synthetic collection meticulously designed to mirror the multifaceted nature of customer interactions within an e-commerce platform. It encompasses a wide array of variables, each serving as a pillar to support various analytical explorations. Here’s a breakdown of the dataset and the potential analyses it enables:

Dataset Description

Customer Demographics: Includes FullName, Gender, Age, CreditScore, and MonthlyIncome. These variables provide a demographic snapshot of the customer base, allowing for segmentation and targeted marketing analysis.

Geographical Data: Comprising Country, State, and City, this section facilitates location-based analytics, market penetration studies, and regional sales performance.

Product Information: Details like Category, Product, Cost, and Price enable product trend analysis, profitability assessment, and inventory optimization.

Transactional Data: Captures the customer journey through SessionStart, CartAdditionTime, OrderConfirmation, OrderConfirmationTime, PaymentMethod, and SessionEnd. This rich temporal data can be used for funnel analysis, conversion rate optimization, and customer behavior modeling.

Post-Purchase Details: With OrderReturn and ReturnReason, analysts can delve into return rate calculations, post-purchase satisfaction, and quality control.

Types of Analysis

Descriptive Analytics: Understand basic metrics like average monthly income, most common product categories, and typical credit scores.

Predictive Analytics: Use machine learning to predict credit risk or the likelihood of a purchase based on demographics and session activity.

Customer Segmentation: Group customers by demographics or purchasing behavior to tailor marketing strategies.

Geospatial Analysis: Examine sales distribution across different regions and optimize logistics. Time Series Analysis: Study the seasonality of purchases and session activities over time.

Funnel Analysis: Evaluate the customer journey from session start to order confirmation and identify drop-off points.

Cohort Analysis: Track customer cohorts over time to understand retention and repeat purchase patterns.

Market Basket Analysis: Discover product affinities and develop cross-selling strategies.

This dataset is a playground for data enthusiasts to practice cleaning, transforming, visualizing, and modeling data. Whether you’re conducting A/B testing for marketing campaigns, forecasting sales, or building customer profiles, Customer360Insights offers a rich, realistic dataset for honing your data science skills.

Curious about how I created the data? Feel free to click here and take a peek! 😉

📊🔍 Good Luck and Happy Analysing 🔍📊
US Airlines Twitter (Over time)
kaggle.com
Updated Nov 18, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Devastator (2022). US Airlines Twitter (Over time) [Dataset]. https://www.kaggle.com/datasets/thedevastator/sentiment-analysis-of-us-airline-twitter-data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 18, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
The Devastator
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
US Airlines Twitter (Over time)

Study the trend customer satisfaction over time

About this dataset

The columns in the dataset include index, unit id, golden, unit state, trusted judgments, last judgment at, airline sentiment, airline sentiment confidence, negative reason, negative reason confidence, airline_sentiment_gold and retweet count. There is also text included for each tweet as well as tweet location and user timezone.

Using this dataset, you can get a feel for how customers of various airlines feel about their service. You can use the data to analyze trends over time or compare different airlines. Some research ideas include using airline sentiment to predict the stock market or using the negativereason data to help airlines improve their customer service

How to use the dataset

Looking at this dataset, you can get a feel for how customers of various airlines feel about their service. The data includes the airline, the tweet text, the date of the tweet, and various other information. You can use this to analyze trends over time or compare different airlines

Research Ideas

Using airline sentiment to predict the stock market - is there a correlation between how the public perceives an airline and how that airline's stock performs?

Using negativereason data to help airlines improve their customer service - which negative reasons are mentioned most often? Are there certain airlines that are consistently mentioned for specific reasons?

Use the tweet data to map out airline hot spots - where do people tend to tweet about certain airlines the most? Is there a geographic pattern to sentiment about specific airlines?

Acknowledgements

If you use this dataset in your research, please credit Social Media Data

License

License: Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) - You are free to: - Share - copy and redistribute the material in any medium or format for non-commercial purposes only. - Adapt - remix, transform, and build upon the material for non-commercial purposes only. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - You may not: - Use the material for commercial purposes.

Columns

File: Airline-Sentiment-2-w-AA.csv | Column name | Description | |:---------------------------|:-----------------------------------------------------------------------------| | _golden | This column is the gold standard column. (Boolean) | | _unit_state | This column is the state of the unit. (String) | | _trusted_judgments | This column is the number of trusted judgments. (Numeric) | | _last_judgment_at | This column is the timestamp of the last judgment. (String) | | airline_sentiment | This column is the sentiment of the tweet. (String) | | negativereason | This column is the negative reason for the sentiment. (String) | | airline_sentiment_gold | This column is the gold standard sentiment of the tweet. (String) | | name | This column is the name of the airline. (String) | | negativereason_gold | This column is the gold standard negative reason for the sentiment. (String) | | retweet_count | This column is the number of retweets. (Numeric) | | text | This column is the text of the tweet. (String) | | tweet_coord | This column is the coordinates of the tweet. (String) | | tweet_created | This column is the timestamp of the tweet. (String) | | tweet_location | This column is the location of the tweet. (String) | | user_timezone | This column is the timezone of the user. (String) |
A
‘846 Companies Ranked! (2021)’ analyzed by Analyst-2
analyst-2.ai
Updated Aug 4, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2020). ‘846 Companies Ranked! (2021)’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-846-companies-ranked-2021-c252/a66d32ff/?iid=002-053&v=presentation
Explore at:
Dataset updated
Aug 4, 2020
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘846 Companies Ranked! (2021)’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/axeltorbenson/846-companies-ranked on 28 January 2022.

--- Dataset description provided by original source is as follows ---

This file contains data on 846 companies and their ranking based on 6 characteristics: customer satisfaction, employee engagement and development, innovation, social responsibility, financial strength, and effectiveness. These rank were made by the Drucker Institute. Obviously these rankings are not and can not be accurate, but are the opinion of and are influenced wholly by the ranking criterium of the Drucker Institute.

Source: https://www.drucker.institute/2021-drucker-institute-company-ranking/

--- Original source retains full ownership of the source dataset ---
A
‘TourPackagePrediction’ analyzed by Analyst-2
analyst-2.ai
Updated Jan 28, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2022). ‘TourPackagePrediction’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-tourpackageprediction-ec5e/latest
Explore at:
Dataset updated
Jan 28, 2022
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘TourPackagePrediction’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/sanamps/tourpackageprediction on 28 January 2022.

--- Dataset description provided by original source is as follows ---

Context

You are a Data Scientist for a tourism company named "Lets Travel". The Policy Maker of the company wants to enable and establish a viable business model to expand the customer base. A viable business model is a central concept that helps you to understand the existing ways of doing the business and how to change the ways for the benefit of the tourism sector. One of the ways to expand the customer base is to introduce a new offering of packages. Currently, there are 5 types of packages the company is offering - Basic, Standard, Deluxe, Super Deluxe, King. Looking at the data of the last year, we observed that 18% of the customers purchased the packages. However, the marketing cost was quite high because customers were contacted at random without looking at the available information. The company is now planning to launch a new product i.e. Wellness Tourism Package. Wellness Tourism is defined as Travel that allows the traveler to maintain, enhance or kick-start a healthy lifestyle, and support or increase one's sense of well-being. However, this time company wants to harness the available data of existing and potential customers to make the marketing expenditure more efficient. You as a Data Scientist at "Visit with us" travel company have to analyze the customers' data and information to provide recommendations to the Policy Maker and Marketing Team and also build a model to predict the potential customer who is going to purchase the newly introduced travel package.

Objective

To predict which customer is more likely to purchase the newly introduced travel package

About the Data

Customer details: 1. CustomerID: Unique customer ID 2. ProdTaken: Whether the customer has purchased a package or not (0: No, 1: Yes) 3. Age: Age of customer 4. TypeofContact: How customer was contacted (Company Invited or Self Inquiry) 5. CityTier: City tier depends on the development of a city, population, facilities, and living standards. The categories are ordered i.e. Tier 1 > Tier 2 > Tier 3 6. Occupation: Occupation of customer 7. Gender: Gender of customer 8. NumberOfPersonVisiting: Total number of persons planning to take the trip with the customer 9. PreferredPropertyStar: Preferred hotel property rating by customer 10. MaritalStatus: Marital status of customer 11. NumberOfTrips: Average number of trips in a year by customer 12. Passport: The customer has a passport or not (0: No, 1: Yes) 13. OwnCar: Whether the customers own a car or not (0: No, 1: Yes) 14. NumberOfChildrenVisiting: Total number of children with age less than 5 planning to take the trip with the customer 15. Designation: Designation of the customer in the current organization 16. MonthlyIncome: Gross monthly income of the customer

Customer interaction data: 1. PitchSatisfactionScore: Sales pitch satisfaction score 2. ProductPitched: Product pitched by the salesperson 3. NumberOfFollowups: Total number of follow-ups has been done by the salesperson after the sales pitch 4. DurationOfPitch: Duration of the pitch by a salesperson to the customer

--- Original source retains full ownership of the source dataset ---
Software Customer Service Satisfaction
kaggle.com
Updated Jun 20, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Luke Marcus (2021). Software Customer Service Satisfaction [Dataset]. https://www.kaggle.com/lukemarcus/software-customer-service-satisfaction/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 20, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Luke Marcus
Description
Dataset

This dataset was created by Luke Marcus

Contents
c
USA hotels dataset from booking
crawlfeeds.com
csv, zip
Updated Jun 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Crawl Feeds (2025). USA hotels dataset from booking [Dataset]. https://crawlfeeds.com/datasets/usa-hotels-dataset-from-booking
Explore at:
csv, zipAvailable download formats
Dataset updated
Jun 15, 2025
Dataset authored and provided by
Crawl Feeds
License
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Area covered
United States
Description
The USA Hotels Dataset from Booking.com is a rich collection of data related to hotels across the United States, extracted from Booking.com. This dataset includes essential information about hotel listings, such as hotel names, locations, prices, star ratings, customer reviews, and amenities offered. It's an ideal resource for researchers, data analysts, and businesses looking to explore the hospitality industry, analyze customer preferences, and understand pricing patterns in the U.S. hotel market.

Access 3 million+ US hotel reviews — submit your request today.

Key Features:

Hotel Information: Includes hotel names, addresses, star ratings, and descriptions.

Pricing Data: Nightly rates, discounts, and price variations by room type and season.

Customer Reviews: Aggregated ratings and detailed user feedback from verified guests.

Amenities: Detailed list of amenities provided by each hotel (e.g., Wi-Fi, parking, spa, swimming pool).

Geographical Information: Hotel locations including city, state, and proximity to major landmarks.

Use Cases:

Sentiment Analysis: Analyze customer reviews to gauge hotel service quality and guest satisfaction.

Price Analysis: Compare pricing across different hotels, locations, and time periods to identify trends.

Recommendation Systems: Build recommendation engines based on customer ratings, reviews, and preferences.

Tourism and Hospitality Research: Understand patterns in hotel demand and services across various U.S. cities.
F
British English Call Center Data for Telecom AI
futurebeeai.com
wav
Updated Aug 1, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FutureBee AI (2022). British English Call Center Data for Telecom AI [Dataset]. https://www.futurebeeai.com/dataset/speech-dataset/telecom-call-center-conversation-english-uk
Explore at:
wavAvailable download formats
Dataset updated
Aug 1, 2022
Dataset provided by
FutureBeeAI
Authors
FutureBee AI
License
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
Area covered
United Kingdom
Dataset funded by
FutureBeeAI
Description
Introduction
This UK English Call Center Speech Dataset for the Telecom industry is purpose-built to accelerate the development of speech recognition, spoken language understanding, and conversational AI systems tailored for English-speaking telecom customers. Featuring over 30 hours of real-world, unscripted audio, it delivers authentic customer-agent interactions across key telecom support scenarios to help train robust ASR models.
Curated by FutureBeeAI, this dataset empowers voice AI engineers, telecom automation teams, and NLP researchers to build high-accuracy, production-ready models for telecom-specific use cases.
Speech Data
The dataset contains 30 hours of dual-channel call center recordings between native UK English speakers. Captured in realistic customer support settings, these conversations span a wide range of telecom topics from network complaints to billing issues, offering a strong foundation for training and evaluating telecom voice AI solutions.
•Participant Diversity:
•
Speakers: 60 native UK English speakers from our verified contributor pool.

•
Regions: Representing multiple provinces across United Kingdom to ensure coverage of various accents and dialects.

•
Participant Profile: Balanced gender mix (60% male, 40% female) with age distribution from 18 to 70 years.

•Recording Details:
•
Conversation Nature: Naturally flowing, unscripted interactions between agents and customers.

•
Call Duration: Ranges from 5 to 15 minutes.

•
Audio Format: Stereo WAV files, 16-bit depth, at 8kHz and 16kHz sample rates.

•
Recording Environment: Captured in clean conditions with no echo or background noise.

Topic Diversity
This speech corpus includes both inbound and outbound calls with varied conversational outcomes like positive, negative, and neutral ensuring broad scenario coverage for telecom AI development.
•Inbound Calls:
•Phone Number Porting
•Network Connectivity Issues
•Billing and Payments
•Technical Support
•Service Activation
•International Roaming Enquiry
•Refund Requests and Billing Adjustments
•Emergency Service Access, and others
•Outbound Calls:
•Welcome Calls & Onboarding
•Payment Reminders
•Customer Satisfaction Surveys
•Technical Updates
•Service Usage Reviews
•Network Complaint Status Calls, and more
This variety helps train telecom-specific models to manage real-world customer interactions and understand context-specific voice patterns.
Transcription
All audio files are accompanied by manually curated, time-coded verbatim transcriptions in JSON format.
•Transcription Includes:
•Speaker-Segmented Dialogues
•Time-coded Segments
•Non-speech Tags (e.g., pauses, coughs)
•High transcription accuracy with word error rate < 5% thanks to dual-layered quality checks.
These transcriptions are production-ready, allowing for faster development of ASR and conversational AI systems in the Telecom domain.
Metadata
Rich metadata is available for each participant and conversation:
•
Participant Metadata: ID, age, gender, accent, dialect, and location.

<div style="margin-top:10px; margin-bottom: 10px; padding-left: 30px; display: flex; gap: 16px; align-items:
Customer_support_data
kaggle.com
Updated Jun 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Akash Bommidi (2025). Customer_support_data [Dataset]. https://www.kaggle.com/datasets/akashbommidi/customer-support-data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 2, 2025
Dataset provided by
Kaggle
Authors
Akash Bommidi
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
This dataset contains detailed records of customer interactions handled by a customer service team through various communication channels such as inbound calls, outbound calls, and digital touchpoints. It includes over 85,000 entries with information related to the nature of the issue, product categories, agent details, and customer satisfaction scores (CSAT).

Key features include:

Issue Metadata: Timestamps for when the issue was reported and responded to.

Categorization: High-level and sub-level issue categories for better analysis.

Agent Information: Names, supervisors, managers, shift, and tenure bucket.

Customer Feedback: CSAT scores and free-text customer remarks.

Transactional Data:Order IDs, product categories, item prices, and customer city.

This dataset is ideal for exploratory data analysis (EDA), natural language processing (NLP), time-to-resolution analysis, customer satisfaction prediction, and performance benchmarking of service agents.

Feature-wise Explanation

Unique id: A unique identifier for each customer support ticket. Used for tracking, not used in modeling.

channel_name: The communication channel used by the customer (e.g., Email, Chat, Phone), which influences response quality and time.

category: Broad classification of the support issue (e.g., Technical, Billing, Account), useful in understanding issue trends.

Sub-category: More specific issue label under each category (e.g., "Login Failure" under Technical) to capture granular insights.

Customer Remarks: Free-text input from customers about their issue; useful for sentiment analysis or NLP-based features.

Order_id: The ID of the order associated with the issue; may not be directly useful unless joined with order metadata.

order_date_time: Timestamp of the order; can be used to derive delays or time gaps relative to issue date.

Issue_reported at: Time when the customer reported the issue; helps calculate response and resolution delays.

issue_responded: Time when the support agent responded; combined with report time to calculate response duration.

Survey_response_Date: Date when customer gave the CSAT feedback; useful to understand follow-up timing, but not always predictive.

Customer_City: The city where the customer resides; can identify location-based trends or systemic issues.

Product_category: The type of product involved in the support ticket; some product types may result in higher or lower CSAT.

Item_price: Price of the item involved; higher prices might lead to higher customer expectations and affect satisfaction.

connected_handling_time: Total time spent by the agent resolving the issue; excessive durations may signal complexity or inefficiency.

Agent_name: Name of the support agent handling the ticket; can be encoded to understand individual performance impact.

Supervisor: The agent’s supervisor; useful to analyze team-level trends in CSAT.

Manager: The manager overseeing the support process; can help identify management-level influence on support quality.

Tenure Bucket: Agent experience group (e.g., 0–6 months, 6–12 months); more experienced agents might resolve issues better.

Agent Shift: Time shift during which the case was handled (e.g., Day, Night); night shifts might see different trends in CSAT.

CSAT Score (Target Variable): Customer satisfaction score (1 to 5); the main variable we aim to classify using other features.
On-Time Delivery
kaggle.com
Updated Feb 14, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
willian oliveira gibin (2024). On-Time Delivery [Dataset]. http://doi.org/10.34740/kaggle/dsv/7626529
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/dsv/7626529
Dataset updated
Feb 14, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
willian oliveira gibin
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F16731800%2Ff07289aba24685fac1a582143c2f1595%2FIA%20na%20Moda%20A%20Revoluo%20da%20Personalizao%20e%20Recomendao%20de%20Produtos.png?generation=1707941820950377&alt=media" alt="">

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F16731800%2F5108af937119a9b311d93039684db884%2FIA%20na%20Moda%20A%20Revoluo%20da%20Personalizao%20e%20Recomendao%20de%20Produtos%20(1).png?generation=1707941829090831&alt=media" alt="">

an era where e-commerce is booming, the ability to understand and optimize customer experience is paramount for businesses aiming to thrive. An international e-commerce company, specializing in electronic products, has embarked on an ambitious project to delve deep into their customer database to uncover vital insights that could revolutionize their operations. Leveraging advanced machine learning techniques, the company aims to dissect the complex dynamics of customer interactions and product shipments to enhance satisfaction and efficiency.

The foundation of this analytical venture is a robust dataset comprising 10,999 observations across 12 meticulously curated variables. These variables provide a comprehensive overview of the customer journey, from the initial purchase to the final delivery. Key data points include:

ID: A unique identifier for each customer, ensuring precise tracking and personalized insights. Warehouse Block: With the company's expansive warehouse segmented into blocks A through E, this variable helps in logistics optimization and inventory management. Mode of Shipment: Understanding the impact of different shipment methods (Ship, Flight, Road) on customer satisfaction and delivery efficiency. Customer Care Calls: The frequency of customer inquiries serves as an indicator of service quality and customer engagement. Customer Rating: A direct measure of customer satisfaction, with ratings ranging from 1 (lowest) to 5 (highest). Cost of the Product: This financial metric is crucial for pricing strategies and profitability analysis. Prior Purchases: Tracking customers' purchase history aids in predicting future buying behavior and personalizing marketing efforts. Product Importance: Categorizing products based on their importance (low, medium, high) enables tailored handling and prioritization. Gender: Analyzing shopping patterns and preferences across genders. Discount Offered: Examining the impact of discounts on sales volume and customer acquisition. Weight in Grams: The logistical aspect of shipping, influencing costs and delivery methods. Reached on Time: The critical outcome variable indicating whether a product was delivered within the expected timeframe, serving as a benchmark for operational efficiency. The company acknowledges the contribution of the broader data science community by making this dataset publicly available on GitHub, fostering collaborative research and innovation in customer analytics. This initiative is not just about understanding past performances but is aimed at inspiring data-driven strategies that can address pressing questions such as the correlation between customer ratings and on-time deliveries, the effectiveness of customer support, and the influence of product importance on customer satisfaction and delivery success.

This exploratory journey through data is poised to offer actionable insights that could lead to enhanced product shipment tracking, improved customer satisfaction, and ultimately, a competitive edge in the fast-paced world of e-commerce.
Data from: Santander Customer Satisfaction
kaggle.com
Updated May 20, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nika Ioramishvili (2018). Santander Customer Satisfaction [Dataset]. https://www.kaggle.com/ioramishvili/santaner/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 20, 2018
Dataset provided by
Kaggle
Authors
Nika Ioramishvili
Description
Context

There's a story behind every dataset and here's your opportunity to share yours.

Content

What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

Acknowledgements

We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

Inspiration

Your data will be in front of the world's largest data science community. What questions do you want to see answered?
oyo-reviews-dataset
kaggle.com
zip
Updated Jun 24, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Deepkumar patel (2023). oyo-reviews-dataset [Dataset]. https://www.kaggle.com/datasets/deeppatel9095/oyo-reviews-dataset
Explore at:
zip(32300432 bytes)Available download formats
Dataset updated
Jun 24, 2023
Authors
Deepkumar patel
Description
The inspiration behind creating the OYO Review Dataset for sentiment analysis was to explore the sentiment and opinions expressed in hotel reviews on the OYO Hotels platform. Analyzing the sentiment of customer reviews can provide valuable insights into the overall satisfaction of guests, identify areas for improvement, and assist in making data-driven decisions to enhance the hotel experience. By collecting and curating this dataset, Deep Patel, Nikki Patel, and Nimil aimed to contribute to the field of sentiment analysis in the context of the hospitality industry. Sentiment analysis allows us to classify the sentiment expressed in textual data, such as reviews, into positive, negative, or neutral categories. This analysis can help hotel management and stakeholders understand customer sentiments, identify common patterns, and address concerns or issues that may affect the reputation and customer satisfaction of OYO Hotels. The dataset provides a valuable resource for training and evaluating sentiment analysis models specifically tailored to the hospitality domain. Researchers, data scientists, and practitioners can utilize this dataset to develop and test various machine learning and natural language processing techniques for sentiment analysis, such as classification algorithms, sentiment lexicons, or deep learning models. Overall, the goal of creating the OYO Review Dataset for sentiment analysis was to facilitate research and analysis in the area of customer sentiments and opinions in the hotel industry. By understanding the sentiment of hotel reviews, businesses can strive to improve their services, enhance customer satisfaction, and make data-driven decisions to elevate the overall guest experience.

Deep Patel: https://www.linkedin.com/in/deep-patel-55ab48199/ Nikki Patel: https://www.linkedin.com/in/nikipatel9/ Nimil lathiya: https://www.linkedin.com/in/nimil-lathiya-059a281b1/
Data from: Bank Customer Churn Prediction
kaggle.com
Updated Mar 21, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Murilo Zangari (2024). Bank Customer Churn Prediction [Dataset]. https://www.kaggle.com/datasets/murilozangari/customer-churn-from-a-bank/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 21, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Murilo Zangari
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
The data will be used to predict whether a customer of the bank will churn. If a customer churns, it means they left the bank and took their business elsewhere. If you can predict which customers are likely to churn, you can take measures to retain them before they do. These measures could be promotions, discounts, or other incentives to boost customer satisfaction and, therefore, retention.

The dataset contains:

10,000 rows – each row is a unique customer of the bank

14 columns:

RowNumber: Row numbers from 1 to 10,000

CustomerId: Customer’s unique ID assigned by bank

Surname: Customer’s last name

CreditScore: Customer’s credit score. This number can range from 300 to 850.

Geography: Customer’s country of residence

Gender: Categorical indicator

Age: Customer’s age (years)

Tenure: Number of years customer has been with bank

Balance: Customer’s bank balance (Euros)

NumOfProducts: Number of products the customer has with the bank

HasCrCard: Indicates whether the customer has a credit card with the bank

IsActiveMember: Indicates whether the customer is considered active

EstimatedSalary: Customer’s estimated annual salary (Euros)

Exited: Indicates whether the customer churned (left the bank)
Food Delivery Data
kaggle.com
Updated Mar 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ADtech_1234 (2024). Food Delivery Data [Dataset]. https://www.kaggle.com/datasets/adtech1234/food-delivery-data/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 19, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
ADtech_1234
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
The dataset titled "Online Delivery Data" comprises 388 entries, each representing an individual's response to a survey concerning their preferences and experiences with online food delivery services in Australia. The dataset is structured into 53 columns, encompassing a wide range of information from demographic details to specific preferences and feedback on online food delivery services. Below is an in-depth description of its structure and the types of information it contains.

Dataset Overview Entries: 388 Attributes: 53 Core Attributes Description Demographic and Background Information

Age: The respondent's age. Gender: The gender of the respondent. Marital Status: Marital status of the respondent (e.g., Single, Married). Occupation: The respondent's occupation. Monthly Income: Monthly income category of the respondent. Educational Qualifications: Educational level achieved by the respondent. City: The city in Australia where the respondent resides. Family size: Number of members in the respondent's family. Service Utilization Preferences

Medium of ordering (P1 and P2): Primary and secondary preferences for ordering mediums, such as food delivery apps or direct calls. Meal preference (P1 and P2): Primary and secondary meal preferences. Preference reasons (P1 and P2): Primary and secondary reasons for their preferences. Perceptions and Attitudes

Various columns capture the respondent's attitudes towards ease and convenience, time-saving aspects, variety of choices, payment options, discounts and offers, food quality, tracking system, and several other factors related to online food delivery. Health and Hygiene Concerns

Specific concerns regarding health, delivery punctuality, hygiene, and past negative experiences with online food delivery services. Service Quality and Feedback

Attributes covering delivery time importance, packaging quality, customer service aspects (such as the number of calls to service and politeness), food freshness, temperature, taste, and quantity. Output: Likely a binary response (e.g., Yes or No) to a specific survey question, which could pertain to the respondent's overall satisfaction or willingness to recommend the service. Reviews: Open-ended feedback from respondents, providing qualitative insights into their experiences. Summary This dataset provides a comprehensive view of consumer preferences, behaviors, and satisfaction levels regarding online food delivery services in Australia. It encompasses a broad spectrum of variables from basic demographic information to detailed opinions on service quality, making it an invaluable resource for analyzing consumer trends, identifying areas for improvement in service delivery, and understanding the factors that influence customer satisfaction and loyalty in the online food delivery industry.
Airline Passenger Reviews
kaggle.com
Updated Jul 14, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Malhar khatu (2023). Airline Passenger Reviews [Dataset]. https://www.kaggle.com/datasets/malharkhatu/airline-passenger-reviews
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 14, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Malhar khatu
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
The dataset comprises a comprehensive collection of passenger reviews, accompanied by corresponding review categories, offering a holistic view of airline travel experiences. These reviews provide valuable insights into customer satisfaction, service quality, and sentiment analysis, enabling in-depth analysis and informed decision-making within the airline industry. With the combination of review content and categories, this dataset serves as a valuable resource for understanding and enhancing the passenger journey.

Facebook

Twitter

Click to copy link

Link copied

Cite

Rachit Khandelwal (2020). E-Commerce Customer Satisfaction Survey [Dataset]. https://www.kaggle.com/datasets/tihcarkhandelwal/ecommerce-customer-satisfaction-survey

E-Commerce Customer Satisfaction Survey

Explore at:

2 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Sep 19, 2020

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Rachit Khandelwal

Description

Dataset

This dataset was created by Rachit Khandelwal

Clear search

Close search

Google apps

Main menu

E-Commerce Customer Satisfaction Survey

Dataset

Contents

US Airline passenger satisfaction survey

Dataset

Contents

Airline Reviews Dataset

customer satisfaction prediction

Dataset

Contents

Customer Satisfaction

Dataset

Contents

Airlines Customer satisfaction

Dataset

Contents

Customer360Insights

Customer360Insights

Dataset Description

Types of Analysis

US Airlines Twitter (Over time)

US Airlines Twitter (Over time)

Study the trend customer satisfaction over time

About this dataset

How to use the dataset

Research Ideas

Acknowledgements

License

Columns

‘846 Companies Ranked! (2021)’ analyzed by Analyst-2

‘TourPackagePrediction’ analyzed by Analyst-2

Context

Objective

About the Data

Software Customer Service Satisfaction

Dataset

Contents

USA hotels dataset from booking

Use Cases:

British English Call Center Data for Telecom AI

Introduction

Speech Data

Topic Diversity

Transcription

Metadata

Customer_support_data

Feature-wise Explanation

On-Time Delivery

Data from: Santander Customer Satisfaction

Context

Content

Acknowledgements

Inspiration

oyo-reviews-dataset

Data from: Bank Customer Churn Prediction

Food Delivery Data

Airline Passenger Reviews

E-Commerce Customer Satisfaction Survey

Dataset

Contents