100+ datasets found
  1. Market Basket Analysis

    • kaggle.com
    Updated Dec 9, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aslan Ahmedov (2021). Market Basket Analysis [Dataset]. https://www.kaggle.com/datasets/aslanahmedov/market-basket-analysis
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 9, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Aslan Ahmedov
    Description

    Market Basket Analysis

    Market basket analysis with Apriori algorithm

    The retailer wants to target customers with suggestions on itemset that a customer is most likely to purchase .I was given dataset contains data of a retailer; the transaction data provides data around all the transactions that have happened over a period of time. Retailer will use result to grove in his industry and provide for customer suggestions on itemset, we be able increase customer engagement and improve customer experience and identify customer behavior. I will solve this problem with use Association Rules type of unsupervised learning technique that checks for the dependency of one data item on another data item.

    Introduction

    Association Rule is most used when you are planning to build association in different objects in a set. It works when you are planning to find frequent patterns in a transaction database. It can tell you what items do customers frequently buy together and it allows retailer to identify relationships between the items.

    An Example of Association Rules

    Assume there are 100 customers, 10 of them bought Computer Mouth, 9 bought Mat for Mouse and 8 bought both of them. - bought Computer Mouth => bought Mat for Mouse - support = P(Mouth & Mat) = 8/100 = 0.08 - confidence = support/P(Mat for Mouse) = 0.08/0.09 = 0.89 - lift = confidence/P(Computer Mouth) = 0.89/0.10 = 8.9 This just simple example. In practice, a rule needs the support of several hundred transactions, before it can be considered statistically significant, and datasets often contain thousands or millions of transactions.

    Strategy

    • Data Import
    • Data Understanding and Exploration
    • Transformation of the data – so that is ready to be consumed by the association rules algorithm
    • Running association rules
    • Exploring the rules generated
    • Filtering the generated rules
    • Visualization of Rule

    Dataset Description

    • File name: Assignment-1_Data
    • List name: retaildata
    • File format: . xlsx
    • Number of Row: 522065
    • Number of Attributes: 7

      • BillNo: 6-digit number assigned to each transaction. Nominal.
      • Itemname: Product name. Nominal.
      • Quantity: The quantities of each product per transaction. Numeric.
      • Date: The day and time when each transaction was generated. Numeric.
      • Price: Product price. Numeric.
      • CustomerID: 5-digit number assigned to each customer. Nominal.
      • Country: Name of the country where each customer resides. Nominal.

    imagehttps://user-images.githubusercontent.com/91852182/145270162-fc53e5a3-4ad1-4d06-b0e0-228aabcf6b70.png">

    Libraries in R

    First, we need to load required libraries. Shortly I describe all libraries.

    • arules - Provides the infrastructure for representing, manipulating and analyzing transaction data and patterns (frequent itemsets and association rules).
    • arulesViz - Extends package 'arules' with various visualization. techniques for association rules and item-sets. The package also includes several interactive visualizations for rule exploration.
    • tidyverse - The tidyverse is an opinionated collection of R packages designed for data science.
    • readxl - Read Excel Files in R.
    • plyr - Tools for Splitting, Applying and Combining Data.
    • ggplot2 - A system for 'declaratively' creating graphics, based on "The Grammar of Graphics". You provide the data, tell 'ggplot2' how to map variables to aesthetics, what graphical primitives to use, and it takes care of the details.
    • knitr - Dynamic Report generation in R.
    • magrittr- Provides a mechanism for chaining commands with a new forward-pipe operator, %>%. This operator will forward a value, or the result of an expression, into the next function call/expression. There is flexible support for the type of right-hand side expressions.
    • dplyr - A fast, consistent tool for working with data frame like objects, both in memory and out of memory.
    • tidyverse - This package is designed to make it easy to install and load multiple 'tidyverse' packages in a single step.

    imagehttps://user-images.githubusercontent.com/91852182/145270210-49c8e1aa-9753-431b-a8d5-99601bc76cb5.png">

    Data Pre-processing

    Next, we need to upload Assignment-1_Data. xlsx to R to read the dataset.Now we can see our data in R.

    imagehttps://user-images.githubusercontent.com/91852182/145270229-514f0983-3bbb-4cd3-be64-980e92656a02.png"> imagehttps://user-images.githubusercontent.com/91852182/145270251-6f6f6472-8817-435c-a995-9bc4bfef10d1.png">

    After we will clear our data frame, will remove missing values.

    imagehttps://user-images.githubusercontent.com/91852182/145270286-05854e1a-2b6c-490e-ab30-9e99e731eacb.png">

    To apply Association Rule mining, we need to convert dataframe into transaction data to make all items that are bought together in one invoice will be in ...

  2. d

    Consumer Marketing Data | Comprehensive Data of Consumer Marketing Insights...

    • datarade.ai
    .csv, .xls, .txt
    Updated Nov 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VisitIQ™ (2024). Consumer Marketing Data | Comprehensive Data of Consumer Marketing Insights | Database & Dataset [Dataset]. https://datarade.ai/data-products/visitiq-consumer-marketing-data-comprehensive-data-of-co-visitiq
    Explore at:
    .csv, .xls, .txtAvailable download formats
    Dataset updated
    Nov 15, 2024
    Dataset authored and provided by
    VisitIQ™
    Area covered
    United States of America
    Description

    At VisitIQ™, we provide a wealth of consumer marketing data to help businesses unlock deeper insights and optimize their B2C strategies. Our extensive and meticulously curated datasets are designed to provide a 360-degree view of your target consumers, combining a wide range of behavioral, demographic, and psychographic data points to deliver actionable insights that drive measurable results.

    Our comprehensive consumer marketing database is built to fuel data-driven marketing strategies. With our rich behavioral insights, you can understand not just who your customers are, but also how they interact with your brand, what they are looking for, and what motivates their purchasing decisions. By tracking online and offline behaviors, preferences, purchase history, and engagement patterns, VisitIQ™ enables you to segment your audience more effectively and craft personalized marketing messages that resonate with your ideal customer profiles.

    In addition to behavioral insights, our datasets provide detailed demographic information, including age, gender, location, income level, education, and household characteristics. This allows you to pinpoint your marketing efforts with incredible precision, reaching the right audience with the right message at the right time. Our data also includes psychographic attributes, such as lifestyle preferences, interests, and values, providing a deeper understanding of what drives consumer behavior and helping you create more compelling and relevant content.

    VisitIQ's™ platform integrates seamlessly with your existing marketing stack, enabling you to utilize our consumer marketing data across multiple channels, from digital and social media to email and direct mail. With our data, you can improve targeting, increase engagement, reduce customer acquisition costs, and ultimately achieve a higher return on your marketing investment.

    Whether you’re looking to attract new customers, retain existing ones, or re-engage lapsed consumers, VisitIQ™ provides the data you need to build effective, data-driven B2C marketing strategies. Our comprehensive datasets empower you to make informed decisions, optimize your marketing campaigns in real-time, and drive successful outcomes.

    Unlock the full potential of your consumer marketing efforts with VisitIQ™. Transform your approach with powerful insights, sharpen your competitive edge, and achieve unparalleled marketing success.

  3. Customer360Insights

    • kaggle.com
    Updated Jun 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dave Darshan (2024). Customer360Insights [Dataset]. https://www.kaggle.com/datasets/davedarshan/customer360insights
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 9, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Dave Darshan
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Customer360Insights

    The Customer360Insights dataset is a synthetic collection meticulously designed to mirror the multifaceted nature of customer interactions within an e-commerce platform. It encompasses a wide array of variables, each serving as a pillar to support various analytical explorations. Here’s a breakdown of the dataset and the potential analyses it enables:

    Dataset Description

    • Customer Demographics: Includes FullName, Gender, Age, CreditScore, and MonthlyIncome. These variables provide a demographic snapshot of the customer base, allowing for segmentation and targeted marketing analysis.
    • Geographical Data: Comprising Country, State, and City, this section facilitates location-based analytics, market penetration studies, and regional sales performance.
    • Product Information: Details like Category, Product, Cost, and Price enable product trend analysis, profitability assessment, and inventory optimization.
    • Transactional Data: Captures the customer journey through SessionStart, CartAdditionTime, OrderConfirmation, OrderConfirmationTime, PaymentMethod, and SessionEnd. This rich temporal data can be used for funnel analysis, conversion rate optimization, and customer behavior modeling.
    • Post-Purchase Details: With OrderReturn and ReturnReason, analysts can delve into return rate calculations, post-purchase satisfaction, and quality control.

    Types of Analysis

    • Descriptive Analytics: Understand basic metrics like average monthly income, most common product categories, and typical credit scores.
    • Predictive Analytics: Use machine learning to predict credit risk or the likelihood of a purchase based on demographics and session activity.
    • Customer Segmentation: Group customers by demographics or purchasing behavior to tailor marketing strategies.
    • Geospatial Analysis: Examine sales distribution across different regions and optimize logistics. Time Series Analysis: Study the seasonality of purchases and session activities over time.
    • Funnel Analysis: Evaluate the customer journey from session start to order confirmation and identify drop-off points.
    • Cohort Analysis: Track customer cohorts over time to understand retention and repeat purchase patterns.
    • Market Basket Analysis: Discover product affinities and develop cross-selling strategies.

    This dataset is a playground for data enthusiasts to practice cleaning, transforming, visualizing, and modeling data. Whether you’re conducting A/B testing for marketing campaigns, forecasting sales, or building customer profiles, Customer360Insights offers a rich, realistic dataset for honing your data science skills.

    Curious about how I created the data? Feel free to click here and take a peek! 😉

    📊🔍 Good Luck and Happy Analysing 🔍📊

  4. c

    Bank Marketing Set Dataset

    • cubig.ai
    Updated Jun 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2025). Bank Marketing Set Dataset [Dataset]. https://cubig.ai/store/products/534/bank-marketing-set-dataset
    Explore at:
    Dataset updated
    Jun 30, 2025
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Privacy-preserving data transformation via differential privacy, Synthetic data generation using AI techniques for model training
    Description

    1) Data Introduction • The Bank Marketing Data Set contains actual customer response data collected from a telephone marketing campaign conducted by a Portuguese bank. It provides various variables including basic customer information, loan status, communication method and frequency, results of previous campaigns, and economic indicators, all of which can be used to predict a customer’s intention to subscribe to a financial product.

    2) Data Utilization (1) Characteristics of the Bank Marketing Data Set: • This dataset comprehensively reflects various customer attributes, marketing campaign details, and economic factors, making it highly applicable for tasks such as financial product recommendation, marketing targeting, and customer behavior analysis.

    (2) Applications of the Bank Marketing Data Set: • Marketing Performance Prediction: By leveraging multiple input variables, this dataset can be used to develop machine learning models that predict whether a customer will subscribe to a financial product (term deposit). • Customer Segmentation and Strategy Development: Prediction results can be used to establish customized marketing strategies for each customer or to select effective call lists.

  5. Retail Transactions Dataset

    • kaggle.com
    Updated May 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prasad Patil (2024). Retail Transactions Dataset [Dataset]. https://www.kaggle.com/datasets/prasad22/retail-transactions-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 18, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Prasad Patil
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset was created to simulate a market basket dataset, providing insights into customer purchasing behavior and store operations. The dataset facilitates market basket analysis, customer segmentation, and other retail analytics tasks. Here's more information about the context and inspiration behind this dataset:

    Context:

    Retail businesses, from supermarkets to convenience stores, are constantly seeking ways to better understand their customers and improve their operations. Market basket analysis, a technique used in retail analytics, explores customer purchase patterns to uncover associations between products, identify trends, and optimize pricing and promotions. Customer segmentation allows businesses to tailor their offerings to specific groups, enhancing the customer experience.

    Inspiration:

    The inspiration for this dataset comes from the need for accessible and customizable market basket datasets. While real-world retail data is sensitive and often restricted, synthetic datasets offer a safe and versatile alternative. Researchers, data scientists, and analysts can use this dataset to develop and test algorithms, models, and analytical tools.

    Dataset Information:

    The columns provide information about the transactions, customers, products, and purchasing behavior, making the dataset suitable for various analyses, including market basket analysis and customer segmentation. Here's a brief explanation of each column in the Dataset:

    • Transaction_ID: A unique identifier for each transaction, represented as a 10-digit number. This column is used to uniquely identify each purchase.
    • Date: The date and time when the transaction occurred. It records the timestamp of each purchase.
    • Customer_Name: The name of the customer who made the purchase. It provides information about the customer's identity.
    • Product: A list of products purchased in the transaction. It includes the names of the products bought.
    • Total_Items: The total number of items purchased in the transaction. It represents the quantity of products bought.
    • Total_Cost: The total cost of the purchase, in currency. It represents the financial value of the transaction.
    • Payment_Method: The method used for payment in the transaction, such as credit card, debit card, cash, or mobile payment.
    • City: The city where the purchase took place. It indicates the location of the transaction.
    • Store_Type: The type of store where the purchase was made, such as a supermarket, convenience store, department store, etc.
    • Discount_Applied: A binary indicator (True/False) representing whether a discount was applied to the transaction.
    • Customer_Category: A category representing the customer's background or age group.
    • Season: The season in which the purchase occurred, such as spring, summer, fall, or winter.
    • Promotion: The type of promotion applied to the transaction, such as "None," "BOGO (Buy One Get One)," or "Discount on Selected Items."

    Use Cases:

    • Market Basket Analysis: Discover associations between products and uncover buying patterns.
    • Customer Segmentation: Group customers based on purchasing behavior.
    • Pricing Optimization: Optimize pricing strategies and identify opportunities for discounts and promotions.
    • Retail Analytics: Analyze store performance and customer trends.

    Note: This dataset is entirely synthetic and was generated using the Python Faker library, which means it doesn't contain real customer data. It's designed for educational and research purposes.

  6. m

    B2B Marketing Dataset

    • data.mendeley.com
    Updated Jan 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Olkan Budak (2024). B2B Marketing Dataset [Dataset]. http://doi.org/10.17632/9h5fjdxhrm.1
    Explore at:
    Dataset updated
    Jan 2, 2024
    Authors
    Olkan Budak
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The authors applied a survey to 1049 sales representatives selected using the random sampling method, working full-time in the field of B2B marketing in different sectors, which are considered boundary-spanning personnel in the literature, working in different provinces of Turkey. The scales used in the study were used in previous studies and their reliability and validity were tested and they were found to be within the acceptable range. Analysis of the data of the study was carried out using the AMOS statistical analysis program.

  7. h

    marketing_social_media

    • huggingface.co
    Updated Aug 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rafael Montanez (2024). marketing_social_media [Dataset]. https://huggingface.co/datasets/RafaM97/marketing_social_media
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 22, 2024
    Authors
    Rafael Montanez
    Description

    Marketing Campaigns Dataset

    This repository contains a dataset specifically designed for generating marketing content. The dataset includes various features that are crucial for crafting effective marketing strategies, such as industry, channel, objective, and more. This dataset is ideal for use in machine learning models, AI-powered marketing tools, and data-driven marketing analyses.

      Dataset Overview
    

    The dataset consists of multiple entries, each representing a specific… See the full description on the dataset page: https://huggingface.co/datasets/RafaM97/marketing_social_media.

  8. h

    banking-dataset-marketing-targets

    • huggingface.co
    Updated Apr 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bowen Li (2024). banking-dataset-marketing-targets [Dataset]. https://huggingface.co/datasets/aisuko/banking-dataset-marketing-targets
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 30, 2024
    Authors
    Bowen Li
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Card for Dataset Name

    This dataset card aims to be a base template for new datasets. It has been generated using this raw template.

      Dataset Details
    
    
    
    
    
      Dataset Description
    

    Curated by: [More Information Needed] Funded by [optional]: [More Information Needed] Shared by [optional]: [More Information Needed] Language(s) (NLP): [More Information Needed] License: [More Information Needed]

      Dataset Sources [optional]
    

    Repository:… See the full description on the dataset page: https://huggingface.co/datasets/aisuko/banking-dataset-marketing-targets.

  9. P

    Sentiment Analysis for Social Media Monitoring Dataset

    • paperswithcode.com
    Updated Mar 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Sentiment Analysis for Social Media Monitoring Dataset [Dataset]. https://paperswithcode.com/dataset/sentiment-analysis-for-social-media
    Explore at:
    Dataset updated
    Mar 6, 2025
    Description

    Problem Statement

    👉 Download the case studies here

    A global consumer goods company struggled to understand customer sentiment across various social media platforms. With millions of posts, reviews, and comments generated daily, manually tracking and analyzing public opinion was inefficient. The company needed an automated solution to monitor brand perception, address negative feedback promptly, and leverage insights for marketing strategies.

    Challenge

    Analyzing social media sentiment posed the following challenges:

    Processing vast amounts of unstructured text data from multiple platforms like Twitter, Facebook, and Instagram.

    Accurately interpreting slang, emojis, and nuanced language used by social media users.

    Identifying trends and actionable insights in real-time to respond to potential crises or opportunities effectively.

    Solution Provided

    An advanced sentiment analysis system was developed using Natural Language Processing (NLP) and sentiment analysis algorithms. The solution was designed to:

    Classify social media posts into positive, negative, and neutral sentiments.

    Extract key topics and trends related to the brand and its products.

    Provide real-time dashboards for monitoring customer sentiment and identifying areas of improvement.

    Development Steps

    Data Collection

    Aggregated data from major social media platforms using APIs, focusing on brand mentions, hashtags, and product keywords.

    Preprocessing

    Cleaned and normalized text data, including handling slang, emojis, and misspellings, to prepare it for analysis.

    Model Training

    Trained NLP models for sentiment classification using supervised learning. Implemented topic modeling algorithms to identify recurring themes and discussions.

    Validation

    Tested the sentiment analysis models on labeled datasets to ensure high accuracy and relevance in classifying social media posts.

    Deployment

    Integrated the sentiment analysis system with a real-time analytics dashboard, enabling the marketing and customer support teams to track trends and respond proactively.

    Monitoring & Improvement

    Established a continuous feedback mechanism to refine models based on evolving language patterns and new social media trends.

    Results

    Gained Actionable Insights

    The system provided detailed insights into customer opinions, helping the company identify strengths and areas for improvement.

    Improved Brand Reputation Management

    Real-time monitoring enabled swift responses to negative feedback, mitigating potential reputation risks.

    Informed Marketing Strategies

    Insights from sentiment analysis guided targeted marketing campaigns, resulting in higher engagement and ROI.

    Enhanced Customer Relationships

    Proactive engagement with customers based on sentiment analysis improved customer satisfaction and loyalty.

    Scalable Monitoring Solution

    The system scaled efficiently to analyze data across multiple languages and platforms, broadening the company’s reach and understanding.

  10. t

    Bank Marketing Dataset (UCI) - Test Upload

    • invenio01-demo.tugraz.at
    zip
    Updated Apr 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    S. Moro; P. Rita; P. Cortez; S. Moro; P. Rita; P. Cortez (2025). Bank Marketing Dataset (UCI) - Test Upload [Dataset]. http://doi.org/10.24432/c5k306
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 8, 2025
    Dataset provided by
    UCI Machine Learning Repository
    Authors
    S. Moro; P. Rita; P. Cortez; S. Moro; P. Rita; P. Cortez
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is related to direct marketing campaigns conducted by a Portuguese banking institution, with campaigns relying on phone calls. Often multiple contacts with the same client were necessary to determine whether they would subscribe ('yes') or not ('no') to a bank term deposit. The dataset includes four files:

    1. bank-additional-full.csv: Contains all 41,188 examples with 20 input features, organized chronologically from May 2008 to November 2010, closely aligned with the data analyzed in [Moro et al., 2014].
    2. bank-additional.csv: A subset of 4,119 examples (10% of the full data), randomly selected, with 20 input features.
    3. bank-full.csv: The older version of the dataset, comprising all examples (41,188) with 17 input features, also organized chronologically.
    4. bank.csv: A 10% random subset of the older version, containing 4,119 examples and 17 input features.

    The smaller subsets are designed for testing computationally intensive machine learning algorithms (e.g., SVM). The primary classification objective is to predict whether a client will subscribe to a term deposit ('yes' or 'no'), based on the target variable y.

  11. Bank Marketing

    • kaggle.com
    Updated Jul 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abdelaziz Sami (2024). Bank Marketing [Dataset]. https://www.kaggle.com/datasets/abdelazizsami/bank-marketing
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 14, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Abdelaziz Sami
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Information

    The data is related with direct marketing campaigns of a Portuguese banking institution. The marketing campaigns were based on phone calls. Often, more than one contact to the same client was required, in order to access if the product (bank term deposit) would be ('yes') or not ('no') subscribed.

    There are four datasets: 1) bank-additional-full.csv with all examples (41188) and 20 inputs, ordered by date (from May 2008 to November 2010), very close to the data analyzed in [Moro et al., 2014] 2) bank-additional.csv with 10% of the examples (4119), randomly selected from 1), and 20 inputs. 3) bank-full.csv with all examples and 17 inputs, ordered by date (older version of this dataset with less inputs). 4) bank.csv with 10% of the examples and 17 inputs, randomly selected from 3 (older version of this dataset with less inputs). The smallest datasets are provided to test more computationally demanding machine learning algorithms (e.g., SVM).

    The classification goal is to predict if the client will subscribe (yes/no) a term deposit (variable y).

    Dataset Characteristics Multivariate

    Associated Tasks Classification

  12. c

    Easy Analysis Of Company's Ideal Customers Dataset

    • cubig.ai
    Updated Jun 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2025). Easy Analysis Of Company's Ideal Customers Dataset [Dataset]. https://cubig.ai/store/products/505/easy-analysis-of-companys-ideal-customers-dataset
    Explore at:
    Dataset updated
    Jun 22, 2025
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Privacy-preserving data transformation via differential privacy, Synthetic data generation using AI techniques for model training
    Description

    1) Data Introduction • The Easy Analysis Of Company's Ideal Customers Dataset is a structured dataset designed to identify ideal customer segments and support the development of effective marketing strategies based on customer demographics, purchasing patterns, and campaign responses. It includes a wide range of features such as age, income, family composition, product spending, and discount usage, with a focus on the response variable indicating whether the customer responded to the last marketing campaign.

    2) Data Utilization (1) Characteristics of the Easy Analysis Of Company's Ideal Customers Dataset: • The dataset includes diverse features useful for customer segmentation, such as education level, marital status, annual income, number of children, and marketing campaign participation history. The response field serves as a binary classification label indicating whether the customer responded to the final campaign.

    (2) Applications of the Easy Analysis Of Company's Ideal Customers Dataset: • Marketing campaign response prediction: This dataset can be used to train machine learning classification models to predict the likelihood of a customer responding to a marketing campaign. • Customer segmentation and strategic planning: By identifying customer segments with high response potential, the dataset can support targeted marketing, personalized promotion design, and customer retention strategies.

  13. P

    aeo digital marketing Dataset

    • paperswithcode.com
    Updated Jun 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). aeo digital marketing Dataset [Dataset]. https://paperswithcode.com/dataset/aeo-digital-marketing
    Explore at:
    Dataset updated
    Jun 22, 2025
    Description
  14. f

    SPSS Dataset

    • figshare.com
    bin
    Updated Nov 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    bankole temitope (2023). SPSS Dataset [Dataset]. http://doi.org/10.6084/m9.figshare.23118368.v1
    Explore at:
    binAvailable download formats
    Dataset updated
    Nov 1, 2023
    Dataset provided by
    figshare
    Authors
    bankole temitope
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Quantitative analysis of adolescent exposure to fast food marketing on Instagram. Descriptive statistics were calculated and the total frequency of each marketing strategy was obtained. For the continuous variables mean and standard deviation values were obtained. Mann-Whitney U tests were conducted to examine the association between the marketing strategies and user engagement, while the Kruskal-Wallis H test was completed to test for associations between brand name and engagement.

  15. h

    open-email-marketing-dataset

    • huggingface.co
    Updated Jul 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Luther Johnson (2025). open-email-marketing-dataset [Dataset]. https://huggingface.co/datasets/emailmarketingdataset/open-email-marketing-dataset
    Explore at:
    Dataset updated
    Jul 12, 2025
    Authors
    Luther Johnson
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Open Email Marketing Dataset

    This repository contains the Open Email Marketing Dataset, a collection of 1,000 question-and-answer pairs in JSONL format. This dataset is created and maintained by LeadsBlue.com to provide a high-quality, public resource for developers, researchers, and marketers. It is specifically designed for tasks such as fine-tuning Large Language Models (LLMs), building advanced Q&A engines, developing cold email tools, and enhancing SEO systems.… See the full description on the dataset page: https://huggingface.co/datasets/emailmarketingdataset/open-email-marketing-dataset.

  16. h

    mmlu-marketing-verbal-neg-prepend

    • huggingface.co
    Updated Apr 28, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    mmlu-marketing-verbal-neg-prepend [Dataset]. https://huggingface.co/datasets/joey234/mmlu-marketing-verbal-neg-prepend
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 28, 2023
    Authors
    Thinh Truong
    Description

    Dataset Card for "mmlu-marketing-verbal-neg-prepend"

    More Information needed

  17. Secondary Market Activities - Datasets - Government of the Republic of...

    • data.gov.tt
    Updated Sep 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.gov.tt (2023). Secondary Market Activities - Datasets - Government of the Republic of Trinidad and Tobago Open Data Platform [Dataset]. https://data.gov.tt/dataset/secondary-market-activities
    Explore at:
    Dataset updated
    Sep 28, 2023
    Dataset provided by
    Data.govhttps://data.gov/
    Area covered
    Trinidad and Tobago
    Description

    This dataset provides an overview of the secondary market activities, expressed in millions of Trinidad and Tobago dollars (TT$).

  18. D

    Category characteristics for IRI Marketing Science Dataset used in: How well...

    • dataverse.nl
    • test.dataverse.nl
    Updated May 2, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hannes Datta; Kusum Ailawadi; Harald J. Van Heerde; Hannes Datta; Kusum Ailawadi; Harald J. Van Heerde (2018). Category characteristics for IRI Marketing Science Dataset used in: How well does consumer-based brand equity align with sales-based brand equity and marketing-mix response?, Journal of Marketing (2017) [Dataset]. http://doi.org/10.34894/ELVF0J
    Explore at:
    csv(1729310), application/x-spss-sav(492243), csv(4506), pdf(101571), type/x-r-syntax(5269), docx(27176), txt(4581)Available download formats
    Dataset updated
    May 2, 2018
    Dataset provided by
    DataverseNL
    Authors
    Hannes Datta; Kusum Ailawadi; Harald J. Van Heerde; Hannes Datta; Kusum Ailawadi; Harald J. Van Heerde
    License

    https://dataverse.nl/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.34894/ELVF0Jhttps://dataverse.nl/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.34894/ELVF0J

    Description

    Category characteristics survey for the IRI Marketing Science dataset (Bronnenberg, Kruger, and Mela 2008), collected by Datta, Ailawadi, and Van Heerde (2017) on Amazon Mechanical Turk (May 2016) to measure the following category characteristics: (1) Hedonic nature of category, (2) Functional / performance risk of category, (3) Social value / social demonstrance of category, (4) Category involvement, (5) Utilitarian nature of category. For details on data collection and constructs, see codebook (PDF/docx).

  19. Instagram Dataset

    • brightdata.com
    .json, .csv, .xlsx
    Updated Apr 26, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2022). Instagram Dataset [Dataset]. https://brightdata.com/products/datasets/instagram
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset updated
    Apr 26, 2022
    Dataset authored and provided by
    Bright Datahttps://brightdata.com/
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide
    Description

    Use our Instagram dataset (public data) to extract business and non-business information from complete public profiles and filter by hashtags, followers, account type, or engagement score. Depending on your needs, you may purchase the entire dataset or a customized subset. Popular use cases include sentiment analysis, brand monitoring, influencer marketing, and more. The dataset includes all major data points: # of followers, verified status, account type (business / non-business), links, posts, comments, location, engagement score, hashtags, and much more.

  20. Spotify Dataset

    • brightdata.com
    .json, .csv, .xlsx
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data, Spotify Dataset [Dataset]. https://brightdata.com/products/datasets/spotify
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset authored and provided by
    Bright Datahttps://brightdata.com/
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide
    Description

    Gain valuable insights into music trends, artist popularity, and streaming analytics with our comprehensive Spotify Dataset. Designed for music analysts, marketers, and businesses, this dataset provides structured and reliable data from Spotify to enhance market research, content strategy, and audience engagement.

    Dataset Features

    Track Information: Access detailed data on songs, including track name, artist, album, genre, and release date. Streaming Popularity: Extract track popularity scores, listener engagement metrics, and ranking trends. Artist & Album Insights: Analyze artist performance, album releases, and genre trends over time. Related Searches & Recommendations: Track related search terms and suggested content for deeper audience insights. Historical & Real-Time Data: Retrieve historical streaming data or access continuously updated records for real-time trend analysis.

    Customizable Subsets for Specific Needs Our Spotify Dataset is fully customizable, allowing you to filter data based on track popularity, artist, genre, release date, or listener engagement. Whether you need broad coverage for industry analysis or focused data for content optimization, we tailor the dataset to your needs.

    Popular Use Cases

    Market Analysis & Trend Forecasting: Identify emerging music trends, genre popularity, and listener preferences. Artist & Label Performance Tracking: Monitor artist rankings, album success, and audience engagement. Competitive Intelligence: Analyze competitor music strategies, playlist placements, and streaming performance. AI & Machine Learning Applications: Use structured music data to train AI models for recommendation engines, playlist curation, and predictive analytics. Advertising & Sponsorship Insights: Identify high-performing tracks and artists for targeted advertising and sponsorship opportunities.

    Whether you're optimizing music marketing, analyzing streaming trends, or enhancing content strategies, our Spotify Dataset provides the structured data you need. Get started today and customize your dataset to fit your business objectives.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Aslan Ahmedov (2021). Market Basket Analysis [Dataset]. https://www.kaggle.com/datasets/aslanahmedov/market-basket-analysis
Organization logo

Market Basket Analysis

Analyzing Consumer Behaviour Using MBA Association Rule Mining

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 9, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Aslan Ahmedov
Description

Market Basket Analysis

Market basket analysis with Apriori algorithm

The retailer wants to target customers with suggestions on itemset that a customer is most likely to purchase .I was given dataset contains data of a retailer; the transaction data provides data around all the transactions that have happened over a period of time. Retailer will use result to grove in his industry and provide for customer suggestions on itemset, we be able increase customer engagement and improve customer experience and identify customer behavior. I will solve this problem with use Association Rules type of unsupervised learning technique that checks for the dependency of one data item on another data item.

Introduction

Association Rule is most used when you are planning to build association in different objects in a set. It works when you are planning to find frequent patterns in a transaction database. It can tell you what items do customers frequently buy together and it allows retailer to identify relationships between the items.

An Example of Association Rules

Assume there are 100 customers, 10 of them bought Computer Mouth, 9 bought Mat for Mouse and 8 bought both of them. - bought Computer Mouth => bought Mat for Mouse - support = P(Mouth & Mat) = 8/100 = 0.08 - confidence = support/P(Mat for Mouse) = 0.08/0.09 = 0.89 - lift = confidence/P(Computer Mouth) = 0.89/0.10 = 8.9 This just simple example. In practice, a rule needs the support of several hundred transactions, before it can be considered statistically significant, and datasets often contain thousands or millions of transactions.

Strategy

  • Data Import
  • Data Understanding and Exploration
  • Transformation of the data – so that is ready to be consumed by the association rules algorithm
  • Running association rules
  • Exploring the rules generated
  • Filtering the generated rules
  • Visualization of Rule

Dataset Description

  • File name: Assignment-1_Data
  • List name: retaildata
  • File format: . xlsx
  • Number of Row: 522065
  • Number of Attributes: 7

    • BillNo: 6-digit number assigned to each transaction. Nominal.
    • Itemname: Product name. Nominal.
    • Quantity: The quantities of each product per transaction. Numeric.
    • Date: The day and time when each transaction was generated. Numeric.
    • Price: Product price. Numeric.
    • CustomerID: 5-digit number assigned to each customer. Nominal.
    • Country: Name of the country where each customer resides. Nominal.

imagehttps://user-images.githubusercontent.com/91852182/145270162-fc53e5a3-4ad1-4d06-b0e0-228aabcf6b70.png">

Libraries in R

First, we need to load required libraries. Shortly I describe all libraries.

  • arules - Provides the infrastructure for representing, manipulating and analyzing transaction data and patterns (frequent itemsets and association rules).
  • arulesViz - Extends package 'arules' with various visualization. techniques for association rules and item-sets. The package also includes several interactive visualizations for rule exploration.
  • tidyverse - The tidyverse is an opinionated collection of R packages designed for data science.
  • readxl - Read Excel Files in R.
  • plyr - Tools for Splitting, Applying and Combining Data.
  • ggplot2 - A system for 'declaratively' creating graphics, based on "The Grammar of Graphics". You provide the data, tell 'ggplot2' how to map variables to aesthetics, what graphical primitives to use, and it takes care of the details.
  • knitr - Dynamic Report generation in R.
  • magrittr- Provides a mechanism for chaining commands with a new forward-pipe operator, %>%. This operator will forward a value, or the result of an expression, into the next function call/expression. There is flexible support for the type of right-hand side expressions.
  • dplyr - A fast, consistent tool for working with data frame like objects, both in memory and out of memory.
  • tidyverse - This package is designed to make it easy to install and load multiple 'tidyverse' packages in a single step.

imagehttps://user-images.githubusercontent.com/91852182/145270210-49c8e1aa-9753-431b-a8d5-99601bc76cb5.png">

Data Pre-processing

Next, we need to upload Assignment-1_Data. xlsx to R to read the dataset.Now we can see our data in R.

imagehttps://user-images.githubusercontent.com/91852182/145270229-514f0983-3bbb-4cd3-be64-980e92656a02.png"> imagehttps://user-images.githubusercontent.com/91852182/145270251-6f6f6472-8817-435c-a995-9bc4bfef10d1.png">

After we will clear our data frame, will remove missing values.

imagehttps://user-images.githubusercontent.com/91852182/145270286-05854e1a-2b6c-490e-ab30-9e99e731eacb.png">

To apply Association Rule mining, we need to convert dataframe into transaction data to make all items that are bought together in one invoice will be in ...

Search
Clear search
Close search
Google apps
Main menu