8 datasets found
  1. Market Basket Analysis

    • kaggle.com
    Updated Dec 9, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aslan Ahmedov (2021). Market Basket Analysis [Dataset]. https://www.kaggle.com/datasets/aslanahmedov/market-basket-analysis
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 9, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Aslan Ahmedov
    Description

    Market Basket Analysis

    Market basket analysis with Apriori algorithm

    The retailer wants to target customers with suggestions on itemset that a customer is most likely to purchase .I was given dataset contains data of a retailer; the transaction data provides data around all the transactions that have happened over a period of time. Retailer will use result to grove in his industry and provide for customer suggestions on itemset, we be able increase customer engagement and improve customer experience and identify customer behavior. I will solve this problem with use Association Rules type of unsupervised learning technique that checks for the dependency of one data item on another data item.

    Introduction

    Association Rule is most used when you are planning to build association in different objects in a set. It works when you are planning to find frequent patterns in a transaction database. It can tell you what items do customers frequently buy together and it allows retailer to identify relationships between the items.

    An Example of Association Rules

    Assume there are 100 customers, 10 of them bought Computer Mouth, 9 bought Mat for Mouse and 8 bought both of them. - bought Computer Mouth => bought Mat for Mouse - support = P(Mouth & Mat) = 8/100 = 0.08 - confidence = support/P(Mat for Mouse) = 0.08/0.09 = 0.89 - lift = confidence/P(Computer Mouth) = 0.89/0.10 = 8.9 This just simple example. In practice, a rule needs the support of several hundred transactions, before it can be considered statistically significant, and datasets often contain thousands or millions of transactions.

    Strategy

    • Data Import
    • Data Understanding and Exploration
    • Transformation of the data – so that is ready to be consumed by the association rules algorithm
    • Running association rules
    • Exploring the rules generated
    • Filtering the generated rules
    • Visualization of Rule

    Dataset Description

    • File name: Assignment-1_Data
    • List name: retaildata
    • File format: . xlsx
    • Number of Row: 522065
    • Number of Attributes: 7

      • BillNo: 6-digit number assigned to each transaction. Nominal.
      • Itemname: Product name. Nominal.
      • Quantity: The quantities of each product per transaction. Numeric.
      • Date: The day and time when each transaction was generated. Numeric.
      • Price: Product price. Numeric.
      • CustomerID: 5-digit number assigned to each customer. Nominal.
      • Country: Name of the country where each customer resides. Nominal.

    imagehttps://user-images.githubusercontent.com/91852182/145270162-fc53e5a3-4ad1-4d06-b0e0-228aabcf6b70.png">

    Libraries in R

    First, we need to load required libraries. Shortly I describe all libraries.

    • arules - Provides the infrastructure for representing, manipulating and analyzing transaction data and patterns (frequent itemsets and association rules).
    • arulesViz - Extends package 'arules' with various visualization. techniques for association rules and item-sets. The package also includes several interactive visualizations for rule exploration.
    • tidyverse - The tidyverse is an opinionated collection of R packages designed for data science.
    • readxl - Read Excel Files in R.
    • plyr - Tools for Splitting, Applying and Combining Data.
    • ggplot2 - A system for 'declaratively' creating graphics, based on "The Grammar of Graphics". You provide the data, tell 'ggplot2' how to map variables to aesthetics, what graphical primitives to use, and it takes care of the details.
    • knitr - Dynamic Report generation in R.
    • magrittr- Provides a mechanism for chaining commands with a new forward-pipe operator, %>%. This operator will forward a value, or the result of an expression, into the next function call/expression. There is flexible support for the type of right-hand side expressions.
    • dplyr - A fast, consistent tool for working with data frame like objects, both in memory and out of memory.
    • tidyverse - This package is designed to make it easy to install and load multiple 'tidyverse' packages in a single step.

    imagehttps://user-images.githubusercontent.com/91852182/145270210-49c8e1aa-9753-431b-a8d5-99601bc76cb5.png">

    Data Pre-processing

    Next, we need to upload Assignment-1_Data. xlsx to R to read the dataset.Now we can see our data in R.

    imagehttps://user-images.githubusercontent.com/91852182/145270229-514f0983-3bbb-4cd3-be64-980e92656a02.png"> imagehttps://user-images.githubusercontent.com/91852182/145270251-6f6f6472-8817-435c-a995-9bc4bfef10d1.png">

    After we will clear our data frame, will remove missing values.

    imagehttps://user-images.githubusercontent.com/91852182/145270286-05854e1a-2b6c-490e-ab30-9e99e731eacb.png">

    To apply Association Rule mining, we need to convert dataframe into transaction data to make all items that are bought together in one invoice will be in ...

  2. Z

    Clickstream Analytics Market By Deployment mode (Cloud and On-premise), By...

    • zionmarketresearch.com
    pdf
    Updated Jul 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zion Market Research (2025). Clickstream Analytics Market By Deployment mode (Cloud and On-premise), By Component (Services and Software), By Application (Traffic Analysis, Click Path Optimization, Basket Analysis & Personalization, Customer Analysis, Website/Application Optimization, And Others), By Industry Vertical (BFSI, Transportation & Logistics, Media & Entertainment, Energy & Utilities, Government, Travel & Hospitality, Telecommunications & IT, And Other Industry Verticals), And By Region: - Global and Regional Industry Overview, Market Intelligence, Comprehensive Analysis, Historical Data, and Forecasts, 2023-2030 [Dataset]. https://www.zionmarketresearch.com/report/clickstream-analytics-market
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jul 22, 2025
    Dataset authored and provided by
    Zion Market Research
    License

    https://www.zionmarketresearch.com/privacy-policyhttps://www.zionmarketresearch.com/privacy-policy

    Time period covered
    2022 - 2030
    Area covered
    Global
    Description

    The global Clickstream Analytics Market was valued at $615.37 Million in 2022, and is projected to $1,298.63 Million by 2030, growing at a CAGR of 11.26%.

  3. m

    Geomarketing Market Size, Share & Trends Analysis 2033

    • marketresearchintellect.com
    Updated Jun 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Intellect (2025). Geomarketing Market Size, Share & Trends Analysis 2033 [Dataset]. https://www.marketresearchintellect.com/product/global-geomarketing-market-size-forecast/
    Explore at:
    Dataset updated
    Jun 2, 2025
    Dataset authored and provided by
    Market Research Intellect
    License

    https://www.marketresearchintellect.com/privacy-policyhttps://www.marketresearchintellect.com/privacy-policy

    Area covered
    Global
    Description

    The size and share of this market is categorized based on Location-based Marketing (Proximity Marketing, Geofencing, Location Analytics, Geotargeting, Mobile Advertising) and Data Analytics (Predictive Analytics, Customer Segmentation, Market Basket Analysis, Trend Analysis, Spatial Analysis) and Software Solutions (GIS Software, Mapping Software, Data Visualization Tools, CRM Integration, Business Intelligence Platforms) and geographical regions (North America, Europe, Asia-Pacific, South America, Middle-East and Africa).

  4. w

    Global Data Mining Tool Market Research Report: By Deployment Mode...

    • wiseguyreports.com
    Updated Jan 3, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    wWiseguy Research Consultants Pvt Ltd (2025). Global Data Mining Tool Market Research Report: By Deployment Mode (On-Premises, Cloud-Based, Hybrid), By Application (Fraud Detection, Customer Segmentation, Market Basket Analysis, Risk Management, Predictive Maintenance), By End User (BFSI, Healthcare, Retail, Telecommunications, Manufacturing), By Data Type (Structured Data, Unstructured Data, Semi-structured Data) and By Regional (North America, Europe, South America, Asia Pacific, Middle East and Africa) - Forecast to 2032. [Dataset]. https://www.wiseguyreports.com/de/reports/data-mining-tool-market
    Explore at:
    Dataset updated
    Jan 3, 2025
    Dataset authored and provided by
    wWiseguy Research Consultants Pvt Ltd
    License

    https://www.wiseguyreports.com/pages/privacy-policyhttps://www.wiseguyreports.com/pages/privacy-policy

    Area covered
    Global
    Description
    BASE YEAR2024
    HISTORICAL DATA2019 - 2024
    REPORT COVERAGERevenue Forecast, Competitive Landscape, Growth Factors, and Trends
    MARKET SIZE 20238.36(USD Billion)
    MARKET SIZE 20249.25(USD Billion)
    MARKET SIZE 203220.74(USD Billion)
    SEGMENTS COVEREDDeployment Mode, Application, End User, Data Type, Regional
    COUNTRIES COVEREDNorth America, Europe, APAC, South America, MEA
    KEY MARKET DYNAMICSGrowing demand for big data analytics, Increasing adoption of AI technologies, Rising importance of customer insights, Expanding applications across industries, Enhanced data privacy regulations
    MARKET FORECAST UNITSUSD Billion
    KEY COMPANIES PROFILEDSAS Institute, Domo, RapidMiner, Microsoft, IBM, DataRobot, TIBCO Software, Oracle, H2O.ai, Sisense, Alteryx, SAP, Tableau, Qlik, Teradata
    MARKET FORECAST PERIOD2025 - 2032
    KEY MARKET OPPORTUNITIESIncreased demand for data analytics, Growth in AI and machine learning, Rising need for big data processing, Cloud-based data mining solutions, Expanding applications across industries
    COMPOUND ANNUAL GROWTH RATE (CAGR) 10.63% (2025 - 2032)
  5. Admission Management Software Market Size & Competitive Landscape 2030

    • mordorintelligence.com
    pdf,excel,csv,ppt
    Updated Jun 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mordor Intelligence (2025). Admission Management Software Market Size & Competitive Landscape 2030 [Dataset]. https://www.mordorintelligence.com/industry-reports/admission-management-software-market
    Explore at:
    pdf,excel,csv,pptAvailable download formats
    Dataset updated
    Jun 22, 2025
    Dataset authored and provided by
    Mordor Intelligence
    License

    https://www.mordorintelligence.com/privacy-policyhttps://www.mordorintelligence.com/privacy-policy

    Time period covered
    2019 - 2030
    Area covered
    Global
    Description

    Admission Management Market is Segmented by Component (Software, Services), Deployment Mode (Cloud, On-Premises), Application (Click Path and Website Optimization, Basket Analysis and Personalization and More), Industry Vertical (Retail and E-Commerce, Media and Entertainment and More), Organization Size (Large Enterprises and SMEs), and Geography. The Market Forecasts are Provided in Terms of Value (USD).

  6. P

    Price Comparison Software Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Jun 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Price Comparison Software Report [Dataset]. https://www.datainsightsmarket.com/reports/price-comparison-software-1455891
    Explore at:
    ppt, doc, pdfAvailable download formats
    Dataset updated
    Jun 25, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The price comparison software market is experiencing robust growth, driven by increasing e-commerce adoption, heightened consumer demand for value, and the proliferation of mobile devices. The market's expansion is fueled by several key factors. Firstly, the ever-increasing number of online retailers and product variations necessitates efficient tools for consumers to compare prices and features. Secondly, the rising popularity of mobile shopping apps has created a significant demand for user-friendly, mobile-optimized price comparison platforms. Thirdly, the strategic partnerships between price comparison engines and retailers further enhance market penetration and user engagement. Competition is fierce, with established players like Google and innovative startups vying for market share. However, the market presents ample opportunities for companies that can provide unique features, personalized experiences, and advanced functionalities, such as real-time price tracking and product review aggregation. We project a steady CAGR of 15% over the forecast period (2025-2033), with significant regional variations driven by varying levels of internet penetration and e-commerce maturity. While challenges exist, such as data accuracy concerns and the need to adapt to evolving consumer preferences and technological advancements, the long-term outlook for price comparison software remains positive. The market segmentation reveals a diversified landscape. Different software solutions cater to specific consumer needs, from simple price comparisons to sophisticated tools integrating features like product reviews, coupon integration, and loyalty program management. Key players are focusing on enhancing their user interfaces, improving data accuracy through advanced algorithms, and expanding their product offerings to consolidate their market positions. Geographic segmentation reveals higher penetration in developed economies like North America and Europe, while emerging markets in Asia and Latin America represent significant growth potential. Future growth will likely depend on innovative approaches that seamlessly integrate price comparison into the overall shopping experience, leverage artificial intelligence for personalized recommendations, and address evolving consumer privacy concerns.

  7. C

    Clickstream Analytics Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Feb 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Clickstream Analytics Report [Dataset]. https://www.datainsightsmarket.com/reports/clickstream-analytics-1420179
    Explore at:
    ppt, doc, pdfAvailable download formats
    Dataset updated
    Feb 16, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    Clickstream analytics market is projected to grow from USD 948 million in 2025 to USD 2,546 million by 2033, at a CAGR of 12.8%. The growth of the market is attributed to the increasing adoption of digital marketing and advertising, the need for better customer experience, and the growing volume of data generated by online activities. The market is segmented by application, type, and region. By application, the market is divided into click path optimization, website/application optimization, customer analysis, basket analysis and personalization, traffic analysis, and others. By type, the market is segmented into software and services. By region, the market is divided into North America, South America, Europe, Middle East & Africa, and Asia Pacific. Explore our Comprehensive Clickstream Analytics Report, offering unrivaled market insights and analysis worth over $500 Million. Our report provides a deep dive into the industry's dynamics, key players, trends, challenges, and growth opportunities.

  8. m

    Рынок розничного программного обеспечения Анализ размера, доли и тенденций...

    • marketresearchintellect.com
    Updated Jun 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Intellect (2025). Рынок розничного программного обеспечения Анализ размера, доли и тенденций 2033 г. [Dataset]. https://www.marketresearchintellect.com/ru/product/global-retail-software-market-size-forecast/
    Explore at:
    Dataset updated
    Jun 3, 2025
    Dataset authored and provided by
    Market Research Intellect
    License

    https://www.marketresearchintellect.com/ru/privacy-policyhttps://www.marketresearchintellect.com/ru/privacy-policy

    Area covered
    Global
    Description

    Размер и доля сегментированы по Point of Sale (POS) Systems (Mobile POS, Cloud-based POS, Traditional POS, Self-service Kiosks, Contactless Payment Solutions) and Retail Management Software (Inventory Management, Customer Relationship Management (CRM), Supply Chain Management, E-commerce Platforms, Workforce Management) and Analytics and Reporting Tools (Sales Analytics, Customer Analytics, Inventory Analytics, Market Basket Analysis, Predictive Analytics) and Omni-channel Retailing Solutions (Unified Commerce Platforms, Order Management Systems, Customer Experience Management, In-store and Online Integration, Click and Collect Solutions) and E-commerce Solutions (Shopping Cart Software, Payment Gateway Solutions, Shipping and Fulfillment Solutions, Website Builders for E-commerce, Digital Marketing Tools) and регионам (Северная Америка, Европа, Азиатско-Тихоокеанский регион, Южная Америка, Ближний Восток и Африка)

  9. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Aslan Ahmedov (2021). Market Basket Analysis [Dataset]. https://www.kaggle.com/datasets/aslanahmedov/market-basket-analysis
Organization logo

Market Basket Analysis

Analyzing Consumer Behaviour Using MBA Association Rule Mining

Explore at:
2 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 9, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Aslan Ahmedov
Description

Market Basket Analysis

Market basket analysis with Apriori algorithm

The retailer wants to target customers with suggestions on itemset that a customer is most likely to purchase .I was given dataset contains data of a retailer; the transaction data provides data around all the transactions that have happened over a period of time. Retailer will use result to grove in his industry and provide for customer suggestions on itemset, we be able increase customer engagement and improve customer experience and identify customer behavior. I will solve this problem with use Association Rules type of unsupervised learning technique that checks for the dependency of one data item on another data item.

Introduction

Association Rule is most used when you are planning to build association in different objects in a set. It works when you are planning to find frequent patterns in a transaction database. It can tell you what items do customers frequently buy together and it allows retailer to identify relationships between the items.

An Example of Association Rules

Assume there are 100 customers, 10 of them bought Computer Mouth, 9 bought Mat for Mouse and 8 bought both of them. - bought Computer Mouth => bought Mat for Mouse - support = P(Mouth & Mat) = 8/100 = 0.08 - confidence = support/P(Mat for Mouse) = 0.08/0.09 = 0.89 - lift = confidence/P(Computer Mouth) = 0.89/0.10 = 8.9 This just simple example. In practice, a rule needs the support of several hundred transactions, before it can be considered statistically significant, and datasets often contain thousands or millions of transactions.

Strategy

  • Data Import
  • Data Understanding and Exploration
  • Transformation of the data – so that is ready to be consumed by the association rules algorithm
  • Running association rules
  • Exploring the rules generated
  • Filtering the generated rules
  • Visualization of Rule

Dataset Description

  • File name: Assignment-1_Data
  • List name: retaildata
  • File format: . xlsx
  • Number of Row: 522065
  • Number of Attributes: 7

    • BillNo: 6-digit number assigned to each transaction. Nominal.
    • Itemname: Product name. Nominal.
    • Quantity: The quantities of each product per transaction. Numeric.
    • Date: The day and time when each transaction was generated. Numeric.
    • Price: Product price. Numeric.
    • CustomerID: 5-digit number assigned to each customer. Nominal.
    • Country: Name of the country where each customer resides. Nominal.

imagehttps://user-images.githubusercontent.com/91852182/145270162-fc53e5a3-4ad1-4d06-b0e0-228aabcf6b70.png">

Libraries in R

First, we need to load required libraries. Shortly I describe all libraries.

  • arules - Provides the infrastructure for representing, manipulating and analyzing transaction data and patterns (frequent itemsets and association rules).
  • arulesViz - Extends package 'arules' with various visualization. techniques for association rules and item-sets. The package also includes several interactive visualizations for rule exploration.
  • tidyverse - The tidyverse is an opinionated collection of R packages designed for data science.
  • readxl - Read Excel Files in R.
  • plyr - Tools for Splitting, Applying and Combining Data.
  • ggplot2 - A system for 'declaratively' creating graphics, based on "The Grammar of Graphics". You provide the data, tell 'ggplot2' how to map variables to aesthetics, what graphical primitives to use, and it takes care of the details.
  • knitr - Dynamic Report generation in R.
  • magrittr- Provides a mechanism for chaining commands with a new forward-pipe operator, %>%. This operator will forward a value, or the result of an expression, into the next function call/expression. There is flexible support for the type of right-hand side expressions.
  • dplyr - A fast, consistent tool for working with data frame like objects, both in memory and out of memory.
  • tidyverse - This package is designed to make it easy to install and load multiple 'tidyverse' packages in a single step.

imagehttps://user-images.githubusercontent.com/91852182/145270210-49c8e1aa-9753-431b-a8d5-99601bc76cb5.png">

Data Pre-processing

Next, we need to upload Assignment-1_Data. xlsx to R to read the dataset.Now we can see our data in R.

imagehttps://user-images.githubusercontent.com/91852182/145270229-514f0983-3bbb-4cd3-be64-980e92656a02.png"> imagehttps://user-images.githubusercontent.com/91852182/145270251-6f6f6472-8817-435c-a995-9bc4bfef10d1.png">

After we will clear our data frame, will remove missing values.

imagehttps://user-images.githubusercontent.com/91852182/145270286-05854e1a-2b6c-490e-ab30-9e99e731eacb.png">

To apply Association Rule mining, we need to convert dataframe into transaction data to make all items that are bought together in one invoice will be in ...

Search
Clear search
Close search
Google apps
Main menu