100+ datasets found
  1. Data from: Credit Card Transactions Dataset

    • kaggle.com
    zip
    Updated Jul 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Priyam Choksi (2024). Credit Card Transactions Dataset [Dataset]. https://www.kaggle.com/datasets/priyamchoksi/credit-card-transactions-dataset
    Explore at:
    zip(152554916 bytes)Available download formats
    Dataset updated
    Jul 23, 2024
    Authors
    Priyam Choksi
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The Credit Card Transactions Dataset provides detailed records of credit card transactions, including information about transaction times, amounts, and associated personal and merchant details. This dataset has over 1.85M rows.

    How This Dataset Can Be Used:

    Fraud Detection : Use machine learning models to identify fraudulent transactions by examining patterns in transaction amounts, locations, and user profiles. Enhancing fraud detection systems becomes feasible by analyzing behavioral patterns.

    Customer Segmentation : Segment customers based on spending patterns, location, and demographics. Tailor marketing strategies and personalized offers to these different customer segments for better engagement.

    Transaction Classification : Classify transactions into categories such as grocery or entertainment to understand spending behaviors. This helps in improving recommendation systems by identifying transaction categories and preferences.

    Geospatial Analysis : Analyze transaction data geographically to map spending patterns and detect regional trends or anomalies based on latitude and longitude.

    Predictive Modeling : Build models to forecast future spending behavior using historical transaction data. Predict potential fraudulent activities and financial trends.

    Behavioral Analysis : Examine how factors like transaction amount, merchant type, and time influence spending behavior. Study the relationships between user demographics and transaction patterns.

    Anomaly Detection : Identify unusual transaction patterns that deviate from normal behavior to detect potential fraud early. Employ anomaly detection techniques to spot outliers and suspicious activities.

  2. Daily Transactions Dataset

    • kaggle.com
    zip
    Updated May 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prasad Patil (2024). Daily Transactions Dataset [Dataset]. https://www.kaggle.com/datasets/prasad22/daily-transactions-dataset
    Explore at:
    zip(34903 bytes)Available download formats
    Dataset updated
    May 14, 2024
    Authors
    Prasad Patil
    Description

    The "Daily Transactions" dataset contains information on dummy transactions made by an individual on a daily basis. The dataset includes data on the products that were purchased, the amount spent on each product, the date and time of each transaction, the payment mode of each transaction, and the source of each record (Expense/Income).

    This dataset can be used to analyze purchasing behavior and money management, forecasting expenses, and optimizing savings and budgeting strategies. The dataset is well-suited for data analysis and machine learning applications,it can be used to train predictive models and make data-driven decisions.

    Column Descriptors

    • Date: The date and time when the transaction was made
    • Mode: The payment mode used for the transaction
    • Category: Each record is divided into a set of categories of transactions
    • Subcategory: Categories are further broken down into Subcategories of transactions
    • Note: A brief description of the transaction made
    • Amount: The transactional amount
    • Income/Expense: The indicator of each transaction representing either expense or income
    • Currency: All transactions are recorded in official currency of India
  3. Consumer Transaction Data | UK & FR | 600K+ daily active users | Airlines -...

    • datarade.ai
    .csv
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ExactOne, Consumer Transaction Data | UK & FR | 600K+ daily active users | Airlines - Regional / Budget | Raw, Aggregated & Ticker Level [Dataset]. https://datarade.ai/data-products/clearscore-dataset-individual-tickers-uk-consumer-transacti-clearscore
    Explore at:
    .csvAvailable download formats
    Dataset provided by
    Exactone
    Authors
    ExactOne
    Area covered
    United Kingdom
    Description

    ExactOne delivers unparalleled consumer transaction insights to help investors and corporate clients uncover market opportunities, analyze trends, and drive better decisions.

    Dataset Highlights - Source: Debit and credit card transactions from 600K+ active users and 2M accounts connected via Open Banking. Scale: Covers 250M+ annual transactions, mapped to 1,800+ merchants and 400+ tickers. Historical Depth: Over 6 years of transaction data. Flexibility: Analyse transactions by merchant/ticker, category/industry, or timeframe (daily, weekly, monthly, or quarterly).

    ExactOne data offers visibility into key consumer industries, including: Airlines - Regional / Budget Airlines - Cargo Airlines - Full Service Autos - OEMs Communication Services - Cable & Satellite Communication Services - Integrated Telecommunications Communication Services - Wireless Telecom Consumer - Services Consumer - Health & Fitness Consumer Staples - Household Supplies Energy - Utilities Energy - Integrated Oil & Gas Financial Services - Insurance Grocers - Traditional Hotels - C-corp Industrial - Misc Industrial - Tools And Hardware Internet - E-commerce Internet - B2B Services Internet - Ride Hailing & Delivery Leisure - Online Gambling Media - Digital Subscription Real Estate - Brokerage Restaurants - Quick Service Restaurants - Fast Casual Restaurants - Pubs Restaurants - Specialty Retail - Softlines Retail - Mass Merchants Retail - European Luxury Retail - Specialty Retail - Sports & Athletics Retail - Footwear Retail - Dept Stores Retail - Luxury Retail - Convenience Stores Retail - Hardlines Technology - Enterprise Software Technology - Electronics & Appliances Technology - Computer Hardware Utilities - Water Utilities

    Use Cases

    For Private Equity & Venture Capital Firms: - Deal Sourcing: Identify high-growth opportunities. - Due Diligence: Leverage transaction data to evaluate investment potential. - Portfolio Monitoring: Track performance post-investment with real-time data.

    For Consumer Insights & Strategy Teams: - Market Dynamics: Compare sales trends, average transaction size, and customer loyalty. - Competitive Analysis: Benchmark market share and identify emerging competitors. - E-commerce vs. Brick & Mortar Trends: Assess channel performance and strategic opportunities. - Demographic & Geographic Insights: Uncover growth drivers by demo and geo segments.

    For Investor Relations Teams: - Shareholder Insights: Monitor brand performance relative to competitors. - Real-Time Intelligence: Analyse sales and market dynamics for public and private companies. - M&A Opportunities: Evaluate market share and growth potential for strategic investments.

    Key Benefits of ExactOne - Understand Market Share: Benchmark against competitors and uncover emerging players. - Analyse Customer Loyalty: Evaluate repeat purchase behavior and retention rates. - Track Growth Trends: Identify key drivers of sales by geography, demographic, and channel. - Granular Insights: Drill into transaction-level data or aggregated summaries for in-depth analysis.

    With ExactOne, investors and corporate leaders gain actionable, real-time insights into consumer behaviour and market dynamics, enabling smarter decisions and sustained growth.

  4. Current account transactions - credits, debits and balance

    • data.europa.eu
    • db.nomics.world
    csv, html, tsv, xml
    Updated Dec 30, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Eurostat (2024). Current account transactions - credits, debits and balance [Dataset]. https://data.europa.eu/data/datasets/fbijvdkpeugldn9czfskba?locale=en
    Explore at:
    csv(11635), tsv(6440), xml(10668), xml(9921), htmlAvailable download formats
    Dataset updated
    Dec 30, 2024
    Dataset authored and provided by
    Eurostathttps://ec.europa.eu/eurostat
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The balance of payments is a record of a country's international transactions with the rest of the world. It is composed of the current account and the capital and financial account. The current account is itself subdivided into goods, services, income and current transfers; it registers the value of exports (credits) and imports (debits). The difference between these two values is the "balance".

  5. Transaction Log Data for Analyzing the Abnormal behaviors in The Financial...

    • figshare.com
    application/cdfv2
    Updated Jul 26, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fang Lyu (2019). Transaction Log Data for Analyzing the Abnormal behaviors in The Financial Domain [Dataset]. http://doi.org/10.6084/m9.figshare.9108602.v1
    Explore at:
    application/cdfv2Available download formats
    Dataset updated
    Jul 26, 2019
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Fang Lyu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In recent years, we have been exploring computational models to classify bank accounts in combating illegal pyramid selling. The department of economic investigation provides us with plenty of transaction data of real bank accounts. An account contains a lot of transaction records, each of which includes bilateral transaction accounts, timestamp, amount of money and transaction direction, etc. We sample out the transaction records belonging to 10145 bank accounts to form out dataset for training our model. There are 9270 normal accounts and 875 accounts involving a MLM organization respectively. The number of transaction records generated by the normal accounts run up to 6732730 and the fraud records created by MLM members amount to 275804 rows. These MLM members are manually annotated as ``illegal'' by economic investigators. Before training the models, we filtered out some noisy data, i.e. deleting the duplicate records, incomplete records and the records whose transaction amounts no more than 50. Therefore, 1371914 records is filtered out from the set of normal accounts' transaction records and 91341 records created by illegal accounts are deleted. In general, more than 5 million transaction records are used after denoising.

  6. c

    Customer Transactions Dataset

    • cubig.ai
    zip
    Updated Jun 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2025). Customer Transactions Dataset [Dataset]. https://cubig.ai/store/products/496/customer-transactions-dataset
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 22, 2025
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Synthetic data generation using AI techniques for model training, Privacy-preserving data transformation via differential privacy
    Description

    1) Data Introduction • The Customer Transactions Dataset is an actual transaction-based customer analysis dataset that records 100,000 customer-specific transaction details (such as payment method, purchased product, amount, date, status, type, etc.) in a tabular format.

    2) Data Utilization (1) Customer Transactions Dataset has characteristics that: • Each row contains customer ID, payment method, purchased goods, transaction amount, transaction date, transaction status (success/failure, etc.), and transaction type (purchase/refund, etc.). • The data is organized appropriately for customer segmentation and behavioral analysis, such as customer-specific iterations, various payment methods, and product-specific purchase patterns. (2) Customer Transactions Dataset can be used to: • Customer Segmentation and Target Marketing: Use transaction patterns, payment methods, purchase history, etc. to define customer groups and use them to develop customized marketing strategies. • Purchase behavior and departure prediction: Based on data such as transaction type, amount, status, etc., it can be applied to customer purchase behavior analysis, departure risk prediction, and finding loyal customers.

  7. Online Retail Transaction Data

    • kaggle.com
    zip
    Updated Dec 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). Online Retail Transaction Data [Dataset]. https://www.kaggle.com/datasets/thedevastator/online-retail-transaction-data
    Explore at:
    zip(9098240 bytes)Available download formats
    Dataset updated
    Dec 21, 2023
    Authors
    The Devastator
    Description

    Online Retail Transaction Data

    UK Online Retail Sales and Customer Transaction Data

    By UCI [source]

    About this dataset

    Comprehensive Dataset on Online Retail Sales and Customer Data

    Welcome to this comprehensive dataset offering a wide array of information related to online retail sales. This data set provides an in-depth look at transactions, product details, and customer information documented by an online retail company based in the UK. The scope of the data spans vastly, from granular details about each product sold to extensive customer data sets from different countries.

    This transnational data set is a treasure trove of vital business insights as it meticulously catalogues all the transactions that happened during its span. It houses rich transactional records curated by a renowned non-store online retail company based in the UK known for selling unique all-occasion gifts. A considerable portion of its clientele includes wholesalers; ergo, this dataset can prove instrumental for companies looking for patterns or studying purchasing trends among such businesses.

    The available attributes within this dataset offer valuable pieces of information:

    • InvoiceNo: This attribute refers to invoice numbers that are six-digit integral numbers uniquely assigned to every transaction logged in this system. Transactions marked with 'c' at the beginning signify cancellations - adding yet another dimension for purchase pattern analysis.

    • StockCode: Stock Code corresponds with specific items as they're represented within the inventory system via 5-digit integral numbers; these allow easy identification and distinction between products.

    • Description: This refers to product names, giving users qualitative knowledge about what kind of items are being bought and sold frequently.

    • Quantity: These figures ascertain the volume of each product per transaction – important figures that can help understand buying trends better.

    • InvoiceDate: Invoice Dates detail when each transaction was generated down to precise timestamps – invaluable when conducting time-based trend analysis or segmentation studies.

    • UnitPrice: Unit prices represent how much each unit retails at — crucial for revenue calculations or cost-related analyses.

    Finally,

    • Country: This locational attribute shows where each customer hails from, adding geographical segmentation to your data investigation toolkit.

    This dataset was originally collated by Dr Daqing Chen, Director of the Public Analytics group based at the School of Engineering, London South Bank University. His research studies and business cases with this dataset have been published in various papers contributing to establishing a solid theoretical basis for direct, data and digital marketing strategies.

    Access to such records can ensure enriching explorations or formulating insightful hypotheses about consumer behavior patterns among wholesalers. Whether it's managing inventory or studying transactional trends over time or spotting cancellation patterns - this dataset is apt for multiple forms of retail analysis

    How to use the dataset

    1. Sales Analysis:

    Sales data forms the backbone of this dataset, and it allows users to delve into various aspects of sales performance. You can use the Quantity and UnitPrice fields to calculate metrics like revenue, and further combine it with InvoiceNo information to understand sales over individual transactions.

    2. Product Analysis:

    Each product in this dataset comes with its unique identifier (StockCode) and its name (Description). You could analyse which products are most popular based on Quantity sold or look at popularity per transaction by considering both Quantity and InvoiceNo.

    3. Customer Segmentation:

    If you associated specific business logic onto the transactions (such as calculating total amounts), then you could use standard machine learning methods or even RFM (Recency, Frequency, Monetary) segmentation techniques combining it with 'CustomerID' for your customer base to understand customer behavior better. Concatenating invoice numbers (which stand for separate transactions) per client will give insights about your clients as well.

    4. Geographical Analysis:

    The Country column enables analysts to study purchase patterns across different geographical locations.

    Practical applications

    Understand what products sell best where - It can help drive tailored marketing strategies. Anomalies detection – Identify unusual behaviors that might lead frau...

  8. d

    SKU-Level Transaction Data | Point-of-Sale (POS) Data | 1M+ Grocery,...

    • datarade.ai
    Updated Jan 22, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MealMe (2025). SKU-Level Transaction Data | Point-of-Sale (POS) Data | 1M+ Grocery, Restaurant, and Retail stores stores with SKU level transactions [Dataset]. https://datarade.ai/data-products/sku-level-transaction-data-point-of-sale-pos-data-1m-g-mealme
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Jan 22, 2025
    Dataset authored and provided by
    MealMe
    Area covered
    Kosovo, Japan, Swaziland, Indonesia, Ghana, Moldova (Republic of), Slovenia, Ecuador, Åland Islands, New Zealand
    Description

    MealMe provides comprehensive grocery and retail SKU-level product data, including real-time pricing, from the top 100 retailers in the USA and Canada. Our proprietary technology ensures accurate and up-to-date insights, empowering businesses to excel in competitive intelligence, pricing strategies, and market analysis.

    Retailers Covered: MealMe’s database includes detailed SKU-level data and pricing from leading grocery and retail chains such as Walmart, Target, Costco, Kroger, Safeway, Publix, Whole Foods, Aldi, ShopRite, BJ’s Wholesale Club, Sprouts Farmers Market, Albertsons, Ralphs, Pavilions, Gelson’s, Vons, Shaw’s, Metro, and many more. Our coverage spans the most influential retailers across North America, ensuring businesses have the insights needed to stay competitive in dynamic markets.

    Key Features: SKU-Level Granularity: Access detailed product-level data, including product descriptions, categories, brands, and variations. Real-Time Pricing: Monitor current pricing trends across major retailers for comprehensive market comparisons. Regional Insights: Analyze geographic price variations and inventory availability to identify trends and opportunities. Customizable Solutions: Tailored data delivery options to meet the specific needs of your business or industry. Use Cases: Competitive Intelligence: Gain visibility into pricing, product availability, and assortment strategies of top retailers like Walmart, Costco, and Target. Pricing Optimization: Use real-time data to create dynamic pricing models that respond to market conditions. Market Research: Identify trends, gaps, and consumer preferences by analyzing SKU-level data across leading retailers. Inventory Management: Streamline operations with accurate, real-time inventory availability. Retail Execution: Ensure on-shelf product availability and compliance with merchandising strategies. Industries Benefiting from Our Data CPG (Consumer Packaged Goods): Optimize product positioning, pricing, and distribution strategies. E-commerce Platforms: Enhance online catalogs with precise pricing and inventory information. Market Research Firms: Conduct detailed analyses to uncover industry trends and opportunities. Retailers: Benchmark against competitors like Kroger and Aldi to refine assortments and pricing. AI & Analytics Companies: Fuel predictive models and business intelligence with reliable SKU-level data. Data Delivery and Integration MealMe offers flexible integration options, including APIs and custom data exports, for seamless access to real-time data. Whether you need large-scale analysis or continuous updates, our solutions scale with your business needs.

    Why Choose MealMe? Comprehensive Coverage: Data from the top 100 grocery and retail chains in North America, including Walmart, Target, and Costco. Real-Time Accuracy: Up-to-date pricing and product information ensures competitive edge. Customizable Insights: Tailored datasets align with your specific business objectives. Proven Expertise: Trusted by diverse industries for delivering actionable insights. MealMe empowers businesses to unlock their full potential with real-time, high-quality grocery and retail data. For more information or to schedule a demo, contact us today!

  9. y

    All Payments to Suppliers - Dataset - York Open Data

    • data.yorkopendata.org
    • ckan.york.staging.datopian.com
    Updated Nov 5, 2015
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2015). All Payments to Suppliers - Dataset - York Open Data [Dataset]. https://data.yorkopendata.org/dataset/all-payments-to-suppliers
    Explore at:
    Dataset updated
    Nov 5, 2015
    License

    Open Government Licence 2.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/2/
    License information was derived automatically

    Area covered
    York
    Description

    Payments to suppliers made by City of York Council from April 2011 onwards. Resources are split according to financial years. Date: The date shown is the date the transaction was input to the system, not the payment date. Transaction number: Our internal reference number to enable us to identify an individual transaction. Transaction numbers beginning with CR relate to entries from the creditor system, usually straightforward payments or credit notes. Transaction numbers beginning with J relate to journal entries, which are usually an accounting entry to correct a miscoding error. Amount: All payments shown exclude VAT. Negative amounts relate to credit notes or corrections. Corrections: Miscoding errors may occur, for example the allocation of a payment to an incorrect expense area or expense type. These are usually corrected in the next month. One of the principles of the spending guidance is to make the data available quickly and to reflect how each individual item was originally recorded in the financial system. Therefore since this report includes only one months data it is likely to include some miscoding errors which have not been corrected yet. These corrections will not be back dated so will appear in the next months report. In the month that the correction occurs a credit (negative) amount will show against the incorrect expense area/ expense type and the corresponding payment will show against the correct expense area/expense type. Supplier Name: The name of the supplier or recipient of the payment. Payments to individuals which may contain sensitive information have been redacted. Supplier ID: Our internal reference number to enable us to identify the supplier. Expense Area: The department where the expenditure is incurred. Expense Type: The description of the nature of the spend.

  10. G

    Bank Transaction Categorization Sample

    • gomask.ai
    csv, json
    Updated Dec 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GoMask.ai (2025). Bank Transaction Categorization Sample [Dataset]. https://gomask.ai/marketplace/datasets/bank-transaction-categorization-sample
    Explore at:
    csv(10 MB), jsonAvailable download formats
    Dataset updated
    Dec 2, 2025
    Dataset provided by
    GoMask.ai
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    2024 - 2025
    Area covered
    Global
    Variables measured
    amount, category, currency, account_id, description, subcategory, is_recurring, label_source, posting_date, location_city, and 7 more
    Description

    This dataset contains labeled bank transaction records, including detailed transaction metadata, merchant information, and manually or automatically assigned expense categories. It is ideal for developing, training, and benchmarking automated expense categorization models for personal finance, budgeting, and regulatory compliance applications.

  11. Retail Market Basket Transactions Dataset

    • kaggle.com
    Updated Aug 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wasiq Ali (2025). Retail Market Basket Transactions Dataset [Dataset]. https://www.kaggle.com/datasets/wasiqaliyasir/retail-market-basket-transactions-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 25, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Wasiq Ali
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Overview

    The Market_Basket_Optimisation dataset is a classic transactional dataset often used in association rule mining and market basket analysis.
    It consists of multiple transactions where each transaction represents the collection of items purchased together by a customer in a single shopping trip.

    • File Name: Market_Basket_Optimisation.csv
    • Format: CSV (Comma-Separated Values)
    • Structure: Each row corresponds to one shopping basket. Each column in that row contains an item purchased in that basket.
    • Nature of Data: Transactional, categorical, sparse.
    • Primary Use Case: Discovering frequent itemsets and association rules to understand shopping patterns, product affinities, and to build recommender systems.

    Detailed Information

    📊 Dataset Composition

    • Transactions: 7,501 (each row = one basket).
    • Items (unique): Around 120 distinct products (e.g., bread, mineral water, chocolate, etc.).
    • Columns per row: Up to 20 possible items (not fixed; some rows have fewer, some more).
    • Data Type: Purely categorical (no numerical or continuous features).
    • Missing Values: Present in the form of empty cells (since not every basket has all 20 columns).
    • Duplicates: Some baskets may appear more than once — this is acceptable in transactional data as multiple customers can buy the same set of items.

    🛒 Nature of Transactions

    • Basket Definition: Each row captures items bought together during a single visit to the store.
    • Variability: Basket size varies from 1 to 20 items. Some customers buy only one product, while others purchase a full set of groceries.
    • Sparsity: Since there are ~120 unique items but only a handful appear in each basket, the dataset is sparse. Most entries in the one-hot encoded representation are zeros.

    🔎 Examples of Data

    Example transaction rows (simplified):

    Item 1Item 2Item 3Item 4...
    BreadButterJam
    Mineral waterChocolateEggsMilk
    SpaghettiTomato sauceParmesan

    Here, empty cells mean no item was purchased in that slot.

    📈 Applications of This Dataset

    This dataset is frequently used in data mining, analytics, and recommendation systems. Common applications include:

    1. Association Rule Mining (Apriori, FP-Growth):

      • Discover rules like {Bread, Butter} ⇒ {Jam} with high support and confidence.
      • Identify cross-selling opportunities.
    2. Product Affinity Analysis:

      • Understand which items tend to be purchased together.
      • Helps with store layout decisions (placing related items near each other).
    3. Recommendation Engines:

      • Build systems that suggest "You may also like" products.
      • Example: If a customer buys pasta and tomato sauce, recommend cheese.
    4. Marketing Campaigns:

      • Bundle promotions and discounts on frequently co-purchased products.
      • Personalized offers based on buying history.
    5. Inventory Management:

      • Anticipate demand for certain product combinations.
      • Prevent stockouts of items that drive the purchase of others.

    📌 Key Insights Potentially Hidden in the Dataset

    • Popular Items: Some items (like mineral water, eggs, spaghetti) occur far more frequently than others.
    • Product Pairs: Frequent pairs and triplets (e.g., pasta + sauce + cheese) reflect natural meal-prep combinations.
    • Basket Size Distribution: Most customers buy fewer than 5 items, but a small fraction buy 10+ items, showing long-tail behavior.
    • Seasonality (if extended with timestamps): Certain items might show peaks in demand during weekends or holidays (though timestamps are not included in this dataset).

    📂 Dataset Limitations

    1. No Customer Identifiers:

      • We cannot track repeated purchases by the same customer.
      • Analysis is limited to basket-level insights.
    2. No Timestamps:

      • No temporal analysis (trends over time, seasonality) is possible.
    3. No Quantities or Prices:

      • We only know whether an item was purchased, not how many units or its cost.
    4. Sparse & Noisy:

      • Many baskets are small (1–2 items), which may produce weak or trivial rules.

    🔮 Potential Extensions

    • Synthetic Timestamps: Assign simulated timestamps to study temporal buying patterns.
    • Add Customer IDs: If merged with external data, one can perform personalized recommendations.
    • Price Data: Adding cost allows for profit-driven association rules (not just frequency-based).
    • Deep Learning Models: Sequence models (RNNs, Transformers) could be applied if temporal ordering of items is introduced.

    ...

  12. Simulated Transactions

    • kaggle.com
    zip
    Updated Mar 15, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Conor (2022). Simulated Transactions [Dataset]. https://www.kaggle.com/datasets/conorsully1/simulated-transactions
    Explore at:
    zip(5725992721 bytes)Available download formats
    Dataset updated
    Mar 15, 2022
    Authors
    Conor
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    NOTE: these transactions are randomly generated. The customers represented in the dataset are not real.

    This is a large transaction dataset for data visualisation and processing tutorials. Transactions are generated for 75,000 customers and are classified into 12 expenditure types: - Groceries - Clothing
    - Housing
    - Education - Health
    - Motor/Travel
    - Entertainment - Gambling
    - Savings
    - Bills and Utilities
    - Tax
    - Fines

    Notebook used to generate data: here

  13. d

    Granular SKU-Level Transaction Data | Global Ecommerce Transactional Data...

    • datarade.ai
    Updated Jun 15, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Measurable AI (2022). Granular SKU-Level Transaction Data | Global Ecommerce Transactional Data for Emerging Markets [Dataset]. https://datarade.ai/data-products/granular-e-commerce-transactional-data-for-emerging-markets-measurable-ai
    Explore at:
    Dataset updated
    Jun 15, 2022
    Dataset authored and provided by
    Measurable AI
    Area covered
    Colombia, Mexico, Philippines, Malaysia, Italy, India, Indonesia, Saudi Arabia, France, United States of America
    Description

    Granular SKU-level transaction data from Measurable AI's proprietary email receipt panel across e-commerce companies in emerging markets.

    Our data is attained with consumer consent from our two consumer apps. We then aggregate and anonymize all the metrics across our panel to produce consumer insights for our end users. Our datasets are available on a granular and aggregate level.

    Key clients range from the e-commerce companies themselves, buyside firms, financial institutions, consultancies, market research agencies and academia.

  14. Bank Customer Segmentation (1M+ Transactions)

    • kaggle.com
    zip
    Updated Oct 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shivam Bansal (2021). Bank Customer Segmentation (1M+ Transactions) [Dataset]. https://www.kaggle.com/shivamb/bank-customer-segmentation
    Explore at:
    zip(25360448 bytes)Available download formats
    Dataset updated
    Oct 26, 2021
    Authors
    Shivam Bansal
    Description

    Bank Customer Segmentation

    Most banks have a large customer base - with different characteristics in terms of age, income, values, lifestyle, and more. Customer segmentation is the process of dividing a customer dataset into specific groups based on shared traits.

    According to a report from Ernst & Young, “A more granular understanding of consumers is no longer a nice-to-have item, but a strategic and competitive imperative for banking providers. Customer understanding should be a living, breathing part of everyday business, with insights underpinning the full range of banking operations.

    About this Dataset

    This dataset consists of 1 Million+ transaction by over 800K customers for a bank in India. The data contains information such as - customer age (DOB), location, gender, account balance at the time of the transaction, transaction details, transaction amount, etc.

    Interesting Analysis Ideas

    The dataset can be used for different analysis, example -

    1. Perform Clustering / Segmentation on the dataset and identify popular customer groups along with their definitions/rules
    2. Perform Location-wise analysis to identify regional trends in India
    3. Perform transaction-related analysis to identify interesting trends that can be used by a bank to improve / optimi their user experiences
    4. Customer Recency, Frequency, Monetary analysis
    5. Network analysis or Graph analysis of customer data.
  15. d

    Vision EUR Retail & Ecommerce Sales Data | Austria, France, Germany, Italy,...

    • datarade.ai
    .csv, .sql
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Consumer Edge, Vision EUR Retail & Ecommerce Sales Data | Austria, France, Germany, Italy, Spain, UK | 6.7M Accounts, 5K Merchants, 600 Companies [Dataset]. https://datarade.ai/data-products/consumer-edge-vision-eur-retail-ecommerce-sales-data-aust-consumer-edge
    Explore at:
    .csv, .sqlAvailable download formats
    Dataset authored and provided by
    Consumer Edge
    Area covered
    France, Germany, Italy, Spain, Austria, United Kingdom
    Description

    Global Spend Analysis with Consumer Edge Credit & Debit Card Transaction Data

    Consumer Edge is a leader in alternative consumer data for public and private investors and corporate clients. CE Vision EUR is an aggregated transaction feed that includes consumer transaction data on 6.7M+ Europe-domiciled payment accounts, including 5.3M+ active monthly users. Capturing online, offline, and 3rd-party consumer spending on public and private companies, data covers 4.4K+ brands and 620 symbols including 490 public tickers. Track detailed consumer behavior patterns, including retention, purchase frequency, and cross shop in addition to total spend, transactions, and dollars per transaction.

    Consumer Edge’s consumer transaction datasets offer insights into industries across consumer and discretionary spend such as: • Apparel, Accessories, & Footwear • Automotive • Beauty • Commercial – Hardlines • Convenience / Drug / Diet • Department Stores • Discount / Club • Education • Electronics / Software • Financial Services • Full-Service Restaurants • Grocery • Ground Transportation • Health Products & Services • Home & Garden • Insurance • Leisure & Recreation • Limited-Service Restaurants • Luxury • Miscellaneous Services • Online Retail – Broadlines • Other Specialty Retail • Pet Products & Services • Sporting Goods, Hobby, Toy & Game • Telecom & Media • Travel

    This data sample illustrates how Consumer Edge data can be used to understand a company’s growth by country for a specific time period (Ex: What was McDonald’s year-over-year growth by country from 2019-2020?)

    Inquire about a CE subscription to perform more complex, near real-time global spend analysis functions on public tickers and private brands like: • Analyze year-over-year spend growth for a company for a subindustry by country • Analyze spend growth for a company vs. its competitors by country through most recent time

    Consumer Edge offers a variety of datasets covering the US and Europe (UK, Austria, France, Germany, Italy, Spain), with subscription options serving a wide range of business needs.

    Use Case: Global Spend Analysis

    Problem A global retailer wants to understand company performance by geography to identify growth and expansion opportunities.

    Solution Consumer Edge transaction data can be used to analyze shopper behavior across geographies and track: • Growth trends by country vs. competitors • Brand performance vs. subindustry by country • Opportunities for product and location expansion

    Impact Marketing and Consumer Insights were able to: • Develop weekly reporting KPI's on key growth drivers by geography for company-wide reporting • Refine strategy in underperforming geographies, both online and offline • Identify areas for investment and expansion by country • Understand how different cohorts are performing compared to key competitors

    Corporate researchers and consumer insights teams use CE Vision for:

    Corporate Strategy Use Cases • Ecommerce vs. brick & mortar trends • Real estate opportunities • Economic spending shifts

    Marketing & Consumer Insights • Total addressable market view • Competitive threats & opportunities • Cross-shopping trends for new partnerships • Demo and geo growth drivers • Customer loyalty & retention

    Investor Relations • Shareholder perspective on brand vs. competition • Real-time market intelligence • M&A opportunities

    Most popular use cases for private equity and venture capital firms include: • Deal Sourcing • Live Diligences • Portfolio Monitoring

    Public and private investors can leverage insights from CE’s synthetic data to assess investment opportunities, while consumer insights, marketing, and retailers can gain visibility into transaction data’s potential for competitive analysis, understanding shopper behavior, and capturing market intelligence.

    Most popular use cases among public and private investors include: • Track Key KPIs to Company-Reported Figures • Understanding TAM for Focus Industries • Competitive Analysis • Evaluating Public, Private, and Soon-to-be-Public Companies • Ability to Explore Geographic & Regional Differences • Cross-Shop & Loyalty • Drill Down to SKU Level & Full Purchase Details • Customer lifetime value • Earnings predictions • Uncovering macroeconomic trends • Analyzing market share • Performance benchmarking • Understanding share of wallet • Seeing subscription trends

    Fields Include: • Day • Merchant • Subindustry • Industry • Spend • Transactions • Spend per Transaction (derivable) • Cardholder State • Cardholder CBSA • Cardholder CSA • Age • Income • Wealth • Ethnicity • Political Affiliation • Children in Household • Adults in Household • Homeowner vs. Renter • Business Owner • Retention by First-Shopped Period • Churn • Cross-Shop • Average Ticket Buckets

  16. c

    Transaction Graph Dataset for the Ethereum Blockchain - Dataset - CryptoData...

    • cryptodata.center
    Updated Dec 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Transaction Graph Dataset for the Ethereum Blockchain - Dataset - CryptoData Hub [Dataset]. https://cryptodata.center/dataset/transaction-graph-dataset-for-the-ethereum-blockchain
    Explore at:
    Dataset updated
    Dec 4, 2024
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Уникальный идентификатор https://doi.org/10.5281/zenodo.4718440 Набор данных обновлен Dec 19, 2022 Набор данных предоставлен Zenodo Авторы Can Özturan; Can Özturan; Alper Şen; Alper Şen; Baran Kılıç; Baran Kılıç Лицензия Attribution 4.0 (CC BY 4.0) Информация о лицензии была получена автоматически Описание This dataset contains ether as well as popular ERC20 token transfer transactions extracted from the Ethereum Mainnet blockchain. Only send ether, contract function call, contract deployment transactions are present in the dataset. Miner reward (static block reward) and "uncle block inclusion reward" are added as transactions to the dataset. Transaction fee reward and "uncles reward" are not currently included in the dataset. Details of the datasets are given below: FILENAME FORMAT: The filenames have the following format: eth-tx- where For example file eth-tx-1000000-1099999.txt.bz2 contains transactions from block 1000000 to block 1099999 inclusive. The files are compressed with bzip2. They can be uncompressed using command bunzip2. TRANSACTION FORMAT: Each line in a file corresponds to a transaction. The transaction has the following format: units. ERC20 tokens transfers (transfer and transferFrom function calls in ERC20 contract) are indicated by token symbol. For example GUSD is Gemini USD stable coin. The JSON file erc20tokens.json given below contains the details of ERC20 tokens. Failed transactions are prefixed with "F-". BLOCK TIME FORMAT: The block time file has the following format: erc20tokens.json FILE: This file contains the list of popular ERC20 token contracts whose transfer/transferFrom transactions appear in the data files. ERC20 token list: USDT TRYb XAUt BNB LEO LINK HT HEDG MKR CRO VEN INO PAX INB SNX REP MOF ZRX SXP OKB XIN OMG SAI HOT DAI EURS HPT BUSD USDC SUSD HDG QCAD PLUS BTCB WBTC cWBTC renBTC sBTC imBTC pBTC IMPORTANT NOTE: Public Ethereum Mainnet blockchain data is open and can be obtained by connecting as a node on the blockchain or by using the block explorer web sites such as http://etherscan.io . The downloaders and users of this dataset accept the full responsibility of using the data in GDPR compliant manner or any other regulations. We provide the data as is and we cannot be held responsible for anything. NOTE: If you use this dataset, please do not forget to add the DOI number to the citation. If you use our dataset in your research, please also cite our paper: https://link.springer.com/article/10.1007/s10586-021-03511-0 @article{kilic2022parallel, title={Parallel Analysis of Ethereum Blockchain Transaction Data using Cluster Computing}, journal={Cluster Computing}, author={K{\i}l{\i}{\c{c}}, Baran and {"O}zturan, Can and Sen, Alper}, year={2022}, month={Jan} }

  17. Simulated Transactions Parquet Format

    • kaggle.com
    zip
    Updated Apr 12, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amanda Martin62 (2022). Simulated Transactions Parquet Format [Dataset]. https://www.kaggle.com/datasets/amandamartin62/simulated-transactions-parquet-format
    Explore at:
    zip(5184522564 bytes)Available download formats
    Dataset updated
    Apr 12, 2022
    Authors
    Amanda Martin62
    Description

    This data is a parquet format of conorsully1/simulated-transactions.

    NOTE: these transactions are randomly generated. The customers represented in the dataset are not real.

    This is a large transaction dataset for data visualisation and processing tutorials. Transactions are generated for 75,000 customers and are classified into 12 expenditure types:

    Groceries Clothing Housing Education Health Motor/Travel Entertainment Gambling Savings Bills and Utilities Tax Fines Notebook used to generate data: here

  18. d

    Example transaction

    • dune.com
    Updated Jun 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    cflan (2025). Example transaction [Dataset]. https://dune.com/discover/content/relevant?q=author:cflan&resource-type=queries
    Explore at:
    Dataset updated
    Jun 21, 2025
    Authors
    cflan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Blockchain data query: Example transaction

  19. d

    SFMTA Parking Meter Detailed Revenue Transactions

    • catalog.data.gov
    • data.sfgov.org
    • +1more
    Updated Oct 25, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.sfgov.org (2025). SFMTA Parking Meter Detailed Revenue Transactions [Dataset]. https://catalog.data.gov/dataset/sfmta-parking-meter-detailed-revenue-transactions
    Explore at:
    Dataset updated
    Oct 25, 2025
    Dataset provided by
    data.sfgov.org
    Description

    Parking meter transaction records where each row equals a single transaction by a single customer at a single meter. Subsequent transactions by a customer to extend time, for example, are captured as new transaction record and indicated with a METER_EVENT_TYPE of AT (Additional Time). POST_ID is a join key that refers to the identifier for a meter. You can find that inventory including location of the meter here: https://data.sfgov.org/d/8vzz-qzz9

  20. c

    Data from: CreditCardTransactions Dataset

    • cubig.ai
    zip
    Updated Jul 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2025). CreditCardTransactions Dataset [Dataset]. https://cubig.ai/store/products/554/creditcardtransactions-dataset
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 8, 2025
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Privacy-preserving data transformation via differential privacy, Synthetic data generation using AI techniques for model training
    Description

    1) Data Introduction • The Credit_Card_Transactions Dataset is a representative sample data for building fraud detection models, including anonymized real-world transaction data such as financial transaction type, amount, sender/receiver account balance, and fraud indicators.

    2) Data Utilization (1) Credit_Card_Transactions Dataset has characteristics that: • This dataset provides individual transaction records on a row-by-row basis, reflecting the real-world class imbalance problem with the extremely low percentage of fraudulent transactions (isFraud=1). • It is an unprocessed raw data structure that allows you to directly utilize key variables such as transaction time, amount, and account change. (2) Credit_Card_Transactions Dataset can be used to: • Binary classification modeling: Fraud transaction detection models can be developed by applying imbalance processing techniques such as SMOTE and undersampling, and appropriate evaluation indicators such as F1-score and ROC-AUC. • Real-time anomaly detection: It can be used to build a real-time anomaly signal detection system through analysis of transaction patterns (amount, frequency, account change).

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Priyam Choksi (2024). Credit Card Transactions Dataset [Dataset]. https://www.kaggle.com/datasets/priyamchoksi/credit-card-transactions-dataset
Organization logo

Data from: Credit Card Transactions Dataset

Using Transactional Data for Financial Analysis and Fraud Detection

Related Article
Explore at:
zip(152554916 bytes)Available download formats
Dataset updated
Jul 23, 2024
Authors
Priyam Choksi
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

The Credit Card Transactions Dataset provides detailed records of credit card transactions, including information about transaction times, amounts, and associated personal and merchant details. This dataset has over 1.85M rows.

How This Dataset Can Be Used:

Fraud Detection : Use machine learning models to identify fraudulent transactions by examining patterns in transaction amounts, locations, and user profiles. Enhancing fraud detection systems becomes feasible by analyzing behavioral patterns.

Customer Segmentation : Segment customers based on spending patterns, location, and demographics. Tailor marketing strategies and personalized offers to these different customer segments for better engagement.

Transaction Classification : Classify transactions into categories such as grocery or entertainment to understand spending behaviors. This helps in improving recommendation systems by identifying transaction categories and preferences.

Geospatial Analysis : Analyze transaction data geographically to map spending patterns and detect regional trends or anomalies based on latitude and longitude.

Predictive Modeling : Build models to forecast future spending behavior using historical transaction data. Predict potential fraudulent activities and financial trends.

Behavioral Analysis : Examine how factors like transaction amount, merchant type, and time influence spending behavior. Study the relationships between user demographics and transaction patterns.

Anomaly Detection : Identify unusual transaction patterns that deviate from normal behavior to detect potential fraud early. Employ anomaly detection techniques to spot outliers and suspicious activities.

Search
Clear search
Close search
Google apps
Main menu