100+ datasets found

Credit Card Fraud Detection Dataset
kaggle.com
Updated May 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ghanshyam Saini (2025). Credit Card Fraud Detection Dataset [Dataset]. https://www.kaggle.com/datasets/ghnshymsaini/credit-card-fraud-detection-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 15, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Ghanshyam Saini
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Credit Card Fraud Detection Dataset (European Cardholders, September 2013)

As a data contributor, I'm sharing this crucial dataset focused on the detection of fraudulent credit card transactions. Recognizing these illicit activities is paramount for protecting customers and the integrity of financial systems.

About the Dataset:

This dataset encompasses credit card transactions made by European cardholders during a two-day period in September 2013. It presents a real-world scenario with a significant class imbalance, where fraudulent transactions are considerably less frequent than legitimate ones. Out of a total of 284,807 transactions, only 492 are instances of fraud, representing a mere 0.172% of the entire dataset.

Content of the Data:

Due to confidentiality concerns, the majority of the input features in this dataset have undergone a Principal Component Analysis (PCA) transformation. This means the original meaning and context of features V1, V2, ..., V28 are not directly provided. However, these principal components capture the variance in the underlying transaction data.

The only features that have not been transformed by PCA are:

Time: Numerical. Represents the number of seconds elapsed between each transaction and the very first transaction recorded in the dataset.

Amount: Numerical. The transaction amount in Euros (€). This feature could be valuable for cost-sensitive learning approaches.

The target variable for this classification task is:

Class: Integer. Takes the value 1 in the case of a fraudulent transaction and 0 otherwise.

Important Note on Evaluation:

Given the substantial class imbalance (far more legitimate transactions than fraudulent ones), traditional accuracy metrics based on the confusion matrix can be misleading. It is strongly recommended to evaluate models using the Area Under the Precision-Recall Curve (AUPRC), as this metric is more sensitive to the performance on the minority class (fraudulent transactions).

How to Use This Dataset:

Download the dataset file (likely in CSV format).

Load the data using libraries like Pandas.

Understand the class imbalance: Be aware that fraudulent transactions are rare.

Explore the features: Analyze the distributions of 'Time', 'Amount', and the PCA-transformed features (V1-V28).

Address the class imbalance: Consider using techniques like oversampling the minority class, undersampling the majority class, or using specialized algorithms designed for imbalanced datasets.

Build and train binary classification models to predict the 'Class' variable.

Evaluate your models using AUPRC to get a meaningful assessment of performance in detecting fraud.

Acknowledgements and Citation:

This dataset has been collected and analyzed through a research collaboration between Worldline and the Machine Learning Group (MLG) of ULB (Université Libre de Bruxelles).

When using this dataset in your research or projects, please cite the following works as appropriate:

Andrea Dal Pozzolo, Olivier Caelen, Reid A. Johnson and Gianluca Bontempi. Calibrating Probability with Undersampling for Unbalanced Classification. In Symposium on Computational Intelligence and Data Mining (CIDM), IEEE, 2015.

Dal Pozzolo, Andrea; Caelen, Olivier; Le Borgne, Yann-Ael; Waterschoot, Serge; Bontempi, Gianluca. Learned lessons in credit card fraud detection from a practitioner perspective, Expert systems with applications,41,10,4915-4928,2014, Pergamon.

Dal Pozzolo, Andrea; Boracchi, Giacomo; Caelen, Olivier; Alippi, Cesare; Bontempi, Gianluca. Credit card fraud detection: a realistic modeling and a novel learning strategy, IEEE transactions on neural networks and learning systems,29,8,3784-3797,2018,IEEE.

Andrea Dal Pozzolo. Adaptive Machine learning for credit card fraud detection ULB MLG PhD thesis (supervised by G. Bontempi).

Fabrizio Carcillo, Andrea Dal Pozzolo, Yann-Aël Le Borgne, Olivier Caelen, Yannis Mazzer, Gianluca Bontempi. Scarff: a scalable framework for streaming credit card fraud detection with Spark, Information fusion,41, 182-194,2018,Elsevier.

Fabrizio Carcillo, Yann-Aël Le Borgne, Olivier Caelen, Gianluca Bontempi. Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization, International Journal of Data Science and Analytics, 5,4,285-300,2018,Springer International Publishing.

Bertrand Lebichot, Yann-Aël Le Borgne, Liyun He, Frederic Oblé, Gianluca Bontempi Deep-Learning Domain Adaptation Techniques for Credit Cards Fraud Detection, INNSBDDL 2019: Recent Advances in Big Data and Deep Learning, pp 78-88, 2019.

Fabrizio Carcillo, Yann-Aël Le Borgne, Olivier Caelen, Frederic Oblé, Gianluca Bontempi *Combining Unsupervised and Supervised...
g
Credit Card Fraud Detection
gts.ai
json
Updated Jun 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GTS (2024). Credit Card Fraud Detection [Dataset]. https://gts.ai/dataset-download/credit-card-fraud-detection/
Explore at:
jsonAvailable download formats
Dataset updated
Jun 25, 2024
Dataset provided by
GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
Authors
GTS
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Explore our anonymized card fraud detection dataset, perfect for developing robust machine learning models.
C
Credit Card Fraud Detection Platform Report
archivemarketresearch.com
doc, pdf, ppt
Updated Mar 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Archive Market Research (2025). Credit Card Fraud Detection Platform Report [Dataset]. https://www.archivemarketresearch.com/reports/credit-card-fraud-detection-platform-56852
Explore at:
ppt, doc, pdfAvailable download formats
Dataset updated
Mar 14, 2025
Dataset authored and provided by
Archive Market Research
License
https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The global credit card fraud detection platform market is experiencing robust growth, driven by the escalating volume of digital transactions and the increasing sophistication of fraudulent activities. While precise figures for market size and CAGR are not provided, based on industry reports and observed trends, a reasonable estimation places the 2025 market size at approximately $15 billion. Considering the rapid adoption of advanced technologies like AI and machine learning in fraud detection, a conservative Compound Annual Growth Rate (CAGR) of 15% is projected for the forecast period (2025-2033). This growth is fueled by several factors, including the rising prevalence of e-commerce, the expanding adoption of mobile payments, and the increasing demand for robust security solutions from both personal and enterprise users. The market is segmented by screening type (manual and automatic) and application (personal and enterprise), with the automatic screening and enterprise segments expected to witness faster growth due to their efficiency and scalability. The competitive landscape is highly dynamic, with a mix of established players like Visa, Mastercard, and FICO, alongside innovative technology companies like Kount, Riskified, and Feedzai. These companies are continuously developing and deploying advanced algorithms and analytics to stay ahead of evolving fraud techniques. Regional growth varies, with North America and Europe currently holding significant market share, though Asia-Pacific is projected to exhibit rapid expansion due to increasing internet penetration and e-commerce adoption in developing economies. Challenges to market growth include the high cost of implementation and maintenance of these platforms, along with the need for continuous updates to counter evolving fraud tactics. However, the increasing financial losses incurred due to fraud are incentivizing businesses and consumers to invest in more sophisticated fraud detection solutions, thereby sustaining the market's upward trajectory.
Credit card fraud detection Date 25th of June 2015
kaggle.com
Updated Oct 29, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zohair ahmed (2023). Credit card fraud detection Date 25th of June 2015 [Dataset]. https://www.kaggle.com/datasets/qnqfbqfqo/credit-card-fraud-detection-date-25th-of-june-2015
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 29, 2023
Dataset provided by
Kaggle
Authors
Zohair ahmed
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
The datasets contains transactions made by credit cards in September 2013 by european cardholders. This dataset present transactions that occurred in two days, where we have 492 frauds out of 284,807 transactions. The dataset is highly unbalanced, the positive class (frauds) account for 0.172% of all transactions.

It contains only numerical input variables which are the result of a PCA transformation. Unfortunately, due to confidentiality issues, we cannot provide the original features and more background information about the data. Features V1, V2, ... V28 are the principal components obtained with PCA, the only features which have not been transformed with PCA are 'Time' and 'Amount'. Feature 'Time' contains the seconds elapsed between each transaction and the first transaction in the dataset. The feature 'Amount' is the transaction Amount, this feature can be used for example-dependant cost-senstive learning. Feature 'Class' is the response variable and it takes value 1 in case of fraud and 0 otherwise.

The dataset has been collected and analysed during a research collaboration of Worldline and the Machine Learning Group (mlg.ulb.ac.be) of ULB (Université Libre de Bruxelles) on big data mining and fraud detection. More details on current and past projects on related topics are available on http://mlg.ulb.ac.be/BruFence and http://mlg.ulb.ac.be/ARTML.
Credit Card Fraud Detection Dataset
kaggle.com
Updated May 30, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shuvom Dhar (2025). Credit Card Fraud Detection Dataset [Dataset]. https://www.kaggle.com/datasets/shuvomdhar/credit-card-fraud-detection-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 30, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Shuvom Dhar
Description
Dataset

This dataset was created by Shuvom Dhar

Contents
t
Credit Card Fraud Detection
test.researchdata.tuwien.ac.at
zenodo.org
+1more
csv, json, pdf +2
Updated Apr 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ajdina Grizhja; Ajdina Grizhja; Ajdina Grizhja; Ajdina Grizhja (2025). Credit Card Fraud Detection [Dataset]. http://doi.org/10.82556/yvxj-9t22
Explore at:
text/markdown, csv, pdf, txt, jsonAvailable download formats
Unique identifier
https://doi.org/10.82556/yvxj-9t22
Dataset updated
Apr 28, 2025
Dataset provided by
TU Wien
Authors
Ajdina Grizhja; Ajdina Grizhja; Ajdina Grizhja; Ajdina Grizhja
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Apr 28, 2025
Description
Below is a draft DMP–style description of your credit‐card fraud detection experiment, modeled on the antiquities example:

1. Dataset Description

Research Domain
This work resides in the domain of financial fraud detection and applied machine learning. We focus on detecting anomalous credit‐card transactions in real time to reduce financial losses and improve trust in digital payment systems.

Purpose
The goal is to train and evaluate a binary classification model that flags potentially fraudulent transactions. By publishing both the code and data splits via FAIR repositories, we enable reproducible benchmarking of fraud‐detection algorithms and support future research on anomaly detection in transaction data.

Data Sources
We used the publicly available credit‐card transaction dataset from Kaggle (original source: https://www.kaggle.com/mlg-ulb/creditcardfraud), which contains anonymized transactions made by European cardholders over two days in September 2013. The dataset includes 284 807 transactions, of which 492 are fraudulent.

Method of Dataset Preparation

Schema validation: Renamed columns to snake_case (e.g. transaction_amount, is_declined) so they conform to DBRepo’s requirements.

Data import: Uploaded the full CSV into DBRepo, assigned persistent identifiers (PIDs).

Splitting: Programmatically derived three subsets—training (70%), validation (15%), test (15%)—using range‐based filters on the primary key actionnr. Each subset was materialized in DBRepo and assigned its own PID for precise citation.

Cleaning: Converted the categorical flags (is_declined, isforeigntransaction, ishighriskcountry, isfradulent) from “Y”/“N” to 1/0 and dropped non‐feature identifiers (actionnr, merchant_id).

Modeling: Trained a RandomForest classifier on the training split, tuned on validation, and evaluated on the held‐out test set.

2. Technical Details

Dataset Structure

The raw data is a single CSV with columns:

actionnr (integer transaction ID)

merchant_id (string)

average_amount_transaction_day (float)

transaction_amount (float)

is_declined, isforeigntransaction, ishighriskcountry, isfradulent (binary flags)

total_number_of_declines_day, daily_chargeback_avg_amt, sixmonth_avg_chbk_amt, sixmonth_chbk_freq (numeric features)

Naming Conventions

All columns use lowercase snake_case.

Subsets are named creditcard_training, creditcard_validation, creditcard_test in DBRepo.

Files in the code repo follow a clear structure:

├── data/ # local copies only; raw data lives in DBRepo ├── notebooks/Task.ipynb ├── models/rf_model_v1.joblib ├── outputs/ # confusion_matrix.png, roc_curve.png, predictions.csv ├── README.md ├── requirements.txt └── codemeta.json

Required Software

Python 3.9+

pandas, numpy (data handling)

scikit-learn (modeling, metrics)

matplotlib (visualizations)

dbrepo‐client.py (DBRepo API)

requests (TU WRD API)

Additional Resources

Original dataset: https://www.kaggle.com/mlg-ulb/creditcardfraud

Scikit-learn docs: https://scikit-learn.org/stable

DBRepo API guide: via the starter notebook’s dbrepo_client.py template

TU WRD REST API spec: https://test.researchdata.tuwien.ac.at/api/docs

3. Further Details

Data Limitations

Highly imbalanced: only ~0.17% of transactions are fraudulent.

Anonymized PCA features (V1–V28) hidden; we extended with domain features but cannot reverse engineer raw variables.

Time‐bounded: only covers two days of transactions, may not capture seasonal patterns.

Licensing and Attribution

Raw data: CC-0 (per Kaggle terms)

Code & notebooks: MIT License

Model artifacts & outputs: CC-BY 4.0

DUWRD records include ORCID identifiers for the author.

Recommended Uses

Benchmarking new fraud‐detection algorithms on a standard imbalanced dataset.

Educational purposes: demonstrating model‐training pipelines, FAIR data practices.

Extension: adding time‐series or deep‐learning models.

Known Issues

Possible temporal leakage if date/time features not handled correctly.

Model performance may degrade on live data due to concept drift.

Binary flags may oversimplify nuanced transaction outcomes.
m
Complète Marché de la plate-forme de détection de fraude par carte de crédit...
marketresearchintellect.com
Updated May 19, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Market Research Intellect (2025). Complète Marché de la plate-forme de détection de fraude par carte de crédit Taille, part et perspectives sectorielles 2033 [Dataset]. https://www.marketresearchintellect.com/fr/product/credit-card-fraud-detection-platform-market/
Explore at:
Dataset updated
May 19, 2025
Dataset authored and provided by
Market Research Intellect
License
https://www.marketresearchintellect.com/fr/privacy-policyhttps://www.marketresearchintellect.com/fr/privacy-policy
Area covered
Global
Description
Learn more about Market Research Intellect's Credit Card Fraud Detection Platform Market Report, valued at USD 3.5 billion in 2024, and set to grow to USD 8.2 billion by 2033 with a CAGR of 10.5% (2026-2033).
f
creditcard Dataset
figshare.com
csv
Updated Jun 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mohammad Shanaa; Sherief Abdallah (2025). creditcard Dataset [Dataset]. http://doi.org/10.6084/m9.figshare.29270873.v1
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.29270873.v1
Dataset updated
Jun 9, 2025
Dataset provided by
figshare
Authors
Mohammad Shanaa; Sherief Abdallah
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Title: Credit Card Transactions Dataset for Fraud Detection (Used in: A Hybrid Anomaly Detection Framework Combining Supervised and Unsupervised Learning)Description:This dataset, commonly known as creditcard.csv, contains anonymized credit card transactions made by European cardholders in September 2013. It includes 284,807 transactions, with 492 labeled as fraudulent. Due to confidentiality constraints, features have been transformed using PCA, except for 'Time' and 'Amount'.This dataset was used in the research article titled "A Hybrid Anomaly Detection Framework Combining Supervised and Unsupervised Learning for Credit Card Fraud Detection". The study proposes an ensemble model integrating techniques such as Autoencoders, Isolation Forest, Local Outlier Factor, and supervised classifiers including XGBoost and Random Forest, aiming to improve the detection of rare fraudulent patterns while maintaining efficiency and scalability.Key Features:30 numerical input features (V1–V28, Time, Amount)Class label indicating fraud (1) or normal (0)Imbalanced class distribution typical in real-world fraud detectionUse Case:Ideal for benchmarking and evaluating anomaly detection and classification algorithms in highly imbalanced data scenarios.Source:Originally published by the Machine Learning Group at Université Libre de Bruxelles.https://www.kaggle.com/mlg-ulb/creditcardfraudLicense:This dataset is distributed for academic and research purposes only. Please cite the original source when using the dataset.
c
Data from: Credit Card Transactions Dataset
cubig.ai
Updated May 28, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CUBIG (2025). Credit Card Transactions Dataset [Dataset]. https://cubig.ai/store/products/336/credit-card-transactions-dataset
Explore at:
Dataset updated
May 28, 2025
Dataset authored and provided by
CUBIG
License
https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
Measurement technique
Privacy-preserving data transformation via differential privacy, Synthetic data generation using AI techniques for model training
Description
1) Data Introduction • The Credit Card Transactions Dataset includes more than 20 million credit card transactions over the decades of 2,000 U.S. resident consumers created by IBM's simulations, providing details of each transaction and fraudulent labels.

2) Data Utilization (1) Credit Card Transactions Dataset has characteristics that: • This dataset provides a variety of properties that are similar to real credit card transactions, including transaction amount, time, card information, purchase location, and store category (MCC). (2) Credit Card Transactions Dataset can be used to: • Development of Credit Card Fraud Detection Model: Using transaction history and properties, you can build a fraud (abnormal transaction) detection model based on machine learning. • Analysis of consumption patterns and risks: Long-term and diverse transaction data can be used to analyze customer consumption behavior and identify risk factors.
C
Credit Card Fraud Detection Platform Report
archivemarketresearch.com
doc, pdf, ppt
Updated Feb 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Archive Market Research (2025). Credit Card Fraud Detection Platform Report [Dataset]. https://www.archivemarketresearch.com/reports/credit-card-fraud-detection-platform-14208
Explore at:
ppt, pdf, docAvailable download formats
Dataset updated
Feb 8, 2025
Dataset authored and provided by
Archive Market Research
License
https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
Market Analysis for Credit Card Fraud Detection Platform The global credit card fraud detection platform market is estimated to reach USD 25.2 billion by 2033, growing at a CAGR of 14.3% from 2025 to 2033. The increasing adoption of digital payment methods, rising incidences of cybercrime, and stringent regulations on data security drive the market growth. The adoption of advanced technologies like machine learning and artificial intelligence in fraud detection solutions further fuels market expansion. The market is segmented into application (personal, enterprise) and type (manual screening, automatic screening). The enterprise segment dominates the market due to the growing demand for fraud protection in corporate environments. Automatic screening solutions are gaining popularity as they automate the fraud detection process, reducing operational costs and improving efficiency. Key market players include Kount, ClearSale, Stripe Radar, Riskified, Sift, SEON, Visa Advanced Authorization, Mastercard, Akkio, and Grid Dynamics. North America holds the largest market share due to the high adoption of advanced fraud detection technologies and the presence of major financial institutions in the region.
C
Credit Card Fraud Detection Platform Report
datainsightsmarket.com
doc, pdf, ppt
Updated Jul 7, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Data Insights Market (2025). Credit Card Fraud Detection Platform Report [Dataset]. https://www.datainsightsmarket.com/reports/credit-card-fraud-detection-platform-1982870
Explore at:
pdf, doc, pptAvailable download formats
Dataset updated
Jul 7, 2025
Dataset authored and provided by
Data Insights Market
License
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The global credit card fraud detection platform market is experiencing robust growth, driven by the increasing prevalence of digital transactions and the sophistication of fraudulent activities. The market, estimated at $15 billion in 2025, is projected to maintain a healthy Compound Annual Growth Rate (CAGR) of 15% from 2025 to 2033, reaching approximately $45 billion by 2033. This expansion is fueled by several key factors: the rising adoption of e-commerce and mobile payments, the increasing volume of online transactions, the growing need for robust security measures among businesses to protect customer data and prevent financial losses, and the continuous evolution of fraud techniques necessitating advanced detection capabilities. Furthermore, the increasing regulatory scrutiny and compliance requirements are pushing organizations to invest heavily in sophisticated fraud detection systems. The market is segmented by deployment (cloud-based and on-premise), by organization size (small, medium, and large enterprises), and by industry vertical (banking, financial services, and insurance, retail, healthcare, and others). Key players in this dynamic market include established companies like Kount, ClearSale, Stripe Radar, Riskified, and FICO, alongside emerging technology providers like Akkio and Dataiku. These companies are constantly innovating to improve detection accuracy, reduce false positives, and offer seamless integration with existing payment processing systems. While challenges remain, such as the rising complexity of fraud schemes and the need to balance security with user experience, the market is poised for continued strong growth, driven by technological advancements in machine learning, artificial intelligence, and big data analytics. The increasing adoption of real-time fraud detection and advanced analytics capabilities will further shape the market landscape in the coming years, creating opportunities for both established and emerging players.
G
Credit Card Fraud Detection
gomask.ai
Updated Jul 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GoMask.ai (2025). Credit Card Fraud Detection [Dataset]. https://gomask.ai/marketplace/datasets/credit-card-fraud-detection
Explore at:
(Unknown)Available download formats
Dataset updated
Jul 12, 2025
Dataset provided by
GoMask.ai
License
https://gomask.ai/licensehttps://gomask.ai/license
Variables measured
is_fraud, entry_mode, card_number, merchant_id, cardholder_id, currency_code, cardholder_age, transaction_id, is_international, transaction_city, and 7 more
Description
This dataset provides detailed, labeled records of simulated credit card transactions, including transaction amounts, merchant and cardholder information, and fraud indicators. It is ideal for developing and benchmarking machine learning models aimed at detecting fraudulent activity and reducing financial risk in payment systems. The inclusion of transaction context and cardholder demographics supports advanced analytics and feature engineering.
A Novel Credit Card Fraud Detection Method
zenodo.org
bin
Updated Jul 19, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xiaoyan Zhao; Xiaoyan Zhao (2023). A Novel Credit Card Fraud Detection Method [Dataset]. http://doi.org/10.5281/zenodo.8159789
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.8159789
Dataset updated
Jul 19, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Xiaoyan Zhao; Xiaoyan Zhao
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Credit card fraud can lead to significant financial losses for both individuals and financial institutions. In this paper, we propose a novel method called CTCN, which uses Conditional Tabular Generative Adversarial Networks (CTGAN) and Temporal Convolutional Network (TCN) for credit card fraud detection. Our approach includes an oversampling algorithm that uses CTGAN to balance the dataset, and Neighborhood Cleaning Rule (NCL) to filter out majority class samples that overlap with the minority class. We generate synthetic minority class samples that conform to the original data distribution, resulting in a balanced dataset. We then employ TCN to analyze transaction sequences and capture long-term dependencies between data, revealing potential relationships between transaction sequences, thus achieving accurate credit card fraud detection. Experiments on three public datasets demonstrate that our proposed method outperforms current machine learning and deep learning methods, as measured by recall, F1-Score, and AUC-ROC.
G
Credit Card Fraud Patterns
gomask.ai
Updated Jul 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GoMask.ai (2025). Credit Card Fraud Patterns [Dataset]. https://gomask.ai/marketplace/datasets/credit-card-fraud-patterns
Explore at:
(Unknown)Available download formats
Dataset updated
Jul 12, 2025
Dataset provided by
GoMask.ai
License
https://gomask.ai/licensehttps://gomask.ai/license
Variables measured
is_fraud, device_id, is_online, entry_mode, fraud_type, card_number, merchant_id, cardholder_id, currency_code, location_city, and 11 more
Description
This dataset contains simulated credit card transaction records, including detailed information on transaction amounts, merchant details, geolocation, device usage, and fraud labels. It is designed for training and evaluating fraud detection models, supporting the identification of both typical and anomalous transaction patterns. The dataset is ideal for fintech AI development, security analytics, and research into payment fraud behaviors.
G
Credit Card Transaction Fraud Flags
gomask.ai
Updated Jul 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GoMask.ai (2025). Credit Card Transaction Fraud Flags [Dataset]. https://gomask.ai/marketplace/datasets/credit-card-transaction-fraud-flags
Explore at:
(Unknown)Available download formats
Dataset updated
Jul 12, 2025
Dataset provided by
GoMask.ai
License
https://gomask.ai/licensehttps://gomask.ai/license
Variables measured
amount, currency, entry_mode, fraud_flag, fraud_score, merchant_id, terminal_id, card_present, merchant_name, transaction_id, and 11 more
Description
This dataset provides detailed credit card transaction records enriched with fraud suspicion flags, risk scores, and contextual information such as merchant, location, and transaction method. It is ideal for developing, training, and evaluating fraud detection models, as well as for analyzing transaction patterns and identifying emerging fraud tactics in the financial sector.
v
Global Credit Card Fraud Detection Platform Market Size By Deployment...
verifiedmarketresearch.com
pdf,excel,csv,ppt
Updated Jul 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Verified Market Research (2025). Global Credit Card Fraud Detection Platform Market Size By Deployment (Cloud-Based, On-Premise), By Technology (Machine Learning, Rule-Based, Hybrid), By End-User (Banks, Payment Processors, E-commerce), By Geographic Scope And Forecast [Dataset]. https://www.verifiedmarketresearch.com/product/credit-card-fraud-detection-platform-market/
Explore at:
pdf,excel,csv,pptAvailable download formats
Dataset updated
Jul 14, 2025
Dataset authored and provided by
Verified Market Research
License
https://www.verifiedmarketresearch.com/privacy-policy/https://www.verifiedmarketresearch.com/privacy-policy/
Time period covered
2026 - 2032
Area covered
Global
Description
Credit Card Fraud Detection Platform Market size was valued at USD 3.4 Billion in 2024 and is projected to reach USD 12.44 Billion by 2032, growing at a CAGR of 17.6% during the forecast period 2026 to 2032.Global Credit Card Fraud Detection Platform Market Drivers:The market drivers for the credit card fraud detection platform market can be influenced by various factors. These may include:Rising Incidence of Online Payment Fraud: The increasing number of fraud attempts during online transactions pushes financial institutions to adopt platforms that monitor and detect unauthorized credit card activity in real time.Growth in E-Commerce Transactions: With more consumers shopping online, the volume of card-not-present transactions rises, creating higher exposure to fraud and driving demand for detection platforms to secure digital payments.
f
CCFD_dataset
figshare.com
xlsx
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nur Amirah Ishak; Keng-Hoong Ng; Gee-Kok Tong; Suraya Nurain Kalid; Kok-Chin Khor (2023). CCFD_dataset [Dataset]. http://doi.org/10.6084/m9.figshare.16695616.v3
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.16695616.v3
Dataset updated
May 30, 2023
Dataset provided by
figshare
Authors
Nur Amirah Ishak; Keng-Hoong Ng; Gee-Kok Tong; Suraya Nurain Kalid; Kok-Chin Khor
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
The dataset has been released by [1], which had been collected and analysed during a research collaboration of Worldline and the Machine Learning Group (http://mlg.ulb.ac.be) of Université Libre de Bruxelles (ULB) on big data mining and fraud detection. [1] Pozzolo, A. D., Caelan, O., Johnson, R. A., and Bontempi, G. (2015). Calibrating Probability with Undersampling for Unbalanced Classification. 2015 IEEE Symposium Series on Computational, pp. 159-166, doi: 10.1109/SSCI.2015.33 open source kaggle : https://www.kaggle.com/mlg-ulb/creditcardfraud
Abstract data set for Credit card fraud detection
kaggle.com
Updated Apr 9, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shubham Joshi (2018). Abstract data set for Credit card fraud detection [Dataset]. https://www.kaggle.com/shubhamjoshi2130of/abstract-data-set-for-credit-card-fraud-detection/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 9, 2018
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Shubham Joshi
Description
Dataset

This dataset was created by Shubham Joshi

Released under Data files © Original Authors

Contents
Card fraud in the U.S. versus rest of the world 2014-2023, with global...
statista.com
Updated Jun 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Card fraud in the U.S. versus rest of the world 2014-2023, with global forecasts 2028 [Dataset]. https://www.statista.com/statistics/1264329/value-fraudulent-card-transactions-worldwide/
Explore at:
Dataset updated
Jun 25, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Dec 2024
Area covered
United States
Description
Payment card fraud - including both credit cards and debit cards - is forecast to grow by over ** billion U.S. dollars between 2022 and 2028. Especially outside the United States, the amount of fraudulent payments almost doubled from 2014 to 2021. In total, fraudulent card payments reached ** billion U.S. dollars in 2021. Card fraud losses across the world increased by more than ** percent between 2020 and 2021, the largest increase since 2018.
credit card fraud detection
kaggle.com
Updated Dec 21, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Karthik (2024). credit card fraud detection [Dataset]. https://www.kaggle.com/kr1kaggle/credit-card-fraud-detection/discussion
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 21, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Karthik
Description
Dataset

This dataset was created by Karthik

Contents

Facebook

Twitter

Click to copy link

Link copied

Cite

Ghanshyam Saini (2025). Credit Card Fraud Detection Dataset [Dataset]. https://www.kaggle.com/datasets/ghnshymsaini/credit-card-fraud-detection-dataset

Credit Card Fraud Detection Dataset

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

May 15, 2025

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Ghanshyam Saini

License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

Credit Card Fraud Detection Dataset (European Cardholders, September 2013)

As a data contributor, I'm sharing this crucial dataset focused on the detection of fraudulent credit card transactions. Recognizing these illicit activities is paramount for protecting customers and the integrity of financial systems.

About the Dataset:

This dataset encompasses credit card transactions made by European cardholders during a two-day period in September 2013. It presents a real-world scenario with a significant class imbalance, where fraudulent transactions are considerably less frequent than legitimate ones. Out of a total of 284,807 transactions, only 492 are instances of fraud, representing a mere 0.172% of the entire dataset.

Content of the Data:

Due to confidentiality concerns, the majority of the input features in this dataset have undergone a Principal Component Analysis (PCA) transformation. This means the original meaning and context of features V1, V2, ..., V28 are not directly provided. However, these principal components capture the variance in the underlying transaction data.

The only features that have not been transformed by PCA are:

Time: Numerical. Represents the number of seconds elapsed between each transaction and the very first transaction recorded in the dataset.
Amount: Numerical. The transaction amount in Euros (€). This feature could be valuable for cost-sensitive learning approaches.

The target variable for this classification task is:

Class: Integer. Takes the value 1 in the case of a fraudulent transaction and 0 otherwise.

Important Note on Evaluation:

Given the substantial class imbalance (far more legitimate transactions than fraudulent ones), traditional accuracy metrics based on the confusion matrix can be misleading. It is strongly recommended to evaluate models using the Area Under the Precision-Recall Curve (AUPRC), as this metric is more sensitive to the performance on the minority class (fraudulent transactions).

How to Use This Dataset:

Download the dataset file (likely in CSV format).
Load the data using libraries like Pandas.
Understand the class imbalance: Be aware that fraudulent transactions are rare.
Explore the features: Analyze the distributions of 'Time', 'Amount', and the PCA-transformed features (V1-V28).
Address the class imbalance: Consider using techniques like oversampling the minority class, undersampling the majority class, or using specialized algorithms designed for imbalanced datasets.
Build and train binary classification models to predict the 'Class' variable.
Evaluate your models using AUPRC to get a meaningful assessment of performance in detecting fraud.

Acknowledgements and Citation:

This dataset has been collected and analyzed through a research collaboration between Worldline and the Machine Learning Group (MLG) of ULB (Université Libre de Bruxelles).

When using this dataset in your research or projects, please cite the following works as appropriate:

Andrea Dal Pozzolo, Olivier Caelen, Reid A. Johnson and Gianluca Bontempi. Calibrating Probability with Undersampling for Unbalanced Classification. In Symposium on Computational Intelligence and Data Mining (CIDM), IEEE, 2015.
Dal Pozzolo, Andrea; Caelen, Olivier; Le Borgne, Yann-Ael; Waterschoot, Serge; Bontempi, Gianluca. Learned lessons in credit card fraud detection from a practitioner perspective, Expert systems with applications,41,10,4915-4928,2014, Pergamon.
Dal Pozzolo, Andrea; Boracchi, Giacomo; Caelen, Olivier; Alippi, Cesare; Bontempi, Gianluca. Credit card fraud detection: a realistic modeling and a novel learning strategy, IEEE transactions on neural networks and learning systems,29,8,3784-3797,2018,IEEE.
Andrea Dal Pozzolo. Adaptive Machine learning for credit card fraud detection ULB MLG PhD thesis (supervised by G. Bontempi).
Fabrizio Carcillo, Andrea Dal Pozzolo, Yann-Aël Le Borgne, Olivier Caelen, Yannis Mazzer, Gianluca Bontempi. Scarff: a scalable framework for streaming credit card fraud detection with Spark, Information fusion,41, 182-194,2018,Elsevier.
Fabrizio Carcillo, Yann-Aël Le Borgne, Olivier Caelen, Gianluca Bontempi. Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization, International Journal of Data Science and Analytics, 5,4,285-300,2018,Springer International Publishing.
Bertrand Lebichot, Yann-Aël Le Borgne, Liyun He, Frederic Oblé, Gianluca Bontempi Deep-Learning Domain Adaptation Techniques for Credit Cards Fraud Detection, INNSBDDL 2019: Recent Advances in Big Data and Deep Learning, pp 78-88, 2019.
Fabrizio Carcillo, Yann-Aël Le Borgne, Olivier Caelen, Frederic Oblé, Gianluca Bontempi *Combining Unsupervised and Supervised...

Clear search

Close search

Google apps

Main menu

Credit Card Fraud Detection Dataset

Credit Card Fraud Detection Dataset (European Cardholders, September 2013)

Credit Card Fraud Detection

Credit Card Fraud Detection Platform Report

Credit card fraud detection Date 25th of June 2015

Credit Card Fraud Detection Dataset

Dataset

Contents

Credit Card Fraud Detection

1. Dataset Description

2. Technical Details

3. Further Details

Complète Marché de la plate-forme de détection de fraude par carte de crédit...

creditcard Dataset

Data from: Credit Card Transactions Dataset

Credit Card Fraud Detection Platform Report

Credit Card Fraud Detection Platform Report

Credit Card Fraud Detection

A Novel Credit Card Fraud Detection Method

Credit Card Fraud Patterns

Credit Card Transaction Fraud Flags

Global Credit Card Fraud Detection Platform Market Size By Deployment...

CCFD_dataset

Abstract data set for Credit card fraud detection

Dataset

Contents

Card fraud in the U.S. versus rest of the world 2014-2023, with global...

credit card fraud detection

Dataset

Contents

Credit Card Fraud Detection Dataset

Credit Card Fraud Detection Dataset (European Cardholders, September 2013)