100+ datasets found
  1. t

    Credit Card Fraud Detection

    • test.researchdata.tuwien.ac.at
    • kaggle.com
    csv, json, pdf +2
    Updated Apr 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ajdina Grizhja; Ajdina Grizhja; Ajdina Grizhja; Ajdina Grizhja (2025). Credit Card Fraud Detection [Dataset]. http://doi.org/10.82556/yvxj-9t22
    Explore at:
    text/markdown, csv, pdf, txt, jsonAvailable download formats
    Dataset updated
    Apr 28, 2025
    Dataset provided by
    TU Wien
    Authors
    Ajdina Grizhja; Ajdina Grizhja; Ajdina Grizhja; Ajdina Grizhja
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Apr 28, 2025
    Description

    Below is a draft DMP–style description of your credit‐card fraud detection experiment, modeled on the antiquities example:

    1. Dataset Description

    Research Domain
    This work resides in the domain of financial fraud detection and applied machine learning. We focus on detecting anomalous credit‐card transactions in real time to reduce financial losses and improve trust in digital payment systems.

    Purpose
    The goal is to train and evaluate a binary classification model that flags potentially fraudulent transactions. By publishing both the code and data splits via FAIR repositories, we enable reproducible benchmarking of fraud‐detection algorithms and support future research on anomaly detection in transaction data.

    Data Sources
    We used the publicly available credit‐card transaction dataset from Kaggle (original source: https://www.kaggle.com/mlg-ulb/creditcardfraud), which contains anonymized transactions made by European cardholders over two days in September 2013. The dataset includes 284 807 transactions, of which 492 are fraudulent.

    Method of Dataset Preparation

    1. Schema validation: Renamed columns to snake_case (e.g. transaction_amount, is_declined) so they conform to DBRepo’s requirements.

    2. Data import: Uploaded the full CSV into DBRepo, assigned persistent identifiers (PIDs).

    3. Splitting: Programmatically derived three subsets—training (70%), validation (15%), test (15%)—using range‐based filters on the primary key actionnr. Each subset was materialized in DBRepo and assigned its own PID for precise citation.

    4. Cleaning: Converted the categorical flags (is_declined, isforeigntransaction, ishighriskcountry, isfradulent) from “Y”/“N” to 1/0 and dropped non‐feature identifiers (actionnr, merchant_id).

    5. Modeling: Trained a RandomForest classifier on the training split, tuned on validation, and evaluated on the held‐out test set.

    2. Technical Details

    Dataset Structure

    • The raw data is a single CSV with columns:

      • actionnr (integer transaction ID)

      • merchant_id (string)

      • average_amount_transaction_day (float)

      • transaction_amount (float)

      • is_declined, isforeigntransaction, ishighriskcountry, isfradulent (binary flags)

      • total_number_of_declines_day, daily_chargeback_avg_amt, sixmonth_avg_chbk_amt, sixmonth_chbk_freq (numeric features)

    Naming Conventions

    • All columns use lowercase snake_case.

    • Subsets are named creditcard_training, creditcard_validation, creditcard_test in DBRepo.

    • Files in the code repo follow a clear structure:

      ├── data/         # local copies only; raw data lives in DBRepo 
      ├── notebooks/Task.ipynb 
      ├── models/rf_model_v1.joblib 
      ├── outputs/        # confusion_matrix.png, roc_curve.png, predictions.csv 
      ├── README.md 
      ├── requirements.txt 
      └── codemeta.json 
      

    Required Software

    • Python 3.9+

    • pandas, numpy (data handling)

    • scikit-learn (modeling, metrics)

    • matplotlib (visualizations)

    • dbrepo‐client.py (DBRepo API)

    • requests (TU WRD API)

    Additional Resources

    3. Further Details

    Data Limitations

    • Highly imbalanced: only ~0.17% of transactions are fraudulent.

    • Anonymized PCA features (V1V28) hidden; we extended with domain features but cannot reverse engineer raw variables.

    • Time‐bounded: only covers two days of transactions, may not capture seasonal patterns.

    Licensing and Attribution

    • Raw data: CC-0 (per Kaggle terms)

    • Code & notebooks: MIT License

    • Model artifacts & outputs: CC-BY 4.0

    • DUWRD records include ORCID identifiers for the author.

    Recommended Uses

    • Benchmarking new fraud‐detection algorithms on a standard imbalanced dataset.

    • Educational purposes: demonstrating model‐training pipelines, FAIR data practices.

    • Extension: adding time‐series or deep‐learning models.

    Known Issues

    • Possible temporal leakage if date/time features not handled correctly.

    • Model performance may degrade on live data due to concept drift.

    • Binary flags may oversimplify nuanced transaction outcomes.

  2. h

    creditcard-fraud-detection

    • huggingface.co
    Updated Aug 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Keith Gauvin (2024). creditcard-fraud-detection [Dataset]. https://huggingface.co/datasets/kgauvin603/creditcard-fraud-detection
    Explore at:
    Dataset updated
    Aug 29, 2024
    Authors
    Keith Gauvin
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    kgauvin603/creditcard-fraud-detection dataset hosted on Hugging Face and contributed by the HF Datasets community

  3. CREDIT CARD FRAUD DETECTION (NEW)

    • kaggle.com
    Updated Jun 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    somnath paul 71 (2025). CREDIT CARD FRAUD DETECTION (NEW) [Dataset]. https://www.kaggle.com/datasets/somnathpaul71/credit-card-fraud-detection-new
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 25, 2025
    Dataset provided by
    Kaggle
    Authors
    somnath paul 71
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    🔍 Dataset Description: Credit Card Fraud Detection This dataset is designed for building and evaluating machine learning models for credit card fraud detection. It contains anonymized transaction records where the goal is to classify transactions as fraudulent (1) or non-fraudulent (0) based on several features.

    📁 Dataset Overview: Each row represents a single credit card transaction.

    Features include a mix of numerical and transformed variables (e.g., V1 to V28) derived from PCA for confidentiality.

    The Amount and Hour_of_Day features represent the transaction value and time, respectively.

    The Class column is the target variable:

    0 → Legitimate transaction

    1 → Fraudulent transaction

    ✅ Key Highlights: The dataset contains both classes (0 and 1) to ensure balanced evaluation for binary classification.

    Suitable for testing anomaly detection, binary classification, and imbalanced dataset handling techniques like SMOTE or under-sampling.

    Ideal for learners, researchers, and practitioners working on fraud detection in real-world scenarios.

    🧠 Suggested Use Cases: Model evaluation with metrics like precision, recall, F1-score (due to class imbalance).

    Experimentation with algorithms such as Logistic Regression, Random Forest, XGBoost, and Neural Networks.

    Feature engineering and explainability techniques (e.g., SHAP values).

  4. b

    Credit Card Fraud Detection

    • berd-platform.de
    csv
    Updated Jul 31, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kaggle (2025). Credit Card Fraud Detection [Dataset]. http://doi.org/10.82939/qcqqe-g6q16
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jul 31, 2025
    Dataset provided by
    Kaggle
    Description

    The dataset contains transactions made by credit cards in September 2013 by European cardholders. This dataset presents transactions that occurred in two days, where we have 492 frauds out of 284,807 transactions. The dataset is highly unbalanced, the positive class (frauds) account for 0.172% of all transactions. The dataset is 0.15 GB large.

    The dataset has been collected and analysed during a research collaboration of Worldline and the Machine Learning Group (http://mlg.ulb.ac.be) of ULB (Université Libre de Bruxelles) on big data mining and fraud detection.

  5. C

    Credit Card Fraud Detection Platform Report

    • archivemarketresearch.com
    doc, pdf, ppt
    Updated Mar 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archive Market Research (2025). Credit Card Fraud Detection Platform Report [Dataset]. https://www.archivemarketresearch.com/reports/credit-card-fraud-detection-platform-56852
    Explore at:
    ppt, doc, pdfAvailable download formats
    Dataset updated
    Mar 14, 2025
    Dataset authored and provided by
    Archive Market Research
    License

    https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The global credit card fraud detection platform market is experiencing robust growth, driven by the escalating volume of digital transactions and the increasing sophistication of fraudulent activities. While precise figures for market size and CAGR are not provided, based on industry reports and observed trends, a reasonable estimation places the 2025 market size at approximately $15 billion. Considering the rapid adoption of advanced technologies like AI and machine learning in fraud detection, a conservative Compound Annual Growth Rate (CAGR) of 15% is projected for the forecast period (2025-2033). This growth is fueled by several factors, including the rising prevalence of e-commerce, the expanding adoption of mobile payments, and the increasing demand for robust security solutions from both personal and enterprise users. The market is segmented by screening type (manual and automatic) and application (personal and enterprise), with the automatic screening and enterprise segments expected to witness faster growth due to their efficiency and scalability. The competitive landscape is highly dynamic, with a mix of established players like Visa, Mastercard, and FICO, alongside innovative technology companies like Kount, Riskified, and Feedzai. These companies are continuously developing and deploying advanced algorithms and analytics to stay ahead of evolving fraud techniques. Regional growth varies, with North America and Europe currently holding significant market share, though Asia-Pacific is projected to exhibit rapid expansion due to increasing internet penetration and e-commerce adoption in developing economies. Challenges to market growth include the high cost of implementation and maintenance of these platforms, along with the need for continuous updates to counter evolving fraud tactics. However, the increasing financial losses incurred due to fraud are incentivizing businesses and consumers to invest in more sophisticated fraud detection solutions, thereby sustaining the market's upward trajectory.

  6. f

    Performance comparison with other credit card fraud detection dataset.

    • plos.figshare.com
    xls
    Updated Jul 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Al Mahmud Siam; Pankaj Bhowmik; Md Palash Uddin (2025). Performance comparison with other credit card fraud detection dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0326975.t015
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jul 16, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Al Mahmud Siam; Pankaj Bhowmik; Md Palash Uddin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Performance comparison with other credit card fraud detection dataset.

  7. C

    Credit Card Fraud Detection Platform Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Jan 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Credit Card Fraud Detection Platform Report [Dataset]. https://www.datainsightsmarket.com/reports/credit-card-fraud-detection-platform-1969188
    Explore at:
    pdf, doc, pptAvailable download formats
    Dataset updated
    Jan 15, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The global credit card fraud detection platform market is projected to reach a value of USD 10.3 billion by 2033, exhibiting a CAGR of 12.5% during the forecast period (2023-2033). The increasing adoption of digital payment methods, the growing number of online transactions, and the rising incidences of fraudulent activities are driving the market's growth. However, factors such as data privacy concerns and the high cost of implementing these platforms may hinder the market's growth. The market is segmented based on application, type, and region. The application segments include e-commerce, banking and financial institutions, and other applications. The e-commerce segment holds the largest market share due to the increasing popularity of online shopping. The types segment includes rule-based systems, statistical techniques, machine learning algorithms, and others. Machine learning algorithms are expected to witness the highest growth rate during the forecast period due to their ability to learn from data and identify fraudulent patterns accurately. The regional segments include North America, Europe, Asia Pacific, and the Rest of the World. North America currently dominates the market and is expected to maintain its dominance throughout the forecast period.

  8. c

    Data from: Credit Card Transactions Dataset

    • cubig.ai
    zip
    Updated May 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2025). Credit Card Transactions Dataset [Dataset]. https://cubig.ai/store/products/336/credit-card-transactions-dataset
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 28, 2025
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Privacy-preserving data transformation via differential privacy, Synthetic data generation using AI techniques for model training
    Description

    1) Data Introduction • The Credit Card Transactions Dataset includes more than 20 million credit card transactions over the decades of 2,000 U.S. resident consumers created by IBM's simulations, providing details of each transaction and fraudulent labels.

    2) Data Utilization (1) Credit Card Transactions Dataset has characteristics that: • This dataset provides a variety of properties that are similar to real credit card transactions, including transaction amount, time, card information, purchase location, and store category (MCC). (2) Credit Card Transactions Dataset can be used to: • Development of Credit Card Fraud Detection Model: Using transaction history and properties, you can build a fraud (abnormal transaction) detection model based on machine learning. • Analysis of consumption patterns and risks: Long-term and diverse transaction data can be used to analyze customer consumption behavior and identify risk factors.

  9. Credit Card Fraud Detection

    • kaggle.com
    Updated Nov 20, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sriseshagiri (2021). Credit Card Fraud Detection [Dataset]. https://www.kaggle.com/datasets/sriseshagiri/credit-card-fraud-detection
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 20, 2021
    Dataset provided by
    Kaggle
    Authors
    Sriseshagiri
    Description

    Dataset

    This dataset was created by Sriseshagiri

    Contents

  10. CCFD_dataset

    • figshare.com
    xlsx
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nur Amirah Ishak; Keng-Hoong Ng; Gee-Kok Tong; Suraya Nurain Kalid; Kok-Chin Khor (2023). CCFD_dataset [Dataset]. http://doi.org/10.6084/m9.figshare.16695616.v3
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Nur Amirah Ishak; Keng-Hoong Ng; Gee-Kok Tong; Suraya Nurain Kalid; Kok-Chin Khor
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The dataset has been released by [1], which had been collected and analysed during a research collaboration of Worldline and the Machine Learning Group (http://mlg.ulb.ac.be) of Université Libre de Bruxelles (ULB) on big data mining and fraud detection. [1] Pozzolo, A. D., Caelan, O., Johnson, R. A., and Bontempi, G. (2015). Calibrating Probability with Undersampling for Unbalanced Classification. 2015 IEEE Symposium Series on Computational, pp. 159-166, doi: 10.1109/SSCI.2015.33 open source kaggle : https://www.kaggle.com/mlg-ulb/creditcardfraud

  11. Credit Card Fraud Dataset

    • kaggle.com
    Updated Sep 17, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cyber Cop (2021). Credit Card Fraud Dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/2624805
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 17, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Cyber Cop
    License

    http://www.gnu.org/licenses/old-licenses/gpl-2.0.en.htmlhttp://www.gnu.org/licenses/old-licenses/gpl-2.0.en.html

    Description

    Context

    The context of this dataset is to find fraudulent credit cards by analyzing the features. The detection of fraudulent credit card can be done using ML or DL.

    Acknowledgements

    The data actually collected from Weka Repository: https://weka.8497.n7.nabble.com/file/n23121/credit_fruad.arff

  12. Credit Card Fraud Detection

    • kaggle.com
    Updated Oct 2, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AMAN GOEL (2020). Credit Card Fraud Detection [Dataset]. https://www.kaggle.com/amangoel766/credit-card-fraud-detection/activity
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 2, 2020
    Dataset provided by
    Kaggle
    Authors
    AMAN GOEL
    Description

    Dataset

    This dataset was created by AMAN GOEL

    Contents

  13. m

    Comprehensive Credit Card Fraud Detection Platform Market Size, Share &...

    • marketresearchintellect.com
    Updated Jul 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Intellect (2025). Comprehensive Credit Card Fraud Detection Platform Market Size, Share & Industry Insights 2033 [Dataset]. https://www.marketresearchintellect.com/product/credit-card-fraud-detection-platform-market/
    Explore at:
    Dataset updated
    Jul 7, 2025
    Dataset authored and provided by
    Market Research Intellect
    License

    https://www.marketresearchintellect.com/privacy-policyhttps://www.marketresearchintellect.com/privacy-policy

    Area covered
    Global
    Description

    Market Research Intellect's Credit Card Fraud Detection Platform Market Report highlights a valuation of USD 3.5 billion in 2024 and anticipates growth to USD 8.2 billion by 2033, with a CAGR of 10.5% from 2026-2033.Explore insights on demand dynamics, innovation pipelines, and competitive landscapes.

  14. G

    Credit Card Fraud Detection

    • gomask.ai
    csv, json
    Updated Jul 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GoMask.ai (2025). Credit Card Fraud Detection [Dataset]. https://gomask.ai/marketplace/datasets/credit-card-fraud-detection
    Explore at:
    csv(10 MB), jsonAvailable download formats
    Dataset updated
    Jul 12, 2025
    Dataset provided by
    GoMask.ai
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    2024 - 2025
    Area covered
    Global
    Variables measured
    is_fraud, entry_mode, card_number, merchant_id, cardholder_id, currency_code, cardholder_age, transaction_id, is_international, transaction_city, and 7 more
    Description

    This dataset provides detailed, labeled records of simulated credit card transactions, including transaction amounts, merchant and cardholder information, and fraud indicators. It is ideal for developing and benchmarking machine learning models aimed at detecting fraudulent activity and reducing financial risk in payment systems. The inclusion of transaction context and cardholder demographics supports advanced analytics and feature engineering.

  15. Credit Card Fraud Detection Platform Market Growth Report 2034

    • polarismarketresearch.com
    Updated Oct 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Polaris Market Research & Consulting, Inc. (2025). Credit Card Fraud Detection Platform Market Growth Report 2034 [Dataset]. https://www.polarismarketresearch.com/industry-analysis/credit-card-fraud-detection-platform-market
    Explore at:
    Dataset updated
    Oct 14, 2025
    Dataset provided by
    Polaris Market Research & Consulting
    Authors
    Polaris Market Research & Consulting, Inc.
    License

    https://www.polarismarketresearch.com/privacy-policyhttps://www.polarismarketresearch.com/privacy-policy

    Description

    The global Credit Card Fraud Detection Platform Market size was valued at USD 3.59 billion in 2024 and is expected to grow at a CAGR of 15.3% from 2025 to 2034.

  16. Credit Card Fraud

    • kaggle.com
    Updated Jul 14, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Manish Kumar (2020). Credit Card Fraud [Dataset]. https://www.kaggle.com/datasets/hellbuoy/credit-card-fraud
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 14, 2020
    Dataset provided by
    Kaggle
    Authors
    Manish Kumar
    Description

    Dataset

    This dataset was created by Manish Kumar

    Contents

  17. Annual card fraud - credit cards and debit cards combined - worldwide...

    • statista.com
    Updated Jun 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Annual card fraud - credit cards and debit cards combined - worldwide 2014-2023 [Dataset]. https://www.statista.com/statistics/1394119/global-card-fraud-losses/
    Explore at:
    Dataset updated
    Jun 30, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Dec 2024
    Area covered
    Worldwide
    Description

    Card fraud losses across the world increased by more than ** percent between 2020 and 2021, the largest increase since 2018. It was estimated that merchants and card acquirers lost well over ** billion U.S. dollars, with - so the source adds - roughly ** billion U.S. dollar coming from the United States alone. Note that the figures provided here included both credit card fraud and debit card fraud. The source does not separate between the two, and also did not provide figures on the United States - a country known for its reliance on credit cards.

  18. G

    Credit Card Fraud Patterns

    • gomask.ai
    csv, json
    Updated Jul 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GoMask.ai (2025). Credit Card Fraud Patterns [Dataset]. https://gomask.ai/marketplace/datasets/credit-card-fraud-patterns
    Explore at:
    json, csv(10 MB)Available download formats
    Dataset updated
    Jul 12, 2025
    Dataset provided by
    GoMask.ai
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    2024 - 2025
    Area covered
    Global
    Variables measured
    is_fraud, device_id, is_online, entry_mode, fraud_type, card_number, merchant_id, cardholder_id, currency_code, location_city, and 11 more
    Description

    This dataset contains simulated credit card transaction records, including detailed information on transaction amounts, merchant details, geolocation, device usage, and fraud labels. It is designed for training and evaluating fraud detection models, supporting the identification of both typical and anomalous transaction patterns. The dataset is ideal for fintech AI development, security analytics, and research into payment fraud behaviors.

  19. Credit Card Fraud Detection

    • kaggle.com
    zip
    Updated Sep 14, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prasanna Venkatesh (2019). Credit Card Fraud Detection [Dataset]. https://www.kaggle.com/prasy46/credit-card-fraud-detection
    Explore at:
    zip(70543178 bytes)Available download formats
    Dataset updated
    Sep 14, 2019
    Authors
    Prasanna Venkatesh
    Description

    Data

    We provide you with a data set in CSV format. The data set contains 2 lakhh+ record train instances and 56 thousand test instance There are 31 input features, labeled V1 to V28 and Amount .

    The target variable is labeled Class.

    Task

    Create a Classification model to predict the target variable Class.

    1. A report - A Power point presentation
    2. Any custom code you used
    3. Instructions for me to run your model on a separate data set

    What should be in the report?

    1. List of any assumptions that you made
    2. Description of your methodology and solution path
    3. List of algorithms and techniques you used
    4. List of tools and frameworks you used
    5. Results and evaluation of your models

    How to evaluate the model

    1. Use the F1 Score for metrics
    2. Any other evaluation measure that you believe is appropriate other than Accuracy.
  20. Global Credit Card Fraud Detection Platform Market Size By Deployment...

    • verifiedmarketresearch.com
    pdf,excel,csv,ppt
    Updated Jul 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Verified Market Research (2025). Global Credit Card Fraud Detection Platform Market Size By Deployment (Cloud-Based, On-Premise), By Technology (Machine Learning, Rule-Based, Hybrid), By End-User (Banks, Payment Processors, E-commerce), By Geographic Scope And Forecast [Dataset]. https://www.verifiedmarketresearch.com/product/credit-card-fraud-detection-platform-market/
    Explore at:
    pdf,excel,csv,pptAvailable download formats
    Dataset updated
    Jul 14, 2025
    Dataset authored and provided by
    Verified Market Researchhttps://www.verifiedmarketresearch.com/
    License

    https://www.verifiedmarketresearch.com/privacy-policy/https://www.verifiedmarketresearch.com/privacy-policy/

    Time period covered
    2026 - 2032
    Area covered
    Global
    Description

    Credit Card Fraud Detection Platform Market size was valued at USD 3.4 Billion in 2024 and is projected to reach USD 12.44 Billion by 2032, growing at a CAGR of 17.6% during the forecast period 2026 to 2032.Global Credit Card Fraud Detection Platform Market Drivers:The market drivers for the credit card fraud detection platform market can be influenced by various factors. These may include:Rising Incidence of Online Payment Fraud: The increasing number of fraud attempts during online transactions pushes financial institutions to adopt platforms that monitor and detect unauthorized credit card activity in real time.Growth in E-Commerce Transactions: With more consumers shopping online, the volume of card-not-present transactions rises, creating higher exposure to fraud and driving demand for detection platforms to secure digital payments.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Ajdina Grizhja; Ajdina Grizhja; Ajdina Grizhja; Ajdina Grizhja (2025). Credit Card Fraud Detection [Dataset]. http://doi.org/10.82556/yvxj-9t22

Credit Card Fraud Detection

Explore at:
text/markdown, csv, pdf, txt, jsonAvailable download formats
Dataset updated
Apr 28, 2025
Dataset provided by
TU Wien
Authors
Ajdina Grizhja; Ajdina Grizhja; Ajdina Grizhja; Ajdina Grizhja
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Time period covered
Apr 28, 2025
Description

Below is a draft DMP–style description of your credit‐card fraud detection experiment, modeled on the antiquities example:

1. Dataset Description

Research Domain
This work resides in the domain of financial fraud detection and applied machine learning. We focus on detecting anomalous credit‐card transactions in real time to reduce financial losses and improve trust in digital payment systems.

Purpose
The goal is to train and evaluate a binary classification model that flags potentially fraudulent transactions. By publishing both the code and data splits via FAIR repositories, we enable reproducible benchmarking of fraud‐detection algorithms and support future research on anomaly detection in transaction data.

Data Sources
We used the publicly available credit‐card transaction dataset from Kaggle (original source: https://www.kaggle.com/mlg-ulb/creditcardfraud), which contains anonymized transactions made by European cardholders over two days in September 2013. The dataset includes 284 807 transactions, of which 492 are fraudulent.

Method of Dataset Preparation

  1. Schema validation: Renamed columns to snake_case (e.g. transaction_amount, is_declined) so they conform to DBRepo’s requirements.

  2. Data import: Uploaded the full CSV into DBRepo, assigned persistent identifiers (PIDs).

  3. Splitting: Programmatically derived three subsets—training (70%), validation (15%), test (15%)—using range‐based filters on the primary key actionnr. Each subset was materialized in DBRepo and assigned its own PID for precise citation.

  4. Cleaning: Converted the categorical flags (is_declined, isforeigntransaction, ishighriskcountry, isfradulent) from “Y”/“N” to 1/0 and dropped non‐feature identifiers (actionnr, merchant_id).

  5. Modeling: Trained a RandomForest classifier on the training split, tuned on validation, and evaluated on the held‐out test set.

2. Technical Details

Dataset Structure

  • The raw data is a single CSV with columns:

    • actionnr (integer transaction ID)

    • merchant_id (string)

    • average_amount_transaction_day (float)

    • transaction_amount (float)

    • is_declined, isforeigntransaction, ishighriskcountry, isfradulent (binary flags)

    • total_number_of_declines_day, daily_chargeback_avg_amt, sixmonth_avg_chbk_amt, sixmonth_chbk_freq (numeric features)

Naming Conventions

  • All columns use lowercase snake_case.

  • Subsets are named creditcard_training, creditcard_validation, creditcard_test in DBRepo.

  • Files in the code repo follow a clear structure:

    ├── data/         # local copies only; raw data lives in DBRepo 
    ├── notebooks/Task.ipynb 
    ├── models/rf_model_v1.joblib 
    ├── outputs/        # confusion_matrix.png, roc_curve.png, predictions.csv 
    ├── README.md 
    ├── requirements.txt 
    └── codemeta.json 
    

Required Software

  • Python 3.9+

  • pandas, numpy (data handling)

  • scikit-learn (modeling, metrics)

  • matplotlib (visualizations)

  • dbrepo‐client.py (DBRepo API)

  • requests (TU WRD API)

Additional Resources

3. Further Details

Data Limitations

  • Highly imbalanced: only ~0.17% of transactions are fraudulent.

  • Anonymized PCA features (V1V28) hidden; we extended with domain features but cannot reverse engineer raw variables.

  • Time‐bounded: only covers two days of transactions, may not capture seasonal patterns.

Licensing and Attribution

  • Raw data: CC-0 (per Kaggle terms)

  • Code & notebooks: MIT License

  • Model artifacts & outputs: CC-BY 4.0

  • DUWRD records include ORCID identifiers for the author.

Recommended Uses

  • Benchmarking new fraud‐detection algorithms on a standard imbalanced dataset.

  • Educational purposes: demonstrating model‐training pipelines, FAIR data practices.

  • Extension: adding time‐series or deep‐learning models.

Known Issues

  • Possible temporal leakage if date/time features not handled correctly.

  • Model performance may degrade on live data due to concept drift.

  • Binary flags may oversimplify nuanced transaction outcomes.

Search
Clear search
Close search
Google apps
Main menu