27 datasets found

f
creditcard Dataset
figshare.com
csv
Updated Jun 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mohammad Shanaa; Sherief Abdallah (2025). creditcard Dataset [Dataset]. http://doi.org/10.6084/m9.figshare.29270873.v1
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.29270873.v1
Dataset updated
Jun 9, 2025
Dataset provided by
figshare
Authors
Mohammad Shanaa; Sherief Abdallah
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Title: Credit Card Transactions Dataset for Fraud Detection (Used in: A Hybrid Anomaly Detection Framework Combining Supervised and Unsupervised Learning)Description:This dataset, commonly known as creditcard.csv, contains anonymized credit card transactions made by European cardholders in September 2013. It includes 284,807 transactions, with 492 labeled as fraudulent. Due to confidentiality constraints, features have been transformed using PCA, except for 'Time' and 'Amount'.This dataset was used in the research article titled "A Hybrid Anomaly Detection Framework Combining Supervised and Unsupervised Learning for Credit Card Fraud Detection". The study proposes an ensemble model integrating techniques such as Autoencoders, Isolation Forest, Local Outlier Factor, and supervised classifiers including XGBoost and Random Forest, aiming to improve the detection of rare fraudulent patterns while maintaining efficiency and scalability.Key Features:30 numerical input features (V1–V28, Time, Amount)Class label indicating fraud (1) or normal (0)Imbalanced class distribution typical in real-world fraud detectionUse Case:Ideal for benchmarking and evaluating anomaly detection and classification algorithms in highly imbalanced data scenarios.Source:Originally published by the Machine Learning Group at Université Libre de Bruxelles.https://www.kaggle.com/mlg-ulb/creditcardfraudLicense:This dataset is distributed for academic and research purposes only. Please cite the original source when using the dataset.
Credit Card Fraud Detection Dataset
kaggle.com
Updated May 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ghanshyam Saini (2025). Credit Card Fraud Detection Dataset [Dataset]. https://www.kaggle.com/datasets/ghnshymsaini/credit-card-fraud-detection-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 15, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Ghanshyam Saini
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Credit Card Fraud Detection Dataset (European Cardholders, September 2013)

As a data contributor, I'm sharing this crucial dataset focused on the detection of fraudulent credit card transactions. Recognizing these illicit activities is paramount for protecting customers and the integrity of financial systems.

About the Dataset:

This dataset encompasses credit card transactions made by European cardholders during a two-day period in September 2013. It presents a real-world scenario with a significant class imbalance, where fraudulent transactions are considerably less frequent than legitimate ones. Out of a total of 284,807 transactions, only 492 are instances of fraud, representing a mere 0.172% of the entire dataset.

Content of the Data:

Due to confidentiality concerns, the majority of the input features in this dataset have undergone a Principal Component Analysis (PCA) transformation. This means the original meaning and context of features V1, V2, ..., V28 are not directly provided. However, these principal components capture the variance in the underlying transaction data.

The only features that have not been transformed by PCA are:

Time: Numerical. Represents the number of seconds elapsed between each transaction and the very first transaction recorded in the dataset.

Amount: Numerical. The transaction amount in Euros (€). This feature could be valuable for cost-sensitive learning approaches.

The target variable for this classification task is:

Class: Integer. Takes the value 1 in the case of a fraudulent transaction and 0 otherwise.

Important Note on Evaluation:

Given the substantial class imbalance (far more legitimate transactions than fraudulent ones), traditional accuracy metrics based on the confusion matrix can be misleading. It is strongly recommended to evaluate models using the Area Under the Precision-Recall Curve (AUPRC), as this metric is more sensitive to the performance on the minority class (fraudulent transactions).

How to Use This Dataset:

Download the dataset file (likely in CSV format).

Load the data using libraries like Pandas.

Understand the class imbalance: Be aware that fraudulent transactions are rare.

Explore the features: Analyze the distributions of 'Time', 'Amount', and the PCA-transformed features (V1-V28).

Address the class imbalance: Consider using techniques like oversampling the minority class, undersampling the majority class, or using specialized algorithms designed for imbalanced datasets.

Build and train binary classification models to predict the 'Class' variable.

Evaluate your models using AUPRC to get a meaningful assessment of performance in detecting fraud.

Acknowledgements and Citation:

This dataset has been collected and analyzed through a research collaboration between Worldline and the Machine Learning Group (MLG) of ULB (Université Libre de Bruxelles).

When using this dataset in your research or projects, please cite the following works as appropriate:

Andrea Dal Pozzolo, Olivier Caelen, Reid A. Johnson and Gianluca Bontempi. Calibrating Probability with Undersampling for Unbalanced Classification. In Symposium on Computational Intelligence and Data Mining (CIDM), IEEE, 2015.

Dal Pozzolo, Andrea; Caelen, Olivier; Le Borgne, Yann-Ael; Waterschoot, Serge; Bontempi, Gianluca. Learned lessons in credit card fraud detection from a practitioner perspective, Expert systems with applications,41,10,4915-4928,2014, Pergamon.

Dal Pozzolo, Andrea; Boracchi, Giacomo; Caelen, Olivier; Alippi, Cesare; Bontempi, Gianluca. Credit card fraud detection: a realistic modeling and a novel learning strategy, IEEE transactions on neural networks and learning systems,29,8,3784-3797,2018,IEEE.

Andrea Dal Pozzolo. Adaptive Machine learning for credit card fraud detection ULB MLG PhD thesis (supervised by G. Bontempi).

Fabrizio Carcillo, Andrea Dal Pozzolo, Yann-Aël Le Borgne, Olivier Caelen, Yannis Mazzer, Gianluca Bontempi. Scarff: a scalable framework for streaming credit card fraud detection with Spark, Information fusion,41, 182-194,2018,Elsevier.

Fabrizio Carcillo, Yann-Aël Le Borgne, Olivier Caelen, Gianluca Bontempi. Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization, International Journal of Data Science and Analytics, 5,4,285-300,2018,Springer International Publishing.

Bertrand Lebichot, Yann-Aël Le Borgne, Liyun He, Frederic Oblé, Gianluca Bontempi Deep-Learning Domain Adaptation Techniques for Credit Cards Fraud Detection, INNSBDDL 2019: Recent Advances in Big Data and Deep Learning, pp 78-88, 2019.

Fabrizio Carcillo, Yann-Aël Le Borgne, Olivier Caelen, Frederic Oblé, Gianluca Bontempi *Combining Unsupervised and Supervised...
Predicting Credit Card Customer Segmentation
kaggle.com
Updated Mar 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Devastator (2024). Predicting Credit Card Customer Segmentation [Dataset]. https://www.kaggle.com/datasets/thedevastator/predicting-credit-card-customer-attrition-with-m
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 10, 2024
Dataset provided by
Kaggle
Authors
The Devastator
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Predicting Credit Card Customer Segmentation

Exploring Key Customer Characteristics

By [source]

About this dataset

This dataset contains a wealth of customer information collected from within a consumer credit card portfolio, with the aim of helping analysts predict customer attrition. It includes comprehensive demographic details such as age, gender, marital status and income category, as well as insight into each customer’s relationship with the credit card provider such as the card type, number of months on book and inactive periods. Additionally it holds key data about customers’ spending behavior drawing closer to their churn decision such as total revolving balance, credit limit, average open to buy rate and analyzable metrics like total amount of change from quarter 4 to quarter 1, average utilization ratio and Naive Bayes classifier attrition flag (Card category is combined with contacts count in 12months period alongside dependent count plus education level & months inactive). Faced with this set of useful predicted data points across multiple variables capture up-to-date information that can determine long term account stability or an impending departure therefore offering us an equipped understanding when seeking to manage a portfolio or serve individual customers

More Datasets

For more datasets, click here.

Featured Notebooks

🚨 Your notebook can be here! 🚨!

How to use the dataset

This dataset can be used to analyze the key factors that influence customer attrition. Analysts can use this dataset to understand customer demographics, spending patterns, and relationship with the credit card provider to better predict customer attrition.

Research Ideas

Using the customer demographics, such as gender, marital status, education level and income category to determine which customer demographic is more likely to churn.

Analyzing the customer’s spending behavior leading up to churning and using this data to better predict the likelihood of a customer of churning in the future.

Creating a classifier that can predict potential customers who are more susceptible to attrition based on their credit score, credit limit, utilization ratio and other spending behavior metrics over time; this could be used as an early warning system for predicting potential attrition before it happens

Acknowledgements

If you use this dataset in your research, please credit the original authors. Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: BankChurners.csv | Column name | Description | |:---------------------------------------------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------| | CLIENTNUM | Unique identifier for each customer. (Integer) | | Attrition_Flag | Flag indicating whether or not the customer has churned out. (Boolean) | | Customer_Age | Age of customer. (Integer) | | Gender | Gender of customer. (String) | | Dependent_count | Number of dependents that customer has. (Integer) | | Education_Level ...
credit card csv
kaggle.com
Updated Jun 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Patyam Satya Lokesh (2025). credit card csv [Dataset]. https://www.kaggle.com/datasets/patyamsatyalokesh/credit-card-csv
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 20, 2025
Dataset provided by
Kaggle
Authors
Patyam Satya Lokesh
License
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
Description
Dataset

This dataset was created by Patyam Satya Lokesh

Released under Database: Open Database, Contents: Database Contents

Contents
Data from: Credit-Card-Default
kaggle.com
Updated Aug 13, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Muhammad Waqas (2020). Credit-Card-Default [Dataset]. https://www.kaggle.com/datasets/hafizwaqas101/creditcarddefault
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 13, 2020
Dataset provided by
Kaggle
Authors
Muhammad Waqas
Description
Dataset

This dataset was created by Muhammad Waqas

Contents
Credit card fraud detection Date 25th of June 2015
kaggle.com
Updated Oct 29, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zohair ahmed (2023). Credit card fraud detection Date 25th of June 2015 [Dataset]. https://www.kaggle.com/datasets/qnqfbqfqo/credit-card-fraud-detection-date-25th-of-june-2015
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Oct 29, 2023
Dataset provided by
Kaggle
Authors
Zohair ahmed
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
The datasets contains transactions made by credit cards in September 2013 by european cardholders. This dataset present transactions that occurred in two days, where we have 492 frauds out of 284,807 transactions. The dataset is highly unbalanced, the positive class (frauds) account for 0.172% of all transactions.

It contains only numerical input variables which are the result of a PCA transformation. Unfortunately, due to confidentiality issues, we cannot provide the original features and more background information about the data. Features V1, V2, ... V28 are the principal components obtained with PCA, the only features which have not been transformed with PCA are 'Time' and 'Amount'. Feature 'Time' contains the seconds elapsed between each transaction and the first transaction in the dataset. The feature 'Amount' is the transaction Amount, this feature can be used for example-dependant cost-senstive learning. Feature 'Class' is the response variable and it takes value 1 in case of fraud and 0 otherwise.

The dataset has been collected and analysed during a research collaboration of Worldline and the Machine Learning Group (mlg.ulb.ac.be) of ULB (Université Libre de Bruxelles) on big data mining and fraud detection. More details on current and past projects on related topics are available on http://mlg.ulb.ac.be/BruFence and http://mlg.ulb.ac.be/ARTML.
Credit Card Balance Prediction
kaggle.com
Updated Jul 13, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Abdalrahman Ali El nashar (2022). Credit Card Balance Prediction [Dataset]. https://www.kaggle.com/datasets/abdalrahmanelnashar/credit-card-balance-prediction
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 13, 2022
Dataset provided by
Kaggle
Authors
Abdalrahman Ali El nashar
Description
This dataset contains information about credit card balance. This data can be used for a lot of purposes such as credit card balance prediction. The columns in the given dataset are as follows: Income: Income of the customer. Limit: Credit limit provided to the customer. Rating: The customer's credit rating. Cards: The number of credit cards the customer has. Age: Age of the customer. Education: Educational level of the customer. Gender: Sex of the customer. Student: If the customer is a student or not. Married: If the customer is married. Ethnicity: Ethnicity of the customer. Balance: Credit balance of the customer.

$ Income : num 14.9 106 104.6 148.9 55.9 ...

$ Limit : int 3606 6645 7075 9504 4897 8047 3388 7114 3300 6819 ...

$ Rating : int 283 483 514 681 357 569 259 512 266 491 ...

$ Cards : int 2 3 4 3 2 4 2 2 5 3 ...

$ Age : int 34 82 71 36 68 77 37 87 66 41 ...

$ Education: int 11 15 11 11 16 10 12 9 13 19 ...

$ Gender : Factor w/ 2 levels " Male","Female": 1 2 1 2 1 1 2 1 2 2 ...

$ Student : Factor w/ 2 levels "No","Yes": 1 2 1 1 1 1 1 1 1 2 ...

$ Married : Factor w/ 2 levels "No","Yes": 2 2 1 1 2 1 1 1 1 2 ...

$ Ethnicity: Factor w/ 3 levels "African American",..: 3 2 2 2 3 3 1 2 3 1 ...

$ Balance : int 333 903 580 964 331 1151 203 872 279 1350 ...
D
Student Banking and College Credit Card marketing Agreements Data
datalumos.org
delimited
Updated May 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Consumer Financial Protection Bureau (2025). Student Banking and College Credit Card marketing Agreements Data [Dataset]. http://doi.org/10.3886/E230722V2
Explore at:
delimitedAvailable download formats
Unique identifier
https://doi.org/10.3886/E230722V2
Dataset updated
May 21, 2025
Dataset authored and provided by
Consumer Financial Protection Bureau
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
2009 - 2023
Description
Overview: This record includes three datasets collected by the Consumer Financial Protection Bureau: Marketing Agreements and DataStudent Banking Reports to CongressDeposit Product Marking Agreements and DataCollege credit card marketing agreements and dataAs required by the Credit CARD Act of 2009, the Consumer Financial Protection Bureau (CFPB) collects information annually from credit card issuers who have marketing agreements with universities, colleges, or affiliated organizations such as alumni associations, sororities, fraternities, and foundations.The CFPB intends to continue updating the CSV file each year as it collects new data from college credit card issuers. The CFPB intends to ensure that the publicly available dataset is as accurate and complete as possible. This means that the dataset (as well as some of the charts and figures in this report) may not be completely consistent with past iterations of this report because submitting entities sometimes make corrections to earlier submissions. In all cases, the CFPB intends for the public dataset to be the CFPB’s definitive account of the data and it will be updated each year as new data becomes availableStudent banking reports to CongressThe Dodd-Frank Wall Street Reform and Consumer Protection Act of 2010 instructs the Bureau to monitor for risks to consumers in the offering or provision of consumer financial products or services, particularly when those products pose a disproportionate risk to traditionally underserved populations.College deposit product marketing agreements and dataThis page presents information about banking products provided to college students pursuant to agreements between institutions of higher education and financial service providers and governed in part by the Department of Education's cash management regulations.The agreements and related information presented here are a sample of the data used in the CFPB's annual report to Congress and should not be considered comprehensive. The scope of the CFPB's observations was limited to the agreements and other public disclosures that were published by institutions related to each award year (interested parties should note that any information in place at the time of publication but absent from the institutional disclosures as of June of each award year may not have been evaluated). Nevertheless, review of publicly available information is helpful in providing an overview of significant market dynamics at a point in time.The CFPB intends to ensure that the publicly available dataset is as accurate and complete as possible. This means that the dataset may not be completely consistent with past iterations of this report because the CFPB sometimes makes corrections to the dataset. In all cases, the CFPB intends for the public dataset to be the CFPB’s definitive account of the data.
Familiarity of use of credit card numbers and CVV for authentication...
statista.com
Updated Aug 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Familiarity of use of credit card numbers and CVV for authentication Thailand 2019 [Dataset]. https://www.statista.com/statistics/1181793/thailand-share-familiarity-use-credit-card-cvv-for-payment-authentication/
Explore at:
Dataset updated
Aug 8, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2019
Area covered
Thailand
Description
In 2019, around **** percent of internet users in Thailand were familiar with the use of credit card numbers and CVV authentication for online payments. CVV stands for card verification value which is used for verifying that the customer has a physical credit or debit card.
Credit Card Fraud
kaggle.com
zip
Updated Sep 14, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Venkat Murali (2019). Credit Card Fraud [Dataset]. https://www.kaggle.com/datasets/venky12347/credit-card-fraud
Explore at:
zip(70543178 bytes)Available download formats
Dataset updated
Sep 14, 2019
Authors
Venkat Murali
Description
Data

We provide you with a data set in CSV format. The data set contains 2 lakhh+ record train instances and 56 thousand test instance There are 31 input features, labeled V1 to V28 and Amount .

The target variable is labeled Class.

Task - Create a Classification model to predict the target variable Class.

List of any assumptions that you made

Description of your methodology and solution path

List of algorithms and techniques you used

List of tools and frameworks you used

Results and evaluation of your models

How to evaluate the model 1. Use the F1 Score for metrics 2. Any other evaluation measure that you believe is appropriate other than Accuracy.
CreditCar_Fraud
kaggle.com
Updated Aug 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Prasun Maity (2023). CreditCar_Fraud [Dataset]. https://www.kaggle.com/datasets/prasunmaity/creditcar-fraud
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 23, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Prasun Maity
Description
The provided JSON file, derived from the project available at the specified Kaggle link, has been transformed into a CSV format for ease of analysis. This dataset likely encompasses credit card fraud-related information. It is structured as a tabular collection of data, with rows representing individual instances and columns containing various attributes. This dataset may include details such as transaction timestamps, transaction amounts, merchant information, and features related to fraud detection. Researchers and analysts can utilize this CSV dataset to investigate patterns, trends, and anomalies related to credit card fraud. The transformation to CSV simplifies data manipulation and exploration, facilitating data-driven insights and potentially aiding in the development of fraud detection algorithms and strategies. SOURCE https://www.kaggle.com/datasets/joebeachcapital/credit-card-fraud
A
‘Credit Risk Classification Dataset’ analyzed by Analyst-2
analyst-2.ai
Updated Nov 13, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Credit Risk Classification Dataset’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-credit-risk-classification-dataset-a5f6/76e42b23/?iid=035-990&v=presentation
Explore at:
Dataset updated
Nov 13, 2021
Dataset authored and provided by
Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Analysis of ‘Credit Risk Classification Dataset’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/praveengovi/credit-risk-classification-dataset on 30 September 2021.

--- Dataset description provided by original source is as follows ---

Context

This is Customer Transaction and Demographic related data , It holds Risky and Not Risky customer for specific banking products

Content

Dataset is small in nature , It helps budding data scientist 👨‍🔬 👩‍🔬& Data Analyst to experiment Machine Learning and Statistical modelling concept

Data:

payment_data.csv:

payment_data.csv: customer’s card payment history. id: customer id OVD_t1: number of times overdue type 1 OVD_t2: number of times overdue type 2 OVD_t3: number of times overdue type 3 OVD_sum: total overdue days pay_normal: number of times normal payment prod_code: credit product code prod_limit: credit limit of product update_date: account update date new_balance: current balance of product highest_balance: highest balance in history report_date: date of recent payment

customer_data.csv:

customer’s demographic data and category attributes which have been encoded. Category features are fea_1, fea_3, fea_5, fea_6, fea_7, fea_9. label is 1, the customer is in high credit risk label is 0, the customer is in low credit risk

Acknowledgements

Thanks to Google Datasets search

Inspiration

Your data will be in front of the world's largest data science community. What questions do you want to see answered?

This dataset help to find out weather customer is Credit Risky or Credit Worthy in Banking perspective

Q1 - What are the factors contributing to Credit Risky customer ? Q2 - Behaviour of Credit Worthy Customer ?

--- Original source retains full ownership of the source dataset ---
CFPB Student Banking and College Credit Card Marketing Agreements
datalumos.org
openicpsr.org
delimited
Updated Feb 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Consumer Finance Protection Bureau (2025). CFPB Student Banking and College Credit Card Marketing Agreements [Dataset]. http://doi.org/10.3886/E219741V1
Explore at:
delimitedAvailable download formats
Unique identifier
https://doi.org/10.3886/E219741V1
Dataset updated
Feb 16, 2025
Dataset provided by
Consumer Financial Protection Bureauhttp://www.consumerfinance.gov/
Authors
Consumer Finance Protection Bureau
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Institutions of higher education play a critical role in supporting and promoting students’ overall financial health and well-being. A growing body of evidence suggests that relatively small financial shocks may cause acute financial hardship for students, potentially derailing their academic pursuits.The Dodd-Frank Wall Street Reform and Consumer Protection Act of 2010 instructs the Bureau to monitor for risks to consumers in the offering or provision of consumer financial products or services, particularly when those products pose a disproportionate risk to traditionally underserved populations. Student banking reports to Congress These reports monitor the growth and impacts of financial products offered by or in conjunction with colleges, specifically focusing on marketing agreements for college-sponsored deposit and prepaid accounts and college-sponsored credit cards. College credit card marketing agreements and data As required by the Credit CARD Act of 2009, we collect information annually from credit card issuers who have marketing agreements with universities, colleges, or affiliated organizations such as alumni associations, sororities, fraternities, and foundations. We maintain publicly accessible files of the agreements.
W
Procurement Card Transactions Jan to Mar 2016
cloud.csiss.gmu.edu
html
Updated Dec 30, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
United Kingdom (2019). Procurement Card Transactions Jan to Mar 2016 [Dataset]. https://cloud.csiss.gmu.edu/uddi/dataset/procurement-card-transactions-jan-to-mar-2016
Explore at:
htmlAvailable download formats
Dataset updated
Dec 30, 2019
Dataset provided by
United Kingdom
License
http://reference.data.gov.uk/id/open-government-licencehttp://reference.data.gov.uk/id/open-government-licence
Description
CSV file detailing all individual transactions made by Surrey County Council using purchase cards (corporate credit card) during the period Jan-Mar 2016 (quarter 4). Purchase cards are used mainly by frontline services to support the work we do for residents. See Metadata tab for more details.

CSV file detailing all individual transactions made by Surrey County Council using purchase cards (corporate credit card) during the period Jan-Mar 2016 (quarter 4). Purchase cards are used mainly by frontline services to support the work we do for residents. Specific data schema details can be found on the Local Government Association's (LGA) website http://schemas.opendata.esd.org.uk/Spend.

The same information is available to download as 5 star Linked Data.

This data is published as part of Surrey's obligations for transparency, as set out in the Local Government Transparency Code 2014.

Update frequency: Quarterly

Review date: No later than end of the month after the quarter end

Temporal coverage: Q4 - Jan - Mar

Geographical coverage: pan-Surrey (though no spatial data published)

Data lineage: Data extracted from SAP, processed to remove irrelevant fields, personal data redacted and re-formatted according to LGA data schema (see above link)

Maintainer contact: Payments Team, Orbis Business Services
CFPB Credit Trends
datalumos.org
openicpsr.org
delimited
Updated Feb 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Consumer Finance Protection Bureau (2025). CFPB Credit Trends [Dataset]. http://doi.org/10.3886/E220144V1
Explore at:
delimitedAvailable download formats
Unique identifier
https://doi.org/10.3886/E220144V1
Dataset updated
Feb 19, 2025
Dataset provided by
Consumer Financial Protection Bureauhttp://www.consumerfinance.gov/
Authors
Consumer Finance Protection Bureau
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset provides access to data about general purpose credit cards, which are open-end loans used by consumers to pay for day-to-day expenses, finance purchases, or provide cash advances.

Credit Card Approval Prediction

kaggle.com

Updated Mar 24, 2020

Facebook

Twitter

Click to copy link

Link copied

Cite

Seanny (2020). Credit Card Approval Prediction [Dataset]. https://www.kaggle.com/datasets/rikdifos/credit-card-approval-prediction

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

Mar 24, 2020

Dataset provided by

Kaggle

Authors

Seanny

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

A Credit Card Dataset for Machine Learning!

Don't ask me where this data come from, the answer is I don't know!

Context

Credit score cards are a common risk control method in the financial industry. It uses personal information and data submitted by credit card applicants to predict the probability of future defaults and credit card borrowings. The bank is able to decide whether to issue a credit card to the applicant. Credit scores can objectively quantify the magnitude of risk.

Generally speaking, credit score cards are based on historical data. Once encountering large economic fluctuations. Past models may lose their original predictive power. Logistic model is a common method for credit scoring. Because Logistic is suitable for binary classification tasks and can calculate the coefficients of each feature. In order to facilitate understanding and operation, the score card will multiply the logistic regression coefficient by a certain value (such as 100) and round it.

At present, with the development of machine learning algorithms. More predictive methods such as Boosting, Random Forest, and Support Vector Machines have been introduced into credit card scoring. However, these methods often do not have good transparency. It may be difficult to provide customers and regulators with a reason for rejection or acceptance.

Task

Build a machine learning model to predict if an applicant is 'good' or 'bad' client, different from other tasks, the definition of 'good' or 'bad' is not given. You should use some techique, such as vintage analysis to construct you label. Also, unbalance data problem is a big problem in this task.

Content & Explanation

There're two tables could be merged by ID:

application_record.csv
Feature name	Explanation	Remarks
`ID`	Client number
`CODE_GENDER`	Gender
`FLAG_OWN_CAR`	Is there a car
`FLAG_OWN_REALTY`	Is there a property
`CNT_CHILDREN`	Number of children
`AMT_INCOME_TOTAL`	Annual income
`NAME_INCOME_TYPE`	Income category
`NAME_EDUCATION_TYPE`	Education level
`NAME_FAMILY_STATUS`	Marit...

Retail Credit Bank Data
kaggle.com
Updated Sep 10, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
SR (2021). Retail Credit Bank Data [Dataset]. https://www.kaggle.com/datasets/surekharamireddy/credit-data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 10, 2021
Dataset provided by
Kaggle
Authors
SR
Description
Context

A retail bank would like to hire you to build a credit default model for their credit card portfolio. The bank expects the model to identify the consumers who are likely to default on their credit card payments over the next 12 months. This model will be used to reduce the bank’s future losses. The bank is willing to provide you with some sample datathat they can currently extract from their systems. This data set (credit_data.csv) consists of 13,444 observations with 14 variables.

Content

Based on the bank’s experience, the number of derogatory reports is a strong indicator of default. This is all that the information you are able to get from the bank at the moment. Currently, they do not have the expertise to provide any clarification on this data and are also unsure about other variables captured by their systems
AmExpert CodeLab 2021
kaggle.com
Updated Nov 24, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pradip Basak (2021). AmExpert CodeLab 2021 [Dataset]. https://www.kaggle.com/datasets/pradip11/amexpert-codelab-2021/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Nov 24, 2021
Dataset provided by
Kaggle
Authors
Pradip Basak
Description
Context

Credit card default risk is the chance that companies or individuals will not be able to return the money lent on time.

Content

This dataset contains the following files: - train.csv: 45528 x 19 - test.csv: 11383 x 18 - sample_submission.csv: 5 x 2

Acknowledgements

The dataset belongs to American Express. It's shared here only for educational purpose.

Inspiration

Find out which customer might default.
Online Sales Dataset - Popular Marketplace Data
kaggle.com
Updated May 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ShreyanshVerma27 (2024). Online Sales Dataset - Popular Marketplace Data [Dataset]. https://www.kaggle.com/datasets/shreyanshverma27/online-sales-dataset-popular-marketplace-data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 25, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
ShreyanshVerma27
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
This dataset provides a comprehensive overview of online sales transactions across different product categories. Each row represents a single transaction with detailed information such as the order ID, date, category, product name, quantity sold, unit price, total price, region, and payment method.

Columns:

Order ID: Unique identifier for each sales order.

Date:Date of the sales transaction.

Category:Broad category of the product sold (e.g., Electronics, Home Appliances, Clothing, Books, Beauty Products, Sports).

Product Name:Specific name or model of the product sold.

Quantity:Number of units of the product sold in the transaction.

Unit Price:Price of one unit of the product.

Total Price: Total revenue generated from the sales transaction (Quantity * Unit Price).

Region:Geographic region where the transaction occurred (e.g., North America, Europe, Asia).

Payment Method: Method used for payment (e.g., Credit Card, PayPal, Debit Card).

Insights:

1. Analyze sales trends over time to identify seasonal patterns or growth opportunities.

2. Explore the popularity of different product categories across regions.

3. Investigate the impact of payment methods on sales volume or revenue.

4. Identify top-selling products within each category to optimize inventory and marketing strategies.

5. Evaluate the performance of specific products or categories in different regions to tailor marketing campaigns accordingly.
Retail Transactions Dataset
kaggle.com
Updated May 18, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Prasad Patil (2024). Retail Transactions Dataset [Dataset]. https://www.kaggle.com/datasets/prasad22/retail-transactions-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 18, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Prasad Patil
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
This dataset was created to simulate a market basket dataset, providing insights into customer purchasing behavior and store operations. The dataset facilitates market basket analysis, customer segmentation, and other retail analytics tasks. Here's more information about the context and inspiration behind this dataset:

Context:

Retail businesses, from supermarkets to convenience stores, are constantly seeking ways to better understand their customers and improve their operations. Market basket analysis, a technique used in retail analytics, explores customer purchase patterns to uncover associations between products, identify trends, and optimize pricing and promotions. Customer segmentation allows businesses to tailor their offerings to specific groups, enhancing the customer experience.

Inspiration:

The inspiration for this dataset comes from the need for accessible and customizable market basket datasets. While real-world retail data is sensitive and often restricted, synthetic datasets offer a safe and versatile alternative. Researchers, data scientists, and analysts can use this dataset to develop and test algorithms, models, and analytical tools.

Dataset Information:

The columns provide information about the transactions, customers, products, and purchasing behavior, making the dataset suitable for various analyses, including market basket analysis and customer segmentation. Here's a brief explanation of each column in the Dataset:

Transaction_ID: A unique identifier for each transaction, represented as a 10-digit number. This column is used to uniquely identify each purchase.

Date: The date and time when the transaction occurred. It records the timestamp of each purchase.

Customer_Name: The name of the customer who made the purchase. It provides information about the customer's identity.

Product: A list of products purchased in the transaction. It includes the names of the products bought.

Total_Items: The total number of items purchased in the transaction. It represents the quantity of products bought.

Total_Cost: The total cost of the purchase, in currency. It represents the financial value of the transaction.

Payment_Method: The method used for payment in the transaction, such as credit card, debit card, cash, or mobile payment.

City: The city where the purchase took place. It indicates the location of the transaction.

Store_Type: The type of store where the purchase was made, such as a supermarket, convenience store, department store, etc.

Discount_Applied: A binary indicator (True/False) representing whether a discount was applied to the transaction.

Customer_Category: A category representing the customer's background or age group.

Season: The season in which the purchase occurred, such as spring, summer, fall, or winter.

Promotion: The type of promotion applied to the transaction, such as "None," "BOGO (Buy One Get One)," or "Discount on Selected Items."

Use Cases:

Market Basket Analysis: Discover associations between products and uncover buying patterns.

Customer Segmentation: Group customers based on purchasing behavior.

Pricing Optimization: Optimize pricing strategies and identify opportunities for discounts and promotions.

Retail Analytics: Analyze store performance and customer trends.

Note: This dataset is entirely synthetic and was generated using the Python Faker library, which means it doesn't contain real customer data. It's designed for educational and research purposes.

Facebook

Twitter

Click to copy link

Link copied

Cite

Mohammad Shanaa; Sherief Abdallah (2025). creditcard Dataset [Dataset]. http://doi.org/10.6084/m9.figshare.29270873.v1

creditcard Dataset

Explore at:

csvAvailable download formats

Unique identifier

https://doi.org/10.6084/m9.figshare.29270873.v1

Dataset updated

Jun 9, 2025

Dataset provided by

figshare

Authors

Mohammad Shanaa; Sherief Abdallah

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Title: Credit Card Transactions Dataset for Fraud Detection (Used in: A Hybrid Anomaly Detection Framework Combining Supervised and Unsupervised Learning)Description:This dataset, commonly known as creditcard.csv, contains anonymized credit card transactions made by European cardholders in September 2013. It includes 284,807 transactions, with 492 labeled as fraudulent. Due to confidentiality constraints, features have been transformed using PCA, except for 'Time' and 'Amount'.This dataset was used in the research article titled "A Hybrid Anomaly Detection Framework Combining Supervised and Unsupervised Learning for Credit Card Fraud Detection". The study proposes an ensemble model integrating techniques such as Autoencoders, Isolation Forest, Local Outlier Factor, and supervised classifiers including XGBoost and Random Forest, aiming to improve the detection of rare fraudulent patterns while maintaining efficiency and scalability.Key Features:30 numerical input features (V1–V28, Time, Amount)Class label indicating fraud (1) or normal (0)Imbalanced class distribution typical in real-world fraud detectionUse Case:Ideal for benchmarking and evaluating anomaly detection and classification algorithms in highly imbalanced data scenarios.Source:Originally published by the Machine Learning Group at Université Libre de Bruxelles.https://www.kaggle.com/mlg-ulb/creditcardfraudLicense:This dataset is distributed for academic and research purposes only. Please cite the original source when using the dataset.

Clear search

Close search

Google apps

Main menu

creditcard Dataset

Credit Card Fraud Detection Dataset

Credit Card Fraud Detection Dataset (European Cardholders, September 2013)

Predicting Credit Card Customer Segmentation

Predicting Credit Card Customer Segmentation

Exploring Key Customer Characteristics

About this dataset

More Datasets

Featured Notebooks

How to use the dataset

Research Ideas

Acknowledgements

License

Columns

credit card csv

Dataset

Contents

Data from: Credit-Card-Default

Dataset

Contents

Credit card fraud detection Date 25th of June 2015

Credit Card Balance Prediction

$ Income : num 14.9 106 104.6 148.9 55.9 ...

$ Limit : int 3606 6645 7075 9504 4897 8047 3388 7114 3300 6819 ...

$ Rating : int 283 483 514 681 357 569 259 512 266 491 ...

$ Cards : int 2 3 4 3 2 4 2 2 5 3 ...

$ Age : int 34 82 71 36 68 77 37 87 66 41 ...

$ Education: int 11 15 11 11 16 10 12 9 13 19 ...

$ Gender : Factor w/ 2 levels " Male","Female": 1 2 1 2 1 1 2 1 2 2 ...

$ Student : Factor w/ 2 levels "No","Yes": 1 2 1 1 1 1 1 1 1 2 ...

$ Married : Factor w/ 2 levels "No","Yes": 2 2 1 1 2 1 1 1 1 2 ...

$ Ethnicity: Factor w/ 3 levels "African American",..: 3 2 2 2 3 3 1 2 3 1 ...

$ Balance : int 333 903 580 964 331 1151 203 872 279 1350 ...

Student Banking and College Credit Card marketing Agreements Data

Familiarity of use of credit card numbers and CVV for authentication...

Credit Card Fraud

CreditCar_Fraud

‘Credit Risk Classification Dataset’ analyzed by Analyst-2

Context

Content

Data:

payment_data.csv:

customer_data.csv:

Acknowledgements

Inspiration

CFPB Student Banking and College Credit Card Marketing Agreements

Procurement Card Transactions Jan to Mar 2016

CFPB Credit Trends

Credit Card Approval Prediction

A Credit Card Dataset for Machine Learning!

Context

Task

Content & Explanation

Retail Credit Bank Data

Context

Content

AmExpert CodeLab 2021

Context

Content

Acknowledgements

Inspiration

Online Sales Dataset - Popular Marketplace Data

Columns:

Insights:

Retail Transactions Dataset

Context:

Inspiration:

Dataset Information:

Use Cases:

Note: This dataset is entirely synthetic and was generated using the Python Faker library, which means it doesn't contain real customer data. It's designed for educational and research purposes.

creditcard Dataset

`Context:`

`Inspiration:`

`Dataset Information:`

`Use Cases:`