100+ datasets found
  1. Fraud Database

    • data.wu.ac.at
    • data.europa.eu
    Updated Aug 31, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Service Personnel and Veterans Agency (2013). Fraud Database [Dataset]. https://data.wu.ac.at/schema/data_gov_uk/MjFmNDViNDYtOGNlNy00MDQzLTliODQtNmM5NTg5ODhkMThm
    Explore at:
    Dataset updated
    Aug 31, 2013
    Dataset provided by
    Service Personnel and Veterans Agency
    Description

    Details of fraud referrals relating to war pensions & compensation

  2. Credit fraud

    • kaggle.com
    zip
    Updated Mar 13, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    José Henrique Gaspar (2021). Credit fraud [Dataset]. https://www.kaggle.com/henriquegaspar/credit-fraud
    Explore at:
    zip(59081 bytes)Available download formats
    Dataset updated
    Mar 13, 2021
    Authors
    José Henrique Gaspar
    Description

    Dataset

    This dataset was created by José Henrique Gaspar

    Contents

  3. Fraud Database - Dataset - data.gov.uk

    • ckan.publishing.service.gov.uk
    Updated Aug 30, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ckan.publishing.service.gov.uk (2013). Fraud Database - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/fraud-database
    Explore at:
    Dataset updated
    Aug 30, 2013
    Dataset provided by
    CKANhttps://ckan.org/
    Description

    Details of fraud referrals relating to war pensions & compensation

  4. G

    Graph Databases for Fraud Detection Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Aug 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). Graph Databases for Fraud Detection Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/graph-databases-for-fraud-detection-market
    Explore at:
    csv, pptx, pdfAvailable download formats
    Dataset updated
    Aug 22, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Graph Databases for Fraud Detection Market Outlook



    As per our latest research, the global Graph Databases for Fraud Detection market size in 2024 stands at USD 1.12 billion, with the market demonstrating robust momentum. The sector is experiencing a compound annual growth rate (CAGR) of 22.7%, positioning it for substantial expansion. By 2033, the market is forecasted to reach a remarkable USD 8.65 billion, driven by the escalating sophistication of fraudulent activities and the increasing adoption of advanced analytics and artificial intelligence across industries. The primary growth factor is the urgent need for real-time, scalable, and highly accurate fraud detection solutions that can adapt to evolving threat landscapes, especially in sectors such as BFSI, retail, and healthcare.




    The proliferation of digital transactions, e-commerce, and online banking has led to a surge in complex fraud schemes, necessitating the deployment of advanced technologies like graph databases. These databases excel at mapping intricate relationships and patterns across vast datasets, making them indispensable in detecting organized fraud rings, identity theft, and money laundering operations. Their ability to visualize and analyze interconnected data points in real time significantly enhances the accuracy and speed of fraud detection, minimizing false positives and enabling proactive risk mitigation. Moreover, the integration of graph databases with machine learning and AI algorithms has further amplified their effectiveness, allowing organizations to uncover hidden patterns and anticipate fraudulent behaviors before they result in significant financial losses.




    Another key growth driver for the graph databases for fraud detection market is the increasing regulatory scrutiny and compliance requirements imposed by governments and international bodies. Financial institutions, healthcare providers, and e-commerce platforms are under mounting pressure to comply with anti-money laundering (AML), know your customer (KYC), and data privacy regulations. Graph databases provide the agility and transparency needed to trace the origin and flow of transactions, ensuring robust audit trails and facilitating regulatory reporting. As regulatory frameworks continue to evolve and penalties for non-compliance become more stringent, organizations are investing heavily in advanced fraud detection infrastructure, further fueling market growth.




    The rapid advancements in cloud computing and the widespread adoption of Software-as-a-Service (SaaS) models have also played a pivotal role in democratizing access to graph database solutions for fraud detection. Cloud-based deployment offers scalability, cost-effectiveness, and ease of integration with existing IT ecosystems, making it an attractive option for both large enterprises and small-to-medium businesses. Furthermore, the rise of API-driven architectures and microservices has enabled seamless interoperability between graph databases and other analytics tools, enhancing their utility across diverse industry verticals. As digital transformation accelerates globally, the demand for flexible, cloud-native fraud detection solutions is expected to witness exponential growth.




    Regionally, North America continues to dominate the graph databases for fraud detection market, accounting for the largest share in 2024, followed closely by Europe and the Asia Pacific. The presence of major financial institutions, advanced technological infrastructure, and a robust regulatory environment are key factors driving adoption in these regions. However, emerging economies in Asia Pacific and Latin America are rapidly catching up, propelled by the digitalization of banking and commerce, rising cybercrime rates, and increased investments in cybersecurity. The Middle East and Africa are also witnessing steady growth, albeit from a smaller base, as governments and enterprises prioritize fraud prevention as part of their digital agendas.





    Component Analysis



    The component segment of the graph databases

  5. Synthetic Financial Datasets For Fraud Detection

    • kaggle.com
    zip
    Updated Apr 3, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Edgar Lopez-Rojas (2017). Synthetic Financial Datasets For Fraud Detection [Dataset]. https://www.kaggle.com/datasets/ealaxi/paysim1
    Explore at:
    zip(186385561 bytes)Available download formats
    Dataset updated
    Apr 3, 2017
    Authors
    Edgar Lopez-Rojas
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Context

    There is a lack of public available datasets on financial services and specially in the emerging mobile money transactions domain. Financial datasets are important to many researchers and in particular to us performing research in the domain of fraud detection. Part of the problem is the intrinsically private nature of financial transactions, that leads to no publicly available datasets.

    We present a synthetic dataset generated using the simulator called PaySim as an approach to such a problem. PaySim uses aggregated data from the private dataset to generate a synthetic dataset that resembles the normal operation of transactions and injects malicious behaviour to later evaluate the performance of fraud detection methods.

    Content

    PaySim simulates mobile money transactions based on a sample of real transactions extracted from one month of financial logs from a mobile money service implemented in an African country. The original logs were provided by a multinational company, who is the provider of the mobile financial service which is currently running in more than 14 countries all around the world.

    This synthetic dataset is scaled down 1/4 of the original dataset and it is created just for Kaggle.

    NOTE: Transactions which are detected as fraud are cancelled, so for fraud detection these columns (oldbalanceOrg, newbalanceOrig, oldbalanceDest, newbalanceDest ) must not be used.

    Headers

    This is a sample of 1 row with headers explanation:

    1,PAYMENT,1060.31,C429214117,1089.0,28.69,M1591654462,0.0,0.0,0,0

    step - maps a unit of time in the real world. In this case 1 step is 1 hour of time. Total steps 744 (30 days simulation).

    type - CASH-IN, CASH-OUT, DEBIT, PAYMENT and TRANSFER.

    amount - amount of the transaction in local currency.

    nameOrig - customer who started the transaction

    oldbalanceOrg - initial balance before the transaction

    newbalanceOrig - new balance after the transaction.

    nameDest - customer who is the recipient of the transaction

    oldbalanceDest - initial balance recipient before the transaction. Note that there is not information for customers that start with M (Merchants).

    newbalanceDest - new balance recipient after the transaction. Note that there is not information for customers that start with M (Merchants).

    isFraud - This is the transactions made by the fraudulent agents inside the simulation. In this specific dataset the fraudulent behavior of the agents aims to profit by taking control or customers accounts and try to empty the funds by transferring to another account and then cashing out of the system.

    isFlaggedFraud - The business model aims to control massive transfers from one account to another and flags illegal attempts. An illegal attempt in this dataset is an attempt to transfer more than 200.000 in a single transaction.

    Past Research

    There are 5 similar files that contain the run of 5 different scenarios. These files are better explained at my PhD thesis chapter 7 (PhD Thesis Available here http://urn.kb.se/resolve?urn=urn:nbn:se:bth-12932.

    We ran PaySim several times using random seeds for 744 steps, representing each hour of one month of real time, which matches the original logs. Each run took around 45 minutes on an i7 intel processor with 16GB of RAM. The final result of a run contains approximately 24 million of financial records divided into the 5 types of categories: CASH-IN, CASH-OUT, DEBIT, PAYMENT and TRANSFER.

    Acknowledgements

    This work is part of the research project ”Scalable resource-efficient systems for big data analytics” funded by the Knowledge Foundation (grant: 20140032) in Sweden.

    Please refer to this dataset using the following citations:

    PaySim first paper of the simulator:

    E. A. Lopez-Rojas , A. Elmir, and S. Axelsson. "PaySim: A financial mobile money simulator for fraud detection". In: The 28th European Modeling and Simulation Symposium-EMSS, Larnaca, Cyprus. 2016

  6. D

    Graph Databases For Fraud Detection Market Research Report 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Sep 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). Graph Databases For Fraud Detection Market Research Report 2033 [Dataset]. https://dataintelo.com/report/graph-databases-for-fraud-detection-market
    Explore at:
    pdf, csv, pptxAvailable download formats
    Dataset updated
    Sep 30, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Graph Databases for Fraud Detection Market Outlook



    According to our latest research, the global graph databases for fraud detection market size reached USD 2.34 billion in 2024, reflecting the rapid adoption of advanced analytics in combating increasingly sophisticated fraud schemes. The market is set to expand at a robust CAGR of 22.1% from 2025 to 2033, propelled by heightened demand for real-time fraud detection and the proliferation of complex digital transactions. By 2033, the market is forecasted to reach USD 16.91 billion, underlining the critical role of graph database technologies in modern fraud prevention strategies. This growth trajectory is primarily driven by the escalating need for advanced, scalable, and intuitive data models that can efficiently uncover hidden relationships and patterns indicative of fraudulent activities.




    A significant growth factor for the graph databases for fraud detection market is the exponential rise in digital transactions and the parallel increase in cybercrime sophistication. As organizations across sectors such as banking, retail, and healthcare accelerate their digital transformation, they face unprecedented challenges in identifying and mitigating fraud that exploits complex, multi-layered networks of transactions and identities. Traditional relational databases often fall short in tracking and analyzing these intricate relationships in real time. In contrast, graph databases, with their inherent capability to map and analyze interconnected data points, offer a powerful solution for detecting anomalies and suspicious patterns indicative of fraud. The surge in online payments, peer-to-peer transactions, and digital onboarding further amplifies the need for robust fraud detection systems, positioning graph databases as a cornerstone technology for security-conscious enterprises.




    Another pivotal driver of market growth is the increasing regulatory scrutiny and compliance requirements imposed by governments and industry bodies worldwide. Financial institutions, in particular, are under mounting pressure to implement stringent anti-fraud measures and demonstrate proactive risk management practices. Graph databases facilitate compliance by enabling organizations to conduct comprehensive network analyses, trace the flow of funds, and generate detailed audit trails for regulatory reporting. The ability to seamlessly integrate with existing anti-money laundering (AML) and know-your-customer (KYC) frameworks enhances their value proposition. Additionally, the shift toward open banking and the proliferation of fintech innovations create new vectors for fraud, necessitating the adoption of advanced graph-based analytics to safeguard consumer trust and organizational reputations.




    The market is further buoyed by advancements in artificial intelligence (AI) and machine learning (ML), which, when combined with graph database architectures, significantly enhance the accuracy and efficiency of fraud detection mechanisms. AI-driven graph analytics can automatically learn from vast datasets, adapt to emerging fraud tactics, and provide actionable insights in real time. This synergy enables organizations to move beyond rule-based detection toward predictive and prescriptive analytics, reducing false positives and operational overhead. The growing availability of cloud-based graph database solutions also lowers entry barriers, allowing small and medium-sized enterprises (SMEs) to leverage cutting-edge fraud detection capabilities without substantial upfront investments in infrastructure.




    From a regional perspective, North America continues to dominate the graph databases for fraud detection market, accounting for the largest share in 2024. This leadership is attributed to the region's advanced digital infrastructure, high adoption rates of graph technology among financial institutions, and stringent regulatory frameworks. Europe follows closely, driven by robust data protection laws and a proactive stance on cybersecurity. The Asia Pacific region is emerging as the fastest-growing market, fueled by rapid digitalization, expanding e-commerce ecosystems, and increasing awareness of fraud risks. Latin America and the Middle East & Africa, while smaller in market share, are witnessing steady growth as organizations in these regions ramp up investments in fraud prevention and digital security.



    Component Analysis



    The graph databases for fraud detection market is segmented by component into softwar

  7. 21st Century Corporate Financial Fraud, United States, 2005-2010

    • catalog.data.gov
    • icpsr.umich.edu
    Updated Nov 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Justice (2025). 21st Century Corporate Financial Fraud, United States, 2005-2010 [Dataset]. https://catalog.data.gov/dataset/21st-century-corporate-financial-fraud-united-states-2005-2010-22a9e
    Explore at:
    Dataset updated
    Nov 14, 2025
    Dataset provided by
    National Institute of Justicehttp://nij.ojp.gov/
    Area covered
    United States
    Description

    The Corporate Financial Fraud project is a study of company and top-executive characteristics of firms that ultimately violated Securities and Exchange Commission (SEC) financial accounting and securities fraud provisions compared to a sample of public companies that did not. The fraud firm sample was identified through systematic review of SEC accounting enforcement releases from 2005-2010, which included administrative and civil actions, and referrals for criminal prosecution that were identified through mentions in enforcement release, indictments, and news searches. The non-fraud firms were randomly selected from among nearly 10,000 US public companies censused and active during at least one year between 2005-2010 in Standard and Poor's Compustat data. The Company and Top-Executive (CEO) databases combine information from numerous publicly available sources, many in raw form that were hand-coded (e.g., for fraud firms: Accounting and Auditing Enforcement Releases (AAER) enforcement releases, investigation summaries, SEC-filed complaints, litigation proceedings and case outcomes). Financial and structural information on companies for the year leading up to the financial fraud (or around year 2000 for non-fraud firms) was collected from Compustat financial statement data on Form 10-Ks, and supplemented by hand-collected data from original company 10-Ks, proxy statements, or other financial reports accessed via Electronic Data Gathering, Analysis, and Retrieval (EDGAR), SEC's data-gathering search tool. For CEOs, data on personal background characteristics were collected from Execucomp and BoardEx databases, supplemented by hand-collection from proxy-statement biographies.

  8. [🛞🕵🏻 Fraudulent european roulette database]

    • kaggle.com
    zip
    Updated Jul 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel Pérez Hernández (2024). [🛞🕵🏻 Fraudulent european roulette database] [Dataset]. https://www.kaggle.com/datasets/danielprezhernndez/fraudulent-european-roulette-dataset
    Explore at:
    zip(9505903 bytes)Available download formats
    Dataset updated
    Jul 9, 2024
    Authors
    Daniel Pérez Hernández
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Description

    Deception in Roulette: A Look at Random Number Manipulation

    Roulette has been a cornerstone in the study of randomness and statistics since its invention, influencing not only physical casinos but also online platforms. I have created a unique dataset that simulates a roulette wheel, not only to explore the random generation of numbers but also to illustrate how certain techniques can be easily employed by online casinos for fraudulent activities.

    Dynamic Factors at Play:

    -Temporal and Climatic Variables: Each spin is precisely recorded, integrating sports results and weather conditions that influence fraud techniques.

    -Dynamic Fraud Techniques: I have created 53 different fraud techniques, including 5 advanced hybrid techniques that combine various manipulation methods. I select and change fraud techniques daily, adjusting them according to the 'peak hours' of casino traffic to reflect realistic manipulation methods.

    -Influence of Historical Results: I use spin histories to determine 'hot' (more frequent) and 'cold' (less frequent) numbers, which are key to deciding the fraud techniques at any given moment.

    -Distributions and Biases: The distributions of resulting numbers are adjusted based on these analyses, showing how historical information can be used to manipulate future results.

    -Majority of Legitimate Spins: Almost 95% of the spins in this dataset are completely legitimate, without any manipulation, reflecting the normal operation of a roulette wheel.

    -Fraud Concentrated During Peak Hours, Weeks, Months, and Days: The remaining 5% corresponds to fraudulent spins, strategically distributed during peak hours, weeks, months, and days, covering a period of one year. This proportion highlights the importance of thoroughly auditing these high-activity periods.

    I would love to see more studies on this database, so I encourage everyone who reads this post to share the insights you discover.

    Here is the list of strategies used in the dataset (some of them are not as intuitive as they might seem by their names):

    0 == No Fraud 1. 'number_bias' 2. 'predictable_sequences' 3. 'color_omission' 4. 'low_range_bias' 5. 'sequence_repetition' 6. 'cyclic_alteration' 7. 'day_night_bias' 8. 'altered_zero_frequency' 9. 'random_alterations' 10. 'temporal_bias' 11. 'day_hour_bias' 12. 'day_of_week_bias' 13. 'day_of_month_bias' 14. 'bimodal_distribution' 15. 'fibonacci_bias' 16. 'parity_alteration' 17. 'prime_sequence' 18. 'double_sinusoidal_distribution' 19. 'normal_distribution' 20. 'time_series_patterns' 21. 'adaptive_variation' 22. 'wear_simulation' 23. 'advanced_hybrid_1' 24. 'advanced_hybrid_2' 25. 'advanced_hybrid_3' 26. 'advanced_hybrid_4' 27. 'advanced_hybrid_5' 28. 'previous_result_sum_bias' 29. 'special_dates_bias' 30. 'weighted_global_events_distribution' 31. 'previous_winning_combinations_bias' 32. 'sentiment_analysis_alteration' 33. 'weighted_day_of_month_bias' 34. 'weather_patterns_bias' 35. 'weighted_hour_of_day_distribution' 36. 'sports_events_bias' 37. 'lunar_cycles_modulation' 38. 'high_range_bias' 39. 'inverse_prime_sequence' 40. 'alternate_parity_bias' 41. 'zero_series_frequency' 42. 'game_history_bias' 43. 'gaussian_noise_modulation' 44. 'time_weighted_distribution_bias' 45. 'last_digit_bias' 46. 'cumulative_temporal_bias' 47. 'hidden_previous_results_patterns' 48. 'weighted_hot_cold_oscillation' 49. 'adaptive_hot_cold_sequence' 50. 'cold_number_mirage' 51. 'hot_number_evasion' 52. 'false_cold' 53. 'hot_deviation'

    Attached is an example of analysis for a specific hour using a specific strategy, in this case, "double_sinusoidal_distribution":

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F9698182%2Ff536eaa650aeebb5737a9d9a2ec53665%2Foutputexample.png?generation=1720566276284440&alt=media" alt="">

  9. Controlling Fraud in Small Business Health Benefits Programs in the United...

    • icpsr.umich.edu
    • catalog.data.gov
    ascii, sas, spss
    Updated Jun 16, 1999
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tillman, Robert (1999). Controlling Fraud in Small Business Health Benefits Programs in the United States, 1990-1996 [Dataset]. http://doi.org/10.3886/ICPSR02627.v1
    Explore at:
    spss, ascii, sasAvailable download formats
    Dataset updated
    Jun 16, 1999
    Dataset provided by
    Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
    Authors
    Tillman, Robert
    License

    https://www.icpsr.umich.edu/web/ICPSR/studies/2627/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/2627/terms

    Time period covered
    1990 - 1996
    Area covered
    United States
    Description

    The focus of this project was insider fraud -- crimes committed by the owners and operators of insurance companies that were established for the purposes of defrauding businesses and employees. The quantitative data for this collection were taken from a database maintained by the National Association of Insurance Commissioners (NAIC), an organization that represents state insurance departments collectively and acts as a clearinghouse for information obtained from individual departments. Created in 1988, the Regulatory Information Retrieval System (RIRS) database contains information on actions taken by state insurance departments against individuals and firms, including cease and desist orders, license revocations, fines, and penalties imposed. Data available for this project include a total of 123 actions taken against firms labeled as Multiple Employer Welfare Arrangements or Multiple Employer Trusts (MEWA/MET) in the RIRS database. Variables available in this data collection include the date action was taken, state where action was taken, dollar amount of the penalty imposed in the action, and disposition for action taken.

  10. a

    Corporate Laundering Database

    • amlnetwork.org
    csv, pdf, tsv, xlsx
    Updated Nov 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AML Network (2025). Corporate Laundering Database [Dataset]. https://amlnetwork.org/watchdog-database/corporate-laundering/
    Explore at:
    tsv, xlsx, pdf, csvAvailable download formats
    Dataset updated
    Nov 11, 2025
    Dataset authored and provided by
    AML Network
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Discover a dataset of companies and institutions tied to corporate laundering, with names, countries, and AML network risk ratings for compliance insight.

  11. National Investigative Case Management System

    • catalog.data.gov
    • data.wu.ac.at
    Updated Nov 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Social Security Administration (2025). National Investigative Case Management System [Dataset]. https://catalog.data.gov/dataset/national-investigative-case-management-system
    Explore at:
    Dataset updated
    Nov 22, 2025
    Dataset provided by
    Social Security Administrationhttp://ssa.gov/
    Description

    Database of allegations of fraud and dispositions of those allegations that warrant further investigation.

  12. Identity Theft Protection Services in the UK - Market Research Report...

    • ibisworld.com
    Updated Aug 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    IBISWorld (2024). Identity Theft Protection Services in the UK - Market Research Report (2015-2030) [Dataset]. https://www.ibisworld.com/united-kingdom/market-research-reports/identity-theft-protection-services-industry/
    Explore at:
    Dataset updated
    Aug 25, 2024
    Dataset authored and provided by
    IBISWorld
    License

    https://www.ibisworld.com/about/termsofuse/https://www.ibisworld.com/about/termsofuse/

    Time period covered
    2015 - 2030
    Area covered
    United Kingdom
    Description

    Rising online fraud and data breaches have supported modest but uneven growth in the Identity Theft Protection Services industry. Free protections from banks and credit agencies, subdued lending conditions and intense competition have held back paid fraud-alert plans, standalone card-protection registration and some lower-tier consumer subscriptions. Over the five years through 2025-26, industry revenue is projected to rise at a compound annual rate of 0.8% to reach £378 million, including growth of 2.2% in 2025-26. Cifas reported that identity fraud accounted for 64% of National Fraud Database filings in 2023 and almost 250,000 cases in 2024, confirming that identity misuse remains the UK’s dominant cybercrime. Growing exposure to phishing, deepfakes and account takeovers is encouraging both consumers and businesses to maintain subscriptions for monitoring, breach alerts and recovery. Profit has remained steady as companies upgrade their detection and compliance systems to meet tougher data-handling rules, although profit margins remain below those in broader information services due to high regulatory and labour costs. Policy and technological shifts are reshaping the competitive landscape. In October 2024, new Payment Systems Regulator rules made banks jointly liable for authorised push-payment fraud, spurring demand for behavioural and biometric verification tools that prevent mis-payments. Rising breach notifications have also driven corporate contracts for identity restoration, with 12,412 filed with the Information Commissioner’s Office in 2024-25. Consolidation is accelerating as global data and cybersecurity groups expand through deals, like Entrust’s acquisition of Onfido in April 2024 and Experian’s purchase of KYC360 in October 2025. Over the five years through 2030-31, industry revenue is forecast to grow at a compound annual rate of 4.5% to £471.8 million. The rollout of GOV.UK One Login, the Data (Use and Access) Act 2025 and rapid advances in AI-driven fraud detection will embed verified digital identity as a standard security layer across UK finance and public services, supporting stronger, technology-led growth. At the same time, free monitoring from banks and credit agencies, privacy concerns about wider data sharing and persistent labour and compliance costs are expected to cap pricing power and limit profit margins for industry companies.

  13. Z

    Food fraud data based on the European Rapid Alert System for Food and Feed...

    • data.niaid.nih.gov
    Updated Jun 23, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hoenderdaal, Wouter (2020). Food fraud data based on the European Rapid Alert System for Food and Feed (RASFF) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3904124
    Explore at:
    Dataset updated
    Jun 23, 2020
    Dataset provided by
    Wageningen Food Safety Research, Akkermaalsbos 2, 6708 PD, Wageningen, The Netherlands
    Authors
    Hoenderdaal, Wouter
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data contains information on food fraud and was used to predict food fraud type using a Bayesian Network model. Food fraud notifications for the period 2000-2014 were downloaded from the Rapid Alert System for Food and Feed (RASFF) database. Each record contains detailed information on the kind of notification and the products and countries involved. Based on the description in each notification we added a variable "food fraud type" (i.e. six different types of food fraud). A set of 749 notifications for the years 2000-2013 was used to train a Bayesian Network model to predict food fraud type. This model was validated using the 88 notifications for the year 2014.

    Interpretation of the data and details on the performance of the BN model can be found in the research article titled “Prediction of food fraud type using data from Rapid Alert System for Food and Feed (RASFF) and Bayesian network modelling” https://doi.org/10.1016/j.foodcont.2015.09.026

    Column names

    year - year notification was made

    product - categorization of the different products

    notification - categorization of the notifications

    notified - country that made the notification

    origin - country where the product originated from

    fraud - classification of fraud type

  14. HEALTHCARE PROVIDER FRAUD DETECTION ANALYSIS

    • kaggle.com
    zip
    Updated May 9, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rohit Anand Gupta (2019). HEALTHCARE PROVIDER FRAUD DETECTION ANALYSIS [Dataset]. https://www.kaggle.com/datasets/rohitrox/healthcare-provider-fraud-detection-analysis
    Explore at:
    zip(26631783 bytes)Available download formats
    Dataset updated
    May 9, 2019
    Authors
    Rohit Anand Gupta
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Project Objectives Provider Fraud is one of the biggest problems facing Medicare. According to the government, the total Medicare spending increased exponentially due to frauds in Medicare claims. Healthcare fraud is an organized crime which involves peers of providers, physicians, beneficiaries acting together to make fraud claims.

    Rigorous analysis of Medicare data has yielded many physicians who indulge in fraud. They adopt ways in which an ambiguous diagnosis code is used to adopt costliest procedures and drugs. Insurance companies are the most vulnerable institutions impacted due to these bad practices. Due to this reason, insurance companies increased their insurance premiums and as result healthcare is becoming costly matter day by day.

    Healthcare fraud and abuse take many forms. Some of the most common types of frauds by providers are:

    a) Billing for services that were not provided.

    b) Duplicate submission of a claim for the same service.

    c) Misrepresenting the service provided.

    d) Charging for a more complex or expensive service than was actually provided.

    e) Billing for a covered service when the service actually provided was not covered.

    Problem Statement The goal of this project is to " predict the potentially fraudulent providers " based on the claims filed by them.along with this, we will also discover important variables helpful in detecting the behaviour of potentially fraud providers. further, we will study fraudulent patterns in the provider's claims to understand the future behaviour of providers.

    Introduction to the Dataset For the purpose of this project, we are considering Inpatient claims, Outpatient claims and Beneficiary details of each provider. Lets s see their details :

    A) Inpatient Data

    This data provides insights about the claims filed for those patients who are admitted in the hospitals. It also provides additional details like their admission and discharge dates and admit d diagnosis code.

    B) Outpatient Data

    This data provides details about the claims filed for those patients who visit hospitals and not admitted in it.

    C) Beneficiary Details Data

    This data contains beneficiary KYC details like health conditions,regioregion they belong to etc.

  15. Predicting Fraud for Mobile Payment Services

    • kaggle.com
    zip
    Updated Nov 26, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Faraz Ahmad (2018). Predicting Fraud for Mobile Payment Services [Dataset]. https://www.kaggle.com/faraz2402/predicting-fraud-for-mobile-payment-services
    Explore at:
    zip(186385509 bytes)Available download formats
    Dataset updated
    Nov 26, 2018
    Authors
    Faraz Ahmad
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    Dataset

    This dataset was created by Faraz Ahmad

    Released under Database: Open Database, Contents: Database Contents

    Contents

  16. H

    Hungary Fraud Attempts: Volume

    • ceicdata.com
    Updated Sep 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CEICdata.com (2025). Hungary Fraud Attempts: Volume [Dataset]. https://www.ceicdata.com/en/hungary/card-and-electronic-payment-frauds/fraud-attempts-volume
    Explore at:
    Dataset updated
    Sep 15, 2025
    Dataset provided by
    CEICdata.com
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Mar 1, 2017 - Dec 1, 2019
    Area covered
    Hungary
    Description

    Hungary Fraud Attempts: Volume data was reported at 32.000 Unit in Dec 2019. This records an increase from the previous number of 12.000 Unit for Sep 2019. Hungary Fraud Attempts: Volume data is updated quarterly, averaging 68.000 Unit from Mar 2010 (Median) to Dec 2019, with 40 observations. The data reached an all-time high of 243.000 Unit in Mar 2013 and a record low of 12.000 Unit in Sep 2019. Hungary Fraud Attempts: Volume data remains active status in CEIC and is reported by National Bank of Hungary. The data is categorized under Global Database’s Hungary – Table HU.KA013: Card and Electronic Payment Frauds.

  17. fraud.

    • kaggle.com
    Updated Sep 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    PANKAJ BHOSALE (2024). fraud. [Dataset]. https://www.kaggle.com/datasets/pankajbhosale/fraud/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 8, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    PANKAJ BHOSALE
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    Dataset

    This dataset was created by PANKAJ BHOSALE

    Released under Database: Open Database, Contents: Database Contents

    Contents

  18. C

    Credit Risk Database Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated May 29, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Credit Risk Database Report [Dataset]. https://www.datainsightsmarket.com/reports/credit-risk-database-1407617
    Explore at:
    ppt, pdf, docAvailable download formats
    Dataset updated
    May 29, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The global credit risk database market is experiencing robust growth, driven by the increasing need for accurate and timely credit risk assessment across diverse sectors. The market's expansion is fueled by several key factors, including the rising adoption of digital technologies in lending and credit underwriting, the growing complexity of financial regulations demanding more sophisticated risk management strategies, and the increasing prevalence of fraud and credit defaults. This necessitates comprehensive credit risk databases offering detailed information on individuals and businesses, enabling financial institutions and other stakeholders to make informed decisions, minimize losses, and optimize their credit portfolios. The market is segmented by database type (consumer, commercial, etc.), deployment model (cloud-based, on-premise), and end-user (banks, insurance companies, etc.). Key players are actively investing in advanced analytics, machine learning, and data enrichment capabilities to enhance the accuracy and predictive power of their credit risk databases, further driving market expansion. Competition is intensifying, with companies focusing on strategic partnerships, acquisitions, and technological innovation to maintain a competitive edge. The forecast period (2025-2033) anticipates continued growth, fueled by burgeoning adoption of sophisticated credit scoring models and the expansion of fintech companies leveraging these databases for lending and other financial services. Regulatory changes impacting credit reporting and data privacy are likely to shape the market landscape, necessitating compliance and adaptation among database providers. While challenges such as data security concerns and the cost of data acquisition and maintenance persist, the overall market outlook remains positive, with substantial growth potential across various geographic regions, particularly in emerging economies experiencing rapid economic development and financial sector expansion. Estimating a reasonable market size requires making assumptions about the provided CAGR and the market's initial value. Let's assume a base year (2025) market size of $5 billion and a CAGR of 12% for illustration purposes. This would indicate significant growth over the forecast period.

  19. g

    Fraud Database | gimi9.com

    • gimi9.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fraud Database | gimi9.com [Dataset]. https://gimi9.com/dataset/uk_fraud-database
    Explore at:
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    🇬🇧 영국

  20. Z

    GPU Database Market: By Tools (GPU-Accelerated Databases and GPU-Accelerated...

    • zionmarketresearch.com
    pdf
    Updated Nov 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zion Market Research (2025). GPU Database Market: By Tools (GPU-Accelerated Databases and GPU-Accelerated Analytics), By Application (GRC, CEM, Threat Intelligence, SCM, and Fraud Detection and Prevention), and by Regions: Global Industry Perspective, Comprehensive Analysis and Forecast, 2024-2032 [Dataset]. https://www.zionmarketresearch.com/report/gpu-database-market
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Nov 22, 2025
    Dataset authored and provided by
    Zion Market Research
    License

    https://www.zionmarketresearch.com/privacy-policyhttps://www.zionmarketresearch.com/privacy-policy

    Time period covered
    2022 - 2030
    Area covered
    Global
    Description

    Global GPU Database Market size is set to expand from $ 509.36 Million in 2023 to $ 2368.35 Million by 2032, with an anticipated CAGR of around 18.62% from 2024 to 2032.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Service Personnel and Veterans Agency (2013). Fraud Database [Dataset]. https://data.wu.ac.at/schema/data_gov_uk/MjFmNDViNDYtOGNlNy00MDQzLTliODQtNmM5NTg5ODhkMThm
Organization logo

Fraud Database

Explore at:
Dataset updated
Aug 31, 2013
Dataset provided by
Service Personnel and Veterans Agency
Description

Details of fraud referrals relating to war pensions & compensation

Search
Clear search
Close search
Google apps
Main menu