64 datasets found
  1. Insurance Claims Dataset

    • kaggle.com
    zip
    Updated May 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sergey Litvinenko (2024). Insurance Claims Dataset [Dataset]. https://www.kaggle.com/datasets/litvinenko630/insurance-claims
    Explore at:
    zip(688768 bytes)Available download formats
    Dataset updated
    May 9, 2024
    Authors
    Sergey Litvinenko
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Description: Insurance Claims Prediction

    Introduction: In the insurance industry, accurately predicting the likelihood of claims is essential for risk assessment and policy pricing. However, insurance claims datasets frequently suffer from class imbalance, where the number of non-claims instances far exceeds that of actual claims. This class imbalance poses challenges for predictive modeling, often leading to biased models favoring the majority class, resulting in subpar performance for the minority class, which is typically of greater interest.

    Dataset Overview: The dataset utilized in this project comprises historical data on insurance claims, encompassing a variety of information about the policyholders, their demographics, past claim history, and other pertinent features. The dataset is structured to facilitate predictive modeling tasks aimed at accurately identifying the likelihood of future insurance claims.

    Key Features: 1. Policyholder Information: This includes demographic details such as age, gender, occupation, marital status, and geographical location. 2. Claim History: Information regarding past insurance claims, including claim amounts, types of claims (e.g., medical, automobile), frequency of claims, and claim durations. 3. Policy Details: Details about the insurance policies held by the policyholders, such as coverage type, policy duration, premium amount, and deductibles. 4. Risk Factors: Variables indicating potential risk factors associated with policyholders, such as credit score, driving record (for automobile insurance), health status (for medical insurance), and property characteristics (for home insurance). 5. External Factors: Factors external to the policyholders that may influence claim likelihood, such as economic indicators, weather conditions, and regulatory changes.

    Objective: The primary objective of utilizing this dataset is to develop robust predictive models capable of accurately assessing the likelihood of insurance claims. By leveraging advanced machine learning techniques, such as classification algorithms and ensemble methods, the aim is to mitigate the effects of class imbalance and produce models that demonstrate high predictive performance across both majority and minority classes.

    Application Areas: 1. Risk Assessment: Assessing the risk associated with insuring a particular policyholder based on their characteristics and historical claim behavior. 2. Policy Pricing: Determining appropriate premium amounts for insurance policies by estimating the expected claim frequency and severity. 3. Fraud Detection: Identifying fraudulent insurance claims by detecting anomalous patterns in claim submissions and policyholder behavior. 4. Customer Segmentation: Segmenting policyholders into distinct groups based on their risk profiles and insurance needs to tailor marketing strategies and policy offerings.

    Conclusion: The insurance claims dataset serves as a valuable resource for developing predictive models aimed at enhancing risk management, policy pricing, and overall operational efficiency within the insurance industry. By addressing the challenges posed by class imbalance and leveraging the rich array of features available, organizations can gain valuable insights into insurance claim likelihood and make informed decisions to mitigate risk and optimize business outcomes.

    FeatureDescription
    policy_idUnique identifier for the insurance policy.
    subscription_lengthThe duration for which the insurance policy is active.
    customer_ageAge of the insurance policyholder, which can influence the likelihood of claims.
    vehicle_ageAge of the vehicle insured, which may affect the probability of claims due to factors like wear and tear.
    modelThe model of the vehicle, which could impact the claim frequency due to model-specific characteristics.
    fuel_typeType of fuel the vehicle uses (e.g., Petrol, Diesel, CNG), which might influence the risk profile and claim likelihood.
    max_torque, max_powerEngine performance characteristics that could relate to the vehicle’s mechanical condition and claim risks.
    engine_typeThe type of engine, which might have implications for maintenance and claim rates.
    displacement, cylinderSpecifications related to the engine size and construction, affec...
  2. Car Insurance Data

    • kaggle.com
    zip
    Updated Jul 5, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sagnik Roy (2021). Car Insurance Data [Dataset]. https://www.kaggle.com/datasets/sagnik1511/car-insurance-data
    Explore at:
    zip(227013 bytes)Available download formats
    Dataset updated
    Jul 5, 2021
    Authors
    Sagnik Roy
    Description

    Context

    The company has shared its annual car insurance data. Now, you have to find out the real customer behaviors over the data.

    Content

    The columns are resembling practical world features. The outcome column indicates 1 if a customer has claimed his/her loan else 0. The data has 19 features from there 18 of them are corresponding logs which were taken by the company.

    Acknowledgements

    Mostly the data is real and some part of it is also generated by me.

    Inspiration

    The data is so well balanced that it will help kagglers find a better intuition of real customers and find the deepest story lien within it.

  3. m

    Dataset of an actual motor vehicle insurance portfolio

    • data.mendeley.com
    • openicpsr.org
    • +1more
    Updated Jul 30, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Josep Lledó (2024). Dataset of an actual motor vehicle insurance portfolio [Dataset]. http://doi.org/10.17632/5cxyb5fp4f.2
    Explore at:
    Dataset updated
    Jul 30, 2024
    Authors
    Josep Lledó
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data is formatted as a spreadsheet, encompassing the primary activities over a span of three full years (November 2015 to December 2018) concerning non-life motor insurance portfolio. This dataset comprises 105,555 rows and 30 columns. Each row signifies a policy transaction, while each column represents a distinct variable.

  4. m

    insurance_claims

    • data.mendeley.com
    • kaggle.com
    Updated Aug 22, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ABDELRAHIM AQQAD (2023). insurance_claims [Dataset]. http://doi.org/10.17632/992mh7dk9y.2
    Explore at:
    Dataset updated
    Aug 22, 2023
    Authors
    ABDELRAHIM AQQAD
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset, named "insurance_claims.csv", is a comprehensive collection of insurance claim records. Each row represents an individual claim, and the columns represent various features associated with that claim.

    The dataset is, highlighting features like 'months_as_customer', 'age', policy_number, ...etc. The main focus is the 'fraud_reported' variable, which indicates claim legitimacy.

    Claims data were sourced from various insurance providers, encompassing a diverse array of insurance types including vehicular, property, and personal injury. Each claim's record provides an in-depth look into the individual's background, claim specifics, associated documentation, and feedback from insurance professionals.

    The dataset further includes specific indicators and parameters that were considered during the claim's assessment, offering a granular look into the complexities of each claim.

    For privacy reasons, and in agreement with the participating insurance providers, certain personal details and specific identifiers have been anonymized. Instead of names or direct identifiers, each entry is associated with a unique ID, ensuring data privacy while retaining data integrity.

    The insurance claims were subjected to rigorous examination, encompassing both manual assessments and automated checks. The end result of this examination, specifically whether a claim was deemed fraudulent or not, is clearly indicated for each record.

  5. Insurance Claims Data

    • kaggle.com
    zip
    Updated Jan 30, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Satish Varma (2022). Insurance Claims Data [Dataset]. https://www.kaggle.com/datasets/saisatish09/insuranceclaimsdata
    Explore at:
    zip(1959661 bytes)Available download formats
    Dataset updated
    Jan 30, 2022
    Authors
    Satish Varma
    Description

    Autobi(Automobile Bodily Injury Claims) -

    The data contains information on demographic information about the claimant, attorney involvement and the economic loss (LOSS, in thousands), among other variables.The full data contains over 70,000 closed claims based on data from thirty-two insurers.

    A data frame with 1340 observations on the following 8 variables.

    CASENUM- Case number to identify the claim, a numeric vector ATTORNEY- Whether the claimant is represented by an attorney (=1 if yes and =2 if no), a numeric vector CLMSEX - Claimant's gender (=1 if male and =2 if female), a numeric vector MARITAL- claimant's marital status (=1 if married, =2 if single, =3 if widowed, and =4 if divorced/separated), a numeric vector CLMINSUR- Whether or not the driver of the claimant's vehicle was uninsured (=1 if yes, =2 if no, and =3 if not applicable), a numeric vector SEATBELT- Whether or not the claimant was wearing a seatbelt/child restraint (=1 if yes, =2 if no, and =3 if not applicable), a numeric vector CLMAGE- Claimant's age, a numeric vector LOSS- The claimant's total economic loss (in thousands), a numeric vector

    AutoClaims(Automobile Insurance Claims) -

    A data frame with 6773 observations on the following 5 variables.

    STATE CLASS - Rating class of operator, based on age, gender, marital status, use of vehicle GENDER AGE - Age of operator PAID - Amount paid to settle and close a claim

    AutoCollision(Automobile UK Collision Claims)

    8,942 collision losses from private passenger United Kingdom (UK) automobile insurance policies. The average severity is in pounds sterling adjusted for inflation.

    A data frame with 32 observations on the following 4 variables.

    Age - Age of driver Vehicle_Use - Purpose of the vehicle use Severity - Average amount of claims Claim_Count - Number of claims

    Additional information can be found in the document: https://cran.r-project.org/web/packages/insuranceData/index.html

  6. G

    Insurance Premium and Claims Data by Class of Insurance, Alberta, 2013

    • open.canada.ca
    • data.wu.ac.at
    csv, html, xlsx
    Updated Jul 24, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Government of Alberta (2024). Insurance Premium and Claims Data by Class of Insurance, Alberta, 2013 [Dataset]. https://open.canada.ca/data/en/dataset/34eb85a2-1558-46b7-adca-a40c446cb05f
    Explore at:
    xlsx, csv, htmlAvailable download formats
    Dataset updated
    Jul 24, 2024
    Dataset provided by
    Government of Alberta
    License

    Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
    License information was derived automatically

    Time period covered
    Jan 1, 2013 - Dec 31, 2013
    Area covered
    Alberta
    Description

    Data provided by insurers, on the premiums written and claims incurred for the 2013 fiscal year. Based on reporting on the consolidated pages of the P&C-1 or Life-1 Annual returns. This data is also reported in the Superintendent of Insurance’s Annual Report.

  7. Car Insurance Claim Data

    • kaggle.com
    zip
    Updated Oct 15, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    xiaomengsun (2018). Car Insurance Claim Data [Dataset]. https://www.kaggle.com/xiaomengsun/car-insurance-claim-data
    Explore at:
    zip(387449 bytes)Available download formats
    Dataset updated
    Oct 15, 2018
    Authors
    xiaomengsun
    Description

    Dataset

    This dataset was created by xiaomengsun

    Contents

  8. Average annual minimum and full car insurance premiums in the U.S. 2024, by...

    • statista.com
    Updated Nov 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Average annual minimum and full car insurance premiums in the U.S. 2024, by age [Dataset]. https://www.statista.com/statistics/675367/annual-auto-insurance-premiums-usa-by-state/
    Explore at:
    Dataset updated
    Nov 24, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2024
    Area covered
    United States
    Description

    Louisiana had the most expensive annual car insurance premiums at ***** U.S. dollars for full coverage. Alaska ranked in first place, having the highest annual cost for minimum car insurance coverage at *** U.S. dollars.Why it varies state by state The huge variance in premiums between states is due to the difference in state laws, the percentage of uninsured drivers in the state, the frequency of natural disasters, and claim rates. For instance, Michigan has a no-fault car insurance system, which means that claims are more common. This drives up the cost of insurance for all drivers because insurers need to pay out more money in claims. Male drivers also pay more There is also a difference between premiums among different age groups. In 2025, 25-year-old male drivers paid more per month than 25-year-old female drivers did. This is due to the higher incidence of accidents among young male drivers. This means that young drivers in states that already have higher premiums must pay a lot for car insurance.

  9. Comprehensive car claim frequency for physical damage in the U.S. 2007-2023

    • statista.com
    Updated Jul 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Comprehensive car claim frequency for physical damage in the U.S. 2007-2023 [Dataset]. https://www.statista.com/statistics/830114/comprehensive-car-claim-frequency-physical-damage-usa/
    Explore at:
    Dataset updated
    Jul 9, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    United States
    Description

    The frequency of private passenger comprehensive auto insurance claims for physical damage in the United States rose to **** per 100 car years in 2023, compared to *** in 2020. This was the highest frequency recorded over the past 15 years.

  10. Auto Insurance Claims Updated to 2024

    • kaggle.com
    zip
    Updated Jul 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Bumpkin (2024). Auto Insurance Claims Updated to 2024 [Dataset]. https://www.kaggle.com/datasets/thebumpkin/auto-insurance-claims-updated-to-2024
    Explore at:
    zip(296702 bytes)Available download formats
    Dataset updated
    Jul 31, 2024
    Authors
    The Bumpkin
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset comprises 9,134 records of auto insurance claims, encompassing a broad range of attributes related to customer profiles and policy details. Key columns include demographic information such as Customer, State, Gender, Income, and Education, along with policy-specific data like Coverage, Policy Type, and Monthly Premium Auto. This dataset also contains indices for various categorical attributes, including Coverage Index, Education Index, and Vehicle Class Index, which facilitate the quantification of qualitative information. Additionally, the dataset tracks metrics related to policy performance and customer interaction, such as the Number of Open Complaints, Months Since Last Claim, and Total Claim Amount.

    To provide a comprehensive view of the insurance landscape, the dataset includes detailed attributes about policy effectiveness and customer engagement. Features such as Effective To Date, Renew Offer Type, Sales Channel, and Vehicle Size contribute to understanding how different factors impact insurance claims. This rich dataset offers valuable insights into customer behavior, policy performance, and overall claim dynamics, making it a robust resource for analyzing trends and patterns in auto insurance claims.

    This dataset was initially created in 2011 with values in 2011 dollars. To reflect current economic conditions, I updated it to 2024 dollars using a factor provided by ChatGPT. Additionally, I incorporated index columns to facilitate research and analysis.

  11. Number of motor insurance claims in Germany 1995-2022

    • statista.com
    Updated Nov 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Number of motor insurance claims in Germany 1995-2022 [Dataset]. https://www.statista.com/statistics/1130547/motor-insurance-claims-in-germany/
    Explore at:
    Dataset updated
    Nov 24, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Germany
    Description

    In 2022, there were more than ************* auto insurance claims submitted in Germany. The largest share was for comprehensive, or Vollkasko, insurance, which accounted for *** million claims, followed by third-party liability with **** million claims.

  12. R

    Car Damage Images Dataset

    • universe.roboflow.com
    zip
    Updated Mar 29, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Car Damage (2023). Car Damage Images Dataset [Dataset]. https://universe.roboflow.com/car-damage-kadad/car-damage-images/dataset/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Mar 29, 2023
    Dataset authored and provided by
    Car Damage
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Car Damage Bounding Boxes
    Description

    Here are a few use cases for this project:

    1. Insurance Assessment: This model can be used by insurance companies to automate the process of assessing car damage in insurance claims. By simply using photographs of the damaged vehicle, the model can identify the type and extent of damage, making the claim processing faster and more objective.

    2. Automotive Repair Estimates: Car repair shops can use this model to get an approximate idea of the damage and therefore provide a more accurate cost estimate for their clients. It can also assist in identifying nonobvious damage.

    3. Used Car Market Evaluation: This model can be used in used car platforms to evaluate the current condition of the cars listed for sale. By identifying existing damage, buyers can make more informed decisions and sellers can price their vehicles more accurately.

    4. Law Enforcement and Road Safety: Traffic police and accident investigation teams can utilize this model to evaluate the types of damages after a road accident. It will assist in rebuilding the accident scenario, providing insights during investigations.

    5. Auto-manufacturing Quality Control: Automobile manufacturers can use this model in their factories to automatically inspect new cars for any damage or misaligned/missing parts before they are dispatched from the factory, ensuring quality control.

  13. Leading private passenger auto insurers in the U.S. 2024, by premiums

    • statista.com
    Updated Jul 17, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista Research Department (2025). Leading private passenger auto insurers in the U.S. 2024, by premiums [Dataset]. https://www.statista.com/topics/3087/car-insurance-in-the-united-states/
    Explore at:
    Dataset updated
    Jul 17, 2025
    Dataset provided by
    Statistahttp://statista.com/
    Authors
    Statista Research Department
    Description

    State Farm Mutual Automobile Insurance was the leading private passenger car insurer in the United States in 2024, with premiums written amounting to approximately 68 billion U.S. dollars. Progressive Corporation, and Berkshire Hathaway Inc. were the next largest insurers in this sector. State Farm: a background State Farm Mutual Automobile Insurance was founded in 1922 and is headquartered in Bloomington, Illinois. In 2024, the insurer was the largest writer of property and casualty insurance in the United States. They provide vehicle, homeowners, renters, life and annuities, health, disability and flood insurance among several other insurance products. Net promoter score and ad spend of State Farm Despite their market leader status, State Farm's net promoter score puts them in the middle of the pack, with only 42 percent of their customers saying they would recommend the insurer. However, their nearest competitors did not score any better, with Progressive receiving a NPS of only 38 percent in the same analysis. The three largest car insurers were also the biggest spenders on advertising.

  14. Car Insurance Premium Dataset

    • kaggle.com
    Updated Jan 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GOVINDARAM SRIRAM (2025). Car Insurance Premium Dataset [Dataset]. https://www.kaggle.com/datasets/govindaramsriram/car-insurance-premium-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 6, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    GOVINDARAM SRIRAM
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Description: This dataset contains 1,000 rows of synthetic data simulating car insurance premiums, calculated using a linear formula. It incorporates key features such as driver age, driving experience, accident history, annual mileage, and car manufacturing year to predict the insurance premium. The dataset is ideal for exploring linear regression models, feature importance analysis, and predictive modeling in the insurance industry. It was inspired by real-world factors influencing insurance premiums, ensuring realistic patterns and meaningful insights.

  15. Premiums And Claims Of General Insurance Funds, Annual

    • data.gov.sg
    Updated Oct 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Singapore Department of Statistics (2025). Premiums And Claims Of General Insurance Funds, Annual [Dataset]. https://data.gov.sg/datasets/d_abcfd12381e7f8d175280d999cdb2dea/view
    Explore at:
    Dataset updated
    Oct 13, 2025
    Dataset authored and provided by
    Singapore Department of Statistics
    License

    https://data.gov.sg/open-data-licencehttps://data.gov.sg/open-data-licence

    Time period covered
    Jan 1965 - Dec 2023
    Description

    Dataset from Singapore Department of Statistics. For more information, visit https://data.gov.sg/datasets/d_abcfd12381e7f8d175280d999cdb2dea/view

  16. Auto Insurance Claim Metadata & Automation Service

    • kaggle.com
    Updated Sep 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Le Tu Uyen Nguyen (2025). Auto Insurance Claim Metadata & Automation Service [Dataset]. https://www.kaggle.com/datasets/letuuyennguyen/auto-insurance-claim-metadata-and-automation-service
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 22, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Le Tu Uyen Nguyen
    Description

    Description This dataset and project are part of ClaimWise AI, an intelligent automation service designed to streamline auto insurance claim processing. All data in this release was collected and curated by our team, ensuring originality and alignment with real-world claim processing scenarios.

    What’s inside

    • claim_metadata.xlsx: Structured metadata of insurance claims including claim IDs, incident details, vehicle attributes, and fraud indicators.
    • Auto-claim service pipeline: Demonstrates how to leverage machine learning, embeddings, and multimodal AI to process claim data and simulate real-world automation.

    Note on Images The pipeline references car crash and accident images as part of embedding and similarity checks. These images were also collected by our team from publicly available resources and curated for research purposes. They are not redistributed in this dataset but are used internally to illustrate how ClaimWise AI can handle multimodal data.

    Key Features

    • Automates claim intake and metadata standardization.
    • Uses AI embeddings for fraud detection and duplicate claim checks.
    • Supports integration with curated external crash images for damage assessment.
    • Provides an end-to-end demo of an AI-powered insurance claim automation workflow.

    Use Cases

    • Fraud detection in auto claims
    • Accident severity classification
    • Claim similarity and duplicate detection
    • AI-enabled insurance process optimization
  17. d

    Automobile Insurance Company Complaint Rankings: Beginning 2009

    • catalog.data.gov
    • data.ny.gov
    • +2more
    Updated Mar 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.ny.gov (2025). Automobile Insurance Company Complaint Rankings: Beginning 2009 [Dataset]. https://catalog.data.gov/dataset/automobile-insurance-company-complaint-rankings-beginning-2009
    Explore at:
    Dataset updated
    Mar 14, 2025
    Dataset provided by
    data.ny.gov
    Description

    The DFS ranks automobile insurance companies doing business in New York State based on the number of consumer complaints upheld against them as a percentage of their total business over a two-year period. Complaints typically involve issues like delays in the payment of no-fault claims and nonrenewal of policies. Insurers with the fewest upheld complaints per million dollars of premiums appear at the top of the list. Those with the highest complaint ratios are ranked at the bottom.

  18. d

    NCID Private Motor Insurance Data- Ultimate Claims - Dataset - CBI Opendata

    • poc.staging.derilinx.com
    Updated Apr 29, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). NCID Private Motor Insurance Data- Ultimate Claims - Dataset - CBI Opendata [Dataset]. https://poc.staging.derilinx.com/dataset/ncid-part-2-claims-ultimate-claims
    Explore at:
    Dataset updated
    Apr 29, 2024
    Description

    This file contains ultimate claims data taken from the private motor National Claims Iinformation Database (NCID). The claims are grouped together by accident year, the year in which the accident occurred. Not all claims are paid in the lifetime of the policy. Some claims, injury claims in particular, can take many years to be settled and be fully paid. Insurers estimate the cost/number of claims expected for a particular accident year, and this known as the ultimate cost/number of claims. The ultimate cost/number of claims is recalculated regularly, based on the most up-to-date information available. The more time that has passed since the accident year, the more certain the ultimate cost of claims becomes. To view the detailed NCID report kindly refer to the centralbank publication link in the Landing Page section under Additional Info.

  19. S

    Switzerland Non Life Insurance: Claims Paid: Liability and Motor

    • ceicdata.com
    Updated Sep 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CEICdata.com (2025). Switzerland Non Life Insurance: Claims Paid: Liability and Motor [Dataset]. https://www.ceicdata.com/en/switzerland/non-life-insurance-claims-paid/non-life-insurance-claims-paid-liability-and-motor
    Explore at:
    Dataset updated
    Sep 15, 2025
    Dataset provided by
    CEICdata.com
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Dec 1, 2005 - Dec 1, 2016
    Area covered
    Switzerland
    Variables measured
    Insurance Market
    Description

    Switzerland Non Life Insurance: Claims Paid: Liability and Motor data was reported at 4,676.000 CHF mn in 2016. This records a decrease from the previous number of 4,802.000 CHF mn for 2015. Switzerland Non Life Insurance: Claims Paid: Liability and Motor data is updated yearly, averaging 4,628.000 CHF mn from Dec 2000 (Median) to 2016, with 17 observations. The data reached an all-time high of 4,918.000 CHF mn in 2009 and a record low of 3,844.000 CHF mn in 2000. Switzerland Non Life Insurance: Claims Paid: Liability and Motor data remains active status in CEIC and is reported by Swiss Financial Market Supervisory Authority. The data is categorized under Global Database’s Switzerland – Table CH.RG011: Non Life Insurance: Claims Paid.

  20. R

    Damaged Car Parts Dataset

    • universe.roboflow.com
    zip
    Updated Jul 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohamed Abdelhalim (2024). Damaged Car Parts Dataset [Dataset]. https://universe.roboflow.com/mohamed-abdelhalim/damaged-car-parts-1srjx/model/3
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 11, 2024
    Dataset authored and provided by
    Mohamed Abdelhalim
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Car Parts Bounding Boxes
    Description

    Here are a few use cases for this project:

    1. Insurance Claims Processing: This model can be used by auto insurance companies to automate the process of assessing the damage on vehicles involved in accidents. After a collision, policyholders can take pictures of their damaged vehicles, and the model can identify and classify the damage, speeding up the claims process.

    2. Vehicle Repair Estimates: Auto repair shops can utilize this model to quickly generate estimates for vehicle repairs. By using it to catalogue damage, repairs needed for specific parts could be more accurately priced.

    3. Online Car Selling Platforms: This model could be used on platforms where used cars are sold. Sellers could upload images of their cars, and the model could assess any visible damage, providing potential buyers with more information about the condition of the vehicle.

    4. Traffic Accident Analysis: Law enforcement or accident investigators could use this model to help determine the sequence of events in a car accident. By identifying the damaged parts of the vehicles involved, it could offer clues to how the accident happened.

    5. Car Rental Services: Car Rental companies could implement this model to automatically evaluate the condition of cars when they are returned by customers. This could identify any new damages or irregularities as compared to the car's condition at the start of the rental period.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Sergey Litvinenko (2024). Insurance Claims Dataset [Dataset]. https://www.kaggle.com/datasets/litvinenko630/insurance-claims
Organization logo

Insurance Claims Dataset

Develop a model for insurance claim likelihood, addressing class imbalance.

Explore at:
zip(688768 bytes)Available download formats
Dataset updated
May 9, 2024
Authors
Sergey Litvinenko
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Dataset Description: Insurance Claims Prediction

Introduction: In the insurance industry, accurately predicting the likelihood of claims is essential for risk assessment and policy pricing. However, insurance claims datasets frequently suffer from class imbalance, where the number of non-claims instances far exceeds that of actual claims. This class imbalance poses challenges for predictive modeling, often leading to biased models favoring the majority class, resulting in subpar performance for the minority class, which is typically of greater interest.

Dataset Overview: The dataset utilized in this project comprises historical data on insurance claims, encompassing a variety of information about the policyholders, their demographics, past claim history, and other pertinent features. The dataset is structured to facilitate predictive modeling tasks aimed at accurately identifying the likelihood of future insurance claims.

Key Features: 1. Policyholder Information: This includes demographic details such as age, gender, occupation, marital status, and geographical location. 2. Claim History: Information regarding past insurance claims, including claim amounts, types of claims (e.g., medical, automobile), frequency of claims, and claim durations. 3. Policy Details: Details about the insurance policies held by the policyholders, such as coverage type, policy duration, premium amount, and deductibles. 4. Risk Factors: Variables indicating potential risk factors associated with policyholders, such as credit score, driving record (for automobile insurance), health status (for medical insurance), and property characteristics (for home insurance). 5. External Factors: Factors external to the policyholders that may influence claim likelihood, such as economic indicators, weather conditions, and regulatory changes.

Objective: The primary objective of utilizing this dataset is to develop robust predictive models capable of accurately assessing the likelihood of insurance claims. By leveraging advanced machine learning techniques, such as classification algorithms and ensemble methods, the aim is to mitigate the effects of class imbalance and produce models that demonstrate high predictive performance across both majority and minority classes.

Application Areas: 1. Risk Assessment: Assessing the risk associated with insuring a particular policyholder based on their characteristics and historical claim behavior. 2. Policy Pricing: Determining appropriate premium amounts for insurance policies by estimating the expected claim frequency and severity. 3. Fraud Detection: Identifying fraudulent insurance claims by detecting anomalous patterns in claim submissions and policyholder behavior. 4. Customer Segmentation: Segmenting policyholders into distinct groups based on their risk profiles and insurance needs to tailor marketing strategies and policy offerings.

Conclusion: The insurance claims dataset serves as a valuable resource for developing predictive models aimed at enhancing risk management, policy pricing, and overall operational efficiency within the insurance industry. By addressing the challenges posed by class imbalance and leveraging the rich array of features available, organizations can gain valuable insights into insurance claim likelihood and make informed decisions to mitigate risk and optimize business outcomes.

FeatureDescription
policy_idUnique identifier for the insurance policy.
subscription_lengthThe duration for which the insurance policy is active.
customer_ageAge of the insurance policyholder, which can influence the likelihood of claims.
vehicle_ageAge of the vehicle insured, which may affect the probability of claims due to factors like wear and tear.
modelThe model of the vehicle, which could impact the claim frequency due to model-specific characteristics.
fuel_typeType of fuel the vehicle uses (e.g., Petrol, Diesel, CNG), which might influence the risk profile and claim likelihood.
max_torque, max_powerEngine performance characteristics that could relate to the vehicle’s mechanical condition and claim risks.
engine_typeThe type of engine, which might have implications for maintenance and claim rates.
displacement, cylinderSpecifications related to the engine size and construction, affec...
Search
Clear search
Close search
Google apps
Main menu