3 datasets found
  1. USA Macroeconomic Rate Of Changes 1993-2025

    • kaggle.com
    Updated Mar 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Saint moretz (2025). USA Macroeconomic Rate Of Changes 1993-2025 [Dataset]. https://www.kaggle.com/datasets/spingere/usa-macroeconomic-rate-of-changes-1993-2025
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 28, 2025
    Dataset provided by
    Kaggle
    Authors
    Saint moretz
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    United States
    Description

    ****Dataset Overview**** This dataset contains historical macroeconomic data, featuring key economic indicators in the United States. It includes important metrics such as the Consumer Price Index (CPI), Retail Sales, Unemployment Rate, Industrial Production, Money Supply (M2), and more. The dataset spans from 1993 to the present and includes monthly data on various economic indicators, processed to show their rate of change (either percentage or absolute difference, depending on the indicator).

    provenance

    The data in this dataset is sourced from the Federal Reserve Economic Data (FRED) database, hosted by the Federal Reserve Bank of St. Louis. FRED provides access to a wide range of economic data, including key macroeconomic indicators for the United States. My work involved calculating the rate of change (ROC) for each indicator and reorganizing the data into a more usable format for analysis. For more information and access to the full database, visit FRED's website.

    Purpose and Use for the Kaggle Community:

    This dataset is a valuable resource for data scientists, economists, and analysts interested in understanding macroeconomic trends, performing time series analysis, or building predictive models. With the rate of change included, users can quickly assess the growth or contraction in these indicators month-over-month. This dataset can be used for:

    • Exploratory Data Analysis (EDA): Understanding historical economic trends. -Time Series Forecasting: Building models to predict future economic conditions. -Macroeconomic Analysis: Analyzing the relationship between various economic indicators. -Machine Learning Projects: Using the data as features to predict financial or economic outcomes. -By utilizing this dataset, users can perform in-depth analysis on the impact of macroeconomic changes, compare the historical performance of various indicators, and experiment with different time series forecasting techniques.

    ****Column Descriptions****

    Year: The year of the observation.

    Month: The month of the observation (1-12).

    Industrial Production: Monthly data on the total output of US factories, mines, and utilities.

    Manufacturers' New Orders: Durable Goods: Measures the value of new orders placed with manufacturers for durable goods, indicating future production activity.

    Consumer Price Index (CPIAUCSL): A measure of the average change over time in the prices paid by urban consumers for a market basket of consumer goods and services.

    Unemployment Rate: The percentage of the total labor force that is unemployed but actively seeking employment.

    Retail Sales: The total receipts of retail stores, indicating consumer spending and economic activity.

    Producer Price Index: Measures the average change over time in the selling prices received by domestic producers for their output.

    Personal Consumption Expenditures (PCE): A measure of the prices paid by consumers for goods and services, used in calculating inflation.

    National Home Price Index: A measure of changes in residential real estate prices across the country.

    All Employees, Total Nonfarm: The number of nonfarm payroll employees, an important indicator of the labor market.

    Labor Force Participation Rate: The percentage of the working-age population that is either employed or actively looking for work.

    Federal Funds Effective Rate: The interest rate at which depository institutions lend reserve balances to other depository institutions overnight.

    Building Permits: The number of building permits issued for residential and non-residential buildings, a leading indicator of construction activity.

    Money Supply (M2): The total money supply, including cash, checking deposits, and easily convertible near money.

    Personal Income: The total income received by individuals from all sources, including wages, investments, and government transfers.

    Trade Balance: The difference between a country's imports and exports, indicating the net trade flow.

    Consumer Sentiment: The index reflecting consumer sentiment and expectations for the future economic outlook.

    Consumer Confidence: A measure of how optimistic or pessimistic consumers are regarding their expected financial situation and the economy.

    Notes on Interest Rates Please note that for the Federal Funds Effective Rate (FEDFUNDS), the dataset includes the absolute change in basis points (bps), not the rate of change. This means that the dataset reflects the direct change in the interest rate rather than the percentage change month-over-month. The change is represented in basis points, where 1 basis point equals 0.01%.

  2. Insurance Claims Dataset

    • kaggle.com
    Updated May 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sergey Litvinenko (2024). Insurance Claims Dataset [Dataset]. https://www.kaggle.com/datasets/litvinenko630/insurance-claims
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 9, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sergey Litvinenko
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset Description: Insurance Claims Prediction

    Introduction: In the insurance industry, accurately predicting the likelihood of claims is essential for risk assessment and policy pricing. However, insurance claims datasets frequently suffer from class imbalance, where the number of non-claims instances far exceeds that of actual claims. This class imbalance poses challenges for predictive modeling, often leading to biased models favoring the majority class, resulting in subpar performance for the minority class, which is typically of greater interest.

    Dataset Overview: The dataset utilized in this project comprises historical data on insurance claims, encompassing a variety of information about the policyholders, their demographics, past claim history, and other pertinent features. The dataset is structured to facilitate predictive modeling tasks aimed at accurately identifying the likelihood of future insurance claims.

    Key Features: 1. Policyholder Information: This includes demographic details such as age, gender, occupation, marital status, and geographical location. 2. Claim History: Information regarding past insurance claims, including claim amounts, types of claims (e.g., medical, automobile), frequency of claims, and claim durations. 3. Policy Details: Details about the insurance policies held by the policyholders, such as coverage type, policy duration, premium amount, and deductibles. 4. Risk Factors: Variables indicating potential risk factors associated with policyholders, such as credit score, driving record (for automobile insurance), health status (for medical insurance), and property characteristics (for home insurance). 5. External Factors: Factors external to the policyholders that may influence claim likelihood, such as economic indicators, weather conditions, and regulatory changes.

    Objective: The primary objective of utilizing this dataset is to develop robust predictive models capable of accurately assessing the likelihood of insurance claims. By leveraging advanced machine learning techniques, such as classification algorithms and ensemble methods, the aim is to mitigate the effects of class imbalance and produce models that demonstrate high predictive performance across both majority and minority classes.

    Application Areas: 1. Risk Assessment: Assessing the risk associated with insuring a particular policyholder based on their characteristics and historical claim behavior. 2. Policy Pricing: Determining appropriate premium amounts for insurance policies by estimating the expected claim frequency and severity. 3. Fraud Detection: Identifying fraudulent insurance claims by detecting anomalous patterns in claim submissions and policyholder behavior. 4. Customer Segmentation: Segmenting policyholders into distinct groups based on their risk profiles and insurance needs to tailor marketing strategies and policy offerings.

    Conclusion: The insurance claims dataset serves as a valuable resource for developing predictive models aimed at enhancing risk management, policy pricing, and overall operational efficiency within the insurance industry. By addressing the challenges posed by class imbalance and leveraging the rich array of features available, organizations can gain valuable insights into insurance claim likelihood and make informed decisions to mitigate risk and optimize business outcomes.

    FeatureDescription
    policy_idUnique identifier for the insurance policy.
    subscription_lengthThe duration for which the insurance policy is active.
    customer_ageAge of the insurance policyholder, which can influence the likelihood of claims.
    vehicle_ageAge of the vehicle insured, which may affect the probability of claims due to factors like wear and tear.
    modelThe model of the vehicle, which could impact the claim frequency due to model-specific characteristics.
    fuel_typeType of fuel the vehicle uses (e.g., Petrol, Diesel, CNG), which might influence the risk profile and claim likelihood.
    max_torque, max_powerEngine performance characteristics that could relate to the vehicle’s mechanical condition and claim risks.
    engine_typeThe type of engine, which might have implications for maintenance and claim rates.
    displacement, cylinderSpecifications related to the engine size and construction, affec...
  3. Brazilian e-commerce company: OLIST

    • kaggle.com
    Updated Oct 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aditya Kumar Pal (2023). Brazilian e-commerce company: OLIST [Dataset]. https://www.kaggle.com/datasets/erak1006/brazilian-e-commerce-company-olist/versions/1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 25, 2023
    Dataset provided by
    Kaggle
    Authors
    Aditya Kumar Pal
    Description

    Presentation Outline: 1. Introduction - Company Overview - Objectives 2. Identified Use Cases - Delivery Date Prediction - Sentiment Analysis - Customer Churn - Customer Acquisition Cost Optimization - Fraud Detection - Price Optimization 3. Roadmap for Data Science Adoption 4. Case Study: Delivery Date Prediction 5. Case Study: Sentiment Analysis 6. Case Study: Customer Churn 7. Case Study: Customer Acquisition Cost Optimization 8. Case Study: Fraud Detection 9. Case Study: Price Optimization 10. Conclusion 11. Q&A

    1. Introduction - Company Overview: OLIST is a Brazilian e-commerce marketplace similar to Amazon. - Objectives: OLIST aims to increase the number of active customers, revenue, efficiency of services, and improve customer experience.

    2. Identified Use Cases OLIST's leadership has identified six key use cases to achieve its objectives:

    a. Delivery Date Prediction - Impact: Improve customer satisfaction and on-time delivery. - Feasibility: High, with historical order data. - Data & Skill Requirements: Order data, historical delivery data, machine learning expertise. - DS Solution Approach: Predict delivery dates using ML models. - Process Changes: More accurate delivery predictions. - Proof of Concept: Evaluate prediction accuracy. - Success Metric: Decrease in average delivery time. - Estimated Benefits: Increased customer satisfaction, customer retention, and potentially reduced inventory costs.

    b. Sentiment Analysis - Impact: Understand customer experience and identify areas for improvement. - Feasibility: High, with access to customer reviews. - Data & Skill Requirements: Customer review data, NLP expertise. - DS Solution Approach: Sentiment analysis of reviews. - Process Changes: Improved customer feedback analysis. - Proof of Concept: Evaluate sentiment analysis accuracy. - Success Metric: Better customer reviews and ratings. - Estimated Benefits: Improved customer satisfaction, customer retention, and product/service enhancements.

    c. Customer Churn - Impact: Identify at-risk customers and build retention strategies. - Feasibility: High, with historical customer data. - Data & Skill Requirements: Customer data, machine learning expertise. - DS Solution Approach: Churn prediction models. - Process Changes: Focus on retaining at-risk customers. - Proof of Concept: Evaluate model's predictive accuracy. - Success Metric: Reduced customer churn rate. - Estimated Benefits: Increased customer retention, revenue, and improved marketing ROI.

    d. Customer Acquisition Cost Optimization - Impact: Measure the effectiveness of acquisition campaigns. - Feasibility: High, with campaign and customer lifetime value data. - Data & Skill Requirements: Campaign and customer data, analytical expertise. - DS Solution Approach: Calculate customer acquisition costs. - Process Changes: More efficient campaign spending. - Proof of Concept: Compare campaign costs against lifetime value. - Success Metric: Reduced acquisition cost per customer. - Estimated Benefits: Improved campaign efficiency, marketing ROI, and increased revenue.

    e. Fraud Detection - Impact: Identify and prevent fraudulent transactions. - Feasibility: High, with transaction data. - Data & Skill Requirements: Transaction data, fraud detection expertise. - DS Solution Approach: Develop fraud detection models. - Process Changes: Enhanced security measures. - Proof of Concept: Evaluate model's fraud detection accuracy. - Success Metric: Reduced fraud-related financial losses. - Estimated Benefits: Minimized financial losses, improved brand reputation.

    f. Price Optimization - Impact: Optimize product pricing for increased revenue. - Feasibility: High, with pricing and sales data. - Data & Skill Requirements: Pricing and sales data, pricing strategy expertise. - DS Solution Approach: Price optimization algorithms. - Process Changes: Dynamic pricing strategies. - Proof of Concept: Evaluate pricing model's effectiveness. - Success Metric: Increased sales and revenue. - Estimated Benefits: Improved revenue and profit margins.

    3. Roadmap for Data Science Adoption - Prioritize use cases based on feasibility, potential impact, and data availability. - Build a data science team with required skills. - Develop proof of concepts for selected use cases. - Monitor and fine-tune models for accuracy and effectiveness. - Continuously measure and adapt to business needs.

    10. Conclusion - Data science adoption is key to achieving OLIST's business objectives. - Each use case offers unique benefits and requires different data and skills. - DS adoption will lead to impro...

  4. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Saint moretz (2025). USA Macroeconomic Rate Of Changes 1993-2025 [Dataset]. https://www.kaggle.com/datasets/spingere/usa-macroeconomic-rate-of-changes-1993-2025
Organization logo

USA Macroeconomic Rate Of Changes 1993-2025

let the data speak, beautiful dataset of ROC macro usa

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 28, 2025
Dataset provided by
Kaggle
Authors
Saint moretz
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Area covered
United States
Description

****Dataset Overview**** This dataset contains historical macroeconomic data, featuring key economic indicators in the United States. It includes important metrics such as the Consumer Price Index (CPI), Retail Sales, Unemployment Rate, Industrial Production, Money Supply (M2), and more. The dataset spans from 1993 to the present and includes monthly data on various economic indicators, processed to show their rate of change (either percentage or absolute difference, depending on the indicator).

provenance

The data in this dataset is sourced from the Federal Reserve Economic Data (FRED) database, hosted by the Federal Reserve Bank of St. Louis. FRED provides access to a wide range of economic data, including key macroeconomic indicators for the United States. My work involved calculating the rate of change (ROC) for each indicator and reorganizing the data into a more usable format for analysis. For more information and access to the full database, visit FRED's website.

Purpose and Use for the Kaggle Community:

This dataset is a valuable resource for data scientists, economists, and analysts interested in understanding macroeconomic trends, performing time series analysis, or building predictive models. With the rate of change included, users can quickly assess the growth or contraction in these indicators month-over-month. This dataset can be used for:

  • Exploratory Data Analysis (EDA): Understanding historical economic trends. -Time Series Forecasting: Building models to predict future economic conditions. -Macroeconomic Analysis: Analyzing the relationship between various economic indicators. -Machine Learning Projects: Using the data as features to predict financial or economic outcomes. -By utilizing this dataset, users can perform in-depth analysis on the impact of macroeconomic changes, compare the historical performance of various indicators, and experiment with different time series forecasting techniques.

****Column Descriptions****

Year: The year of the observation.

Month: The month of the observation (1-12).

Industrial Production: Monthly data on the total output of US factories, mines, and utilities.

Manufacturers' New Orders: Durable Goods: Measures the value of new orders placed with manufacturers for durable goods, indicating future production activity.

Consumer Price Index (CPIAUCSL): A measure of the average change over time in the prices paid by urban consumers for a market basket of consumer goods and services.

Unemployment Rate: The percentage of the total labor force that is unemployed but actively seeking employment.

Retail Sales: The total receipts of retail stores, indicating consumer spending and economic activity.

Producer Price Index: Measures the average change over time in the selling prices received by domestic producers for their output.

Personal Consumption Expenditures (PCE): A measure of the prices paid by consumers for goods and services, used in calculating inflation.

National Home Price Index: A measure of changes in residential real estate prices across the country.

All Employees, Total Nonfarm: The number of nonfarm payroll employees, an important indicator of the labor market.

Labor Force Participation Rate: The percentage of the working-age population that is either employed or actively looking for work.

Federal Funds Effective Rate: The interest rate at which depository institutions lend reserve balances to other depository institutions overnight.

Building Permits: The number of building permits issued for residential and non-residential buildings, a leading indicator of construction activity.

Money Supply (M2): The total money supply, including cash, checking deposits, and easily convertible near money.

Personal Income: The total income received by individuals from all sources, including wages, investments, and government transfers.

Trade Balance: The difference between a country's imports and exports, indicating the net trade flow.

Consumer Sentiment: The index reflecting consumer sentiment and expectations for the future economic outlook.

Consumer Confidence: A measure of how optimistic or pessimistic consumers are regarding their expected financial situation and the economy.

Notes on Interest Rates Please note that for the Federal Funds Effective Rate (FEDFUNDS), the dataset includes the absolute change in basis points (bps), not the rate of change. This means that the dataset reflects the direct change in the interest rate rather than the percentage change month-over-month. The change is represented in basis points, where 1 basis point equals 0.01%.

Search
Clear search
Close search
Google apps
Main menu