4 datasets found
  1. Online Sales Dataset - Popular Marketplace Data

    • kaggle.com
    Updated May 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ShreyanshVerma27 (2024). Online Sales Dataset - Popular Marketplace Data [Dataset]. https://www.kaggle.com/datasets/shreyanshverma27/online-sales-dataset-popular-marketplace-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 25, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    ShreyanshVerma27
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset provides a comprehensive overview of online sales transactions across different product categories. Each row represents a single transaction with detailed information such as the order ID, date, category, product name, quantity sold, unit price, total price, region, and payment method.

    Columns:

    • Order ID: Unique identifier for each sales order.
    • Date:Date of the sales transaction.
    • Category:Broad category of the product sold (e.g., Electronics, Home Appliances, Clothing, Books, Beauty Products, Sports).
    • Product Name:Specific name or model of the product sold.
    • Quantity:Number of units of the product sold in the transaction.
    • Unit Price:Price of one unit of the product.
    • Total Price: Total revenue generated from the sales transaction (Quantity * Unit Price).
    • Region:Geographic region where the transaction occurred (e.g., North America, Europe, Asia).
    • Payment Method: Method used for payment (e.g., Credit Card, PayPal, Debit Card).

    Insights:

    • 1. Analyze sales trends over time to identify seasonal patterns or growth opportunities.
    • 2. Explore the popularity of different product categories across regions.
    • 3. Investigate the impact of payment methods on sales volume or revenue.
    • 4. Identify top-selling products within each category to optimize inventory and marketing strategies.
    • 5. Evaluate the performance of specific products or categories in different regions to tailor marketing campaigns accordingly.
  2. Football Players Data

    • kaggle.com
    Updated Nov 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Masood Ahmed (2023). Football Players Data [Dataset]. http://doi.org/10.34740/kaggle/dsv/6960429
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 13, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Masood Ahmed
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Description:

    This comprehensive dataset offers detailed information on approximately 17,000 FIFA football players, meticulously scraped from SoFIFA.com.

    It encompasses a wide array of player-specific data points, including but not limited to player names, nationalities, clubs, player ratings, potential, positions, ages, and various skill attributes. This dataset is ideal for football enthusiasts, data analysts, and researchers seeking to conduct in-depth analysis, statistical studies, or machine learning projects related to football players' performance, characteristics, and career progressions.

    Features:

    • name: Name of the player.
    • full_name: Full name of the player.
    • birth_date: Date of birth of the player.
    • age: Age of the player.
    • height_cm: Player's height in centimeters.
    • weight_kgs: Player's weight in kilograms.
    • positions: Positions the player can play.
    • nationality: Player's nationality.
    • overall_rating: Overall rating of the player in FIFA.
    • potential: Potential rating of the player in FIFA.
    • value_euro: Market value of the player in euros.
    • wage_euro: Weekly wage of the player in euros.
    • preferred_foot: Player's preferred foot.
    • international_reputation(1-5): International reputation rating from 1 to 5.
    • weak_foot(1-5): Rating of the player's weaker foot from 1 to 5.
    • skill_moves(1-5): Skill moves rating from 1 to 5.
    • body_type: Player's body type.
    • release_clause_euro: Release clause of the player in euros.
    • national_team: National team of the player.
    • national_rating: Rating in the national team.
    • national_team_position: Position in the national team.
    • national_jersey_number: Jersey number in the national team.
    • crossing: Rating for crossing ability.
    • finishing: Rating for finishing ability.
    • heading_accuracy: Rating for heading accuracy.
    • short_passing: Rating for short passing ability.
    • volleys: Rating for volleys.
    • dribbling: Rating for dribbling.
    • curve: Rating for curve shots.
    • freekick_accuracy: Rating for free kick accuracy.
    • long_passing: Rating for long passing.
    • ball_control: Rating for ball control.
    • acceleration: Rating for acceleration.
    • sprint_speed: Rating for sprint speed.
    • agility: Rating for agility.
    • reactions: Rating for reactions.
    • balance: Rating for balance.
    • shot_power: Rating for shot power.
    • jumping: Rating for jumping.
    • stamina: Rating for stamina.
    • strength: Rating for strength.
    • long_shots: Rating for long shots.
    • aggression: Rating for aggression.
    • interceptions: Rating for interceptions.
    • positioning: Rating for positioning.
    • vision: Rating for vision.
    • penalties: Rating for penalties.
    • composure: Rating for composure.
    • marking: Rating for marking.
    • standing_tackle: Rating for standing tackle.
    • sliding_tackle: Rating for sliding tackle.

    Use Case:

    This dataset is ideal for data analysis, predictive modeling, and machine learning projects. It can be used for:

    • Player performance analysis and comparison.
    • Market value assessment and wage prediction.
    • Team composition and strategy planning.
    • Machine learning models to predict future player potential and career trajectories.

    Note:

    Please ensure to adhere to the terms of service of SoFIFA.com and relevant data protection laws when using this dataset. The dataset is intended for educational and research purposes only and should not be used for commercial gains without proper authorization.

  3. Heart Disease Prediction Dataset

    • kaggle.com
    Updated Sep 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    M. Farhaan Nazirkhan (2024). Heart Disease Prediction Dataset [Dataset]. https://www.kaggle.com/datasets/mfarhaannazirkhan/heart-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 27, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    M. Farhaan Nazirkhan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Heart Disease Prediction Dataset

    This dataset contains 1,888 records merged from five publicly available heart disease datasets. It includes 14 features that are crucial for predicting heart attack and stroke risks, covering both medical and demographic factors. Below is a detailed description of each feature.

    Feature Descriptions:

    1. age: Age of the patient (Numeric).
    2. sex: Gender of the patient. Values: 1 = male, 0 = female.
    3. cp: Chest pain type. Values: 0 = Typical angina, 1 = Atypical angina, 2 = Non-anginal pain, 3 = Asymptomatic.
    4. trestbps: Resting Blood Pressure (in mm Hg) (Numeric).
    5. chol: Serum Cholesterol level (in mg/dl) (Numeric).
    6. fbs: Fasting blood sugar > 120 mg/dl. Values: 1 = true, 0 = false.
    7. restecg: Resting electrocardiographic results. Values: 0 = Normal, 1 = ST-T wave abnormality, 2 = Left ventricular hypertrophy.
    8. thalach: Maximum heart rate achieved (Numeric).
    9. exang: Exercise-induced angina. Values: 1 = yes, 0 = no.
    10. oldpeak: ST depression induced by exercise relative to rest (Numeric).
    11. slope: Slope of the peak exercise ST segment. Values: 0 = Upsloping, 1 = Flat, 2 = Downsloping.
    12. ca: Number of major vessels (0-3) colored by fluoroscopy. Values: 0, 1, 2, 3.
    13. thal: Thalassemia types. Values: 1 = Normal, 2 = Fixed defect, 3 = Reversible defect.
    14. target: Outcome variable (heart attack risk). Values: 1 = more chance of heart attack, 0 = less chance of heart attack.

    Dataset Details:

    This dataset is a combination of five publicly available heart disease datasets, with a total of 1,888 records. Merging these datasets provides a more robust foundation for training machine learning models aimed at predicting heart attack risk.

    Datasets Used:

    1. Heart Attack Analysis & Prediction Dataset
      Number of Records: 304
      Reference: Rahman, 2021

    2. Heart Disease Dataset
      Number of Records: 1,026
      Reference: Lapp, 2019

    3. Heart Attack Prediction (Dataset 3)
      Number of Records: 295
      Reference: Damarla, 2020

    4. Heart Attack Prediction (Dataset 4)
      Number of Records: 271
      Reference: Anand, 2018

    5. Heart CSV Dataset
      Number of Records: 290
      Reference: Nandal, 2022

    Description:

    This dataset includes 14 features known to contribute to heart attack risk. It is ideal for training machine learning models aimed at early detection and prevention of heart disease. The records have been cleaned by removing missing data to ensure data integrity. This dataset can be applied to various machine learning algorithms, including classification models such as Decision Trees, Neural Networks, and others.

  4. Data from: ERA5 hourly data on single levels from 1940 to present

    • cds.climate.copernicus.eu
    • search-sandbox-2.test.dataone.org
    • +1more
    grib
    Updated Sep 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ECMWF (2025). ERA5 hourly data on single levels from 1940 to present [Dataset]. http://doi.org/10.24381/cds.adbb2d47
    Explore at:
    gribAvailable download formats
    Dataset updated
    Sep 27, 2025
    Dataset provided by
    European Centre for Medium-Range Weather Forecastshttp://ecmwf.int/
    Authors
    ECMWF
    License

    https://object-store.os-api.cci2.ecmwf.int:443/cci2-prod-catalogue/licences/cc-by/cc-by_f24dc630aa52ab8c52a0ac85c03bc35e0abc850b4d7453bdc083535b41d5a5c3.pdfhttps://object-store.os-api.cci2.ecmwf.int:443/cci2-prod-catalogue/licences/cc-by/cc-by_f24dc630aa52ab8c52a0ac85c03bc35e0abc850b4d7453bdc083535b41d5a5c3.pdf

    Time period covered
    Jan 1, 1940 - Sep 21, 2025
    Description

    ERA5 is the fifth generation ECMWF reanalysis for the global climate and weather for the past 8 decades. Data is available from 1940 onwards. ERA5 replaces the ERA-Interim reanalysis. Reanalysis combines model data with observations from across the world into a globally complete and consistent dataset using the laws of physics. This principle, called data assimilation, is based on the method used by numerical weather prediction centres, where every so many hours (12 hours at ECMWF) a previous forecast is combined with newly available observations in an optimal way to produce a new best estimate of the state of the atmosphere, called analysis, from which an updated, improved forecast is issued. Reanalysis works in the same way, but at reduced resolution to allow for the provision of a dataset spanning back several decades. Reanalysis does not have the constraint of issuing timely forecasts, so there is more time to collect observations, and when going further back in time, to allow for the ingestion of improved versions of the original observations, which all benefit the quality of the reanalysis product. ERA5 provides hourly estimates for a large number of atmospheric, ocean-wave and land-surface quantities. An uncertainty estimate is sampled by an underlying 10-member ensemble at three-hourly intervals. Ensemble mean and spread have been pre-computed for convenience. Such uncertainty estimates are closely related to the information content of the available observing system which has evolved considerably over time. They also indicate flow-dependent sensitive areas. To facilitate many climate applications, monthly-mean averages have been pre-calculated too, though monthly means are not available for the ensemble mean and spread. ERA5 is updated daily with a latency of about 5 days. In case that serious flaws are detected in this early release (called ERA5T), this data could be different from the final release 2 to 3 months later. In case that this occurs users are notified. The data set presented here is a regridded subset of the full ERA5 data set on native resolution. It is online on spinning disk, which should ensure fast and easy access. It should satisfy the requirements for most common applications. An overview of all ERA5 datasets can be found in this article. Information on access to ERA5 data on native resolution is provided in these guidelines. Data has been regridded to a regular lat-lon grid of 0.25 degrees for the reanalysis and 0.5 degrees for the uncertainty estimate (0.5 and 1 degree respectively for ocean waves). There are four main sub sets: hourly and monthly products, both on pressure levels (upper air fields) and single levels (atmospheric, ocean-wave and land surface quantities). The present entry is "ERA5 hourly data on single levels from 1940 to present".

  5. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
ShreyanshVerma27 (2024). Online Sales Dataset - Popular Marketplace Data [Dataset]. https://www.kaggle.com/datasets/shreyanshverma27/online-sales-dataset-popular-marketplace-data
Organization logo

Online Sales Dataset - Popular Marketplace Data

Global Transactions Across Various Product Categories

Explore at:
3 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 25, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
ShreyanshVerma27
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

This dataset provides a comprehensive overview of online sales transactions across different product categories. Each row represents a single transaction with detailed information such as the order ID, date, category, product name, quantity sold, unit price, total price, region, and payment method.

Columns:

  • Order ID: Unique identifier for each sales order.
  • Date:Date of the sales transaction.
  • Category:Broad category of the product sold (e.g., Electronics, Home Appliances, Clothing, Books, Beauty Products, Sports).
  • Product Name:Specific name or model of the product sold.
  • Quantity:Number of units of the product sold in the transaction.
  • Unit Price:Price of one unit of the product.
  • Total Price: Total revenue generated from the sales transaction (Quantity * Unit Price).
  • Region:Geographic region where the transaction occurred (e.g., North America, Europe, Asia).
  • Payment Method: Method used for payment (e.g., Credit Card, PayPal, Debit Card).

Insights:

  • 1. Analyze sales trends over time to identify seasonal patterns or growth opportunities.
  • 2. Explore the popularity of different product categories across regions.
  • 3. Investigate the impact of payment methods on sales volume or revenue.
  • 4. Identify top-selling products within each category to optimize inventory and marketing strategies.
  • 5. Evaluate the performance of specific products or categories in different regions to tailor marketing campaigns accordingly.
Search
Clear search
Close search
Google apps
Main menu