4 datasets found

Online Sales Dataset - Popular Marketplace Data
kaggle.com
Updated May 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ShreyanshVerma27 (2024). Online Sales Dataset - Popular Marketplace Data [Dataset]. https://www.kaggle.com/datasets/shreyanshverma27/online-sales-dataset-popular-marketplace-data
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 25, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
ShreyanshVerma27
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
This dataset provides a comprehensive overview of online sales transactions across different product categories. Each row represents a single transaction with detailed information such as the order ID, date, category, product name, quantity sold, unit price, total price, region, and payment method.

Columns:

Order ID: Unique identifier for each sales order.

Date:Date of the sales transaction.

Category:Broad category of the product sold (e.g., Electronics, Home Appliances, Clothing, Books, Beauty Products, Sports).

Product Name:Specific name or model of the product sold.

Quantity:Number of units of the product sold in the transaction.

Unit Price:Price of one unit of the product.

Total Price: Total revenue generated from the sales transaction (Quantity * Unit Price).

Region:Geographic region where the transaction occurred (e.g., North America, Europe, Asia).

Payment Method: Method used for payment (e.g., Credit Card, PayPal, Debit Card).

Insights:

1. Analyze sales trends over time to identify seasonal patterns or growth opportunities.

2. Explore the popularity of different product categories across regions.

3. Investigate the impact of payment methods on sales volume or revenue.

4. Identify top-selling products within each category to optimize inventory and marketing strategies.

5. Evaluate the performance of specific products or categories in different regions to tailor marketing campaigns accordingly.
Football Players Data
kaggle.com
Updated Nov 13, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Masood Ahmed (2023). Football Players Data [Dataset]. http://doi.org/10.34740/kaggle/dsv/6960429
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.34740/kaggle/dsv/6960429
Dataset updated
Nov 13, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Masood Ahmed
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Description:

This comprehensive dataset offers detailed information on approximately 17,000 FIFA football players, meticulously scraped from SoFIFA.com.

It encompasses a wide array of player-specific data points, including but not limited to player names, nationalities, clubs, player ratings, potential, positions, ages, and various skill attributes. This dataset is ideal for football enthusiasts, data analysts, and researchers seeking to conduct in-depth analysis, statistical studies, or machine learning projects related to football players' performance, characteristics, and career progressions.

Features:

name: Name of the player.

full_name: Full name of the player.

birth_date: Date of birth of the player.

age: Age of the player.

height_cm: Player's height in centimeters.

weight_kgs: Player's weight in kilograms.

positions: Positions the player can play.

nationality: Player's nationality.

overall_rating: Overall rating of the player in FIFA.

potential: Potential rating of the player in FIFA.

value_euro: Market value of the player in euros.

wage_euro: Weekly wage of the player in euros.

preferred_foot: Player's preferred foot.

international_reputation(1-5): International reputation rating from 1 to 5.

weak_foot(1-5): Rating of the player's weaker foot from 1 to 5.

skill_moves(1-5): Skill moves rating from 1 to 5.

body_type: Player's body type.

release_clause_euro: Release clause of the player in euros.

national_team: National team of the player.

national_rating: Rating in the national team.

national_team_position: Position in the national team.

national_jersey_number: Jersey number in the national team.

crossing: Rating for crossing ability.

finishing: Rating for finishing ability.

heading_accuracy: Rating for heading accuracy.

short_passing: Rating for short passing ability.

volleys: Rating for volleys.

dribbling: Rating for dribbling.

curve: Rating for curve shots.

freekick_accuracy: Rating for free kick accuracy.

long_passing: Rating for long passing.

ball_control: Rating for ball control.

acceleration: Rating for acceleration.

sprint_speed: Rating for sprint speed.

agility: Rating for agility.

reactions: Rating for reactions.

balance: Rating for balance.

shot_power: Rating for shot power.

jumping: Rating for jumping.

stamina: Rating for stamina.

strength: Rating for strength.

long_shots: Rating for long shots.

aggression: Rating for aggression.

interceptions: Rating for interceptions.

positioning: Rating for positioning.

vision: Rating for vision.

penalties: Rating for penalties.

composure: Rating for composure.

marking: Rating for marking.

standing_tackle: Rating for standing tackle.

sliding_tackle: Rating for sliding tackle.

Use Case:

This dataset is ideal for data analysis, predictive modeling, and machine learning projects. It can be used for:

Player performance analysis and comparison.

Market value assessment and wage prediction.

Team composition and strategy planning.

Machine learning models to predict future player potential and career trajectories.

Note:

Please ensure to adhere to the terms of service of SoFIFA.com and relevant data protection laws when using this dataset. The dataset is intended for educational and research purposes only and should not be used for commercial gains without proper authorization.
Heart Disease Prediction Dataset
kaggle.com
Updated Sep 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
M. Farhaan Nazirkhan (2024). Heart Disease Prediction Dataset [Dataset]. https://www.kaggle.com/datasets/mfarhaannazirkhan/heart-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 27, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
M. Farhaan Nazirkhan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Heart Disease Prediction Dataset

This dataset contains 1,888 records merged from five publicly available heart disease datasets. It includes 14 features that are crucial for predicting heart attack and stroke risks, covering both medical and demographic factors. Below is a detailed description of each feature.

Feature Descriptions:

age: Age of the patient (Numeric).

sex: Gender of the patient. Values: 1 = male, 0 = female.

cp: Chest pain type. Values: 0 = Typical angina, 1 = Atypical angina, 2 = Non-anginal pain, 3 = Asymptomatic.

trestbps: Resting Blood Pressure (in mm Hg) (Numeric).

chol: Serum Cholesterol level (in mg/dl) (Numeric).

fbs: Fasting blood sugar > 120 mg/dl. Values: 1 = true, 0 = false.

restecg: Resting electrocardiographic results. Values: 0 = Normal, 1 = ST-T wave abnormality, 2 = Left ventricular hypertrophy.

thalach: Maximum heart rate achieved (Numeric).

exang: Exercise-induced angina. Values: 1 = yes, 0 = no.

oldpeak: ST depression induced by exercise relative to rest (Numeric).

slope: Slope of the peak exercise ST segment. Values: 0 = Upsloping, 1 = Flat, 2 = Downsloping.

ca: Number of major vessels (0-3) colored by fluoroscopy. Values: 0, 1, 2, 3.

thal: Thalassemia types. Values: 1 = Normal, 2 = Fixed defect, 3 = Reversible defect.

target: Outcome variable (heart attack risk). Values: 1 = more chance of heart attack, 0 = less chance of heart attack.

Dataset Details:

This dataset is a combination of five publicly available heart disease datasets, with a total of 1,888 records. Merging these datasets provides a more robust foundation for training machine learning models aimed at predicting heart attack risk.

Datasets Used:

Heart Attack Analysis & Prediction Dataset
Number of Records: 304
Reference: Rahman, 2021

Heart Disease Dataset
Number of Records: 1,026
Reference: Lapp, 2019

Heart Attack Prediction (Dataset 3)
Number of Records: 295
Reference: Damarla, 2020

Heart Attack Prediction (Dataset 4)
Number of Records: 271
Reference: Anand, 2018

Heart CSV Dataset
Number of Records: 290
Reference: Nandal, 2022

Description:

This dataset includes 14 features known to contribute to heart attack risk. It is ideal for training machine learning models aimed at early detection and prevention of heart disease. The records have been cleaned by removing missing data to ensure data integrity. This dataset can be applied to various machine learning algorithms, including classification models such as Decision Trees, Neural Networks, and others.
Data from: ERA5 hourly data on single levels from 1940 to present
cds.climate.copernicus.eu
search-sandbox-2.test.dataone.org
+1more
grib
Updated Sep 27, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ECMWF (2025). ERA5 hourly data on single levels from 1940 to present [Dataset]. http://doi.org/10.24381/cds.adbb2d47
Explore at:
gribAvailable download formats
Unique identifier
https://doi.org/10.24381/cds.adbb2d47
Dataset updated
Sep 27, 2025
Dataset provided by
European Centre for Medium-Range Weather Forecastshttp://ecmwf.int/
Authors
ECMWF
License
https://object-store.os-api.cci2.ecmwf.int:443/cci2-prod-catalogue/licences/cc-by/cc-by_f24dc630aa52ab8c52a0ac85c03bc35e0abc850b4d7453bdc083535b41d5a5c3.pdfhttps://object-store.os-api.cci2.ecmwf.int:443/cci2-prod-catalogue/licences/cc-by/cc-by_f24dc630aa52ab8c52a0ac85c03bc35e0abc850b4d7453bdc083535b41d5a5c3.pdf
Time period covered
Jan 1, 1940 - Sep 21, 2025
Description
ERA5 is the fifth generation ECMWF reanalysis for the global climate and weather for the past 8 decades. Data is available from 1940 onwards. ERA5 replaces the ERA-Interim reanalysis. Reanalysis combines model data with observations from across the world into a globally complete and consistent dataset using the laws of physics. This principle, called data assimilation, is based on the method used by numerical weather prediction centres, where every so many hours (12 hours at ECMWF) a previous forecast is combined with newly available observations in an optimal way to produce a new best estimate of the state of the atmosphere, called analysis, from which an updated, improved forecast is issued. Reanalysis works in the same way, but at reduced resolution to allow for the provision of a dataset spanning back several decades. Reanalysis does not have the constraint of issuing timely forecasts, so there is more time to collect observations, and when going further back in time, to allow for the ingestion of improved versions of the original observations, which all benefit the quality of the reanalysis product. ERA5 provides hourly estimates for a large number of atmospheric, ocean-wave and land-surface quantities. An uncertainty estimate is sampled by an underlying 10-member ensemble at three-hourly intervals. Ensemble mean and spread have been pre-computed for convenience. Such uncertainty estimates are closely related to the information content of the available observing system which has evolved considerably over time. They also indicate flow-dependent sensitive areas. To facilitate many climate applications, monthly-mean averages have been pre-calculated too, though monthly means are not available for the ensemble mean and spread. ERA5 is updated daily with a latency of about 5 days. In case that serious flaws are detected in this early release (called ERA5T), this data could be different from the final release 2 to 3 months later. In case that this occurs users are notified. The data set presented here is a regridded subset of the full ERA5 data set on native resolution. It is online on spinning disk, which should ensure fast and easy access. It should satisfy the requirements for most common applications. An overview of all ERA5 datasets can be found in this article. Information on access to ERA5 data on native resolution is provided in these guidelines. Data has been regridded to a regular lat-lon grid of 0.25 degrees for the reanalysis and 0.5 degrees for the uncertainty estimate (0.5 and 1 degree respectively for ocean waves). There are four main sub sets: hourly and monthly products, both on pressure levels (upper air fields) and single levels (atmospheric, ocean-wave and land surface quantities). The present entry is "ERA5 hourly data on single levels from 1940 to present".
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

ShreyanshVerma27 (2024). Online Sales Dataset - Popular Marketplace Data [Dataset]. https://www.kaggle.com/datasets/shreyanshverma27/online-sales-dataset-popular-marketplace-data

Online Sales Dataset - Popular Marketplace Data

Global Transactions Across Various Product Categories

Explore at:

3 scholarly articles cite this dataset (View in Google Scholar)

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

May 25, 2024

Dataset provided by

Kagglehttp://kaggle.com/

Authors

ShreyanshVerma27

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

This dataset provides a comprehensive overview of online sales transactions across different product categories. Each row represents a single transaction with detailed information such as the order ID, date, category, product name, quantity sold, unit price, total price, region, and payment method.

Columns:

Order ID: Unique identifier for each sales order.
Date:Date of the sales transaction.
Category:Broad category of the product sold (e.g., Electronics, Home Appliances, Clothing, Books, Beauty Products, Sports).
Product Name:Specific name or model of the product sold.
Quantity:Number of units of the product sold in the transaction.
Unit Price:Price of one unit of the product.
Total Price: Total revenue generated from the sales transaction (Quantity * Unit Price).
Region:Geographic region where the transaction occurred (e.g., North America, Europe, Asia).
Payment Method: Method used for payment (e.g., Credit Card, PayPal, Debit Card).

Insights:

1. Analyze sales trends over time to identify seasonal patterns or growth opportunities.
2. Explore the popularity of different product categories across regions.
3. Investigate the impact of payment methods on sales volume or revenue.
4. Identify top-selling products within each category to optimize inventory and marketing strategies.
5. Evaluate the performance of specific products or categories in different regions to tailor marketing campaigns accordingly.

Clear search

Close search

Google apps

Main menu

Online Sales Dataset - Popular Marketplace Data

Columns:

Insights:

Football Players Data

Description:

Features:

Use Case:

Note:

Heart Disease Prediction Dataset

Heart Disease Prediction Dataset

Feature Descriptions:

Dataset Details:

Datasets Used:

Description:

Data from: ERA5 hourly data on single levels from 1940 to present

Online Sales Dataset - Popular Marketplace Data

Global Transactions Across Various Product Categories

Columns:

Insights: