Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This comprehensive dataset offers detailed information on approximately 17,000 FIFA football players, meticulously scraped from SoFIFA.com.
It encompasses a wide array of player-specific data points, including but not limited to player names, nationalities, clubs, player ratings, potential, positions, ages, and various skill attributes. This dataset is ideal for football enthusiasts, data analysts, and researchers seeking to conduct in-depth analysis, statistical studies, or machine learning projects related to football players' performance, characteristics, and career progressions.
This dataset is ideal for data analysis, predictive modeling, and machine learning projects. It can be used for:
Please ensure to adhere to the terms of service of SoFIFA.com and relevant data protection laws when using this dataset. The dataset is intended for educational and research purposes only and should not be used for commercial gains without proper authorization.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
The FIFA Football Players Dataset is a comprehensive collection of information about football (soccer) players from around the world. This dataset offers a wealth of attributes related to each player, making it a valuable resource for various analyses and insights into the realm of football, both for gaming enthusiasts and real-world sports enthusiasts.
Attributes:
Potential Uses:
Player Performance Analysis: Evaluate the performance of football players based on their attributes. Club Analysis: Investigate clubs, player distribution, and club statistics. Positional Insights: Explore the attributes specific to player positions. Player Valuation Trends: Analyze how player values change over time. Data Visualization:Create visualizations for better data representation. Machine Learning Models: Develop predictive models for various football-related forecasts.
Before using the dataset for analysis, it's advisable to preprocess the data, such as converting the "value" column into a numerical format, handling missing values, and ensuring consistency in column names. This dataset is a valuable resource for gaining insights into football, both in the context of the FIFA video game and real-world football.
All thanks and credit goes to FIFA Index
http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
📂 About This Dataset This dataset combines detailed player performance statistics from WhoScored with team and player meta-data from Transfermarkt. It covers over 1,500 players from top European leagues and includes metrics such as:
Expected Goals (xG) & xG per 90
Tackles, Interceptions, Key Passes, Assists
Pass Accuracy, Crosses, Long Balls
Total Minutes Played & Formations
Player Age, Height, Positioning
🧩 Use Cases Player Rating Prediction
Team Formation Impact Analysis
Identifying Underrated Players via xG vs. Goals
Clustering Players by Style or Efficiency
Fantasy Football Recommendations
🏗️ Data Sources WhoScored.com: Player match stats, tactical analysis.
Transfermarkt.com: Player bio, team formations.
📊 Features Snapshot 32 Columns
Over 20 numerical performance metrics
Cleaned, ready-to-analyze format
Small number of missing values (mostly in passing stats)
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset provides comprehensive data on football players from various clubs around the world for the year 2024. It includes key player attributes such as name, age, height, nationality, and club affiliation, as well as positional details. The dataset is perfect for football analytics, performance tracking, and scouting purposes.
Context: The data is intended for football enthusiasts, analysts, and data scientists who are interested in exploring player statistics and trends in modern football. With information from players across different leagues and countries, this dataset can help in identifying patterns of player performance, comparing attributes, and understanding the distribution of talent across clubs.
Whether you’re interested in understanding how age affects player positions or comparing the height of defenders across leagues, this dataset provides the foundation for in-depth football analysis.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This contains more detailed information than the dataset from https://www.kaggle.com/datasets/codytipton/understat-data, which includes the individual player stats per game for the English Premier League, La Liga, Bundesliga, Serie A, Ligue 1, and the Russian Football Premier League. In particular, it contains each player's xG, xGBuildup, goals, and shots per game. Furthermore, it has the events for each shot in the events table, clubs and their stats per season in the clubs table, and each game with who lost, won, shots, possession, probabilities of who wins, ect..
This is for educational purposes in our data science bootcamp project.
⚽ Explore an extensive dataset featuring detailed player statistics exclusively from the top 7 European football leagues:
EPL (English Premier League)
Bundesliga 🇩🇪
La Liga 🇪🇸
Serie A 🇮🇹
Ligue 1 🇫🇷
Eredivisie 🇳🇱
Primeira Liga 🇵🇹
This dataset provides comprehensive insights into player performances, including attributes like goals, assists, minutes played, and other key metrics. Uncover in-depth player analyses and comparisons across leagues to fuel your football data-driven strategies and player evaluations! 📈🥅⚽
Comprehensive Football Player Statistics: 2023-2024 Season This dataset contains detailed player statistics from top football leagues for the 2023-2024 season. Sourced from FBref, the dataset includes a wide range of metrics covering various aspects of player performance, such as defense, goalkeeping, passing, and shooting.
Key Features Detailed Player Metrics: Statistics for individual players across multiple performance areas. Structured Data: Organized into tables focusing on different aspects of the game for easy analysis. Top Leagues: Includes data from prominent leagues that provide comprehensive detailed stats.
Github Repository link of the project : https://github.com/GuechtouliAnis/Football-Data-Scraping
By: Guechtouli Anis
This dataset provides an in-depth look at the 2023/24 La Liga season, covering various aspects of team and player performances across all matchdays. With over 50 individual CSV files, the dataset includes statistics on passing accuracy, goal-scoring, defensive actions, possession metrics, and player ratings, among others. Whether you're interested in analyzing top scorers, understanding team strengths, or delving into player-specific contributions, this dataset offers a rich foundation for football analytics enthusiasts and professionals.
In addition to the core dataset, we have now added more files related to the league table, expanding the dataset with essential information on match outcomes, league standings, and advanced metrics.
The dataset contains the following types of data:
The file details provide an overview of each dataset, including a brief description of the data structure and potential uses for analysis. This helps users quickly navigate and understand the data available for analysis.
This dataset is ideal for statistical analysis, data visualization, and machine learning applications to uncover patterns in football performance.
This dataset opens up multiple avenues for data analysis and visualization. Here are some ideas:
This dataset is a valuable resource for football enthusiasts, data scientists, and analysts interested in uncovering patterns, building predictive models, or generating insights for La Liga 2023/24.
This dataset is shared for non-commercial, educational, and personal analysis purposes only. It is not intended for redistribution, commercial use, or integration into other public datasets.
This dataset was sourced from FotMob, a proprietary provider of football statistics. All rights to the original data belong to FotMob. The dataset is a restructured collection of publicly viewable data and does not claim ownership over FotMob's data. Users should reference FotMob as the original source when using this dataset for research or analysis.
By using this dataset, you agree to the following: - Non-commercial Use: This dataset is only for educational, analytical, and personal use. It may not be used for commercial purposes or integrated into other public datasets. - Proper Attribution: Please attribu...
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
La Liga Players Performance Dataset
This dataset provides a comprehensive overview of player performance in the La Liga capturing a wide array of metrics related to gameplay, scoring, passing, and defensive actions. With records detailing individual player statistics across different teams, this dataset is a valuable resource for analysts, data scientists, and fans who are interested in diving into player performance data from one of the world’s top soccer leagues.
Each entry represents a single player's profile, featuring data on expected goals (xG), expected assists (xAG), touches, dribbles, tackles, and more. This dataset is ideal for analyzing various aspects of player contribution, both offensively and defensively, and understanding their impact on team performance.
Dataset Columns
Player: Name of the player Team: Team the player belongs to '#' : Player's jersey number Nation: Nationality of the player Position: Primary playing position on the field Age: Age of the player Minutes: Total minutes played Goals: Number of goals scored Assists: Number of assists Penalty Shoot on Goal: Penalty shots taken on goal Penalty Shoot: Total penalty shots attempted Total Shoot: Total shots attempted Shoot on Target: Shots successfully on target Yellow Cards: Number of yellow cards received Red Cards: Number of red cards received Touches: Total ball touches Dribbles: Total dribbles attempted Tackles: Total tackles made Blocks: Total blocks Expected Goals (xG): Expected goals, calculated based on shooting positions and likelihood of scoring Non-Penalty xG (npxG): Expected goals excluding penalties Expected Assists (xAG): Expected assists, based on actions leading to an expected goal (xG) Shot-Creating Actions: Actions leading to a shot attempt Goal-Creating Actions: Actions leading to a goal Passes Completed: Successful passes completed Passes Attempted: Total passes attempted Pass Completion %: Pass completion rate, expressed as a percentage (some entries have missing values here) Progressive Passes: Passes advancing the ball significantly toward the opponent’s goal Carries: Total ball carries Progressive Carries: Carries advancing the ball significantly toward the opponent’s goal Dribble Attempts: Total dribbles attempted Successful Dribbles: Total successful dribbles Date: Date of record collection or game date
Potential Use Cases
Data Visualization: Explore relationships between various performance metrics to identify patterns.
Player Comparisons: Compare individual players based on goals, assists, xG, xAG, and other metrics.
Team Analysis: Evaluate contributions of players within the same team to gain insights into team dynamics.
Predictive Modeling: Use the dataset to build models for predicting game outcomes, goals, or assists based on player performance metrics.
This dataset was created by Jaseem Mohammed
It contains the following files:
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This is a data dump of the football section of Statbunker's searchable football statistics database. I have uploaded League data for these European leagues:
I have pulled data for the following seasons:
Based on the following disciplines:
All data pulled can be found on the Statbunker website: https://www.statbunker.com/
For anyone who enjoys footbal, and analyzing football stats. Please feel free to run kernels!
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
So, I was trying to predict the rating of players in the FIFA21 game which is going to be released in the coming weeks by using their individual performance in the previous season and the rating in the previous edition of the game FIFA20. But, I couldn't find a dataset for this. So I had no option other than to scrape data from the PL website itself.
Each row in the datasets represents each player's performance in that particular season. It starts with Name, Position, Appearances, and the statistics of his performance throughout the season. Some entries are null because those attributes don't correspond to the position in which the player actually plays, for instance, a Forward will not have Number of saves; it doesn't make sense.
To all those football freaks like me, Feel free to use this dataset
Let me know if there's an error
This dataset is undertaken to create a predictive model for the transfer values of football players. We will utilize data from football players and construct a model to predict transfer fees based on that data. Player data includes basic information such as age, height, playing position, as well as professional statistics like goal scoring, assists (in 2 season 2021-2022 and 2022-2023), injuries, along with total individual and team awards in their career.
We had gathered information on players competing in several top-tier global football leagues:
11 European leagues, including the Premier League and Championship in England, Bundesliga in Germany, La Liga in Spain, Serie A in Italy, Ligue 1 in France, Eredivisie in the Netherlands, Liga NOS in Portugal, Premier Liga in Russia, Super Lig in Turkey, and Bundesliga in Austria.
4 American leagues, including Brasileiro in Brazil, Major League Soccer in the United States, Primera División in Argentina, and Liga MX in Mexico.
1 African league, namely the DStv Premiership in South Africa.
4 Asian leagues, comprising J-League in Japan, Saudi Pro League in Saudi Arabia, K-League 1 in South Korea, and A-League in Australia.
Most publicly available football (soccer) statistics are limited to aggregated data such as Goals, Shots, Fouls, Cards. When assessing performance or building predictive models, this simple aggregation, without any context, can be misleading. For example, a team that produced 10 shots on target from long range has a lower chance of scoring than a club that produced the same amount of shots from inside the box. However, metrics derived from this simple count of shots will similarly asses the two teams.
A football game generates much more events and it is very important and interesting to take into account the context in which those events were generated. This dataset should keep sports analytics enthusiasts awake for long hours as the number of questions that can be asked is huge.
This dataset is a result of a very tiresome effort of webscraping and integrating different data sources. The central element is the text commentary. All the events were derived by reverse engineering the text commentary, using regex. Using this, I was able to derive 11 types of events, as well as the main player and secondary player involved in those events and many other statistics. In case I've missed extracting some useful information, you are gladly invited to do so and share your findings. The dataset provides a granular view of 9,074 games, totaling 941,009 events from the biggest 5 European football (soccer) leagues: England, Spain, Germany, Italy, France from 2011/2012 season to 2016/2017 season as of 25.01.2017. There are games that have been played during these seasons for which I could not collect detailed data. Overall, over 90% of the played games during these seasons have event data.
The dataset is organized in 3 files:
I have used this data to:
There are tons of interesting questions a sports enthusiast can answer with this dataset. For example:
And many many more...
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
I had the need to collect Europe's top 5 leagues' dataset for my own undergraduate project. The idea was to eliminate human bias from the player scouting process.
More Details: https://github.com/Suwadith/Winning-Eleven-Scout-Evaluation-and-Analysis-to-Enhance-Football-Player-Recommendations-ML-Flask
This dataset contains individual player statistics from Europe's top 5 leagues 2009 - 2018. Leagues included: La Liga, Bundesliga, Serie A, Ligue 1, Premier League Types of stats: Offensive, Defensive, Passing, Overall Summary
This dataset was compiled from the https://www.whoscored.com website
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
NFL passing statistics since 2001. Contains record of every player who attempted a pass within the time period. Tracked metrics include passing yards, passing touchdowns, pass attempts, completions, interceptions, and touchdown/interception/completion percentages. More advanced metrics like yards per attempt, adjusted net yards per attempt, and other similar metrics are also included. I used this dataset, accompanied with the NFL Rushing Statistics dataset to predict the NFL MVP winner in 2024.
This dataset was created by Suv Sanjit Patnaik
It contains the following files:
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset has an exhaustive list of player statistics for each season from 2013 - 2020
Each row is associated with a player and a season. Eg. You will have 7 rows for Lionel Messi: 1 for each season he played
Each row will have 103 unique stats you can look at (Eg. No of Goals Scored, Passing accuracy in %, Minutes Played etc)
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains detailed league performance statistics for the nominees of the 2024 Ballon D'Or
across major European football leagues. The stats cover the 2023-2024
season, showcasing metrics such as goals
, assists
, expected goals (xG)
, expected assists (xAG)
, progression metrics
, and more.
The winner of the Men's Ballon d'Or goes to the best male player voted by a panel of soccer journalists representing the top 100 countries in the FIFA Men's Rankings.
For the first time since 2003, though, Cristiano Ronaldo and Lionel Messi were not included among the nominees!
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Dataset of Arsenal matches and player statistics from 2017/18 season until 28.02.2023.
Content This dataset includes 214 results of Arsenal FC matches starting from 2017/18 season up to 28.02.2022, 2741 record of every player, who played in these matches with advanced statistics and 218 records of every goalkeeper.
Dataset has three files: matches.csv - every Arsenal FC match between the 2017-18 season and February 28, 2023. players.csv - every player who played for Arsenal between the 2017-18 season and February 28, 2023. goalkeepers.csv - every goalkeeper who played for Arsenal between the 2017-18 season and February 28, 2023.
All columns descriptions you can find in README file.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This comprehensive dataset offers detailed information on approximately 17,000 FIFA football players, meticulously scraped from SoFIFA.com.
It encompasses a wide array of player-specific data points, including but not limited to player names, nationalities, clubs, player ratings, potential, positions, ages, and various skill attributes. This dataset is ideal for football enthusiasts, data analysts, and researchers seeking to conduct in-depth analysis, statistical studies, or machine learning projects related to football players' performance, characteristics, and career progressions.
This dataset is ideal for data analysis, predictive modeling, and machine learning projects. It can be used for:
Please ensure to adhere to the terms of service of SoFIFA.com and relevant data protection laws when using this dataset. The dataset is intended for educational and research purposes only and should not be used for commercial gains without proper authorization.