Facebook
TwitterIn our Data Science lesson, we tried to predict the value of some soccer players, using their performance and their last market value. As we have not found a dataset on Kaggle that was convenient to us, we have tried to create our own dataset merging two ones finding on this platform. The 2 datasets are : ''Soccer players values and their statistics'' and ''Top Football Leagues Scorers''.
The data are only from the season 2019-2020. We have 88 players remaining. Our work is not finish and can be significantly improved, particularly by increasing the number of player.
Thanks to Mohamed Hany and RSKriegs for their datasets.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Welcome to my first real-world football dataset, scraped from Transfermarkt, containing detailed market value data for 499 Premier League players (2025).
This dataset includes the following attributes for each player:
Each field was carefully extracted and cleaned from public sources using custom Python scripts (available on GitHub below).
This is just Phase 1. My goal is to:
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F16731800%2Ff0d45220cad473000b1e59942548dd45%2Fanimated_bubble_chart.gif?generation=1705615116968842&alt=media" alt="">This comprehensive football dataset, derived primarily from Transfermarkt, serves as a valuable resource for football enthusiasts, offering structured information on competitions, clubs, and players. With over 60,000 games across major global competitions, the dataset delves into the performance metrics of 400+ clubs and detailed statistics for more than 30,000 players.
Structured in CSV files, each with unique IDs, users can seamlessly join datasets to perform in-depth analyses. The dataset encompasses market values, historical valuations, and detailed player statistics, including physical attributes, contract statuses, and individual performances. A specialized Python-based web scraper ensures consistent updates, with data meticulously processed through Python scripts and SQL databases.
To use the dataset effectively, users are encouraged to understand the relevant files, join datasets using unique IDs, and leverage compatible software tools like Python's pandas or R's ggplot2 for analysis. The guide emphasizes the potential for fantasy football predictions, tracking player value over time, assessing market value versus performance, and exploring the impact of cards on match outcomes.
Research ideas include player performance analysis for fantasy football or recruitment purposes, studying market value trends for economic insights, evaluating club performance for strategic decision-making, developing predictive models for match outcomes, and conducting social network analysis to understand interactions among clubs and players.
Acknowledging the dataset's unknown license, users are encouraged to credit the original authors, particularly David Cereijo, if used in research. The dataset's dedication to accessibility is evident through active discussions on GitHub for improvements and bug fixes.
In conclusion, this football dataset offers a wealth of information, empowering users to explore diverse analyses and research ideas, bridging the gap between structured data and the dynamic world of football.
Facebook
Twitterhttps://datacatalog.worldbank.org/public-licenses?fragment=cchttps://datacatalog.worldbank.org/public-licenses?fragment=cc
Economic Fitness (EF) is both a measure of a country’s diversification and ability to produce complex goods on a globally competitive basis. The Universal Economic Fitness (UEF) extends this assessment to include services. Countries with the highest levels of EF or UEF have capabilities to produce a diverse portfolio of products and services, the ability to upgrade into ever-increasing complex industries, tend to have more predictable long-term growth, and attain a good competitive position relative to other countries. Countries with low EF or UEF levels tend to suffer from poverty, low capabilities, less predictable growth, low value-addition, and trouble upgrading and diversifying faster than other countries. The starting data is the UN-COMTRADE list of products and the IMF-BOP list of services exported by each country. This data defines a bipartite network of countries and industries. A suitably designed mathematical algorithm applied to this network leads to the Fitness of all countries and the Complexity of all sectors. The comparison of Fitness to the GDP reveals hidden information about the capabilities, development, and growth of countries.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
A collection of 2,010 forward players from the 2024-2025 soccer season, extracted from Transfermarkt.com. This dataset provides detailed performance metrics and market valuations for attacking players across the world's top 15 leagues and competitions.
| Variable | Description |
|---|---|
name | Player full name |
position | Attacking position (CF, LW, RW, SS) |
age | Player age in years |
nation | Player nationality |
club | N/A |
league | N/A |
matches | Matches played (2024-25) |
goals | Goals scored |
assists | Assists provided |
points(goals+assists) | Combined performance metric |
value | Market value (Euros) |
player_link | Transfermarkt profile URL |
Facebook
TwitterThis dataset was created by Abdulmajeed33
Facebook
Twitterhttps://www.wiseguyreports.com/pages/privacy-policyhttps://www.wiseguyreports.com/pages/privacy-policy
| BASE YEAR | 2024 |
| HISTORICAL DATA | 2019 - 2023 |
| REGIONS COVERED | North America, Europe, APAC, South America, MEA |
| REPORT COVERAGE | Revenue Forecast, Competitive Landscape, Growth Factors, and Trends |
| MARKET SIZE 2024 | 2.18(USD Billion) |
| MARKET SIZE 2025 | 2.35(USD Billion) |
| MARKET SIZE 2035 | 5.0(USD Billion) |
| SEGMENTS COVERED | Application, Platform, Users, Functionality, Regional |
| COUNTRIES COVERED | US, Canada, Germany, UK, France, Russia, Italy, Spain, Rest of Europe, China, India, Japan, South Korea, Malaysia, Thailand, Indonesia, Rest of APAC, Brazil, Mexico, Argentina, Rest of South America, GCC, South Africa, Rest of MEA |
| KEY MARKET DYNAMICS | increasing health consciousness, rising smartphone usage, growing fitness trends, demand for personalized nutrition, integration of wearable technology |
| MARKET FORECAST UNITS | USD Billion |
| KEY COMPANIES PROFILED | FatSecret, Nutracheck, Eat This Much, Cronometer, Fitbit, MyFitnessPal, SparkPeople, Lifesum, Yummly, Apple Health, Diet Organizer, Google Fit, Noom, Calorie Counter by Green Guava, Samsung Health, Lose It |
| MARKET FORECAST PERIOD | 2025 - 2035 |
| KEY MARKET OPPORTUNITIES | Integration with wearable devices, Personalized nutrition plans, Gamification features, AI-driven insights, Multilingual support features |
| COMPOUND ANNUAL GROWTH RATE (CAGR) | 7.8% (2025 - 2035) |
Facebook
TwitterThis dataset was created by sean ngeo
Facebook
TwitterThis dataset is undertaken to create a predictive model for the transfer values of football players. We will utilize data from football players and construct a model to predict transfer fees based on that data. Player data includes basic information such as age, height, playing position, as well as professional statistics like goal scoring, assists (in 2 season 2021-2022 and 2022-2023), injuries, along with total individual and team awards in their career.
We had gathered information on players competing in several top-tier global football leagues:
11 European leagues, including the Premier League and Championship in England, Bundesliga in Germany, La Liga in Spain, Serie A in Italy, Ligue 1 in France, Eredivisie in the Netherlands, Liga NOS in Portugal, Premier Liga in Russia, Super Lig in Turkey, and Bundesliga in Austria.
4 American leagues, including Brasileiro in Brazil, Major League Soccer in the United States, Primera División in Argentina, and Liga MX in Mexico.
1 African league, namely the DStv Premiership in South Africa.
4 Asian leagues, comprising J-League in Japan, Saudi Pro League in Saudi Arabia, K-League 1 in South Korea, and A-League in Australia.
Facebook
TwitterAmong the industries presented in the data set, health and fitness had the highest cost-per-action (CPA) for Facebook ads as of February 2025, with ***U.S. dollars. The lowest value belonged to the real estate industry, with ** U.S. dollars.
Facebook
Twitterhttp://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
This dataset was built as a supplementary to "[European Soccer Database][1]". It includes data dictionary, extraction of detailed match information previously contains in XML columns.
Original data comes from [European Soccer Database][1] by Hugo Mathien. I personally thank him for all his efforts.
Since this is a open dataset with no specific goals / objectives, I would like to explore the following aspects by data analytics / data mining:
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset provides comprehensive information on the top 500 most valuable footballers (football players) in the world. The dataset includes essential details for each player, such as their position, age, nationality, club affiliation, and market value in million euros. The information is up-to-date till the end of the 2022/2023 season. By analyzing this dataset, researchers can uncover trends, identify players with exceptional market value, track the value fluctuations of individual players over time, and perform in-depth statistical analyses related to player worth.
Whether you are interested in understanding the market dynamics of football or conducting advanced analytics in the domain of sports, this dataset provides a solid foundation for exploration and research. Join the league of data-driven football enthusiasts and dive into the exciting world of player valuations with the Top 500 Most Valuable Footballers in the World dataset.
Facebook
TwitterFor all Football/Soccer Lovers, Find over 50+ stats for over 2 seasons to determine a players market value, asses and visualize various parameters and show your results!
Football and Love for Football!
We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
For most football fans, May - July represents a lull period due to the lack of club football. What makes up for it, is the intense transfer speculation that surrounds all major player transfers today. Their market valuations also lead to a few raised eyebrows, lately more than ever. I was curious to see how good a proxy popularity could be for ability, and the predictive power it would have in a model estimating a player's market value.
name: Name of the player
club: Club of the player
age : Age of the player
position : The usual position on the pitch
position_cat :
1 for attackers
2 for midfielders
3 for defenders
4 for goalkeepers
market_value : As on transfermrkt.com on July 20th, 2017
page_views : Average daily Wikipedia page views from September 1, 2016 to May 1, 2017
fpl_value : Value in Fantasy Premier League as on July 20th, 2017
fpl_sel : % of FPL players who have selected that player in their team
fpl_points : FPL points accumulated over the previous season
region:
1 for England
2 for EU
3 for Americas
4 for Rest of World
nationality
new_foreign : Whether a new signing from a different league, for 2017/18 (till 20th July)
age_cat
club_id
big_club: Whether one of the Top 6 clubs
new_signing: Whether a new signing for 2017/18 (till 20th July)
To statistically analyse the beautiful game.
Facebook
TwitterContains web scrapped (rvest) Market Value information, FIFA variables, and other related data on Players from the English Premier League. This includes but is not limited to, Market Value, Accumulated Market Sums, Highest Ever Market Value, Player Team, Player Name, Position, Agents, Player Sponsors, Birth Places etc....
Facebook
TwitterContext This dataset includes information on the most 100 valuable teams in the world. The information was scraped from www.transfermarkt.com
Content** **Club: The team name Competition: In which league does the club compete? Age: Average age of players Squad_size: The number of players in the team Market Value: ةarket value of the club market value of players: Average market value of team players MV Top-18 players: The total market value of the 18 most valuable players on the team Share of MV: The ratio of the total market value of the 18 most valuable players in the team to the total market value of all the team's players
Acknowledgements The data was scraped from www.transfermarkt.com
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The dataset contains information about 644 Brazilian football players, with the following columns:
The data can be useful for analysis of player performance and market value trends in Brazilian football.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The Dairy Goods Sales Dataset provides a detailed and comprehensive collection of data related to dairy farms, dairy products, sales, and inventory management. This dataset encompasses a wide range of information, including farm location, land area, cow population, farm size, production dates, product details, brand information, quantities, pricing, shelf life, storage conditions, expiration dates, sales information, customer locations, sales channels, stock quantities, stock thresholds, and reorder quantities.
This dataset can be used by researchers, analysts, and businesses in the dairy industry for various purposes, such as:
Note: This dataset includes data from the period between 2019 and 2022, and it specifically focuses on selected dairy brands operating in specific states and union territories of India. There is an intentional drift highlighted in the dataset's figures due to its opensource and creative license, currently !
Facebook
TwitterThis dataset contains detailed information on a wide variety of vegetables from different regions across the world. Each entry includes data on the vegetable's category, color, seasonality, origin, nutritional value, pricing, availability, shelf life, storage requirements, growing conditions, health benefits, and common varieties. The dataset is structured to facilitate research and data analysis, offering insights into agricultural trends, nutritional science, and market dynamics. Ideal for use in academic research, market analysis, and agricultural studies.
Facebook
TwitterOpen Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
Context: This dataset has been created to assist gamers, football enthusiasts, and data analysts in discovering valuable insights about the FIFA 23 Ultimate Team (FUT) mode. Ultimate Team is a popular game mode within the FIFA series, where users can build their dream squads by collecting and trading virtual cards representing real-life football players. Player performance in FUT is determined by various statistics and attributes, which are crucial factors when building a competitive team.
Sources: The dataset is sourced from FUTBIN, a widely recognized website that provides comprehensive information on player cards, including statistics, attributes, market values, and more. FUTBIN constantly updates its data to reflect the latest player ratings and market trends, ensuring the dataset remains relevant and accurate.
Inspiration: The inspiration behind this dataset is to provide users with a comprehensive and structured collection of player data, enabling them to make informed decisions when constructing their Ultimate Team. By analyzing this dataset, users can:
Facebook
TwitterIn our Data Science lesson, we tried to predict the value of some soccer players, using their performance and their last market value. As we have not found a dataset on Kaggle that was convenient to us, we have tried to create our own dataset merging two ones finding on this platform. The 2 datasets are : ''Soccer players values and their statistics'' and ''Top Football Leagues Scorers''.
The data are only from the season 2019-2020. We have 88 players remaining. Our work is not finish and can be significantly improved, particularly by increasing the number of player.
Thanks to Mohamed Hany and RSKriegs for their datasets.