Facebook
TwitterThe lack of publicly available National Football League (NFL) data sources has been a major obstacle in the creation of modern, reproducible research in football analytics. While clean play-by-play data is available via open-source software packages in other sports (e.g. nhlscrapr for hockey; PitchF/x data in baseball; the Basketball Reference for basketball), the equivalent datasets are not freely available for researchers interested in the statistical analysis of the NFL. To solve this issue, a group of Carnegie Mellon University statistical researchers including Maksim Horowitz, Ron Yurko, and Sam Ventura, built and released nflscrapR an R package which uses an API maintained by the NFL to scrape, clean, parse, and output clean datasets at the individual play, player, game, and season levels. Using the data outputted by the package, the trio went on to develop reproducible methods for building expected point and win probability models for the NFL. The outputs of these models are included in this dataset and can be accessed using the nflscrapR package.
The dataset made available on Kaggle contains all the regular season plays from the 2009-2016 NFL seasons. The dataset has 356,768 rows and 100 columns. Each play is broken down into great detail containing information on: game situation, players involved, results, and advanced metrics such as expected point and win probability values. Detailed information about the dataset can be found at the following web page, along with more NFL data: https://github.com/ryurko/nflscrapR-data.
This dataset was compiled by Ron Yurko, Sam Ventura, and myself. Special shout-out to Ron for improving our current expected points and win probability models and compiling this dataset. All three of us are proud founders of the Carnegie Mellon Sports Analytics Club.
This dataset is meant to both grow and bring together the community of sports analytics by providing clean and easily accessible NFL data that has never been availabe on this scale for free.
Facebook
Twitterhttps://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
The Sports Data API Interface market is booming, projected to reach $2.5 Billion in 2025 and grow at a 15% CAGR through 2033. This comprehensive analysis explores market drivers, trends, restraints, and key players in sports data APIs for esports, football, basketball, and more. Discover market insights and regional breakdowns for informed business decisions.
Facebook
TwitterUse our trusted SportMonks Football API to build your own sports application and be at the forefront of football data today.
Our Football API is designed for iGaming, media, developers and football enthusiasts alike, ensuring you can create a football application that meets your needs.
Over 20,000 sports fanatics make use of our data. We know what data works best for you, so we ensured that our Football API has all the necessary tools you need to create a successful football application.
Livescores and schedules Our Football API features extremely fast livescores and up-to-date season schedules, meaning your app will be the first to notify its customers about a goal scored. This also works to further improve the look and feel of your website.
Statistics and line-ups We offer various kinds of football statistics, ranging from (live) player statistics to team, match and season statistics. And that’s not all - we also provide pre-match lineups for all important leagues.
Coverage and historical data Our Football API covers over 1,200 leagues, all managed by our in-house scouts and data platform. That means there’s up to 14 years of historical data available.
Bookmakers and odds Build your football sportsbook, odds comparison or betting portal with our pre-match and in-play odds collated from all major bookmakers and markets.
TV Stations and highlights Show your customers where the football games are broadcasted and provide video highlights of major match events.
Standings and topscorers Enhance your football website with standings and live standings, and allow your customers to see the top scorers and what the season's standings are.
Facebook
Twitterhttps://www.marketresearchforecast.com/privacy-policyhttps://www.marketresearchforecast.com/privacy-policy
The Sports Data API Interface market is booming, projected to reach $12B+ by 2033 (15% CAGR). This in-depth analysis covers market size, trends, key players (Sportradar, Genius Sports, etc.), and regional insights. Learn how esports, fantasy sports, and betting fuel this explosive growth.
Facebook
TwitterThe lack of publicly available National Football League (NFL) data sources has been a major obstacle in the creation of modern, reproducible research in football analytics. While clean play-by-play data is available via open-source software packages in other sports (e.g. nhlscrapr for hockey; PitchF/x data in baseball; the NBA API for basketball), the equivalent datasets are not freely available for researchers interested in the statistical analysis of the NFL. To solve this issue, a group of Carnegie Mellon University statistical researchers led by recent graduate, Maksim Horowitz, built and released nflscrapR an R package which uses an API maintained by the NFL to scrape, clean, parse, and output clean datasets at the individual play, player, game, and season levels. These datasets allow for the advancement of NFL research in the public domain by allowing analysts to develop from a common source in order to create reproducible NFL research, similar to what is being done currently in other professional sports.
The dataset made available on Kaggle contains all the regular season plays from the 2015-2016 NFL season. The dataset contain 46,129 rows and 63 columns. Each play is broken down into great detail containing information on; game situation, players involved and results. Detailed information about the dataset can be found in the nflscrapR documentation.
# Must install the devtools package using the below code
install.packages('devtools')
library(devtools)
# For now you must install nflscrapR from github
if (!is.element("nflscrapR", installed.packages())) {
# Print Installing nflscrapR
devtools::install_github(repo = "maksimhorowitz/nflscrapR")
}
library(nflscrapR)
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This folder contains data behind the story The Football Hall Of Fame Has A Receiver Problem.
advanced-historical.csv contains advanced career stats for NFL receivers, 1932-2013.
| Header | Definition |
|---|---|
pfr_player_id | Player identification code at Pro-Football-Reference.com |
player_name | The player's name |
career_try | Career True Receiving Yards |
career_ranypa | Adjusted Net Yards Per Attempt (relative to average) of player's career teams, weighted by TRY w/ each team |
career_wowy | The amount by which career_ranypa exceeds what would be expected from his QBs' (age-adjusted) performance without the receiver |
bcs_rating | The number of yards per game by which a player would outgain an average receiver on the same team, after adjusting for teammate quality and age (update of http://www.sabernomics.com/sabernomics/index.php/2005/02/ranking-the-all-time-great-wide-receivers/) |
try-per-game-aging-curve.csv contains receiver aging curve definitions.
| Header | Definition |
|---|---|
age_from | The age (as of December 31st) the player is moving from |
age_to | The age (as of December 31st) the player is moving to |
trypg_change | Expected change in TRY/game from one age-season to the next |
This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!
This dataset is maintained using GitHub's API and Kaggle's API.
This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.
Facebook
TwitterWith the SportMonks Football Euro Plan you get access to all data you need for a fantastic website/app about the Top Leagues in Europe. All data is available in JSON en extremely well documented. Our 7 days a week available Service Department will help you when needed. 14 day Free Trial included for all our plans!
Facebook
Twitterhttps://cdla.io/sharing-1-0/https://cdla.io/sharing-1-0/
European Football Leagues Database 2023-2024
Overview This dataset provides comprehensive information about the top 5 European football leagues for the 2023-2024 season. It includes detailed statistics about matches, players, teams, coaches, referees, and more, making it an invaluable resource for sports analysts, researchers, and football enthusiasts.
Dataset Description Leagues Covered: - English Premier League - Spanish La Liga - German Bundesliga - Italian Serie A - French Ligue 1
The database follows a normalized schema design with proper relationships between tables. Here's a simplified view of the main relationships:
leagues
↑
teams → matches ← referees
↓ ↑
players scores
↑
coaches
Here are some example SQL queries to get you started:
Get all matches for a specific team:
sql
SELECT m.*, t1.name as home_team, t2.name as away_team
FROM matches m
JOIN teams t1 ON m.home_team_id = t1.team_id
JOIN teams t2 ON m.away_team_id = t2.team_id
WHERE t1.team_id = [team_id] OR t2.team_id = [team_id];
Get current league standings:
sql
SELECT t.name, s.*
FROM standings s
JOIN teams t ON s.team_id = t.team_id
WHERE s.league_id = [league_id]
ORDER BY s.points DESC;
Get top scorers:
sql
SELECT p.name, p.team_id, COUNT(*) as goals
FROM scores s
JOIN players p ON s.scorer_id = p.player_id
GROUP BY p.player_id, p.name, p.team_id
ORDER BY goals DESC;
import pandas as pd
import sqlite3
# Connect to the SQLite database
conn = sqlite3.connect('sports_league.sqlite')
# Read data into pandas DataFrames
matches_df = pd.read_sql('SELECT * FROM matches', conn)
players_df = pd.read_sql('SELECT * FROM players', conn)
teams_df = pd.read_sql('SELECT * FROM teams', conn)
# Analyze data
team_stats = matches_df.groupby('home_team_id')['home_team_goals'].agg(['mean', 'sum'])
This dataset can be used for: 1. Match outcome prediction 2. Player performance analysis 3. Team strategy analysis 4. Historical trend analysis 5. Sports betting research 6. Fantasy football insights 7. Statistical modeling 8. Machine learning projects
Data Files:
matches.csv
players.csv
teams.csv
coaches.csv
referees.csv
stadiums.csv
standings.csv
scores.csv
seasons.csv
sports_league.sqlite
This dataset is released under the Creative Commons Zero v1.0 Universal license
If you find any issues or have suggestions for improvements, please: 1. Open an issue on the dataset's GitHub repository 2. Submit a pull request with your proposed changes 3. Contact the maintainer directly
Project: https://github.com/kaimg/Sports-League-Management-System
Facebook
TwitterNfl Films Export Import Data. Follow the Eximpedia platform for HS code, importer-exporter records, and customs shipment details.
Facebook
TwitterNfl Construction Sa Export Import Data. Follow the Eximpedia platform for HS code, importer-exporter records, and customs shipment details.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This folder contains data behind the story The Rams Are Dead To Me, So I Answered 3,352 Questions To Find A New NFL Team.
team-picking-categories.csv contains grades for each NFL franchise in 16 categories, to be used to pick a new favorite team.
| abbrev | category |
|---|---|
| FRL | Fan relations - Courtesy by players, coaches and front offices toward fans, and how well a team uses technology to reach them |
| OWN | Ownership - Honesty; loyalty to core players and the community |
| PLA | Players - Effort on the field, likability off it |
| FUT | Future wins - Projected wins over next 5 seasons |
| BWG | Bandwagon Factor - Are the team's next 5 years likely to be better than their previous 5? |
| TRD | Tradition - Championships/division titles/wins in team's entire history |
| BNG | Bang for the buck - Wins per fan dollars spent |
| BEH | Behavior - Suspensions by players on team since 2007, with extra weight to transgressions vs. women |
| NYP | Proximity to New York City |
| SLP | Proximity to St. Louis |
| AFF | Affordability - Price of tickets, parking and concessions |
| SMK | Small Market - Size of market in terms of population, where smaller is better |
| STX | Stadium experience - Quality of venue; fan-friendliness of environment; frequency of game-day promotions |
| CCH | Coaching - Strength of on-field leadership |
| UNI | Uniform - Stylishness of uniform design, according to Uni Watch's Paul Lukas |
| BMK | Big Market - Size of market in terms of population, where bigger is better |
Should be used in conjunction with weights derived from a survey structured like this: http://www.allourideas.org/nflteampickingsample.
This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!
This dataset is maintained using GitHub's API and Kaggle's API.
This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.
Facebook
TwitterNfl Uk Export Import Data. Follow the Eximpedia platform for HS code, importer-exporter records, and customs shipment details.
Facebook
Twitter"These are arrests, charges and citations of NFL players for crimes more serious than common traffic violations. Almost all of the players belonged to an NFL roster at the time of the incident. In rare cases, a free agent is included only if that player later signs with an NFL team. The data comes from media reports and public records. It cannot be considered fully complete because records of some player arrests might not have been found for various reasons, including lack of media coverage or accessible public records. Many resolutions to these cases also are pending or could not be immediately found." (Source)
This data covers January 2000 to March 2017. Like mentioned above, it is not fully complete. In the future I hope to add files to add dimensions like USA crime rates, team info, player info, team season records
The original database was conceived and created by sports writer Brent Schrotenboer of USA Today. http://www.usatoday.com/sports/nfl/arrests/
The Rate of Domestic Violence Arrests Among NFL Players - Benjamin Morris (FiveThirtyEight)
I found this data set August of 2015 and created http://nflarrest.com/ that attempts to provide a visual tool to explore the data set and a RESTful API.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This folder contains data behind the story How Every NFL Team’s Fans Lean Politically.
Google Trends data was derived from comparing 5-year search traffic for the 7 sports leagues we analyzed:
Results are listed by designated market area (DMA).
The percentages are the approximate percentage of major-sports searches that were conducted for each league.
Trump's percentage is his share of the vote within the DMA in the 2016 presidential election.
SurveyMonkey data was derived from a poll of American adults ages 18 and older, conducted between Sept. 1-7, 2017.
Listed numbers are the raw totals for respondents who ranked a given NFL team among their three favorites, and how many identified with a given party (further broken down by race). We also list the percentages of the entire sample that identified with each party, and were of each race.
This is a dataset from FiveThirtyEight hosted on their GitHub. Explore FiveThirtyEight data using Kaggle and all of the data sources available through the FiveThirtyEight organization page!
This dataset is maintained using GitHub's API and Kaggle's API.
This dataset is distributed under the Attribution 4.0 International (CC BY 4.0) license.
Facebook
TwitterHave you ever found yourself with a football dataset that almost had it all, but left you short of happiness? Time after time, promising datasets failed to deliver the statistics that truly matter – match events, player performances, team results, and season standings.
That time is over!
This in-depth football dataset, curated straight from a RapidAPI endpoint, brings you the data points we've all been waiting for. From fixtures and injuries to goals, assists, and tactical breakdowns, this dataset unlocks the full picture of the beautiful game.
What You Get 🏆 - Fixture Stats & Events: Goals, assists, fouls, and match-defining moments across leagues up to 2024. - Player Performances: From tackles to dribbles, passes, and shots – every stat that makes a difference. - Season Stats & League Standings: Discover how teams dominate, stumble, or rise to glory each season. - Team Insights: Analyze home/away performance, goal-scoring patterns, and defensive strengths. - Match Highlights: Real-time events like own goals, red cards, and critical substitutions. - Injuries & Suspensions: Missing players and their impact on team dynamics. - Iconic Stadiums: Explore venues, capacities, and surfaces that set the stage for football's greatest moments.
Why It’s Exciting 🌟
This isn’t just another football dataset – it’s the ultimate resource for fans, analysts, and strategists who want to dig deeper. Whether you're predicting outcomes, analyzing player form, or crafting the next big football insights project, you now have all the tools you need.
Get ready to unlock stories, trends, and insights like never before – because this time, the stats you actually care about are all here. Let’s kick it off! ⚽✨
In terms of fixture stats for players, the endpoint provides data from 2015 up through the 2024 season and I plan to make one more update at the end of all league/cup seasons in June of 2025.
Disclaimer: This dataset is intended for non-commercial, academic purposes and does not infringe upon any intellectual property rights of the original data providers, including RapidAPI or associated sources. For full details, please refer to the respective terms of use provided by the data sources.
If you have questions about the data or simply want to connect, reach out on LinkedIn and if you plan on using this data for any type of analysis, can you please share that with me!
PS: I am a Ronaldo fan... Suiiiii !!!
Leagues/Cups in datasets: - La Liga - Ligue 1 - Serie A - World Cup - Bundesliga - NWSL Women - Pro League - Championship League - Copa America - Premier League - CONCACAF Gold Cup - Euro Championship - UEFA Europa League - MLS - Africa Cup Of Nations - CONCACAF Champions League
Facebook
TwitterNfl League Office Export Import Data. Follow the Eximpedia platform for HS code, importer-exporter records, and customs shipment details.
Facebook
TwitterStatsBomb Open Data
StatsBomb are committed to sharing new data and research publicly to enhance understanding of the game of Football. We want to actively encourage new research and analysis at all levels. Therefore we have made certain leagues of StatsBomb Data freely available for public use for research projects and genuine interest in football analytics.
StatsBomb are hoping that by making data freely available, we will extend the wider football analytics community and attract new talent to the industry. We would like to collect some basic personal information about users of our data. By giving us your email address, it means we will let you know when we make more data, tutorials and research available. We will store the information in accordance with our Privacy Policy and the GDPR.
Whilst we are keen to share data and facilitate research, we also urge you to be responsible with the data. Please register your details on https://www.statsbomb.com/resource-centre and read our User Agreement carefully.
Terms & Conditions By using this repository, you are agreeing to the user agreement.
If you publish, share or distribute any research, analysis or insights based on this data, please state the data source as StatsBomb and use our logo, available in our Media Pack.
Getting Started The data is provided as JSON files exported from the StatsBomb Data API, in the following structure:
Competition and seasons stored in competitions.json. Matches for each competition and season, stored in matches. Each folder within is named for a competition ID, each file is named for a season ID within that competition. Events and lineups for each match, stored in events and lineups respectively. Each file is named for a match ID. StatsBomb 360 data for selected matches, stored in three-sixty. Each file is named for a match ID. Some documentation about the meaning of different events and the format of the JSON can be found in the doc directory.
Facebook
TwitterPremier League, Serie A, La Liga, Bundesliga, Ligue 1 from 2017-2018 to 2020-2021.
1 collection for each league of a certain season.
1 document for each player.
Within each document:
- name, age, nationality, height, weight, team, position.
- general stats: games, time, yellow cards, red cards.
- offensive stats: goals, assists, xG, xA, shots, key passes, npg, npxG, xGChain, xGBuildup.
- defensive stats: Tkl, TklW, Past, Press, Succ, Block, Int.
- passing stats: Cmp, Cmp%, 1/3, PPA, CrsPA, Prog.
Three data resources were used: Understat, api-football and Fbref. For more information on the data acquisition phase, I recommend reading the Football players notebook in the Code section.
This dataset is built with the aim of supporting an analysis to try to identify the most probable top performance age range of a player knowing the league in which he plays, his physical characteristics, his role and his nationality.
Facebook
TwitterNfl Kzn Export Import Data. Follow the Eximpedia platform for HS code, importer-exporter records, and customs shipment details.
Facebook
TwitterMaryland Sports (http://www.marylandsports.us/) has identified sport venues located within the State of Maryland. These venues offer opportunities to participate in free and fee-based, organized and pick-up, indoor and outdoor sports and physical fitness related activities in the area of Football.This is a MD iMAP hosted service layer. Find more information at https://imap.maryland.gov.Feature Service Layer Link:https://mdgeodata.md.gov/imap/rest/services/Society/MD_SportVenues/FeatureServer/26
Facebook
TwitterThe lack of publicly available National Football League (NFL) data sources has been a major obstacle in the creation of modern, reproducible research in football analytics. While clean play-by-play data is available via open-source software packages in other sports (e.g. nhlscrapr for hockey; PitchF/x data in baseball; the Basketball Reference for basketball), the equivalent datasets are not freely available for researchers interested in the statistical analysis of the NFL. To solve this issue, a group of Carnegie Mellon University statistical researchers including Maksim Horowitz, Ron Yurko, and Sam Ventura, built and released nflscrapR an R package which uses an API maintained by the NFL to scrape, clean, parse, and output clean datasets at the individual play, player, game, and season levels. Using the data outputted by the package, the trio went on to develop reproducible methods for building expected point and win probability models for the NFL. The outputs of these models are included in this dataset and can be accessed using the nflscrapR package.
The dataset made available on Kaggle contains all the regular season plays from the 2009-2016 NFL seasons. The dataset has 356,768 rows and 100 columns. Each play is broken down into great detail containing information on: game situation, players involved, results, and advanced metrics such as expected point and win probability values. Detailed information about the dataset can be found at the following web page, along with more NFL data: https://github.com/ryurko/nflscrapR-data.
This dataset was compiled by Ron Yurko, Sam Ventura, and myself. Special shout-out to Ron for improving our current expected points and win probability models and compiling this dataset. All three of us are proud founders of the Carnegie Mellon Sports Analytics Club.
This dataset is meant to both grow and bring together the community of sports analytics by providing clean and easily accessible NFL data that has never been availabe on this scale for free.