Facebook
TwitterThis archived Paleoclimatology Study is available from the NOAA National Centers for Environmental Information (NCEI), under the World Data Service (WDS) for Paleoclimatology. The associated NCEI study type is Fire. The data include parameters of fire history with a geographic location of Montana, United States Of America. The time period coverage is from 562 to -53 in calendar years before present (BP). See metadata information for parameter and study location details. Please cite this study when using the data.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
All data taken from https://fbref.com/
GitHub to my project: https://github.com/emreguvenilir/fifa23-ml-ratingsystem
There is another statistics dataset here on Kaggle where the data is totally incomplete. So I took the time, mainly because of a final school project, to download the raw data from R. I then cleaned the data to the specifics of my project. The data contains only players from the big 5 leagues (prem, la liga, bundesliga, ligue 1, serie a.)
Column Description
squad: The team of a given player
comp: The league of the team, only includes the “big 5”
player: player name
nation: nationality of the player
pos: position of the player
age: age of the player
born: year born
MP: matches played
Minutes_Played: minutes played in the season
Mn_per_MP: minutes per match played
Mins_Per_90: minutes per 90 minutes (length of a soccer match)
Starts: matches started
PPM_Team.Success: avg # of point earned by the team from matches in which the player appeared with a minimum of 30 minutes
OnG_Team.Success: goals scored by team while on pitch
onGA_Team.Success: Goals allowed by team while on pitch plus_per_minus_Team.Success: goals scored minus allowed while on pitch
Goals: goals scored
Assists: assists that led to goal
GoalsAssists: goals + assists
NonPKG: non penalty kick goals
PK: penalty kicks made
PKatt: penalties attempted
CrdY: yellow cards
CrdR: red cards
xG: expected goals based on all shots taken
xAG: expected assisted goals
npxG+xAG: non penalty expected goals + assisted goals
PrgC: progressive carries in the attacking half of the pitch and went at least 10 yards
PrgP: progressive carries in the attacking half of the pitch and went at least 10 yards
Gls_Per90: goals per 90 minutes
Ast_Per90: assists per 90 minutes
G+A_Per90: goals + assists per 90
G_minus_PK_Per: goals excluding penalties per 90
G+A_minus_PK_Per: goals and assists excluding penalties per 90
xG_Per: xG per 90
xAG_Per: xAG per 90
xG+xAG_Per: xG+xAG per 90
Shots: shots taken
Shots_On_Target: shots on goal frame
SoT_percent: sh/SoT * 100
G_per_Sh: goals per shot taken
G_per_SoT: goal per shot on target
Avg_Shot_Dist: avg shot dist
FK_Standard: shots from free kicks
G_minus_xG_expected: goals minus expected goals
np:G_minus_xG_Expected: non penalty goals minus expected goals
Passes_Completed: passes completed
Passes_attempted: passes attempted
Passes_Cmp_percent: pass completion percentage
PrgDist_Total: progressive pass total distance
Passes_Cmp_Short: short passes completed (5 to 15 yds)
Passes_Att_Short: short passes Attempted (5 to 15 yds)
Passes_Cmp_Percent_Short: short passes completed percentage (5 to 15 yds)
Passes_Cmp_Medium: medium passes completed (15 to 30 yds)
Passes_Att_medium: medium passes Attempted (15 to 30 yds)
Passes_Cmp_Percent_Medium: medium passes completed percentage (15 to 30 yds)
Passes_Cmp_long: long passes completed (30+ yds)
Passes_Att_long : long passes Attempted (30+ yds)
Passes_Cmp_Percent_long : long passes completed percentage (30+ yds)
A_minus_xAG_expected: assists minus expected assists
Key_Passes: passes that lead directly to a shot
Final_third: passes that enter the final third of the field
PPA: passes into the penalty area
CrsPA: crosses into penalty area
TB_pass: through ball passes
Crs_Pass: number of crosses
Offside_passes: passes that resulted in an offside
Blocked_passes: passes blocked by an opponent
Shot_Creating_Actions: shot creating actions
SCA_90: shot creating actions per 90
TakeOnTo_Shot: take ons that led to shot
FoulTo_Shot: fouls draw that led to shot
DefAction_Shot: defensive actions that led to a shot (pressing)
GoalCreatingAction: goal creating actions
GCA90: goal creating actions per 90
TakeOn_Goal: take ons that led to a goal
Fld_goal: fouls drawn that led to a goal
DefAction_Goal: defensive actions that led to a goal (pressing)
Tackles: number of tackles made
Tackles_won: tackles won
Def_3rd_Tackles: tackles in the defensive 1/3 of the pitch
Mid_3rd_Tackles: tackles in the middle 1/3 of the pitch
Att_3rd_Tackles: tackles in the attacking 1/3 of the pitch
Tkl_percent_won: % of dribblers tackled
Lost_challenges: lost challenges, unsuccessful attempts to win the ball
Blocks: # of times blocking the ball by standing in path
Sh_blocked: shots blocked
Passes_blocked: number of passes blocked
Interceptions: interceptions
Clearances; clearances
ErrorsLead_ToShot: errors made leading to a shot
Att_Take: attacking take ons attempted
Succ:Take: attacking take ons successful
Succ_percent_take: percentage of attacking take ons successfully
Tkld_Take: times tackled during a take on
Tkld_percent_Take: percentage of times tackled during a take on
TotDist_Carries: total distance carrying the ball in any direction
PrgDist_carries: progressive carry distance total
Miscontrolls: # of times a player...
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Comprehensive football (soccer) data lake from Transfermarkt, clean and structured for analysis and machine learning.
Everything in raw CSV format – perfect for EDA, ML, and advanced football analytics.
A complete football data lake covering players, teams, transfers, performances, market values, injuries, and national team stats. Perfect for analysts, data scientists, researchers, and enthusiasts.
Here’s the high-level schema to help you understand the dataset structure:
https://i.imgur.com/WXLIx3L.png" alt="Transfermarkt Dataset ER Diagram">
Organized into 10 well-structured CSV categories:
Most football datasets are pre-processed and restrictive. This one is raw, rich, and flexible:
I’m always excited to collaborate on innovative football data projects. If you’ve got an idea, let’s make it happen together!
If this dataset helps you:
- Upvote on Kaggle
- Star the GitHub repo
- Share with others in the football analytics community
football analytics soccer dataset transfermarkt sports analytics machine learning football research player statistics
🔥 Analyze football like never before. Your next AI or analytics project starts here.
Facebook
TwitterThis archived Paleoclimatology Study is available from the NOAA National Centers for Environmental Information (NCEI), under the World Data Service (WDS) for Paleoclimatology. The associated NCEI study type is Fire. The data include parameters of fire history|paleolimnology with a geographic location of Michigan, United States Of America. The time period coverage is from Unavailable begin date to Unavailable end date in calendar years before present (BP). See metadata information for parameter and study location details. Please cite this study when using the data.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
USHAP (USHighAirPollutants) is one of the series of long-term, full-coverage, high-resolution, and high-quality datasets of ground-level air pollutants for the United States. It is generated from the big data (e.g., ground-based measurements, satellite remote sensing products, atmospheric reanalysis, and model simulations) using artificial intelligence by considering the spatiotemporal heterogeneity of air pollution. This is the big data-derived seamless (spatial coverage = 100%) daily, monthly, and yearly 1 km (i.e., D1K, M1K, and Y1K) ground-level PM2.5 dataset in the United States from 2000 to 2020. Our daily PM2.5 estimates agree well with ground measurements with an average cross-validation coefficient of determination (CV-R2) of 0.82 and normalized root-mean-square error (NRMSE) of 0.40, respectively. All the data will be made public online once our paper is accepted, and if you want to use the USHighPM2.5 dataset for related scientific research, please contact us (Email: weijing_rs@163.com; weijing@umd.edu). Wei, J., Wang, J., Li, Z., Kondragunta, S., Anenberg, S., Wang, Y., Zhang, H., Diner, D., Hand, J., Lyapustin, A., Kahn, R., Colarco, P., da Silva, A., and Ichoku, C. Long-term mortality burden trends attributed to black carbon and PM2.5 from wildfire emissions across the continental USA from 2000 to 2020: a deep learning modelling study. The Lancet Planetary Health, 2023, 7, e963–e975. https://doi.org/10.1016/S2542-5196(23)00235-8 More air quality datasets of different air pollutants can be found at: https://weijing-rs.github.io/product.html
Facebook
TwitterWhen a quarterback takes a snap and drops back to pass, what happens next may seem like chaos. As offensive players move in various patterns, the defense works together to prevent successful pass completions and then to quickly tackle receivers that do catch the ball. In this year’s Kaggle competition, your goal is to use data science to better understand the schemes and players that make for a successful defense against passing plays.
In American football, there are a plethora of defensive strategies and outcomes. The National Football League (NFL) has used previous Kaggle competitions to focus on offensive plays, but as the old proverb goes, “defense wins championships.” Though metrics for analyzing quarterbacks, running backs, and wide receivers are consistently a part of public discourse, techniques for analyzing the defensive part of the game trail and lag behind. Identifying player, team, or strategic advantages on the defensive side of the ball would be a significant breakthrough for the game.
This competition uses NFL’s Next Gen Stats data, which includes the position and speed of every player on the field during each play. You’ll employ player tracking data for all drop-back pass plays from the 2018 regular season. The goal of submissions is to identify unique and impactful approaches to measure defensive performance on these plays. There are several different directions for participants to ‘tackle’ (ha)—which may require levels of football savvy, data aptitude, and creativity. As examples:
What are coverage schemes (man, zone, etc) that the defense employs? What coverage options tend to be better performing? Which players are the best at closely tracking receivers as they try to get open? Which players are the best at closing on receivers when the ball is in the air? Which players are the best at defending pass plays when the ball arrives? Is there any way to use player tracking data to predict whether or not certain penalties – for example, defensive pass interference – will be called? Who are the NFL’s best players against the pass? How does a defense react to certain types of offensive plays? Is there anything about a player – for example, their height, weight, experience, speed, or position – that can be used to predict their performance on defense? What does data tell us about defending the pass play? You are about to find out.
Note: Are you a university participant? Students have the option to participate in a college-only Competition, where you’ll work on the identical themes above. Students can opt-in for either the Open or College Competitions, but not both.
Facebook
Twitterhttps://sqmagazine.co.uk/privacy-policy/https://sqmagazine.co.uk/privacy-policy/
The AI image generator Midjourney has rapidly shifted from a niche tool to a mainstream creative engine. Artists and brands alike now use it for concept art, marketing visuals, and rapid prototyping, while design teams employ it to streamline workflows and reduce production time. In this article, you’ll find detailed...
Facebook
Twitterhttps://doi.org/10.17026/fp39-0x58https://doi.org/10.17026/fp39-0x58
The datasets used are in public access. Here is the URL for the database of forest fire footprints: https://www.ffm.vic.gov.au/The Landsat images are also in public access: https://earthexplorer.usgs.gov/.The code for the analysis is attached (Bush_fires_v2_2.R). The results can be reproduced using the code.The figures for the publication were created using the other R code attached to this email.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The dataset contains yearly College Basketball Big 12 Conference data for every active Big 12 season.
The data was collected from Sports Reference then cleaned for data analysis.
Tabular data includes:
- year
- rank
- school
- games: Total games played
- wins
- losses
- win_percentage
- conference_wins
- conference_losses
- home_wins
- home_losses
- away_wins
- away_losses
- offensive_rating
- defensive_rating
- net_rating
- simple_rating
Per Game
————————————————————
- field_goals
- field_goal_attempts
- field_goal_percentage
- 3_pointers
- 3_pointer_attempts
- 3_pointer_percentage
- effective_field_goal_percentage
- free_throws
- free_throw_attempts
- free_throw_percentage
- offensive_rebounds
- total_rebounds
- assists
- steals
- blocks
- turnovers
- personal_fouls
- points
- opponent_points
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
ABSTRACT Introduction Tactical football training is significant in teaching great football teams. Analyzing and discussing existing problems and proposals for corresponding countermeasures should be carried out periodically. Objective Investigate and understand the main factors that affect the development of tactical training activities of big football teams. Methods Large-scale soccer match tactics at the 2018 World Cup are evaluated and treated statistically by dividing the defensive behaviors in the game between individual defensive tactics and collective defensive tactics. Results The primary means of launching a fast defensive attack is a medium to long pass across the court. Launching a fast attack requires combining a pass with a sudden attack. Conclusion Attackers often take the initiative in their confrontation tactics. The aggressive style of the players excels in the initiative and midfield advantage. Evidence level II; Therapeutic Studies - Investigating the results.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterThis archived Paleoclimatology Study is available from the NOAA National Centers for Environmental Information (NCEI), under the World Data Service (WDS) for Paleoclimatology. The associated NCEI study type is Fire. The data include parameters of fire history with a geographic location of Montana, United States Of America. The time period coverage is from 562 to -53 in calendar years before present (BP). See metadata information for parameter and study location details. Please cite this study when using the data.