35 datasets found
  1. f

    Approaches and included features.

    • plos.figshare.com
    xls
    Updated Oct 30, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ishara Bandara; Sergiy Shelyag; Sutharshan Rajasegarar; Dan Dwyer; Eun-jin Kim; Maia Angelova (2024). Approaches and included features. [Dataset]. http://doi.org/10.1371/journal.pone.0312278.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Oct 30, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Ishara Bandara; Sergiy Shelyag; Sutharshan Rajasegarar; Dan Dwyer; Eun-jin Kim; Maia Angelova
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In association football, predicting the likelihood and outcome of a shot at a goal is useful but challenging. Expected goal (xG) models can be used in a variety of ways including evaluating performance and designing offensive strategies. This study proposed a novel framework that uses the events preceding a shot, to improve the accuracy of the expected goals (xG) metric. A combination of previously explored and unexplored temporal features is utilized in the proposed framework. The new features include; “advancement factor”, and “player position column”. A random forest model was used, which performed better than published single-event-based models in the literature. Results further demonstrated a significant improvement in model performance with the inclusion of preceding event information. The proposed framework and model enable the discovery of event sequences that improve xG, which include; opportunities built up from the sides of the 18-yard box, shots attempted from in front of the goal within the opposition’s 18-yard box, and shots from successful passes to the far post.

  2. f

    Test data results for comparison between expected goals statistic and...

    • plos.figshare.com
    bin
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    James Mead; Anthony O’Hare; Paul McMenemy (2023). Test data results for comparison between expected goals statistic and traditional metrics. [Dataset]. http://doi.org/10.1371/journal.pone.0282295.t004
    Explore at:
    binAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    PLOS ONE
    Authors
    James Mead; Anthony O’Hare; Paul McMenemy
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Test data results for comparison between expected goals statistic and traditional metrics.

  3. R

    Xg Dataset

    • universe.roboflow.com
    zip
    Updated May 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rostyslav (2025). Xg Dataset [Dataset]. https://universe.roboflow.com/rostyslav-egoyn/xg-ykiip/model/1
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 16, 2025
    Dataset authored and provided by
    Rostyslav
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Objects Bounding Boxes
    Description

    XG

    ## Overview
    
    XG is a dataset for object detection tasks - it contains Objects annotations for 340 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  4. Summary of the results of our model compared to published models.

    • plos.figshare.com
    bin
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    James Mead; Anthony O’Hare; Paul McMenemy (2023). Summary of the results of our model compared to published models. [Dataset]. http://doi.org/10.1371/journal.pone.0282295.t003
    Explore at:
    binAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    James Mead; Anthony O’Hare; Paul McMenemy
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The AUC ROC for the optimal model in this research used test data, and used players’ FIFA ratings as a proxy for player ability.

  5. Model XG BOOST

    • kaggle.com
    Updated Mar 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ShyamSUBEDI (2025). Model XG BOOST [Dataset]. https://www.kaggle.com/datasets/shyamsubedi/model-xg-boost
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 27, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    ShyamSUBEDI
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by ShyamSUBEDI

    Released under MIT

    Contents

  6. Football Shots Data

    • kaggle.com
    Updated Feb 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alba Closa Tarres (2025). Football Shots Data [Dataset]. https://www.kaggle.com/datasets/albaclosatarres/football-shots-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 19, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Alba Closa Tarres
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset provides detailed information on football (soccer) shots, capturing various contextual and technical aspects of each attempt. It is designed for sports analytics, machine learning models, and tactical analysis. It was created with the objective to generate a basic xG model.

  7. h

    xg-thesis

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fadhil Raihan Akbar, xg-thesis [Dataset]. https://huggingface.co/datasets/fadhilra101/xg-thesis
    Explore at:
    Authors
    Fadhil Raihan Akbar
    Description

    expected-goals-thesis

    A repository for analysis on Expected Goals using StatsBomb and Wyscout data.

      StatsBomb data
    

    This repository assumes that the StatsBomb open-data has already been cloned to a local directory.

      Versioning
    

    The original thesis was run from a particular version of the data and mplsoccer (my football plotting library). The original code is here:… See the full description on the dataset page: https://huggingface.co/datasets/fadhilra101/xg-thesis.

  8. Premier League - Player Stats Season - 24/25

    • kaggle.com
    Updated Dec 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Eduardo Palmieri (2024). Premier League - Player Stats Season - 24/25 [Dataset]. https://www.kaggle.com/datasets/eduardopalmieri/premier-league-player-stats-season-2425/versions/8
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 7, 2024
    Dataset provided by
    Kaggle
    Authors
    Eduardo Palmieri
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Premier League Players Performance Dataset

    This dataset provides a comprehensive overview of player performance in the Premier League capturing a wide array of metrics related to gameplay, scoring, passing, and defensive actions. With records detailing individual player statistics across different teams, this dataset is a valuable resource for analysts, data scientists, and fans who are interested in diving into player performance data from one of the world’s top soccer leagues.

    Each entry represents a single player's profile, featuring data on expected goals (xG), expected assists (xAG), touches, dribbles, tackles, and more. This dataset is ideal for analyzing various aspects of player contribution, both offensively and defensively, and understanding their impact on team performance.

    Dataset Columns

    Player: Name of the player Team: Team the player belongs to '#' : Player's jersey number Nation: Nationality of the player Position: Primary playing position on the field Age: Age of the player Minutes: Total minutes played Goals: Number of goals scored Assists: Number of assists Penalty Shoot on Goal: Penalty shots taken on goal Penalty Shoot: Total penalty shots attempted Total Shoot: Total shots attempted Shoot on Target: Shots successfully on target Yellow Cards: Number of yellow cards received Red Cards: Number of red cards received Touches: Total ball touches Dribbles: Total dribbles attempted Tackles: Total tackles made Blocks: Total blocks Expected Goals (xG): Expected goals, calculated based on shooting positions and likelihood of scoring Non-Penalty xG (npxG): Expected goals excluding penalties Expected Assists (xAG): Expected assists, based on actions leading to an expected goal (xG) Shot-Creating Actions: Actions leading to a shot attempt Goal-Creating Actions: Actions leading to a goal Passes Completed: Successful passes completed Passes Attempted: Total passes attempted Pass Completion %: Pass completion rate, expressed as a percentage (some entries have missing values here) Progressive Passes: Passes advancing the ball significantly toward the opponent’s goal Carries: Total ball carries Progressive Carries: Carries advancing the ball significantly toward the opponent’s goal Dribble Attempts: Total dribbles attempted Successful Dribbles: Total successful dribbles Date: Date of record collection or game date

    Potential Use Cases

    Data Visualization: Explore relationships between various performance metrics to identify patterns.

    Player Comparisons: Compare individual players based on goals, assists, xG, xAG, and other metrics.

    Team Analysis: Evaluate contributions of players within the same team to gain insights into team dynamics.

    Predictive Modeling: Use the dataset to build models for predicting game outcomes, goals, or assists based on player performance metrics.

  9. i

    Data from: xG-Loc: 3GPP-compliant datasets for xG location-aware networks

    • ieee-dataport.org
    Updated Nov 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andrea Conti (2024). xG-Loc: 3GPP-compliant datasets for xG location-aware networks [Dataset]. https://ieee-dataport.org/open-access/xg-loc-3gpp-compliant-datasets-xg-location-aware-networks
    Explore at:
    Dataset updated
    Nov 29, 2024
    Authors
    Andrea Conti
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    including location-based services (LBSs) and efficient network management. However

  10. f

    League positions resulting in specific consequences for teams in each...

    • plos.figshare.com
    bin
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    James Mead; Anthony O’Hare; Paul McMenemy (2023). League positions resulting in specific consequences for teams in each league. [Dataset]. http://doi.org/10.1371/journal.pone.0282295.t001
    Explore at:
    binAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    PLOS ONE
    Authors
    James Mead; Anthony O’Hare; Paul McMenemy
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    League positions resulting in specific consequences for teams in each league.

  11. 4

    Research Data for the PhD thesis Advanced Electromagnetic Modelling of the...

    • data.4tu.nl
    zip
    Updated May 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Riccardo Ozzola; Daniele Cavallo; Andrea Neto (2024). Research Data for the PhD thesis Advanced Electromagnetic Modelling of the Next Generation (XG) Wireless Communication Systems [Dataset]. http://doi.org/10.4121/9d280382-b6ab-4bf7-9308-d48a7326a38a.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 30, 2024
    Dataset provided by
    4TU.ResearchData
    Authors
    Riccardo Ozzola; Daniele Cavallo; Andrea Neto
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    These are the measurements (S-parameters and farfield patterms) of the prototype discussed in Chapter 4 of the PhD thesis Advanced Electromagnetic Modelling of the Next Generation (XG) Wireless Communication Systems.

  12. Arsenal - Nottingham Forest xG EPL(12.08.2023)

    • kaggle.com
    Updated Jul 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    orkunaktas4 (2024). Arsenal - Nottingham Forest xG EPL(12.08.2023) [Dataset]. https://www.kaggle.com/datasets/orkunaktas/arsenal-nottingham-forest-xg-epl12-08-2023
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 22, 2024
    Dataset provided by
    Kaggle
    Authors
    orkunaktas4
    Description

    Context

    this dataset contains shot statistics and xG values of shots taken during the match between arsenal vs nottingham forest on 12.08.2023

    Variables

    • Player: Who shoots to score the goal
    • Squad: Team
    • xG: Expected Goals
    • PSxG: Post-Shot Expected Goals PSxG is expected goals based on how likely the goalkeeper is to save the shot
    • Outcome: Shot result
    • Distance: Whether Arsenal is the home team or not. 1-yes ,0-no
    • Body Part: the body part used to score the goal
  13. Football Analytics (Event data)

    • kaggle.com
    Updated Aug 25, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HARDIK AGARWAL (2020). Football Analytics (Event data) [Dataset]. https://www.kaggle.com/datasets/hardikagarwal1/football-analytics-event-data-statsbomb/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 25, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    HARDIK AGARWAL
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Most publicly available football (soccer) statistics are limited to aggregated data such as Goals, Shots, Fouls, Cards. When assessing performance or building predictive models, this simple aggregation, without any context, can be misleading. For example, a team that produced 10 shots on target from long range has a lower chance of scoring than a club that produced the same amount of shots from inside the box. However, metrics derived from this simple count of shots will similarly asses the two teams.

    A football game generates hundreds of events and it is very important and interesting to take into account the context in which those events were generated. This incredibly rich data set should keep football analytics enthusiasts awake for long hours as the size of the data set and number of questions that can be asked is huge.

    Content

    There are 4 main files containing the data: 1) Competition data: Contains information regarding competetion id, competition name, season id, season name, country and gender.

    2)Match data: Match information for each match including competition and season information, stadium and referee information, home and away team information as well as the data version the match was collected under.

    3) Lineup data: Records the lineup information for the players, managers and referees involved with each match. The following variables are collected in the lineups of each match - team id, team name and lineup. The lineup array is a nested data frame inside of the lineup object, the lineup array contains the following information for each team- player id, player name, player nickname, jersey number and country

    4) Event data: Event Data comprises of general attributes and event specific attributes. General attributes are recorded for most event types, depending only on applicability. Event specific attributes help describe the event type in more detail as well as describe the outcome of the event type.

    The open data specification document in the doc folder describes the structure of the data along with all attributes in great detail. Take a look at this file for deeper understanding of the data.

    Acknowledgements

    This data is from the StatsBomb Open Data repository. StatsBomb are committed to sharing new data and research publicly to enhance understanding of the game of Football. They want to actively encourage new research and analysis at all levels. Therefore they have made certain leagues of StatsBomb Data freely available for public use for research projects and genuine interest in football analytics.

    Inspiration

    There are many many questions we can ask with such detailed event data. Here are just a few examples: What is the value of a shot? Or what is the probability of a shot being a goal given it's location, shooter, league, assist method, gamestate, number of players on the pitch, time - known as expected goals (xG) models When are teams more likely to score? Which teams are the best or sloppiest at holding the lead? Which teams or players make the best use of set pieces? How do players compare when they shoot with their week foot versus strong foot? Or which players are ambidextrous? Identify different styles of plays (shooting from long range vs shooting from the box, crossing the ball vs passing the ball, use of headers) Which teams have a bias for attacking on a particular flank?

  14. D

    Replication Data for: On-chip topological beamformer for multi-link...

    • researchdata.ntu.edu.sg
    tsv, txt, xlsx
    Updated May 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    DR-NTU (Data) (2024). Replication Data for: On-chip topological beamformer for multi-link terahertz 6G to XG wireless [Dataset]. http://doi.org/10.21979/N9/UKLX3D
    Explore at:
    tsv(84573), txt(10524802), tsv(672230), xlsx(31441085), tsv(49361), tsv(175), txt(10525066), txt(257341170), tsv(7984), txt(55558929), tsv(122129), tsv(731976), xlsx(36710154), txt(130061574), txt(2523595), tsv(140495), txt(10528406), tsv(165699), txt(130108121), tsv(225), txt(11541389), tsv(875362), txt(63239846), xlsx(79337348), tsv(121102), txt(2523009), tsv(557755), tsv(4243349), tsv(40527), xlsx(44807459), txt(99399346), tsv(391), tsv(803554), txt(10528626), txt(43981350), txt(161190852), tsv(109436), txt(83527634), xlsx(27524408), txt(304732790), tsv(12814), tsv(3470408)Available download formats
    Dataset updated
    May 28, 2024
    Dataset provided by
    DR-NTU (Data)
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Dataset funded by
    National Research Foundation (NRF)
    Agence Nationale de la Recherche (ANR)
    Description

    This dataset contains all the data used in the paper, titled 'On-chip topological beamformer for multi-link terahertz 6G to XG wireless'.

  15. f

    A comparative analysis of DC, CTGAN-DC, XGBoost, CTGAN-XG, and TVAE-XG...

    • plos.figshare.com
    xls
    Updated Dec 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chuan-Sheng Hung; Chun-Hung Richard Lin; Jain-Shing Liu; Shi-Huang Chen; Tsung-Chi Hung; Chih-Min Tsai (2024). A comparative analysis of DC, CTGAN-DC, XGBoost, CTGAN-XG, and TVAE-XG models in Kawasaki Disease experiments. [Dataset]. http://doi.org/10.1371/journal.pone.0314995.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Dec 31, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Chuan-Sheng Hung; Chun-Hung Richard Lin; Jain-Shing Liu; Shi-Huang Chen; Tsung-Chi Hung; Chih-Min Tsai
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Washington
    Description

    A comparative analysis of DC, CTGAN-DC, XGBoost, CTGAN-XG, and TVAE-XG models in Kawasaki Disease experiments.

  16. h

    Data from: Measurement of the Interference Structure Function Xg(3) (X) in...

    • hepdata.net
    Updated 1984
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Argento, A.; Benvenuti, A.C.; Bollini, D.; Bruni, G.; Camporesi, T.; Heiman, G.; Monari, L.; Navarria, F.L.; Bozzo, M.; Deiters, K.; Argento, A.; Benvenuti, A.C.; Bollini, D.; Bruni, G.; Camporesi, T.; Heiman, G.; Monari, L.; Navarria, F.L.; Bozzo, M.; Deiters, K. (1984). Measurement of the Interference Structure Function Xg(3) (X) in Muon - Nucleon Scattering [Dataset]. http://doi.org/10.17182/hepdata.13025.v1
    Explore at:
    Dataset updated
    1984
    Dataset provided by
    HEPData
    Authors
    Argento, A.; Benvenuti, A.C.; Bollini, D.; Bruni, G.; Camporesi, T.; Heiman, G.; Monari, L.; Navarria, F.L.; Bozzo, M.; Deiters, K.; Argento, A.; Benvenuti, A.C.; Bollini, D.; Bruni, G.; Camporesi, T.; Heiman, G.; Monari, L.; Navarria, F.L.; Bozzo, M.; Deiters, K.
    Description

    DATA REQUESTED IN JUNE 1985 BY YGS. DATA REQUESTED IN JUNE 1985 BY YGS. DATA SUPPLIED BY AUTHORS IN SEP 1985. FIRST MEASUREMENT OF INTERFERENCE STRUCTURE FUNCTION XG3(X) SCATTERING POSITIVE AND NEGATIVE MUONS OFF CARBON TARGET. BCDMS (NA4) COLLABORATION. Q*2 IN THERANGE 40 TO 180 GEV**2. WARNING(VVE-94): SOURCE OF THE NUMERICAL DATA UNKNOWN, DATA ON XG3(X) FROM THE SAME EXPERIMENT SUPPLIED BY AUTHORS SEE IN PART=1 OF THE RECORD.

  17. f

    Athlete country ranking table.

    • figshare.com
    xls
    Updated Aug 22, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Beat Knechtle; Katja Weiss; David Valero; Elias Villiger; Pantelis T. Nikolaidis; Marilia Santos Andrade; Volker Scheer; Ivan Cuk; Robert Gajda; Mabliny Thuany (2024). Athlete country ranking table. [Dataset]. http://doi.org/10.1371/journal.pone.0303960.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Aug 22, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Beat Knechtle; Katja Weiss; David Valero; Elias Villiger; Pantelis T. Nikolaidis; Marilia Santos Andrade; Volker Scheer; Ivan Cuk; Robert Gajda; Mabliny Thuany
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The present study intended to determine the nationality of the fastest 100-mile ultra-marathoners and the country/events where the fastest 100-mile races are held. A machine learning model based on the XG Boost algorithm was built to predict the running speed from the athlete’s age (Age group), gender (Gender), country of origin (Athlete country) and where the race occurred (Event country). Model explainability tools were then used to investigate how each independent variable influenced the predicted running speed. A total of 172,110 race records from 65,392 unique runners from 68 different countries participating in races held in 44 different countries were used for analyses. The model rates Event country (0.53) as the most important predictor (based on data entropy reduction), followed by Athlete country (0.21), Age group (0.14), and Gender (0.13). In terms of participation, the United States leads by far, followed by Great Britain, Canada, South Africa, and Japan, in both athlete and event counts. The fastest 100-mile races are held in Romania, Israel, Switzerland, Finland, Russia, the Netherlands, France, Denmark, Czechia, and Taiwan. The fastest athletes come mostly from Eastern European countries (Lithuania, Latvia, Ukraine, Finland, Russia, Hungary, Slovakia) and also Israel. In contrast, the slowest athletes come from Asian countries like China, Thailand, Vietnam, Indonesia, Malaysia, and Brunei. The difference among male and female predictions is relatively small at about 0.25 km/h. The fastest age group is 25–29 years, but the average speeds of groups 20–24 and 30–34 years are close. Participation, however, peaks for the age group 40–44 years. The model predicts the event location (country of event) as the most important predictor for a fast 100-mile race time. The fastest race courses were occurred in Romania, Israel, Switzerland, Finland, Russia, the Netherlands, France, Denmark, Czechia, and Taiwan. Athletes and coaches can use these findings for their race preparation to find the most appropriate racecourse for a fast 100-mile race time.

  18. Arsenal Squad Shooting Stats 23/24

    • kaggle.com
    Updated Aug 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    orkunaktas4 (2024). Arsenal Squad Shooting Stats 23/24 [Dataset]. http://doi.org/10.34740/kaggle/dsv/9077993
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 1, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    orkunaktas4
    Description

    Context

    this dataset contains shot statistics of arsenal players for the season 2023/24

    Player: The name of the player.

    Nation: The name or abbreviation of the player's country.

    Pos: The position the player plays (e.g., Forward, Midfielder, Defender).

    Age: The age of the player.

    90s: The number of minutes the player has played in terms of 90-minute units. This represents the total amount of time played converted into 90-minute segments.

    Gls: The total number of goals scored by the player.

    Sh: The total number of shots taken by the player.

    SoT: The number of shots on target by the player (shots that are on goal).

    SoT%: The percentage of shots on target by the player. Calculation: (SoT / Sh) * 100.

    Sh/90: The average number of shots per match by the player. Calculation: Sh / 90s.

    SoT/90: The average number of shots on target per match by the player. Calculation: SoT / 90s.

    G/Sh: The average number of goals per shot by the player. Calculation: Gls / Sh.

    G/SoT: The average number of goals per shot on target by the player. Calculation: Gls / SoT.

    Dist: The average distance of the player’s shots (typically measured from the goal).

    FK: The number of goals scored from free kicks by the player.

    PK: The number of goals scored from penalties by the player.

    PKatt: The number of penalties taken by the player.

    xG: The expected goals of the player. This metric measures the probability of a shot resulting in a goal, accounting for the quality of the chances.

    npxG: The expected goals of the player excluding penalties. This calculates xG without considering penalty shots.

    npxG/Sh: The ratio of expected goals excluding penalties per shot by the player. Calculation: npxG / Sh.

    G-xG: The difference between the goals scored and the expected goals. Calculation: Gls - xG. This shows how much more or less the player has scored compared to the expected goals.

    np
    The difference between non-penalty goals and expected goals. Calculation: Gls - npxG. This shows how much more or less the player has scored in open play compared to the expected goals.
  19. Champions League 23/24

    • kaggle.com
    Updated May 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sharvagya (2024). Champions League 23/24 [Dataset]. http://doi.org/10.34740/kaggle/ds/5071658
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 24, 2024
    Dataset provided by
    Kaggle
    Authors
    Sharvagya
    Description

    Champions League 2023/2024 Dataset

    Overview

    This dataset provides detailed statistics for the UEFA Champions League 2023/2024 season, focusing on team performance across various metrics. The data is sourced from FBref, a comprehensive platform for football statistics. This single-table dataset includes metrics such as matches played, wins, losses, goals scored, expected goals (xG), and more for each team participating in the Champions League.

    Dataset Content

    The dataset is structured as a single CSV file with the following headers:

    • Rk: Rank of the team based on the stage of the competition reached.
    • Country: The country of the club.
    • Squad: The name of the club.
    • MP: Matches played.
    • W: Matches won.
    • D: Matches drawn.
    • L: Matches lost.
    • GF: Goals for - total goals scored by the team.
    • GA: Goals against - total goals conceded by the team.
    • GD: Goal difference (GF - GA).
    • Pts: Total points accumulated by the team
    • xG: Expected goals - a metric that estimates the number of goals a team should have scored based on the quality of their chances.
    • xGA: Expected goals against - a metric that estimates the number of goals a team should have conceded based on the quality of chances they allowed.
    • xGD: Expected goal difference (xG - xGA).
    • xGD/90: Expected goal difference per 90 minutes.
    • Last 5: Results of the last 5 matches (e.g., WWDWL for 3 wins, 1 draw, and 1 loss).
    • Attendance: Average attendance for home matches.
    • Top Team Scorer: The name of the top scorer for the team.
    • Goalkeeper: The name of the main goalkeeper for the team.

    Data Source

    The data has been scraped from FBref, a well-known source for football statistics. FBref provides detailed and historical data for various football competitions worldwide, including the UEFA Champions League.

    Acknowledgements

    • FBref: For providing the comprehensive data used to compile this dataset.
    • Kaggle: For hosting and facilitating data science competitions and datasets.
  20. Streamlining Orders

    • open.canada.ca
    • ouvert.canada.ca
    csv, pdf
    Updated Feb 24, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Canada Energy Regulator (2024). Streamlining Orders [Dataset]. https://open.canada.ca/data/en/dataset/e13f8b67-c13b-4fab-8968-42cc3afe92d7
    Explore at:
    pdf, csvAvailable download formats
    Dataset updated
    Feb 24, 2024
    Dataset provided by
    Canadian Energy Regulatorhttps://www.cer-rec.gc.ca/en/index.html
    License

    Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
    License information was derived automatically

    Time period covered
    Oct 1, 2003 - Mar 31, 2017
    Description

    This data presents details of projects undertaken pursuant to the Streamlining Order XG/XO-100-2012. This Order provides the Canada Energy Regulator’s approval for the construction of certain classes of oil and gas projects regulated under the Canada Energy Regulator Act. The data ranges from 2003 to current; it is updated annually.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Ishara Bandara; Sergiy Shelyag; Sutharshan Rajasegarar; Dan Dwyer; Eun-jin Kim; Maia Angelova (2024). Approaches and included features. [Dataset]. http://doi.org/10.1371/journal.pone.0312278.t002

Approaches and included features.

Related Article
Explore at:
xlsAvailable download formats
Dataset updated
Oct 30, 2024
Dataset provided by
PLOS ONE
Authors
Ishara Bandara; Sergiy Shelyag; Sutharshan Rajasegarar; Dan Dwyer; Eun-jin Kim; Maia Angelova
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

In association football, predicting the likelihood and outcome of a shot at a goal is useful but challenging. Expected goal (xG) models can be used in a variety of ways including evaluating performance and designing offensive strategies. This study proposed a novel framework that uses the events preceding a shot, to improve the accuracy of the expected goals (xG) metric. A combination of previously explored and unexplored temporal features is utilized in the proposed framework. The new features include; “advancement factor”, and “player position column”. A random forest model was used, which performed better than published single-event-based models in the literature. Results further demonstrated a significant improvement in model performance with the inclusion of preceding event information. The proposed framework and model enable the discovery of event sequences that improve xG, which include; opportunities built up from the sides of the 18-yard box, shots attempted from in front of the goal within the opposition’s 18-yard box, and shots from successful passes to the far post.

Search
Clear search
Close search
Google apps
Main menu