Below you will find a collection of data files containing match data from the LCS, LEC, LCK, LPL, PCS, CBLoL, and many more leagues. Files are in .csv format.
All data has been aggregated and released by Tim Sevenhuysen of OraclesElixir.com. It is provided free of charge and is intended for use by analysts, commentators, and fans.
Changelogs, news, and updates are maintained on the Oracle's Elixir Discord server in the oe-data-updates channel.
Definitions for the data in these files can be found or inferred from the information on theDefinitions page.
Questions or requests? Get in touch, or join the Oracle's Elixir Discord server.
If you find this downloadable data useful, please consider helping out with the cost of running the site by subscribing on Patreon.
Game statistics are the property of Riot Games, and any usage of such data must follow Riot Games' terms and policies.
Access files via Google Drive.
Released in October 2020, League of Legends: Wild Rift is a mobile MOBA (multiplayer online battle arena) gaming title which was developed and published by Riot Games. The title is a mobile version of the MOBA genre classic League of Legends and generated approximately 970 thousand app downloads in December 2023.
I hope this will be great!!
I am writing articles on League of Legends and Machine Learning. You can find the full repository where this information is stored here.
This statistic illustrates the number of League of Legends (LoL) monthly active users worldwide from 2011 to 2016. In 2016, LoL had 100 million MAU, up from 90 million in 2015. Being one of the most prominent eSports games, in 2015 LoL championship finals attracted 36 million viewers worldwide. What is more, with the rise of gaming video content popularity, video games enthusiasts can now watch other players try their chances in the game, from the comfort of their own home. And so, League of Legends was one of the top games on Twitch, based on the number of hours viewed in 2016. It comes a no surprise that LoL held the highest share of the MOBA games revenues that year.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This data set is the High elo Ranked Games data for a TFT game called TeamFights Tactics.
It is a game in which 8 participants fight against each other by forming various combinations suitable for their strategy, and it is composed of individual exhibitions.
The data was collected only from High elo Ranked Games, and we built three ranking games, Rolltoches' Top Rank Challenger, Grandmaster, and Master.
A similar game is Auto Chess, which combines approximately 50 champions to create their own combination. There are 10 series synergies and 13 job synergies, and you can create various combinations.
The final level is 9 levels, and there is a limit to what you can put on the board for each level. You can raise 1 champion per level, and when you reach the final 9 level, you can raise 9 champions on the board. (May vary slightly depending on champion skill and combination characteristics)
There are a total of 8 variables, and each description is described in the data set column description below.
Most of the columns are explained intuitively, but we'll cover the two columns in more detail.
First, "Combination" Column
Second, "Champion" Columns
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
This dataset represents, at the time of writing and to the best of the author's knowledge, the largest and most comprehensive dataset of League of Legends ranked matches from NA. There are approximately 10 000 matches in this data set, with each match containing over 700 individual items, ranging from champion/spell choice, team stats, to individual player performance.
This dataset is the result of my personal project, and I wanted a comprehensive dataset of ranked matches from NA. Each match is pulled from players who are approximately Gold-ranked.
This dataset would not have been possible without the help of the RiotWatcher wrapper library (https://riot-watcher.readthedocs.io/en/latest/), which is primarily responsible for calling the Riot Games API.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Game stats for all matches in the League of Legends Worlds 2021 Play-in Groups
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This is a collection of over 50,000 ranked EUW games from the game League of Legends, as well as json files containing a way to convert between champion and summoner spell IDs and their names. For each game, there are fields for:
This dataset was collected using the Riot Games API, which makes it easy to lookup and collect information on a users ranked history and collect their games. However finding a list of usernames is the hard part, in this case I am using a list of usernames scraped from 3rd party LoL sites.
There is a vast amount of data in just a single LoL game. This dataset takes the most relevant information and makes it available easily for use in things such as attempting to predict the outcome of a LoL game, analysing which in-game events are most likely to lead to victory, understanding how big of an effect bans of a specific champion have, and more.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains start & end timestamps for League of Legends games played via the Duowan plugin. We randomly sampled 100k user_ids and gathered all games associated with those users.
The continuous development of e-sports is generating a daily trail of insightful data in high volume, to the point that justifies the use of exploratory data analysis.
In particular, the multiplayer online battle arena (MOBA) game League of Legends (LoL), organizes one of the most viewed tournaments, attracting over 4 million peak viewers.
The game lets participants choose between more than 161 champions with different characteristics and game play mechanics affecting the dynamics of team composition. Thus, champion selection is of capital importance for pro players.
Multiple works focused on champion selection data in order to predict team victory for DOTA, a MOBA similar to League of Legends, but LoL is still under-researched. And with the regular new patches received, it is difficult to compare predictor performances across time.
To this objective, we are releasing this curated dataset such that others can try their own architectures on victory prediction from champion selection data, thus offering a benchmark dataset for the community.
This dataset has been collected by Devoteam Revolve from Riot Developer API
http://france.devoteam.com/wp-content/uploads/sites/21/2021/05/logo-cartouches-RVB-ROUGE.png" alt="Devoteam logo">
The dataset has a total of 84440 games that are from 2022 at the version 12.12 of the game.
The games are only from the highest ELO players, with ranks of either Master, Grand Master and Challenger. This ranks represents the top 1.2% of all players.
The dataset comes pre splitted
Set | Proportion | size |
---|---|---|
Training | 90% | 75970 |
Validation | 5% | 4239 |
Test | 5% | 4231 |
Dataset organization:
12.12.-splits
├── test
| ├── df_00000.csv
| | ...
| └── df_xxxxx.csv
|
├── train
| ├── df_00000.csv
| | ...
| └── df_xxxxx.csv
|
└── val
| ├── df_00000.csv
| | ...
| └── df_xxxxx.csv
|
└── champion.json
All champions information can be found under ./12.12.-splits/champion.json
This file allows the conversion from Player_{Player_id}_pick
id number to the champion name.
Multiple other information are also freely available such has champion damages, HP, etc ...
All the matches are collected in the 3 directories:
./12.12.-splits/train/
./12.12.-splits/val/
./12.12.-splits/test/
Each of these directories contain multiple df_xxxxx.csv
files detailing up to 100 matches.
The description of each column can be read in the below table.
The column which possess {Player_id}
in their name are repeated 10 times, one for each player.
For example, the column name Player_{Player_id}_team
can be found in each csv as 10 different columns with names ranging from Player_1_team
to Player_10_team
.
Column name | Use das input | Path from Match-V5 | type | description |
---|---|---|---|---|
gameId | No | info/gameId | str | unique value for each match |
matchId | No | metadata/matchId | str | gameId prefixed with the players region |
gameVersion | No | info/gameVersion | str | game version, the first two parts can be used to determine the patch |
gameDuration | No | info/gameDuration | int | game duration in seconds |
teamVictory | No | info/teams[t]/win | int | Team victory, either 100 for blue, or 200 for red |
team_100_gold | No | info/participants[]/goldEarned | int | Total gold earned by blue team |
team_200_gold | No | info/participants[]/goldEarned | int | Total gold earned by red team |
Player_id | Yes | info/participants/participantId | int | Player id ranging from 1 to 10 included |
Player_{Player_id}_team | Yes | info/participants/teamId | int | Player team, either 100 for blue team, or 200 for red team |
Player_{Player_id}_ban | Yes | info/teams[t]/bans[i]/championId | int | Player champion banned |
Player_{Player_id}_pick | Yes | info/participants[i]/championId | int | Player champion picked |
Player_{Player_id}_ban_turn | Yes | info/teams[t]/bans[i]/pickTurn | int | Player pick order |
Player_{Player_id}_victory | No | info/teams[t]/win | int | Either 1 for victory or 0 for defeat |
Player_{Player_id}_role | No | info/participants[i]/role | str | Role declared by the player before match. Possible values: DUO, DUO_CARRY, DUO_SUPPORT, NONE, and SOLO |
Player_{Player_id}_position | No | info/participants[i]/teamPosition | str | Role deduced after match from every players position. Possible values: TOP, MIDDLE, JUNGLE, BOTTOM, UTILITY, APEX, and NONE |
Player_{Player_id}_time_game | No | info/gameDuration | int | Game duration in seconds |
Player_{Player_id}_gold | No | info/participants[i]/goldEarned | int | Total gold earned |
Player_{Player_id}_xp | No | info/participants[i]/champExperience | int | Total XP accumulated |
Player_{Player_id}_dmg_dealt | No | info/participants[i]/totalDamageDealtToChampions | int | Total damages dealt to other champions |
Player_{Player_id}_dmg_taken | No | info/participants[i]/totalDamageTaken | int | Total damages received |
Player_{Player_id}_time_ccing | No | info/participants[i]/timeCCingOthers | int | Total time of crowd control inflicted to other champs |
A loading example for the dataset can be found under https://www.kaggle.com/ezalos/loading-lol-dataset
This dataset was created by Danilo Donato
It contains the following files:
This dataset comprises comprehensive information from ranked matches played in the game League of Legends, spanning the time frame between January 12, 2023, and May 18, 2023. The matches cover a wide range of skill levels, specifically from the Iron tier to the Diamond tier. The dataset is structured based on time intervals, presenting game data at various percentages of elapsed game time, including 20%, 40%, 60%, 80%, and 100%. For each interval, detailed match statistics, player performance metrics, objective control, gold distribution, and other vital in-game information are provided. This collection of data not only offers insights into how matches evolve and strategies change over different phases of the game but also enables the exploration of player behavior and decision-making as matches progress. Researchers and analysts in the field of esports and game analytics will find this dataset valuable for studying trends, developing predictive models, and gaining a deeper understanding of the dynamics within ranked League of Legends matches across different skill tiers.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
League of Legends KR High Elo 5v5 Match Data
Related project link: GitHub
The dataset is retrieved using the Riot API. For documentation of the API please visit the website.
The dataset contains information about all League of Legends KR server challengers (n=300) as of 2022-05-23. The account information is stored in accounts.json, whereas the information about the challenger league is in kr_challenger_league.json.
Match data was retrieved from the 5 most recent 5v5 ranked solo matches for each challenger account. There are in total 811 unique matches, and the information is stored in matches.json. The matches are further cleaned only to include games that last more than 16 minutes (n=787), which are stored in matches_cleaned.json.
This dataset is coming from Oracle Elixir (https://oracleselixir.com/) and gather the games information of every single official competitive game of League of Legends in 2024.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘League of Legends Diamond Ranked Games (10 min)’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/bobbyscience/league-of-legends-diamond-ranked-games-10-min on 21 November 2021.
--- Dataset description provided by original source is as follows ---
League of Legends is a MOBA (multiplayer online battle arena) where 2 teams (blue and red) face off. There are 3 lanes, a jungle, and 5 roles. The goal is to take down the enemy Nexus to win the game.
This dataset contains the first 10min. stats of approx. 10k ranked games (SOLO QUEUE) from a high ELO (DIAMOND I to MASTER). Players have roughly the same level.
Each game is unique. The gameId can help you to fetch more attributes from the Riot API.
There are 19 features per team (38 in total) collected after 10min in-game. This includes kills, deaths, gold, experience, level... It's up to you to do some feature engineering to get more insights.
The column blueWins is the target value (the value we are trying to predict). A value of 1 means the blue team has won. 0 otherwise.
So far I know, there is no missing value.
Thanks, Rito Gaming.
--- Original source retains full ownership of the source dataset ---
This dataset was created by James Jung
This dataset was created by Rei Cripto
Includes data and code for both analysis and data mining.
Github link with a .README file https://github.com/Vutya/LoL_player_segmenation
This dataset contains the statistic data related to each League of Legends champion's profile of the op.gg webpage. It contains data related to every champion within the last month of ranked season 11. It has been created only for educational purposes with a web scraper based on scrapy.
Below you will find a collection of data files containing match data from the LCS, LEC, LCK, LPL, PCS, CBLoL, and many more leagues. Files are in .csv format.
All data has been aggregated and released by Tim Sevenhuysen of OraclesElixir.com. It is provided free of charge and is intended for use by analysts, commentators, and fans.
Changelogs, news, and updates are maintained on the Oracle's Elixir Discord server in the oe-data-updates channel.
Definitions for the data in these files can be found or inferred from the information on theDefinitions page.
Questions or requests? Get in touch, or join the Oracle's Elixir Discord server.
If you find this downloadable data useful, please consider helping out with the cost of running the site by subscribing on Patreon.
Game statistics are the property of Riot Games, and any usage of such data must follow Riot Games' terms and policies.
Access files via Google Drive.