85 datasets found
  1. standard-chess-games

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    lichess, standard-chess-games [Dataset]. https://huggingface.co/datasets/Lichess/standard-chess-games
    Explore at:
    Dataset provided by
    Lichesshttp://lichess.org/
    Authors
    lichess
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    [!CAUTION] This dataset is still a work in progress and some breaking changes might occur.

      Lichess Rated Standard Chess Games Dataset
    
    
    
    
    
      Dataset Description
    

    6,771,826,271 standard rated games, played on lichess.org, updated monthly from the database dumps. This version of the data is meant for data analysis. If you need PGN files you can find those here. That said, once you have a subset of interest, it is trivial to convert it back to PGN as shown in the Dataset Usage… See the full description on the dataset page: https://huggingface.co/datasets/Lichess/standard-chess-games.

  2. c

    Chess Game (Lichess) Dataset

    • cubig.ai
    Updated May 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CUBIG (2025). Chess Game (Lichess) Dataset [Dataset]. https://cubig.ai/store/products/326/chess-game-lichess-dataset
    Explore at:
    Dataset updated
    May 28, 2025
    Dataset authored and provided by
    CUBIG
    License

    https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service

    Measurement technique
    Synthetic data generation using AI techniques for model training, Privacy-preserving data transformation via differential privacy
    Description

    1) Data Introduction • The Chess Game Dataset (Lichess) includes details of more than 20,000 chess matches collected on Lichess.org , as well as players, openings, and match results.

    2) Data Utilization (1) Chess Game Dataset (Lichess) has characteristics that: • This dataset provides a variety of variables related to chess matches, including match ID, match start and end times, turn count, winner, player rating, opening code and name, and full sequence. (2) Chess Game Dataset (Lichess) can be used to: • Openings Win Rate Analysis: Analysis of the frequency of use and win rate of each opening can be used to study effective chess strategies. • Player skill prediction: Based on player ratings, match results, and sequential data, it can be used to predict wins and losses and analyze performance improvement factors.

  3. Data from: chess-puzzles

    • huggingface.co
    Updated Oct 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    lichess (2025). chess-puzzles [Dataset]. https://huggingface.co/datasets/Lichess/chess-puzzles
    Explore at:
    Dataset updated
    Oct 6, 2025
    Dataset provided by
    Lichesshttp://lichess.org/
    Authors
    lichess
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    Dataset Card for Lichess Puzzles

      Dataset Description
    

    5,423,662 chess puzzles, rated and tagged. See them in action on Lichess. This dataset is updated monthly, and was last updated on October 7th, 2025.

      Dataset Creation
    

    Generating the initial dataset chess puzzles took more than 50 years of CPU time. We went through 300,000,000 analyzed games from the Lichess database, and re-analyzed interesting positions with Stockfish 12/13/14/15 NNUE at 40 meganodes. The… See the full description on the dataset page: https://huggingface.co/datasets/Lichess/chess-puzzles.

  4. chess-position-evaluations

    • huggingface.co
    Updated Oct 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    lichess (2025). chess-position-evaluations [Dataset]. https://huggingface.co/datasets/Lichess/chess-position-evaluations
    Explore at:
    Dataset updated
    Oct 6, 2025
    Dataset provided by
    Lichesshttp://lichess.org/
    Authors
    lichess
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    Dataset Card for the Lichess Evaluations dataset

      Dataset Description
    

    302,517,109 chess positions evaluated with Stockfish at various depths and node count. Produced by, and for, the Lichess analysis board, running various flavours of Stockfish within user browsers. This version of the dataset is a de-normalized version of the original dataset and contains 752,452,094 rows. This dataset is updated monthly, and was last updated on Thursday 16th, 2025.

      Dataset… See the full description on the dataset page: https://huggingface.co/datasets/Lichess/chess-position-evaluations.
    
  5. d

    Pawnster101 Chess Games on Lichess - Dataset - Datopian CKAN instance

    • demo.dev.datopian.com
    Updated Aug 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Pawnster101 Chess Games on Lichess - Dataset - Datopian CKAN instance [Dataset]. https://demo.dev.datopian.com/dataset/ellazero-portal--pawnster101-chess-games-on-lichess
    Explore at:
    Dataset updated
    Aug 27, 2025
    Description

    A collection of 25,940 chess games played by the user 'pawnster101' on the online platform Lichess.org. The dataset includes game data and player interactions in a clean interface without ads or registration requirements.

  6. Chess Game Dataset

    • kaggle.com
    zip
    Updated Sep 19, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Praveen Kumar (2020). Chess Game Dataset [Dataset]. https://www.kaggle.com/penchalaiah123/chess-game-dataset
    Explore at:
    zip(2903760 bytes)Available download formats
    Dataset updated
    Sep 19, 2020
    Authors
    Praveen Kumar
    Description

    General Info

    This is a set of just over 20,000 games collected from a selection of users on the site Lichess.org, and how to collect more. I will also upload more games in the future as I collect them. This set contains the:

    • Game ID;
    • Rated (T/F);
    • Start Time;
    • End Time;
    • Number of Turns;
    • Game Status;
    • Winner;
    • Time Increment;
    • White Player ID;
    • White Player Rating;
    • Black Player ID;
    • Black Player Rating;
    • All Moves in Standard Chess Notation;
    • Opening Eco (Standardised Code for any given opening)
    • Opening Name;
    • Opening Ply (Number of moves in the opening phase) For each of these separate games from Lichess. I collected this data using the Lichess API, which enables collcollection of any given users' game history. The difficult part was collecting usernames to use, however the API also enables dumpdumping of all users in a Lichess team. There are several teams on Lichess with over 1,500 players, so this proved an effective way to get users to collect games.

    Possible Uses

    Lots of information is contained within a single chess game, let alone a full dataset of multiple games. It is primarily a game of patterns, and data science is all about detecting patterns in data, which is why chess has been one of the most invested in areas of AI in the past. This dataset collects all of the information available from 20,000 games and presents it in a format that is easy to process for analysis of, for example, what allows a player to win as black or white, how much meta (out-of-game) factors affect a game, the relationship between openings and victory for black and white and more.

  7. lichess-puzzles

    • huggingface.co
    Updated May 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    EleutherAI (2024). lichess-puzzles [Dataset]. https://huggingface.co/datasets/EleutherAI/lichess-puzzles
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 10, 2024
    Dataset authored and provided by
    EleutherAIhttps://eleuther.ai/
    Description

    This is a collection of ~1.5M chess puzzles from the Lichess database of ~3.9M puzzles (as of 2024-05-09). The set of puzzles from "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks" is included, with the exception of 26,079 puzzles that are no longer in the Lichess database (on the assumption that they might have been removed for a good reason). For each puzzle, ctx is a SAN transcript (with every half-move numbered) of an actual Lichess game, up to… See the full description on the dataset page: https://huggingface.co/datasets/EleutherAI/lichess-puzzles.

  8. chess-openings

    • huggingface.co
    Updated Oct 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    lichess (2025). chess-openings [Dataset]. https://huggingface.co/datasets/Lichess/chess-openings
    Explore at:
    Dataset updated
    Oct 6, 2025
    Dataset provided by
    Lichesshttp://lichess.org/
    Authors
    lichess
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    Dataset Card for Lichess Puzzles

      Dataset Description
    

    3546 chess openings with their Encyclopaedia of Chess Openings (ECO) classification.

      Dataset Creation
    

    Creating this dataset is detailed in its original GitHub repository. Updates to the original repo will also be reflected in this version. Dataset last updated on October 7th, 2025.

      Dataset Usage
    

    Using the datasets library: from datasets import load_dataset

    dset =… See the full description on the dataset page: https://huggingface.co/datasets/Lichess/chess-openings.

  9. e

    lichess.org Traffic Analytics Data

    • analytics.explodingtopics.com
    Updated Aug 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). lichess.org Traffic Analytics Data [Dataset]. https://analytics.explodingtopics.com/website/lichess.org
    Explore at:
    Dataset updated
    Aug 1, 2025
    Variables measured
    Global Rank, Monthly Visits, Authority Score, US Country Rank, Computer & Video Games Category Rank
    Description

    Traffic analytics, rankings, and competitive metrics for lichess.org as of August 2025

  10. Chess positions

    • kaggle.com
    Updated Nov 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NikiTricky (2023). Chess positions [Dataset]. https://www.kaggle.com/datasets/nikitricky/chess-positions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 4, 2023
    Dataset provided by
    Kaggle
    Authors
    NikiTricky
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    This dataset is a comprehensive collection of chess positions sourced from Lichess, one of the most popular online chess platforms. This dataset offers a valuable resource for researchers, enthusiasts, and developers interested in exploring and analyzing chess games.

    Key Features

    • Chess Positions: The dataset comprises 983,014 unique chess positions obtained from Lichess. Each position represents a snapshot of a game at a specific moment and includes the placement of chess pieces on the board.
    • Stockfish 15.1 Scores: Every chess position in the dataset is accompanied by a score assigned by Stockfish 15.1, a powerful open-source chess engine renowned for its exceptional playing strength. The scores provide an assessment of the position's relative strength, with positive values indicating an advantage for white and negative values indicating an advantage for black.
    • Additional Information: Alongside the chess positions and Stockfish scores, the dataset includes supplementary information associated with each position. This information includes usernames, elo ratings of the players, and the final results of the corresponding games, among other relevant details. This data enriches the dataset and enables various analyses, such as studying the performance of specific players or exploring the relationship between player ratings and position evaluations.

    Potential Use Cases

    • Chess AI Development: Researchers and developers interested in developing or improving chess-playing algorithms can leverage this dataset to train machine learning models. The combination of chess positions and associated Stockfish scores offers a valuable resource for training AI models to make informed decisions and evaluate the strength of chess positions.
    • Chess Strategy Analysis: Chess enthusiasts can utilize this dataset to study and analyze various chess positions and the corresponding evaluations by Stockfish 15.1. By examining positions where the engine assigns significantly higher or lower scores, users can gain insights into strategic moves, patterns, and potential winning strategies.
    • Player Performance Evaluation: The additional information provided in the dataset, such as usernames and elo ratings, allows for an analysis of player performance. Researchers can explore how different players' strategies and decision-making impact the Stockfish evaluations, shedding light on various playing styles and approaches.

    Note: While some positions may appear multiple times in the dataset, they represent distinct moments within different games. The dataset encourages a comprehensive understanding of chess dynamics by offering a diverse range of positions and evaluations.

  11. Lichess Chess Dataset h5 All Wins

    • kaggle.com
    Updated Aug 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seqaeon (2025). Lichess Chess Dataset h5 All Wins [Dataset]. https://www.kaggle.com/datasets/pearsejim01/lichess-chess-dataset-h5-all-wins
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 31, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Seqaeon
    Description

    Dataset

    This dataset was created by Seqaeon

    Contents

  12. Me_Jarvis Lichess Matches

    • kaggle.com
    zip
    Updated Oct 22, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gaurav Bisht (2020). Me_Jarvis Lichess Matches [Dataset]. https://www.kaggle.com/gauravbisht005/me-jarvis-lichess-matches
    Explore at:
    zip(5974 bytes)Available download formats
    Dataset updated
    Oct 22, 2020
    Authors
    Gaurav Bisht
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    It all started with Covid-19 pandemic. The world got home struck. I started watching videos of a lot of content creators in Youtube. I started watching a stand-up comedian turned streamer's videos and he introduced chess and lo-and-behold, my fond for chess started!!

    Content

    The data in this dataset are all stats of my chess matches that I've played either in the morning or in the evening, with my friends or randoms. All the data is mine but I have collected it from Lichess.org, an online chess playing and learning webapp. All the data is manually written by my ownself!

    Acknowledgements

    Thanks to Lichess.org for having come up with such a thought that I was able to access this wonderful place.

    Inspiration

    I'm facing issues as of now crossing 1000 rating in 5 min + 0 increment Blitz. So, what can I do, what opening's should I go for, which pieces I'm good with and etcetera.

  13. Chess Commentary dataset

    • kaggle.com
    Updated Jul 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jayanth Raj G (2025). Chess Commentary dataset [Dataset]. http://doi.org/10.34740/kaggle/dsv/12511134
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 18, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Jayanth Raj G
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    🧠 About This Dataset

    This dataset showcases synthetic chess commentary generated from real Lichess games, designed to empower research in LLMs, sports analytics, and game understanding.

    🚀 Dataset Creation Pipeline

    • Game Source: Real match data fetched from Lichess.org
    • Analysis Engine: Each game was processed using Stockfish, extracting turn-by-turn evaluations and chess insights
    • Commentary Generator: These insights were passed to Gemini API, producing commentary in Alpaca-style JSON format, structured for LLM compatibility

    🧩 Dataset Contents

    Each JSON object contains: - Unique match ID - Player moves and board states - Stockfish's turn-level analysis - Synthetic commentary in alpaca format with each move

    📈 Intended Use Cases

    This dataset supports: - Fine-tuning LLMs for dynamic game commentary - Evaluating generative models in sports settings - Building intelligent chess dashboards or companion apps

    ⚙️ Format & Licensing

    • Format: Single JSON file with game annotated matches
    • License: Apache 2.0—open for reuse and adaptation
  14. h

    lichess_sf

    • huggingface.co
    Updated Feb 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maurice Thomalzig (2024). lichess_sf [Dataset]. https://huggingface.co/datasets/mauricett/lichess_sf
    Explore at:
    Dataset updated
    Feb 29, 2024
    Authors
    Maurice Thomalzig
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    Condensed Lichess Database

    This dataset is a condensed version of the Lichess database. It only includes games for which Stockfish evaluations were available. Currently, the dataset contains the entire year 2023, which consists of >100M games and >2B positions. Games are stored in a format that is much faster to process than the original PGN data.

    Requirements: pip install zstandard python-chess datasets

      Quick Guide
    

    In the following, I explain the data format… See the full description on the dataset page: https://huggingface.co/datasets/mauricett/lichess_sf.

  15. Lichess 2013-2020 Chess Dataset h5 Checkmates_Res

    • kaggle.com
    Updated Aug 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seqaeon (2025). Lichess 2013-2020 Chess Dataset h5 Checkmates_Res [Dataset]. https://www.kaggle.com/datasets/pearsejim01/lichess-2013-2020-chess-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 30, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Seqaeon
    Description

    Dataset

    This dataset was created by Seqaeon

    Contents

  16. three-check-chess-games

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    lichess, three-check-chess-games [Dataset]. https://huggingface.co/datasets/Lichess/three-check-chess-games
    Explore at:
    Dataset provided by
    Lichesshttp://lichess.org/
    Authors
    lichess
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    [!CAUTION] This dataset is still a work in progress and some breaking changes might occur. In the meantime, please use https://database.lichess.org/#variant_games

  17. chess-co

    • kaggle.com
    Updated Jul 4, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ankit (2021). chess-co [Dataset]. https://www.kaggle.com/mecha027ankitsingh/chessco/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 4, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Ankit
    Description

    Context

    This dataset is contains more than 7 million chess matches between players above 2200+ rating on lichess excluding bullet games

    Content

    This contains folder which is monthly rapid games played on lichess starting from june-2020 all upto june-2021 in pgn format

    Acknowledgements

    We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

    Inspiration

    It was collected from li chess to make a chess engine capable of making decent moves and then self learn to improve

  18. chess960-chess-games

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    lichess, chess960-chess-games [Dataset]. https://huggingface.co/datasets/Lichess/chess960-chess-games
    Explore at:
    Dataset provided by
    Lichesshttp://lichess.org/
    Authors
    lichess
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    [!CAUTION] This dataset is still a work in progress and some breaking changes might occur.

      Note
    

    The FEN column has 961 unique values instead of the expected 960, because some rematches were recorded with invalid castling rights in their starting FEN in November 2023.

  19. tournament-chess-games

    • huggingface.co
    Updated Jul 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    lichess (2025). tournament-chess-games [Dataset]. https://huggingface.co/datasets/Lichess/tournament-chess-games
    Explore at:
    Dataset updated
    Jul 10, 2025
    Dataset provided by
    Lichesshttp://lichess.org/
    Authors
    lichess
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Lichess Broadcasts

    This is a dataset of chess games from chess tournaments tracked using Lichess Broadcasts. Lichess Broadcasts show live games as they unfold with new moves arriving in real time. They are built to connect to the live-updating PGN file produced by DGT boards but can work with other sources as well. Broadcasts are organized in "tournaments" and "rounds."

  20. king-of-the-hill-chess-games

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    lichess, king-of-the-hill-chess-games [Dataset]. https://huggingface.co/datasets/Lichess/king-of-the-hill-chess-games
    Explore at:
    Dataset provided by
    Lichesshttp://lichess.org/
    Authors
    lichess
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    [!CAUTION] This dataset is still a work in progress and some breaking changes might occur. In the meantime, please use https://database.lichess.org/#variant_games

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
lichess, standard-chess-games [Dataset]. https://huggingface.co/datasets/Lichess/standard-chess-games
Organization logo

standard-chess-games

Lichess Standard Rated Games

Lichess/standard-chess-games

Explore at:
18 scholarly articles cite this dataset (View in Google Scholar)
Dataset provided by
Lichesshttp://lichess.org/
Authors
lichess
License

https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

Description

[!CAUTION] This dataset is still a work in progress and some breaking changes might occur.

  Lichess Rated Standard Chess Games Dataset





  Dataset Description

6,771,826,271 standard rated games, played on lichess.org, updated monthly from the database dumps. This version of the data is meant for data analysis. If you need PGN files you can find those here. That said, once you have a subset of interest, it is trivial to convert it back to PGN as shown in the Dataset Usage… See the full description on the dataset page: https://huggingface.co/datasets/Lichess/standard-chess-games.

Search
Clear search
Close search
Google apps
Main menu