21 datasets found
  1. NBA Players

    • kaggle.com
    zip
    Updated Oct 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Justinas Cirtautas (2023). NBA Players [Dataset]. https://www.kaggle.com/datasets/justinas/nba-players-data/discussion
    Explore at:
    zip(577071 bytes)Available download formats
    Dataset updated
    Oct 13, 2023
    Authors
    Justinas Cirtautas
    Description

    Update 2023-10-13: The data now includes 2022 season.

    Update 2022-08-06: The data now includes 2021 season.

    Update 2021-08-02: The data now includes 2020 season and metrics for 2019 have been updated.

    Update 2020-08-03: The data now includes 2017, 2018 and 2019 seasons. Keep in mind that metrics like gp, pts, reb, etc. are not complete for 2019 season, as it is ongoing at the time of upload.

    Context

    As a life-long fan of basketball, I always wanted to combine my enthusiasm for the sport with passion for analytics 🏀📊. So, I utilized the NBA Stats API to pull together this data set. I hope it will prove to be as interesting to work with for you as it has been for me!

    Content

    The data set contains over two decades of data on each player who has been part of an NBA teams' roster. It captures demographic variables such as age, height, weight and place of birth, biographical details like the team played for, draft year and round. In addition, it has basic box score statistics such as games played, average number of points, rebounds, assists, etc.

    The pull initially contained 52 rows of missing data. The gaps have been manually filled using data from Basketball Reference. I am not aware of any other data quality issues.

    Analysis Ideas

    The data set can be used to explore how age/height/weight tendencies have changed over time due to changes in game philosophy and player development strategies. Also, it could be interesting to see how geographically diverse the NBA is and how oversees talents have influenced it. A longitudinal study on players' career arches can also be performed.

  2. NBA Anthropometric

    • kaggle.com
    zip
    Updated Jan 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Timmy (2024). NBA Anthropometric [Dataset]. https://www.kaggle.com/datasets/tymoteuszdobrucki/nba-anthropometric
    Explore at:
    zip(35391 bytes)Available download formats
    Dataset updated
    Jan 21, 2024
    Authors
    Timmy
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The National Basketball Association (NBA) is a professional basketball league in North America composed of 30 teams. It is one of the major professional sports leagues in the United States and Canada and is considered the premier professional basketball league in the world.

    The NBA draft combine is a multi-day showcase that takes place every May before the annual NBA draft. At the combine, college basketball players are measured and take medical tests, are interviewed, perform various athletic tests and shooting drills, and play in five-on-five drills for an audience of National Basketball Association (NBA) coaches, general managers, and scouts. Athletes attend by invitation only. An athlete's performance during the combine can affect perception, draft status, salary, and ultimately the player's career.

    This dataset includes the anthropometric measurements collected during Draft Combine events in years 2000-2023. It has been acquired using NBA Stats API. The units have been converted from imperial to metric system (inches to centimeters and pounds to kilograms). The numbers have been rounded to two decimal points.

  3. Birthday Paradox Visitor Data

    • kaggle.com
    zip
    Updated Jan 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). Birthday Paradox Visitor Data [Dataset]. https://www.kaggle.com/datasets/thedevastator/birthday-paradox-visitor-data
    Explore at:
    zip(8451 bytes)Available download formats
    Dataset updated
    Jan 22, 2023
    Authors
    The Devastator
    Description

    Birthday Paradox Visitor Data

    Exploring Probability and Patterns of Day of the Week Birthdays

    By data.world's Admin [source]

    About this dataset

    This dataset contains daily visitor-submitted birthdays and associated data from an ongoing experimentation known as the Birthday Paradox. Be enlightened as you learn how many people have chosen the same day of their birthday as yours. Get a better perspective on how this phenomenon varies day-to-day, including recent submissions within the last 24 hours. This experiment is published under the MIT License, giving you access to detailed information behind this perplexing cognitive illusion. Find out now why the probability of two people in the same room having birthday matches is much higher than one might expect!

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    This dataset provides data on the Birthday Paradox Visitor Experiments. It contains information such as daily visitor-submitted birthdays, the total number of visitors who have submitted birthdays, the total number of visitors who guessed the same day as their birthday, and more. This dataset can be used to analyze patterns in visitor behavior related to the Birthday Paradox Experiment.

    In order to use this dataset effectively and efficiently, it is important to understand its fields and variables:
    - Updated: The date when this data was last updated
    - Count: The total number of visitors who have submitted birthdays
    - Recent: The number of visitors who have submitted birthdays in the last 24 hours
    - binnedDay: The day of the week for a given visitor's birthday submission
    - binnedGuess: The day of week that a given visitor guessed their birthday would fall on 6) Tally: Total number of visitors who guessed same day as their birthday 7) binnedTally: Total number of visitors grouped by guess day

    To begin using this dataset you should first filter your data based on desired criteria such as date range or binnedDay. For instance, if you are interested in analyzing Birthady Paradox Experiment results for Monday submissions only then you can filter your data by binnedDay = 'Monday'. Then further analyze your filtered query by examining other fields such as binnedGuess and comparing it with tally or binnedTally results accordingly. For example if we look at Monday entries above we should compare 'Monday' tallies with 'Tuesday' guesses (or any other weekday). ` Furthermore understanding updates from recent field can also provide interesting insights into user behavior related to Birthady Paradox Experiment -- trackingt recent entries may yield valuable trends over time.

    By exploring various combinations offields available in this dataset users will be ableto gain a better understandingof how user behaviordiffers across different daysofweek both within a singledayandover periodsoftimeaccordingtodifferent criteria providedbythisdataset

    Research Ideas

    • Analyzing the likelihood of whether a person will guess their own birthday correctly.
    • Estimating which day of the week is seeing the most number of visitors submitting their birthdays each day and analyzing how this varies over time.
    • Investigating how likely it is for two people from different regions to have the same birthday by comparing their respective submission rates on each day of the week

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    See the dataset description for more information.

    Columns

    File: data.csv | Column name | Description | |:----------------|:-----------------------------------------------------------------------------------| | updated | The date and time the data was last updated. (DateTime) | | count | The total number of visitor submissions. (Integer) | | recent | The number of visitor submissions in the last 24 hours. (Integer) | | binnedDay | The day of the week the visitor submitted their birthday. (String) | | binnedGuess | The day of the week the visitor guessed their birthday. (String) | | tally | The total number of visitor guesses that matched their actual birthdays. (Integer) | | binnedTally | The day of the week the visitor guessed their birthday correctly. (String) |

    Acknowledgement...

  4. Social Power NBA

    • kaggle.com
    zip
    Updated Aug 1, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Noah Gift (2017). Social Power NBA [Dataset]. https://www.kaggle.com/noahgift/social-power-nba
    Explore at:
    zip(1397766 bytes)Available download formats
    Dataset updated
    Aug 1, 2017
    Authors
    Noah Gift
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Context

    This data set contains combined on-court performance data for NBA players in the 2016-2017 season, alongside salary, Twitter engagement, and Wikipedia traffic data.

    Further information can be found in a series of articles for IBM Developerworks: "Explore valuation and attendance using data science and machine learning" and "Exploring the individual NBA players".

    A talk about this dataset has slides from March, 2018, Strata:

    https://www.slideshare.net/noahgift/social-power-andinfluenceinthenba-89807740?qid=3f9f835a-f3d7-4174-8a8c-c97f9c82e614&v=&b=&from_search=1

    Further reading on this dataset is in the book Pragmatic AI, in Chapter 6 or full book, Pragmatic AI: An introduction to Cloud-based Machine Learning and watch lesson 9 in Essential Machine Learning and AI with Python and Jupyter Notebook

    Followup Items

    Acknowledgement

    Data sources include ESPN, Basketball-Reference, Twitter, Five-ThirtyEight, and Wikipedia. The source code for this dataset (in Python and R) can be found on GitHub. Links to more writing can be found at noahgift.com.

    Inspiration

    • Do NBA fans know more about who the best players are, or do owners?
    • What is the true worth of the social media presence of athletes in the NBA?
  5. NBA regular season TV viewers 2019-2025

    • statista.com
    • abripper.com
    Updated Nov 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). NBA regular season TV viewers 2019-2025 [Dataset]. https://www.statista.com/statistics/289993/nba-number-of-tv-viewers-usa/
    Explore at:
    Dataset updated
    Nov 19, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    United States
    Description

    An average of **** million viewers tuned in to watch NBA regular season games across ABC, ESPN and TNT in the 2024/25 season. This marked a slight decline in the number of viewers from the previous season.

  6. NBA player data by game from 1949 to 2019

    • kaggle.com
    Updated Feb 21, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Haris Beslic (2020). NBA player data by game from 1949 to 2019 [Dataset]. https://www.kaggle.com/harisbeslic/nba-player-data-by-game-from-1949-to-2019
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 21, 2020
    Dataset provided by
    Kaggle
    Authors
    Haris Beslic
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    I wanted to learn web scraping in order to make website for basketball, so I created this dataset as part of my learning. I will try to keep it updated as much as possible.

  7. NBA & WNBA annual salaries in 2024/25

    • statista.com
    Updated Nov 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). NBA & WNBA annual salaries in 2024/25 [Dataset]. https://www.statista.com/statistics/1120680/annual-salaries-nba-wnba/
    Explore at:
    Dataset updated
    Nov 26, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    North America
    Description

    The NBA and WNBA are the two top leagues for basketball in the United States for men and women, respectively. In the NBA, players took home an average annual salary of over ** million U.S. dollars for the 2024/25 season, with the league's minimum salary set at **** million U.S. dollars that year. In comparison, players in the WNBA received an average annual pay of ******* U.S. dollars in the 2025 season, with the highest-earning players in the WNBA receiving around ******* U.S. dollars annually.

  8. Illustrious Careers of NBA Legends ✨🏀

    • kaggle.com
    zip
    Updated May 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alfredo (2024). Illustrious Careers of NBA Legends ✨🏀 [Dataset]. https://www.kaggle.com/datasets/alfredkondoro/exploring-kobe-bryants-nba-journey
    Explore at:
    zip(76645 bytes)Available download formats
    Dataset updated
    May 13, 2024
    Authors
    Alfredo
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Introduction:

    Embark on an enthralling exploration into the illustrious careers of basketball's most iconic figures in the NBA Legends Dataset. This meticulously curated collection chronicles the remarkable odysseys of legendary players, offering intimate glimpses into their unparalleled skills, unwavering determination, and relentless pursuit of excellence. As a tribute to the enduring legacies and profound impacts these legends have had on the game and countless lives, this dataset encapsulates their transcendent influences, both on and off the court.

    Column Descriptions:

    1. season: The NBA season during which the game took place.
    2. date: The date of the game.
    3. age: Kobe Bryant's age at the time of the game.
    4. team_played: The team Kobe Bryant played for during the game.
    5. game_type: The type of game (regular season, playoffs, etc.).
    6. venue: The arena or stadium where the game was held.
    7. opponent: The opposing team.
    8. win_lose: Indicates whether Kobe's team won or lost the game.
    9. point_difference: The difference in points between Kobe's team and the opposing team.
    10. game_started: Whether Kobe started the game or came off the bench.
    11. minutes_played: The total minutes Kobe played in the game.
    12. fieldgoal: The number of field goals Kobe made.
    13. fieldgoal_attempts: The total number of field goal attempts by Kobe.
    14. fieldgoal_percent: Kobe's shooting percentage for field goals.
    15. 3pointers: The number of three-pointers Kobe made.
    16. 3pointers_attempts: The total number of three-point attempts by Kobe.
    17. 3pointers_percent: Kobe's shooting percentage for three-pointers.
    18. freethrows: The number of free throws Kobe made.
    19. freethrows_attempt: The total number of free throw attempts by Kobe.
    20. freethrow_percent: Kobe's shooting percentage for free throws.
    21. offensive_rebounds: The number of offensive rebounds by Kobe.
    22. defensive_rebounds: The number of defensive rebounds by Kobe.
    23. total_rebounds: The total number of rebounds by Kobe.
    24. assists: The number of assists by Kobe.
    25. steals: The number of steals by Kobe.
    26. blocks: The number of blocks by Kobe.
    27. turnovers: The number of turnovers by Kobe.
    28. personal_fouls: The number of personal fouls committed by Kobe.
    29. points: The total points scored by Kobe in the game.

    Influence of NBA Legends:

    The enduring legacies of NBA legends transcend basketball, serving as timeless sources of inspiration for athletes and enthusiasts alike. Their remarkable achievements, unwavering work ethics, and unyielding self-belief epitomize the essence of greatness and resilience. As we delve into the intricacies of their journeys through this dataset, may their indelible spirits continue to inspire and motivate us to pursue excellence in every aspect of life

    Photo by JC Gellidon on Unsplash

  9. Major US Sports Venues Usage and Affiliations

    • kaggle.com
    zip
    Updated Jan 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). Major US Sports Venues Usage and Affiliations [Dataset]. https://www.kaggle.com/datasets/thedevastator/major-us-sports-venues-usage-and-affiliations
    Explore at:
    zip(36399 bytes)Available download formats
    Dataset updated
    Jan 15, 2023
    Authors
    The Devastator
    Area covered
    United States
    Description

    Major US Sports Venues Usage and Affiliations

    Team, League, Conference and Population Usage Records

    By Homeland Infrastructure Foundation [source]

    About this dataset

    This dataset provides detailed information on major sport venues, along with their usage and affiliations. It includes data related to the National Association for Stock Car Auto Racing, Indy Racing League, Major League Soccer, Major League Baseball, National Basketball Association, Women's National Basketball Association, National Hockey League, National Football League, PGA Tour, NCAA Division 1 FBS Football, NCAA Division 1 Basketball and thoroughbred horse racing.* This dataset contains columns such as USE (which describes the type of use for the venue), TEAM (the team associated with the venue), LEAGUE (the league associated with the venue) , CONFERENCE (the conference associated with the venue), DIVISION (the division associated with the venue), INST_AFFIL(the institution affiliation associatedwith the venue), TRACK_TYPE(type of track at a specific point in time or over its complete life-cycle) as well as LENGTH_MILEGE ('length of track in milege') ROOF_TYPE(The type of roof covering used at a specific point in time or over its complete life-cycle) and plenty other variables. With this astounding range and quantity of data points -- spanning countries across different continents and leagues -- explore patterns in sports games you never even thought were possible!

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    The MajorUS Sports Venues Usage and Affiliations dataset includes data on major sports venues from leagues including National Association for Stock Car Auto Racing (NASCAR), Indy Racing League (IRL), Major League Soccer (MLS), Major League Baseball (MLB), National Basketball Association (NBA), Women's National Basketball Association (WNBA), National Hockey League (NHL), National Football League(NFL), PGA Tour, NCAA Division 1 FBS Football, NCAA Division 1 Basketball, and thoroughbred horse racing. The columns provided include USE_, USE_POP, TEAM, LEAGUE,CONFERENCE,DIVISION ,INST_AFFIL,TRACK_TYPE. LENGTH_MI,ROOF_TYPESTADIUM_SH,`ADDDATAE , USEWEBSITE',and'COMMENTS'.

    The `USE~ column specifies the type of usage of each venue at which point can be college athletics or professional athletics. The corresponding column to this is the ‘USE~POP’ which informs you about how many people are using each venue for a particular sport at a given time. For example if there were 6 NHL games being played that day then USE~ would say “professional Athletics” while USE~POP would state “NNN” reflecting there were NNN people spectating those events collectively: The next column is TEAM which represents what team sponsors or manages each venue or what teams will be playing in them.

     Following on from TEAM is LEAGUE; here you can find out what league each team represents such as MLB, NBA etc… The next three columns CONFERENCE/DIVISION/INST ~ AFFIL provide more specific details as they blur into collegiate level as well where CONFERENCE indicates which conference they belong within their respective division: while INST ~ AFFIL states its affiliated school body e.g.: Southeastern Conference > University of Arkansas Razorbacks . Rounding up our overview these last three columns TRACK ~ TYPE/LENGTH
    

    Research Ideas

    • Analyzing the affiliations and usage of different sports venues to determine which teams or leagues have the most presence across a certain geographic area.
    • Comparing different stadiums within a given conference in terms of their roof type, track length, and stadium shape for optimal design features for new construction projects.
    • Placing sponsorships or advertisements within each sporting arena based on audience size, league popularity, and team affiliation within a given conference or division

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contribut...

  10. Best Ever Basketball Players Stats

    • kaggle.com
    zip
    Updated Feb 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Akul Vaishnavi (2024). Best Ever Basketball Players Stats [Dataset]. https://www.kaggle.com/akulvaishnavi/best-ever-basketball-players-stats
    Explore at:
    zip(5722 bytes)Available download formats
    Dataset updated
    Feb 18, 2024
    Authors
    Akul Vaishnavi
    License

    https://cdla.io/sharing-1-0/https://cdla.io/sharing-1-0/

    Description

    This dataset contains data and statistics for some of the greatest players who have played in the National Basketball Association (NBA). You can use these stats to assess for various aspects for these players - and maybe even find out who is the all-time GOAT of basketball.

    Explaining some statistics to people unfamiliar to Basketball (Assuming points, assists etc. are obvious)

    PER - Player Efficiency Rating - The player efficiency rating (PER) is John Hollinger's all-in-one basketball rating, which attempts to collect or boil down all of a player's contributions into one number. Using a detailed formula, Hollinger developed a system that rates every player's statistical performance.

    EWA - Estimated Wins Added - EWA is similar to PER where it boils down all player contributions into 1 statistic. But it is used in a way to show how many wins are added to a team when that certain player plays on the court

    WS & WS/48 - Win shares & Win shares per 48 - Win Share is a measure that is assigned to players based on their offense, defense, and playing time. WS/48 is win shares per 48 minutes and invented by Justin Kubatko who explains: “A win share is worth one-third of a team win. If a team wins 60 games, there are 180 'Win Shares' to distribute among the players.”

  11. NBA All-Time career blocks leaders

    • kaggle.com
    zip
    Updated Jan 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    kabhishm (2023). NBA All-Time career blocks leaders [Dataset]. https://www.kaggle.com/datasets/kabhishm/list-of-nba-career-blocks-leaders
    Explore at:
    zip(3706 bytes)Available download formats
    Dataset updated
    Jan 13, 2023
    Authors
    kabhishm
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The dataset contains information on players who are all time career leaders in blocks. It includes information like player name, position, teams played, number of seasons played, number of games played, birth place, birth date etc.

  12. NBASeasonStats(2018-20)

    • kaggle.com
    zip
    Updated Feb 20, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HeeebsInc (2021). NBASeasonStats(2018-20) [Dataset]. https://www.kaggle.com/heeebsinc/nbaseasonstats201820
    Explore at:
    zip(18653301 bytes)Available download formats
    Dataset updated
    Feb 20, 2021
    Authors
    HeeebsInc
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Individual game stats for every NBA player in the 2018-19 and 2019-20 season.

    Data used to develop machine learning algorithm that determines the best fantasy basketball lineups. Follow along in the tutorial and learn how to scrape any NBA season you choose with one Python function.
    If you want to stay up to date with the tutorial and learn how to scrape data and implement ML in daily fantasy basketball, check out my blog.

    Code available on my Github

  13. NBA Warriors vs Celtics ALL Past Games

    • kaggle.com
    zip
    Updated Jun 17, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Neel Gajare (2022). NBA Warriors vs Celtics ALL Past Games [Dataset]. https://www.kaggle.com/datasets/neelgajare/warriors-vs-celtics-all-past-games
    Explore at:
    zip(3623 bytes)Available download formats
    Dataset updated
    Jun 17, 2022
    Authors
    Neel Gajare
    Description

    Context

    The National Basketball Association is a professional basketball league in North America. The league is composed of 30 teams and is one of the major professional sports leagues in the United States and Canada. The Golden State Warriors and the Boston Celtics will be competing in the NBA Finals in June.

    Content

    • This dataset contains all the past games between the 2 NBA Finals contenders.
    • They have competed against each other for a total of 346 regular season games, with the Celtics winning 208 games and the Warriors winning 138.
    • It contains the score of the game, the date the game was played, which team played in their home stadium (or sometimes in neither of the teams' stadiums which is labelled 'None'), and the streaks (number of consecutive wins) of the 2 teams.

    Purpose

    • This dataset can be useful to see the history between the 2 teams, and how they have fared against each other in the past. It may or may not be relevant to their upcoming matches in the finals...

    Acknowledgement

  14. NBA Play-by-Play 2019-2020 Season

    • kaggle.com
    zip
    Updated Mar 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harry Wang (2024). NBA Play-by-Play 2019-2020 Season [Dataset]. https://www.kaggle.com/datasets/harrywang/nba-play-by-play-2019-2020-season/versions/1
    Explore at:
    zip(5144332 bytes)Available download formats
    Dataset updated
    Mar 12, 2024
    Authors
    Harry Wang
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset is revised based on https://www.kaggle.com/datasets/schmadam97/nba-playbyplay-data-20182019

    This dataset offers a comprehensive play-by-play log of NBA games, detailing not only scoring plays but also player movements, fouls, rebounds, and other significant actions within each game.

    • GameType: Indicates the type of game, such as regular season or playoffs.
    • Date: The date on which the game was played.
    • WinningTeam: The abbreviation of the team that won the game.
    • Quarter: The quarter in which the play occurred, ranging from 1 to 4 for regular quarters, and additional entries for overtime periods.
    • SecLeft: Seconds left in the quarter when the play occurred.
    • AwayTeam: The abbreviation of the away team.
    • HomeTeam: The abbreviation of the home team.
    • Shooter: The player who attempted a field goal, identified by name and player ID.
    • ShotType: Describes the type of shot taken (e.g., 2-pt jump shot, 3-pt shot, layup).
    • ShotOutcome: The outcome of the shot (make or miss).
    • PointsAttempted: The point value of the shot attempted.
    • PointsMade: The points scored from the shot if it was successful.
    • FreeThrowShooter: The player who attempted a free throw, identified by name and player ID.
    • ReboundType: Indicates the type of rebound (offensive, defensive).
    • ReboundPlayer: The player who secured the rebound.
    • AssistPlayer: The player who assisted on the shot.
    • FoulType: Describes the type of foul committed.
    • FouledPlayer: The player who was fouled.
    • FoulingPlayer: The player who committed the foul.
    • SubIn: The player who entered the game as a substitute.
    • SubOut: The player who exited the game for a substitute.
    • TimeoutTeam: The team that called a timeout.
    • EnterGame: Indicates a player entering the game, potentially redundant with SubIn.
    • LeaveGame: Indicates a player leaving the game, potentially redundant with SubOut.
    • TurnoverPlayer: The player who committed a turnover.
    • TurnoverType: The type of turnover committed (e.g., bad pass, travel).
    • TurnoverCause: A more detailed cause of the turnover, if applicable.
    • TurnoverCauser: The player who caused the turnover, if applicable.
    • JumpballAwayPlayer: The away team player involved in a jump ball.
    • JumpballHomePlayer: The home team player involved in a jump ball.
    • JumpballPoss: The player who gained possession following the jump ball.
  15. 2017-2018 NBA Regular Season Game Data

    • kaggle.com
    zip
    Updated May 31, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael McFarlane (2018). 2017-2018 NBA Regular Season Game Data [Dataset]. https://www.kaggle.com/datasets/michaelmcfarlane/20172018-nba-regular-season-game-data
    Explore at:
    zip(73756 bytes)Available download formats
    Dataset updated
    May 31, 2018
    Authors
    Michael McFarlane
    Description

    Context

    Game log data of the 2017-2018 NBA Regular Season.

    Content

    What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

    Acknowledgements

    We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

    Inspiration

    Your data will be in front of the world's largest data science community. What questions do you want to see answered?

  16. Data to Prove Who's the GOAT Once and For All

    • kaggle.com
    zip
    Updated Aug 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    dahoopster23 (2024). Data to Prove Who's the GOAT Once and For All [Dataset]. https://www.kaggle.com/datasets/dahoopster23/data-to-prove-whos-the-goat-once-and-for-all/discussion
    Explore at:
    zip(3040507 bytes)Available download formats
    Dataset updated
    Aug 15, 2024
    Authors
    dahoopster23
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This is my take on the tiresome topic of who is the NBA GOAT. I've always had my strong opinion, but this takes a deep dive into the stats to prove it with numbers. I took a dataset of the career stats for all NBA players and organized it or narrowed it down to the top 10 players with the most points all time. I then took a look at their individual stats compared to each other. I figured to keep it easy, I only looked at the # of games played, minutes played, total points, rebounds, assists, steals, blocks and turnovers. I then took a deeper look at who most people think is the GOAT, either LeBron James or Michael Jordan. I looked at their total career stats, then per game stats, then per minute stats. But this could still be an unfair comparison because LeBron James has played roughly 500 more games than Michael Jordan did. So I did a what if scenario to show what the stats would look like if the total games and minutes played were reversed, but they each kept their individual stats averages for points, rebounds, assists, steals, blocks and turnovers. The results are impressive! Take a look, and enjoy.

    I was unable to create a notebook with a sample of my work using R because I passed the time limit for the free trial after I exported the data to excel.

  17. NBA Top Shot Moment List

    • kaggle.com
    zip
    Updated Mar 13, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Isaac Trussell (2021). NBA Top Shot Moment List [Dataset]. https://www.kaggle.com/itrussell15/nba-top-shot-moment-list
    Explore at:
    zip(87682 bytes)Available download formats
    Dataset updated
    Mar 13, 2021
    Authors
    Isaac Trussell
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    NBA Top Shot is just one of the many NFT platforms that is exploding right now. This website serializes highlights of NBA players and puts them on the blockchain to immortalize them forever. The community is similar to the trading card community with collectors and a marketplace to buy and sell your moments.

    Content

    This dataset has all of the moments that are being offered by NBA Topshot to collect. Every card has the play type, date of event, team of the player, the lowest asking price on the moment (from the date that it was published), number of listings on the marketplace (from the date that it was published), the rarity of the moment, the number of moments minted, and whether there will be more of these moments made.

    There is also some interesting data about each series of card that was released.

    I scraped all of this data from 2 different website. - nbatopshot.com - evaluate.market

    I used selenium to gather all of this data since both of them had javascript interfaces.

    Inspiration

    After buying into NBA Topshot, I saw the prices of cards fluctuate drastically. I wanted to understand what drove the insane prices that people were paying for a moment. Even better, I wanted to know if I could tell if a card was undervalued so that I could buy it low and sell it after it had recovered.

    I still don't understand why cards are worth what they are worth, but that is what lead me to collect this data and try to see if I could make sense of it all. It could all just be hype surrounding NFT's and FOMO of not being apart of the next big thing that could be worth millions.

    Hope you find this dataset interesting and I am excited to see what people do/uncover with it. :) Please let me know if there is anything else that would be helpful to include in this dataset. I am hoping to complete some exploratory analysis on it sometime shortly myself.

  18. US state_trends.csv

    • kaggle.com
    zip
    Updated Jan 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ANKITHA SRIDHAR (2024). US state_trends.csv [Dataset]. https://www.kaggle.com/datasets/ankithasridhar/us-state-trends-csv
    Explore at:
    zip(64366 bytes)Available download formats
    Dataset updated
    Jan 18, 2024
    Authors
    ANKITHA SRIDHAR
    Area covered
    United States
    Description

    This dataset, named "state_trends.csv," contains information about different U.S. states. Let's break down the attributes and understand what each column represents:

    1. state: The name of the U.S. state.
    2. state_code: The two-letter postal code abbreviation for the state.
    3. population: The population of the state.
    4. sq_miles: The total land area of the state in square miles.
    5. pop_density: Population density, which is the number of people per square mile.
    6. region: The geographical region of the United States to which the state belongs (e.g., South, West).
    7. psych_region: A description of the psychological region based on personality traits.
    8. psy_reg: A shortened version of the psychological region.
    9. extraversion: A measure of the state's population tendency toward extraversion.
    10. agreeableness: A measure of the state's population tendency toward agreeableness.
    11. conscientiousness: A measure of the state's population tendency toward conscientiousness.
    12. neuroticism: A measure of the state's population tendency toward neuroticism.
    13. openness: A measure of the state's population tendency toward openness.
    14. data_science: A score related to the state's interest or proficiency in the field of data science.
    15. artificial_intelligence: A score related to the state's interest or proficiency in artificial intelligence.
    16. machine_learning: A score related to the state's interest or proficiency in machine learning.
    17. data_analysis: A score related to the state's interest or proficiency in data analysis.
    18. business_intelligence: A score related to the state's interest or proficiency in business intelligence.
    19. spreadsheet: A score related to the state's interest or proficiency in spreadsheet usage.
    20. statistics: A score related to the state's interest or proficiency in statistics.
    21. art: A score related to the state's interest or involvement in the field of art.
    22. dance: A score related to the state's interest or involvement in dance.
    23. museum: A score related to the state's interest or presence of museums.
    24. basketball: A score related to the state's interest or involvement in basketball.
    25. football: A score related to the state's interest or involvement in football.
    26. baseball: A score related to the state's interest or involvement in baseball.
    27. soccer: A score related to the state's interest or involvement in soccer.
    28. hockey: A score related to the state's interest or involvement in hockey.
    29. has_nba: Indicates whether the state has a National Basketball Association (NBA) team (Yes/No).
    30. has_nfl: Indicates whether the state has a National Football League (NFL) team (Yes/No).
    31. has_mlb: Indicates whether the state has a Major League Baseball (MLB) team (Yes/No).
    32. has_mls: Indicates whether the state has a Major League Soccer (MLS) team (Yes/No).
    33. has_nhl: Indicates whether the state has a National Hockey League (NHL) team (Yes/No).
    34. has_any: Indicates whether the state has any of the mentioned professional sports teams (Yes/No).

    In summary, this dataset provides a variety of information about U.S. states, including demographic data, geographical region, psychological region, personality traits, and scores related to interests or proficiencies in various fields such as data science, art, and sports.

  19. NBA Writer Rank

    • kaggle.com
    zip
    Updated Oct 3, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aaron Miles (2017). NBA Writer Rank [Dataset]. https://www.kaggle.com/amiles/nba-writer-rank
    Explore at:
    zip(6638550 bytes)Available download formats
    Dataset updated
    Oct 3, 2017
    Authors
    Aaron Miles
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    NBARank is an annual preseason tradition at basketball sites like ESPN and Slam. Aggrieved at some of the rankings, CJ McCollum suggested that writers get ranked instead. I set up an allourideas.com survey to do exactly this and let twitter decide who the best writers are.

  20. Top 10 Highest-Paid Athletes 2011 - 2021

    • kaggle.com
    zip
    Updated Jan 3, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dimitris Angelides (2022). Top 10 Highest-Paid Athletes 2011 - 2021 [Dataset]. https://www.kaggle.com/datasets/dimitrisangelide/top-10-highestpaid-athletes-tennis-nba-soccer
    Explore at:
    zip(21962 bytes)Available download formats
    Dataset updated
    Jan 3, 2022
    Authors
    Dimitris Angelides
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Context

    Forbes is not just one of the most popular business magazines!! It contains countless articles on numerous subjects (e.g., business, investing, technology, entrepreneurship, etc.), reporting valuable data and insights.

    For instance, Forbes publishes annual lists of wealthy people reporting their worth such as "Forbes 400" and "Forbes World's Billionaires list".

    Athletes are not an exception, and every year lists are published for the top highest paid individuals.

    Content

    The data are scrapped manually from Forbes articles listing the top 10 highest-paid athletes in tennis, NBA, and soccer.

    Athletes can have multiple sources of income. • Team sports athletes earn a salary paid by their team whereas individual sports athletes compete in tournaments for prize money (such as tennis players). • Most of the time, brands are paying athletes to promote their products (on and off the court) as a marketing promotional strategy to reach a wider target audience and boost their sales / profit.

    The dataset contains 11 years of data starting from 2011.

    Source

    Forbes official website: https://www.forbes.com/ (dataset last updated on 3rd of January 2022)

    Inspiration

    • Which sport rewarded its athletes the most in each year? • Is there a trend across years for the total earnings of the top 10 highest-paid athletes of each sport? • Does this trend change when looking into salaries (or prize money) and endorsements separately? • Which country 'earns' the most out of those three sports each year?

    These are examples of interesting questions that could be answered by analysing this dataset.

    If you are interested, please have a look at the Tableau dashboard that I have created to help answer the above questions, and report some of my insights. Tableau dashboard: https://public.tableau.com/views/AthelesSaleries/SportsEarningsAnalysis?:language=en-US&publish=yes&:display_count=n&:origin=viz_share_link

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Justinas Cirtautas (2023). NBA Players [Dataset]. https://www.kaggle.com/datasets/justinas/nba-players-data/discussion
Organization logo

NBA Players

Biometric, biographic and basic box score stats from 1996 to 2022 season

Explore at:
zip(577071 bytes)Available download formats
Dataset updated
Oct 13, 2023
Authors
Justinas Cirtautas
Description

Update 2023-10-13: The data now includes 2022 season.

Update 2022-08-06: The data now includes 2021 season.

Update 2021-08-02: The data now includes 2020 season and metrics for 2019 have been updated.

Update 2020-08-03: The data now includes 2017, 2018 and 2019 seasons. Keep in mind that metrics like gp, pts, reb, etc. are not complete for 2019 season, as it is ongoing at the time of upload.

Context

As a life-long fan of basketball, I always wanted to combine my enthusiasm for the sport with passion for analytics 🏀📊. So, I utilized the NBA Stats API to pull together this data set. I hope it will prove to be as interesting to work with for you as it has been for me!

Content

The data set contains over two decades of data on each player who has been part of an NBA teams' roster. It captures demographic variables such as age, height, weight and place of birth, biographical details like the team played for, draft year and round. In addition, it has basic box score statistics such as games played, average number of points, rebounds, assists, etc.

The pull initially contained 52 rows of missing data. The gaps have been manually filled using data from Basketball Reference. I am not aware of any other data quality issues.

Analysis Ideas

The data set can be used to explore how age/height/weight tendencies have changed over time due to changes in game philosophy and player development strategies. Also, it could be interesting to see how geographically diverse the NBA is and how oversees talents have influenced it. A longitudinal study on players' career arches can also be performed.

Search
Clear search
Close search
Google apps
Main menu