https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
I wanted to learn web scraping in order to make website for basketball, so I created this dataset as part of my learning. I will try to keep it updated as much as possible.
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
This data set contains combined on-court performance data for NBA players in the 2016-2017 season, alongside salary, Twitter engagement, and Wikipedia traffic data.
Further information can be found in a series of articles for IBM Developerworks: "Explore valuation and attendance using data science and machine learning" and "Exploring the individual NBA players".
A talk about this dataset has slides from March, 2018, Strata:
Further reading on this dataset is in the book Pragmatic AI, in Chapter 6 or full book, Pragmatic AI: An introduction to Cloud-based Machine Learning and watch lesson 9 in Essential Machine Learning and AI with Python and Jupyter Notebook
You can watch a breakdown of using cluster analysis on the Pragmatic AI YouTube channel
Learn to deploy a Kaggle project into a production Machine Learning sklearn + flask + container by reading Python for Devops: Learn Ruthlessly Effective Automation, Chapter 14: MLOps and Machine learning engineering
Use social media to predict a winning season with this notebook: https://github.com/noahgift/core-stats-datascience/blob/master/Lesson2_7_Trends_Supervized_Learning.ipynb
Learn to use the cloud for data analysis.
Data sources include ESPN, Basketball-Reference, Twitter, Five-ThirtyEight, and Wikipedia. The source code for this dataset (in Python and R) can be found on GitHub. Links to more writing can be found at noahgift.com.
The National Basketball Association has one of the highest percentages of African American players from the big four professional sports leagues in North America. In 2023, approximately **** percent of NBA players were African American. Meanwhile, ethnically white players constituted a **** percent share of all NBA players that year. After the WNBA and NBA, the National Football League had the largest share of African Americans in a professional sports league in North America. How do other roles in the NBA compare? When it comes to African American representation in the NBA, no other role in the NBA is as well represented by African Americans as players. Meanwhile, on the opposite end of the scale, less than **** percent of team governors in the NBA were African American in 2023. During the 2022/23 season, the role with the second-highest share of African Americans was head coach, with a share of ** percent. That season, the number of African American head coaches in the NBA exceeded the number of white head coaches for the first time. African Americans in the NFL In 2022, the greatest share of players by ethnicity in the NFL were African American, with more than half of all NFL players falling within this group. The representation of African Americans in American Football extended beyond the playing field, with **** percent of NFL assistant coaches being African American in 2022 as well. However, positions such as vice presidents and head coaches were less representative of the African American population, as less than ** percent of the individuals fulfilling these roles in 2022 were African American.
As of 2024, the largest luxury tax bill footed by a team in the NBA came in the 2023/24 season, when the Golden State Warriors were taxed 176.9 million U.S. dollars by the league. The Warriors also held the other top-three spots, bringing their overall luxury tax payments from 2021/22 to 2023/24 to 510.9 million U.S. dollars.
We examine whether social data can be used to predict how members of Major League Baseball (MLB) and members of the National Basketball Association (NBA) transition between teams during their career. We find that incorporating social data into various machine learning algorithms substantially improves the algorithms' ability to correctly determine these transitions in the NBA but only marginally in MLB. We also measure the extent to which player performance and team fitness data can be used to predict transitions between teams. This data, however, only slightly improves our predictions for players for both basketball and baseball players. We also consider whether social, performance, and team fitness data can be used to infer past transitions. Here we find that social data significantly improves our inference accuracy in both the NBA and MLB but player performance and team fitness data again does little to improve this score.
An average of **** million viewers tuned in to watch NBA regular season games across ABC, ESPN and TNT in the 2024/25 season. This marked a slight decline in the number of viewers from the previous season.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This is my take on the tiresome topic of who is the NBA GOAT. I've always had my strong opinion, but this takes a deep dive into the stats to prove it with numbers. I took a dataset of the career stats for all NBA players and organized it or narrowed it down to the top 10 players with the most points all time. I then took a look at their individual stats compared to each other. I figured to keep it easy, I only looked at the # of games played, minutes played, total points, rebounds, assists, steals, blocks and turnovers. I then took a deeper look at who most people think is the GOAT, either LeBron James or Michael Jordan. I looked at their total career stats, then per game stats, then per minute stats. But this could still be an unfair comparison because LeBron James has played roughly 500 more games than Michael Jordan did. So I did a what if scenario to show what the stats would look like if the total games and minutes played were reversed, but they each kept their individual stats averages for points, rebounds, assists, steals, blocks and turnovers. The results are impressive! Take a look, and enjoy.
I was unable to create a notebook with a sample of my work using R because I passed the time limit for the free trial after I exported the data to excel.
Data was collected using a mixed qualitative-quantitative web survey, which was administered using E-lomake survey software. The survey included 20 questions of which 3 included multiple statements on a 5-point Likert-like scale. 12 of the questions were open-ended and the focus of both data collection and analysis was on qualitative understanding rather than quantification. The respondents were asked to describe and rate their experiences of the development-led archaeology process, usefulness and use of archaeological information, and to indicate the branch and size of the organisation they represented. The sample is essentially a convenience sample of Finnish and Swedish organisations, which contracted archaeological investigations in 2013-2014. For Sweden, the names of the organisations were harvested semi- automatically using custom-written php-scripts from the PDF reports covering the chosen timeframe and available at the Samla database of the NHB (samla.raa.se). For Finland, the same data was collected from Muinaisjäännösten hankerekisteri (engl. Antiquities Project Registry) database (http://kulttuuriymparisto. nba.fi) maintained by the National Board of Antiquities of Finland. Email addresses of the organisations and, as possible, individuals working at relevant parts of the organisation (depending on the type of the organisation, generally planning, development and property management related functions) were collected using public online sources, including the websites of the organisations. Invitations were sent during the summer and autumn of 2015 to 241 Swedish organisations and 131 Finnish organisations. One reminder to participate in the survey was submitted to all organisations. Nine invitations were returned as definitely undeliverable. In total 34 organisations participated in the survey, 14 from Finland and 20 from Sweden. Twenty of the 34 respondents classified their organisations as municipal which corresponds relatively well with the distribution of the organisations in the original population (126 of the 241 Swedish and 87 of the 131 Finnish organisations were municipalities, excluding municipal e.g. energy and water supply companies). Five of the 34 respondents represented construction companies, 3 organisations in the energy branch, 3 regional and 2 national public bodies. One organisation from the property development, mining and environmental consulting branches participated in the survey. Amounts of employess varied - eight of 34 organisations had less than 10, 14 of the 34 organizations had between 11 and 100, five of the 34 had between 101 and 999, and seven of the 34 organizations had over 1000 employees. Especially for Sweden, it is important to note that the collection of reports is not complete, partly because only a part of the available reports contained information on the organisations who had contracted and/or financed investigations. It is also possible that the semi-automated harvesting process failed to find a small number of organisations. In addition, it is likely that in a number of organisations, the invitation did not reach the relevant respondents even if the invitation contained a request to forward it to a colleague if the recipient considered herself to be unable to take the survey. Therefore, even if the sampling approach was designed to reach a reasonable level of systematicity, coverage and comparability, the lack of a comprehensive project or central report registry in Sweden, technical issues, variation in the reporting of the contracting organisations, and the varying specificity of contact details mean that the final sample is closer to a convenience sample than a systematic cross section. (Description from Huvila, I. Land developers and archaeological information. Open Information Science, 2017, 1(1), 71-90) The dataset was originally published in DiVA and moved to SND in 2024. Se engelsk version av denna katalogpost för beskrivning. Datasetet har ursprungligen publicerats i DiVA och flyttades över till SND 2024.
https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy
The global sporting events market is experiencing significant growth, with a market size that was valued at approximately $600 billion in 2023 and is projected to reach around $900 billion by 2032, reflecting a CAGR of 4.5%. This robust growth is driven by an array of factors including the increasing popularity of sports among a global audience, advancements in digital technology that enhance fan engagement, and the rising investment in sports infrastructure across various regions. The expanding reach of sports through digital platforms and live streaming services has made sporting events more accessible than ever before, fostering a larger fan base and, consequently, greater revenue opportunities.
The growing globalization of sports, where international events are becoming more frequent and accessible, significantly propels market growth. Major sporting events like the Olympics, FIFA World Cup, and regional tournaments attract massive global audiences, translating into substantial revenue through diverse streams such as broadcasting rights and sponsorships. Technological advancements are also revolutionizing the way audiences engage with sports, with innovations like virtual and augmented reality enhancing the viewing experience. Additionally, the integration of data analytics in sports helps in improving team performances and in tailoring strategies to enhance fan engagement, further driving the market expansion.
Another substantial growth factor is the rising investment in sports infrastructure. Governments and private entities are increasingly investing in the development of new stadiums and renovation of existing facilities to attract major events. This trend is particularly evident in developing countries, where hosting international sports events is seen as a means to boost tourism and enhance the country's global image. Moreover, the proliferation of sports leagues and tournaments in these regions is generating numerous opportunities for market participants. Such investments not only improve the quality of sporting events but also contribute to the economic development of the host locations, thereby driving the overall market growth.
The commercialization of sports is an essential factor influencing the market's growth trajectory. Sports have become a lucrative industry, attracting investments from corporations looking to tap into the extensive visibility that sports offer. Sponsorship and advertising opportunities remain abundant, with brands keen to associate their image with popular sports events to reach a broad audience. This commercial interest has led to increased monetization opportunities within the sports industry, which is further propelled by the emergence of eSports as a competitive and financially rewarding segment. The inclusion of eSports in mainstream sporting events is expanding the traditional sports market, engaging younger demographics and tech-savvy audiences.
Regionally, North America continues to dominate the sporting events market owing to its established sporting culture and infrastructure. The region boasts several major sports leagues like the NFL, NBA, and MLB, which attract large audiences and generate substantial revenue from broadcasting and sponsorship deals. The Asia Pacific region, however, is emerging as a significant market player, driven by a growing middle class and increasing disposable income. The enthusiasm for sports such as cricket, football, and basketball is on the rise, along with the popularity of eSports, positioning the region as a key growth area. Europe continues to hold a strong position thanks to its rich sporting history and hosting of major events like the UEFA Champions League, while Latin America and the Middle East & Africa are showing promising growth due to rising sports participation and infrastructure development.
The sporting events market is broadly categorized into team sports, individual sports, and eSports. Team sports, which include popular games like football, basketball, and cricket, dominate the market owing to their massive global fan base and extensive media coverage. These sports have a long-standing tradition and enjoy immense popularity across various continents, contributing significantly to the revenue generated from ticket sales, broadcasting, and sponsorships. The collaborative nature of team sports fosters community spirit and regional loyalty, which translates into sustained audience engagement and financial support from local businesses and global corporations alike.
Individual sports, such as tennis, golf, and ath
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Over the past few years, the US has seen a significant shift in the political sentiment surrounding sports betting and gambling, following the Supreme Court overruling a law which forbade states from...
The NBA and WNBA are the two top leagues for basketball in the United States for men and women, respectively. In the NBA, players took home an average annual salary of over ** million U.S. dollars for the 2024/25 season, with the league's minimum salary set at **** million U.S. dollars that year. In comparison, players in the WNBA received an average annual pay of ******* U.S. dollars in the 2025 season, with the highest-earning players in the WNBA receiving around ******* U.S. dollars annually.
As of April 2024, the National Basketball Association’s (NBA) official X (formerly Twitter) page had ***** million followers. Meanwhile, the NBA’s Facebook page had ***** million fans. Both social media followings have experienced near-constant growth since September 2012; however, as of April 2024, the league’s Facebook fan base has decreased. Conversely, the NBA’s X following has increased by approximately *** million followers every six months since March 2021. Which NBA team has the largest social media following? With approximately ** million followers, the Los Angeles Lakers were the most-followed NBA franchise on X as of 2024. Having last won the Finals in 2020, the Lakers are also tied with the Boston Celtics as the NBA team with the most championships. A key component of their Finals victory in 2020 was LeBron James, who won the Finals MVP that season. James’ status as a modern-day sporting icon is reflected in his sponsorship deals, which made him the NBA’s highest-paid player in 2023. Do NFL teams have larger social media presences? In what is perhaps a reflection of the NBA’s global appeal rather than its popularity in North America, the Lakers’ Twitter following was roughly ***** times larger than that of the NFL team with the most X followers, the New England Patriots, in 2023. Looking outside of North America, NBA franchises are, however, still far behind soccer teams such as Manchester United, whose X following of over ** million made it the most followed soccer club in the Premier League.
In the 2023/24 season, the Golden State Warriors generated the most revenue from the National Basketball Association franchises. Specifically, the Golden State Warriors generated 800 million U.S. dollars in revenue by the end of the season.
The 2024 Women's National Basketball Association (WNBA) regular season games were watched by an average of ******* viewers across media platforms in the United States. This was a significant increase from the previous year and made the **** season the league’s highest-viewed regular season in the last 24 years.
The statistic depicts wholesale sales of basketball team uniforms in the United States from 2007 to 2024. In 2024, U.S. wholesale sales of basketball team uniforms amounted to about *** million U.S. dollars.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
I wanted to learn web scraping in order to make website for basketball, so I created this dataset as part of my learning. I will try to keep it updated as much as possible.