100+ datasets found
  1. Compare Baseball Player Statistics using Visualiza

    • kaggle.com
    zip
    Updated Sep 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abdelaziz Sami (2024). Compare Baseball Player Statistics using Visualiza [Dataset]. https://www.kaggle.com/datasets/abdelazizsami/compare-baseball-player-statistics-using-visualiza
    Explore at:
    zip(1030978 bytes)Available download formats
    Dataset updated
    Sep 28, 2024
    Authors
    Abdelaziz Sami
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    To compare baseball player statistics effectively using visualization, we can create some insightful plots. Below are the steps to accomplish this in Python using libraries like Pandas and Matplotlib or Seaborn.

    1. Load the Data

    First, we need to load the judge.csv file into a DataFrame. This will allow us to manipulate and analyze the data easily.

    2. Explore the Data

    Before creating visualizations, it’s good to understand the data structure and identify the columns we want to compare. The relevant columns in your data include pitch_type, release_speed, game_date, and events.

    3. Visualization

    We can create various visualizations, such as: - A bar chart to compare the average release speed of different pitch types. - A line plot to visualize trends over time based on game dates. - A scatter plot to analyze the relationship between release speed and the outcome of the pitches (e.g., strikeouts, home runs).

    Example Code

    Here is a sample code to demonstrate how to create these visualizations using Matplotlib and Seaborn:

    import pandas as pd
    import matplotlib.pyplot as plt
    import seaborn as sns
    
    # Load the data
    df = pd.read_csv('judge.csv')
    
    # Display the first few rows of the dataframe
    print(df.head())
    
    # Set the style of seaborn
    sns.set(style="whitegrid")
    
    # 1. Average Release Speed by Pitch Type
    plt.figure(figsize=(12, 6))
    avg_speed = df.groupby('pitch_type')['release_speed'].mean().sort_values()
    sns.barplot(x=avg_speed.values, y=avg_speed.index, palette="viridis")
    plt.title('Average Release Speed by Pitch Type')
    plt.xlabel('Average Release Speed (mph)')
    plt.ylabel('Pitch Type')
    plt.show()
    
    # 2. Trends in Release Speed Over Time
    # First, convert the 'game_date' to datetime
    df['game_date'] = pd.to_datetime(df['game_date'])
    
    plt.figure(figsize=(14, 7))
    sns.lineplot(data=df, x='game_date', y='release_speed', estimator='mean', ci=None)
    plt.title('Trends in Release Speed Over Time')
    plt.xlabel('Game Date')
    plt.ylabel('Average Release Speed (mph)')
    plt.xticks(rotation=45)
    plt.tight_layout()
    plt.show()
    
    # 3. Scatter Plot of Release Speed vs. Events
    plt.figure(figsize=(12, 6))
    sns.scatterplot(data=df, x='release_speed', y='events', hue='pitch_type', alpha=0.7)
    plt.title('Release Speed vs. Events')
    plt.xlabel('Release Speed (mph)')
    plt.ylabel('Event Type')
    plt.legend(title='Pitch Type', bbox_to_anchor=(1.05, 1), loc='upper left')
    plt.show()
    

    Explanation of the Code

    • Data Loading: The CSV file is loaded into a Pandas DataFrame.
    • Average Release Speed: A bar chart shows the average release speed for each pitch type.
    • Trends Over Time: A line plot illustrates the trend in release speed over time, which can indicate changes in performance or strategy.
    • Scatter Plot: A scatter plot visualizes the relationship between release speed and different events, providing insight into performance outcomes.

    Conclusion

    These visualizations will help you compare player statistics in a meaningful way. You can customize the plots further based on your specific needs, such as filtering data for specific players or seasons. If you have any specific comparisons in mind or additional data to visualize, let me know!

  2. i

    Grant Giving Statistics for Pandas Resource Network Inc

    • instrumentl.com
    Updated Feb 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Grant Giving Statistics for Pandas Resource Network Inc [Dataset]. https://www.instrumentl.com/990-report/pandas-resource-network-inc
    Explore at:
    Dataset updated
    Feb 19, 2023
    Variables measured
    Total Assets
    Description

    Financial overview and grant giving statistics of Pandas Resource Network Inc

  3. i

    Grant Giving Statistics for Pandas International

    • instrumentl.com
    Updated Dec 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Grant Giving Statistics for Pandas International [Dataset]. https://www.instrumentl.com/990-report/pandas-international
    Explore at:
    Dataset updated
    Dec 26, 2021
    Variables measured
    Total Assets, Total Giving, Average Grant Amount
    Description

    Financial overview and grant giving statistics of Pandas International

  4. stats1_practice

    • kaggle.com
    zip
    Updated Jun 17, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    J H Lee (2021). stats1_practice [Dataset]. https://www.kaggle.com/goen01/stats-ex3
    Explore at:
    zip(19866 bytes)Available download formats
    Dataset updated
    Jun 17, 2021
    Authors
    J H Lee
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    For statistics data analysis such as correlation, linear regression and so on.

  5. Panda's YouTube Channel Statistics

    • vidiq.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    vidIQ, Panda's YouTube Channel Statistics [Dataset]. https://vidiq.com/youtube-stats/channel/UCj0sKQk9qjiIfNH22ZBWpxw/
    Explore at:
    Dataset authored and provided by
    vidIQ
    Time period covered
    Nov 1, 2025 - Nov 28, 2025
    Area covered
    SE
    Variables measured
    subscribers, video count, video views, engagement rate, upload frequency, estimated earnings
    Description

    Comprehensive YouTube channel statistics for Panda, featuring 13,300,000 subscribers and 3,141,620,262 total views. This dataset includes detailed performance metrics such as subscriber growth, video views, engagement rates, and estimated revenue. The channel operates in the Gaming category and is based in SE. Track 2,112 videos with daily and monthly performance data, including view counts, subscriber changes, and earnings estimates. Analyze growth trends, engagement patterns, and compare performance against similar channels in the same category.

  6. Dataset Nba stats

    • kaggle.com
    zip
    Updated Jun 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thomas Brescou (2023). Dataset Nba stats [Dataset]. https://www.kaggle.com/datasets/thomasbrescou/dataset-nba-stats
    Explore at:
    zip(11234569 bytes)Available download formats
    Dataset updated
    Jun 13, 2023
    Authors
    Thomas Brescou
    License

    http://www.gnu.org/licenses/old-licenses/gpl-2.0.en.htmlhttp://www.gnu.org/licenses/old-licenses/gpl-2.0.en.html

    Description

    NBA Statistics Repository

    Welcome to the NBA Statistics Repository for teams and players. This repository contains a rich and diverse dataset spanning from 1996 to 2023, drawn from NBA game statistics. It's ideal for data analysts, basketball fans, researchers, and anyone interested in the detailed numbers behind the sport.

    Repo Structure

    This repository contains a series of CSV files detailing the performances of teams and players from 1996 to 2023. A list of these files is provided below:

    1. player_index.csv: An index of all players with general information.
    2. player_stats_advanced_po.csv and player_stats_advanced_rs.csv: Advanced statistics for players during playoffs (po) and regular season (rs).
    3. player_stats_defense_po.csv and player_stats_defense_rs.csv: Defensive statistics for players during the playoffs and regular season.
    4. player_stats_misc_po.csv and player_stats_misc_rs.csv: Miscellaneous player statistics for the playoffs and regular season.
    5. player_stats_scoring_po.csv and player_stats_scoring_rs.csv: Scoring statistics for players during the playoffs and regular season.
    6. player_stats_traditional_po.csv and player_stats_traditionnal_rs.csv: Traditional player statistics during the playoffs and regular season.
    7. player_stats_usage_po.csv and player_stats_usage_rs.csv: Player usage statistics during the playoffs and regular season.
    8. team_stats_advanced_po.csv and team_stats_advanced_rs.csv: Advanced team statistics during the playoffs and regular season.
    9. team_stats_defense_po.csv and team_stats_defense_rs.csv: Defensive team statistics during the playoffs and regular season.
    10. team_stats_four_factors_po.csv and team_stats_four_factors_rs.csv: Four factors team statistics during the playoffs and regular season.
    11. team_stats_misc_po.csv and team_stats_misc_rs.csv: Miscellaneous team statistics during the playoffs and regular season.
    12. team_stats_opponent_po.csv and team_stats_opponent_rs.csv: Team opponent statistics during the playoffs and regular season.
    13. team_stats_scoring_po.csv and team_stats_scoring_rs.csv: Scoring team statistics during the playoffs and regular season.
    14. team_stats_traditional_po.csv and team_stats_traditional_rs.csv: Traditional team statistics during the playoffs and regular season.

    How to Use this Data

    To use this data, simply clone this repository and use a software capable of reading CSV files, such as Excel, R, Python (with pandas), etc.

    Contributions

    Contributions to this repo are welcome. If you have additional data to add or corrections to make, please feel free to open a pull request.

    License

    These data are released under the MIT License. See the LICENSE file for more information.

  7. All Pandas Operations Reference

    • kaggle.com
    zip
    Updated Nov 8, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Reuben (2019). All Pandas Operations Reference [Dataset]. https://www.kaggle.com/reubenn/all-pandas-operations-reference
    Explore at:
    zip(10449 bytes)Available download formats
    Dataset updated
    Nov 8, 2019
    Authors
    Reuben
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Pandas is a very useful library, probably the most useful for data munging in Python. This notebook is an attempt to collate all pandas dataframes operations that a data scientist might use.

    Content

    You'll see how to create dataframes, read in files (even ones with anomalies), check out descriptive stats on columns, filter on different values and in different ways as well as perform some of the more oft-used operations

    Acknowledgements

    A big "thank you" to Data School. You'll find plenty of notebooks and videos here: https://github.com/justmarkham/pandas-videos

  8. i

    Grant Giving Statistics for Southeastern Pans-Pandas Association Inc.

    • instrumentl.com
    Updated Jun 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Grant Giving Statistics for Southeastern Pans-Pandas Association Inc. [Dataset]. https://www.instrumentl.com/990-report/southeastern-pans-pandas-association-inc
    Explore at:
    Dataset updated
    Jun 19, 2023
    Variables measured
    Total Assets, Total Giving
    Description

    Financial overview and grant giving statistics of Southeastern Pans-Pandas Association Inc.

  9. Python frameworks used in data science 2021

    • statista.com
    Updated Jun 15, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2022). Python frameworks used in data science 2021 [Dataset]. https://www.statista.com/statistics/1338424/python-use-frameworks-data-science/
    Explore at:
    Dataset updated
    Jun 15, 2022
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Oct 2021 - Dec 2021
    Area covered
    Worldwide
    Description

    Python is one of the most popular programming languages among data scientists, partly due to its varied packages and capabilities. In 2021, Numpy and Pandas were the most used Python frameworks for data science, with a ** percent and ** percent share respectively.

  10. UK_Flight_Data Statistics_2018

    • kaggle.com
    zip
    Updated Jan 29, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ferhat Culfaz (2019). UK_Flight_Data Statistics_2018 [Dataset]. https://www.kaggle.com/ferhat00/uk-flight-stats-2018
    Explore at:
    zip(103089 bytes)Available download formats
    Dataset updated
    Jan 29, 2019
    Authors
    Ferhat Culfaz
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    United Kingdom
    Description

    An analysis of the flight punctuality statistics using pandas and seaborn. Source data from: https://www.caa.co.uk/Data-and-analysis/UK-aviation-market/Flight-reliability/Datasets/Punctuality-data/Punctuality-statistics-2018/

    Open the csv into a pandas dataframe and analyse using Seaborn.

  11. Panda Gaming's YouTube Channel Statistics

    • vidiq.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    vidIQ, Panda Gaming's YouTube Channel Statistics [Dataset]. https://vidiq.com/youtube-stats/channel/UCpcuN4m4C69XMzGd8QypFYg/
    Explore at:
    Dataset authored and provided by
    vidIQ
    Time period covered
    Dec 1, 2025 - Dec 2, 2025
    Area covered
    PL, YouTube
    Variables measured
    subscribers, video count, video views, engagement rate, upload frequency, estimated earnings
    Description

    Comprehensive YouTube channel statistics for Panda Gaming, featuring 370,000 subscribers and 71,502,594 total views. This dataset includes detailed performance metrics such as subscriber growth, video views, engagement rates, and estimated revenue. The channel operates in the Gaming category and is based in PL. Track 2,106 videos with daily and monthly performance data, including view counts, subscriber changes, and earnings estimates. Analyze growth trends, engagement patterns, and compare performance against similar channels in the same category.

  12. Crafty Panda's YouTube Channel Statistics

    • vidiq.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    vidIQ, Crafty Panda's YouTube Channel Statistics [Dataset]. https://vidiq.com/youtube-stats/channel/UC03RvJoIhm_fMwlUpm9ZvFw/
    Explore at:
    Dataset authored and provided by
    vidIQ
    Time period covered
    Dec 1, 2025 - Dec 2, 2025
    Area covered
    US, YouTube
    Variables measured
    subscribers, video count, video views, engagement rate, upload frequency, estimated earnings
    Description

    Comprehensive YouTube channel statistics for Crafty Panda, featuring 19,100,000 subscribers and 494,412,352 total views. This dataset includes detailed performance metrics such as subscriber growth, video views, engagement rates, and estimated revenue. The channel operates in the Lifestyle category and is based in US. Track 171 videos with daily and monthly performance data, including view counts, subscriber changes, and earnings estimates. Analyze growth trends, engagement patterns, and compare performance against similar channels in the same category.

  13. i

    Grant Giving Statistics for Pandas Network-Orgnon-Profit to Cure Auto...

    • instrumentl.com
    Updated Oct 17, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Grant Giving Statistics for Pandas Network-Orgnon-Profit to Cure Auto Neuropsychiatric Syndrom [Dataset]. https://www.instrumentl.com/990-report/pandas-networkorg-a-non-profit-to-cure-pediatric-acute-neuropsychiatric
    Explore at:
    Dataset updated
    Oct 17, 2021
    Variables measured
    Total Assets, Total Giving, Average Grant Amount
    Description

    Financial overview and grant giving statistics of Pandas Network-Orgnon-Profit to Cure Auto Neuropsychiatric Syndrom

  14. t

    Tibia Player Statistics Dataset

    • tibia-statistic.com
    Updated Dec 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TibiaStatistic (2025). Tibia Player Statistics Dataset [Dataset]. https://www.tibia-statistic.com/statistics/players/pandas%20pet
    Explore at:
    Dataset updated
    Dec 1, 2025
    Dataset authored and provided by
    TibiaStatistic
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2024 - Present
    Area covered
    Tibia Game Worlds
    Description

    Detailed online statistics for player Pandas Pet from world Mystera. View daily activity and session history.

  15. Lost Panda's YouTube Channel Statistics

    • vidiq.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    vidIQ, Lost Panda's YouTube Channel Statistics [Dataset]. https://vidiq.com/youtube-stats/channel/UCyh5g11KbG_YdbRw1ktAJqA/
    Explore at:
    Dataset authored and provided by
    vidIQ
    Time period covered
    Nov 1, 2025 - Nov 26, 2025
    Area covered
    GB
    Variables measured
    subscribers, video count, video views, engagement rate, upload frequency, estimated earnings
    Description

    Comprehensive YouTube channel statistics for Lost Panda, featuring 1,160,000 subscribers and 1,014,676,950 total views. This dataset includes detailed performance metrics such as subscriber growth, video views, engagement rates, and estimated revenue. The channel operates in the Music category and is based in GB. Track 1,589 videos with daily and monthly performance data, including view counts, subscriber changes, and earnings estimates. Analyze growth trends, engagement patterns, and compare performance against similar channels in the same category.

  16. H

    Daily Statistics for Discharge at USGS 09380000 Colorado River at Lee’s...

    • hydroshare.org
    • search.dataone.org
    zip
    Updated Dec 5, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amber Jones (2018). Daily Statistics for Discharge at USGS 09380000 Colorado River at Lee’s Ferry, AZ: Jupyter Notebook [Dataset]. https://www.hydroshare.org/resource/63ef4e48947c40898c56c0e2ed9c3fc5
    Explore at:
    zip(5.3 KB)Available download formats
    Dataset updated
    Dec 5, 2018
    Dataset provided by
    HydroShare
    Authors
    Amber Jones
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Aug 1, 2018 - Nov 26, 2018
    Area covered
    Description

    This resource contains a Jupyter Notebook that uses Python to access and visualize data for the USGS flow gage on the Colorado River at Lee’s Ferry, AZ (09380000). This site monitors water quantity and quality for water released from Glen Canyon Dam that then flows through the Grand Canyon. To call these services in Python, the suds-py3 package was used. Using this package, a “GetValuesObject” request, as defined by WaterOneFlow, was passed to the server using inputs for the web service url, site code, variable code, and dates of interest. For this case, 15-minute discharge from August 1, 2018 to the current date was used. The web service returned an object from which the dates and the data values were obtained, as well as the site name. The Python libraries Pandas and Matplotlib were used to manipulate and view the results. The time series data were converted to lists and then to a Pandas series object. Using the “resample” function of Pandas, values for mean, minimum, and maximum were determined on a daily basis from the 15-minute data. Using Matplotlib, a figure object was created to which Pandas series objects were added using the Pandas plot method. The daily mean, minimum, maximum, and the 15-minute flow values were added to illustrate the differences in the daily ranges of data.

  17. NBA History | Seasonal Data 1995-2023

    • kaggle.com
    zip
    Updated Oct 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bendik Flåt Aas (2023). NBA History | Seasonal Data 1995-2023 [Dataset]. https://www.kaggle.com/datasets/bendikfltaas/nba-history-seasonal-data-1995-2023/data
    Explore at:
    zip(1818612 bytes)Available download formats
    Dataset updated
    Oct 15, 2023
    Authors
    Bendik Flåt Aas
    Description

    Greetings!

    If you are reading this then you might be an NBA junkie like me. I always wanted to have access to some data pertaining to my favorite players and teams across the years, and here i've tried to compile and accumulate data i could get hands on since 1995.

    A lot of the columns have been kept as raw as possible, but with additions like:

    • Adding a season column to indicate which season it is relevant for, and will help doing aggregations across different years
    • Adding a team column to indicate the team that the datapoints were relevant for. Makes making aggregations over time on a team level a bit easier
    • Adding a team_retconcolumn which will map franchise renames to reflect their current date team name.

    Note that duplicate player entries for a given season indicates a trade or switch of teams! Have fun!

    About the data

    There are 7 parquet files in this dataset:

    1. total.parq is a parquet file containing regular season stat totals per player for each team for observed years 1995-2023
    2. total_playoffs.parq is a parquet file containing playoff stat totals per player and for each team for observed years 1995-2023
    3. advanced.parq is a parquet file containing regular seasonal advanced stats (re: VOIP) per player for each team for observed years 1995-2023
    4. advanced_playoffs.parq is a parquet file containing more playoff advanced stats (re: VOIP) per player for each team for observed years 1995-2023
    5. average.parq is a parquet file containing regular season stats averages per player for each team for observed years 1995-2023
    6. average_playoffs.parq is a parquet file containing playoff stat averages per player for each team for observed years 1995-2023
    7. roster.parq is a parquet file containing the roster per team for seasons 1995-2023

    Loading the data

    If you are familiar with pandas, it is just as easy to read a parquet file as it is reading a standard csv file. The compression and space occupancy for parquet is however much lower!

    you can load it by simply writing:

    import pandas as pd
    df= pd.read_parquet('total.parq')
    

    in a notebook.

    All data is sourced and can be found at basketball-reference.com

  18. I

    Global Panda Polarization Maintaining Fibers Market Overview and Outlook...

    • statsndata.org
    excel, pdf
    Updated Nov 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stats N Data (2025). Global Panda Polarization Maintaining Fibers Market Overview and Outlook 2025-2032 [Dataset]. https://www.statsndata.org/report/panda-polarization-maintaining-fibers-market-76737
    Explore at:
    pdf, excelAvailable download formats
    Dataset updated
    Nov 2025
    Dataset authored and provided by
    Stats N Data
    License

    https://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order

    Area covered
    Global
    Description

    The Panda Polarization Maintaining Fibers market has emerged as a crucial segment within the optical fiber industry, characterized by its unique ability to preserve the polarization of light passing through it. This feature is integral to various applications, particularly in telecommunications, aerospace, and medic

  19. I

    Global Panda PM Fiber Market Overview and Outlook 2025-2032

    • statsndata.org
    excel, pdf
    Updated Nov 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stats N Data (2025). Global Panda PM Fiber Market Overview and Outlook 2025-2032 [Dataset]. https://www.statsndata.org/report/panda-pm-fiber-market-42839
    Explore at:
    excel, pdfAvailable download formats
    Dataset updated
    Nov 2025
    Dataset authored and provided by
    Stats N Data
    License

    https://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order

    Area covered
    Global
    Description

    The Panda PM Fiber market is an emerging segment within the broader textiles and fiber industry, renowned for its sustainable attributes and versatility. As a high-performance fiber, Panda PM Fiber is primarily used in various applications, ranging from apparel and fashion to home textiles and industrial products. I

  20. t

    Tibia Player Statistics Dataset

    • tibia-statistic.com
    Updated Nov 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    TibiaStatistic (2025). Tibia Player Statistics Dataset [Dataset]. https://www.tibia-statistic.com/statistics/players/kawaiii%20panda
    Explore at:
    Dataset updated
    Nov 16, 2025
    Dataset authored and provided by
    TibiaStatistic
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2024 - Present
    Area covered
    Tibia Game Worlds
    Description

    Detailed online statistics for player Kawaiii Panda from world Bona. View daily activity and session history.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Abdelaziz Sami (2024). Compare Baseball Player Statistics using Visualiza [Dataset]. https://www.kaggle.com/datasets/abdelazizsami/compare-baseball-player-statistics-using-visualiza
Organization logo

Compare Baseball Player Statistics using Visualiza

Analyzing Performance Metrics for Enhanced Insights

Explore at:
zip(1030978 bytes)Available download formats
Dataset updated
Sep 28, 2024
Authors
Abdelaziz Sami
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

To compare baseball player statistics effectively using visualization, we can create some insightful plots. Below are the steps to accomplish this in Python using libraries like Pandas and Matplotlib or Seaborn.

1. Load the Data

First, we need to load the judge.csv file into a DataFrame. This will allow us to manipulate and analyze the data easily.

2. Explore the Data

Before creating visualizations, it’s good to understand the data structure and identify the columns we want to compare. The relevant columns in your data include pitch_type, release_speed, game_date, and events.

3. Visualization

We can create various visualizations, such as: - A bar chart to compare the average release speed of different pitch types. - A line plot to visualize trends over time based on game dates. - A scatter plot to analyze the relationship between release speed and the outcome of the pitches (e.g., strikeouts, home runs).

Example Code

Here is a sample code to demonstrate how to create these visualizations using Matplotlib and Seaborn:

import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

# Load the data
df = pd.read_csv('judge.csv')

# Display the first few rows of the dataframe
print(df.head())

# Set the style of seaborn
sns.set(style="whitegrid")

# 1. Average Release Speed by Pitch Type
plt.figure(figsize=(12, 6))
avg_speed = df.groupby('pitch_type')['release_speed'].mean().sort_values()
sns.barplot(x=avg_speed.values, y=avg_speed.index, palette="viridis")
plt.title('Average Release Speed by Pitch Type')
plt.xlabel('Average Release Speed (mph)')
plt.ylabel('Pitch Type')
plt.show()

# 2. Trends in Release Speed Over Time
# First, convert the 'game_date' to datetime
df['game_date'] = pd.to_datetime(df['game_date'])

plt.figure(figsize=(14, 7))
sns.lineplot(data=df, x='game_date', y='release_speed', estimator='mean', ci=None)
plt.title('Trends in Release Speed Over Time')
plt.xlabel('Game Date')
plt.ylabel('Average Release Speed (mph)')
plt.xticks(rotation=45)
plt.tight_layout()
plt.show()

# 3. Scatter Plot of Release Speed vs. Events
plt.figure(figsize=(12, 6))
sns.scatterplot(data=df, x='release_speed', y='events', hue='pitch_type', alpha=0.7)
plt.title('Release Speed vs. Events')
plt.xlabel('Release Speed (mph)')
plt.ylabel('Event Type')
plt.legend(title='Pitch Type', bbox_to_anchor=(1.05, 1), loc='upper left')
plt.show()

Explanation of the Code

  • Data Loading: The CSV file is loaded into a Pandas DataFrame.
  • Average Release Speed: A bar chart shows the average release speed for each pitch type.
  • Trends Over Time: A line plot illustrates the trend in release speed over time, which can indicate changes in performance or strategy.
  • Scatter Plot: A scatter plot visualizes the relationship between release speed and different events, providing insight into performance outcomes.

Conclusion

These visualizations will help you compare player statistics in a meaningful way. You can customize the plots further based on your specific needs, such as filtering data for specific players or seasons. If you have any specific comparisons in mind or additional data to visualize, let me know!

Search
Clear search
Close search
Google apps
Main menu