100+ datasets found

Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm
plos.figshare.com
docx
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tracey L. Weissgerber; Natasa M. Milic; Stacey J. Winham; Vesna D. Garovic (2023). Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm [Dataset]. http://doi.org/10.1371/journal.pbio.1002128
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pbio.1002128
Dataset updated
May 31, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Tracey L. Weissgerber; Natasa M. Milic; Stacey J. Winham; Vesna D. Garovic
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Figures in scientific publications are critically important because they often show the data supporting key findings. Our systematic review of research articles published in top physiology journals (n = 703) suggests that, as scientists, we urgently need to change our practices for presenting continuous data in small sample size studies. Papers rarely included scatterplots, box plots, and histograms that allow readers to critically evaluate continuous data. Most papers presented continuous data in bar and line graphs. This is problematic, as many different data distributions can lead to the same bar or line graph. The full data may suggest different conclusions from the summary statistics. We recommend training investigators in data presentation, encouraging a more complete presentation of data, and changing journal editorial policies. Investigators can quickly make univariate scatterplots for small sample size studies using our Excel templates.
Compare Baseball Player Statistics using Visualiza
kaggle.com
zip
Updated Sep 28, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Abdelaziz Sami (2024). Compare Baseball Player Statistics using Visualiza [Dataset]. https://www.kaggle.com/datasets/abdelazizsami/compare-baseball-player-statistics-using-visualiza
Explore at:
zip(1030978 bytes)Available download formats
Dataset updated
Sep 28, 2024
Authors
Abdelaziz Sami
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
To compare baseball player statistics effectively using visualization, we can create some insightful plots. Below are the steps to accomplish this in Python using libraries like Pandas and Matplotlib or Seaborn.

1. Load the Data

First, we need to load the judge.csv file into a DataFrame. This will allow us to manipulate and analyze the data easily.

2. Explore the Data

Before creating visualizations, it’s good to understand the data structure and identify the columns we want to compare. The relevant columns in your data include pitch_type, release_speed, game_date, and events.

3. Visualization

We can create various visualizations, such as: - A bar chart to compare the average release speed of different pitch types. - A line plot to visualize trends over time based on game dates. - A scatter plot to analyze the relationship between release speed and the outcome of the pitches (e.g., strikeouts, home runs).

Example Code

Here is a sample code to demonstrate how to create these visualizations using Matplotlib and Seaborn:

import pandas as pd import matplotlib.pyplot as plt import seaborn as sns # Load the data df = pd.read_csv('judge.csv') # Display the first few rows of the dataframe print(df.head()) # Set the style of seaborn sns.set(style="whitegrid") # 1. Average Release Speed by Pitch Type plt.figure(figsize=(12, 6)) avg_speed = df.groupby('pitch_type')['release_speed'].mean().sort_values() sns.barplot(x=avg_speed.values, y=avg_speed.index, palette="viridis") plt.title('Average Release Speed by Pitch Type') plt.xlabel('Average Release Speed (mph)') plt.ylabel('Pitch Type') plt.show() # 2. Trends in Release Speed Over Time # First, convert the 'game_date' to datetime df['game_date'] = pd.to_datetime(df['game_date']) plt.figure(figsize=(14, 7)) sns.lineplot(data=df, x='game_date', y='release_speed', estimator='mean', ci=None) plt.title('Trends in Release Speed Over Time') plt.xlabel('Game Date') plt.ylabel('Average Release Speed (mph)') plt.xticks(rotation=45) plt.tight_layout() plt.show() # 3. Scatter Plot of Release Speed vs. Events plt.figure(figsize=(12, 6)) sns.scatterplot(data=df, x='release_speed', y='events', hue='pitch_type', alpha=0.7) plt.title('Release Speed vs. Events') plt.xlabel('Release Speed (mph)') plt.ylabel('Event Type') plt.legend(title='Pitch Type', bbox_to_anchor=(1.05, 1), loc='upper left') plt.show()

Explanation of the Code

Data Loading: The CSV file is loaded into a Pandas DataFrame.

Average Release Speed: A bar chart shows the average release speed for each pitch type.

Trends Over Time: A line plot illustrates the trend in release speed over time, which can indicate changes in performance or strategy.

Scatter Plot: A scatter plot visualizes the relationship between release speed and different events, providing insight into performance outcomes.

Conclusion

These visualizations will help you compare player statistics in a meaningful way. You can customize the plots further based on your specific needs, such as filtering data for specific players or seasons. If you have any specific comparisons in mind or additional data to visualize, let me know!
f
UC_vs_US Statistic Analysis.xlsx
figshare.com
xlsx
Updated Jul 9, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
F. (Fabiano) Dalpiaz (2020). UC_vs_US Statistic Analysis.xlsx [Dataset]. http://doi.org/10.23644/uu.12631628.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.23644/uu.12631628.v1
Dataset updated
Jul 9, 2020
Dataset provided by
Utrecht University
Authors
F. (Fabiano) Dalpiaz
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Sheet 1 (Raw-Data): The raw data of the study is provided, presenting the tagging results for the used measures described in the paper. For each subject, it includes multiple columns: A. a sequential student ID B an ID that defines a random group label and the notation C. the used notation: user Story or use Cases D. the case they were assigned to: IFA, Sim, or Hos E. the subject's exam grade (total points out of 100). Empty cells mean that the subject did not take the first exam F. a categorical representation of the grade L/M/H, where H is greater or equal to 80, M is between 65 included and 80 excluded, L otherwise G. the total number of classes in the student's conceptual model H. the total number of relationships in the student's conceptual model I. the total number of classes in the expert's conceptual model J. the total number of relationships in the expert's conceptual model K-O. the total number of encountered situations of alignment, wrong representation, system-oriented, omitted, missing (see tagging scheme below) P. the researchers' judgement on how well the derivation process explanation was explained by the student: well explained (a systematic mapping that can be easily reproduced), partially explained (vague indication of the mapping ), or not present.

Tagging scheme: Aligned (AL) - A concept is represented as a class in both models, either

with the same name or using synonyms or clearly linkable names; Wrongly represented (WR) - A class in the domain expert model is incorrectly represented in the student model, either (i) via an attribute, method, or relationship rather than class, or (ii) using a generic term (e.g., user'' instead ofurban planner''); System-oriented (SO) - A class in CM-Stud that denotes a technical implementation aspect, e.g., access control. Classes that represent legacy system or the system under design (portal, simulator) are legitimate; Omitted (OM) - A class in CM-Expert that does not appear in any way in CM-Stud; Missing (MI) - A class in CM-Stud that does not appear in any way in CM-Expert.

All the calculations and information provided in the following sheets

originate from that raw data.

Sheet 2 (Descriptive-Stats): Shows a summary of statistics from the data collection,

including the number of subjects per case, per notation, per process derivation rigor category, and per exam grade category.

Sheet 3 (Size-Ratio):

The number of classes within the student model divided by the number of classes within the expert model is calculated (describing the size ratio). We provide box plots to allow a visual comparison of the shape of the distribution, its central value, and its variability for each group (by case, notation, process, and exam grade) . The primary focus in this study is on the number of classes. However, we also provided the size ratio for the number of relationships between student and expert model.

Sheet 4 (Overall):

Provides an overview of all subjects regarding the encountered situations, completeness, and correctness, respectively. Correctness is defined as the ratio of classes in a student model that is fully aligned with the classes in the corresponding expert model. It is calculated by dividing the number of aligned concepts (AL) by the sum of the number of aligned concepts (AL), omitted concepts (OM), system-oriented concepts (SO), and wrong representations (WR). Completeness on the other hand, is defined as the ratio of classes in a student model that are correctly or incorrectly represented over the number of classes in the expert model. Completeness is calculated by dividing the sum of aligned concepts (AL) and wrong representations (WR) by the sum of the number of aligned concepts (AL), wrong representations (WR) and omitted concepts (OM). The overview is complemented with general diverging stacked bar charts that illustrate correctness and completeness.

For sheet 4 as well as for the following four sheets, diverging stacked bar

charts are provided to visualize the effect of each of the independent and mediated variables. The charts are based on the relative numbers of encountered situations for each student. In addition, a "Buffer" is calculated witch solely serves the purpose of constructing the diverging stacked bar charts in Excel. Finally, at the bottom of each sheet, the significance (T-test) and effect size (Hedges' g) for both completeness and correctness are provided. Hedges' g was calculated with an online tool: https://www.psychometrica.de/effect_size.html. The independent and moderating variables can be found as follows:

Sheet 5 (By-Notation):

Model correctness and model completeness is compared by notation - UC, US.

Sheet 6 (By-Case):

Model correctness and model completeness is compared by case - SIM, HOS, IFA.

Sheet 7 (By-Process):

Model correctness and model completeness is compared by how well the derivation process is explained - well explained, partially explained, not present.

Sheet 8 (By-Grade):

Model correctness and model completeness is compared by the exam grades, converted to categorical values High, Low , and Medium.
Ad hoc Statistical Analysis for surveys: 2020/21 Quarter 3
gov.uk
s3.amazonaws.com
Updated Dec 4, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department for Digital, Culture, Media & Sport (2020). Ad hoc Statistical Analysis for surveys: 2020/21 Quarter 3 [Dataset]. https://www.gov.uk/government/statistical-data-sets/ad-hoc-statistical-analysis-202021-quarter-3
Explore at:
Dataset updated
Dec 4, 2020
Dataset provided by
GOV.UKhttp://gov.uk/
Authors
Department for Digital, Culture, Media & Sport
Description
This page lists ad-hoc statistics released during the period October to December 2020. These are additional analyses not included in any of the Department for Digital, Culture, Media and Sport’s standard publications.

If you would like any further information please contact evidence@dcms.gov.uk.

October 2020 - Taking Part: Lotteries request

This piece of analysis covers:

The proportion of adults who had played a National Lottery Game, who also had played any society lotteries in the last 12 months

The proportion of adults who had played a Society Lottery Game, who also had played any National Lottery game in the last 12 months.

Here is a link to the lotteries and gambling page for the annual Taking Part survey.

https://assets.publishing.service.gov.uk/media/5f7c439dd3bf7f2d4df83aeb/Lottery_data_table.xlsx">

https://assets.publishing.service.gov.uk/media/5f7c439dd3bf7f2d4df83aeb/Lottery_data_table.xlsx">National Lottery and Society Lottery Participation

MS Excel Spreadsheet, 70.2 KB This file may not be suitable for users of assistive technology. <details data-module="ga4-event-tracker" data-ga4-event='{"event_name":"select_content","type":"detail","text":"Request an accessible format.","section":"Request an accessible format.","index_section":1}' class="gem-c-details govuk-details govuk-!-margin-bottom-0" title="Request an accessible format.">

Request an accessible format.

If you use assistive technology (such as a screen reader) and need a version of this document in a more accessible format, please email <a href="mailto:enquiries@dcms.gov.uk" target="_blank" class="govuk-link">enquiries@dcms.gov.uk</a>. Please tell us what format you need. It will help us if you say what assistive technology you use.

October 2020 - Community Life Survey: Loneliness request

This piece of analysis covers how often people feel they lack companionship, feel left out and feel isolated. This analysis also provides demographic breakdowns of the loneliness indicators.

Here is a link to the wellbeing and loneliness page for the annual Community Life survey.

https://assets.publishing.service.gov.uk/media/5f7c45e78fa8f5545098fe82/Loneliness_Demographic_Tables.xlsx">
Data generation volume worldwide 2010-2029
statista.com
Updated Nov 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Data generation volume worldwide 2010-2029 [Dataset]. https://www.statista.com/statistics/871513/worldwide-data-created/
Explore at:
Dataset updated
Nov 19, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Worldwide
Description
The total amount of data created, captured, copied, and consumed globally is forecast to increase rapidly. While it was estimated at ***** zettabytes in 2025, the forecast for 2029 stands at ***** zettabytes. Thus, global data generation will triple between 2025 and 2029. Data creation has been expanding continuously over the past decade. In 2020, the growth was higher than previously expected, caused by the increased demand due to the coronavirus (COVID-19) pandemic, as more people worked and learned from home and used home entertainment options more often.
C
Global Interactive LED Display for Business Market Industry Best Practices...
statsndata.org
excel, pdf
Updated Oct 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stats N Data (2025). Global Interactive LED Display for Business Market Industry Best Practices 2025-2032 [Dataset]. https://www.statsndata.org/report/interactive-led-display-for-business-market-304559
Explore at:
pdf, excelAvailable download formats
Dataset updated
Oct 2025
Dataset authored and provided by
Stats N Data
License
https://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order
Area covered
Global
Description
The Interactive LED Display for Business market is rapidly evolving, revolutionizing the way organizations engage with their audiences. These dynamic displays serve as multifaceted communication tools, enhancing presentations, advertisements, and customer interactions in retail spaces, corporate environments, educat
ERA5 post-processed daily statistics on single levels from 1940 to present
cds.climate.copernicus.eu
grib
Updated Dec 3, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ECMWF (2025). ERA5 post-processed daily statistics on single levels from 1940 to present [Dataset]. http://doi.org/10.24381/cds.4991cf48
Explore at:
gribAvailable download formats
Unique identifier
https://doi.org/10.24381/cds.4991cf48
Dataset updated
Dec 3, 2025
Dataset provided by
European Centre for Medium-Range Weather Forecastshttp://ecmwf.int/
Authors
ECMWF
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
ERA5 is the fifth generation ECMWF reanalysis for the global climate and weather for the past 8 decades. Data is available from 1940 onwards. ERA5 replaces the ERA-Interim reanalysis. Reanalysis combines model data with observations from across the world into a globally complete and consistent dataset using the laws of physics. This principle, called data assimilation, is based on the method used by numerical weather prediction centres, where every so many hours (12 hours at ECMWF) a previous forecast is combined with newly available observations in an optimal way to produce a new best estimate of the state of the atmosphere, called analysis, from which an updated, improved forecast is issued. Reanalysis works in the same way, but at reduced resolution to allow for the provision of a dataset spanning back several decades. Reanalysis does not have the constraint of issuing timely forecasts, so there is more time to collect observations, and when going further back in time, to allow for the ingestion of improved versions of the original observations, which all benefit the quality of the reanalysis product. This catalogue entry provides post-processed ERA5 hourly single-level data aggregated to daily time steps. In addition to the data selection options found on the hourly page, the following options can be selected for the daily statistic calculation:

The daily aggregation statistic (daily mean, daily max, daily min, daily sum*) The sub-daily frequency sampling of the original data (1 hour, 3 hours, 6 hours) The option to shift to any local time zone in UTC (no shift means the statistic is computed from UTC+00:00)

*The daily sum is only available for the accumulated variables (see ERA5 documentation for more details). Users should be aware that the daily aggregation is calculated during the retrieval process and is not part of a permanently archived dataset. For more details on how the daily statistics are calculated, including demonstrative code, please see the documentation. For more details on the hourly data used to calculate the daily statistics, please refer to the ERA5 hourly single-level data catalogue entry and the documentation found therein.
f
Data from: Aspects of University Students' Graph Sense in a Virtual Learning...
scielo.figshare.com
jpeg
Updated Jun 3, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fabiana Chagas de Andrade; Carolina Vieira Schiller; Dione Aparecido Ferreira da Silva; Larissa Pereira Menezes; Alexandre Sousa da Silva (2023). Aspects of University Students' Graph Sense in a Virtual Learning Environment [Dataset]. http://doi.org/10.6084/m9.figshare.14304727.v1
Explore at:
jpegAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.14304727.v1
Dataset updated
Jun 3, 2023
Dataset provided by
SciELO journals
Authors
Fabiana Chagas de Andrade; Carolina Vieira Schiller; Dione Aparecido Ferreira da Silva; Larissa Pereira Menezes; Alexandre Sousa da Silva
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Abstract To break with the traditional model of Basic Statistics classes in Higher Education, we sought on Statistical Literacy and Critical Education to develop an activity about graphic interpretation, which took place in a Virtual Learning Environment (VLE), as a complement to classroom meetings. Twenty-three engineering students from a public higher education institution in Rio de Janeiro took part in the research. Our objective was to analyze elements of graphic comprehension in an activity that consisted of identifying incorrect statistical graphs, conveyed by the media, followed by argumentation and interaction among students about these errors. The main results evidenced that elements of the Graphic Sense were present in the discussions and were the goal of the students' critical analysis. The VLE was responsible for facilitating communication, fostering student participation, and linguistic writing, so the use of digital technologies and activities favored by collaboration and interaction are important for statistical development, but such construction is a gradual process.
Statistical Data Return 2018 to 2019
gov.uk
Updated Oct 16, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Regulator of Social Housing (2020). Statistical Data Return 2018 to 2019 [Dataset]. https://www.gov.uk/government/statistics/statistical-data-return-2018-to-2019
Explore at:
Dataset updated
Oct 16, 2020
Dataset provided by
GOV.UKhttp://gov.uk/
Authors
Regulator of Social Housing
Description
The Regulator of Social Housing’s SDR collects data on stock size, types, location and rents at 31 March each year, and data on sales and acquisitions made between 1 April and 31 March.

The statistics derived from the SDR data and published as Private registered provider social housing stock in England are considered by the United Kingdom Statistics Authority regulatory arm – the Office for Statistics Regulation – to have met the highest standards of trustworthiness, quality and public value, and are considered a national statistic. For more information see the data quality and methodology note.

As part of our commitment to making the statistics based on these data timely and accessible, stock information was released on 19 September 2019 and rent information on 26 September. This page was updated on 10 October 2019 with all other data.

The responsible statistician for this statistical release was Amanda Hall. The lead official was Jonathan Walters.

Statistical queries on this publication should be directed to the Referrals and Regulatory Enquiries team on 0300 124 5225 or email enquiries@rsh.gov.uk.

Users are encouraged to provide comments and feedback on how these statistics are used and how they meet user needs. Please send these entitled “SDR Feedback” to enquiries@rsh.gov.uk.

We have issued press notices for each of this year’s SDR releases: stock profile; rents profile and all data including sector characteristics and stock movement.

The annual SDR releases are available on the Statistical Data Return statistical releases collections page.

An accessible HTML summary of the key findings from the report has been included on this page. If you require any further information please contact enquiries@rsh.gov.uk
Z
Data from: A 24-hour dynamic population distribution dataset based on mobile...
data.niaid.nih.gov
zenodo.org
Updated Feb 16, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Claudia Bergroth; Olle Järv; Henrikki Tenkanen; Matti Manninen; Tuuli Toivonen (2022). A 24-hour dynamic population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4724388
Explore at:
Dataset updated
Feb 16, 2022
Dataset provided by
Elisa Corporation
Digital Geography Lab, Department of Geosciences and Geography, University of Helsinki
Unit of Urban Research and Statistics, City of Helsinki / Digital Geography Lab, Department of Geosciences and Geography, University of Helsinki
Department of Built Environment, Aalto University / Centre for Advanced Spatial Analysis, University College London
Authors
Claudia Bergroth; Olle Järv; Henrikki Tenkanen; Matti Manninen; Tuuli Toivonen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Finland, Helsinki Metropolitan Area
Description
Related article: Bergroth, C., Järv, O., Tenkanen, H., Manninen, M., Toivonen, T., 2022. A 24-hour population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland. Scientific Data 9, 39.

In this dataset:

We present temporally dynamic population distribution data from the Helsinki Metropolitan Area, Finland, at the level of 250 m by 250 m statistical grid cells. Three hourly population distribution datasets are provided for regular workdays (Mon – Thu), Saturdays and Sundays. The data are based on aggregated mobile phone data collected by the biggest mobile network operator in Finland. Mobile phone data are assigned to statistical grid cells using an advanced dasymetric interpolation method based on ancillary data about land cover, buildings and a time use survey. The data were validated by comparing population register data from Statistics Finland for night-time hours and a daytime workplace registry. The resulting 24-hour population data can be used to reveal the temporal dynamics of the city and examine population variations relevant to for instance spatial accessibility analyses, crisis management and planning.

Please cite this dataset as:

Bergroth, C., Järv, O., Tenkanen, H., Manninen, M., Toivonen, T., 2022. A 24-hour population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland. Scientific Data 9, 39. https://doi.org/10.1038/s41597-021-01113-4

Organization of data

The dataset is packaged into a single Zipfile Helsinki_dynpop_matrix.zip which contains following files:

HMA_Dynamic_population_24H_workdays.csv represents the dynamic population for average workday in the study area.

HMA_Dynamic_population_24H_sat.csv represents the dynamic population for average saturday in the study area.

HMA_Dynamic_population_24H_sun.csv represents the dynamic population for average sunday in the study area.

target_zones_grid250m_EPSG3067.geojson represents the statistical grid in ETRS89/ETRS-TM35FIN projection that can be used to visualize the data on a map using e.g. QGIS.

Column names

YKR_ID : a unique identifier for each statistical grid cell (n=13,231). The identifier is compatible with the statistical YKR grid cell data by Statistics Finland and Finnish Environment Institute.

H0, H1 ... H23 : Each field represents the proportional distribution of the total population in the study area between grid cells during a one-hour period. In total, 24 fields are formatted as “Hx”, where x stands for the hour of the day (values ranging from 0-23). For example, H0 stands for the first hour of the day: 00:00 - 00:59. The sum of all cell values for each field equals to 100 (i.e. 100% of total population for each one-hour period)

In order to visualize the data on a map, the result tables can be joined with the target_zones_grid250m_EPSG3067.geojson data. The data can be joined by using the field YKR_ID as a common key between the datasets.

License Creative Commons Attribution 4.0 International.

Related datasets

Järv, Olle; Tenkanen, Henrikki & Toivonen, Tuuli. (2017). Multi-temporal function-based dasymetric interpolation tool for mobile phone data. Zenodo. https://doi.org/10.5281/zenodo.252612

Tenkanen, Henrikki, & Toivonen, Tuuli. (2019). Helsinki Region Travel Time Matrix [Data set]. Zenodo. http://doi.org/10.5281/zenodo.3247564
Data from: Sports Data Analysis
kaggle.com
zip
Updated Aug 14, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mukesh Manral (2022). Sports Data Analysis [Dataset]. https://www.kaggle.com/datasets/mukeshmanral/fifa-data-for-eda-and-stats/code
Explore at:
zip(1523740 bytes)Available download formats
Dataset updated
Aug 14, 2022
Authors
Mukesh Manral
License
http://www.gnu.org/licenses/lgpl-3.0.htmlhttp://www.gnu.org/licenses/lgpl-3.0.html
Description
You want to create your own football club named ‘ultralearnManral’. - Your club don't have a team yet. - Team will require to hire players for their roster. - You wants to make players selection decisions using past data.

Create some reports/kind of things which recommends data backed players for main team - To start with, a total 14-16 players are required. - Collected data contains information about players, clubs they are currently playing for and various performance measures.

NOTE: As always assume budget for hiring players to be limited, team needs 18-22 possible players to choose from. - Formulating a report will help management/stack-holders make some decision regarding potential players.
C
Statistical Data Catalog Cologne
ckan.mobidatalab.eu
Updated Jul 26, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Köln (2023). Statistical Data Catalog Cologne [Dataset]. https://ckan.mobidatalab.eu/dataset/statisticaldatacatalogue-coln
Explore at:
http://publications.europa.eu/resource/authority/file-type/csv(307022), http://publications.europa.eu/resource/authority/file-type/csv(272780), http://publications.europa.eu/resource/authority/file-type/json, http://publications.europa.eu/resource/authority/file-type/csv(3746), http://publications.europa.eu/resource/authority/file-type/csv(3752), http://publications.europa.eu/resource/authority/file-type/csv(274184), http://publications.europa.eu/resource/authority/file-type/csv(3735), http://publications.europa.eu/resource/authority/file-type/csv(275264), http://publications.europa.eu/resource/authority/file-type/csv(5356), http://publications.europa.eu/resource/authority/file-type/csv(273265), http://publications.europa.eu/resource/authority/file-type/csv(3730), http://publications.europa.eu/resource/authority/file-type/csv(19787), http://publications.europa.eu/resource/authority/file-type/csv(273515), http://publications.europa.eu/resource/authority/file-type/csv(272571), http://publications.europa.eu/resource/authority/file-type/csv(3748), http://publications.europa.eu/resource/authority/file-type/csv(3753), http://publications.europa.eu/resource/authority/file-type/csv(271286), http://publications.europa.eu/resource/authority/file-type/csv(3754), http://publications.europa.eu/resource/authority/file-type/csv(273516), http://publications.europa.eu/resource/authority/file-type/csv(273403), http://publications.europa.eu/resource/authority/file-type/csv(3764), http://publications.europa.eu/resource/authority/file-type/csv(1215), http://publications.europa.eu/resource/authority/file-type/csv(3758)Available download formats
Dataset updated
Jul 26, 2023
Dataset provided by
Köln
License
Data licence Germany – Attribution – Version 2.0https://www.govdata.de/dl-de/by-2-0
License information was derived automatically
Description
Data from various sources are updated in the Statistical Information System of the City of Cologne. The annual statistical yearbook publishes these in tabular, graphic and cartographic form at the level of the city districts and districts. Furthermore, definitions and calculation bases are explained. Small-scale statistics at the level of the 86 districts can be obtained from the Cologne district information become. All levels of the local area structure are presented in this publication explained.

This statistical data catalogue supplements the range of small-scale data. Selected structural data can be called up here in compact tabular form at the level of the 570 statistical districts or the 86 districts. The two overviews provide information about which data is available and from which source it originates. The data itself is provided annually.

Notes:

Data sources are indicated in the summary tables. When using the data, the data license Germany - attribution - version 2.0 must be observed.

Some values cannot be given to protect statistical confidentiality. For the data sets of the Federal Employment Agency, these are values from 1 to < 10, for all further data records values from 1 to < 5. This is marked in the data by a * .

The differentiation of population figures by gender is currently made according to female and male residents. The case numbers of those who define themselves as non-binary/diverse are so low at a small-scale level that they cannot be reported for reasons of statistical confidentiality.

The determination of residents with a migration background is carried out by combination various characteristics from the resident registration procedure. The data are to be interpreted as estimates. The statistical yearbook of the city of Cologne provides further details.

The information on households comes from the household generation process. This is a statistical procedure in which residents within an address are assigned to a household as far as possible by querying certain criteria. If the procedure does not identify any connections, the allocation to single-person households takes place. The statistical yearbook of the city of Cologne provides further details.

The data set pupils* at general schools (spatial location by place of residence) is available from 2013.

The number of the statistical quarter or district is a spatial location and can be linked to the geodata (see related resource below).
n
Data from: Evaluating modularity in morphometric data: challenges with the...
data.niaid.nih.gov
datadryad.org
+1more
zip
Updated Nov 17, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dean C. Adams (2016). Evaluating modularity in morphometric data: challenges with the RV coefficient and a new test measure [Dataset]. http://doi.org/10.5061/dryad.2kt43
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.2kt43
Dataset updated
Nov 17, 2016
Dataset provided by
Iowa State University
Authors
Dean C. Adams
License
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Description
Modularity describes the case where patterns of trait covariation are unevenly dispersed across traits. Specifically, trait correlations are high and concentrated within subsets of variables (modules), but the correlations between traits across modules are relatively weaker. For morphometric data sets, hypotheses of modularity are commonly evaluated using the RV coefficient, an association statistic used in a wide variety of fields. In this article, I explore the properties of the RV coefficient using simulated data sets. Using data drawn from a normal distribution where the data were neither modular nor integrated in structure, I show that the RV coefficient is adversely affected by attributes of the data (sample size and the number of variables) that do not characterize the covariance structure between sets of variables. Thus, with the RV coefficient, patterns of modularity or integration in data are confounded with trends generated by sample size and the number of variables, which limits biological interpretations and renders comparisons of RV coefficients across data sets uninformative. As an alternative, I propose the covariance ratio (CR) for quantifying modular structure and show that it is unaffected by sample size or the number of variables. Further, statistical tests based on the CR exhibit appropriate type I error rates and display higher statistical power relative to the RV coefficient when evaluating modular data. Overall, these findings demonstrate that the RV coefficient does not display statistical characteristics suitable for reliable assessment of hypotheses of modular or integrated structure and therefore should not be used to evaluate these patterns in morphological data sets. By contrast, the covariance ratio meets these criteria and provides a useful alternative method for assessing the degree of modular structure in morphological data.
Statistical Rankings of Award-Winning Roller
kaggle.com
zip
Updated Dec 25, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Devastator (2022). Statistical Rankings of Award-Winning Roller [Dataset]. https://www.kaggle.com/datasets/thedevastator/statistical-rankings-of-award-winning-roller-coa/discussion?sort=undefined
Explore at:
zip(59126 bytes)Available download formats
Dataset updated
Dec 25, 2022
Authors
The Devastator
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Roller Coasters Of The World

Performance, Quality, and Innovation

By [source]

About this dataset

This dataset offers a comprehensive analysis of roller coaster performance quality. It contains detailed information about everything from seating arrangements and speeds to points awarded, rankings, and even awards won! The three key data files are Golden_Ticket_Award_Winners_Steel.csv, Golden_Ticket_Award_Winners_Wood.csv and roller coasters.csv - all of which provide statistical data or rankings that accurately catalog the roller coaster performances available today. This dataset features an array of columns covering all facets from length and speed to rank, name, location and material type allowing for a detailed look at the modern day roller coaster performance analysis like never before! Unlock the power this data holds in deciphering what makes up some of today's most thrilling amusement park rides worldwide – providing users with statistics that will leave them exhilarated yet awed!

More Datasets

For more datasets, click here.

Featured Notebooks

🚨 Your notebook can be here! 🚨!

How to use the dataset

This dataset provides all the necessary information for you to get an insight into what roller coasters have received high awards and their individual scores. It is useful for finding out which roller coasters are most sought after and how each one was rated from an objective point of view. This information can be used in various ways, such as determining which amusement parks have the best rides, or looking up reviews and experiences from other people.

Research Ideas

Creating a heatmap visualizing the number of award-winning roller coasters and the locations of amusement parks across different countries.

Creating an interactive timeline to compare and track the changes in rankings and points awarded over time for different types of roller coasters, such as steel or wood.

Creating a graph comparing speed, height and length for top-ranked roller coasters to show how their performance varies based on these parameters

Acknowledgements

If you use this dataset in your research, please credit the original authors. Data Source

License

License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

Columns

File: Golden_Ticket_Award_Winners_Steel.csv | Column name | Description | |:---------------|:-----------------------------------------------------------------| | Rank | The ranking of the roller coaster in the year. (Integer) | | Name | The name of the roller coaster. (String) | | Park | The amusement park where the roller coaster is located. (String) | | Location | The location of the amusement park. (String) | | Supplier | The manufacturer of the roller coaster. (String) | | Year Built | The year the roller coaster was built. (Integer) | | Points | The points awarded to the roller coaster. (Integer) |

File: Golden_Ticket_Award_Winners_Wood.csv | Column name | Description | |:---------------|:-----------------------------------------------------------------| | Rank | The ranking of the roller coaster in the year. (Integer) | | Name | The name of the roller coaster. (String) | | Park | The amusement park where the roller coaster is located. (String) | | Location | The location of the amusement park. (String) | | Supplier | The manufacturer of the roller coaster. (String) | | Year Built | The year the roller coaster was built. (Integer) | | Points | The points awarded to the roller coaster. (Integer) |

File: roller_coasters.csv | Column name | Description | |:-------------------|:--------------------------------------------------------------------| | name | The name of the roller coaster. (String) | | material_type | The type of material used to construct the roller coaster. (String) | | **seating...
n
Data from: A rapid and scalable method for multilocus species delimitation...
data.niaid.nih.gov
zip
Updated Apr 5, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tomochika Fujisawa; Amr Aswad; Timothy G. Barraclough (2016). A rapid and scalable method for multilocus species delimitation using Bayesian model comparison and rooted triplets [Dataset]. http://doi.org/10.5061/dryad.3cb25
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.3cb25
Dataset updated
Apr 5, 2016
Dataset provided by
Imperial College London
Kyoto University
Authors
Tomochika Fujisawa; Amr Aswad; Timothy G. Barraclough
License
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Area covered
UK
Description
Multilocus sequence data provide far greater power to resolve species limits than the single locus data typically used for broad surveys of clades. However, current statistical methods based on a multispecies coalescent framework are computationally demanding, because of the number of possible delimitations that must be compared and time-consuming likelihood calculations. New methods are therefore needed to open up the power of multilocus approaches to larger systematic surveys. Here, we present a rapid and scalable method that introduces 2 new innovations. First, the method reduces the complexity of likelihood calculations by decomposing the tree into rooted triplets. The distribution of topologies for a triplet across multiple loci has a uniform trinomial distribution when the 3 individuals belong to the same species, but a skewed distribution if they belong to separate species with a form that is specified by the multispecies coalescent. A Bayesian model comparison framework was developed and the best delimitation found by comparing the product of posterior probabilities of all triplets. The second innovation is a new dynamic programming algorithm for finding the optimum delimitation from all those compatible with a guide tree by successively analyzing subtrees defined by each node. This algorithm removes the need for heuristic searches used by current methods, and guarantees that the best solution is found and potentially could be used in other systematic applications. We assessed the performance of the method with simulated, published, and newly generated data. Analyses of simulated data demonstrate that the combined method has favorable statistical properties and scalability with increasing sample sizes. Analyses of empirical data from both eukaryotes and prokaryotes demonstrate its potential for delimiting species in real cases.
M
Global 18-22 Inch Industrial Display Market Industry Best Practices...
statsndata.org
excel, pdf
Updated Oct 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stats N Data (2025). Global 18-22 Inch Industrial Display Market Industry Best Practices 2025-2032 [Dataset]. https://www.statsndata.org/report/18-22-inch-industrial-display-market-288219
Explore at:
pdf, excelAvailable download formats
Dataset updated
Oct 2025
Dataset authored and provided by
Stats N Data
License
https://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order
Area covered
Global
Description
The 18-22 inch industrial display market is a dynamic segment within the broader display technology landscape, catering to a variety of industries including manufacturing, healthcare, transportation, and retail. These displays are integral for applications such as process controls, data visualization, and user inter
Ad-hoc statistical analysis: 2020/21 Quarter 2
gov.uk
s3.amazonaws.com
Updated Sep 11, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department for Digital, Culture, Media & Sport (2020). Ad-hoc statistical analysis: 2020/21 Quarter 2 [Dataset]. https://www.gov.uk/government/statistical-data-sets/ad-hoc-statistical-analysis-202021-quarter-2
Explore at:
Dataset updated
Sep 11, 2020
Dataset provided by
GOV.UKhttp://gov.uk/
Authors
Department for Digital, Culture, Media & Sport
Description
This page lists ad-hoc statistics released during the period July - September 2020. These are additional analyses not included in any of the Department for Digital, Culture, Media and Sport’s standard publications.

If you would like any further information please contact evidence@dcms.gov.uk.

July 2020 - DCMS Economic Estimates: Number of businesses and Gross Value Added (GVA) by turnover band (2018)

This analysis considers businesses in the DCMS Sectors split by whether they had reported annual turnover above or below £500 million, at one time the threshold for the Coronavirus Business Interruption Loan Scheme (CBILS). Please note the DCMS Sectors totals here exclude the Tourism and Civil Society sectors, for which data is not available or has been excluded for ease of comparability.

The analysis looked at number of businesses; and total GVA generated for both turnover bands. In 2018, an estimated 112 DCMS Sector businesses had an annual turnover of £500m or more (0.03% of the total DCMS Sector businesses). These businesses generated 35.3% (£73.9bn) of all GVA by the DCMS Sectors.

These are trends are broadly similar for the wider non-financial UK business economy, where an estimated 823 businesses had an annual turnover of £500m or more (0.03% of the total) and generated 24.3% (£409.9bn) of all GVA.

The Digital Sector had an estimated 89 businesses (0.04% of all Digital Sector businesses) – the largest number – with turnover of £500m or more; and these businesses generated 41.5% (£61.9bn) of all GVA for the Digital Sector. By comparison, the Creative Industries had an estimated 44 businesses with turnover of £500m or more (0.01% of all Creative Industries businesses), and these businesses generated 23.9% (£26.7bn) of GVA for the Creative Industries sector.

https://assets.publishing.service.gov.uk/media/5f05e78ce90e0712cc90b6f7/dcms-businesses-turnover-split-by-number-and-gva-2018.xlsx">

https://assets.publishing.service.gov.uk/media/5f05e78ce90e0712cc90b6f7/dcms-businesses-turnover-split-by-number-and-gva-2018.xlsx">Number and Gross Value Added by businesses in DCMS sectors, split by annual turnover, 2018

MS Excel Spreadsheet, 42.5 KB

July 2020 - ONS Opinions and Lifestyle Omnibus Survey, February 2020 Data Module

This analysis shows estimates from the ONS Opinion and Lifestyle Omnibus Survey Data Module, commissioned by DCMS in February 2020. The Opinions and Lifestyles Survey (OPN) is run by the Office for National Statistics. For more information on the survey, please see the https://www.ons.gov.uk/aboutus/whatwedo/paidservices/opinions" class="govuk-link">ONS website.

DCMS commissioned 19 questions to be included in the February 2020 survey relating to the public’s views on a range of data related issues, such as trust in different types of organisations when handling personal data, confidence using data skills at work, understanding of how data is managed by companies and the use of data skills at work.

The high level results are included in the accompanying tables. The survey samples adults (16+) across the whole of Great Britain (excluding the Isles of Scilly).

<a class="govuk-link" target="_s
w
Fire statistics data tables
gov.uk
s3.amazonaws.com
Updated Oct 23, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ministry of Housing, Communities and Local Government (2025). Fire statistics data tables [Dataset]. https://www.gov.uk/government/statistical-data-sets/fire-statistics-data-tables
Explore at:
Dataset updated
Oct 23, 2025
Dataset provided by
GOV.UK
Authors
Ministry of Housing, Communities and Local Government
Description

On 1 April 2025 responsibility for fire and rescue transferred from the Home Office to the Ministry of Housing, Communities and Local Government.

This information covers fires, false alarms and other incidents attended by fire crews, and the statistics include the numbers of incidents, fires, fatalities and casualties as well as information on response times to fires. The Ministry of Housing, Communities and Local Government (MHCLG) also collect information on the workforce, fire prevention work, health and safety and firefighter pensions. All data tables on fire statistics are below.

MHCLG has responsibility for fire services in England. The vast majority of data tables produced by the Ministry of Housing, Communities and Local Government are for England but some (0101, 0103, 0201, 0501, 1401) tables are for Great Britain split by nation. In the past the Department for Communities and Local Government (who previously had responsibility for fire services in England) produced data tables for Great Britain and at times the UK. Similar information for devolved administrations are available at https://www.firescotland.gov.uk/about/statistics/">Scotland: Fire and Rescue Statistics, https://statswales.gov.wales/Catalogue/Community-Safety-and-Social-Inclusion/Community-Safety">Wales: Community safety and https://www.nifrs.org/home/about-us/publications/">Northern Ireland: Fire and Rescue Statistics.

If you use assistive technology (for example, a screen reader) and need a version of any of these documents in a more accessible format, please email alternativeformats@communities.gov.uk. Please tell us what format you need. It will help us if you say what assistive technology you use.

Related content

Fire statistics guidance
Fire statistics incident level datasets

Incidents attended

https://assets.publishing.service.gov.uk/media/68f0f810e8e4040c38a3cf96/FIRE0101.xlsx">FIRE0101: Incidents attended by fire and rescue services by nation and population (MS Excel Spreadsheet, 143 KB) Previous FIRE0101 tables

https://assets.publishing.service.gov.uk/media/68f0ffd528f6872f1663ef77/FIRE0102.xlsx">FIRE0102: Incidents attended by fire and rescue services in England, by incident type and fire and rescue authority (MS Excel Spreadsheet, 2.12 MB) Previous FIRE0102 tables

https://assets.publishing.service.gov.uk/media/68f20a3e06e6515f7914c71c/FIRE0103.xlsx">FIRE0103: Fires attended by fire and rescue services by nation and population (MS Excel Spreadsheet, 197 KB) Previous FIRE0103 tables

https://assets.publishing.service.gov.uk/media/68f20a552f0fc56403a3cfef/FIRE0104.xlsx">FIRE0104: Fire false alarms by reason for false alarm, England (MS Excel Spreadsheet, 443 KB) Previous FIRE0104 tables

Dwelling fires attended

https://assets.publishing.service.gov.uk/media/68f100492f0fc56403a3cf94/FIRE0201.xlsx">FIRE0201: Dwelling fires attended by fire and rescue services by motive, population and nation (MS Excel Spreadsheet, 192 KB) Previous FIRE0201 tables

<span class="gem
Men's Mile Run World Record Progression History
kaggle.com
zip
Updated Jan 14, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Devastator (2023). Men's Mile Run World Record Progression History [Dataset]. https://www.kaggle.com/datasets/thedevastator/men-s-mile-run-world-record-progression-history
Explore at:
zip(3258 bytes)Available download formats
Dataset updated
Jan 14, 2023
Authors
The Devastator
Description
Men's Mile Run World Record Progression History (1861-Present)

Examining the Athlete, Nationality and Venue Influence on Race Times

By Ben Jones [source]

About this dataset

This remarkable dataset chronicles the world record progression of the men's mile run, containing detailed information on each athlete's time, their name, nationality, date of their accomplishment and the location of their event. It allows us to look back in history and get a comprehensive overview of how this track event has progressed over time. Analyzing this information can help us understand how training and technology have improved the event over the years; as well as study different athletes' performances and learn how some athletes have pushed beyond their limits or fell short. This valuable resource is an essential source for anyone intrigued by the cutting edge achievements in men's mile running world records. Discovering powerful insights from this dataset can allow us to gain perspective into not only our own personal goals but also uncover ideas on how we could continue pushing our physical boundaries by watching past successes. Explore and comprehend for yourself what it means to be a true athlete at heart!

More Datasets

For more datasets, click here.

Featured Notebooks

🚨 Your notebook can be here! 🚨!

How to use the dataset

This guide provides an introduction on how best to use this dataset in order to analyze various aspects involving the men’s mile run world records. We will focus on analyzing specific fields such as date, athlete name & nationality, time taken for completion and auto status by using statistical methods and graphical displays of data.

In order to use this data effectively it is important that you understand what each field measures: • Time: The amount of time it took for an athlete to finish a race - measured in minutes and seconds (example: 3:54).
• Auto: Whether or not a pacemaker was used during a specific race (example ; yes/no).
• Athlete Name & Nationality: The name and nationality associated with an athlete who set \record(example; Usain Bolt - Jamaica).
• Date : Year representing when a specific record was set by an individual( example-2021 ). •Venue : Location at which the record is set.(example; London Olympic Stadium )

Now that you understand which fields measure what let’s discuss various ways that you can use these datasets features. Analyzing trends in historical sporting performances has long been utilized as means for understanding changes brought about by new training methods/technologies etc., over time . This can be done with our dataset by using basic statistical displays like bar graphs & average analysis or more advanced methods such as regression analysis or even Bayesian approaches etc..The first thing anyone interested should do when dealing with this sort of data is inspect any wacky outliers before beginning more rigorous analysis; if one discovers any potential unreasonable values it would be best to discard them before building after models or readings based off them (this sort of elimination is common practice).After cleaning your work space let’s move onto building interactive visual display through graphics ,plotting different columns against one another e.g., – plotting time against date allows us see changes overtime from 1861 until now . Additionally plotting time vs Auto allows us see any

Research Ideas

Comparing individual athletes and identifying those who have consistently pushed the event to higher levels of performance.

Analyzing national trends related to improvement in track records over time, based on differences in training and technology.

Creating a heatmap to visualize the progression of track records around the world and locate regions with a particularly strong historical performance in this event

Acknowledgements

If you use this dataset in your research, please credit the original authors. Data Source

License

License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. -...
Regional trade statistics interactive analysis: fourth quarter 2019
gov.uk
Updated Mar 17, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
HM Revenue & Customs (2020). Regional trade statistics interactive analysis: fourth quarter 2019 [Dataset]. https://www.gov.uk/government/statistical-data-sets/regional-trade-statistics-interactive-analysis-fourth-quarter-2019
Explore at:
Dataset updated
Mar 17, 2020
Dataset provided by
GOV.UKhttp://gov.uk/
Authors
HM Revenue & Customs
Description
They enable further analysis and comparison of Regional Trade in goods data and contain information that includes:

Quarterly information on the number of goods exporters and importers, by UK region and destination country.

Data on number of businesses exporting or importing

Average value of exports and imports by business per region.

Export and Import value by region.

The spreadsheets provide data on businesses using both the whole number and proportion number methodology, (see section 3.24 (page 14) of the RTS methodology document).

The spreadsheets will cover:

Importers by whole number business count

Importers by proportional business count

Exporters by whole number business count

Exporters by proportional business count

The Exporters by proportional business count spreadsheet was previously produced by the Department for International Trade.

https://assets.publishing.service.gov.uk/media/5e6ba3bde90e070ac9b2137a/2019_Q4_RTS_Exports_Proportion_Interactive_Spreadsheet.xlsm">

https://assets.publishing.service.gov.uk/media/5e6ba3bde90e070ac9b2137a/2019_Q4_RTS_Exports_Proportion_Interactive_Spreadsheet.xlsm">Q4 2019: Exports using proportional business count method

4.89 MB This file may not be suitable for users of assistive technology. <details data-module="ga4-event-tracker" data-ga4-event='{"event_name":"select_content","type":"detail","text":"Request an accessible format.","section":"Request an accessible format.","index_section":1}' class="gem-c-details govuk-details govuk-!-margin-bottom-0" title="Request an accessible format.">

Request an accessible format.

If you use assistive technology (such as a screen reader) and need a version of this document in a more accessible format, please email <a href="mailto:different.format@hmrc.gov.uk" target="_blank" class="govuk-link">different.format@hmrc.gov.uk</a>. Please tell us what format you need. It will help us if you say what assistive technology you use.

https://assets.publishing.service.gov.uk/media/5e6ba3ee86650c727adb43ae/2019_Q4_RTS_Exports_Whole_number_Interactive_Spreadsheet.xlsm">

https://assets.publishing.service.gov.uk/media/5e6ba3ee86650c727adb43ae/2019_Q4_RTS_Exports_Whole_number_Interactive_Spreadsheet.xlsm">Q4 2019: Exports using whole number count method

4.9 MB This file may not be suitable for users of assistive technology. <details data-module="g

Facebook

Twitter

Click to copy link

Link copied

Cite

Tracey L. Weissgerber; Natasa M. Milic; Stacey J. Winham; Vesna D. Garovic (2023). Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm [Dataset]. http://doi.org/10.1371/journal.pbio.1002128

Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm

Explore at:

312 scholarly articles cite this dataset (View in Google Scholar)

docxAvailable download formats

Unique identifier

https://doi.org/10.1371/journal.pbio.1002128

Dataset updated

May 31, 2023

Dataset provided by

PLOShttp://plos.org/

Authors

Tracey L. Weissgerber; Natasa M. Milic; Stacey J. Winham; Vesna D. Garovic

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Figures in scientific publications are critically important because they often show the data supporting key findings. Our systematic review of research articles published in top physiology journals (n = 703) suggests that, as scientists, we urgently need to change our practices for presenting continuous data in small sample size studies. Papers rarely included scatterplots, box plots, and histograms that allow readers to critically evaluate continuous data. Most papers presented continuous data in bar and line graphs. This is problematic, as many different data distributions can lead to the same bar or line graph. The full data may suggest different conclusions from the summary statistics. We recommend training investigators in data presentation, encouraging a more complete presentation of data, and changing journal editorial policies. Investigators can quickly make univariate scatterplots for small sample size studies using our Excel templates.

Clear search

Close search

Google apps

Main menu

Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm

Compare Baseball Player Statistics using Visualiza

1. Load the Data

2. Explore the Data

3. Visualization

Example Code

Explanation of the Code

Conclusion

UC_vs_US Statistic Analysis.xlsx

Ad hoc Statistical Analysis for surveys: 2020/21 Quarter 3

October 2020 - Taking Part: Lotteries request

https://assets.publishing.service.gov.uk/media/5f7c439dd3bf7f2d4df83aeb/Lottery_data_table.xlsx">National Lottery and Society Lottery Participation

October 2020 - Community Life Survey: Loneliness request

Data generation volume worldwide 2010-2029

Global Interactive LED Display for Business Market Industry Best Practices...

ERA5 post-processed daily statistics on single levels from 1940 to present

Data from: Aspects of University Students' Graph Sense in a Virtual Learning...

Statistical Data Return 2018 to 2019

Data from: A 24-hour dynamic population distribution dataset based on mobile...

Data from: Sports Data Analysis

Statistical Data Catalog Cologne

Data from: Evaluating modularity in morphometric data: challenges with the...

Statistical Rankings of Award-Winning Roller

Roller Coasters Of The World

Performance, Quality, and Innovation

About this dataset

More Datasets

Featured Notebooks

How to use the dataset

Research Ideas

Acknowledgements

License

Columns

Data from: A rapid and scalable method for multilocus species delimitation...

Global 18-22 Inch Industrial Display Market Industry Best Practices...

Ad-hoc statistical analysis: 2020/21 Quarter 2

July 2020 - DCMS Economic Estimates: Number of businesses and Gross Value Added (GVA) by turnover band (2018)

https://assets.publishing.service.gov.uk/media/5f05e78ce90e0712cc90b6f7/dcms-businesses-turnover-split-by-number-and-gva-2018.xlsx">Number and Gross Value Added by businesses in DCMS sectors, split by annual turnover, 2018

July 2020 - ONS Opinions and Lifestyle Omnibus Survey, February 2020 Data Module

Fire statistics data tables

Related content

Incidents attended

Dwelling fires attended

Men's Mile Run World Record Progression History

Men's Mile Run World Record Progression History (1861-Present)

Examining the Athlete, Nationality and Venue Influence on Race Times

About this dataset

More Datasets

Featured Notebooks

How to use the dataset

Research Ideas

Acknowledgements

License

Regional trade statistics interactive analysis: fourth quarter 2019

https://assets.publishing.service.gov.uk/media/5e6ba3bde90e070ac9b2137a/2019_Q4_RTS_Exports_Proportion_Interactive_Spreadsheet.xlsm">Q4 2019: Exports using proportional business count method

https://assets.publishing.service.gov.uk/media/5e6ba3ee86650c727adb43ae/2019_Q4_RTS_Exports_Whole_number_Interactive_Spreadsheet.xlsm">Q4 2019: Exports using whole number count method

Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm