100+ datasets found
  1. Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm

    • plos.figshare.com
    docx
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tracey L. Weissgerber; Natasa M. Milic; Stacey J. Winham; Vesna D. Garovic (2023). Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm [Dataset]. http://doi.org/10.1371/journal.pbio.1002128
    Explore at:
    docxAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Tracey L. Weissgerber; Natasa M. Milic; Stacey J. Winham; Vesna D. Garovic
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Figures in scientific publications are critically important because they often show the data supporting key findings. Our systematic review of research articles published in top physiology journals (n = 703) suggests that, as scientists, we urgently need to change our practices for presenting continuous data in small sample size studies. Papers rarely included scatterplots, box plots, and histograms that allow readers to critically evaluate continuous data. Most papers presented continuous data in bar and line graphs. This is problematic, as many different data distributions can lead to the same bar or line graph. The full data may suggest different conclusions from the summary statistics. We recommend training investigators in data presentation, encouraging a more complete presentation of data, and changing journal editorial policies. Investigators can quickly make univariate scatterplots for small sample size studies using our Excel templates.

  2. Compare Baseball Player Statistics using Visualiza

    • kaggle.com
    zip
    Updated Sep 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abdelaziz Sami (2024). Compare Baseball Player Statistics using Visualiza [Dataset]. https://www.kaggle.com/datasets/abdelazizsami/compare-baseball-player-statistics-using-visualiza
    Explore at:
    zip(1030978 bytes)Available download formats
    Dataset updated
    Sep 28, 2024
    Authors
    Abdelaziz Sami
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    To compare baseball player statistics effectively using visualization, we can create some insightful plots. Below are the steps to accomplish this in Python using libraries like Pandas and Matplotlib or Seaborn.

    1. Load the Data

    First, we need to load the judge.csv file into a DataFrame. This will allow us to manipulate and analyze the data easily.

    2. Explore the Data

    Before creating visualizations, it’s good to understand the data structure and identify the columns we want to compare. The relevant columns in your data include pitch_type, release_speed, game_date, and events.

    3. Visualization

    We can create various visualizations, such as: - A bar chart to compare the average release speed of different pitch types. - A line plot to visualize trends over time based on game dates. - A scatter plot to analyze the relationship between release speed and the outcome of the pitches (e.g., strikeouts, home runs).

    Example Code

    Here is a sample code to demonstrate how to create these visualizations using Matplotlib and Seaborn:

    import pandas as pd
    import matplotlib.pyplot as plt
    import seaborn as sns
    
    # Load the data
    df = pd.read_csv('judge.csv')
    
    # Display the first few rows of the dataframe
    print(df.head())
    
    # Set the style of seaborn
    sns.set(style="whitegrid")
    
    # 1. Average Release Speed by Pitch Type
    plt.figure(figsize=(12, 6))
    avg_speed = df.groupby('pitch_type')['release_speed'].mean().sort_values()
    sns.barplot(x=avg_speed.values, y=avg_speed.index, palette="viridis")
    plt.title('Average Release Speed by Pitch Type')
    plt.xlabel('Average Release Speed (mph)')
    plt.ylabel('Pitch Type')
    plt.show()
    
    # 2. Trends in Release Speed Over Time
    # First, convert the 'game_date' to datetime
    df['game_date'] = pd.to_datetime(df['game_date'])
    
    plt.figure(figsize=(14, 7))
    sns.lineplot(data=df, x='game_date', y='release_speed', estimator='mean', ci=None)
    plt.title('Trends in Release Speed Over Time')
    plt.xlabel('Game Date')
    plt.ylabel('Average Release Speed (mph)')
    plt.xticks(rotation=45)
    plt.tight_layout()
    plt.show()
    
    # 3. Scatter Plot of Release Speed vs. Events
    plt.figure(figsize=(12, 6))
    sns.scatterplot(data=df, x='release_speed', y='events', hue='pitch_type', alpha=0.7)
    plt.title('Release Speed vs. Events')
    plt.xlabel('Release Speed (mph)')
    plt.ylabel('Event Type')
    plt.legend(title='Pitch Type', bbox_to_anchor=(1.05, 1), loc='upper left')
    plt.show()
    

    Explanation of the Code

    • Data Loading: The CSV file is loaded into a Pandas DataFrame.
    • Average Release Speed: A bar chart shows the average release speed for each pitch type.
    • Trends Over Time: A line plot illustrates the trend in release speed over time, which can indicate changes in performance or strategy.
    • Scatter Plot: A scatter plot visualizes the relationship between release speed and different events, providing insight into performance outcomes.

    Conclusion

    These visualizations will help you compare player statistics in a meaningful way. You can customize the plots further based on your specific needs, such as filtering data for specific players or seasons. If you have any specific comparisons in mind or additional data to visualize, let me know!

  3. f

    UC_vs_US Statistic Analysis.xlsx

    • figshare.com
    xlsx
    Updated Jul 9, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    F. (Fabiano) Dalpiaz (2020). UC_vs_US Statistic Analysis.xlsx [Dataset]. http://doi.org/10.23644/uu.12631628.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jul 9, 2020
    Dataset provided by
    Utrecht University
    Authors
    F. (Fabiano) Dalpiaz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Sheet 1 (Raw-Data): The raw data of the study is provided, presenting the tagging results for the used measures described in the paper. For each subject, it includes multiple columns: A. a sequential student ID B an ID that defines a random group label and the notation C. the used notation: user Story or use Cases D. the case they were assigned to: IFA, Sim, or Hos E. the subject's exam grade (total points out of 100). Empty cells mean that the subject did not take the first exam F. a categorical representation of the grade L/M/H, where H is greater or equal to 80, M is between 65 included and 80 excluded, L otherwise G. the total number of classes in the student's conceptual model H. the total number of relationships in the student's conceptual model I. the total number of classes in the expert's conceptual model J. the total number of relationships in the expert's conceptual model K-O. the total number of encountered situations of alignment, wrong representation, system-oriented, omitted, missing (see tagging scheme below) P. the researchers' judgement on how well the derivation process explanation was explained by the student: well explained (a systematic mapping that can be easily reproduced), partially explained (vague indication of the mapping ), or not present.

    Tagging scheme:
    Aligned (AL) - A concept is represented as a class in both models, either
    

    with the same name or using synonyms or clearly linkable names; Wrongly represented (WR) - A class in the domain expert model is incorrectly represented in the student model, either (i) via an attribute, method, or relationship rather than class, or (ii) using a generic term (e.g., user'' instead ofurban planner''); System-oriented (SO) - A class in CM-Stud that denotes a technical implementation aspect, e.g., access control. Classes that represent legacy system or the system under design (portal, simulator) are legitimate; Omitted (OM) - A class in CM-Expert that does not appear in any way in CM-Stud; Missing (MI) - A class in CM-Stud that does not appear in any way in CM-Expert.

    All the calculations and information provided in the following sheets
    

    originate from that raw data.

    Sheet 2 (Descriptive-Stats): Shows a summary of statistics from the data collection,
    

    including the number of subjects per case, per notation, per process derivation rigor category, and per exam grade category.

    Sheet 3 (Size-Ratio):
    

    The number of classes within the student model divided by the number of classes within the expert model is calculated (describing the size ratio). We provide box plots to allow a visual comparison of the shape of the distribution, its central value, and its variability for each group (by case, notation, process, and exam grade) . The primary focus in this study is on the number of classes. However, we also provided the size ratio for the number of relationships between student and expert model.

    Sheet 4 (Overall):
    

    Provides an overview of all subjects regarding the encountered situations, completeness, and correctness, respectively. Correctness is defined as the ratio of classes in a student model that is fully aligned with the classes in the corresponding expert model. It is calculated by dividing the number of aligned concepts (AL) by the sum of the number of aligned concepts (AL), omitted concepts (OM), system-oriented concepts (SO), and wrong representations (WR). Completeness on the other hand, is defined as the ratio of classes in a student model that are correctly or incorrectly represented over the number of classes in the expert model. Completeness is calculated by dividing the sum of aligned concepts (AL) and wrong representations (WR) by the sum of the number of aligned concepts (AL), wrong representations (WR) and omitted concepts (OM). The overview is complemented with general diverging stacked bar charts that illustrate correctness and completeness.

    For sheet 4 as well as for the following four sheets, diverging stacked bar
    

    charts are provided to visualize the effect of each of the independent and mediated variables. The charts are based on the relative numbers of encountered situations for each student. In addition, a "Buffer" is calculated witch solely serves the purpose of constructing the diverging stacked bar charts in Excel. Finally, at the bottom of each sheet, the significance (T-test) and effect size (Hedges' g) for both completeness and correctness are provided. Hedges' g was calculated with an online tool: https://www.psychometrica.de/effect_size.html. The independent and moderating variables can be found as follows:

    Sheet 5 (By-Notation):
    

    Model correctness and model completeness is compared by notation - UC, US.

    Sheet 6 (By-Case):
    

    Model correctness and model completeness is compared by case - SIM, HOS, IFA.

    Sheet 7 (By-Process):
    

    Model correctness and model completeness is compared by how well the derivation process is explained - well explained, partially explained, not present.

    Sheet 8 (By-Grade):
    

    Model correctness and model completeness is compared by the exam grades, converted to categorical values High, Low , and Medium.

  4. Ad hoc Statistical Analysis for surveys: 2020/21 Quarter 3

    • gov.uk
    • s3.amazonaws.com
    Updated Dec 4, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department for Digital, Culture, Media & Sport (2020). Ad hoc Statistical Analysis for surveys: 2020/21 Quarter 3 [Dataset]. https://www.gov.uk/government/statistical-data-sets/ad-hoc-statistical-analysis-202021-quarter-3
    Explore at:
    Dataset updated
    Dec 4, 2020
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Department for Digital, Culture, Media & Sport
    Description

    This page lists ad-hoc statistics released during the period October to December 2020. These are additional analyses not included in any of the Department for Digital, Culture, Media and Sport’s standard publications.

    If you would like any further information please contact evidence@dcms.gov.uk.

    October 2020 - Taking Part: Lotteries request

    This piece of analysis covers:

    1. The proportion of adults who had played a National Lottery Game, who also had played any society lotteries in the last 12 months
    2. The proportion of adults who had played a Society Lottery Game, who also had played any National Lottery game in the last 12 months.

    Here is a link to the lotteries and gambling page for the annual Taking Part survey.

    https://assets.publishing.service.gov.uk/media/5f7c439dd3bf7f2d4df83aeb/Lottery_data_table.xlsx">National Lottery and Society Lottery Participation

     <p class="gem-c-attachment_metadata"><span class="gem-c-attachment_attribute">MS Excel Spreadsheet</span>, <span class="gem-c-attachment_attribute">70.2 KB</span></p>
    
    
    
    
     <p class="gem-c-attachment_metadata">This file may not be suitable for users of assistive technology.</p>
     <details data-module="ga4-event-tracker" data-ga4-event='{"event_name":"select_content","type":"detail","text":"Request an accessible format.","section":"Request an accessible format.","index_section":1}' class="gem-c-details govuk-details govuk-!-margin-bottom-0" title="Request an accessible format.">
    

    Request an accessible format.

      If you use assistive technology (such as a screen reader) and need a version of this document in a more accessible format, please email <a href="mailto:enquiries@dcms.gov.uk" target="_blank" class="govuk-link">enquiries@dcms.gov.uk</a>. Please tell us what format you need. It will help us if you say what assistive technology you use.
    

    October 2020 - Community Life Survey: Loneliness request

    This piece of analysis covers how often people feel they lack companionship, feel left out and feel isolated. This analysis also provides demographic breakdowns of the loneliness indicators.

    Here is a link to the wellbeing and loneliness page for the annual Community Life survey.

  5. Data generation volume worldwide 2010-2029

    • statista.com
    Updated Nov 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Data generation volume worldwide 2010-2029 [Dataset]. https://www.statista.com/statistics/871513/worldwide-data-created/
    Explore at:
    Dataset updated
    Nov 19, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    The total amount of data created, captured, copied, and consumed globally is forecast to increase rapidly. While it was estimated at ***** zettabytes in 2025, the forecast for 2029 stands at ***** zettabytes. Thus, global data generation will triple between 2025 and 2029. Data creation has been expanding continuously over the past decade. In 2020, the growth was higher than previously expected, caused by the increased demand due to the coronavirus (COVID-19) pandemic, as more people worked and learned from home and used home entertainment options more often.

  6. C

    Global Interactive LED Display for Business Market Industry Best Practices...

    • statsndata.org
    excel, pdf
    Updated Oct 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stats N Data (2025). Global Interactive LED Display for Business Market Industry Best Practices 2025-2032 [Dataset]. https://www.statsndata.org/report/interactive-led-display-for-business-market-304559
    Explore at:
    pdf, excelAvailable download formats
    Dataset updated
    Oct 2025
    Dataset authored and provided by
    Stats N Data
    License

    https://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order

    Area covered
    Global
    Description

    The Interactive LED Display for Business market is rapidly evolving, revolutionizing the way organizations engage with their audiences. These dynamic displays serve as multifaceted communication tools, enhancing presentations, advertisements, and customer interactions in retail spaces, corporate environments, educat

  7. ERA5 post-processed daily statistics on single levels from 1940 to present

    • cds.climate.copernicus.eu
    grib
    Updated Dec 3, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ECMWF (2025). ERA5 post-processed daily statistics on single levels from 1940 to present [Dataset]. http://doi.org/10.24381/cds.4991cf48
    Explore at:
    gribAvailable download formats
    Dataset updated
    Dec 3, 2025
    Dataset provided by
    European Centre for Medium-Range Weather Forecastshttp://ecmwf.int/
    Authors
    ECMWF
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ERA5 is the fifth generation ECMWF reanalysis for the global climate and weather for the past 8 decades. Data is available from 1940 onwards. ERA5 replaces the ERA-Interim reanalysis. Reanalysis combines model data with observations from across the world into a globally complete and consistent dataset using the laws of physics. This principle, called data assimilation, is based on the method used by numerical weather prediction centres, where every so many hours (12 hours at ECMWF) a previous forecast is combined with newly available observations in an optimal way to produce a new best estimate of the state of the atmosphere, called analysis, from which an updated, improved forecast is issued. Reanalysis works in the same way, but at reduced resolution to allow for the provision of a dataset spanning back several decades. Reanalysis does not have the constraint of issuing timely forecasts, so there is more time to collect observations, and when going further back in time, to allow for the ingestion of improved versions of the original observations, which all benefit the quality of the reanalysis product. This catalogue entry provides post-processed ERA5 hourly single-level data aggregated to daily time steps. In addition to the data selection options found on the hourly page, the following options can be selected for the daily statistic calculation:

    The daily aggregation statistic (daily mean, daily max, daily min, daily sum*) The sub-daily frequency sampling of the original data (1 hour, 3 hours, 6 hours) The option to shift to any local time zone in UTC (no shift means the statistic is computed from UTC+00:00)

    *The daily sum is only available for the accumulated variables (see ERA5 documentation for more details). Users should be aware that the daily aggregation is calculated during the retrieval process and is not part of a permanently archived dataset. For more details on how the daily statistics are calculated, including demonstrative code, please see the documentation. For more details on the hourly data used to calculate the daily statistics, please refer to the ERA5 hourly single-level data catalogue entry and the documentation found therein.

  8. f

    Data from: Aspects of University Students' Graph Sense in a Virtual Learning...

    • scielo.figshare.com
    jpeg
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fabiana Chagas de Andrade; Carolina Vieira Schiller; Dione Aparecido Ferreira da Silva; Larissa Pereira Menezes; Alexandre Sousa da Silva (2023). Aspects of University Students' Graph Sense in a Virtual Learning Environment [Dataset]. http://doi.org/10.6084/m9.figshare.14304727.v1
    Explore at:
    jpegAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    SciELO journals
    Authors
    Fabiana Chagas de Andrade; Carolina Vieira Schiller; Dione Aparecido Ferreira da Silva; Larissa Pereira Menezes; Alexandre Sousa da Silva
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Abstract To break with the traditional model of Basic Statistics classes in Higher Education, we sought on Statistical Literacy and Critical Education to develop an activity about graphic interpretation, which took place in a Virtual Learning Environment (VLE), as a complement to classroom meetings. Twenty-three engineering students from a public higher education institution in Rio de Janeiro took part in the research. Our objective was to analyze elements of graphic comprehension in an activity that consisted of identifying incorrect statistical graphs, conveyed by the media, followed by argumentation and interaction among students about these errors. The main results evidenced that elements of the Graphic Sense were present in the discussions and were the goal of the students' critical analysis. The VLE was responsible for facilitating communication, fostering student participation, and linguistic writing, so the use of digital technologies and activities favored by collaboration and interaction are important for statistical development, but such construction is a gradual process.

  9. Statistical Data Return 2018 to 2019

    • gov.uk
    Updated Oct 16, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Regulator of Social Housing (2020). Statistical Data Return 2018 to 2019 [Dataset]. https://www.gov.uk/government/statistics/statistical-data-return-2018-to-2019
    Explore at:
    Dataset updated
    Oct 16, 2020
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Regulator of Social Housing
    Description

    The Regulator of Social Housing’s SDR collects data on stock size, types, location and rents at 31 March each year, and data on sales and acquisitions made between 1 April and 31 March.

    The statistics derived from the SDR data and published as Private registered provider social housing stock in England are considered by the United Kingdom Statistics Authority regulatory arm – the Office for Statistics Regulation – to have met the highest standards of trustworthiness, quality and public value, and are considered a national statistic. For more information see the data quality and methodology note.

    As part of our commitment to making the statistics based on these data timely and accessible, stock information was released on 19 September 2019 and rent information on 26 September. This page was updated on 10 October 2019 with all other data.

    The responsible statistician for this statistical release was Amanda Hall. The lead official was Jonathan Walters.

    Statistical queries on this publication should be directed to the Referrals and Regulatory Enquiries team on 0300 124 5225 or email enquiries@rsh.gov.uk.

    Users are encouraged to provide comments and feedback on how these statistics are used and how they meet user needs. Please send these entitled “SDR Feedback” to enquiries@rsh.gov.uk.

    We have issued press notices for each of this year’s SDR releases: stock profile; rents profile and all data including sector characteristics and stock movement.

    The annual SDR releases are available on the Statistical Data Return statistical releases collections page.

    An accessible HTML summary of the key findings from the report has been included on this page. If you require any further information please contact enquiries@rsh.gov.uk

  10. Z

    Data from: A 24-hour dynamic population distribution dataset based on mobile...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Feb 16, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Claudia Bergroth; Olle Järv; Henrikki Tenkanen; Matti Manninen; Tuuli Toivonen (2022). A 24-hour dynamic population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4724388
    Explore at:
    Dataset updated
    Feb 16, 2022
    Dataset provided by
    Elisa Corporation
    Digital Geography Lab, Department of Geosciences and Geography, University of Helsinki
    Unit of Urban Research and Statistics, City of Helsinki / Digital Geography Lab, Department of Geosciences and Geography, University of Helsinki
    Department of Built Environment, Aalto University / Centre for Advanced Spatial Analysis, University College London
    Authors
    Claudia Bergroth; Olle Järv; Henrikki Tenkanen; Matti Manninen; Tuuli Toivonen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Finland, Helsinki Metropolitan Area
    Description

    Related article: Bergroth, C., Järv, O., Tenkanen, H., Manninen, M., Toivonen, T., 2022. A 24-hour population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland. Scientific Data 9, 39.

    In this dataset:

    We present temporally dynamic population distribution data from the Helsinki Metropolitan Area, Finland, at the level of 250 m by 250 m statistical grid cells. Three hourly population distribution datasets are provided for regular workdays (Mon – Thu), Saturdays and Sundays. The data are based on aggregated mobile phone data collected by the biggest mobile network operator in Finland. Mobile phone data are assigned to statistical grid cells using an advanced dasymetric interpolation method based on ancillary data about land cover, buildings and a time use survey. The data were validated by comparing population register data from Statistics Finland for night-time hours and a daytime workplace registry. The resulting 24-hour population data can be used to reveal the temporal dynamics of the city and examine population variations relevant to for instance spatial accessibility analyses, crisis management and planning.

    Please cite this dataset as:

    Bergroth, C., Järv, O., Tenkanen, H., Manninen, M., Toivonen, T., 2022. A 24-hour population distribution dataset based on mobile phone data from Helsinki Metropolitan Area, Finland. Scientific Data 9, 39. https://doi.org/10.1038/s41597-021-01113-4

    Organization of data

    The dataset is packaged into a single Zipfile Helsinki_dynpop_matrix.zip which contains following files:

    HMA_Dynamic_population_24H_workdays.csv represents the dynamic population for average workday in the study area.

    HMA_Dynamic_population_24H_sat.csv represents the dynamic population for average saturday in the study area.

    HMA_Dynamic_population_24H_sun.csv represents the dynamic population for average sunday in the study area.

    target_zones_grid250m_EPSG3067.geojson represents the statistical grid in ETRS89/ETRS-TM35FIN projection that can be used to visualize the data on a map using e.g. QGIS.

    Column names

    YKR_ID : a unique identifier for each statistical grid cell (n=13,231). The identifier is compatible with the statistical YKR grid cell data by Statistics Finland and Finnish Environment Institute.

    H0, H1 ... H23 : Each field represents the proportional distribution of the total population in the study area between grid cells during a one-hour period. In total, 24 fields are formatted as “Hx”, where x stands for the hour of the day (values ranging from 0-23). For example, H0 stands for the first hour of the day: 00:00 - 00:59. The sum of all cell values for each field equals to 100 (i.e. 100% of total population for each one-hour period)

    In order to visualize the data on a map, the result tables can be joined with the target_zones_grid250m_EPSG3067.geojson data. The data can be joined by using the field YKR_ID as a common key between the datasets.

    License Creative Commons Attribution 4.0 International.

    Related datasets

    Järv, Olle; Tenkanen, Henrikki & Toivonen, Tuuli. (2017). Multi-temporal function-based dasymetric interpolation tool for mobile phone data. Zenodo. https://doi.org/10.5281/zenodo.252612

    Tenkanen, Henrikki, & Toivonen, Tuuli. (2019). Helsinki Region Travel Time Matrix [Data set]. Zenodo. http://doi.org/10.5281/zenodo.3247564

  11. Data from: Sports Data Analysis

    • kaggle.com
    zip
    Updated Aug 14, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mukesh Manral (2022). Sports Data Analysis [Dataset]. https://www.kaggle.com/datasets/mukeshmanral/fifa-data-for-eda-and-stats/code
    Explore at:
    zip(1523740 bytes)Available download formats
    Dataset updated
    Aug 14, 2022
    Authors
    Mukesh Manral
    License

    http://www.gnu.org/licenses/lgpl-3.0.htmlhttp://www.gnu.org/licenses/lgpl-3.0.html

    Description

    You want to create your own football club named ‘ultralearnManral’. - Your club don't have a team yet. - Team will require to hire players for their roster. - You wants to make players selection decisions using past data.

    Create some reports/kind of things which recommends data backed players for main team - To start with, a total 14-16 players are required. - Collected data contains information about players, clubs they are currently playing for and various performance measures.

    NOTE: As always assume budget for hiring players to be limited, team needs 18-22 possible players to choose from. - Formulating a report will help management/stack-holders make some decision regarding potential players.

  12. C

    Statistical Data Catalog Cologne

    • ckan.mobidatalab.eu
    Updated Jul 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Köln (2023). Statistical Data Catalog Cologne [Dataset]. https://ckan.mobidatalab.eu/dataset/statisticaldatacatalogue-coln
    Explore at:
    http://publications.europa.eu/resource/authority/file-type/csv(307022), http://publications.europa.eu/resource/authority/file-type/csv(272780), http://publications.europa.eu/resource/authority/file-type/json, http://publications.europa.eu/resource/authority/file-type/csv(3746), http://publications.europa.eu/resource/authority/file-type/csv(3752), http://publications.europa.eu/resource/authority/file-type/csv(274184), http://publications.europa.eu/resource/authority/file-type/csv(3735), http://publications.europa.eu/resource/authority/file-type/csv(275264), http://publications.europa.eu/resource/authority/file-type/csv(5356), http://publications.europa.eu/resource/authority/file-type/csv(273265), http://publications.europa.eu/resource/authority/file-type/csv(3730), http://publications.europa.eu/resource/authority/file-type/csv(19787), http://publications.europa.eu/resource/authority/file-type/csv(273515), http://publications.europa.eu/resource/authority/file-type/csv(272571), http://publications.europa.eu/resource/authority/file-type/csv(3748), http://publications.europa.eu/resource/authority/file-type/csv(3753), http://publications.europa.eu/resource/authority/file-type/csv(271286), http://publications.europa.eu/resource/authority/file-type/csv(3754), http://publications.europa.eu/resource/authority/file-type/csv(273516), http://publications.europa.eu/resource/authority/file-type/csv(273403), http://publications.europa.eu/resource/authority/file-type/csv(3764), http://publications.europa.eu/resource/authority/file-type/csv(1215), http://publications.europa.eu/resource/authority/file-type/csv(3758)Available download formats
    Dataset updated
    Jul 26, 2023
    Dataset provided by
    Köln
    License

    Data licence Germany – Attribution – Version 2.0https://www.govdata.de/dl-de/by-2-0
    License information was derived automatically

    Description

    Data from various sources are updated in the Statistical Information System of the City of Cologne. The annual statistical yearbook publishes these in tabular, graphic and cartographic form at the level of the city districts and districts. Furthermore, definitions and calculation bases are explained. Small-scale statistics at the level of the 86 districts can be obtained from the Cologne district information become. All levels of the local area structure are presented in this publication explained.

    This statistical data catalogue supplements the range of small-scale data. Selected structural data can be called up here in compact tabular form at the level of the 570 statistical districts or the 86 districts. The two overviews provide information about which data is available and from which source it originates. The data itself is provided annually.

    Notes:

    • Data sources are indicated in the summary tables. When using the data, the data license Germany - attribution - version 2.0 must be observed.
    • Some values ​​cannot be given to protect statistical confidentiality. For the data sets of the Federal Employment Agency, these are values ​​from 1 to < 10, for all further data records values ​​from 1 to < 5. This is marked in the data by a * .
    • The differentiation of population figures by gender is currently made according to female and male residents. The case numbers of those who define themselves as non-binary/diverse are so low at a small-scale level that they cannot be reported for reasons of statistical confidentiality.
    • The determination of residents with a migration background is carried out by combination various characteristics from the resident registration procedure. The data are to be interpreted as estimates. The statistical yearbook of the city of Cologne provides further details.
    • The information on households comes from the household generation process. This is a statistical procedure in which residents within an address are assigned to a household as far as possible by querying certain criteria. If the procedure does not identify any connections, the allocation to single-person households takes place. The statistical yearbook of the city of Cologne provides further details.
    • The data set pupils* at general schools (spatial location by place of residence) is available from 2013.
    • The number of the statistical quarter or district is a spatial location and can be linked to the geodata (see related resource below).

  13. n

    Data from: Evaluating modularity in morphometric data: challenges with the...

    • data.niaid.nih.gov
    • datadryad.org
    • +1more
    zip
    Updated Nov 17, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dean C. Adams (2016). Evaluating modularity in morphometric data: challenges with the RV coefficient and a new test measure [Dataset]. http://doi.org/10.5061/dryad.2kt43
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 17, 2016
    Dataset provided by
    Iowa State University
    Authors
    Dean C. Adams
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Description

    Modularity describes the case where patterns of trait covariation are unevenly dispersed across traits. Specifically, trait correlations are high and concentrated within subsets of variables (modules), but the correlations between traits across modules are relatively weaker. For morphometric data sets, hypotheses of modularity are commonly evaluated using the RV coefficient, an association statistic used in a wide variety of fields. In this article, I explore the properties of the RV coefficient using simulated data sets. Using data drawn from a normal distribution where the data were neither modular nor integrated in structure, I show that the RV coefficient is adversely affected by attributes of the data (sample size and the number of variables) that do not characterize the covariance structure between sets of variables. Thus, with the RV coefficient, patterns of modularity or integration in data are confounded with trends generated by sample size and the number of variables, which limits biological interpretations and renders comparisons of RV coefficients across data sets uninformative. As an alternative, I propose the covariance ratio (CR) for quantifying modular structure and show that it is unaffected by sample size or the number of variables. Further, statistical tests based on the CR exhibit appropriate type I error rates and display higher statistical power relative to the RV coefficient when evaluating modular data. Overall, these findings demonstrate that the RV coefficient does not display statistical characteristics suitable for reliable assessment of hypotheses of modular or integrated structure and therefore should not be used to evaluate these patterns in morphological data sets. By contrast, the covariance ratio meets these criteria and provides a useful alternative method for assessing the degree of modular structure in morphological data.

  14. Statistical Rankings of Award-Winning Roller

    • kaggle.com
    zip
    Updated Dec 25, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2022). Statistical Rankings of Award-Winning Roller [Dataset]. https://www.kaggle.com/datasets/thedevastator/statistical-rankings-of-award-winning-roller-coa/discussion?sort=undefined
    Explore at:
    zip(59126 bytes)Available download formats
    Dataset updated
    Dec 25, 2022
    Authors
    The Devastator
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Roller Coasters Of The World

    Performance, Quality, and Innovation

    By [source]

    About this dataset

    This dataset offers a comprehensive analysis of roller coaster performance quality. It contains detailed information about everything from seating arrangements and speeds to points awarded, rankings, and even awards won! The three key data files are Golden_Ticket_Award_Winners_Steel.csv, Golden_Ticket_Award_Winners_Wood.csv and roller coasters.csv - all of which provide statistical data or rankings that accurately catalog the roller coaster performances available today. This dataset features an array of columns covering all facets from length and speed to rank, name, location and material type allowing for a detailed look at the modern day roller coaster performance analysis like never before! Unlock the power this data holds in deciphering what makes up some of today's most thrilling amusement park rides worldwide – providing users with statistics that will leave them exhilarated yet awed!

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    This dataset provides all the necessary information for you to get an insight into what roller coasters have received high awards and their individual scores. It is useful for finding out which roller coasters are most sought after and how each one was rated from an objective point of view. This information can be used in various ways, such as determining which amusement parks have the best rides, or looking up reviews and experiences from other people.

    Research Ideas

    • Creating a heatmap visualizing the number of award-winning roller coasters and the locations of amusement parks across different countries.
    • Creating an interactive timeline to compare and track the changes in rankings and points awarded over time for different types of roller coasters, such as steel or wood.
    • Creating a graph comparing speed, height and length for top-ranked roller coasters to show how their performance varies based on these parameters

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.

    Columns

    File: Golden_Ticket_Award_Winners_Steel.csv | Column name | Description | |:---------------|:-----------------------------------------------------------------| | Rank | The ranking of the roller coaster in the year. (Integer) | | Name | The name of the roller coaster. (String) | | Park | The amusement park where the roller coaster is located. (String) | | Location | The location of the amusement park. (String) | | Supplier | The manufacturer of the roller coaster. (String) | | Year Built | The year the roller coaster was built. (Integer) | | Points | The points awarded to the roller coaster. (Integer) |

    File: Golden_Ticket_Award_Winners_Wood.csv | Column name | Description | |:---------------|:-----------------------------------------------------------------| | Rank | The ranking of the roller coaster in the year. (Integer) | | Name | The name of the roller coaster. (String) | | Park | The amusement park where the roller coaster is located. (String) | | Location | The location of the amusement park. (String) | | Supplier | The manufacturer of the roller coaster. (String) | | Year Built | The year the roller coaster was built. (Integer) | | Points | The points awarded to the roller coaster. (Integer) |

    File: roller_coasters.csv | Column name | Description | |:-------------------|:--------------------------------------------------------------------| | name | The name of the roller coaster. (String) | | material_type | The type of material used to construct the roller coaster. (String) | | **seating...

  15. n

    Data from: A rapid and scalable method for multilocus species delimitation...

    • data.niaid.nih.gov
    zip
    Updated Apr 5, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tomochika Fujisawa; Amr Aswad; Timothy G. Barraclough (2016). A rapid and scalable method for multilocus species delimitation using Bayesian model comparison and rooted triplets [Dataset]. http://doi.org/10.5061/dryad.3cb25
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 5, 2016
    Dataset provided by
    Imperial College London
    Kyoto University
    Authors
    Tomochika Fujisawa; Amr Aswad; Timothy G. Barraclough
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Area covered
    UK
    Description

    Multilocus sequence data provide far greater power to resolve species limits than the single locus data typically used for broad surveys of clades. However, current statistical methods based on a multispecies coalescent framework are computationally demanding, because of the number of possible delimitations that must be compared and time-consuming likelihood calculations. New methods are therefore needed to open up the power of multilocus approaches to larger systematic surveys. Here, we present a rapid and scalable method that introduces 2 new innovations. First, the method reduces the complexity of likelihood calculations by decomposing the tree into rooted triplets. The distribution of topologies for a triplet across multiple loci has a uniform trinomial distribution when the 3 individuals belong to the same species, but a skewed distribution if they belong to separate species with a form that is specified by the multispecies coalescent. A Bayesian model comparison framework was developed and the best delimitation found by comparing the product of posterior probabilities of all triplets. The second innovation is a new dynamic programming algorithm for finding the optimum delimitation from all those compatible with a guide tree by successively analyzing subtrees defined by each node. This algorithm removes the need for heuristic searches used by current methods, and guarantees that the best solution is found and potentially could be used in other systematic applications. We assessed the performance of the method with simulated, published, and newly generated data. Analyses of simulated data demonstrate that the combined method has favorable statistical properties and scalability with increasing sample sizes. Analyses of empirical data from both eukaryotes and prokaryotes demonstrate its potential for delimiting species in real cases.

  16. M

    Global 18-22 Inch Industrial Display Market Industry Best Practices...

    • statsndata.org
    excel, pdf
    Updated Oct 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stats N Data (2025). Global 18-22 Inch Industrial Display Market Industry Best Practices 2025-2032 [Dataset]. https://www.statsndata.org/report/18-22-inch-industrial-display-market-288219
    Explore at:
    pdf, excelAvailable download formats
    Dataset updated
    Oct 2025
    Dataset authored and provided by
    Stats N Data
    License

    https://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order

    Area covered
    Global
    Description

    The 18-22 inch industrial display market is a dynamic segment within the broader display technology landscape, catering to a variety of industries including manufacturing, healthcare, transportation, and retail. These displays are integral for applications such as process controls, data visualization, and user inter

  17. Ad-hoc statistical analysis: 2020/21 Quarter 2

    • gov.uk
    • s3.amazonaws.com
    Updated Sep 11, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department for Digital, Culture, Media & Sport (2020). Ad-hoc statistical analysis: 2020/21 Quarter 2 [Dataset]. https://www.gov.uk/government/statistical-data-sets/ad-hoc-statistical-analysis-202021-quarter-2
    Explore at:
    Dataset updated
    Sep 11, 2020
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Department for Digital, Culture, Media & Sport
    Description

    This page lists ad-hoc statistics released during the period July - September 2020. These are additional analyses not included in any of the Department for Digital, Culture, Media and Sport’s standard publications.

    If you would like any further information please contact evidence@dcms.gov.uk.

    July 2020 - DCMS Economic Estimates: Number of businesses and Gross Value Added (GVA) by turnover band (2018)

    This analysis considers businesses in the DCMS Sectors split by whether they had reported annual turnover above or below £500 million, at one time the threshold for the Coronavirus Business Interruption Loan Scheme (CBILS). Please note the DCMS Sectors totals here exclude the Tourism and Civil Society sectors, for which data is not available or has been excluded for ease of comparability.

    The analysis looked at number of businesses; and total GVA generated for both turnover bands. In 2018, an estimated 112 DCMS Sector businesses had an annual turnover of £500m or more (0.03% of the total DCMS Sector businesses). These businesses generated 35.3% (£73.9bn) of all GVA by the DCMS Sectors.

    These are trends are broadly similar for the wider non-financial UK business economy, where an estimated 823 businesses had an annual turnover of £500m or more (0.03% of the total) and generated 24.3% (£409.9bn) of all GVA.

    The Digital Sector had an estimated 89 businesses (0.04% of all Digital Sector businesses) – the largest number – with turnover of £500m or more; and these businesses generated 41.5% (£61.9bn) of all GVA for the Digital Sector. By comparison, the Creative Industries had an estimated 44 businesses with turnover of £500m or more (0.01% of all Creative Industries businesses), and these businesses generated 23.9% (£26.7bn) of GVA for the Creative Industries sector.

    https://assets.publishing.service.gov.uk/media/5f05e78ce90e0712cc90b6f7/dcms-businesses-turnover-split-by-number-and-gva-2018.xlsx">Number and Gross Value Added by businesses in DCMS sectors, split by annual turnover, 2018

     <p class="gem-c-attachment_metadata"><span class="gem-c-attachment_attribute">MS Excel Spreadsheet</span>, <span class="gem-c-attachment_attribute">42.5 KB</span></p>
    

    July 2020 - ONS Opinions and Lifestyle Omnibus Survey, February 2020 Data Module

    This analysis shows estimates from the ONS Opinion and Lifestyle Omnibus Survey Data Module, commissioned by DCMS in February 2020. The Opinions and Lifestyles Survey (OPN) is run by the Office for National Statistics. For more information on the survey, please see the https://www.ons.gov.uk/aboutus/whatwedo/paidservices/opinions" class="govuk-link">ONS website.

    DCMS commissioned 19 questions to be included in the February 2020 survey relating to the public’s views on a range of data related issues, such as trust in different types of organisations when handling personal data, confidence using data skills at work, understanding of how data is managed by companies and the use of data skills at work.

    The high level results are included in the accompanying tables. The survey samples adults (16+) across the whole of Great Britain (excluding the Isles of Scilly).

    <a class="govuk-link" target="_s

  18. w

    Fire statistics data tables

    • gov.uk
    • s3.amazonaws.com
    Updated Oct 23, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ministry of Housing, Communities and Local Government (2025). Fire statistics data tables [Dataset]. https://www.gov.uk/government/statistical-data-sets/fire-statistics-data-tables
    Explore at:
    Dataset updated
    Oct 23, 2025
    Dataset provided by
    GOV.UK
    Authors
    Ministry of Housing, Communities and Local Government
    Description

    On 1 April 2025 responsibility for fire and rescue transferred from the Home Office to the Ministry of Housing, Communities and Local Government.

    This information covers fires, false alarms and other incidents attended by fire crews, and the statistics include the numbers of incidents, fires, fatalities and casualties as well as information on response times to fires. The Ministry of Housing, Communities and Local Government (MHCLG) also collect information on the workforce, fire prevention work, health and safety and firefighter pensions. All data tables on fire statistics are below.

    MHCLG has responsibility for fire services in England. The vast majority of data tables produced by the Ministry of Housing, Communities and Local Government are for England but some (0101, 0103, 0201, 0501, 1401) tables are for Great Britain split by nation. In the past the Department for Communities and Local Government (who previously had responsibility for fire services in England) produced data tables for Great Britain and at times the UK. Similar information for devolved administrations are available at https://www.firescotland.gov.uk/about/statistics/">Scotland: Fire and Rescue Statistics, https://statswales.gov.wales/Catalogue/Community-Safety-and-Social-Inclusion/Community-Safety">Wales: Community safety and https://www.nifrs.org/home/about-us/publications/">Northern Ireland: Fire and Rescue Statistics.

    If you use assistive technology (for example, a screen reader) and need a version of any of these documents in a more accessible format, please email alternativeformats@communities.gov.uk. Please tell us what format you need. It will help us if you say what assistive technology you use.

    Related content

    Fire statistics guidance
    Fire statistics incident level datasets

    Incidents attended

    https://assets.publishing.service.gov.uk/media/68f0f810e8e4040c38a3cf96/FIRE0101.xlsx">FIRE0101: Incidents attended by fire and rescue services by nation and population (MS Excel Spreadsheet, 143 KB) Previous FIRE0101 tables

    https://assets.publishing.service.gov.uk/media/68f0ffd528f6872f1663ef77/FIRE0102.xlsx">FIRE0102: Incidents attended by fire and rescue services in England, by incident type and fire and rescue authority (MS Excel Spreadsheet, 2.12 MB) Previous FIRE0102 tables

    https://assets.publishing.service.gov.uk/media/68f20a3e06e6515f7914c71c/FIRE0103.xlsx">FIRE0103: Fires attended by fire and rescue services by nation and population (MS Excel Spreadsheet, 197 KB) Previous FIRE0103 tables

    https://assets.publishing.service.gov.uk/media/68f20a552f0fc56403a3cfef/FIRE0104.xlsx">FIRE0104: Fire false alarms by reason for false alarm, England (MS Excel Spreadsheet, 443 KB) Previous FIRE0104 tables

    Dwelling fires attended

    https://assets.publishing.service.gov.uk/media/68f100492f0fc56403a3cf94/FIRE0201.xlsx">FIRE0201: Dwelling fires attended by fire and rescue services by motive, population and nation (MS Excel Spreadsheet, 192 KB) Previous FIRE0201 tables

    <span class="gem

  19. Men's Mile Run World Record Progression History

    • kaggle.com
    zip
    Updated Jan 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). Men's Mile Run World Record Progression History [Dataset]. https://www.kaggle.com/datasets/thedevastator/men-s-mile-run-world-record-progression-history
    Explore at:
    zip(3258 bytes)Available download formats
    Dataset updated
    Jan 14, 2023
    Authors
    The Devastator
    Description

    Men's Mile Run World Record Progression History (1861-Present)

    Examining the Athlete, Nationality and Venue Influence on Race Times

    By Ben Jones [source]

    About this dataset

    This remarkable dataset chronicles the world record progression of the men's mile run, containing detailed information on each athlete's time, their name, nationality, date of their accomplishment and the location of their event. It allows us to look back in history and get a comprehensive overview of how this track event has progressed over time. Analyzing this information can help us understand how training and technology have improved the event over the years; as well as study different athletes' performances and learn how some athletes have pushed beyond their limits or fell short. This valuable resource is an essential source for anyone intrigued by the cutting edge achievements in men's mile running world records. Discovering powerful insights from this dataset can allow us to gain perspective into not only our own personal goals but also uncover ideas on how we could continue pushing our physical boundaries by watching past successes. Explore and comprehend for yourself what it means to be a true athlete at heart!

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    This guide provides an introduction on how best to use this dataset in order to analyze various aspects involving the men’s mile run world records. We will focus on analyzing specific fields such as date, athlete name & nationality, time taken for completion and auto status by using statistical methods and graphical displays of data.

    In order to use this data effectively it is important that you understand what each field measures: • Time: The amount of time it took for an athlete to finish a race - measured in minutes and seconds (example: 3:54).
    • Auto: Whether or not a pacemaker was used during a specific race (example ; yes/no).
    • Athlete Name & Nationality: The name and nationality associated with an athlete who set \record(example; Usain Bolt - Jamaica).
    • Date : Year representing when a specific record was set by an individual( example-2021 ). •Venue : Location at which the record is set.(example; London Olympic Stadium )

    Now that you understand which fields measure what let’s discuss various ways that you can use these datasets features. Analyzing trends in historical sporting performances has long been utilized as means for understanding changes brought about by new training methods/technologies etc., over time . This can be done with our dataset by using basic statistical displays like bar graphs & average analysis or more advanced methods such as regression analysis or even Bayesian approaches etc..The first thing anyone interested should do when dealing with this sort of data is inspect any wacky outliers before beginning more rigorous analysis; if one discovers any potential unreasonable values it would be best to discard them before building after models or readings based off them (this sort of elimination is common practice).After cleaning your work space let’s move onto building interactive visual display through graphics ,plotting different columns against one another e.g., – plotting time against date allows us see changes overtime from 1861 until now . Additionally plotting time vs Auto allows us see any

    Research Ideas

    • Comparing individual athletes and identifying those who have consistently pushed the event to higher levels of performance.
    • Analyzing national trends related to improvement in track records over time, based on differences in training and technology.
    • Creating a heatmap to visualize the progression of track records around the world and locate regions with a particularly strong historical performance in this event

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. -...

  20. Regional trade statistics interactive analysis: fourth quarter 2019

    • gov.uk
    Updated Mar 17, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HM Revenue & Customs (2020). Regional trade statistics interactive analysis: fourth quarter 2019 [Dataset]. https://www.gov.uk/government/statistical-data-sets/regional-trade-statistics-interactive-analysis-fourth-quarter-2019
    Explore at:
    Dataset updated
    Mar 17, 2020
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    HM Revenue & Customs
    Description

    They enable further analysis and comparison of Regional Trade in goods data and contain information that includes:

    • Quarterly information on the number of goods exporters and importers, by UK region and destination country.
    • Data on number of businesses exporting or importing
    • Average value of exports and imports by business per region.
    • Export and Import value by region.

    The spreadsheets provide data on businesses using both the whole number and proportion number methodology, (see section 3.24 (page 14) of the RTS methodology document).

    The spreadsheets will cover:

    • Importers by whole number business count
    • Importers by proportional business count
    • Exporters by whole number business count
    • Exporters by proportional business count

    The Exporters by proportional business count spreadsheet was previously produced by the Department for International Trade.

    https://assets.publishing.service.gov.uk/media/5e6ba3bde90e070ac9b2137a/2019_Q4_RTS_Exports_Proportion_Interactive_Spreadsheet.xlsm">Q4 2019: Exports using proportional business count method

     <p class="gem-c-attachment_metadata"><span class="gem-c-attachment_attribute">4.89 MB</span></p>
    
    
    
    
     <p class="gem-c-attachment_metadata">This file may not be suitable for users of assistive technology.</p>
     <details data-module="ga4-event-tracker" data-ga4-event='{"event_name":"select_content","type":"detail","text":"Request an accessible format.","section":"Request an accessible format.","index_section":1}' class="gem-c-details govuk-details govuk-!-margin-bottom-0" title="Request an accessible format.">
    

    Request an accessible format.

      If you use assistive technology (such as a screen reader) and need a version of this document in a more accessible format, please email <a href="mailto:different.format@hmrc.gov.uk" target="_blank" class="govuk-link">different.format@hmrc.gov.uk</a>. Please tell us what format you need. It will help us if you say what assistive technology you use.
    

    https://assets.publishing.service.gov.uk/media/5e6ba3ee86650c727adb43ae/2019_Q4_RTS_Exports_Whole_number_Interactive_Spreadsheet.xlsm">Q4 2019: Exports using whole number count method

     <p class="gem-c-attachment_metadata"><span class="gem-c-attachment_attribute">4.9 MB</span></p>
    
    
    
    
     <p class="gem-c-attachment_metadata">This file may not be suitable for users of assistive technology.</p>
     <details data-module="g
    
Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Tracey L. Weissgerber; Natasa M. Milic; Stacey J. Winham; Vesna D. Garovic (2023). Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm [Dataset]. http://doi.org/10.1371/journal.pbio.1002128
Organization logo

Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm

Explore at:
312 scholarly articles cite this dataset (View in Google Scholar)
docxAvailable download formats
Dataset updated
May 31, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Tracey L. Weissgerber; Natasa M. Milic; Stacey J. Winham; Vesna D. Garovic
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Figures in scientific publications are critically important because they often show the data supporting key findings. Our systematic review of research articles published in top physiology journals (n = 703) suggests that, as scientists, we urgently need to change our practices for presenting continuous data in small sample size studies. Papers rarely included scatterplots, box plots, and histograms that allow readers to critically evaluate continuous data. Most papers presented continuous data in bar and line graphs. This is problematic, as many different data distributions can lead to the same bar or line graph. The full data may suggest different conclusions from the summary statistics. We recommend training investigators in data presentation, encouraging a more complete presentation of data, and changing journal editorial policies. Investigators can quickly make univariate scatterplots for small sample size studies using our Excel templates.

Search
Clear search
Close search
Google apps
Main menu