24 datasets found
  1. Statistical Function in Excel

    • kaggle.com
    zip
    Updated Feb 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sanjana Murthy (2024). Statistical Function in Excel [Dataset]. https://www.kaggle.com/datasets/sanjanamurthy392/statistical-function
    Explore at:
    zip(1412940 bytes)Available download formats
    Dataset updated
    Feb 7, 2024
    Authors
    Sanjana Murthy
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    This data contains functions like: Sum, Average, Max, Min, Sumif, Sumifs, Count, Countblank, Countifs, Counta, Averageif, Averageifs.

  2. Data from: Excel Templates: A Helpful Tool for Teaching Statistics

    • tandf.figshare.com
    zip
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alejandro Quintela-del-Rƭo; Mario Francisco-FernƔndez (2023). Excel Templates: A Helpful Tool for Teaching Statistics [Dataset]. http://doi.org/10.6084/m9.figshare.3408052.v2
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Taylor & Francishttps://taylorandfrancis.com/
    Authors
    Alejandro Quintela-del-Rƭo; Mario Francisco-FernƔndez
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This article describes a free, open-source collection of templates for the popular Excel (2013, and later versions) spreadsheet program. These templates are spreadsheet files that allow easy and intuitive learning and the implementation of practical examples concerning descriptive statistics, random variables, confidence intervals, and hypothesis testing. Although they are designed to be used with Excel, they can also be employed with other free spreadsheet programs (changing some particular formulas). Moreover, we exploit some possibilities of the ActiveX controls of the Excel Developer Menu to perform interactive Gaussian density charts. Finally, it is important to note that they can be often embedded in a web page, so it is not necessary to employ Excel software for their use. These templates have been designed as a useful tool to teach basic statistics and to carry out data analysis even when the students are not familiar with Excel. Additionally, they can be used as a complement to other analytical software packages. They aim to assist students in learning statistics, within an intuitive working environment. Supplementary materials with the Excel templates are available online.

  3. Extended 1.0 Dataset of "Concentration and Geospatial Modelling of Health...

    • zenodo.org
    bin, csv, pdf
    Updated Sep 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peter Domjan; Peter Domjan; Viola Angyal; Viola Angyal; Istvan Vingender; Istvan Vingender (2024). Extended 1.0 Dataset of "Concentration and Geospatial Modelling of Health Development Offices' Accessibility for the Total and Elderly Populations in Hungary" [Dataset]. http://doi.org/10.5281/zenodo.13826993
    Explore at:
    bin, pdf, csvAvailable download formats
    Dataset updated
    Sep 23, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Peter Domjan; Peter Domjan; Viola Angyal; Viola Angyal; Istvan Vingender; Istvan Vingender
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Sep 23, 2024
    Area covered
    Hungary
    Description

    Introduction

    We are enclosing the database used in our research titled "Concentration and Geospatial Modelling of Health Development Offices' Accessibility for the Total and Elderly Populations in Hungary", along with our statistical calculations. For the sake of reproducibility, further information can be found in the file Short_Description_of_Data_Analysis.pdf and Statistical_formulas.pdf

    The sharing of data is part of our aim to strengthen the base of our scientific research. As of March 7, 2024, the detailed submission and analysis of our research findings to a scientific journal has not yet been completed.

    The dataset was expanded on 23rd September 2024 to include SPSS statistical analysis data, a heatmap, and buffer zone analysis around the Health Development Offices (HDOs) created in QGIS software.

    Short Description of Data Analysis and Attached Files (datasets):

    Our research utilised data from 2022, serving as the basis for statistical standardisation. The 2022 Hungarian census provided an objective basis for our analysis, with age group data available at the county level from the Hungarian Central Statistical Office (KSH) website. The 2022 demographic data provided an accurate picture compared to the data available from the 2023 microcensus. The used calculation is based on our standardisation of the 2022 data. For xlsx files, we used MS Excel 2019 (version: 1808, build: 10406.20006) with the SOLVER add-in.

    Hungarian Central Statistical Office served as the data source for population by age group, county, and regions: https://www.ksh.hu/stadat_files/nep/hu/nep0035.html, (accessed 04 Jan. 2024.) with data recorded in MS Excel in the Data_of_demography.xlsx file.

    In 2022, 108 Health Development Offices (HDOs) were operational, and it's noteworthy that no developments have occurred in this area since 2022. The availability of these offices and the demographic data from the Central Statistical Office in Hungary are considered public interest data, freely usable for research purposes without requiring permission.

    The contact details for the Health Development Offices were sourced from the following page (Hungarian National Population Centre (NNK)): https://www.nnk.gov.hu/index.php/efi (n=107). The Semmelweis University Health Development Centre was not listed by NNK, hence it was separately recorded as the 108th HDO. More information about the office can be found here: https://semmelweis.hu/egeszsegfejlesztes/en/ (n=1). (accessed 05 Dec. 2023.)

    Geocoordinates were determined using Google Maps (N=108): https://www.google.com/maps. (accessed 02 Jan. 2024.) Recording of geocoordinates (latitude and longitude according to WGS 84 standard), address data (postal code, town name, street, and house number), and the name of each HDO was carried out in the: Geo_coordinates_and_names_of_Hungarian_Health_Development_Offices.csv file.

    The foundational software for geospatial modelling and display (QGIS 3.34), an open-source software, can be downloaded from:

    https://qgis.org/en/site/forusers/download.html. (accessed 04 Jan. 2024.)

    The HDOs_GeoCoordinates.gpkg QGIS project file contains Hungary's administrative map and the recorded addresses of the HDOs from the

    Geo_coordinates_and_names_of_Hungarian_Health_Development_Offices.csv file,

    imported via .csv file.

    The OpenStreetMap tileset is directly accessible from www.openstreetmap.org in QGIS. (accessed 04 Jan. 2024.)

    The Hungarian county administrative boundaries were downloaded from the following website: https://data2.openstreetmap.hu/hatarok/index.php?admin=6 (accessed 04 Jan. 2024.)

    HDO_Buffers.gpkg is a QGIS project file that includes the administrative map of Hungary, the county boundaries, as well as the HDO offices and their corresponding buffer zones with a radius of 7.5 km.

    Heatmap.gpkg is a QGIS project file that includes the administrative map of Hungary, the county boundaries, as well as the HDO offices and their corresponding heatmap (Kernel Density Estimation).

    A brief description of the statistical formulas applied is included in the Statistical_formulas.pdf.

    Recording of our base data for statistical concentration and diversification measurement was done using MS Excel 2019 (version: 1808, build: 10406.20006) in .xlsx format.

    • Aggregated number of HDOs by county: Number_of_HDOs.xlsx
    • Standardised data (Number of HDOs per 100,000 residents): Standardized_data.xlsx
    • Calculation of the Lorenz curve: Lorenz_curve.xlsx
    • Calculation of the Gini index: Gini_Index.xlsx
    • Calculation of the LQ index: LQ_Index.xlsx
    • Calculation of the Herfindahl-Hirschman Index: Herfindahl_Hirschman_Index.xlsx
    • Calculation of the Entropy index: Entropy_Index.xlsx
    • Regression and correlation analysis calculation: Regression_correlation.xlsx

    Using the SPSS 29.0.1.0 program, we performed the following statistical calculations with the databases Data_HDOs_population_without_outliers.sav and Data_HDOs_population.sav:

    • Regression curve estimation with elderly population and number of HDOs, excluding outlier values (Types of analyzed equations: Linear, Logarithmic, Inverse, Quadratic, Cubic, Compound, Power, S, Growth, Exponential, Logistic, with summary and ANOVA analysis table): Curve_estimation_elderly_without_outlier.spv
    • Pearson correlation table between the total population, elderly population, and number of HDOs per county, excluding outlier values such as Budapest and Pest County: Pearson_Correlation_populations_HDOs_number_without_outliers.spv.
    • Dot diagram including total population and number of HDOs per county, excluding outlier values such as Budapest and Pest Counties: Dot_HDO_total_population_without_outliers.spv.
    • Dot diagram including elderly (64<) population and number of HDOs per county, excluding outlier values such as Budapest and Pest Counties: Dot_HDO_elderly_population_without_outliers.spv
    • Regression curve estimation with total population and number of HDOs, excluding outlier values (Types of analyzed equations: Linear, Logarithmic, Inverse, Quadratic, Cubic, Compound, Power, S, Growth, Exponential, Logistic, with summary and ANOVA analysis table): Curve_estimation_without_outlier.spv
    • Dot diagram including elderly (64<) population and number of HDOs per county: Dot_HDO_elderly_population.spv
    • Dot diagram including total population and number of HDOs per county: Dot_HDO_total_population.spv
    • Pearson correlation table between the total population, elderly population, and number of HDOs per county: Pearson_Correlation_populations_HDOs_number.spv
    • Regression curve estimation with total population and number of HDOs, (Types of analyzed equations: Linear, Logarithmic, Inverse, Quadratic, Cubic, Compound, Power, S, Growth, Exponential, Logistic, with summary and ANOVA analysis table): Curve_estimation_total_population.spv

    For easier readability, the files have been provided in both SPV and PDF formats.

    The translation of these supplementary files into English was completed on 23rd Sept. 2024.

    If you have any further questions regarding the dataset, please contact the corresponding author: domjan.peter@phd.semmelweis.hu

  4. Employee Analysis In Excel

    • kaggle.com
    zip
    Updated Mar 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Afolabi Raymond (2024). Employee Analysis In Excel [Dataset]. https://www.kaggle.com/datasets/afolabiraymond/employee-analysis-in-excel
    Explore at:
    zip(190258 bytes)Available download formats
    Dataset updated
    Mar 20, 2024
    Authors
    Afolabi Raymond
    Description

    In this project, I analysed the employees of an organization located in two distinct countries using Excel. This project covers:

    1) How to approach a data analysis project 2) How to systematically clean data 3) Doing EDA with Excel formulas & tables 4) How to use Power Query to combine two datasets 5) Statistical Analysis of data 6) Using formulas like COUNTIFS, SUMIFS, XLOOKUP 7) Making an information finder with your data 8) Male vs. Female Analysis with Pivot tables 9) Calculating Bonuses based on business rules 10) Visual analytics of data with 4 topics 11) Analysing the salary spread (Histograms & Box plots) 12) Relationship between Salary & Rating 13) Staff growth over time - trend analysis 14) Regional Scorecard to compare NZ with India

    Including various Excel features such as: 1) Using Tables 2) Working with Power Query 3) Formulas 4) Pivot Tables 5) Conditional formatting 6) Charts 7) Data Validation 8) Keyboard Shortcuts & tricks 9) Dashboard Design

  5. m

    Dataset of development of business during the COVID-19 crisis

    • data.mendeley.com
    • narcis.nl
    Updated Nov 9, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tatiana N. Litvinova (2020). Dataset of development of business during the COVID-19 crisis [Dataset]. http://doi.org/10.17632/9vvrd34f8t.1
    Explore at:
    Dataset updated
    Nov 9, 2020
    Authors
    Tatiana N. Litvinova
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    To create the dataset, the top 10 countries leading in the incidence of COVID-19 in the world were selected as of October 22, 2020 (on the eve of the second full of pandemics), which are presented in the Global 500 ranking for 2020: USA, India, Brazil, Russia, Spain, France and Mexico. For each of these countries, no more than 10 of the largest transnational corporations included in the Global 500 rating for 2020 and 2019 were selected separately. The arithmetic averages were calculated and the change (increase) in indicators such as profitability and profitability of enterprises, their ranking position (competitiveness), asset value and number of employees. The arithmetic mean values of these indicators for all countries of the sample were found, characterizing the situation in international entrepreneurship as a whole in the context of the COVID-19 crisis in 2020 on the eve of the second wave of the pandemic. The data is collected in a general Microsoft Excel table. Dataset is a unique database that combines COVID-19 statistics and entrepreneurship statistics. The dataset is flexible data that can be supplemented with data from other countries and newer statistics on the COVID-19 pandemic. Due to the fact that the data in the dataset are not ready-made numbers, but formulas, when adding and / or changing the values in the original table at the beginning of the dataset, most of the subsequent tables will be automatically recalculated and the graphs will be updated. This allows the dataset to be used not just as an array of data, but as an analytical tool for automating scientific research on the impact of the COVID-19 pandemic and crisis on international entrepreneurship. The dataset includes not only tabular data, but also charts that provide data visualization. The dataset contains not only actual, but also forecast data on morbidity and mortality from COVID-19 for the period of the second wave of the pandemic in 2020. The forecasts are presented in the form of a normal distribution of predicted values and the probability of their occurrence in practice. This allows for a broad scenario analysis of the impact of the COVID-19 pandemic and crisis on international entrepreneurship, substituting various predicted morbidity and mortality rates in risk assessment tables and obtaining automatically calculated consequences (changes) on the characteristics of international entrepreneurship. It is also possible to substitute the actual values identified in the process and following the results of the second wave of the pandemic to check the reliability of pre-made forecasts and conduct a plan-fact analysis. The dataset contains not only the numerical values of the initial and predicted values of the set of studied indicators, but also their qualitative interpretation, reflecting the presence and level of risks of a pandemic and COVID-19 crisis for international entrepreneurship.

  6. Data from: Student Academic Performance Dataset

    • kaggle.com
    Updated Oct 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hackathon data (2025). Student Academic Performance Dataset [Dataset]. https://www.kaggle.com/datasets/aryancodes12fyds/student-academic-performance-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 6, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Hackathon data
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    šŸ“˜ Description

    The Student Academic Performance Dataset contains detailed academic and lifestyle information of 250 students, created to analyze how various factors — such as study hours, sleep, attendance, stress, and social media usage — influence their overall academic outcomes and GPA.

    This dataset is synthetic but realistic, carefully generated to reflect believable academic patterns and relationships. It’s perfect for learning data analysis, statistics, and visualization using Excel, Python, or R.

    The data includes 12 attributes, primarily numerical, ensuring that it’s suitable for a wide range of analytical tasks — from basic descriptive statistics (mean, median, SD) to correlation and regression analysis.

    šŸ“Š Key Features

    🧮 250 rows and 12 columns

    šŸ’” Mostly numerical — great for Excel-based statistical functions

    šŸ” No missing values — ready for direct use

    šŸ“ˆ Balanced and realistic — ideal for clear visualizations and trend analysis

    šŸŽÆ Suitable for:

    Descriptive statistics

    Correlation & regression

    Data visualization projects

    Dashboard creation (Excel, Tableau, Power BI)

    šŸ’” Possible Insights to Explore

    How do study hours impact GPA?

    Is there a relationship between stress levels and performance?

    Does social media usage reduce study efficiency?

    Do students with higher attendance achieve better grades?

    āš™ļø Data Generation Details

    Each record represents a unique student.

    GPA is calculated using a weighted formula based on midterm and final scores.

    Relationships are designed to be realistic — for example:

    Higher study hours → higher scores and GPA

    Higher stress → slightly lower sleep hours

    Excessive social media time → reduced academic performance

    āš ļø Disclaimer

    This dataset is synthetically generated using statistical modeling techniques and does not contain any real student data. It is intended purely for educational, analytical, and research purposes.

  7. The Cost of Wider Fire Access Codes (Supplementary Material)

    • figshare.com
    xlsx
    Updated May 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Scott Brody (2023). The Cost of Wider Fire Access Codes (Supplementary Material) [Dataset]. http://doi.org/10.6084/m9.figshare.22778735.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    May 8, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Scott Brody
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    There are 4 files which supplement my manuscript. Excel File Raw Data contains elements including street name, street width, dwelling units, neighborhood area, and parking characteristics.

    Neighborhood averages were copied to the Excel file Measurement Summary for additional analysis.

    More detailed statistical calculations were performed using Texas Instruments TI-nspire software. Outputs are available in the PDF Statistical Calculations.

    Following review, the paper’s methodology was simplified. The simplified methodology produces the same result as the original process, and the older calculation files were not modified. The Excel file Updated Street Width Formulas displays both processes side by side.

  8. f

    The raw data_calculations Microsoft Excel file contains all the raw...

    • datasetcatalog.nlm.nih.gov
    • figshare.com
    Updated Mar 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lebar, Matthew D.; Carter-Wientjes, Carol H.; Lohmar, Jessica M.; Cary, Jeffrey W.; Wei, Qijian; Mack, Brian M.; Gross, Stephanie R. (2025). The raw data_calculations Microsoft Excel file contains all the raw numerical data and calculations that were used to make figure 2, figure 3, figure 4, figure 5, figure 6, figure 7, S1 Fig , and S3 Fig . [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0002080010
    Explore at:
    Dataset updated
    Mar 3, 2025
    Authors
    Lebar, Matthew D.; Carter-Wientjes, Carol H.; Lohmar, Jessica M.; Cary, Jeffrey W.; Wei, Qijian; Mack, Brian M.; Gross, Stephanie R.
    Description

    Additionally, all P-values used to determine statistical significance have also been included in this file. (XLSX)

  9. Formula 1 Stats 1998-2021

    • kaggle.com
    zip
    Updated Jan 21, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nick Peterselie (2022). Formula 1 Stats 1998-2021 [Dataset]. https://www.kaggle.com/datasets/nickpeterselie/formula-1-stats-19982021
    Explore at:
    zip(67314 bytes)Available download formats
    Dataset updated
    Jan 21, 2022
    Authors
    Nick Peterselie
    Description

    An excel file containing the following on the seasons 1998 to 2021: -Personal stats of drivers (championship finishes, wins/season, total wins, podiums, points, fastest laps and pole positions) -Championship stats (drivers and teams, with colours, and their championship positions at the end of each season) -Table with the wins per circuit per year (also with colours) and the wins per team per year

    This dataset was mainly made for fun / nice looking visualization so first open it in excel to see the colours as well. If you want to use it for more complex purposes, I would recommend to do some data-prepping

  10. r

    Data from: INDILACT – Extended voluntary waiting period in primiparous dairy...

    • researchdata.se
    Updated Mar 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anna Edvardsson Rasmussen (2025). INDILACT – Extended voluntary waiting period in primiparous dairy cows. Part 2: Customized VWP – Metadata and R–scripts with statistical calculations [Dataset]. https://researchdata.se/en/catalogue/dataset/2024-424
    Explore at:
    (108812), (428272), (22393), (5799), (3243), (23818), (5034), (747913), (7170)Available download formats
    Dataset updated
    Mar 13, 2025
    Dataset provided by
    Swedish University of Agricultural Sciences (SLU)
    Authors
    Anna Edvardsson Rasmussen
    Time period covered
    Jan 1, 2019 - Oct 27, 2022
    Area covered
    Sweden
    Description

    This is part 2 of INDILACT, part 1 is published separately.

    The objective of this study is to investigate how a customized voluntary waiting period before first insemination in prmiparous dairy cows would affect milk production, fertility and health of primparous dairy cows during their first calving interval.

    The data was registered between January 2019 and october 2022.

    This data is archived: - Metadata (publically available) - Raw data (.txt files) from the Swedish national herd recording scheme (SNDRS), operated by VƤxa Sverige: access restricted due to agreements with the principle owners of the data, VƤxa Sverige and the farms. Code lists are available in INDILACT part 1. - Aggregated data (Excel files): access restricted due to agreements with the principle owners of the data, VƤxa Sverige and the farms - R- scripts with statistical calculations (Openly available)

    Metadata (3 filer): - Metadata gentypning: The only new file type compared to INDILACT Part 1, description of how this data category have been handled. The other file-types have been handled in the same way as in INDILACT Part 1. - Metadata - del 2 - General summary of initioal data handeling for aggregation of the files of the same types (dates etc.) to create excel-files used in the R-scripts. - DisCodes: Divisions of the diagnoses into categories.

    Raw data: -59 .txt files containing data retrieved from SNDRS from 8 separate occacions. -Data from 18 Swedish farms from Jan 2019 to Oct 2022.

    Aggregeated data: - 29 Excelfiles. The textfiles have been transformed to Excel formate and all data from the same file type is aggregated into one file. - Data collected from the farms by email and phone contact, about individual cows enrolled in the trial, from Oct 2020 to Oct 2022. - One merged Script derived from initial data handeling in R where relevant variables were calculated and aggregated to be used for statistical calculations.

    R-script with data handeling and statistical calculations: - "Data analysis part 2 - final": Data handeling to create the file used in the statistical calculations. - "Part 2 - Binomial models - Fertility": Statistiscal calculations of variables using Binomial models. - "Part 2 - glmmTMB models - Fertility": Statistiscal calculations of variables using glmmTMB models. - "Part 2 - linear models - Fertility": Statistiscal calculations of fertility variables using linear models. - "Part 2 - linear models": Statistiscal calculations of milk variables using linear models.

    Running the R scripts requires access to the restricted files. The files should be unpacked in a subdirectory "data" relative to the working directory for the scripts. See also the file "sessionInfo.txt" for information on R packages used.

  11. qPCR calculations and statistical analysis from "Transcriptomics illuminates...

    • figshare.com
    xlsx
    Updated Jan 29, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Courtney Stairs (2019). qPCR calculations and statistical analysis from "Transcriptomics illuminates the invasive lifestyle of the salmon gut pathogen Spironucleus salmonicida" [Dataset]. http://doi.org/10.6084/m9.figshare.7583846.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jan 29, 2019
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Courtney Stairs
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Excel table with Pfaffl calculations (qPCR_calc.xlsx)Prism file with statistical analysis (qPCRcalc.pzfx)Excel table of statistical analysis (qPCR_tests.xlsx; if reader does not have access to prism)

  12. 2011 skills for life survey: small area estimation data

    • gov.uk
    Updated Dec 12, 2012
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department for Business, Innovation & Skills (2012). 2011 skills for life survey: small area estimation data [Dataset]. https://www.gov.uk/government/statistical-data-sets/2011-skills-for-life-survey-small-area-estimation-data
    Explore at:
    Dataset updated
    Dec 12, 2012
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Department for Business, Innovation & Skills
    Description

    Small area estimation modelling methods have been applied to the 2011 Skills for Life survey data in order to generate local level area estimates of the number and proportion of adults (aged 16-64 years old) in England living in households with defined skill levels in:

    • literacy
    • numeracy
    • information and communication technology (ICT); including emailing, word processing, spreadsheet use and a multiple-choice assessment of ICT awareness

    The number and proportion of adults in households who do not speak English as a first language are also included.

    Two sets of small area estimates are provided for 7 geographies; middle layer super output areas (MSOAs), standard table wards, 2005 statistical wards, 2011 council wards, 2011 parliamentary constituencies, local authorities, and local enterprise partnership areas.

    Regional estimates have also been provided, however, unlike the other geographies, these estimates are based on direct survey estimates and not modelled estimates.

    The files are available as both Excel and csv files – the user guide explains the estimates and modelling approach in more detail.

    How to use the small area estimation files, an example

    To find the estimate for the proportion of adults with entry level 1 or below literacy in the Manchester Central parliamentary constituency, you need to:

    1. select the link to the ā€˜parliamentary-constituencies-2009-all’ Excel file in the table above
    2. select the ā€˜literacy proportions’ page of the Excel spreadsheet
    3. use the ā€˜find’ function to locate ā€˜Manchester Central’
    4. note the proportion listed for Entry Level and below

    It is estimated that 8.1% of adults aged 16-64 in Manchester Central have entry level or below literacy. The Credible Intervals for this estimate are 7.0 and 9.3% at the 95 per cent level. This means that while the estimate is 8.1%, there is a 95% likelihood that the actual value lies between 7.0 and 9.3%.

    https://assets.publishing.service.gov.uk/media/5a79d91240f0b670a8025dd8/middle-layer-super-output-areas-2001-all_1_.xlsx">Middle layer super output areas: 2001 all skill level estimates

     <p class="gem-c-attachment_metadata"><span class="gem-c-attachment_attribute">MS Excel Spreadsheet</span>, <span class="gem-c-attachment_attribute">14.5 MB</span></p>
    
    
    
    
     <p class="gem-c-attachment_metadata">This file may not be suitable for users of assistive technology.</p>
     <details data-module="ga4-event-tracker" data-ga4-event='{"event_name":"select_content","type":"detail","text":"Request an accessible format.","section":"Request an accessible format.","index_section":1}' class="gem-c-details govuk-details govuk-!-margin-bottom-0" title="Request an accessible format.">
    

    Request an accessible format.

      If you use assistive technology (such as a screen reader) and need a version of this document in a more accessible format, please email <a href="mailto:enquiries@beis.gov.uk" target="_blank" class="govuk-link">enquiries@beis.gov.uk</a>. Please tell us what format you need. It will help us if you say what assistive technology you use.
    

    <div class="gem-c-attachmen

  13. d

    Data from: The importance of a medical chaperone: a quality improvement...

    • datadryad.org
    • data.niaid.nih.gov
    • +2more
    zip
    Updated Jun 4, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Katy Rose; Sarah Eshleby; Paul Thiruchelvam; Anna Khoo; Katy Hogben (2015). The importance of a medical chaperone: a quality improvement study exploring the use of a note stamp in a tertiary breast surgery unit [Dataset]. http://doi.org/10.5061/dryad.8bc05
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 4, 2015
    Dataset provided by
    Dryad
    Authors
    Katy Rose; Sarah Eshleby; Paul Thiruchelvam; Anna Khoo; Katy Hogben
    Time period covered
    Jun 3, 2015
    Area covered
    London, UK
    Description

    Raw data for three parts of a QIP studyData was collected in paper format by researchers at the end of an outpatient clinic and inputted each day into a cumulative excel spreadsheet. The raw data is included in these files. Statistical calculations for this data were calculated using an open access software. Results of a two tailed z-calculation have also been included. Confidence intervals where relevant were included in the paper and calculated by hand.Dryad Data The Importance of a Medical Chaperone - a quality improvement study exploring the use of a note stamp in a tertiary breast surgery unit.xlsx

  14. Hive Annotation Job Results - Cleaned and Audited

    • kaggle.com
    zip
    Updated Apr 28, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Brendan Kelley (2021). Hive Annotation Job Results - Cleaned and Audited [Dataset]. https://www.kaggle.com/brendankelley/hive-annotation-job-results-cleaned-and-audited
    Explore at:
    zip(471571 bytes)Available download formats
    Dataset updated
    Apr 28, 2021
    Authors
    Brendan Kelley
    Description

    Context

    This notebook serves to showcase my problem solving ability, knowledge of the data analysis process, proficiency with Excel and its various tools and functions, as well as my strategic mindset and statistical prowess. This project consist of an auditing prompt provided by Hive Data, a raw Excel data set, a cleaned and audited version of the raw Excel data set, and my description of my thought process and knowledge used during completion of the project. The prompt can be found below:

    Hive Data Audit Prompt

    The raw data that accompanies the prompt can be found below:

    Hive Annotation Job Results - Raw Data

    ^ These are the tools I was given to complete my task. The rest of the work is entirely my own.

    To summarize broadly, my task was to audit the dataset and summarize my process and results. Specifically, I was to create a method for identifying which "jobs" - explained in the prompt above - needed to be rerun based on a set of "background facts," or criteria. The description of my extensive thought process and results can be found below in the Content section.

    Content

    Brendan Kelley April 23, 2021

    Hive Data Audit Prompt Results

    This paper explains the auditing process of the ā€œHive Annotation Job Resultsā€ data. It includes the preparation, analysis, visualization, and summary of the data. It is accompanied by the results of the audit in the excel file ā€œHive Annotation Job Results – Auditedā€.

    Observation

    The ā€œHive Annotation Job Resultsā€ data comes in the form of a single excel sheet. It contains 7 columns and 5,001 rows, including column headers. The data includes ā€œfileā€, ā€œobject idā€, and the pseudonym for five questions that each client was instructed to answer about their respective table: ā€œtabularā€, ā€œsemanticā€, ā€œdefinition listā€, ā€œheader rowā€, and ā€œheader columnā€. The ā€œfileā€ column includes non-unique (that is, there are multiple instances of the same value in the column) numbers separated by a dash. The ā€œobject idā€ column includes non-unique numbers ranging from 5 to 487539. The columns containing the answers to the five questions include Boolean values - TRUE or FALSE – which depend upon the yes/no worker judgement.

    Use of the COUNTIF() function reveals that there are no values other than TRUE or FALSE in any of the five question columns. The VLOOKUP() function reveals that the data does not include any missing values in any of the cells.

    Assumptions

    Based on the clean state of the data and the guidelines of the Hive Data Audit Prompt, the assumption is that duplicate values in the ā€œfileā€ column are acceptable and should not be removed. Similarly, duplicated values in the ā€œobject idā€ column are acceptable and should not be removed. The data is therefore clean and is ready for analysis/auditing.

    Preparation

    The purpose of the audit is to analyze the accuracy of the yes/no worker judgement of each question according to the guidelines of the background facts. The background facts are as follows:

    • A table that is a definition list should automatically be tabular and also semantic • Semantic tables should automatically be tabular • If a table is NOT tabular, then it is definitely not semantic nor a definition list • A tabular table that has a header row OR header column should definitely be semantic

    These background facts serve as instructions for how the answers to the five questions should interact with one another. These facts can be re-written to establish criteria for each question:

    For tabular column: - If the table is a definition list, it is also tabular - If the table is semantic, it is also tabular

    For semantic column: - If the table is a definition list, it is also semantic - If the table is not tabular, it is not semantic - If the table is tabular and has either a header row or a header column...

  15. Introducing the new RPIJ measure of Consumer Price Inflation

    • data.wu.ac.at
    html
    Updated Apr 26, 2014
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office for National Statistics (2014). Introducing the new RPIJ measure of Consumer Price Inflation [Dataset]. https://data.wu.ac.at/schema/data_gov_uk/YmYyM2EzZTQtMWRmYS00NmM5LTkzZjYtNGE3ZDg0MTFmYzZi
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Apr 26, 2014
    Dataset provided by
    Office for National Statisticshttp://www.ons.gov.uk/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    This article describes the new RPIJ measure of Consumer Price Inflation. RPIJ is a Retail Prices Index (RPI) based measure that will use a geometric (Jevons) formula in place of one type of arithmetic formula (Carli). It is being launched in response to the National Statistician's conclusion that the RPI does not meet international standards due to the use of the Carli formula in its calculation. The accompanying Excel file includes a back series for RPIJ from 1997 to 2012.

    Source agency: Office for National Statistics

    Designation: National Statistics

    Language: English

    Alternative title: New RPIJ measure of Consumer Price Inflation

  16. Supplement 1. Excel-sheet calculator and calculator instructions.

    • wiley.figshare.com
    html
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anne Chao; Robert K. Colwell; Chih-Wei Lin; Nicholas J. Gotelli (2023). Supplement 1. Excel-sheet calculator and calculator instructions. [Dataset]. http://doi.org/10.6084/m9.figshare.3530930.v1
    Explore at:
    htmlAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Wileyhttps://www.wiley.com/
    Authors
    Anne Chao; Robert K. Colwell; Chih-Wei Lin; Nicholas J. Gotelli
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    File List Supp1ExcelGuide.pdf Supp2ExcelCalculator.xls ExcelCalculatorAbundanceData.pdf ExcelCalculatorIncidenceData.pdf Description Supp1ExcelGuide.pdf contains a complete description of the variables and how to use the Excel Spreadsheet calculator. Supp2ExcelCalculator.xls is an Excel spreadsheet with formulas to calculate the statistics described in the paper.

  17. Microsoft Excel sheet with QC data from [69] used in Figs 5 and C in S1...

    • plos.figshare.com
    xlsx
    Updated May 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zahra Vahdat; Oliver Gambrell; Jonas Fisch; Eckhard Friauf; Abhyudai Singh (2025). Microsoft Excel sheet with QC data from [69] used in Figs 5 and C in S1 File. [Dataset]. http://doi.org/10.1371/journal.pcbi.1013067.s002
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    May 23, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Zahra Vahdat; Oliver Gambrell; Jonas Fisch; Eckhard Friauf; Abhyudai Singh
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Microsoft Excel sheet with QC data from [69] used in Figs 5 and C in S1 File.

  18. An Excel sheet representing the coded data of the study population.

    • plos.figshare.com
    xlsx
    Updated Oct 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wissam Ghach; May M. Bakkar; Mona Aridi; Mohammad A. Alebrahim (2025). An Excel sheet representing the coded data of the study population. [Dataset]. http://doi.org/10.1371/journal.pone.0335254.s001
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Oct 27, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Wissam Ghach; May M. Bakkar; Mona Aridi; Mohammad A. Alebrahim
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    An Excel sheet representing the coded data of the study population.

  19. Excel spreadsheet containing the numerical data and details of statistical...

    • figshare.com
    bin
    Updated Aug 29, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Francesca Mattedi; Ethlyn Lloyd-Morris; Frank Hirth; Alessio Vagnoni (2023). Excel spreadsheet containing the numerical data and details of statistical analysis for Figs 1D, 1E, 1F, 1G, 2C, 2D, 2F, 2G, 2H, 3B–3D, 3F, 3G, 4B, 4C, 4D, 4E, 4G, 4H, 5C, 5D, 5E, 5F, 6C, 6D–6F, 7A, 7C, 7D, 7E, 7F, 7G, 7H, 7I, 7J, 7K, S1C, S1D, S1F, S1G, S2B, S2C, S2G, S2H, S2I, S2J, S2K, S3A, S3C, S3D, S3F, S3G, S3I, S4B, S5C, S5D, S5E, S5F, S5G and S5H. [Dataset]. http://doi.org/10.1371/journal.pbio.3002273.s002
    Explore at:
    binAvailable download formats
    Dataset updated
    Aug 29, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Francesca Mattedi; Ethlyn Lloyd-Morris; Frank Hirth; Alessio Vagnoni
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Excel spreadsheet containing the numerical data and details of statistical analysis for Figs 1D, 1E, 1F, 1G, 2C, 2D, 2F, 2G, 2H, 3B–3D, 3F, 3G, 4B, 4C, 4D, 4E, 4G, 4H, 5C, 5D, 5E, 5F, 6C, 6D–6F, 7A, 7C, 7D, 7E, 7F, 7G, 7H, 7I, 7J, 7K, S1C, S1D, S1F, S1G, S2B, S2C, S2G, S2H, S2I, S2J, S2K, S3A, S3C, S3D, S3F, S3G, S3I, S4B, S5C, S5D, S5E, S5F, S5G and S5H.

  20. Microsoft Excel workbook provided source data matrices and associated...

    • plos.figshare.com
    xlsx
    Updated Apr 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mengmeng Yu; Li Zhang; Ying Wang; Suyan Wang; Yongzhen Liu; Peng Liu; Yuntong Chen; Ru Guo; Lingzhai Meng; Tao Zhang; Wenrui Fan; Xiaole Qi; Yulu Duan; Yanping Zhang; Hongyu Cui; Yulong Gao (2025). Microsoft Excel workbook provided source data matrices and associated statistical computations used to generate the graphical representations in Figures. [Dataset]. http://doi.org/10.1371/journal.ppat.1013064.s003
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Apr 30, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Mengmeng Yu; Li Zhang; Ying Wang; Suyan Wang; Yongzhen Liu; Peng Liu; Yuntong Chen; Ru Guo; Lingzhai Meng; Tao Zhang; Wenrui Fan; Xiaole Qi; Yulu Duan; Yanping Zhang; Hongyu Cui; Yulong Gao
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Microsoft Excel workbook provided source data matrices and associated statistical computations used to generate the graphical representations in Figures.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Sanjana Murthy (2024). Statistical Function in Excel [Dataset]. https://www.kaggle.com/datasets/sanjanamurthy392/statistical-function
Organization logo

Statistical Function in Excel

Sum, average, max, min, sumif, sumifs, count, countblank, countif, countifs, etc

Explore at:
35 scholarly articles cite this dataset (View in Google Scholar)
zip(1412940 bytes)Available download formats
Dataset updated
Feb 7, 2024
Authors
Sanjana Murthy
License

Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically

Description

This data contains functions like: Sum, Average, Max, Min, Sumif, Sumifs, Count, Countblank, Countifs, Counta, Averageif, Averageifs.

Search
Clear search
Close search
Google apps
Main menu