31 datasets found
  1. Pivot table - Data analysis project

    • kaggle.com
    Updated Jul 18, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gamal Khattab (2022). Pivot table - Data analysis project [Dataset]. https://www.kaggle.com/datasets/gamalkhattab/pivot-table
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 18, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Gamal Khattab
    Description

    Summarize big data with pivot table and charts and slicers

  2. SPORTS_DATA_ANALYSIS_ON_EXCEL

    • kaggle.com
    zip
    Updated Dec 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nil kamal Saha (2024). SPORTS_DATA_ANALYSIS_ON_EXCEL [Dataset]. https://www.kaggle.com/datasets/nilkamalsaha/sports-data-analysis-on-excel
    Explore at:
    zip(1203633 bytes)Available download formats
    Dataset updated
    Dec 12, 2024
    Authors
    Nil kamal Saha
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    PROJECT OBJECTIVE

    We are a part of XYZ Co Pvt Ltd company who is in the business of organizing the sports events at international level. Countries nominate sportsmen from different departments and our team has been given the responsibility to systematize the membership roster and generate different reports as per business requirements.

    Questions (KPIs)

    TASK 1: STANDARDIZING THE DATASET

    • Populate the FULLNAME consisting of the following fields ONLY, in the prescribed format: PREFIX FIRSTNAME LASTNAME.{Note: All UPPERCASE)
    • Get the COUNTRY NAME to which these sportsmen belong to. Make use of LOCATION sheet to get the required data
    • Populate the LANGUAGE_!poken by the sportsmen. Make use of LOCTION sheet to get the required data
    • Generate the EMAIL ADDRESS for those members, who speak English, in the prescribed format :lastname.firstnamel@xyz .org {Note: All lowercase) and for all other members, format should be lastname.firstname@xyz.com (Note: All lowercase)
    • Populate the SPORT LOCATION of the sport played by each player. Make use of SPORT sheet to get the required data

    TASK 2: DATA FORMATING

    • Display MEMBER IDas always 3 digit number {Note: 001,002 ...,D2D,..etc)
    • Format the BIRTHDATE as dd mmm'yyyy (Prescribed format example: 09 May' 1986)
    • Display the units for the WEIGHT column (Prescribed format example: 80 kg)
    • Format the SALARY to show the data In thousands. If SALARY is less than 100,000 then display data with 2 decimal places else display data with one decimal place. In both cases units should be thousands (k) e.g. 87670 -> 87.67 k and 12 250 -> 123.2 k

    TASK 3: SUMMARIZE DATA - PIVOT TABLE (Use SPORTSMEN worksheet after attempting TASK 1) • Create a PIVOT table in the worksheet ANALYSIS, starting at cell B3,with the following details:

    • In COLUMNS; Group : GENDER.
    • In ROWS; Group : COUNTRY (Note: use COUNTRY NAMES).
    • In VALUES; calculate the count of candidates from each COUNTRY and GENDER type, Remove GRAND TOTALs.

    TASK 4: SUMMARIZE DATA - EXCEL FUNCTIONS (Use SPORTSMEN worksheet after attempting TASK 1)

    • Create a SUMMARY table in the worksheet ANALYSIS,starting at cell G4, with the following details:

    • Starting from range RANGE H4; get the distinct GENDER. Use remove duplicates option and transpose the data.
    • Starting from range RANGE GS; get the distinct COUNTRY (Note: use COUNTRY NAMES).
    • In the cross table,get the count of candidates from each COUNTRY and GENDER type.

    TASK 5: GENERATE REPORT - PIVOT TABLE (Use SPORTSMEN worksheet after attempting TASK 1)

    • Create a PIVOT table report in the worksheet REPORT, starting at cell A3, with the following information:

    • Change the report layout to TABULAR form.
    • Remove expand and collapse buttons.
    • Remove GRAND TOTALs.
    • Allow user to filter the data by SPORT LOCATION.

    Process

    • Verify data for any missing values and anomalies, and sort out the same.
    • Made sure data is consistent and clean with respect to data type, data format and values used.
    • Created pivot tables according to the questions asked.
  3. VLOOKUP & PIVOT TABLE

    • kaggle.com
    zip
    Updated May 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Derrick Mallison (2023). VLOOKUP & PIVOT TABLE [Dataset]. https://www.kaggle.com/datasets/derrickmallison/vlookup-and-pivot-table
    Explore at:
    zip(58855 bytes)Available download formats
    Dataset updated
    May 3, 2023
    Authors
    Derrick Mallison
    Description

    Dataset

    This dataset was created by Derrick Mallison

    Contents

  4. d

    Hospital Annual Utilization Report & Pivot Tables

    • catalog.data.gov
    • data.chhs.ca.gov
    • +4more
    Updated Nov 23, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Health Care Access and Information (2025). Hospital Annual Utilization Report & Pivot Tables [Dataset]. https://catalog.data.gov/dataset/hospital-annual-utilization-report-pivot-tables-81847
    Explore at:
    Dataset updated
    Nov 23, 2025
    Dataset provided by
    Department of Health Care Access and Information
    Description

    The complete data set of annual utilization data reported by hospitals contains basic licensing information including bed classifications; patient demographics including occupancy rates, the number of discharges and patient days by bed classification, and the number of live births; as well as information on the type of services provided including the number of surgical operating rooms, number of surgeries performed (both inpatient and outpatient), the number of cardiovascular procedures performed, and licensed emergency medical services provided.

  5. HUD FHA Single Family Portfolio Snapshot

    • datalumos.org
    • openicpsr.org
    Updated Feb 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States Department of Housing and Urban Development (2025). HUD FHA Single Family Portfolio Snapshot [Dataset]. http://doi.org/10.3886/E220223V2
    Explore at:
    Dataset updated
    Feb 20, 2025
    Dataset authored and provided by
    United States Department of Housing and Urban Developmenthttp://www.hud.gov/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Feb 2010 - Nov 2024
    Area covered
    United States of America
    Description

    The Single-Family Portfolio Snapshot consists of a monthly data table and a report generator (Excel pivot table) that can be used to quickly create new reports of interest to the user from the data records. The data records themselves are loan level records using all of the categorical variables highlighted on the report generator table. Users may download and save the Excel file that contains the data records and the pivot table.The report generator sheet consists of an Excel pivot table that gives individual users some ability to analyze monthly trends on dimensions of interest to them. There are six choice dimensions: property state, property county, loan purpose, loan type, property product type, and downpayment source.Each report generator selection variable has an associated drop-down menu that is accessed by clicking once on the associated arrows. Only single selections can be made from each menu. For example, users must choose one state or all states, one county or all counties. If a county is chosen that does not correspond with the selected state, the result will be null values.The data records include each report generator choice variable plus the property zip code, originating mortgagee (lender) number, sponsor-lender name, sponsor number, nonprofit gift provider tax identification number, interest rate, and FHA insurance endorsement year and month. The report generator only provides output for the dollar amount of loans. Users who desire to analyze other data that are available on the data table, for example, interest rates or sponsor number, must first download the Excel file. See the data definitions (PDF in top folder) for details on each data element.Files switch from .zip to excel in August 2017.

  6. Europe Bike Store Sales

    • kaggle.com
    zip
    Updated Mar 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    PrepInsta Technologies (2023). Europe Bike Store Sales [Dataset]. https://www.kaggle.com/datasets/prepinstaprime/europe-bike-store-sales/versions/1
    Explore at:
    zip(1209546 bytes)Available download formats
    Dataset updated
    Mar 21, 2023
    Authors
    PrepInsta Technologies
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    Europe
    Description

    In the Europe bikes dataset, Extract the insight into sales in each country and each state of their countries using Excel.

  7. d

    Community Survey: 2021 Random Sample Results

    • catalog.data.gov
    • data.bloomington.in.gov
    Updated May 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.bloomington.in.gov (2023). Community Survey: 2021 Random Sample Results [Dataset]. https://catalog.data.gov/dataset/community-survey-2021-random-sample-results-69942
    Explore at:
    Dataset updated
    May 20, 2023
    Dataset provided by
    data.bloomington.in.gov
    Description

    A random sample of households were invited to participate in this survey. In the dataset, you will find the respondent level data in each row with the questions in each column. The numbers represent a scale option from the survey, such as 1=Excellent, 2=Good, 3=Fair, 4=Poor. The question stem, response option, and scale information for each field can be found in the var "variable labels" and "value labels" sheets. VERY IMPORTANT NOTE: The scientific survey data were weighted, meaning that the demographic profile of respondents was compared to the demographic profile of adults in Bloomington from US Census data. Statistical adjustments were made to bring the respondent profile into balance with the population profile. This means that some records were given more "weight" and some records were given less weight. The weights that were applied are found in the field "wt". If you do not apply these weights, you will not obtain the same results as can be found in the report delivered to the Bloomington. The easiest way to replicate these results is likely to create pivot tables, and use the sum of the "wt" field rather than a count of responses.

  8. RAAAP-123 Main Datasets (for Workshop).xlsx

    • figshare.com
    xlsx
    Updated Jun 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Simon Kerridge; Melinda Fischer (2024). RAAAP-123 Main Datasets (for Workshop).xlsx [Dataset]. http://doi.org/10.6084/m9.figshare.26123668.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 28, 2024
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Simon Kerridge; Melinda Fischer
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This spreadsheet contains a number of sheets.Three sheets contain the main datasets from each of the first three RAAAP surveys.In addition there is a combined sheet containing data from all three sheets (where the semantics are the same) with an additional field indicating which survey it is from. This sheet has fewer colunms as it only has the shared variables.There is also a sheet for each survey listing the variables, and one showing the mappings between surveys, and one showing the common variables.Finally there is an example pivot table to show how the data can be easily visualised.This spreasheet was developed for the RAAAP workshop delivered at the 2023 INORMS Conference in May 2023 in Durban, South Africa.This spreadsheet contains all of the common data from the first 3 RAAAP surveys.These data are presented on separate

  9. e

    Road traffic accidents

    • data.europa.eu
    • gimi9.com
    csv
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Leeds City Council, Road traffic accidents [Dataset]. https://data.europa.eu/data/datasets/road-traffic-accidents?locale=da
    Explore at:
    csvAvailable download formats
    Dataset authored and provided by
    Leeds City Council
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    Information on accidents across Leeds. Data includes location, number of people and vehicles involved, road surface, weather conditions and severity of any casualties.

    Please note

    • The Eastings and Northings are generated at the roadside where the accident occurred. Sometimes due to poor internet connectivity this data is may not be as accurate as it could be. If you notice any errors please contact accident.studies@leeds.gov.uk.

    Due to the format of the report a number of figures in the columns are repeated, these are:

    • Reference Number
    • Easting
    • Northing
    • Number of Vehicles
    • Accident Date
    • Time (24hr)
    • 1st Road Class
    • Road Surface
    • Lighting Conditions
    • Weather Conditions

    Reference Number

    Grid Ref: Easting

    Grid Ref: Northing

    Number of vehicles

    Accident Date

    Time (24hr)

    21G0539

    427798

    426248

    5

    16/01/2015

    1205

    21G0539

    427798

    426248

    5

    16/01/2015

    1205

    21G1108

    431142

    430087

    1

    16/01/2015

    1732

    21H0565

    434602

    436699>

    1

    17/01/2015

    930

    21H0638

    434254

    434318

    2

    17/01/2015

    1315

    21H0638

    434254

    434318

    2

    17/01/2015

    1315

    Therefore the number of vehicles involved in accident 21G0539 were 5, and in accident 21H0638 were 2. Overall in the example above a total of 9 vehicles were involved in accidents

    A useful tool to analyse the data is Excel pivot tables, these help summarise large amounts of data in a easy to view table, for further information on pivot table visit here.

    Further Information

    • Please see the guidance document for further information on categories.
  10. w

    Road Traffic Accident Data

    • data.wu.ac.at
    csv, html, json
    Updated Aug 24, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Calderdale Council (2018). Road Traffic Accident Data [Dataset]. https://data.wu.ac.at/schema/data_gov_uk/MDUzYTY1MjktNmM4Yy00MmFjLWFlMWUtNDU1YjI3MDhlNTM1
    Explore at:
    json, html, csvAvailable download formats
    Dataset updated
    Aug 24, 2018
    Dataset provided by
    Calderdale Council
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    Information on accidents casualites across Calderdale. Data includes location, number of people and vehicles involved, road surface, weather conditions and severity of any casualties.

    Please note

    • The Eastings and Northings are generated at the roadside where the accident occurred. Sometimes due to poor internet connectivity this data is may not be as accurate as it could be. If you notice any errors please contact accident.studies@leeds.gov.uk.

    Due to the format of the report a number of figures in the columns are repeated, these are:

    • Reference Number
    • Easting
    • Northing
    • Number of Vehicles
    • Accident Date
    • Time (24hr)
    • 1st Road Class
    • Road Surface
    • Lighting Conditions
    • Weather Conditions

    Reference Number

    Grid Ref: Easting

    Grid Ref: Northing

    Number of vehicles

    Accident Date

    Time (24hr)

    21G0539

    427798

    426248

    5

    16/01/2015

    1205

    21G0539

    427798

    426248

    5

    16/01/2015

    1205

    21G1108

    431142

    430087

    1

    16/01/2015

    1732

    21H0565

    434602

    436699>

    1

    17/01/2015

    930

    21H0638

    434254

    434318

    2

    17/01/2015

    1315

    21H0638

    434254

    434318

    2

    17/01/2015

    1315

    Therefore the number of vehicles involved in accident 21G0539 were 5, and in accident 21H0638 were 2. Overall in the example above a total of 9 vehicles were involved in accidents

    A useful tool to analyse the data is Excel pivot tables, these help summarise large amounts of data in a easy to view table, for further information on pivot tables visit here.

  11. Analysis of performance vs annual salary by region

    • kaggle.com
    zip
    Updated Aug 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    JSeebs (2024). Analysis of performance vs annual salary by region [Dataset]. https://www.kaggle.com/datasets/jseebs/analysis-of-performance-vs-annual-salary-by-region/data
    Explore at:
    zip(113971 bytes)Available download formats
    Dataset updated
    Aug 11, 2024
    Authors
    JSeebs
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Original dataset by user Abdallah Wagih Ibrahim https://www.kaggle.com/datasets/abdallahwagih/company-employees/data

    I created a pivot table visualizing the relationship between annual salary and job rate(performance) by region.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F21036995%2F0ae505c2b2c7262a7fbda9acd9e90d2d%2FEmployeesPivotTable.png?generation=1723379896090719&alt=media" alt="">

  12. Checklist derived from plant species of Benin downloaded from GBIF site

    • gbif.org
    Updated May 9, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jean GANGLO; Jean GANGLO (2019). Checklist derived from plant species of Benin downloaded from GBIF site [Dataset]. http://doi.org/10.15468/mid3vk
    Explore at:
    Dataset updated
    May 9, 2019
    Dataset provided by
    Global Biodiversity Information Facilityhttps://www.gbif.org/
    Laboratory of Forest Sciences (University of Abomey-Calavi)
    Authors
    Jean GANGLO; Jean GANGLO
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 1, 1780 - Jan 11, 2019
    Area covered
    Description

    Plant species collected throughout Benin were published on GBIF site. Data concerning those species were downloaded from GBIF site. Using Excel dynamic pivot table we derived and achieved the checklist of plant species of Benin from the dataset downloaded.

  13. d

    Community Survey: 2019 Survey Data

    • catalog.data.gov
    • data.bloomington.in.gov
    • +2more
    Updated May 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.bloomington.in.gov (2023). Community Survey: 2019 Survey Data [Dataset]. https://catalog.data.gov/dataset/community-survey-2019-survey-data-ac78c
    Explore at:
    Dataset updated
    May 20, 2023
    Dataset provided by
    data.bloomington.in.gov
    Description

    The City of Bloomington contracted with National Research Center, Inc. to conduct the 2019 Bloomington Community Survey. This was the second time a scientific citywide survey had been completed covering resident opinions on service delivery satisfaction by the City of Bloomington and quality of life issues. The first was in 2017. The survey captured the responses of 610 households from a representative sample of 3,000 residents of Bloomington who were randomly selected to complete the survey. VERY IMPORTANT NOTE: The scientific survey data were weighted, meaning that the demographic profile of respondents was compared to the demographic profile of adults in Bloomington from US Census data. Statistical adjustments were made to bring the respondent profile into balance with the population profile. This means that some records were given more "weight" and some records were given less weight. The weights that were applied are found in the field "wt". If you do not apply these weights, you will not obtain the same results as can be found in the report delivered to the City of Bloomington. The easiest way to replicate these results is likely to create pivot tables, and use the sum of the "wt" field rather than a count of responses.

  14. e

    Collections database

    • data.europa.eu
    • ckan.publishing.service.gov.uk
    • +1more
    csv, html
    Updated Nov 8, 2013
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tate (2013). Collections database [Dataset]. https://data.europa.eu/data/datasets/collections-database?locale=en
    Explore at:
    html, csvAvailable download formats
    Dataset updated
    Nov 8, 2013
    Dataset authored and provided by
    Tate
    Description

    The Tate Collection Here we present the metadata for around 70,000 artworks that Tate owns or jointly owns with the National Galleries of Scotland as part of ARTIST ROOMS. Metadata for around 3,500 associated artists is also included.

    The metadata here is released under the Creative Commons Public Domain CC0 licence. Please see the enclosed LICENCE file for more detail.

    Images are not included and are not part of the dataset. Use of Tate images is covered on the Copyright and permissions page. You may also license images for commercial use.

    Please review the full usage guidelines.

    Repository Contents

    We offer two data formats:

    A richer dataset is provided in the JSON format, which is organised by the directory structure of the Git repository. JSON supports more hierarchical or nested information such as subjects.

    We also provide CSVs of flattened data, which is less comprehensive but perhaps easier to grok. The CSVs provide a good introduction to overall contents of the Tate metadata and create opportunities for artistic pivot tables.

    JSON

    Artists

    Each artist has his or her own JSON file. They are found in the artists folder, then filed away by first letter of the artist’s surname.

    Artworks

    Artworks are found in the artworks folder. They are filed away by accession number. This is the unique identifier given to artworks when they come into the Tate collection. In many cases, the format has significance. For example, the ar accession number prefix indicates that the artwork is part of ARTIST ROOMS collection. The n prefix indicates works that once were part of the National Gallery collection.

    CSV

    There is one CSV file for artists (artist_data.csv) and one (very large) for artworks (artwork_data.csv), which we may one day break up into more manageable chunks. The CSV headings should be helpful. Let us know if not. Entrepreneurial hackers could use the CSVs as an index to the JSON collections if they wanted richer data.

    Usage guidelines for open data

    These usage guidelines are based on goodwill. They are not a legal contract but Tate requests that you follow these guidelines if you use Metadata from our Collection dataset.

    The Metadata published by Tate is available free of restrictions under the Creative Commons Zero Public Domain Dedication.

    This means that you can use it for any purpose without having to give attribution. However, Tate requests that you actively acknowledge and give attribution to Tate wherever possible. Attribution supports future efforts to release other data. It also reduces the amount of ‘orphaned data’, helping retain links to authoritative sources.

    Give attribution to Tate

    Make sure that others are aware of the rights status of Tate and are aware of these guidelines by keeping intact links to the Creative Commons Zero Public Domain Dedication.

    If for technical or other reasons you cannot include all the links to all sources of the Metadata and rights information directly with the Metadata, you should consider including them separately, for example in a separate document that is distributed with the Metadata or dataset.

    If for technical or other reasons you cannot include all the links to all sources of the Metadata and rights information, you may consider linking only to the Metadata source on Tate’s website, where all available sources and rights information can be found, including in machine readable formats.

    Metadata is dynamic

    When working with Metadata obtained from Tate, please be aware that this Metadata is not static. It sometimes changes daily. Tate continuously updates its Metadata in order to correct mistakes and include new and additional information. Museum collections are under constant study and research, and new information is frequently added to objects in the collection.

    Mention your modifications of the Metadata and contribute your modified Metadata back

    Whenever you transform, translate or otherwise modify the Metadata, make it clear that the resulting Metadata has been modified by you. If you enrich or otherwise modify Metadata, consider publishing the derived Metadata without reuse restrictions, preferably via the Creative Commons Zero Public Domain Dedication.

    Be responsible

    Ensure that you do not use the Metadata in a way that suggests any official status or that Tate endorses you or your use of the Metadata, unless you have prior permission to do so.

    Ensure that you do not mislead others or misrepresent the Metadata or its sources

    Ensure that your use of the Metadata does not breach any national legislation based thereon, notably concerning (but not limited to) data protection, defamation or copyright. Please note that you use the Metadata at your own risk. Tate offers the Metadata as-is and makes no representations or warranties of any kind concerning any Metadata published by Tate.

    The writers of these guidelines are deeply indebted to the Smithsonian Cooper-Hewitt

  15. SIMPaCT Soil Moisture Sensor Dataset

    • zenodo.org
    zip
    Updated Aug 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mesut Koçyiğit; Bahman Javadi; Russell Thomson; Sebastian Pfautsch; Oliver Obst; Mesut Koçyiğit; Bahman Javadi; Russell Thomson; Sebastian Pfautsch; Oliver Obst (2025). SIMPaCT Soil Moisture Sensor Dataset [Dataset]. http://doi.org/10.5281/zenodo.16811020
    Explore at:
    zipAvailable download formats
    Dataset updated
    Aug 13, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Mesut Koçyiğit; Bahman Javadi; Russell Thomson; Sebastian Pfautsch; Oliver Obst; Mesut Koçyiğit; Bahman Javadi; Russell Thomson; Sebastian Pfautsch; Oliver Obst
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This is the first public release of the SIMPaCT Soil Moisture Sensor Dataset.

    Contents

    • Example volumetric water content (VWC) data from a test deployment of 13 soil moisture sensors in Bicentennial Park, Sydney, Australia.
    • Preprocessing script to convert raw readings into a pivot table with indexed timestamps.
    • Analysis scripts to:
      • Compute mutual information between sensors
      • Select the best neighbour sensor
      • Train a small MLP-based virtual sensor
    • Generated analysis files and example plots
      (may vary slightly if regenerated due to nondeterminism).

    Licenses

    • Code: MIT License
    • Data: CC BY 4.0 License

    For dataset description and usage instructions, see:

  16. h

    SheetBench-50

    • huggingface.co
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    hud, SheetBench-50 [Dataset]. https://huggingface.co/datasets/hud-evals/SheetBench-50
    Explore at:
    Dataset authored and provided by
    hud
    Description

    Task Categories

      1. Data Preparation and Hygiene (29 tasks)
    

    De-duplication, type normalization, time parsing, joins/FX conversions, pivot tables

      2. Derivations & Extraction (16 tasks)
    

    Correlations, z-scores, grouping logic, compliance filters (e.g., 1099)

      3. Modeling & Forecasts (5 tasks)
    

    Revenue/breakeven projections, amortization schedules, depreciation calculations, scenario tables

      Example Task
    

    For the ticker that has the greatest… See the full description on the dataset page: https://huggingface.co/datasets/hud-evals/SheetBench-50.

  17. Z

    Dataset: Preliminary analysis of open data pertaining to the services...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Nov 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Petravić, Luka; Ivetić, Vojislav (2023). Dataset: Preliminary analysis of open data pertaining to the services available through the Health Insurance Institute of Slovenia and provided by family medicine [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_8305762
    Explore at:
    Dataset updated
    Nov 23, 2023
    Dataset provided by
    Medicinska fakulteta, Univerza v Mariboru, Taborska ulica 8, 2000 Maribor, Slovenija; Sava Med d.o.o., Cesta k Dravi 8, 2241 Spodnji Duplek, Slovenija
    Medicinska fakulteta, Univerza v Mariboru, Taborska ulica 8, 2000 Maribor, Slovenija
    Authors
    Petravić, Luka; Ivetić, Vojislav
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Slovenia
    Description

    BACKGROUND: The Health Insurance Institute of Slovenia (ZZZS) began publishing service-related data in May 2023, following a directive from the Ministry of Health (MoH). The ZZZS website provides easily accessible information about the services provided by individual doctors, including their names. The user is provided relevant information about the doctor's employer, including whether it is a public or private institution. The data provided is useful for studying the public system's operations and identifying any errors or anomalies.

    METHODS: The data for services provided in May 2023 was downloaded and analysed. The published data were cross-referenced using the provider's RIZDDZ number with the daily updated data on ambulatory workload from June 9, 2023, published by ZZZS. The data mentioned earlier were found to be inaccurate and were improved using alerts from the zdravniki.sledilnik.org portal. Therefore, they currently provide an accurate representation of the current situation. The total number of services provided by each provider in a given month was determined by adding up the individual services and then assigning them to the corresponding provider.

    RESULTS: A pivot table was created to identify 307 unique operators, with 15 operators not appearing in both lists. There are 66 public providers, which make up about 72% of the contractual programme in the public system. There are 241 private providers, accounting for about 28% of the contractual programme. In May 2023, public providers accounted for 69% (n=646,236) of services in the family medicine system, while private providers contributed 31% (n=291,660). The total number of services provided by public and private providers was 937,896. Three linear correlations were analysed. The initial analysis of the entire sample yielded a high R-squared value of .998 (adjusted R-squared value of .996) and a significant level below 0.001. The second analysis of the data from private providers showed a high R Squared value of .904 (Adjusted R Squared = .886), indicating a strong correlation between the variables. Furthermore, the significance level was < 0.001, providing additional support for the statistical significance of the results. The third analysis used data from public providers and showed a strong level of explanatory power, with a R Squared value of 1.000 (Adjusted R Squared = 1.000). Furthermore, the statistical significance of the findings was established with a p-value < 0.001.

    CONCLUSION: Our analysis shows a strong linear correlation between contract size of the program signed and number services rendered by family medicine providers. A stronger linear correlation is observed among providers in the public system compared to those in the private system. Our study found that private providers generally offer more services than public providers. However, it is important to acknowledge that the evaluation framework for assessing services may have inherent flaws when examining the data. Prescribing a prescription and resuscitating a patient are both assigned a rating of one service. It is crucial to closely monitor trends and identify comparable databases for pairing at the secondary and tertiary levels.

  18. Student Performance_Example SOW

    • kaggle.com
    zip
    Updated Jul 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    huyen_nguyen_63 (2024). Student Performance_Example SOW [Dataset]. https://www.kaggle.com/huyennguyen63/student-performance-example-sow
    Explore at:
    zip(457 bytes)Available download formats
    Dataset updated
    Jul 3, 2024
    Authors
    huyen_nguyen_63
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    Source: Kaggle users Data analysis: Student's placement scores with their years to join the club. The file includes some worksheets such as raw data, summary data, pivot table and a chart, SMART questions and SOW. In the chart: You can see the connection between student's placement scores and their time to join the club. The table involves 5 columns, and then it was added more two columns (Year and Time_Join_Club) so that you can calculate the number of years in which the students have joined the club.

    Math_Score| Reading_Score| Writing_Score |Placement_Score |Club_Join_Date|Year | Time _Join_Club|

  19. USDA Census of Agriculture 2022 - Chicken Production

    • usdadatalibrary-lnr.hub.arcgis.com
    Updated Apr 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Esri (2024). USDA Census of Agriculture 2022 - Chicken Production [Dataset]. https://usdadatalibrary-lnr.hub.arcgis.com/datasets/esri::usda-census-of-agriculture-2022-chicken-production
    Explore at:
    Dataset updated
    Apr 18, 2024
    Dataset authored and provided by
    Esrihttp://esri.com/
    Area covered
    Description

    The Census of Agriculture, produced by the United States Department of Agriculture (USDA), provides a complete count of America's farms, ranches and the people who grow our food. The census is conducted every five years, most recently in 2022, and provides an in-depth look at the agricultural industry. This layer was produced from data obtained from the USDA National Agriculture Statistics Service (NASS) Large Datasets download page. The data were transformed and prepared for publishing using the Pivot Table geoprocessing tool in ArcGIS Pro and joined to county boundaries. The county boundaries are 2022 vintage and come from Living Atlas ACS 2022 feature layers.Dataset SummaryPhenomenon Mapped: Chicken productionGeographic Extent: 48 contiguous United States, Alaska, Hawaii, and Puerto RicoProjection: Web Mercator Auxiliary SphereSource: USDA National Agricultural Statistics ServiceUpdate Frequency: 5 yearsData Vintage: 2022Publication Date: April 2024AttributesNote that some values are suppressed as "Withheld to avoid disclosing data for individual operations", "Not applicable", or "Less than half the rounding unit". These have been coded in the data as -999, -888, and -777 respectively. You should account for these values when symbolizing or doing any calculations.Some chicken production commodity fields are broken out into ranges based on the number of head of chickens. For space reasons, a general sample of the fields is listed here.Commodities included in this layer: Chickens, Broilers - InventoryChickens, Broilers - Operations with InventoryChickens, Broilers - Operations with Sales - Sales: (Based on number of head)Chickens, Broilers - Operations with SalesChickens, Broilers - Sales, Measured in HeadChickens, Broilers, Production Contract - Operations with ProductionChickens, Broilers, Production Contract - Production, Measured in HeadChickens, Layers - InventoryChickens, Layers - Operations with Inventory - Inventory: (Based on number of head)Chickens, Layers - Operations with InventoryChickens, Layers - Operations with SalesChickens, Layers - Sales, Measured in HeadChickens, Layers, Production Contract - Operations with ProductionChickens, Layers, Production Contract - Production, Measured in HeadChickens, Pullets, Replacement - InventoryChickens, Pullets, Replacement - Operations with InventoryChickens, Pullets, Replacement - Operations with SalesChickens, Pullets, Replacement - Sales, Measured in HeadChickens, Pullets, Replacement, Production Contract - Operations with ProductionChickens, Pullets, Replacement, Production Contract - Production, Measured in HeadChickens, Roosters - InventoryChickens, Roosters - Operations with InventoryChickens, Roosters - Operations with SalesChickens, Roosters - Sales, Measured in HeadGeography NoteIn Alaska, one or more county-equivalent entities (borough, census area, city, municipality) are included in an agriculture census area.What can you do with this layer?This layer is designed for data visualization. Identify features by clicking on the map to reveal the pre-configured pop-up. You may change the field(s) being symbolized. When symbolizing other fields, you will need to update the popup accordingly. Simple summary statistics are supported by this data.Questions?Please leave a comment below if you have a question about this layer, and we will get back to you as soon as possible.

  20. Bulk data files for all years – releases, disposals, transfers and facility...

    • open.canada.ca
    csv, html
    Updated Jul 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Environment and Climate Change Canada (2025). Bulk data files for all years – releases, disposals, transfers and facility locations [Dataset]. https://open.canada.ca/data/en/dataset/40e01423-7728-429c-ac9d-2954385ccdfb
    Explore at:
    csv, htmlAvailable download formats
    Dataset updated
    Jul 15, 2025
    Dataset provided by
    Environment And Climate Change Canadahttps://www.canada.ca/en/environment-climate-change.html
    License

    Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
    License information was derived automatically

    Time period covered
    Jan 1, 1993 - Dec 31, 2023
    Description

    The National Pollutant Release Inventory (NPRI) is Canada's public inventory of pollutant releases (to air, water and land), disposals and transfers for recycling. Each file contains data from 1993 to the latest reporting year. These CSV format datasets are in normalized or ‘list’ format and are optimized for pivot table analyses. Here is a description of each file: - The RELEASES file contains all substance release quantities. - The DISPOSALS file contains all on-site and off-site disposal quantities, including tailings and waste rock (TWR). - The TRANSFERS file contains all quantities transferred for recycling or treatment prior to disposal. - The COMMENTS file contains all the comments provided by facilities about substances included in their report. - The GEO LOCATIONS file contains complete geographic information for all facilities that have reported to the NPRI. Please consult the following resources to enhance your analysis: - Guide on using and Interpreting NPRI Data: https://www.canada.ca/en/environment-climate-change/services/national-pollutant-release-inventory/using-interpreting-data.html - Access additional data from the NPRI, including datasets and mapping products: https://www.canada.ca/en/environment-climate-change/services/national-pollutant-release-inventory/tools-resources-data/exploredata.html Supplemental Information More NPRI datasets and mapping products are available here: https://www.canada.ca/en/environment-climate-change/services/national-pollutant-release-inventory/tools-resources-data/access.html Supporting Projects: National Pollutant Release Inventory (NPRI)

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Gamal Khattab (2022). Pivot table - Data analysis project [Dataset]. https://www.kaggle.com/datasets/gamalkhattab/pivot-table
Organization logo

Pivot table - Data analysis project

Summarize big data with pivot table and charts and slicers

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 18, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Gamal Khattab
Description

Summarize big data with pivot table and charts and slicers

Search
Clear search
Close search
Google apps
Main menu