8 datasets found
  1. f

    UC_vs_US Statistic Analysis.xlsx

    • figshare.com
    xlsx
    Updated Jul 9, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    F. (Fabiano) Dalpiaz (2020). UC_vs_US Statistic Analysis.xlsx [Dataset]. http://doi.org/10.23644/uu.12631628.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jul 9, 2020
    Dataset provided by
    Utrecht University
    Authors
    F. (Fabiano) Dalpiaz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Sheet 1 (Raw-Data): The raw data of the study is provided, presenting the tagging results for the used measures described in the paper. For each subject, it includes multiple columns: A. a sequential student ID B an ID that defines a random group label and the notation C. the used notation: user Story or use Cases D. the case they were assigned to: IFA, Sim, or Hos E. the subject's exam grade (total points out of 100). Empty cells mean that the subject did not take the first exam F. a categorical representation of the grade L/M/H, where H is greater or equal to 80, M is between 65 included and 80 excluded, L otherwise G. the total number of classes in the student's conceptual model H. the total number of relationships in the student's conceptual model I. the total number of classes in the expert's conceptual model J. the total number of relationships in the expert's conceptual model K-O. the total number of encountered situations of alignment, wrong representation, system-oriented, omitted, missing (see tagging scheme below) P. the researchers' judgement on how well the derivation process explanation was explained by the student: well explained (a systematic mapping that can be easily reproduced), partially explained (vague indication of the mapping ), or not present.

    Tagging scheme:
    Aligned (AL) - A concept is represented as a class in both models, either
    

    with the same name or using synonyms or clearly linkable names; Wrongly represented (WR) - A class in the domain expert model is incorrectly represented in the student model, either (i) via an attribute, method, or relationship rather than class, or (ii) using a generic term (e.g., user'' instead ofurban planner''); System-oriented (SO) - A class in CM-Stud that denotes a technical implementation aspect, e.g., access control. Classes that represent legacy system or the system under design (portal, simulator) are legitimate; Omitted (OM) - A class in CM-Expert that does not appear in any way in CM-Stud; Missing (MI) - A class in CM-Stud that does not appear in any way in CM-Expert.

    All the calculations and information provided in the following sheets
    

    originate from that raw data.

    Sheet 2 (Descriptive-Stats): Shows a summary of statistics from the data collection,
    

    including the number of subjects per case, per notation, per process derivation rigor category, and per exam grade category.

    Sheet 3 (Size-Ratio):
    

    The number of classes within the student model divided by the number of classes within the expert model is calculated (describing the size ratio). We provide box plots to allow a visual comparison of the shape of the distribution, its central value, and its variability for each group (by case, notation, process, and exam grade) . The primary focus in this study is on the number of classes. However, we also provided the size ratio for the number of relationships between student and expert model.

    Sheet 4 (Overall):
    

    Provides an overview of all subjects regarding the encountered situations, completeness, and correctness, respectively. Correctness is defined as the ratio of classes in a student model that is fully aligned with the classes in the corresponding expert model. It is calculated by dividing the number of aligned concepts (AL) by the sum of the number of aligned concepts (AL), omitted concepts (OM), system-oriented concepts (SO), and wrong representations (WR). Completeness on the other hand, is defined as the ratio of classes in a student model that are correctly or incorrectly represented over the number of classes in the expert model. Completeness is calculated by dividing the sum of aligned concepts (AL) and wrong representations (WR) by the sum of the number of aligned concepts (AL), wrong representations (WR) and omitted concepts (OM). The overview is complemented with general diverging stacked bar charts that illustrate correctness and completeness.

    For sheet 4 as well as for the following four sheets, diverging stacked bar
    

    charts are provided to visualize the effect of each of the independent and mediated variables. The charts are based on the relative numbers of encountered situations for each student. In addition, a "Buffer" is calculated witch solely serves the purpose of constructing the diverging stacked bar charts in Excel. Finally, at the bottom of each sheet, the significance (T-test) and effect size (Hedges' g) for both completeness and correctness are provided. Hedges' g was calculated with an online tool: https://www.psychometrica.de/effect_size.html. The independent and moderating variables can be found as follows:

    Sheet 5 (By-Notation):
    

    Model correctness and model completeness is compared by notation - UC, US.

    Sheet 6 (By-Case):
    

    Model correctness and model completeness is compared by case - SIM, HOS, IFA.

    Sheet 7 (By-Process):
    

    Model correctness and model completeness is compared by how well the derivation process is explained - well explained, partially explained, not present.

    Sheet 8 (By-Grade):
    

    Model correctness and model completeness is compared by the exam grades, converted to categorical values High, Low , and Medium.

  2. e

    Comparison of estimates of 0-18 year olds

    • data.europa.eu
    • data.wu.ac.at
    pdf, unknown
    Updated Sep 28, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Greater London Authority (2021). Comparison of estimates of 0-18 year olds [Dataset]. https://data.europa.eu/data/datasets/comparison-estimates-0-18-year-olds
    Explore at:
    pdf, unknownAvailable download formats
    Dataset updated
    Sep 28, 2021
    Dataset authored and provided by
    Greater London Authority
    Description

    This report from the GLA Intelligence Unit compares 2011 census estimates of the population aged 0-18 to the following alternative data sources:

    • ONS 2010 based sub-national population projections (SNPP);

    • GLA 2011 round population projections;

    • General Practitioner registrations; and

    • Child benefit claims.

    The report is available to download here.

    An Excel file containing the data behind charts and tables in the report is available to download here

  3. ROAD ACCIDENT DASHBOARD

    • kaggle.com
    zip
    Updated Jan 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ANJALI KB (2024). ROAD ACCIDENT DASHBOARD [Dataset]. https://www.kaggle.com/datasets/anjalikb/road-accident-dashboard/code
    Explore at:
    zip(72609359 bytes)Available download formats
    Dataset updated
    Jan 23, 2024
    Authors
    ANJALI KB
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    📊 Road Accident Data Analysis: Interactive Excel Dashboard 🚗

    Excited to share my Kaggle project focusing on road accident data analysis. Leveraging Excel's power, I've developed an interactive dashboard offering comprehensive insights for safer roads.

    Key Aspects:

    Data Processing & Cleaning: Ensured data reliability through meticulous processing. KPIs: Primarily focused on Total Casualties, with detailed breakdowns for Fatal, Serious, Slight, and by Car type. Visualizations: Engaging charts - Doughnuts, Line, Bar, and Pie - offering a holistic view of accident trends. Interactivity: User-friendly features include Urban/Rural and Year filters for dynamic exploration. Unique Insights:

    Monthly Trends: Line chart for a nuanced comparison of current vs. previous year casualties. Road Type Breakdown: Bar chart to showcase casualties distributed across different road types. Geospatial Analysis: Doughnut charts detailing casualties by location and area. Call for Collaboration: Seeking Kaggle community input for refinement and optimization. Let's collectively contribute to making our roads safer through data-driven insights!

    DataAnalysis #RoadSafety #InteractiveDashboard #KaggleProject #Excel #DataVisualization #CollaborationOpportunity

    Looking forward to your feedback and contributions! 🚀🌐

  4. H

    Data from: From CAS to EAS – Calculating and Plotting the Compressibility...

    • dataverse.harvard.edu
    • search.dataone.org
    Updated Apr 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Danny Steeven Sarmiento Beltran (2024). From CAS to EAS – Calculating and Plotting the Compressibility Correction Chart [Dataset]. http://doi.org/10.7910/DVN/6QWEX1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 10, 2024
    Dataset provided by
    Harvard Dataverse
    Authors
    Danny Steeven Sarmiento Beltran
    License

    https://dataverse.harvard.edu/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.7910/DVN/6QWEX1https://dataverse.harvard.edu/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.7910/DVN/6QWEX1

    Description

    Purpose – The conversion between calibrated airspeed (CAS) and equivalent airspeed (EAS) is relatively cumbersome, because it involves the calculation of incompressible flow, for which the equations are quite long. If calculations on the computer are required, conversions with equations are necessary. In contrast, this project calculates a CAS to EAS Compressibility Correction Chart, which allows to convert CAS to EAS very quickly by reading the correction from a graph. --- Methodology – In Excel, compressibility correction is achieved through flight mechanics formulas. The correction is calculated with two distinct functions, one based on Mach Number and the other on pressure altitude. These functions are graphed individually and then integrated to produce the Compressibility Correction Chart. --- Findings – The Compressibility Correction Chart was successfully recreated as a 2-D graph. Upon comparison with other correction charts, the EAS-CAS-results demonstrate a mere 0% deviation, proving the accuracy of the findings and validating their near-perfect alignment. --- Research Limitations – Due to a limitation in Excel, which allows for 255 series for plotting, the range of input parameters had to be adjusted accordingly. The iterations of altitude span 1000 ft intervals, while those for Mach Number span 0.05 intervals. --- Practical Implications – Pilots can easily use the Compressibility Correction Chart for quick and highly accurate calculations when needed. --- Originality – CAS-EAS Compressibility Correction Charts are available in other sources. This paper represents a recreation of the 2-D Correction Chart by the combination of plots: one as function of Mach Number and the other of pressure altitude, using the Excel Software.

  5. Does insecticide resistance expand the host range potential of the aphid...

    • figshare.com
    xlsx
    Updated Mar 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maddie Church (2024). Does insecticide resistance expand the host range potential of the aphid Myzus persicae? - Data [Dataset]. http://doi.org/10.6084/m9.figshare.25476112.v2
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Mar 26, 2024
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Maddie Church
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    All Comparisons of Differentially Expressed Genes - excel sheet containing the annotations and fold change values of the all the differentially expressed genes between the different clone comparisonsFinal List of Common Genes - excel sheet containing the list of genes that were commonly differentially expressed between all the aphid clone comparisons. Also contains table and bar chart presenting the number of times each candidate gene selected from previous literature was found in each aphid clone comparison.Non-direct and Direct Competition - excel sheet containing number of nymphs produced by all 6 clones on the 3 host plants in the non-direct competition, and the number of nymphs produced by the two clones NS and Viola in the direct competition experiment.sterror - excel sheet containing the means and standard error values of the 6 grouped resistant and susceptible clones in the non-direct competition experiment, used to make the bar plot for the non-direct competition experiment.sterror2 - excel sheet containing the means and standard error values of the resistant clone Viola and susceptible clone NS in the direct competition experiment, used to make the bar plot for the direct competition experiment.cabbagettest - excel sheet containing the number of nymphs produce by the 6 grouped resistant and susceptible clones on the 3 host plants, used to conduct the unpaired t tests to compare the reproductive performance of resistant and susceptible clones on the 3 different host plants when in not in competitiondirectcompetition - excel sheet containing the number of nymphs produce by the resistant clone Viola and susceptible clone NS on the 3 host plants, used to conduct the unpaired t tests comparing the reproductive performance of resistant and susceptible clones on the 3 different host plants when in direct competitionAPHID HOST SHIFT DISS Rscript - R script containing all my statistical tests: unpaired t tests of resistant and susceptible clones on the 3 host plants when in direct and non direct competition, and kruskal Wallis tests and post hoc Dunns test to identify significant differences between individual and resistant and susceptible clones on the different host plants. Also contains all my code for my bar charts for the non-direct and direct competition experiments and the code for my box plots showing the significant differences between individual clones and resistant and susceptible clones on the different host plants.Up and Down-regulated Genes Graph - excel sheet containing the number of and and down regulated genes in each aphid clone comparison and the bar graph generated from this data.

  6. f

    Table1_EasySSR: a user-friendly web application with full command-line...

    • figshare.com
    • frontiersin.figshare.com
    xlsx
    Updated Aug 24, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sandy Ingrid Aguiar Alves; Victor Benedito Costa Ferreira; Carlos Willian Dias Dantas; Artur Luiz da Costa da Silva; Rommel Thiago Jucá Ramos (2023). Table1_EasySSR: a user-friendly web application with full command-line features for large-scale batch microsatellite mining and samples comparison.XLSX [Dataset]. http://doi.org/10.3389/fgene.2023.1228552.s001
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Aug 24, 2023
    Dataset provided by
    Frontiers
    Authors
    Sandy Ingrid Aguiar Alves; Victor Benedito Costa Ferreira; Carlos Willian Dias Dantas; Artur Luiz da Costa da Silva; Rommel Thiago Jucá Ramos
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Microsatellites, also known as SSRs or STRs, are polymorphic DNA regions with tandem repetitions of a nucleotide motif of size 1–6 base pairs with a broad range of applications in many fields, such as comparative genomics, molecular biology, and forensics. However, the majority of researchers do not have computational training and struggle while running command-line tools or very limited web tools for their SSR research, spending a considerable amount of time learning how to execute the software and conducting the post-processing data tabulation in other tools or manually—time that could be used directly in data analysis. We present EasySSR, a user-friendly web tool with command-line full functionality, designed for practical use in batch identifying and comparing SSRs in sequences, draft, or complete genomes, not requiring previous bioinformatic skills to run. EasySSR requires only a FASTA and an optional GENBANK file of one or more genomes to identify and compare STRs. The tool can automatically analyze and compare SSRs in whole genomes, convert GenBank to PTT files, identify perfect and imperfect SSRs and coding and non-coding regions, compare their frequencies, abundancy, motifs, flanking sequences, and iterations, producing many outputs ready for download such as PTT files, interactive charts, and Excel tables, giving the user the data ready for further analysis in minutes. EasySSR was implemented as a web application, which can be executed from any browser and is available for free at https://computationalbiology.ufpa.br/easyssr/. Tutorials, usage notes, and download links to the source code can be found at https://github.com/engbiopct/EasySSR.

  7. q

    MS Excel Refresher - Lizards, iguanas, and snakes! Oh my! | Data Nuggets

    • qubeshub.org
    Updated Jan 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kristen Kaczynski (2023). MS Excel Refresher - Lizards, iguanas, and snakes! Oh my! | Data Nuggets [Dataset]. http://doi.org/10.25334/NZWH-HQ21
    Explore at:
    Dataset updated
    Jan 13, 2023
    Dataset provided by
    QUBES
    Authors
    Kristen Kaczynski
    Description

    This resource, a MS Excel refresher, extends the level for this Data Nugget. Students are given an Excel workbook with the data and asked to graph and calculate diversity using Excel functions (rather than drawing graphs by hand as in the original data nugget). The data set used is the same. I use this activity in an upper division Environmental Science course for majors that focuses on Restoration Ecology. The simplicity of the data set and the comparisons of reptile diversity among urban, non-urban and urban rehabilitated lend for a great example for doing calculations in spreadsheets.

  8. CMS TRENDS REPORT

    • kaggle.com
    zip
    Updated Oct 8, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dr Dev Kambhampati (2020). CMS TRENDS REPORT [Dataset]. https://www.kaggle.com/devkambhampati/cms-trends-report
    Explore at:
    zip(237157 bytes)Available download formats
    Dataset updated
    Oct 8, 2020
    Authors
    Dr Dev Kambhampati
    Description

    https://www.cms.gov/Medicare/Health-Plans/HealthPlansGenInfo/ReportsFilesData

    Trend Reports

    This page was added due to popular request at the Sep 2004 Enrollment conference. It contains several useful trend and competition indicators. Certain files will be updated monthly while others will be updated quarterly. This page Contains links to the following 4 Excel files: 13 Month Trend Report

    (Zipped Excel and cvs files 127K) Updated 11-02-2005

    Note: The typically 1% difference among CMS reports on monthly plan enrollment is due to different run times. Despite these differences, growth indicators tend to be very stable. Therefore, the data is reliable and can be used for projections. Effective September 2005 the 13-month report will contain snapshots from two components of the CMS data system.

    This file contains all Medicare Advantage Plans.
    This file contains 13 Months of enrollments for each plan.
    The file contains two trend indicators:
      Current Month Growth and
      Month Growth over year
    (Eff 9-05) This file contains 13-month snapshots from two components of the CMS data system
    The file also contains for each plan: Regional Office, Effective Date Organization Type, Geographic Region
    

    Trends By State By Quarter Report

    (Zipped Excel file 30K) Updated 10-17-2005

    The file contains trends for all 50 states and some protectorates. For each state and each quarter the file contains:

    total state eligibles
    total state Medicare Advantage Enrollment
    Penetration
    

    The file contains excel bar charts showing trends in Penetration for the year. The file is organized by the 10, Center For Medicare And Medicaid Services, Regional Offices. Each Regional Office oversees several states. Competition Macro by Plan / Counties

    The Competition Macro may be found in the Geographic Service Area file which is linked to the HealthPlans/ReportFilesData page. The competition macro enables one to review:

    Enrollment, eligibles and penetration
      for all plans
      and/or for all relevant counties
      In a given area of counties
    

    By RO Type Report

    (Zipped Excel and csv files 87K) Updated 11-02-2005

    Note: The 1% difference among CMS reports in July 2005 total plan enrollment is due to different run times. These run time differences will be corrected in the future. The By RO Type report presents:

    Both the Number and Percent of
    Both Medicare Advantage Organizations and Enrollees
    By Regional Office and Organization Type
    
  9. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
F. (Fabiano) Dalpiaz (2020). UC_vs_US Statistic Analysis.xlsx [Dataset]. http://doi.org/10.23644/uu.12631628.v1

UC_vs_US Statistic Analysis.xlsx

Explore at:
xlsxAvailable download formats
Dataset updated
Jul 9, 2020
Dataset provided by
Utrecht University
Authors
F. (Fabiano) Dalpiaz
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Sheet 1 (Raw-Data): The raw data of the study is provided, presenting the tagging results for the used measures described in the paper. For each subject, it includes multiple columns: A. a sequential student ID B an ID that defines a random group label and the notation C. the used notation: user Story or use Cases D. the case they were assigned to: IFA, Sim, or Hos E. the subject's exam grade (total points out of 100). Empty cells mean that the subject did not take the first exam F. a categorical representation of the grade L/M/H, where H is greater or equal to 80, M is between 65 included and 80 excluded, L otherwise G. the total number of classes in the student's conceptual model H. the total number of relationships in the student's conceptual model I. the total number of classes in the expert's conceptual model J. the total number of relationships in the expert's conceptual model K-O. the total number of encountered situations of alignment, wrong representation, system-oriented, omitted, missing (see tagging scheme below) P. the researchers' judgement on how well the derivation process explanation was explained by the student: well explained (a systematic mapping that can be easily reproduced), partially explained (vague indication of the mapping ), or not present.

Tagging scheme:
Aligned (AL) - A concept is represented as a class in both models, either

with the same name or using synonyms or clearly linkable names; Wrongly represented (WR) - A class in the domain expert model is incorrectly represented in the student model, either (i) via an attribute, method, or relationship rather than class, or (ii) using a generic term (e.g., user'' instead ofurban planner''); System-oriented (SO) - A class in CM-Stud that denotes a technical implementation aspect, e.g., access control. Classes that represent legacy system or the system under design (portal, simulator) are legitimate; Omitted (OM) - A class in CM-Expert that does not appear in any way in CM-Stud; Missing (MI) - A class in CM-Stud that does not appear in any way in CM-Expert.

All the calculations and information provided in the following sheets

originate from that raw data.

Sheet 2 (Descriptive-Stats): Shows a summary of statistics from the data collection,

including the number of subjects per case, per notation, per process derivation rigor category, and per exam grade category.

Sheet 3 (Size-Ratio):

The number of classes within the student model divided by the number of classes within the expert model is calculated (describing the size ratio). We provide box plots to allow a visual comparison of the shape of the distribution, its central value, and its variability for each group (by case, notation, process, and exam grade) . The primary focus in this study is on the number of classes. However, we also provided the size ratio for the number of relationships between student and expert model.

Sheet 4 (Overall):

Provides an overview of all subjects regarding the encountered situations, completeness, and correctness, respectively. Correctness is defined as the ratio of classes in a student model that is fully aligned with the classes in the corresponding expert model. It is calculated by dividing the number of aligned concepts (AL) by the sum of the number of aligned concepts (AL), omitted concepts (OM), system-oriented concepts (SO), and wrong representations (WR). Completeness on the other hand, is defined as the ratio of classes in a student model that are correctly or incorrectly represented over the number of classes in the expert model. Completeness is calculated by dividing the sum of aligned concepts (AL) and wrong representations (WR) by the sum of the number of aligned concepts (AL), wrong representations (WR) and omitted concepts (OM). The overview is complemented with general diverging stacked bar charts that illustrate correctness and completeness.

For sheet 4 as well as for the following four sheets, diverging stacked bar

charts are provided to visualize the effect of each of the independent and mediated variables. The charts are based on the relative numbers of encountered situations for each student. In addition, a "Buffer" is calculated witch solely serves the purpose of constructing the diverging stacked bar charts in Excel. Finally, at the bottom of each sheet, the significance (T-test) and effect size (Hedges' g) for both completeness and correctness are provided. Hedges' g was calculated with an online tool: https://www.psychometrica.de/effect_size.html. The independent and moderating variables can be found as follows:

Sheet 5 (By-Notation):

Model correctness and model completeness is compared by notation - UC, US.

Sheet 6 (By-Case):

Model correctness and model completeness is compared by case - SIM, HOS, IFA.

Sheet 7 (By-Process):

Model correctness and model completeness is compared by how well the derivation process is explained - well explained, partially explained, not present.

Sheet 8 (By-Grade):

Model correctness and model completeness is compared by the exam grades, converted to categorical values High, Low , and Medium.

Search
Clear search
Close search
Google apps
Main menu