100+ datasets found
  1. Market share of leading data analytics tools globally 2023

    • statista.com
    Updated Jun 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Market share of leading data analytics tools globally 2023 [Dataset]. https://www.statista.com/statistics/982516/most-popular-data-analytics-software/
    Explore at:
    Dataset updated
    Jun 26, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Dec 2022 - Mar 2023
    Area covered
    Worldwide
    Description

    In 2023, Morningstar Advisor Workstation was by far the most popular data analytics software worldwide. According to a survey carried out between December 2022 and March 2023, the market share of Morningstar Advisor Workstation was ***** percent. It was followed by Riskalyze Elite, with ***** percent, and YCharts, with ***** percent.

  2. Global Statistical Analysis Software Market Size By Deployment Model, By...

    • verifiedmarketresearch.com
    Updated Mar 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VERIFIED MARKET RESEARCH (2024). Global Statistical Analysis Software Market Size By Deployment Model, By Application, By Component, By Geographic Scope And Forecast [Dataset]. https://www.verifiedmarketresearch.com/product/statistical-analysis-software-market/
    Explore at:
    Dataset updated
    Mar 7, 2024
    Dataset provided by
    Verified Market Researchhttps://www.verifiedmarketresearch.com/
    Authors
    VERIFIED MARKET RESEARCH
    License

    https://www.verifiedmarketresearch.com/privacy-policy/https://www.verifiedmarketresearch.com/privacy-policy/

    Time period covered
    2024 - 2030
    Area covered
    Global
    Description

    Statistical Analysis Software Market size was valued at USD 7,963.44 Million in 2023 and is projected to reach USD 13,023.63 Million by 2030, growing at a CAGR of 7.28% during the forecast period 2024-2030.

    Global Statistical Analysis Software Market Drivers

    The market drivers for the Statistical Analysis Software Market can be influenced by various factors. These may include:

    Growing Data Complexity and Volume: The demand for sophisticated statistical analysis tools has been fueled by the exponential rise in data volume and complexity across a range of industries. Robust software solutions are necessary for organizations to evaluate and extract significant insights from huge datasets. Growing Adoption of Data-Driven Decision-Making: Businesses are adopting a data-driven approach to decision-making at a faster rate. Utilizing statistical analysis tools, companies can extract meaningful insights from data to improve operational effectiveness and strategic planning. Developments in Analytics and Machine Learning: As these fields continue to progress, statistical analysis software is now capable of more. These tools' increasing popularity can be attributed to features like sophisticated modeling and predictive analytics. A greater emphasis is being placed on business intelligence: Analytics and business intelligence are now essential components of corporate strategy. In order to provide business intelligence tools for studying trends, patterns, and performance measures, statistical analysis software is essential. Increasing Need in Life Sciences and Healthcare: Large volumes of data are produced by the life sciences and healthcare sectors, necessitating complex statistical analysis. The need for data-driven insights in clinical trials, medical research, and healthcare administration is driving the market for statistical analysis software. Growth of Retail and E-Commerce: The retail and e-commerce industries use statistical analytic tools for inventory optimization, demand forecasting, and customer behavior analysis. The need for analytics tools is fueled in part by the expansion of online retail and data-driven marketing techniques. Government Regulations and Initiatives: Statistical analysis is frequently required for regulatory reporting and compliance with government initiatives, particularly in the healthcare and finance sectors. In these regulated industries, statistical analysis software uptake is driven by this. Big Data Analytics's Emergence: As big data analytics has grown in popularity, there has been a demand for advanced tools that can handle and analyze enormous datasets effectively. Software for statistical analysis is essential for deriving valuable conclusions from large amounts of data. Demand for Real-Time Analytics: In order to make deft judgments fast, there is a growing need for real-time analytics. Many different businesses have a significant demand for statistical analysis software that provides real-time data processing and analysis capabilities. Growing Awareness and Education: As more people become aware of the advantages of using statistical analysis in decision-making, its use has expanded across a range of academic and research institutions. The market for statistical analysis software is influenced by the academic sector. Trends in Remote Work: As more people around the world work from home, they are depending more on digital tools and analytics to collaborate and make decisions. Software for statistical analysis makes it possible for distant teams to efficiently examine data and exchange findings.

  3. A

    App Data Statistics Tool Report

    • archivemarketresearch.com
    doc, pdf, ppt
    Updated Mar 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archive Market Research (2025). App Data Statistics Tool Report [Dataset]. https://www.archivemarketresearch.com/reports/app-data-statistics-tool-58979
    Explore at:
    doc, pdf, pptAvailable download formats
    Dataset updated
    Mar 15, 2025
    Dataset authored and provided by
    Archive Market Research
    License

    https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The global market for App Data Statistics Tools is experiencing robust growth, driven by the increasing adoption of mobile applications across various sectors and the rising need for data-driven decision-making. This market, estimated at $2.5 billion in 2025, is projected to achieve a Compound Annual Growth Rate (CAGR) of 15% from 2025 to 2033. This significant expansion is fueled by several key factors, including the escalating demand for precise user behavior analysis, the necessity for enhanced app performance optimization, and the growing importance of personalized user experiences. The market is segmented by tool type (customized vs. universal) and application (social, information, gaming, e-commerce, tools, and others). The rise of sophisticated analytics platforms offering comprehensive data visualization and insightful reporting contributes significantly to the market's growth. Furthermore, the increasing adoption of cloud-based solutions simplifies data storage and analysis, enabling businesses of all sizes to leverage app data effectively. Competitive forces are shaping the landscape, with established players and emerging startups continuously innovating to offer advanced features and cater to the diverse needs of developers and businesses. The North American market currently holds a significant share, largely due to the concentration of technology companies and early adoption of advanced analytics tools. However, Asia-Pacific is expected to exhibit the fastest growth during the forecast period, driven by the burgeoning mobile app market in countries like India and China. The market faces certain restraints, such as data privacy concerns and the complexity of integrating different analytics tools. Nevertheless, the continued evolution of mobile app technology, alongside the development of more user-friendly and cost-effective analytics platforms, will continue to propel market expansion over the next decade. This growth underscores the strategic value of app data analytics in understanding user behavior, improving app functionality, and ultimately maximizing business success in the competitive mobile landscape.

  4. m

    Statistical Analysis Software Market Size, Share & Industry Trends Analysis...

    • marketresearchintellect.com
    Updated Aug 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Intellect (2024). Statistical Analysis Software Market Size, Share & Industry Trends Analysis 2033 [Dataset]. https://www.marketresearchintellect.com/product/global-statistical-analysis-software-market-size-and-forecast/
    Explore at:
    Dataset updated
    Aug 18, 2024
    Dataset authored and provided by
    Market Research Intellect
    License

    https://www.marketresearchintellect.com/privacy-policyhttps://www.marketresearchintellect.com/privacy-policy

    Area covered
    Global
    Description

    Check Market Research Intellect's Statistical Analysis Software Market Report, pegged at USD 5.25 billion in 2024 and projected to reach USD 10.12 billion by 2033, advancing with a CAGR of 8.6% (2026-2033).Explore factors such as rising applications, technological shifts, and industry leaders.

  5. Leading data compilation and analytics presentation/reporting tools in U.S....

    • statista.com
    Updated Apr 30, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2016). Leading data compilation and analytics presentation/reporting tools in U.S. 2015 [Dataset]. https://www.statista.com/statistics/562654/united-states-data-analytics-data-compilation-and-presentation-tools/
    Explore at:
    Dataset updated
    Apr 30, 2016
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    United States
    Description

    This statistic depicts the distribution of tools used to compile data and present analytics and/or reports to management, according to a marketing survey of C-level executives, conducted in ************* by Black Ink. As of *************, * percent of respondents used statistical modeling tools, such as IBM's SPSS or the SAS Institute's Statistical Analysis System package, to compile and present their reports.

  6. f

    UC_vs_US Statistic Analysis.xlsx

    • figshare.com
    xlsx
    Updated Jul 9, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    F. (Fabiano) Dalpiaz (2020). UC_vs_US Statistic Analysis.xlsx [Dataset]. http://doi.org/10.23644/uu.12631628.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jul 9, 2020
    Dataset provided by
    Utrecht University
    Authors
    F. (Fabiano) Dalpiaz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Sheet 1 (Raw-Data): The raw data of the study is provided, presenting the tagging results for the used measures described in the paper. For each subject, it includes multiple columns: A. a sequential student ID B an ID that defines a random group label and the notation C. the used notation: user Story or use Cases D. the case they were assigned to: IFA, Sim, or Hos E. the subject's exam grade (total points out of 100). Empty cells mean that the subject did not take the first exam F. a categorical representation of the grade L/M/H, where H is greater or equal to 80, M is between 65 included and 80 excluded, L otherwise G. the total number of classes in the student's conceptual model H. the total number of relationships in the student's conceptual model I. the total number of classes in the expert's conceptual model J. the total number of relationships in the expert's conceptual model K-O. the total number of encountered situations of alignment, wrong representation, system-oriented, omitted, missing (see tagging scheme below) P. the researchers' judgement on how well the derivation process explanation was explained by the student: well explained (a systematic mapping that can be easily reproduced), partially explained (vague indication of the mapping ), or not present.

    Tagging scheme:
    Aligned (AL) - A concept is represented as a class in both models, either
    

    with the same name or using synonyms or clearly linkable names; Wrongly represented (WR) - A class in the domain expert model is incorrectly represented in the student model, either (i) via an attribute, method, or relationship rather than class, or (ii) using a generic term (e.g., user'' instead ofurban planner''); System-oriented (SO) - A class in CM-Stud that denotes a technical implementation aspect, e.g., access control. Classes that represent legacy system or the system under design (portal, simulator) are legitimate; Omitted (OM) - A class in CM-Expert that does not appear in any way in CM-Stud; Missing (MI) - A class in CM-Stud that does not appear in any way in CM-Expert.

    All the calculations and information provided in the following sheets
    

    originate from that raw data.

    Sheet 2 (Descriptive-Stats): Shows a summary of statistics from the data collection,
    

    including the number of subjects per case, per notation, per process derivation rigor category, and per exam grade category.

    Sheet 3 (Size-Ratio):
    

    The number of classes within the student model divided by the number of classes within the expert model is calculated (describing the size ratio). We provide box plots to allow a visual comparison of the shape of the distribution, its central value, and its variability for each group (by case, notation, process, and exam grade) . The primary focus in this study is on the number of classes. However, we also provided the size ratio for the number of relationships between student and expert model.

    Sheet 4 (Overall):
    

    Provides an overview of all subjects regarding the encountered situations, completeness, and correctness, respectively. Correctness is defined as the ratio of classes in a student model that is fully aligned with the classes in the corresponding expert model. It is calculated by dividing the number of aligned concepts (AL) by the sum of the number of aligned concepts (AL), omitted concepts (OM), system-oriented concepts (SO), and wrong representations (WR). Completeness on the other hand, is defined as the ratio of classes in a student model that are correctly or incorrectly represented over the number of classes in the expert model. Completeness is calculated by dividing the sum of aligned concepts (AL) and wrong representations (WR) by the sum of the number of aligned concepts (AL), wrong representations (WR) and omitted concepts (OM). The overview is complemented with general diverging stacked bar charts that illustrate correctness and completeness.

    For sheet 4 as well as for the following four sheets, diverging stacked bar
    

    charts are provided to visualize the effect of each of the independent and mediated variables. The charts are based on the relative numbers of encountered situations for each student. In addition, a "Buffer" is calculated witch solely serves the purpose of constructing the diverging stacked bar charts in Excel. Finally, at the bottom of each sheet, the significance (T-test) and effect size (Hedges' g) for both completeness and correctness are provided. Hedges' g was calculated with an online tool: https://www.psychometrica.de/effect_size.html. The independent and moderating variables can be found as follows:

    Sheet 5 (By-Notation):
    

    Model correctness and model completeness is compared by notation - UC, US.

    Sheet 6 (By-Case):
    

    Model correctness and model completeness is compared by case - SIM, HOS, IFA.

    Sheet 7 (By-Process):
    

    Model correctness and model completeness is compared by how well the derivation process is explained - well explained, partially explained, not present.

    Sheet 8 (By-Grade):
    

    Model correctness and model completeness is compared by the exam grades, converted to categorical values High, Low , and Medium.

  7. Collection of example datasets used for the book - R Programming -...

    • figshare.com
    txt
    Updated Dec 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kingsley Okoye; Samira Hosseini (2023). Collection of example datasets used for the book - R Programming - Statistical Data Analysis in Research [Dataset]. http://doi.org/10.6084/m9.figshare.24728073.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Dec 4, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Kingsley Okoye; Samira Hosseini
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This book is written for statisticians, data analysts, programmers, researchers, teachers, students, professionals, and general consumers on how to perform different types of statistical data analysis for research purposes using the R programming language. R is an open-source software and object-oriented programming language with a development environment (IDE) called RStudio for computing statistics and graphical displays through data manipulation, modelling, and calculation. R packages and supported libraries provides a wide range of functions for programming and analyzing of data. Unlike many of the existing statistical softwares, R has the added benefit of allowing the users to write more efficient codes by using command-line scripting and vectors. It has several built-in functions and libraries that are extensible and allows the users to define their own (customized) functions on how they expect the program to behave while handling the data, which can also be stored in the simple object system.For all intents and purposes, this book serves as both textbook and manual for R statistics particularly in academic research, data analytics, and computer programming targeted to help inform and guide the work of the R users or statisticians. It provides information about different types of statistical data analysis and methods, and the best scenarios for use of each case in R. It gives a hands-on step-by-step practical guide on how to identify and conduct the different parametric and non-parametric procedures. This includes a description of the different conditions or assumptions that are necessary for performing the various statistical methods or tests, and how to understand the results of the methods. The book also covers the different data formats and sources, and how to test for reliability and validity of the available datasets. Different research experiments, case scenarios and examples are explained in this book. It is the first book to provide a comprehensive description and step-by-step practical hands-on guide to carrying out the different types of statistical analysis in R particularly for research purposes with examples. Ranging from how to import and store datasets in R as Objects, how to code and call the methods or functions for manipulating the datasets or objects, factorization, and vectorization, to better reasoning, interpretation, and storage of the results for future use, and graphical visualizations and representations. Thus, congruence of Statistics and Computer programming for Research.

  8. f

    Data from: ODM Data Analysis—A tool for the automatic validation, monitoring...

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Jun 22, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Doods, Justin; Ständer, Sonja; Brix, Tobias Johannes; Bruland, Philipp; Ernsting, Jan; Dugas, Martin; Neuhaus, Philipp; Storck, Michael; Sarfraz, Saad (2018). ODM Data Analysis—A tool for the automatic validation, monitoring and generation of generic descriptive statistics of patient data [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000711292
    Explore at:
    Dataset updated
    Jun 22, 2018
    Authors
    Doods, Justin; Ständer, Sonja; Brix, Tobias Johannes; Bruland, Philipp; Ernsting, Jan; Dugas, Martin; Neuhaus, Philipp; Storck, Michael; Sarfraz, Saad
    Description

    IntroductionA required step for presenting results of clinical studies is the declaration of participants demographic and baseline characteristics as claimed by the FDAAA 801. The common workflow to accomplish this task is to export the clinical data from the used electronic data capture system and import it into statistical software like SAS software or IBM SPSS. This software requires trained users, who have to implement the analysis individually for each item. These expenditures may become an obstacle for small studies. Objective of this work is to design, implement and evaluate an open source application, called ODM Data Analysis, for the semi-automatic analysis of clinical study data.MethodsThe system requires clinical data in the CDISC Operational Data Model format. After uploading the file, its syntax and data type conformity of the collected data is validated. The completeness of the study data is determined and basic statistics, including illustrative charts for each item, are generated. Datasets from four clinical studies have been used to evaluate the application’s performance and functionality.ResultsThe system is implemented as an open source web application (available at https://odmanalysis.uni-muenster.de) and also provided as Docker image which enables an easy distribution and installation on local systems. Study data is only stored in the application as long as the calculations are performed which is compliant with data protection endeavors. Analysis times are below half an hour, even for larger studies with over 6000 subjects.DiscussionMedical experts have ensured the usefulness of this application to grant an overview of their collected study data for monitoring purposes and to generate descriptive statistics without further user interaction. The semi-automatic analysis has its limitations and cannot replace the complex analysis of statisticians, but it can be used as a starting point for their examination and reporting.

  9. B

    Biostatistics Software Report

    • archivemarketresearch.com
    doc, pdf, ppt
    Updated Mar 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archive Market Research (2025). Biostatistics Software Report [Dataset]. https://www.archivemarketresearch.com/reports/biostatistics-software-53353
    Explore at:
    doc, ppt, pdfAvailable download formats
    Dataset updated
    Mar 7, 2025
    Dataset authored and provided by
    Archive Market Research
    License

    https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The biostatistics software market is experiencing robust growth, driven by the increasing adoption of data-driven approaches in pharmaceutical research, clinical trials, and academic studies. The market, valued at approximately $2.5 billion in 2025, is projected to exhibit a Compound Annual Growth Rate (CAGR) of 12% from 2025 to 2033. This expansion is fueled by several key factors. Firstly, the rising volume of complex biological data necessitates sophisticated software solutions for analysis and interpretation. Secondly, advancements in machine learning and artificial intelligence are enhancing the capabilities of biostatistics software, enabling more accurate and efficient data processing. Thirdly, regulatory pressures demanding robust data analysis in the pharmaceutical and healthcare sectors are boosting demand for validated and compliant biostatistics tools. The market is segmented by software type (general-purpose versus specialized) and end-user (pharmaceutical companies, academic institutions, and others). Pharmaceutical companies represent a significant portion of the market due to their extensive reliance on clinical trial data analysis. However, the academic and research segments are also exhibiting strong growth due to increased research activities and funding. Geographically, North America and Europe currently dominate the market, but Asia-Pacific is expected to witness substantial growth in the coming years due to increasing healthcare spending and technological advancements in the region. The competitive landscape is characterized by a mix of established players offering comprehensive suites and specialized niche vendors. While leading players like IBM SPSS Statistics and Minitab enjoy significant market share based on their brand recognition and established user bases, smaller companies specializing in specific statistical methods or user interfaces are gaining traction by catering to niche demands. This competitive dynamic will likely drive innovation and further segmentation within the market, resulting in specialized software offerings tailored to particular research areas and user requirements. The challenges the market faces include the high cost of software licensing, the need for specialized training for effective utilization, and the potential integration complexities with existing data management systems. However, the overall growth trajectory remains positive, driven by the inherent need for sophisticated biostatistical analysis in various sectors.

  10. A

    App Data Statistics Tool Report

    • archivemarketresearch.com
    doc, pdf, ppt
    Updated Mar 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archive Market Research (2025). App Data Statistics Tool Report [Dataset]. https://www.archivemarketresearch.com/reports/app-data-statistics-tool-58940
    Explore at:
    doc, pdf, pptAvailable download formats
    Dataset updated
    Mar 15, 2025
    Dataset authored and provided by
    Archive Market Research
    License

    https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The booming App Data Statistics Tool market is projected to reach $9.66 billion by 2033, growing at a CAGR of 18%. This report analyzes market size, trends, key players (like App Annie, Firebase, Mixpanel), segmentation (social, gaming, e-commerce apps), and regional growth. Discover insights to optimize your app strategy.

  11. Z

    Data Analysis for the Systematic Literature Review of DL4SE

    • data.niaid.nih.gov
    • data-staging.niaid.nih.gov
    Updated Jul 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cody Watson; Nathan Cooper; David Nader; Kevin Moran; Denys Poshyvanyk (2024). Data Analysis for the Systematic Literature Review of DL4SE [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4768586
    Explore at:
    Dataset updated
    Jul 19, 2024
    Dataset provided by
    College of William and Mary
    Washington and Lee University
    Authors
    Cody Watson; Nathan Cooper; David Nader; Kevin Moran; Denys Poshyvanyk
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data Analysis is the process that supports decision-making and informs arguments in empirical studies. Descriptive statistics, Exploratory Data Analysis (EDA), and Confirmatory Data Analysis (CDA) are the approaches that compose Data Analysis (Xia & Gong; 2014). An Exploratory Data Analysis (EDA) comprises a set of statistical and data mining procedures to describe data. We ran EDA to provide statistical facts and inform conclusions. The mined facts allow attaining arguments that would influence the Systematic Literature Review of DL4SE.

    The Systematic Literature Review of DL4SE requires formal statistical modeling to refine the answers for the proposed research questions and formulate new hypotheses to be addressed in the future. Hence, we introduce DL4SE-DA, a set of statistical processes and data mining pipelines that uncover hidden relationships among Deep Learning reported literature in Software Engineering. Such hidden relationships are collected and analyzed to illustrate the state-of-the-art of DL techniques employed in the software engineering context.

    Our DL4SE-DA is a simplified version of the classical Knowledge Discovery in Databases, or KDD (Fayyad, et al; 1996). The KDD process extracts knowledge from a DL4SE structured database. This structured database was the product of multiple iterations of data gathering and collection from the inspected literature. The KDD involves five stages:

    Selection. This stage was led by the taxonomy process explained in section xx of the paper. After collecting all the papers and creating the taxonomies, we organize the data into 35 features or attributes that you find in the repository. In fact, we manually engineered features from the DL4SE papers. Some of the features are venue, year published, type of paper, metrics, data-scale, type of tuning, learning algorithm, SE data, and so on.

    Preprocessing. The preprocessing applied was transforming the features into the correct type (nominal), removing outliers (papers that do not belong to the DL4SE), and re-inspecting the papers to extract missing information produced by the normalization process. For instance, we normalize the feature “metrics” into “MRR”, “ROC or AUC”, “BLEU Score”, “Accuracy”, “Precision”, “Recall”, “F1 Measure”, and “Other Metrics”. “Other Metrics” refers to unconventional metrics found during the extraction. Similarly, the same normalization was applied to other features like “SE Data” and “Reproducibility Types”. This separation into more detailed classes contributes to a better understanding and classification of the paper by the data mining tasks or methods.

    Transformation. In this stage, we omitted to use any data transformation method except for the clustering analysis. We performed a Principal Component Analysis to reduce 35 features into 2 components for visualization purposes. Furthermore, PCA also allowed us to identify the number of clusters that exhibit the maximum reduction in variance. In other words, it helped us to identify the number of clusters to be used when tuning the explainable models.

    Data Mining. In this stage, we used three distinct data mining tasks: Correlation Analysis, Association Rule Learning, and Clustering. We decided that the goal of the KDD process should be oriented to uncover hidden relationships on the extracted features (Correlations and Association Rules) and to categorize the DL4SE papers for a better segmentation of the state-of-the-art (Clustering). A clear explanation is provided in the subsection “Data Mining Tasks for the SLR od DL4SE”. 5.Interpretation/Evaluation. We used the Knowledge Discover to automatically find patterns in our papers that resemble “actionable knowledge”. This actionable knowledge was generated by conducting a reasoning process on the data mining outcomes. This reasoning process produces an argument support analysis (see this link).

    We used RapidMiner as our software tool to conduct the data analysis. The procedures and pipelines were published in our repository.

    Overview of the most meaningful Association Rules. Rectangles are both Premises and Conclusions. An arrow connecting a Premise with a Conclusion implies that given some premise, the conclusion is associated. E.g., Given that an author used Supervised Learning, we can conclude that their approach is irreproducible with a certain Support and Confidence.

    Support = Number of occurrences this statement is true divided by the amount of statements Confidence = The support of the statement divided by the number of occurrences of the premise

  12. A

    App Data Statistics Tool Report

    • marketresearchforecast.com
    doc, pdf, ppt
    Updated Mar 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Forecast (2025). App Data Statistics Tool Report [Dataset]. https://www.marketresearchforecast.com/reports/app-data-statistics-tool-44049
    Explore at:
    pdf, ppt, docAvailable download formats
    Dataset updated
    Mar 20, 2025
    Dataset authored and provided by
    Market Research Forecast
    License

    https://www.marketresearchforecast.com/privacy-policyhttps://www.marketresearchforecast.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    Discover the booming App Data Statistics Tool market! This comprehensive analysis reveals key trends, growth drivers, and leading companies shaping this dynamic sector, including market size projections to 2033. Learn about regional market share and unlock actionable insights for your app strategy.

  13. A

    AI Tools for Data Analysis Report

    • archivemarketresearch.com
    doc, pdf, ppt
    Updated Feb 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archive Market Research (2025). AI Tools for Data Analysis Report [Dataset]. https://www.archivemarketresearch.com/reports/ai-tools-for-data-analysis-18014
    Explore at:
    ppt, pdf, docAvailable download formats
    Dataset updated
    Feb 10, 2025
    Dataset authored and provided by
    Archive Market Research
    License

    https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The size of the AI Tools for Data Analysis market was valued at USD 24160 million in 2024 and is projected to reach USD XXX million by 2033, with an expected CAGR of XX % during the forecast period.

  14. S

    Statistics Software Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Nov 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Statistics Software Report [Dataset]. https://www.datainsightsmarket.com/reports/statistics-software-540803
    Explore at:
    doc, pdf, pptAvailable download formats
    Dataset updated
    Nov 2, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The global Statistics Software market is projected to experience robust growth, with a current estimated market size of approximately $4,500 million in 2025 and an anticipated Compound Annual Growth Rate (CAGR) of around 11% during the forecast period of 2025-2033. This expansion is significantly driven by the increasing demand for advanced analytical capabilities across various sectors. The Scientific Research segment is a primary beneficiary, leveraging statistical software for hypothesis testing, data modeling, and drawing conclusive insights from complex datasets. In parallel, the Finance industry is witnessing a surge in adoption, fueled by the need for sophisticated tools for risk assessment, algorithmic trading, fraud detection, and predictive modeling. Industrial applications are also contributing to market growth, with businesses utilizing statistical software for quality control, process optimization, and supply chain management to enhance efficiency and reduce operational costs. Emerging economies, particularly in the Asia Pacific region, are becoming crucial growth hubs due to increased investment in data analytics and a growing pool of skilled professionals. Key trends shaping the Statistics Software market include the rise of cloud-based solutions, offering greater accessibility, scalability, and cost-effectiveness for businesses of all sizes. Furthermore, the integration of artificial intelligence (AI) and machine learning (ML) within statistical software is a significant development, enabling automated data analysis, pattern recognition, and more accurate forecasting. While the market is poised for substantial growth, certain restraints could temper this trajectory. The high cost of advanced statistical software and the need for specialized expertise to effectively utilize its full potential can be prohibitive for smaller organizations. Data privacy and security concerns, especially with the increasing volume of sensitive data being processed, also pose a challenge, necessitating robust security features and compliance with evolving regulations. However, the continuous innovation by leading companies like Microsoft, IBM, and SAS Institute, alongside emerging players like RapidMiner and Knime, in developing user-friendly interfaces and powerful analytical tools, is expected to mitigate these restraints and propel the market forward.

  15. Change over time analysis (CoTA) tool

    • gov.uk
    Updated Mar 31, 2011
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Energy & Climate Change (2011). Change over time analysis (CoTA) tool [Dataset]. https://www.gov.uk/government/statistics/change-over-time-analysis-cota-tool
    Explore at:
    Dataset updated
    Mar 31, 2011
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Department of Energy & Climate Change
    Description

    Change over Time Analysis (CoTA) Viewer is a visual tool with accompanying Excel worksheets, which assists the analysis of change over time for small areas. In this version, electricity and gas data from 2005 to 2009 are used to analyse change at Middle – Layer Super Output Area in England and Wales.

    This tool supports the strategy for analysing change over time for small areas created by Neighborhood Statistics.

    The tool is available from the http://webarchive.nationalarchives.gov.uk/20130109092117/http:/www.decc.gov.uk/en/content/cms/statistics/energy_stats/regional/analytical/analytical.aspx">National Archives: Analytical tools web page.

    Access the http://www.neighbourhood.statistics.gov.uk/dissemination/Info.do;jessionid=Xb1mQqlJXRcJdnCtQZpzlQJXGpxd7XcsJ3PkXcvpG9dwpDTNVQGM!452292141!1357522281515?m=0&s=1357522281515&enc=1&page=analysisandguidance/analysistoolkit/analysis-toolkit.htm&nsjs=true&nsck=true&nssvg=false&nswid=1680">Neighbourhood Statistics Analysis Toolkit.

  16. Data from: Additional file 1 of MEPHAS: an interactive graphical user...

    • figshare.com
    zip
    Updated Feb 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yi Zhou; Siu-wai Leung; Shosuke Mizutani; Tatsuya Takagi; Yu-Shi Tian (2024). Additional file 1 of MEPHAS: an interactive graphical user interface for medical and pharmaceutical statistical analysis with R and Shiny [Dataset]. http://doi.org/10.6084/m9.figshare.12285290.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Feb 14, 2024
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Yi Zhou; Siu-wai Leung; Shosuke Mizutani; Tatsuya Takagi; Yu-Shi Tian
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 1: AF_table1.docx Graphic user interfaces and statistical methods in MEPHAS; AF_table2.docx Comparison of methods in MEPHAS with EZR, FSFS, and Radiant; AF_list.docx R packages used in MEPHAS; AF_result.docx The results in Example 1 and Example 2.

  17. q

    Data from: A Customizable Inquiry-Based Statistics Teaching Application for...

    • qubeshub.org
    Updated Apr 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mikus Abolins-Abols*; Natalie Christian; Jeffery Masters; Rachel Pigg (2024). A Customizable Inquiry-Based Statistics Teaching Application for Introductory Biology Students [Dataset]. https://qubeshub.org/publications/4651/?v=1
    Explore at:
    Dataset updated
    Apr 5, 2024
    Dataset provided by
    QUBES
    Authors
    Mikus Abolins-Abols*; Natalie Christian; Jeffery Masters; Rachel Pigg
    Description

    Building strong quantitative skills prepares undergraduate biology students for successful careers in science and medicine. While math and statistics anxiety can negatively impact student learning within biology classrooms, instructors may reduce this anxiety by steadily building student competency in quantitative reasoning through instructional scaffolding, application-based approaches, and simple computer program interfaces. However, few statistical programs exist that meet all needs of an inclusive, inquiry-based laboratory course. These needs include an open-source program, a simple interface, little required background knowledge in statistics for student users, and customizability to minimize cognitive load, align with course learning outcomes, and create desirable difficulty. To address these needs, we used the Shiny package in R to develop a custom statistical analysis application. Our “BioStats” app provides students with scaffolded learning experiences in applied statistics that promotes student agency and is customizable by the instructor. It introduces students to the strengths of the R interface, while eliminating the need for complex coding in the R programming language. It also prioritizes practical implementation of statistical analyses over learning statistical theory. To our knowledge, this is the first statistics teaching tool where students are presented basic statistics initially, more complex analyses as they advance, and includes an option to learn R statistical coding. The BioStats app interface yields a simplified introduction to applied statistics that is adaptable to many biology laboratory courses.

    Primary Image: Singing Junco. A sketch of a junco singing on a pine tree branch, created by the lead author of this paper.

  18. m

    Survey results for Third Party Risk Management Tool Selection Criteria

    • data.mendeley.com
    Updated Jun 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rakesh Venugopal (2024). Survey results for Third Party Risk Management Tool Selection Criteria [Dataset]. http://doi.org/10.17632/rzzd7nz8vv.1
    Explore at:
    Dataset updated
    Jun 4, 2024
    Authors
    Rakesh Venugopal
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The research data utilized in this study primarily consists of responses from a survey administered to information security and risk management professionals globally. The survey was designed to elicit the relative importance of various criteria when selecting a Third-Party Risk Management (TPRM) tool. The survey employed a weighted scale, allowing respondents to assign a level of importance (e.g., not important, somewhat important, very important) to each of the criteria identified in the applied framework.

    The survey sample encompassed a diverse range of roles and levels of seniority within organizations, including CEOs, VPs, auditors, and managers. This diversity aimed to capture a comprehensive view of the priorities and preferences across the industry.

    The collected survey data was then analyzed using descriptive statistics and a weighted average approach to determine the mean and median scores for each evaluation criterion. This analysis allowed for a quantitative assessment of the relative importance of different features and functionalities in TPRM tools, providing valuable insights into the decision-making process of industry professionals.

    Additionally, the study incorporated a review of existing literature on TPRM and tool selection. This literature review served to identify key concepts, trends, and gaps in knowledge, informing the development of the TPRMTSF framework and the selection of survey criteria.

    The combination of survey data and literature review provides a comprehensive foundation for the research findings and recommendations presented in this study. By analyzing both empirical data and existing knowledge, the study offers a well-rounded perspective on the challenges and opportunities associated with TPRM tool selection.

  19. h

    Data Acquisition & Analysis Software Market Statistics & Facts

    • htfmarketinsights.com
    pdf & excel
    Updated Oct 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HTF Market Intelligence (2025). Data Acquisition & Analysis Software Market Statistics & Facts [Dataset]. https://htfmarketinsights.com/report/4390503-data-acquisition-analysis-software-market
    Explore at:
    pdf & excelAvailable download formats
    Dataset updated
    Oct 23, 2025
    Dataset authored and provided by
    HTF Market Intelligence
    License

    https://www.htfmarketinsights.com/privacy-policyhttps://www.htfmarketinsights.com/privacy-policy

    Time period covered
    2019 - 2031
    Area covered
    Global
    Description

    Global Data Acquisition & Analysis Software Market is segmented by Application (Pharma R&D, Clinical Trials, Academic Research, Quality Control, Environmental Testing), Type (Lab Data Management Software, Process Analytics Software, Statistical Analysis Software, Instrument Control Software, LIMS Integration Tools), and Geography (North America_ LATAM_ West Europe_Central & Eastern Europe_ Northern Europe_ Southern Europe_ East Asia_ Southeast Asia_ South Asia_ Central Asia_ Oceania_ MEA)

  20. b

    Guidelines for Computing Summary Statistics for Data-Sets Containing...

    • datahub.bvcentre.ca
    Updated Jun 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Guidelines for Computing Summary Statistics for Data-Sets Containing Non-Detects - Dataset - BVRC DataHub [Dataset]. https://datahub.bvcentre.ca/dataset/guidelines-for-computing-summary-statistics-for-data-sets-containing-non-detects
    Explore at:
    Dataset updated
    Jun 3, 2024
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    INTRODUCTION As part of its responsibilities, the BC Ministry of Environment monitors water quality in the province’s streams, rivers, and lakes. Often, it is necessary to compile statistics involving concentrations of contaminants or other compounds. Quite often the instruments used cannot measure concentrations below certain values. These observations are called non-detects or less thans. However, non-detects pose a difficulty when it is necessary to compute statistical measurements such as the mean, the median, and the standard deviation for a data set. The way non-detects are handled can affect the quality of any statistics generated. Non-detects, or censored data are found in many fields such as medicine, engineering, biology, and environmetrics. In such fields, it is often the case that the measurements of interest are below some threshold. Dealing with non-detects is a significant issue and statistical tools using survival or reliability methods have been developed. Basically, there are three approaches for treating data containing censored values: 1. substitution, which gives poor results and therefore, is not recommended in the literature; 2. maximum likelihood estimation, which requires an assumption of some distributional form; and 3. and nonparametric methods which assess the shape of the data based on observed percentiles rather than a strict distributional form. This document provides guidance on how to record censor data, and on when and how to use certain analysis methods when the percentage of censored observations is less than 50%. The methods presented in this document are:1. substitution; 2. Kaplan-Meier, as part of nonparametric methods; 3. lognormal model based on maximum likelihood estimation; 4. and robust regression on order statistics, which is a semiparametric method. Statistical software suitable for survival or reliability analysis is available for dealing with censored data. This software has been widely used in medical and engineering environments. In this document, methods are illustrated with both R and JMP software packages, when possible. JMP often requires some intermediate steps to obtain summary statistics with most of the methods described in this document. R, with the NADA package is usually straightforward. The package NADA was developed specifically for computing statistics with non-detects in environmental data based on Helsel (2005b). The data used to illustrate the methods described for computing summary statistics for non-detects are either simulated or based on information acquired from the B.C. Ministry of Environment. This document is strongly based on the book Nondetects And Data Analysis written by Dennis R. Helsel in 2005 (Helsel, 2005b).

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Statista (2025). Market share of leading data analytics tools globally 2023 [Dataset]. https://www.statista.com/statistics/982516/most-popular-data-analytics-software/
Organization logo

Market share of leading data analytics tools globally 2023

Explore at:
Dataset updated
Jun 26, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Dec 2022 - Mar 2023
Area covered
Worldwide
Description

In 2023, Morningstar Advisor Workstation was by far the most popular data analytics software worldwide. According to a survey carried out between December 2022 and March 2023, the market share of Morningstar Advisor Workstation was ***** percent. It was followed by Riskalyze Elite, with ***** percent, and YCharts, with ***** percent.

Search
Clear search
Close search
Google apps
Main menu