100+ datasets found

Global Next-Generation Sequencing Informatics Market Business Opportunities...
statsndata.org
excel, pdf
Updated Jun 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stats N Data (2025). Global Next-Generation Sequencing Informatics Market Business Opportunities 2025-2032 [Dataset]. https://www.statsndata.org/report/next-generation-sequencing-informatics-market-9231
Explore at:
pdf, excelAvailable download formats
Dataset updated
Jun 2025
Dataset authored and provided by
Stats N Data
License
https://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order
Area covered
Global
Description
The Next-Generation Sequencing (NGS) Informatics market has rapidly evolved over the past decade, becoming an integral component in genomics research, personalized medicine, and various biomedical applications. This market encompasses software and analytics tools that handle the vast data generated from NGS technolo
Technologies used in big data analysis 2015
statista.com
Updated Jul 29, 2015
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2015). Technologies used in big data analysis 2015 [Dataset]. https://www.statista.com/statistics/491267/big-data-technologies-used/
Explore at:
Dataset updated
Jul 29, 2015
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Dec 2014 - Feb 2015
Area covered
North America, Worldwide, Europe
Description
This graph presents the results of a survey, conducted by BARC in 2014/15, into the current and planned use of technology for the analysis of big data. At the beginning of 2015, ** percent of respondents indicated that their company was already using a big data analytical appliance for big data.
S
Statistical Analysis Software Report
archivemarketresearch.com
doc, pdf, ppt
Updated Feb 8, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Archive Market Research (2025). Statistical Analysis Software Report [Dataset]. https://www.archivemarketresearch.com/reports/statistical-analysis-software-15882
Explore at:
ppt, doc, pdfAvailable download formats
Dataset updated
Feb 8, 2025
Dataset authored and provided by
Archive Market Research
License
https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The market for statistical analysis software is segmented by various factors, including:
f
Orange dataset table
figshare.com
xlsx
Updated Mar 4, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rui Simões (2022). Orange dataset table [Dataset]. http://doi.org/10.6084/m9.figshare.19146410.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.19146410.v1
Dataset updated
Mar 4, 2022
Dataset provided by
figshare
Authors
Rui Simões
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The complete dataset used in the analysis comprises 36 samples, each described by 11 numeric features and 1 target. The attributes considered were caspase 3/7 activity, Mitotracker red CMXRos area and intensity (3 h and 24 h incubations with both compounds), Mitosox oxidation (3 h incubation with the referred compounds) and oxidation rate, DCFDA fluorescence (3 h and 24 h incubations with either compound) and oxidation rate, and DQ BSA hydrolysis. The target of each instance corresponds to one of the 9 possible classes (4 samples per class): Control, 6.25, 12.5, 25 and 50 µM for 6-OHDA and 0.03, 0.06, 0.125 and 0.25 µM for rotenone. The dataset is balanced, it does not contain any missing values and data was standardized across features. The small number of samples prevented a full and strong statistical analysis of the results. Nevertheless, it allowed the identification of relevant hidden patterns and trends.

Exploratory data analysis, information gain, hierarchical clustering, and supervised predictive modeling were performed using Orange Data Mining version 3.25.1 [41]. Hierarchical clustering was performed using the Euclidean distance metric and weighted linkage. Cluster maps were plotted to relate the features with higher mutual information (in rows) with instances (in columns), with the color of each cell representing the normalized level of a particular feature in a specific instance. The information is grouped both in rows and in columns by a two-way hierarchical clustering method using the Euclidean distances and average linkage. Stratified cross-validation was used to train the supervised decision tree. A set of preliminary empirical experiments were performed to choose the best parameters for each algorithm, and we verified that, within moderate variations, there were no significant changes in the outcome. The following settings were adopted for the decision tree algorithm: minimum number of samples in leaves: 2; minimum number of samples required to split an internal node: 5; stop splitting when majority reaches: 95%; criterion: gain ratio. The performance of the supervised model was assessed using accuracy, precision, recall, F-measure and area under the ROC curve (AUC) metrics.
S
Global Tabletop Role-Playing Game (TTRPG) Market Competitive Landscape...
statsndata.org
excel, pdf
Updated May 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stats N Data (2025). Global Tabletop Role-Playing Game (TTRPG) Market Competitive Landscape 2025-2032 [Dataset]. https://www.statsndata.org/report/tabletop-role-playing-game-ttrpg-market-9685
Explore at:
pdf, excelAvailable download formats
Dataset updated
May 2025
Dataset authored and provided by
Stats N Data
License
https://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order
Area covered
Global
Description
The Tabletop Role-Playing Game (TTRPG) market has evolved dramatically over the past few decades, emerging as a vibrant community of storytellers and strategists engaging in immersive gameplay experiences. Initially characterized by traditional pen-and-paper formats, the industry has diversified to include a plethor
Experimental statistics: fostering care datasets
data.wu.ac.at
data.europa.eu
html
Updated May 9, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ofsted (2014). Experimental statistics: fostering care datasets [Dataset]. https://data.wu.ac.at/schema/data_gov_uk/YjJkNzFhNjctOGQ3ZS00OGUwLTgyYmQtY2QyZGJkY2FlZGE4
Explore at:
htmlAvailable download formats
Dataset updated
May 9, 2014
Dataset provided by
Ofstedhttps://gov.uk/ofsted
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
This is the experiemental fostering care publication comprising of datasets.

Source agency: Office for Standards in Education, Children's Services and Skills

Designation: Experimental Official Statistics

Language: English

Alternative title: Experimental statistics: fostering care datasets
v
Statistics Software Market Size and Growth Forecast: Global Insights and...
verifiedindustryinsights.com
Updated Jul 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Verified Industry Insights (2025). Statistics Software Market Size and Growth Forecast: Global Insights and Analysis [Dataset]. https://www.verifiedindustryinsights.com/report/global-statistics-software-industry/
Explore at:
Dataset updated
Jul 9, 2025
Authors
Verified Industry Insights
License
https://www.verifiedindustryinsights.com/privacy-policyhttps://www.verifiedindustryinsights.com/privacy-policy
Area covered
Global
Description
The market size of the Statistics Software Market is categorized based on Deployment Type (On-Premise, Cloud-Based) and Application (Data Analysis, Data Visualization, Predictive Analytics, Statistical Analysis, Reporting) and End-User Industry (Healthcare, Finance, Retail, Education, Government) and geographical regions (North America, Europe, Asia-Pacific, South America, and Middle-East and Africa).
Statistical analysis of food poisoning cases by the number of patients in...
data.gov.tw
csv, json, xml
Updated Jun 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Food and Drug Administration (2025). Statistical analysis of food poisoning cases by the number of patients in the place of consumption [Dataset]. https://data.gov.tw/en/datasets/9839
Explore at:
csv, json, xmlAvailable download formats
Dataset updated
Jun 2, 2025
Dataset authored and provided by
Food and Drug Administrationhttp://www.fda.gov/
License
https://data.gov.tw/licensehttps://data.gov.tw/license
Description
This dataset provides statistics on the number of food poisoning cases in eating places after the year 1981. It is available for use by the general public, industry, academic institutions, and others.
Association rule mining data for census tract chemical exposure analysis
catalog.data.gov
data.amerigeoss.org
+1more
Updated Nov 12, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. EPA Office of Research and Development (ORD) (2020). Association rule mining data for census tract chemical exposure analysis [Dataset]. https://catalog.data.gov/dataset/association-rule-mining-data-for-census-tract-chemical-exposure-analysis
Explore at:
Dataset updated
Nov 12, 2020
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Description
Chemical concentration, exposure, and health risk data for U.S. census tracts from National Scale Air Toxics Assessment (NATA). This dataset is associated with the following publication: Huang, H., R. Tornero-Velez, and T. Barzyk. Associations between socio-demographic characteristics and chemical concentrations contributing to cumulative exposures in the United States. Journal of Exposure Science and Environmental Epidemiology. Nature Publishing Group, London, UK, 27(6): 544-550, (2017).
National Energy Efficiency Data-Framework (NEED) report: summary of analysis...
gov.uk
s3.amazonaws.com
Updated Aug 11, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department for Business, Energy & Industrial Strategy (2023). National Energy Efficiency Data-Framework (NEED) report: summary of analysis 2021 [Dataset]. https://www.gov.uk/government/statistics/national-energy-efficiency-data-framework-need-report-summary-of-analysis-2021
Explore at:
Dataset updated
Aug 11, 2023
Dataset provided by
GOV.UKhttp://gov.uk/
Authors
Department for Business, Energy & Industrial Strategy
Description
The National Energy Efficiency Data-Framework (NEED) was set up to provide a better understanding of energy use and energy efficiency in domestic and non-domestic buildings in Great Britain. The data framework matches data about a property together - including energy consumption and energy efficiency measures installed - at household level.

11 August 2023 Error notice: revisions to the June 2021 Domestic NEED annual report

We identified 2 processing errors in this edition of the Domestic NEED Annual report and corrected them. The changes are small and do not affect the overall findings of the report, only the domestic energy consumption estimates. The revisions are summarised here:

Error 1: Local authority consumption estimates

Extent of the error: Table LA13 and LA14 revised to correct for a processing error that was identified after accessible versions of the tables were published on 10 June 2022. The update did not include the ‘Unknown’ category which meant columns were misplaced. Table LA7 (2019 only) has also been updated to include data for counties, these rows previously appeared as [no data].

Years affected: 2017-2019

Countries affected: England and Wales

Data tables affected:

Local authority table, England and Wales, 2019 (Tables LA7, LA13 and LA14)

Local authority table, England and Wales, 2018 (Tables LA13 and LA14)

Local authority table, England and Wales, 2017 (Tables LA13 and LA14)

Error 2: Some properties incorrectly excluded from the Scotland multiple attributes tables

Extent of the error: These corrections primarily affect the number in sample column for all years as some properties were incorrectly excluded from the consumption estimates. There have also been revisions to the mean, median, upper and lower quartiles. Using 2019 as an example, around 80% of the updated mean and median values are within 300 kWh of what was previously published.

Years affected: 2017-2019

Countries affected: Scotland

Data tables affected: Multiple attributes tables: Scotland, 2019 (all tables)

4 August 2021 Error notice: revisions to the June 2021 Domestic NEED annual report

We identified 2 processing errors in this edition of the Domestic NEED Annual report and corrected them. The changes are small and do not affect the overall findings of the report, only the domestic energy consumption estimates. The impact of energy efficiency measures analysis remains unchanged. The revisions are summarised here:

Error 1: Some properties incorrectly excluded from the 2019 gas consumption estimates

Extent of the error: The properties that were incorrectly excluded made up around 1% of all properties that should have been included

Years affected: 2019

Countries affected: England and Wales, Scotland

Data table and documents affected:

the 2019 gas estimates in all Consumption tables

the 2019 gas estimates in the "https://www.gov.uk/government/statistical-data-sets/national-energy-efficiency-data-framework-need-data-explorer" class="govuk-link"><abbr title="National Energy Efficiency Data-Frame
Z
Codes in R for spatial statistics analysis, ecological response models and...
data.niaid.nih.gov
zenodo.org
Updated Feb 6, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Martínez-Montoya, J. F. (2023). Codes in R for spatial statistics analysis, ecological response models and spatial distribution models [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7603556
Explore at:
Dataset updated
Feb 6, 2023
Dataset provided by
Espinosa, S.
Palacio-Núñez, J.
Rössel-Ramírez, D. W.
Martínez-Montoya, J. F.
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
In the last decade, a plethora of algorithms have been developed for spatial ecology studies. In our case, we use some of these codes for underwater research work in applied ecology analysis of threatened endemic fishes and their natural habitat. For this, we developed codes in Rstudio® script environment to run spatial and statistical analyses for ecological response and spatial distribution models (e.g., Hijmans & Elith, 2017; Den Burg et al., 2020). The employed R packages are as follows: caret (Kuhn et al., 2020), corrplot (Wei & Simko, 2017), devtools (Wickham, 2015), dismo (Hijmans & Elith, 2017), gbm (Freund & Schapire, 1997; Friedman, 2002), ggplot2 (Wickham et al., 2019), lattice (Sarkar, 2008), lattice (Musa & Mansor, 2021), maptools (Hijmans & Elith, 2017), modelmetrics (Hvitfeldt & Silge, 2021), pander (Wickham, 2015), plyr (Wickham & Wickham, 2015), pROC (Robin et al., 2011), raster (Hijmans & Elith, 2017), RColorBrewer (Neuwirth, 2014), Rcpp (Eddelbeuttel & Balamura, 2018), rgdal (Verzani, 2011), sdm (Naimi & Araujo, 2016), sf (e.g., Zainuddin, 2023), sp (Pebesma, 2020) and usethis (Gladstone, 2022).

It is important to follow all the codes in order to obtain results from the ecological response and spatial distribution models. In particular, for the ecological scenario, we selected the Generalized Linear Model (GLM) and for the geographic scenario we selected DOMAIN, also known as Gower's metric (Carpenter et al., 1993). We selected this regression method and this distance similarity metric because of its adequacy and robustness for studies with endemic or threatened species (e.g., Naoki et al., 2006). Next, we explain the statistical parameterization for the codes immersed in the GLM and DOMAIN running:

In the first instance, we generated the background points and extracted the values of the variables (Code2_Extract_values_DWp_SC.R). Barbet-Massin et al. (2012) recommend the use of 10,000 background points when using regression methods (e.g., Generalized Linear Model) or distance-based models (e.g., DOMAIN). However, we considered important some factors such as the extent of the area and the type of study species for the correct selection of the number of points (Pers. Obs.). Then, we extracted the values of predictor variables (e.g., bioclimatic, topographic, demographic, habitat) in function of presence and background points (e.g., Hijmans and Elith, 2017).

Subsequently, we subdivide both the presence and background point groups into 75% training data and 25% test data, each group, following the method of Soberón & Nakamura (2009) and Hijmans & Elith (2017). For a training control, the 10-fold (cross-validation) method is selected, where the response variable presence is assigned as a factor. In case that some other variable would be important for the study species, it should also be assigned as a factor (Kim, 2009).

After that, we ran the code for the GBM method (Gradient Boost Machine; Code3_GBM_Relative_contribution.R and Code4_Relative_contribution.R), where we obtained the relative contribution of the variables used in the model. We parameterized the code with a Gaussian distribution and cross iteration of 5,000 repetitions (e.g., Friedman, 2002; kim, 2009; Hijmans and Elith, 2017). In addition, we considered selecting a validation interval of 4 random training points (Personal test). The obtained plots were the partial dependence blocks, in function of each predictor variable.

Subsequently, the correlation of the variables is run by Pearson's method (Code5_Pearson_Correlation.R) to evaluate multicollinearity between variables (Guisan & Hofer, 2003). It is recommended to consider a bivariate correlation ± 0.70 to discard highly correlated variables (e.g., Awan et al., 2021).

Once the above codes were run, we uploaded the same subgroups (i.e., presence and background groups with 75% training and 25% testing) (Code6_Presence&backgrounds.R) for the GLM method code (Code7_GLM_model.R). Here, we first ran the GLM models per variable to obtain the p-significance value of each variable (alpha ≤ 0.05); we selected the value one (i.e., presence) as the likelihood factor. The generated models are of polynomial degree to obtain linear and quadratic response (e.g., Fielding and Bell, 1997; Allouche et al., 2006). From these results, we ran ecological response curve models, where the resulting plots included the probability of occurrence and values for continuous variables or categories for discrete variables. The points of the presence and background training group are also included.

On the other hand, a global GLM was also run, from which the generalized model is evaluated by means of a 2 x 2 contingency matrix, including both observed and predicted records. A representation of this is shown in Table 1 (adapted from Allouche et al., 2006). In this process we select an arbitrary boundary of 0.5 to obtain better modeling performance and avoid high percentage of bias in type I (omission) or II (commission) errors (e.g., Carpenter et al., 1993; Fielding and Bell, 1997; Allouche et al., 2006; Kim, 2009; Hijmans and Elith, 2017).

Table 1. Example of 2 x 2 contingency matrix for calculating performance metrics for GLM models. A represents true presence records (true positives), B represents false presence records (false positives - error of commission), C represents true background points (true negatives) and D represents false backgrounds (false negatives - errors of omission).

Validation set

Model

True

False

Presence

A

B

Background

C

D

We then calculated the Overall and True Skill Statistics (TSS) metrics. The first is used to assess the proportion of correctly predicted cases, while the second metric assesses the prevalence of correctly predicted cases (Olden and Jackson, 2002). This metric also gives equal importance to the prevalence of presence prediction as to the random performance correction (Fielding and Bell, 1997; Allouche et al., 2006).

The last code (i.e., Code8_DOMAIN_SuitHab_model.R) is for species distribution modelling using the DOMAIN algorithm (Carpenter et al., 1993). Here, we loaded the variable stack and the presence and background group subdivided into 75% training and 25% test, each. We only included the presence training subset and the predictor variables stack in the calculation of the DOMAIN metric, as well as in the evaluation and validation of the model.

Regarding the model evaluation and estimation, we selected the following estimators:

1) partial ROC, which evaluates the approach between the curves of positive (i.e., correctly predicted presence) and negative (i.e., correctly predicted absence) cases. As farther apart these curves are, the model has a better prediction performance for the correct spatial distribution of the species (Manzanilla-Quiñones, 2020).

2) ROC/AUC curve for model validation, where an optimal performance threshold is estimated to have an expected confidence of 75% to 99% probability (De Long et al., 1988).
u
Data for Analysis of features in a sliding threshold of observation for...
deepblue.lib.umich.edu
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Liemohn, Michael W; Adam, Joshua G; Ganushkina, Natalia Y, Data for Analysis of features in a sliding threshold of observation for numeric evaluation (STONE) curve [Dataset]. http://doi.org/10.7302/2mcx-5749
Explore at:
Unique identifier
https://doi.org/10.7302/2mcx-5749
Dataset provided by
Deep Blue Data
Authors
Liemohn, Michael W; Adam, Joshua G; Ganushkina, Natalia Y
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Time period covered
Sep 20, 2013
Description
Many statistical tools have been developed to aid in the assessment of a numerical model’s quality at reproducing observations. Some of these techniques focus on the identification of events within the data set, times when the observed value is beyond some threshold value that defines it as a value of keen interest. An example of this is whether it will rain, in which events are defined as any precipitation above some defined amount. A method called the sliding threshold of observation for numeric evaluation (STONE) curve sweeps the event definition threshold of both the model output and the observations, resulting in the identification of threshold intervals for which the model does well at sorting the observations into events and nonevents. An excellent data-model comparison will have a smooth STONE curve, but the STONE curve can have wiggles and ripples in it. These features reveal clusters when the model systematically overestimates or underestimates the observations. This study establishes the connection between features in the STONE curve and attributes of the data-model relationship. The method is applied to a space weather example.
I
Global Underwater Inspection Solutions Market Scenario Forecasting 2025-2032...
statsndata.org
excel, pdf
Updated May 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stats N Data (2025). Global Underwater Inspection Solutions Market Scenario Forecasting 2025-2032 [Dataset]. https://www.statsndata.org/report/underwater-inspection-solutions-market-366905
Explore at:
pdf, excelAvailable download formats
Dataset updated
May 2025
Dataset authored and provided by
Stats N Data
License
https://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order
Area covered
Global
Description
The Underwater Inspection Solutions market has witnessed significant growth over the past decade, driven by the increasing need for asset integrity management in various industries, including oil and gas, marine, civil engineering, and environmental monitoring. With the growing complexity of underwater infrastructur
d
Statistics analysis table for the adjustment of income and deduction amounts...
data.gov.tw
csv
Updated Jun 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Taxation Bureau of Taipei,Ministry of Finance (2025). Statistics analysis table for the adjustment of income and deduction amounts at the Taipei National Tax Bureau of the Ministry of Finance [Dataset]. https://data.gov.tw/en/datasets/132629
Explore at:
csvAvailable download formats
Dataset updated
Jun 4, 2025
Dataset authored and provided by
National Taxation Bureau of Taipei,Ministry of Finance
License
https://data.gov.tw/licensehttps://data.gov.tw/license
Description
The statistical analysis table for the adjustments of income and deductions provided by the Taipei National Taxation Bureau of the Ministry of Finance.
d
Louisville Metro KY - Officer Involved Shooting Database and Statistical...
catalog.data.gov
data.lojic.org
+1more
Updated Apr 13, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Louisville/Jefferson County Information Consortium (2023). Louisville Metro KY - Officer Involved Shooting Database and Statistical Analysis 10-13-2021 [Dataset]. https://catalog.data.gov/dataset/louisville-metro-ky-officer-involved-shooting-database-and-statistical-analysis-10-13-2021
Explore at:
Dataset updated
Apr 13, 2023
Dataset provided by
Louisville/Jefferson County Information Consortium
Area covered
Louisville, Kentucky
Description
Officer Involved Shooting (OIS) Database and Statistical Analysis. Data is updated after there is an officer involved shooting.PIU#Incident # - the number associated with either the incident or used as reference to store the items in our evidence rooms Date of Occurrence Month - month the incident occurred (Note the year is labeled on the tab of the spreadsheet)Date of Occurrence Day - day of the month the incident occurred (Note the year is labeled on the tab of the spreadsheet)Time of Occurrence - time the incident occurredAddress of incident - the location the incident occurredDivision - the LMPD division in which the incident actually occurredBeat - the LMPD beat in which the incident actually occurredInvestigation Type - the type of investigation (shooting or death)Case Status - status of the case (open or closed)Suspect Name - the name of the suspect involved in the incidentSuspect Race - the race of the suspect involved in the incident (W-White, B-Black)Suspect Sex - the gender of the suspect involved in the incidentSuspect Age - the age of the suspect involved in the incidentSuspect Ethnicity - the ethnicity of the suspect involved in the incident (H-Hispanic, N-Not Hispanic)Suspect Weapon - the type of weapon the suspect used in the incidentOfficer Name - the name of the officer involved in the incidentOfficer Race - the race of the officer involved in the incident (W-White, B-Black, A-Asian)Officer Sex - the gender of the officer involved in the incidentOfficer Age - the age of the officer involved in the incidentOfficer Ethnicity - the ethnicity of the suspect involved in the incident (H-Hispanic, N-Not Hispanic)Officer Years of Service - the number of years the officer has been serving at the time of the incidentLethal Y/N - whether or not the incident involved a death (Y-Yes, N-No, continued-pending)Narrative - a description of what was determined from the investigationContact:Carol Boylecarol.boyle@louisvilleky.gov
m
Data for "Best Practices for Your Exploratory Factor Analysis: Factor...
data.mendeley.com
Updated Jul 16, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pablo Rogers (2021). Data for "Best Practices for Your Exploratory Factor Analysis: Factor Tutorial" published by RAC-Revista de Administração Contemporânea [Dataset]. http://doi.org/10.17632/rdky78bk8r.1
Explore at:
Unique identifier
https://doi.org/10.17632/rdky78bk8r.1
Dataset updated
Jul 16, 2021
Authors
Pablo Rogers
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This repository contains material related to the analysis performed in the article "Best Practices for Your Exploratory Factor Analysis: Factor Tutorial". The material includes the data used in the analyses in .dat format, the labels (.txt) of the variables used in the Factor software, the outputs (.txt) evaluated in the article, and videos (.mp4 with English subtitles) recorded for the purpose of explaining the article. The videos can also be accessed in the following playlist: https://youtube.com/playlist?list=PLDfyRtHbxiZ3R-T3H1cY8dusz273aUFVe. Below is a summary of the article:

"Exploratory Factor Analysis (EFA) is one of the statistical methods most widely used in Administration, however, its current practice coexists with rules of thumb and heuristics given half a century ago. The purpose of this article is to present the best practices and recent recommendations for a typical EFA in Administration through a practical solution accessible to researchers. In this sense, in addition to discussing current practices versus recommended practices, a tutorial with real data on Factor is illustrated, a software that is still little known in the Administration area, but freeware, easy to use (point and click) and powerful. The step-by-step illustrated in the article, in addition to the discussions raised and an additional example, is also available in the format of tutorial videos. Through the proposed didactic methodology (article-tutorial + video-tutorial), we encourage researchers/methodologists who have mastered a particular technique to do the same. Specifically, about EFA, we hope that the presentation of the Factor software, as a first solution, can transcend the current outdated rules of thumb and heuristics, by making best practices accessible to Administration researchers"
Comparative Analysis of Data-Driven Anomaly Detection Methods
data.nasa.gov
s.cnmilf.com
+2more
Updated Mar 31, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
nasa.gov (2025). Comparative Analysis of Data-Driven Anomaly Detection Methods [Dataset]. https://data.nasa.gov/dataset/comparative-analysis-of-data-driven-anomaly-detection-methods
Explore at:
Dataset updated
Mar 31, 2025
Dataset provided by
NASAhttp://nasa.gov/
Description
This paper provides a review of three different advanced machine learning algorithms for anomaly detection in continuous data streams from a ground-test firing of a subscale Solid Rocket Motor (SRM). This study compares Orca, one-class support vector machines, and the Inductive Monitoring System (IMS) for anomaly detection on the data streams. We measure the performance of the algorithm with respect to the detection horizon for situations where fault information is available. These algorithms have been also studied by the present authors (and other co-authors) as applied to liquid propulsion systems. The trade space will be explored between these algorithms for both types of propulsion systems.
o
Indigenous data analysis methods for research
osf.io
url
Updated Jun 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nina Sivertsen; Tahlia Johnson; Annette Briley; Shanamae Davies; Tara Struck; Larissa Taylor; Susan Smith; Megan Cooper; Jaclyn Davey (2024). Indigenous data analysis methods for research [Dataset]. http://doi.org/10.17605/OSF.IO/VNZD9
Explore at:
urlAvailable download formats
Unique identifier
https://doi.org/10.17605/OSF.IO/VNZD9
Dataset updated
Jun 12, 2024
Dataset provided by
Center For Open Science
Authors
Nina Sivertsen; Tahlia Johnson; Annette Briley; Shanamae Davies; Tara Struck; Larissa Taylor; Susan Smith; Megan Cooper; Jaclyn Davey
Description
Objective: The objective of this review is to identify what is known about Indigenous data analysis methods for research. Introduction: Understanding Indigenous data analyses methods for research is crucial in health research with Indigenous participants, to support culturally appropriate interpretation of research data, and culturally inclusive analyses in cross-cultural research teams. Inclusion Criteria: This review will consider primary research studies that report on Indigenous data analysis methods for research. Method: Medline (via Ovid SP), PsycINFO (via Ovid SP), Web of Science (Clarivate Analytics), Scopus (Elsevier), Cumulated Index to Nursing and Allied Health Literature CINAHL (EBSCOhost), ProQuest Central, ProQuest Social Sciences Premium (Clarivate) will be searched. ProQuest (Theses and Dissertations) will be searched for unpublished material. Studies published from inception onwards and written in English will be assessed for inclusion. Studies meeting the inclusion criteria will be assessed for methodological quality and data will be extracted.
Big Data Analytics in Retail Market - Trends & Industry Analysis
mordorintelligence.com
pdf,excel,csv,ppt
Updated Jun 16, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mordor Intelligence (2024). Big Data Analytics in Retail Market - Trends & Industry Analysis [Dataset]. https://www.mordorintelligence.com/industry-reports/big-data-analytics-in-retail-marketing-market
Explore at:
pdf,excel,csv,pptAvailable download formats
Dataset updated
Jun 16, 2024
Dataset authored and provided by
Mordor Intelligence
License
https://www.mordorintelligence.com/privacy-policyhttps://www.mordorintelligence.com/privacy-policy
Time period covered
2021 - 2030
Area covered
Global
Description
The Data Analytics in Retail Industry is segmented by Application (Merchandising and Supply Chain Analytics, Social Media Analytics, Customer Analytics, Operational Intelligence, Other Applications), by Business Type (Small and Medium Enterprises, Large-scale Organizations), and Geography. The market size and forecasts are provided in terms of value (USD billion) for all the above segments.
f
Data from: Additive Hazards Regression Analysis of Massive Interval-Censored...
tandf.figshare.com
pdf
Updated May 12, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Peiyao Huang; Shuwei Li; Xinyuan Song (2025). Additive Hazards Regression Analysis of Massive Interval-Censored Data via Data Splitting [Dataset]. http://doi.org/10.6084/m9.figshare.27103243.v1
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.27103243.v1
Dataset updated
May 12, 2025
Dataset provided by
Taylor & Francis
Authors
Peiyao Huang; Shuwei Li; Xinyuan Song
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
With the rapid development of data acquisition and storage space, massive datasets exhibited with large sample size emerge increasingly and make more advanced statistical tools urgently need. To accommodate such big volume in the analysis, a variety of methods have been proposed in the circumstances of complete or right censored survival data. However, existing development of big data methodology has not attended to interval-censored outcomes, which are ubiquitous in cross-sectional or periodical follow-up studies. In this work, we propose an easily implemented divide-and-combine approach for analyzing massive interval-censored survival data under the additive hazards model. We establish the asymptotic properties of the proposed estimator, including the consistency and asymptotic normality. In addition, the divide-and-combine estimator is shown to be asymptotically equivalent to the full-data-based estimator obtained from analyzing all data together. Simulation studies suggest that, relative to the full-data-based approach, the proposed divide-and-combine approach has desirable advantage in terms of computation time, making it more applicable to large-scale data analysis. An application to a set of interval-censored data also demonstrates the practical utility of the proposed method.

Facebook

Twitter

Click to copy link

Link copied

Cite

Stats N Data (2025). Global Next-Generation Sequencing Informatics Market Business Opportunities 2025-2032 [Dataset]. https://www.statsndata.org/report/next-generation-sequencing-informatics-market-9231

Global Next-Generation Sequencing Informatics Market Business Opportunities 2025-2032

Explore at:

pdf, excelAvailable download formats

Dataset updated

Jun 2025

Dataset authored and provided by

Stats N Data

License

https://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order

Area covered

Global

Description

The Next-Generation Sequencing (NGS) Informatics market has rapidly evolved over the past decade, becoming an integral component in genomics research, personalized medicine, and various biomedical applications. This market encompasses software and analytics tools that handle the vast data generated from NGS technolo

Clear search

Close search

Google apps

Main menu

Global Next-Generation Sequencing Informatics Market Business Opportunities...

Technologies used in big data analysis 2015

Statistical Analysis Software Report

Orange dataset table

Global Tabletop Role-Playing Game (TTRPG) Market Competitive Landscape...

Experimental statistics: fostering care datasets

Statistics Software Market Size and Growth Forecast: Global Insights and...

Statistical analysis of food poisoning cases by the number of patients in...

Association rule mining data for census tract chemical exposure analysis

National Energy Efficiency Data-Framework (NEED) report: summary of analysis...

11 August 2023 Error notice: revisions to the June 2021 Domestic NEED annual report

Error 1: Local authority consumption estimates

4 August 2021 Error notice: revisions to the June 2021 Domestic NEED annual report

Error 1: Some properties incorrectly excluded from the 2019 gas consumption estimates

Codes in R for spatial statistics analysis, ecological response models and...

Data for Analysis of features in a sliding threshold of observation for...

Global Underwater Inspection Solutions Market Scenario Forecasting 2025-2032...

Statistics analysis table for the adjustment of income and deduction amounts...

Louisville Metro KY - Officer Involved Shooting Database and Statistical...

Data for "Best Practices for Your Exploratory Factor Analysis: Factor...

Comparative Analysis of Data-Driven Anomaly Detection Methods

Indigenous data analysis methods for research

Big Data Analytics in Retail Market - Trends & Industry Analysis

Data from: Additive Hazards Regression Analysis of Massive Interval-Censored...

Global Next-Generation Sequencing Informatics Market Business Opportunities 2025-2032