100+ datasets found

D
Data Analysis Application Solution Report
archivemarketresearch.com
doc, pdf, ppt
Updated Feb 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Archive Market Research (2025). Data Analysis Application Solution Report [Dataset]. https://www.archivemarketresearch.com/reports/data-analysis-application-solution-25684
Explore at:
pdf, ppt, docAvailable download formats
Dataset updated
Feb 14, 2025
Dataset authored and provided by
Archive Market Research
License
https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The market for Data Analysis Application Solutions is projected to reach $345.8 million by 2033, exhibiting a CAGR of 12.3% during the forecast period (2023-2033). The increasing adoption of cloud-based data analysis solutions, the growing need for data-driven decision-making, and the rising adoption of big data analytics are the key factors propelling market growth. The increasing adoption of cloud-based data analysis solutions is one of the major drivers of market growth. Cloud-based solutions provide several benefits, such as reduced IT costs, increased flexibility, and accessibility. The growing need for data-driven decision-making is also contributing to market growth. Businesses are increasingly recognizing the importance of data in making informed decisions. Data analysis solutions provide businesses with the ability to analyze data and gain insights, helping them make better decisions. The rising adoption of big data analytics is another factor driving market growth. Big data analytics allows businesses to analyze large volumes of data, identifying patterns and trends that would not be possible to identify with traditional data analysis methods.
f
Descriptive statistics.
plos.figshare.com
xls
Updated Oct 31, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mrinal Saha; Aparna Deb; Imtiaz Sultan; Sujat Paul; Jishan Ahmed; Goutam Saha (2023). Descriptive statistics. [Dataset]. http://doi.org/10.1371/journal.pgph.0002475.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pgph.0002475.t003
Dataset updated
Oct 31, 2023
Dataset provided by
PLOS Global Public Health
Authors
Mrinal Saha; Aparna Deb; Imtiaz Sultan; Sujat Paul; Jishan Ahmed; Goutam Saha
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Vitamin D insufficiency appears to be prevalent in SLE patients. Multiple factors potentially contribute to lower vitamin D levels, including limited sun exposure, the use of sunscreen, darker skin complexion, aging, obesity, specific medical conditions, and certain medications. The study aims to assess the risk factors associated with low vitamin D levels in SLE patients in the southern part of Bangladesh, a region noted for a high prevalence of SLE. The research additionally investigates the possible correlation between vitamin D and the SLEDAI score, seeking to understand the potential benefits of vitamin D in enhancing disease outcomes for SLE patients. The study incorporates a dataset consisting of 50 patients from the southern part of Bangladesh and evaluates their clinical and demographic data. An initial exploratory data analysis is conducted to gain insights into the data, which includes calculating means and standard deviations, performing correlation analysis, and generating heat maps. Relevant inferential statistical tests, such as the Student’s t-test, are also employed. In the machine learning part of the analysis, this study utilizes supervised learning algorithms, specifically Linear Regression (LR) and Random Forest (RF). To optimize the hyperparameters of the RF model and mitigate the risk of overfitting given the small dataset, a 3-Fold cross-validation strategy is implemented. The study also calculates bootstrapped confidence intervals to provide robust uncertainty estimates and further validate the approach. A comprehensive feature importance analysis is carried out using RF feature importance, permutation-based feature importance, and SHAP values. The LR model yields an RMSE of 4.83 (CI: 2.70, 6.76) and MAE of 3.86 (CI: 2.06, 5.86), whereas the RF model achieves better results, with an RMSE of 2.98 (CI: 2.16, 3.76) and MAE of 2.68 (CI: 1.83,3.52). Both models identify Hb, CRP, ESR, and age as significant contributors to vitamin D level predictions. Despite the lack of a significant association between SLEDAI and vitamin D in the statistical analysis, the machine learning models suggest a potential nonlinear dependency of vitamin D on SLEDAI. These findings highlight the importance of these factors in managing vitamin D levels in SLE patients. The study concludes that there is a high prevalence of vitamin D insufficiency in SLE patients. Although a direct linear correlation between the SLEDAI score and vitamin D levels is not observed, machine learning models suggest the possibility of a nonlinear relationship. Furthermore, factors such as Hb, CRP, ESR, and age are identified as more significant in predicting vitamin D levels. Thus, the study suggests that monitoring these factors may be advantageous in managing vitamin D levels in SLE patients. Given the immunological nature of SLE, the potential role of vitamin D in SLE disease activity could be substantial. Therefore, it underscores the need for further large-scale studies to corroborate this hypothesis.
E
Exploratory Data Analysis (EDA) Tools Report
marketreportanalytics.com
doc, pdf, ppt
Updated Apr 2, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Market Report Analytics (2025). Exploratory Data Analysis (EDA) Tools Report [Dataset]. https://www.marketreportanalytics.com/reports/exploratory-data-analysis-eda-tools-54257
Explore at:
ppt, doc, pdfAvailable download formats
Dataset updated
Apr 2, 2025
Dataset authored and provided by
Market Report Analytics
License
https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The Exploratory Data Analysis (EDA) tools market is experiencing robust growth, driven by the increasing need for businesses to derive actionable insights from their ever-expanding datasets. The market, currently estimated at $15 billion in 2025, is projected to witness a Compound Annual Growth Rate (CAGR) of 15% from 2025 to 2033, reaching an estimated $45 billion by 2033. This growth is fueled by several factors, including the rising adoption of big data analytics, the proliferation of cloud-based solutions offering enhanced accessibility and scalability, and the growing demand for data-driven decision-making across diverse industries like finance, healthcare, and retail. The market is segmented by application (large enterprises and SMEs) and type (graphical and non-graphical tools), with graphical tools currently holding a larger market share due to their user-friendly interfaces and ability to effectively communicate complex data patterns. Large enterprises are currently the dominant segment, but the SME segment is anticipated to experience faster growth due to increasing affordability and accessibility of EDA solutions. Geographic expansion is another key driver, with North America currently holding the largest market share due to early adoption and a strong technological ecosystem. However, regions like Asia-Pacific are exhibiting high growth potential, fueled by rapid digitalization and a burgeoning data science talent pool. Despite these opportunities, the market faces certain restraints, including the complexity of some EDA tools requiring specialized skills and the challenge of integrating EDA tools with existing business intelligence platforms. Nonetheless, the overall market outlook for EDA tools remains highly positive, driven by ongoing technological advancements and the increasing importance of data analytics across all sectors. The competition among established players like IBM Cognos Analytics and Altair RapidMiner, and emerging innovative companies like Polymer Search and KNIME, further fuels market dynamism and innovation.
Importance of data sources for analytics vs access among U.S. businesses...
statista.com
Updated Mar 25, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2016). Importance of data sources for analytics vs access among U.S. businesses 2015 [Dataset]. https://www.statista.com/statistics/562625/united-states-data-analytics-importance-vs-access/
Explore at:
Dataset updated
Mar 25, 2016
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
United States
Description
This statistic illustrates the importance of various data sources for business analytics, compared to the level of access businesses have to those data sources, according to a marketing survey of C-level executives, conducted in ************* by Black Ink. As of *************, product and service usage data was listed as important by ** percent of respondents, but the degree of access to that data was put at ** percent.
u
ERA-40 Monthly Means of Isentropic Level Analysis Data
data.ucar.edu
rda-web-prod.ucar.edu
+2more
grib
Updated Oct 9, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
European Centre for Medium-Range Weather Forecasts (2025). ERA-40 Monthly Means of Isentropic Level Analysis Data [Dataset]. http://doi.org/10.5065/84RB-5G30
Explore at:
gribAvailable download formats
Unique identifier
https://doi.org/10.5065/84RB-5G30
Dataset updated
Oct 9, 2025
Dataset provided by
NSF National Center for Atmospheric Research
Authors
European Centre for Medium-Range Weather Forecasts
Description
The monthly means of ECMWF ERA-40 reanalysis isentropic level analysis data are in this dataset.
f
Data from: An Evaluation of the Use of Statistical Procedures in Soil...
scielo.figshare.com
xls
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Laene de Fátima Tavares; André Mundstock Xavier de Carvalho; Lucas Gonçalves Machado (2023). An Evaluation of the Use of Statistical Procedures in Soil Science [Dataset]. http://doi.org/10.6084/m9.figshare.19944438.v1
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.19944438.v1
Dataset updated
Jun 1, 2023
Dataset provided by
SciELO journals
Authors
Laene de Fátima Tavares; André Mundstock Xavier de Carvalho; Lucas Gonçalves Machado
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
ABSTRACT Experimental statistical procedures used in almost all scientific papers are fundamental for clearer interpretation of the results of experiments conducted in agrarian sciences. However, incorrect use of these procedures can lead the researcher to incorrect or incomplete conclusions. Therefore, the aim of this study was to evaluate the characteristics of the experiments and quality of the use of statistical procedures in soil science in order to promote better use of statistical procedures. For that purpose, 200 articles, published between 2010 and 2014, involving only experimentation and studies by sampling in the soil areas of fertility, chemistry, physics, biology, use and management were randomly selected. A questionnaire containing 28 questions was used to assess the characteristics of the experiments, the statistical procedures used, and the quality of selection and use of these procedures. Most of the articles evaluated presented data from studies conducted under field conditions and 27 % of all papers involved studies by sampling. Most studies did not mention testing to verify normality and homoscedasticity, and most used the Tukey test for mean comparisons. Among studies with a factorial structure of the treatments, many had ignored this structure, and data were compared assuming the absence of factorial structure, or the decomposition of interaction was performed without showing or mentioning the significance of the interaction. Almost none of the papers that had split-block factorial designs considered the factorial structure, or they considered it as a split-plot design. Among the articles that performed regression analysis, only a few of them tested non-polynomial fit models, and none reported verification of the lack of fit in the regressions. The articles evaluated thus reflected poor generalization and, in some cases, wrong generalization in experimental design and selection of procedures for statistical analysis.
Model output and data used for analysis
catalog.data.gov
Updated Nov 12, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. EPA Office of Research and Development (ORD) (2020). Model output and data used for analysis [Dataset]. https://catalog.data.gov/dataset/model-output-and-data-used-for-analysis
Explore at:
Dataset updated
Nov 12, 2020
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Description
The modeled data in these archives are in the NetCDF format (https://www.unidata.ucar.edu/software/netcdf/). NetCDF (Network Common Data Form) is a set of software libraries and machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data. It is also a community standard for sharing scientific data. The Unidata Program Center supports and maintains netCDF programming interfaces for C, C++, Java, and Fortran. Programming interfaces are also available for Python, IDL, MATLAB, R, Ruby, and Perl. Data in netCDF format is: • Self-Describing. A netCDF file includes information about the data it contains. • Portable. A netCDF file can be accessed by computers with different ways of storing integers, characters, and floating-point numbers. • Scalable. Small subsets of large datasets in various formats may be accessed efficiently through netCDF interfaces, even from remote servers. • Appendable. Data may be appended to a properly structured netCDF file without copying the dataset or redefining its structure. • Sharable. One writer and multiple readers may simultaneously access the same netCDF file. • Archivable. Access to all earlier forms of netCDF data will be supported by current and future versions of the software. Pub_figures.tar.zip Contains the NCL scripts for figures 1-5 and Chesapeake Bay Airshed shapefile. The directory structure of the archive is ./Pub_figures/Fig#_data. Where # is the figure number from 1-5. EMISS.data.tar.zip This archive contains two NetCDF files that contain the emission totals for 2011ec and 2040ei emission inventories. The name of the files contain the year of the inventory and the file header contains a description of each variable and the variable units. EPIC.data.tar.zip contains the monthly mean EPIC data in NetCDF format for ammonium fertilizer application (files with ANH3 in the name) and soil ammonium concentration (files with NH3 in the name) for historical (Hist directory) and future (RCP-4.5 directory) simulations. WRF.data.tar.zip contains mean monthly and seasonal data from the 36km downscaled WRF simulations in the NetCDF format for the historical (Hist directory) and future (RCP-4.5 directory) simulations. CMAQ.data.tar.zip contains the mean monthly and seasonal data in NetCDF format from the 36km CMAQ simulations for the historical (Hist directory), future (RCP-4.5 directory) and future with historical emissions (RCP-4.5-hist-emiss directory). This dataset is associated with the following publication: Campbell, P., J. Bash, C. Nolte, T. Spero, E. Cooter, K. Hinson, and L. Linker. Projections of Atmospheric Nitrogen Deposition to the Chesapeake Bay Watershed. Journal of Geophysical Research - Biogeosciences. American Geophysical Union, Washington, DC, USA, 12(11): 3307-3326, (2019).
Big Data Market Analysis, Size, and Forecast 2025-2029: North America (US...
technavio.com
pdf
Updated Jun 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Technavio (2025). Big Data Market Analysis, Size, and Forecast 2025-2029: North America (US and Canada), Europe (France, Germany, and UK), APAC (Australia, China, India, Japan, and South Korea), and Rest of World (ROW) [Dataset]. https://www.technavio.com/report/big-data-market-industry-analysis
Explore at:
pdfAvailable download formats
Dataset updated
Jun 7, 2025
Dataset provided by
TechNavio
Authors
Technavio
License
https://www.technavio.com/content/privacy-noticehttps://www.technavio.com/content/privacy-notice
Time period covered
2025 - 2029
Description
Snapshot img

Big Data Market Size 2025-2029

The big data market size is valued to increase USD 193.2 billion, at a CAGR of 13.3% from 2024 to 2029. Surge in data generation will drive the big data market.

Major Market Trends & Insights

APAC dominated the market and accounted for a 36% growth during the forecast period. By Deployment - On-premises segment was valued at USD 55.30 billion in 2023 By Type - Services segment accounted for the largest market revenue share in 2023

Market Size & Forecast

Market Opportunities: USD 193.04 billion Market Future Opportunities: USD 193.20 billion CAGR from 2024 to 2029 : 13.3%

Market Summary

In the dynamic realm of business intelligence, the market continues to expand at an unprecedented pace. According to recent estimates, this market is projected to reach a value of USD 274.3 billion by 2022, underscoring its significant impact on modern industries. This growth is driven by several factors, including the increasing volume, variety, and velocity of data generation. Moreover, the adoption of advanced technologies, such as machine learning and artificial intelligence, is enabling businesses to derive valuable insights from their data. Another key trend is the integration of blockchain solutions into big data implementation, enhancing data security and trust. However, this rapid expansion also presents challenges, such as ensuring data privacy and security, managing data complexity, and addressing the skills gap. Despite these challenges, the future of the market looks promising, with continued innovation and investment in data analytics and management solutions. As businesses increasingly rely on data to drive decision-making and gain a competitive edge, the importance of effective big data strategies will only grow.

What will be the Size of the Big Data Market during the forecast period?

Get Key Insights on Market Forecast (PDF) Request Free Sample

How is the Big Data Market Segmented?

The big data industry research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD billion' for the period 2025-2029, as well as historical data from 2019-2023 for the following segments.

Deployment On-premises Cloud-based Hybrid Type Services Software End-user BFSI Healthcare Retail and e-commerce IT and telecom Others Geography North America US Canada Europe France Germany UK APAC Australia China India Japan South Korea Rest of World (ROW)

By Deployment Insights

The on-premises segment is estimated to witness significant growth during the forecast period.

In the ever-evolving landscape of data management, the market continues to expand with innovative technologies and solutions. On-premises big data software deployment, a popular choice for many organizations, offers control over hardware and software functions. Despite the high upfront costs for hardware purchases, it eliminates recurring monthly payments, making it a cost-effective alternative for some. However, cloud-based deployment, with its ease of access and flexibility, is increasingly popular, particularly for businesses dealing with high-velocity data ingestion. Cloud deployment, while convenient, comes with its own challenges, such as potential security breaches and the need for companies to manage their servers.

On-premises solutions, on the other hand, provide enhanced security and control, but require significant capital expenditure. Advanced analytics platforms, such as those employing deep learning models, parallel processing, and machine learning algorithms, are transforming data processing and analysis. Metadata management, data lineage tracking, and data versioning control are crucial components of these solutions, ensuring data accuracy and reliability. Data integration platforms, including IoT data integration and ETL process optimization, are essential for seamless data flow between systems. Real-time analytics, data visualization tools, and business intelligence dashboards enable organizations to make data-driven decisions. Data encryption methods, distributed computing, and data lake architectures further enhance data security and scalability.

Request Free Sample

The On-premises segment was valued at USD 55.30 billion in 2019 and showed a gradual increase during the forecast period.

With the integration of AI-powered insights, natural language processing, and predictive modeling, businesses can unlock valuable insights from their data, improving operational efficiency and driving growth. A recent study reveals that the market is projected to reach USD 274.3 billion by 2022, underscoring its growing importance in today's data-driven economy. This continuous evolution of big data technologies and solutions underscores the need for robust data governa
H
Replication data for: Statistical Analysis of List Experiments
dataverse.harvard.edu
Updated Oct 2, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Graeme Blair; Kosuke Imai (2014). Replication data for: Statistical Analysis of List Experiments [Dataset]. http://doi.org/10.7910/DVN/7WEJ09
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/7WEJ09
Dataset updated
Oct 2, 2014
Dataset provided by
Harvard Dataverse
Authors
Graeme Blair; Kosuke Imai
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
The validity of empirical research often relies upon the accuracy of self-reported behavior and beliefs. Yet, eliciting truthful answers in surveys is challenging especially when studying sensitive issues such as racial prejudice, corruption, and support for militant groups. List experiments have attracted much attention recently as a potential solution to this measurement problem. Many researchers, however, have used a simple difference-in-means estimator without being able to efficiently examine multivariate relationships between respondents' characteristics and their answers to sensitive items. Moreover, no systematic means exist to investigate role of underlying assumptions. We fill these gaps by developing a set of new statistical methods for list experiments. We identify the commonly invoked assumptions, propose new multivariate regression estimators, and develop methods to detect and adjust for potential violations of key assumptions. For empirical illustrations, we analyze list experiments concerning racial prejudice. Open-source software is made available to implement the proposed methodology.
Comparison of proteomic sample preparation and data analysis methods by...
data-staging.niaid.nih.gov
ebi.ac.uk
xml
Updated Dec 4, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Roland Lehmann; Prof. Hortense Slevogt (2018). Comparison of proteomic sample preparation and data analysis methods by means of human follicular fluids [Dataset]. https://data-staging.niaid.nih.gov/resources?id=pxd009061
Explore at:
xmlAvailable download formats
Dataset updated
Dec 4, 2018
Dataset provided by
University Hospital Jena Septomics
Host Septomics Research Centre Jena University Hospital
Authors
Roland Lehmann; Prof. Hortense Slevogt
Variables measured
Proteomics
Description
In-depth proteome exploration of complex body fluids is a challenging task that requires optimal sample preparation and analysis in order to reach novel and meaningful insights. Analysis of follicular fluids is similarly difficult as that of blood serum due to the ubiquitous presence of several highly abundant proteins and a wide range of protein concentrations. Therefore, the accessibility of this complex body fluid for liquid chromatography-tandem mass spectrometry (LC/MS/MS) analysis is a challenging opportunity to gain insights into the physiological status or to identify new diagnostic and prognostic markers for e.g. the treatment of infertility. We compared different sample preparation methods (FASP, eFASP and in-solution digestion) and three different data analysis software packages (Proteome Discoverer with SEQUEST and Mascot, Maxquant with Andromeda) in conjunction with semi- and full-tryptic databank search approaches in order to obtain a maximum coverage of the proteome.
D
Data Analysis Services Report
datainsightsmarket.com
doc, pdf, ppt
Updated Sep 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Data Insights Market (2025). Data Analysis Services Report [Dataset]. https://www.datainsightsmarket.com/reports/data-analysis-services-1366877
Explore at:
doc, pdf, pptAvailable download formats
Dataset updated
Sep 26, 2025
Dataset authored and provided by
Data Insights Market
License
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The global Data Analysis Services market is poised for substantial expansion, projected to reach a significant valuation by 2060. Driven by an ever-increasing volume of digital data and the imperative for businesses to extract actionable insights for strategic decision-making, the market is expected to grow at a Compound Annual Growth Rate (CAGR) of 10.2% from 2025 to 2033. This robust growth is fueled by the transformative power of data in optimizing operations, enhancing customer experiences, and identifying new revenue streams across diverse industries. Key applications such as retail are leveraging data analysis for personalized marketing and inventory management, while the medical industry utilizes it for predictive diagnostics and drug discovery. Manufacturing sectors are benefiting from data-driven process optimization and predictive maintenance, further underscoring the broad applicability and essential nature of these services. The increasing adoption of advanced analytics techniques, including AI and machine learning, is a critical factor propelling this market forward, enabling more sophisticated data interpretation and forecasting. The competitive landscape features a blend of established technology giants and specialized analytics firms, all vying to provide cutting-edge solutions. Major players like IBM, Microsoft, Oracle, and SAP are investing heavily in their data analysis platforms and service offerings, while companies such as Accenture, PwC, and SAS Institute are recognized for their consulting and implementation expertise. Trends like the rise of cloud-based analytics, the demand for real-time data processing, and the growing emphasis on data governance and security are shaping the market's trajectory. While the potential for significant returns and competitive advantage through data analysis remains a powerful driver, challenges such as data privacy concerns, the scarcity of skilled data professionals, and the cost of implementing sophisticated analytics solutions can act as restraints. Nevertheless, the overarching demand for data-driven insights to navigate an increasingly complex business environment ensures a dynamic and growth-oriented future for the Data Analysis Services market. This report delves into the dynamic global Data Analysis Services market, providing an in-depth analysis from the historical period of 2019-2024 through to an estimated forecast period of 2025-2033. With a base year of 2025, the study meticulously examines market size, growth drivers, challenges, and future trends, offering actionable insights for stakeholders. The projected market value is expected to reach multi-million dollar figures, reflecting the escalating importance of data-driven decision-making across industries.
D
Data from: Qualitative analysis of meanings concerning death and dying...
ssh.datastations.nl
datasearch.gesis.org
bin, pdf, xml, zip
Updated Nov 16, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
N.P.M. Fortuin; J.B.A.M. Schilderman; H.J.M. Venbrux; N.P.M. Fortuin; J.B.A.M. Schilderman; H.J.M. Venbrux (2016). Qualitative analysis of meanings concerning death and dying stemming from the Dutch article series 'the last word' (NRC Handelsblad, 2011-2013) [Dataset]. http://doi.org/10.17026/DANS-ZEM-SKCD
Explore at:
bin(564097), zip(20931), xml(939604), pdf(331037)Available download formats
Unique identifier
https://doi.org/10.17026/DANS-ZEM-SKCD
Dataset updated
Nov 16, 2016
Dataset provided by
DANS Data Station Social Sciences and Humanities
Authors
N.P.M. Fortuin; J.B.A.M. Schilderman; H.J.M. Venbrux; N.P.M. Fortuin; J.B.A.M. Schilderman; H.J.M. Venbrux
License
https://doi.org/10.17026/fp39-0x58https://doi.org/10.17026/fp39-0x58
Description
This dataset is an ATLAS.ti copy bundle that contains the analysis of 86 articles that appeared between March 2011 and March 2013 in the Dutch quality newspaper NRC Handelsblad in the weekly article series 'the last word' [Dutch: 'het laatste woord'] that were written by NRC editor Gijsbert van Es. Newspaper texts have been retrieved from LexisNexis (http://academic.lexisnexis.nl/). These articles describe the experience of the last phase of life of people who were confronted with approaching death due to cancer or other life-threatening diseases, or due to old age and age-related health losses. The analysis focuses on the meanings concerning death and dying that were expressed by these people in their last phase of life. The data-set was analysed with ATLAS.ti and contains a codebook. In the memo manager a memo is included that provides information concerning the analysed data. Culturally embedded meanings concerning death and dying have been interpreted as 'death-related cultural affordances': possibilities for perception and action in the face of death that are offered by the cultural environment. These have been grouped into three different ‘cultural niches’ (sets of mutually supporting cultural affordances) that are grounded in different mechanisms for determining meaning: a canonical niche (grounding meaning in established (religious) authority and tradition), a utilitarian niche (grounding meaning in rationality and utilitarian function) and an expressive niche (grounding meaning in authentic (and often aesthetic) self-expression. Interviews are in Dutch; Codes, analysis and metadata are in English.
Importance Analysis about fqid Features - FI Data
kaggle.com
zip
Updated Apr 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tsuyoshi Fujii (2023). Importance Analysis about fqid Features - FI Data [Dataset]. https://www.kaggle.com/datasets/tsuyoshifujii/importance-analysis-about-fqid-features-fi-data
Explore at:
zip(207835 bytes)Available download formats
Dataset updated
Apr 12, 2023
Authors
Tsuyoshi Fujii
Description
Data related to the competition "Predict Student Performance from Game Play"

This is a pickle file that summarizes the results of the Feature Importance calculations. The notebook of Train part is here.
ECMWF ERA5: ensemble means of surface level analysis parameter data
catalogue.ceda.ac.uk
data-search.nerc.ac.uk
Updated Jul 7, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
European Centre for Medium-Range Weather Forecasts (ECMWF) (2025). ECMWF ERA5: ensemble means of surface level analysis parameter data [Dataset]. https://catalogue.ceda.ac.uk/uuid/d8021685264e43c7a0868396a5f582d0
Explore at:
Dataset updated
Jul 7, 2025
Dataset provided by
Centre for Environmental Data Analysishttp://www.ceda.ac.uk/
Authors
European Centre for Medium-Range Weather Forecasts (ECMWF)
License
https://artefacts.ceda.ac.uk/licences/specific_licences/ecmwf-era-products.pdfhttps://artefacts.ceda.ac.uk/licences/specific_licences/ecmwf-era-products.pdf
Area covered
Earth
Variables measured
cloud_area_fraction, sea_ice_area_fraction, air_pressure_at_mean_sea_level, lwe_thickness_of_atmosphere_mass_content_of_water_vapor
Description
This dataset contains ERA5 surface level analysis parameter data ensemble means (see linked dataset for spreads). ERA5 is the 5th generation reanalysis project from the European Centre for Medium-Range Weather Forecasts (ECWMF) - see linked documentation for further details. The ensemble means and spreads are calculated from the ERA5 10 member ensemble, run at a reduced resolution compared with the single high resolution (hourly output at 31 km grid spacing) 'HRES' realisation, for which these data have been produced to provide an uncertainty estimate. This dataset contains a limited selection of all available variables and have been converted to netCDF from the original GRIB files held on the ECMWF system. They have also been translated onto a regular latitude-longitude grid during the extraction process from the ECMWF holdings. For a fuller set of variables please see the linked Copernicus Data Store (CDS) data tool, linked to from this record.

Note, ensemble standard deviation is often referred to as ensemble spread and is calculated as the standard deviation of the 10-members in the ensemble (i.e., including the control). It is not the sample standard deviation, and thus were calculated by dividing by 10 rather than 9 (N-1). See linked datasets for ensemble member and ensemble mean data.

The ERA5 global atmospheric reanalysis of the covers 1979 to 2 months behind the present month. This follows on from the ERA-15, ERA-40 rand ERA-interim re-analysis projects.

An initial release of ERA5 data (ERA5t) is made roughly 5 days behind the present date. These will be subsequently reviewed ahead of being released by ECMWF as quality assured data within 3 months. CEDA holds a 6 month rolling copy of the latest ERA5t data. See related datasets linked to from this record. However, for the period 2000-2006 the initial ERA5 release was found to suffer from stratospheric temperature biases and so new runs to address this issue were performed resulting in the ERA5.1 release (see linked datasets). Note, though, that Simmons et al. 2020 (technical memo 859) report that "ERA5.1 is very close to ERA5 in the lower and middle troposphere." but users of data from this period should read the technical memo 859 for further details.
m
Data from: A Semiotics Analysis Found on Music Video of You Belong with Me...
data.mendeley.com
Updated Aug 22, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
PRAGMATICA; Journal of Linguistics and Literature (2023). A Semiotics Analysis Found on Music Video of You Belong with Me by Taylor Swift [Dataset]. http://doi.org/10.17632/fp46m4gvps.1
Explore at:
Unique identifier
https://doi.org/10.17632/fp46m4gvps.1
Dataset updated
Aug 22, 2023
Authors
PRAGMATICA; Journal of Linguistics and Literature
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This research entitles “A Semiotics Analysis Found on Music Vidio of You Belong with Me”.The aim of this research was to investigate and analyze the verbal and visual signs and the meaning itself in the music video of “You Belong with Me” by Taylor Swift. The type of this research was qualitative research. In collecting data, the writer used the method of observation and documentation by classifying videos into pictures in the form of sequences.The results of this study indicate that the semiotic signs contained in this music video are in the form of visual displays contained in body language in the music video which tells about a male friend that Swift likes who actually has a lover, and verbal signs contained in the music video is a paper that contains writing that is used to communicate. Based on the result of the analysis,it can be concluded as there are two classifications,namely: verbal sign and visual sign. In verbal sign, it was found eight data. In visual sign, it was found seven data. The concept of music video of You Belong With Me describe someone who is in love with someone where that person has been with a lover who doesn't appreciate it at all. In the data found, verbal and visual sign explained about caring, disappointment, jealousy, and express feelings.
s
10 Important Questions on Fundamental Analysis of Stocks – Meaning,...
smartinvestello.com
html
Updated Oct 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Smart Investello (2025). 10 Important Questions on Fundamental Analysis of Stocks – Meaning, Parameters, and Step-by-Step Guide - Data Table [Dataset]. https://smartinvestello.com/10-questions-on-fundamental-analysis/
Explore at:
htmlAvailable download formats
Dataset updated
Oct 5, 2025
Dataset authored and provided by
Smart Investello
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset extracted from the post 10 Important Questions on Fundamental Analysis of Stocks – Meaning, Parameters, and Step-by-Step Guide on Smart Investello.
D
Collision between biological process and statistical analysis revealed by...
datasetcatalog.nlm.nih.gov
data.niaid.nih.gov
+1more
Updated Sep 8, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dingemanse, Niels; Allegue, Hassen; Westneat, David; Dochtermann, Ned; Class, Barbara; Nakagawa, Shinichi; Schielzeth, Holger; Martin, Julien; Reale, Denis; Garamszegi, Laszlo; Araya-Ajoy, Yimen (2020). Collision between biological process and statistical analysis revealed by mean-centering [Dataset]. http://doi.org/10.5061/dryad.sj3tx9632
Explore at:
Unique identifier
https://doi.org/10.5061/dryad.sj3tx9632
Dataset updated
Sep 8, 2020
Authors
Dingemanse, Niels; Allegue, Hassen; Westneat, David; Dochtermann, Ned; Class, Barbara; Nakagawa, Shinichi; Schielzeth, Holger; Martin, Julien; Reale, Denis; Garamszegi, Laszlo; Araya-Ajoy, Yimen
Description
Animal ecologists often collect hierarchically-structured data and analyze these with linear mixed-effects models. Specific complications arise when the effect sizes of covariates vary on multiple levels (e.g., within vs among subjects). Mean-centering of covariates within subjects offers a useful approach in such situations, but is not without problems. A statistical model represents a hypothesis about the underlying biological process. Mean-centering within clusters assumes that the lower level responses (e.g. within subjects) depend on the deviation from the subject mean (relative) rather than on absolute values of the covariate. This may or may not be biologically realistic. We show that mismatch between the nature of the generating (i.e., biological) process and the form of the statistical analysis produce major conceptual and operational challenges for empiricists. We explored the consequences of mismatches by simulating data with three response-generating processes differing in the source of correlation between a covariate and the response. These data were then analyzed by three different analysis equations. We asked how robustly different analysis equations estimate key parameters of interest and under which circumstances biases arise. Mismatches between generating and analytical equations created several intractable problems for estimating key parameters. The most widely misestimated parameter was the among-subject variance in response. We found that no single analysis equation was robust in estimating all parameters generated by all equations. Importantly, even when response-generating and analysis equations matched mathematically, bias in some parameters arose when sampling across the range of the covariate was limited. Our results have general implications for how we collect and analyze data. They also remind us more generally that conclusions from statistical analysis of data are conditional on a hypothesis, sometimes implicit, for the process(es) that generated the attributes we measure. We discuss strategies for real data analysis in face of uncertainty about the underlying biological process.
Data from: Meaning of derivative in the book tasks of 1st of “Bachillerato”
scielo.figshare.com
jpeg
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
María Fernanda Vargas; José Antonio Fernández-Plaza; Juan Francisco Ruiz-Hidalgo (2023). Meaning of derivative in the book tasks of 1st of “Bachillerato” [Dataset]. http://doi.org/10.6084/m9.figshare.14304760.v1
Explore at:
jpegAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.14304760.v1
Dataset updated
Jun 1, 2023
Dataset provided by
SciELOhttp://www.scielo.org/
Authors
María Fernanda Vargas; José Antonio Fernández-Plaza; Juan Francisco Ruiz-Hidalgo
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Abstract Due to the importance of textbooks within the processes of teaching and learning in Mathematics, this article focuses on the tasks proposed in five textbooks of 1st of Bachillerato for this topic. The goal is to identify meanings of derivative in the textbooks through the proposed tasks. It is a quantitative research in which, by means of a cluster analysis, the tasks were grouped according to similarity. The results show that the books emphasize three meanings of the derivative: one procedural-algebraic, one algorithmic, and finally another conceptual-geometric meaning, all of them dominated by the symbolic representation system and that exclusively show a mathematical context.
Data from: Meanings of work for manicurists and hairdressers: employees and...
scielo.figshare.com
xls
Updated Jun 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mariana Machado Souza; Livia de Oliveira Borges (2023). Meanings of work for manicurists and hairdressers: employees and pejotizados [Dataset]. http://doi.org/10.6084/m9.figshare.19923743.v1
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.19923743.v1
Dataset updated
Jun 2, 2023
Dataset provided by
SciELOhttp://www.scielo.org/
Authors
Mariana Machado Souza; Livia de Oliveira Borges
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Abstract The research aimed to identify the differentiation of meanings of work among beauty salon workers, considering the work contracts and the functions performed (hairdressers and manicurists), in a context of pejotização and functions’ internal hierarchy. We applied questionnaires to 171 manicurists and hairdressers with the following types of links: employee, informal, MEI pejotizado and MEI não pejotizado. The results indicated that employees perceive with greater intensity the work as a responsibility and as a way of being socially included, and more proportionality in social and financial retribution. They also indicated that manicurists experience with more intensity the characteristics of brutalization, discrimination and demand.

🌆 City Lifestyle Segmentation Dataset

kaggle.com

zip

Updated Nov 15, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

UmutUygurr (2025). 🌆 City Lifestyle Segmentation Dataset [Dataset]. https://www.kaggle.com/datasets/umuttuygurr/city-lifestyle-segmentation-dataset

Explore at:

zip(11274 bytes)Available download formats

Dataset updated

Nov 15, 2025

Authors

UmutUygurr

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F22121490%2F7189944f8fc292a094c90daa799d08ca%2FChatGPT%20Image%2015%20Kas%202025%2014_07_37.png?generation=1763204959770660&alt=media" alt="">

🌆 About This Dataset

This synthetic dataset simulates 300 global cities across 6 major geographic regions, designed specifically for unsupervised machine learning and clustering analysis. It explores how economic status, environmental quality, infrastructure, and digital access shape urban lifestyles worldwide.

🎯 Perfect For:

📊 K-Means, DBSCAN, Agglomerative Clustering
🔬 PCA & t-SNE Dimensionality Reduction
🗺️ Geospatial Visualization (Plotly, Folium)
📈 Correlation Analysis & Feature Engineering
🎓 Educational Projects (Beginner to Intermediate)

📦 What's Inside?

Feature	Description	Range
10 Features	Economic, environmental & social indicators	Realistically scaled
300 Cities	Europe, Asia, Americas, Africa, Oceania	Diverse distributions
Strong Correlations	Income ↔ Rent (+0.8), Density ↔ Pollution (+0.6)	ML-ready
No Missing Values	Clean, preprocessed data	Ready for analysis
4-5 Natural Clusters	Metropolitan hubs, eco-towns, developing centers	Pre-validated

🔥 Key Features

✅ Realistic Correlations: Income strongly predicts rent (+0.8), internet access (+0.7), and happiness (+0.6)
✅ Regional Diversity: Each region has distinct economic and environmental characteristics
✅ Clustering-Ready: Naturally separable into 4-5 lifestyle archetypes
✅ Beginner-Friendly: No data cleaning required, includes example code
✅ Documented: Comprehensive README with methodology and use cases

🚀 Quick Start Example

import pandas as pd
from sklearn.cluster import KMeans
from sklearn.preprocessing import StandardScaler

# Load and prepare
df = pd.read_csv('city_lifestyle_dataset.csv')
X = df.drop(['city_name', 'country'], axis=1)
X_scaled = StandardScaler().fit_transform(X)

# Cluster
kmeans = KMeans(n_clusters=5, random_state=42)
df['cluster'] = kmeans.fit_predict(X_scaled)

# Analyze
print(df.groupby('cluster').mean())

🎓 Learning Outcomes

After working with this dataset, you will be able to: 1. Apply K-Means, DBSCAN, and Hierarchical Clustering 2. Use PCA for dimensionality reduction and visualization 3. Interpret correlation matrices and feature relationships 4. Create geographic visualizations with cluster assignments 5. Profile and name discovered clusters based on characteristics

📚 Ideal For These Projects

🏆 Kaggle Competitions: Practice clustering techniques
📝 Academic Projects: Urban planning, sociology, environmental science
💼 Portfolio Work: Showcase ML skills to employers
🎓 Learning: Hands-on practice with unsupervised learning
🔬 Research: Urban lifestyle segmentation studies

🌍 Expected Clusters

Cluster	Characteristics	Example Cities
Metropolitan Tech Hubs	High income, density, rent	Silicon Valley, Singapore
Eco-Friendly Towns	Low density, clean air, high happiness	Nordic cities
Developing Centers	Mid income, high density, poor air	Emerging markets
Low-Income Suburban	Low infrastructure, income	Rural areas
Industrial Mega-Cities	Very high density, pollution	Manufacturing hubs

🛠️ Technical Details

Format: CSV (UTF-8)
Size: ~300 rows × 10 columns
Missing Values: 0%
Data Types: 2 categorical, 8 numerical
Target Variable: None (unsupervised)
Correlation Strength: Pre-validated (r: 0.4 to 0.8)

📖 What Makes This Dataset Special?

Unlike random synthetic data, this dataset was carefully engineered with: - ✨ Realistic correlation structures based on urban research - 🌍 Regional characteristics matching real-world patterns - 🎯 Optimal cluster separability (validated via silhouette scores) - 📚 Comprehensive documentation and starter code

🏅 Use This Dataset If You Want To:

✓ Learn clustering without data cleaning hassles
✓ Practice PCA and dimensionality reduction
✓ Create beautiful geographic visualizations
✓ Understand feature correlation in real-world contexts
✓ Build a portfolio project with clear business insights

📊 Acknowledgments

This dataset was designed for educational purposes in machine learning and data science. While synthetic, it reflects real patterns observed in global urban development research.

Happy Clustering! 🎉

Facebook

Twitter

Click to copy link

Link copied

Cite

Archive Market Research (2025). Data Analysis Application Solution Report [Dataset]. https://www.archivemarketresearch.com/reports/data-analysis-application-solution-25684

Data Analysis Application Solution Report

Explore at:

pdf, ppt, docAvailable download formats

Dataset updated

Feb 14, 2025

Dataset authored and provided by

Archive Market Research

License

https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

Time period covered

2025 - 2033

Area covered

Global

Variables measured

Market Size

Description

The market for Data Analysis Application Solutions is projected to reach $345.8 million by 2033, exhibiting a CAGR of 12.3% during the forecast period (2023-2033). The increasing adoption of cloud-based data analysis solutions, the growing need for data-driven decision-making, and the rising adoption of big data analytics are the key factors propelling market growth. The increasing adoption of cloud-based data analysis solutions is one of the major drivers of market growth. Cloud-based solutions provide several benefits, such as reduced IT costs, increased flexibility, and accessibility. The growing need for data-driven decision-making is also contributing to market growth. Businesses are increasingly recognizing the importance of data in making informed decisions. Data analysis solutions provide businesses with the ability to analyze data and gain insights, helping them make better decisions. The rising adoption of big data analytics is another factor driving market growth. Big data analytics allows businesses to analyze large volumes of data, identifying patterns and trends that would not be possible to identify with traditional data analysis methods.

Clear search

Close search

Google apps

Main menu

Data Analysis Application Solution Report

Descriptive statistics.

Exploratory Data Analysis (EDA) Tools Report

Importance of data sources for analytics vs access among U.S. businesses...

ERA-40 Monthly Means of Isentropic Level Analysis Data

Data from: An Evaluation of the Use of Statistical Procedures in Soil...

Model output and data used for analysis

Big Data Market Analysis, Size, and Forecast 2025-2029: North America (US...

Snapshot img

Replication data for: Statistical Analysis of List Experiments

Comparison of proteomic sample preparation and data analysis methods by...

Data Analysis Services Report

Data from: Qualitative analysis of meanings concerning death and dying...

Importance Analysis about fqid Features - FI Data

Data related to the competition "Predict Student Performance from Game Play"

ECMWF ERA5: ensemble means of surface level analysis parameter data

Data from: A Semiotics Analysis Found on Music Video of You Belong with Me...

10 Important Questions on Fundamental Analysis of Stocks – Meaning,...

Collision between biological process and statistical analysis revealed by...

Data from: Meaning of derivative in the book tasks of 1st of “Bachillerato”

Data from: Meanings of work for manicurists and hairdressers: employees and...

🌆 City Lifestyle Segmentation Dataset

🌆 About This Dataset

🎯 Perfect For:

📦 What's Inside?

🔥 Key Features

🚀 Quick Start Example

🎓 Learning Outcomes

📚 Ideal For These Projects

🌍 Expected Clusters

🛠️ Technical Details

📖 What Makes This Dataset Special?

🏅 Use This Dataset If You Want To:

📊 Acknowledgments

Data Analysis Application Solution Report