100+ datasets found

Data Mining Tools Market - A Global and Regional Analysis
bisresearch.com
csv, pdf
Updated Jul 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bisresearch (2025). Data Mining Tools Market - A Global and Regional Analysis [Dataset]. https://bisresearch.com/industry-report/global-data-mining-tools-market.html
Explore at:
csv, pdfAvailable download formats
Dataset updated
Jul 25, 2025
Dataset authored and provided by
Bisresearch
License
https://bisresearch.com/privacy-policy-cookie-restriction-modehttps://bisresearch.com/privacy-policy-cookie-restriction-mode
Time period covered
2023 - 2033
Area covered
Worldwide
Description
The Data Mining Tools Market is expected to be valued at $1.24 billion in 2024, with an anticipated expansion at a CAGR of 11.63% to reach $3.73 billion by 2034.
d
Data from: A Generic Local Algorithm for Mining Data Streams in Large...
catalog.data.gov
datasets.ai
+3more
Updated Apr 10, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dashlink (2025). A Generic Local Algorithm for Mining Data Streams in Large Distributed Systems [Dataset]. https://catalog.data.gov/dataset/a-generic-local-algorithm-for-mining-data-streams-in-large-distributed-systems
Explore at:
Dataset updated
Apr 10, 2025
Dataset provided by
Dashlink
Description
In a large network of computers or wireless sensors, each of the components (henceforth, peers) has some data about the global state of the system. Much of the system's functionality such as message routing, information retrieval and load sharing relies on modeling the global state. We refer to the outcome of the function (e.g., the load experienced by each peer) as the emph{model} of the system. Since the state of the system is constantly changing, it is necessary to keep the models up-to-date. Computing global data mining models e.g. decision trees, k-means clustering in large distributed systems may be very costly due to the scale of the system and due to communication cost, which may be high. The cost further increases in a dynamic scenario when the data changes rapidly. In this paper we describe a two step approach for dealing with these costs. First, we describe a highly efficient emph{local} algorithm which can be used to monitor a wide class of data mining models. Then, we use this algorithm as a feedback loop for the monitoring of complex functions of the data such as its k-means clustering. The theoretical claims are corroborated with a thorough experimental analysis.
Data Mining Tools Market Size, Share, Trend Analysis by 2033
emergenresearch.com
pdf,excel,csv,ppt
Updated Dec 8, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Emergen Research (2024). Data Mining Tools Market Size, Share, Trend Analysis by 2033 [Dataset]. https://www.emergenresearch.com/industry-report/data-mining-tools-market
Explore at:
pdf,excel,csv,pptAvailable download formats
Dataset updated
Dec 8, 2024
Dataset authored and provided by
Emergen Research
License
https://www.emergenresearch.com/privacy-policyhttps://www.emergenresearch.com/privacy-policy
Area covered
Global
Variables measured
Base Year, No. of Pages, Growth Drivers, Forecast Period, Segments covered, Historical Data for, Pitfalls Challenges, 2033 Value Projection, Tables, Charts, and Figures, Forecast Period 2024 - 2033 CAGR, and 1 more
Description
The Data Mining Tools Market size is expected to reach a valuation of USD 3.33 billion in 2033 growing at a CAGR of 12.50%. The Data Mining Tools market research report classifies market by share, trend, demand, forecast and based on segmentation.
Lifesciences Data Mining and Visualization Market Report | Global Forecast...
dataintelo.com
csv, pdf, pptx
Updated Sep 5, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dataintelo (2024). Lifesciences Data Mining and Visualization Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/global-lifesciences-data-mining-and-visualization-market
Explore at:
pptx, pdf, csvAvailable download formats
Dataset updated
Sep 5, 2024
Dataset provided by
Authors
Dataintelo
License
https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy
Time period covered
2024 - 2032
Area covered
Global
Description
Lifesciences Data Mining and Visualization Market Outlook

The global market size for Lifesciences Data Mining and Visualization was valued at approximately USD 1.5 billion in 2023 and is projected to reach around USD 4.3 billion by 2032, growing at a compound annual growth rate (CAGR) of 12.5% during the forecast period. The growth of this market is driven by the increasing demand for sophisticated data analysis tools in the life sciences sector, advancements in analytical technologies, and the rising volume of complex biological data generated from research and clinical trials.

One of the primary growth factors for the Lifesciences Data Mining and Visualization market is the burgeoning amount of data generated from various life sciences applications, such as genomics, proteomics, and clinical trials. With the advent of high-throughput technologies, researchers and healthcare professionals are now capable of generating vast amounts of data, which necessitates the use of advanced data mining and visualization tools to derive actionable insights. These tools not only help in managing and interpreting large datasets but also in uncovering hidden patterns and relationships, thereby accelerating research and development processes.

Another significant driver is the increasing adoption of artificial intelligence (AI) and machine learning (ML) algorithms in the life sciences domain. These technologies have proven to be invaluable in enhancing data analysis capabilities, enabling more precise and predictive modeling of biological systems. By integrating AI and ML with data mining and visualization platforms, researchers can achieve higher accuracy in identifying potential drug targets, understanding disease mechanisms, and personalizing treatment plans. This trend is expected to continue, further propelling the market's growth.

Moreover, the rising emphasis on personalized medicine and the need for precision in healthcare is fueling the demand for data mining and visualization tools. Personalized medicine relies heavily on the analysis of individual genetic, proteomic, and metabolomic profiles to tailor treatments specifically to patients' unique characteristics. The ability to visualize these complex datasets in an understandable and actionable manner is critical for the successful implementation of personalized medicine strategies, thereby boosting the demand for advanced data analysis tools.

From a regional perspective, North America is anticipated to dominate the Lifesciences Data Mining and Visualization market, owing to the presence of a robust healthcare infrastructure, significant investments in research and development, and a high adoption rate of advanced technologies. The European market is also expected to witness substantial growth, driven by increasing government initiatives to support life sciences research and the presence of leading biopharmaceutical companies. The Asia Pacific region is projected to experience the fastest growth, attributed to the expanding healthcare sector, rising investments in biotechnology research, and the increasing adoption of data analytics solutions.

Component Analysis

The Lifesciences Data Mining and Visualization market is segmented by component into software and services. The software segment is expected to hold a significant share of the market, driven by the continuous advancements in data mining algorithms and visualization techniques. Software solutions are critical in processing large volumes of complex biological data, facilitating real-time analysis, and providing intuitive visual representations that aid in decision-making. The increasing integration of AI and ML into these software solutions is further enhancing their capabilities, making them indispensable tools in life sciences research.

The services segment, on the other hand, is projected to grow at a considerable rate, as organizations seek specialized expertise to manage and interpret their data. Services include consulting, implementation, and maintenance, as well as training and support. The demand for these services is driven by the need to ensure optimal utilization of data mining software and to keep up with the rapid pace of technological advancements. Moreover, many life sciences organizations lack the in-house expertise required to handle large-scale data analytics projects, thereby turning to external service providers for assistance.

Within the software segment, there is a growing trend towards the development of integrated platforms that combine multiple functionalities, such as data collection, pre
Data Mining Software Market Report | Global Forecast From 2025 To 2033
dataintelo.com
csv, pdf, pptx
Updated Jan 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dataintelo (2025). Data Mining Software Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/data-mining-software-market
Explore at:
pdf, pptx, csvAvailable download formats
Dataset updated
Jan 7, 2025
Dataset provided by
Authors
Dataintelo
License
https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy
Time period covered
2024 - 2032
Area covered
Global
Description
Data Mining Software Market Outlook

The global data mining software market size was valued at USD 7.2 billion in 2023 and is projected to reach USD 15.5 billion by 2032, growing at a compound annual growth rate (CAGR) of 8.7% during the forecast period. This growth is driven primarily by the increasing adoption of big data analytics and the rising demand for business intelligence across various industries. As businesses increasingly recognize the value of data-driven decision-making, the market is expected to witness substantial growth.

One of the significant growth factors for the data mining software market is the exponential increase in data generation. With the proliferation of internet-enabled devices and the rapid advancement of technologies such as the Internet of Things (IoT), there is a massive influx of data. Organizations are now more focused than ever on harnessing this data to gain insights, improve operations, and create a competitive advantage. This has led to a surge in demand for advanced data mining tools that can process and analyze large datasets efficiently.

Another driving force is the growing need for personalized customer experiences. In industries such as retail, healthcare, and BFSI, understanding customer behavior and preferences is crucial. Data mining software enables organizations to analyze customer data, segment their audience, and deliver personalized offerings, ultimately enhancing customer satisfaction and loyalty. This drive towards personalization is further fueling the adoption of data mining solutions, contributing significantly to market growth.

The integration of artificial intelligence (AI) and machine learning (ML) technologies with data mining software is also a key growth factor. These advanced technologies enhance the capabilities of data mining tools by enabling them to learn from data patterns and make more accurate predictions. The convergence of AI and data mining is opening new avenues for businesses, allowing them to automate complex tasks, predict market trends, and make informed decisions more swiftly. The continuous advancements in AI and ML are expected to propel the data mining software market over the forecast period.

Regionally, North America holds a significant share of the data mining software market, driven by the presence of major technology companies and the early adoption of advanced analytics solutions. The Asia Pacific region is also expected to witness substantial growth due to the rapid digital transformation across various industries and the increasing investments in data infrastructure. Additionally, the growing awareness and implementation of data-driven strategies in emerging economies are contributing to the market expansion in this region.

Text Mining Software is becoming an integral part of the data mining landscape, offering unique capabilities to analyze unstructured data. As organizations generate vast amounts of textual data from various sources such as social media, emails, and customer feedback, the need for specialized tools to extract meaningful insights is growing. Text Mining Software enables businesses to process and analyze this data, uncovering patterns and trends that were previously hidden. This capability is particularly valuable in industries like marketing, customer service, and research, where understanding the nuances of language can lead to more informed decision-making. The integration of text mining with traditional data mining processes is enhancing the overall analytical capabilities of organizations, allowing them to derive comprehensive insights from both structured and unstructured data.

Component Analysis

The data mining software market is segmented by components, which primarily include software and services. The software segment encompasses various types of data mining tools that are used for analyzing and extracting valuable insights from raw data. These tools are designed to handle large volumes of data and provide advanced functionalities such as predictive analytics, data visualization, and pattern recognition. The increasing demand for sophisticated data analysis tools is driving the growth of the software segment. Enterprises are investing in these tools to enhance their data processing capabilities and derive actionable insights.

Within the software segment, the emergence of cloud-based data mining solutions is a notable trend. Cloud-based solutions offer several advantages, including s
Table_13_Revealing Relationships Among Cognitive Functions Using Functional...
frontiersin.figshare.com
figshare.com
xlsx
Updated Jun 6, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hiroki Kurashige; Jun Kaneko; Yuichi Yamashita; Rieko Osu; Yohei Otaka; Takashi Hanakawa; Manabu Honda; Hideaki Kawabata (2023). Table_13_Revealing Relationships Among Cognitive Functions Using Functional Connectivity and a Large-Scale Meta-Analysis Database.XLSX [Dataset]. http://doi.org/10.3389/fnhum.2019.00457.s014
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.3389/fnhum.2019.00457.s014
Dataset updated
Jun 6, 2023
Dataset provided by
Frontiers Mediahttp://www.frontiersin.org/
Authors
Hiroki Kurashige; Jun Kaneko; Yuichi Yamashita; Rieko Osu; Yohei Otaka; Takashi Hanakawa; Manabu Honda; Hideaki Kawabata
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
To characterize each cognitive function per se and to understand the brain as an aggregate of those functions, it is vital to relate dozens of these functions to each other. Knowledge about the relationships among cognitive functions is informative not only for basic neuroscientific research but also for clinical applications and developments of brain-inspired artificial intelligence. In the present study, we propose an exhaustive data mining approach to reveal relationships among cognitive functions based on functional brain mapping and network analysis. We began our analysis with 109 pseudo-activation maps (cognitive function maps; CFM) that were reconstructed from a functional magnetic resonance imaging meta-analysis database, each of which corresponds to one of 109 cognitive functions such as ‘emotion,’ ‘attention,’ ‘episodic memory,’ etc. Based on the resting-state functional connectivity between the CFMs, we mapped the cognitive functions onto a two-dimensional space where the relevant functions were located close to each other, which provided a rough picture of the brain as an aggregate of cognitive functions. Then, we conducted so-called conceptual analysis of cognitive functions using clustering of voxels in each CFM connected to the other 108 CFMs with various strengths. As a result, a CFM for each cognitive function was subdivided into several parts, each of which is strongly associated with some CFMs for a subset of the other cognitive functions, which brought in sub-concepts (i.e., sub-functions) of the cognitive function. Moreover, we conducted network analysis for the network whose nodes were parcels derived from whole-brain parcellation based on the whole-brain voxel-to-CFM resting-state functional connectivities. Since each parcel is characterized by associations with the 109 cognitive functions, network analyses using them are expected to inform about relationships between cognitive and network characteristics. Indeed, we found that informational diversities of interaction between parcels and densities of local connectivity were dependent on the kinds of associated functions. In addition, we identified the homogeneous and inhomogeneous network communities about the associated functions. Altogether, we suggested the effectiveness of our approach in which we fused the large-scale meta-analysis of functional brain mapping with the methods of network neuroscience to investigate the relationships among cognitive functions.
B
Big Data Tools Report
datainsightsmarket.com
doc, pdf, ppt
Updated Jul 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Data Insights Market (2025). Big Data Tools Report [Dataset]. https://www.datainsightsmarket.com/reports/big-data-tools-1449474
Explore at:
ppt, pdf, docAvailable download formats
Dataset updated
Jul 22, 2025
Dataset authored and provided by
Data Insights Market
License
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The Big Data Tools market is experiencing robust growth, driven by the exponential increase in data volume and the rising need for advanced analytics across diverse sectors. While precise market sizing data is unavailable, considering the presence of established players like IBM and Dundas BI alongside emerging competitors like AnswerDock and ClicData, a reasonable estimate for the 2025 market size would be around $15 billion USD, exhibiting a Compound Annual Growth Rate (CAGR) of approximately 15% from 2025 to 2033. This growth is fueled by several key factors. Firstly, the increasing adoption of cloud-based solutions offers scalability and cost-effectiveness, attracting both large enterprises and smaller businesses. Secondly, the growing demand for real-time data processing and insights is driving investments in sophisticated analytics tools. Furthermore, advancements in artificial intelligence (AI) and machine learning (ML) are seamlessly integrating with Big Data tools, enhancing their analytical capabilities and further propelling market expansion. However, the market also faces certain restraints. The complexity of Big Data tools can lead to high implementation costs and a need for specialized expertise, potentially limiting adoption amongst smaller companies with limited resources. Data security and privacy concerns also remain critical challenges, demanding robust security measures and compliance with data protection regulations. Despite these constraints, the long-term outlook remains positive, driven by the continuously increasing volume of data generated across various industries, the ongoing need for data-driven decision-making, and the continued innovation within the Big Data tools landscape. Market segmentation is likely divided across various deployment models (cloud, on-premise), industry verticals (finance, healthcare, retail), and tool functionalities (data visualization, data warehousing, data mining). Competitive analysis indicates a mix of established vendors and emerging players constantly innovating to improve offerings, leading to a dynamic and competitive market environment.
Local L2 Thresholding Based Data Mining in Peer-to-Peer Systems - Dataset -...
data.nasa.gov
data.staging.idas-ds1.appdat.jsc.nasa.gov
Updated Mar 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
nasa.gov (2025). Local L2 Thresholding Based Data Mining in Peer-to-Peer Systems - Dataset - NASA Open Data Portal [Dataset]. https://data.nasa.gov/dataset/local-l2-thresholding-based-data-mining-in-peer-to-peer-systems
Explore at:
Dataset updated
Mar 31, 2025
Dataset provided by
NASAhttp://nasa.gov/
Description
In a large network of computers, wireless sensors, or mobile devices, each of the components (hence, peers) has some data about the global status of the system. Many of the functions of the system, such as routing decisions, search strategies, data cleansing, and the assignment of mutual trust, depend on the global status. Therefore, it is essential that the system be able to detect, and react to, changes in its global status. Computing global predicates in such systems is usually very costly. Mainly because of their scale, and in some cases (e.g., sensor networks) also because of the high cost of communication. The cost further increases when the data changes rapidly (due to state changes, node failure, etc.) and computation has to follow these changes. In this paper we describe a two step approach for dealing with these costs. First, we describe a highly efficient local algorithm which detect when the L2 norm of the average data surpasses a threshold. Then, we use this algorithm as a feedback loop for the monitoring of complex predicates on the data – such as the data’s k-means clustering. The efficiency of the L2 algorithm guarantees that so long as the clustering results represent the data (i.e., the data is stationary) few resources are required. When the data undergoes an epoch change – a change in the underlying distribution – and the model no longer represents it, the feedback loop indicates this and the model is rebuilt. Furthermore, the existence of a feedback loop allows using approximate and “best-effort ” methods for constructing the model; if an ill-fit model is built the feedback loop would indicate so, and the model would be rebuilt.
f
Table_11_Revealing Relationships Among Cognitive Functions Using Functional...
figshare.com
frontiersin.figshare.com
xlsx
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hiroki Kurashige; Jun Kaneko; Yuichi Yamashita; Rieko Osu; Yohei Otaka; Takashi Hanakawa; Manabu Honda; Hideaki Kawabata (2023). Table_11_Revealing Relationships Among Cognitive Functions Using Functional Connectivity and a Large-Scale Meta-Analysis Database.XLSX [Dataset]. http://doi.org/10.3389/fnhum.2019.00457.s012
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.3389/fnhum.2019.00457.s012
Dataset updated
Jun 1, 2023
Dataset provided by
Frontiers
Authors
Hiroki Kurashige; Jun Kaneko; Yuichi Yamashita; Rieko Osu; Yohei Otaka; Takashi Hanakawa; Manabu Honda; Hideaki Kawabata
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
To characterize each cognitive function per se and to understand the brain as an aggregate of those functions, it is vital to relate dozens of these functions to each other. Knowledge about the relationships among cognitive functions is informative not only for basic neuroscientific research but also for clinical applications and developments of brain-inspired artificial intelligence. In the present study, we propose an exhaustive data mining approach to reveal relationships among cognitive functions based on functional brain mapping and network analysis. We began our analysis with 109 pseudo-activation maps (cognitive function maps; CFM) that were reconstructed from a functional magnetic resonance imaging meta-analysis database, each of which corresponds to one of 109 cognitive functions such as ‘emotion,’ ‘attention,’ ‘episodic memory,’ etc. Based on the resting-state functional connectivity between the CFMs, we mapped the cognitive functions onto a two-dimensional space where the relevant functions were located close to each other, which provided a rough picture of the brain as an aggregate of cognitive functions. Then, we conducted so-called conceptual analysis of cognitive functions using clustering of voxels in each CFM connected to the other 108 CFMs with various strengths. As a result, a CFM for each cognitive function was subdivided into several parts, each of which is strongly associated with some CFMs for a subset of the other cognitive functions, which brought in sub-concepts (i.e., sub-functions) of the cognitive function. Moreover, we conducted network analysis for the network whose nodes were parcels derived from whole-brain parcellation based on the whole-brain voxel-to-CFM resting-state functional connectivities. Since each parcel is characterized by associations with the 109 cognitive functions, network analyses using them are expected to inform about relationships between cognitive and network characteristics. Indeed, we found that informational diversities of interaction between parcels and densities of local connectivity were dependent on the kinds of associated functions. In addition, we identified the homogeneous and inhomogeneous network communities about the associated functions. Altogether, we suggested the effectiveness of our approach in which we fused the large-scale meta-analysis of functional brain mapping with the methods of network neuroscience to investigate the relationships among cognitive functions.
W
Computing Infrastructure and Remote, Parallel Data Mining Engine for Virtual...
cloud.csiss.gmu.edu
html
Updated Jan 29, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
United States (2020). Computing Infrastructure and Remote, Parallel Data Mining Engine for Virtual Observatories, Phase I [Dataset]. https://cloud.csiss.gmu.edu/uddi/dataset/computing-infrastructure-and-remote-parallel-data-mining-engine-for-virtual-observatories
Explore at:
htmlAvailable download formats
Dataset updated
Jan 29, 2020
Dataset provided by
United States
Description
We propose to develop a state-of-the-art data mining engine that extends the functionality of Virtual Observatories (VO) from data portal to science analysis resource. Our solution consists of two integrated products, IDDat and RemoteMiner:

(1) IDDat is an advanced grid-based computing infrastructure which acts as an add-on to VOs and supports processing and remote data analysis of widely distributed data in space sciences. IDDat middleware design is such as to reduce undue network traffic on the VO.

(2) RemoteMiner is a novel data mining engine that connects to the VO via the IDDat. It supports multi-users, has autonomous operation for automated systematic identification while enabling the advanced users to do their own mining and can be used by data centers for pre-mining.

These innovations will significantly enhance the science return from NASA missions by providing data centers and individual researchers alike an unprecedented capability to mine vast quantities of data. Phase I is aimed at complete definition of the design of the product and a demonstration of a prototype of the proposed major innovations. Phase II work will encompass the building of a full commercial product with associated production quality technical and user documentation.
E
Enterprise Data Warehouse (EDW) Report
archivemarketresearch.com
doc, pdf, ppt
Updated Mar 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Archive Market Research (2025). Enterprise Data Warehouse (EDW) Report [Dataset]. https://www.archivemarketresearch.com/reports/enterprise-data-warehouse-edw-55305
Explore at:
ppt, doc, pdfAvailable download formats
Dataset updated
Mar 9, 2025
Dataset authored and provided by
Archive Market Research
License
https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The Enterprise Data Warehouse (EDW) market is experiencing robust growth, projected to reach a market size of $3455.2 million in 2025 and maintain a Compound Annual Growth Rate (CAGR) of 5.6% from 2025 to 2033. This expansion is driven by the increasing need for organizations to consolidate data from disparate sources for improved business intelligence, enhanced decision-making, and streamlined operational efficiency. The rising adoption of cloud-based EDW solutions, fueled by scalability, cost-effectiveness, and accessibility, is a significant factor contributing to this growth. Furthermore, the expanding use of advanced analytics techniques, such as data mining and predictive modeling, within EDWs is further boosting market demand across diverse sectors including healthcare, finance, and retail. The market is segmented by deployment type (web-based and server-based) and application (information processing, data mining, and analytical processing), reflecting the diverse functionalities and deployment models available. Key players, including industry giants like Amazon Web Services, Microsoft, and Google, alongside specialized vendors like Teradata and Snowflake, are aggressively innovating to meet the evolving needs of enterprises. The competitive landscape is characterized by both established players and emerging technology providers. The ongoing trend towards data democratization, where access to data and analytics is broadened within organizations, is fostering demand for user-friendly EDW interfaces and tools. While regulatory compliance and data security remain key restraints, the overall market outlook for EDWs remains positive, with substantial growth potential driven by the continuous rise in data volumes, the growing need for real-time analytics, and increasing investments in digital transformation initiatives across industries globally. The North American market currently holds a significant share due to early adoption and technological advancements, but the Asia-Pacific region is projected to witness rapid growth in the coming years due to increased digitalization and technological infrastructure development.
Data from: Reconstruction of magnetospheric storm-time dynamics using...
zenodo.org
data.niaid.nih.gov
zip
Updated Sep 19, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nikolai Tsyganenko; Nikolai Tsyganenko (2020). Reconstruction of magnetospheric storm-time dynamics using cylindrical basis functions and multi-mission data mining [Dataset]. http://doi.org/10.5281/zenodo.4036006
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.4036006
Dataset updated
Sep 19, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Nikolai Tsyganenko; Nikolai Tsyganenko
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This zip file contains data used to create figures and tables, describing the results of the paper "Reconstruction of magnetospheric storm-time dynamics using cylindrical basis functions and multi-mission data mining", by N. A. Tsyganenko, V. A. Andreeva, and M. I. Sitnov.

Global Statistical Analysis Software Market Research Report: By Application...

wiseguyreports.com

Updated Jan 25, 2025

+ more versions

Facebook

Twitter

Click to copy link

Link copied

Cite

wWiseguy Research Consultants Pvt Ltd (2025). Global Statistical Analysis Software Market Research Report: By Application (Data Analytics, Data Mining, Predictive Analytics, Quality Control, Survey Analysis), By Deployment Mode (On-Premises, Cloud-Based, Hybrid), By End User (Academics, Healthcare, Retail, Finance, Government), By Functionality (Statistical Analysis, Data Visualization, Reporting, Data Management) and By Regional (North America, Europe, South America, Asia Pacific, Middle East and Africa) - Forecast to 2032. [Dataset]. https://www.wiseguyreports.com/cn/reports/statistical-analysis-software-market

Explore at:

Dataset updated

Jan 25, 2025

Dataset authored and provided by

wWiseguy Research Consultants Pvt Ltd

License

https://www.wiseguyreports.com/pages/privacy-policyhttps://www.wiseguyreports.com/pages/privacy-policy

Area covered

Global

Description

BASE YEAR	2024
HISTORICAL DATA	2019 - 2024
REPORT COVERAGE	Revenue Forecast, Competitive Landscape, Growth Factors, and Trends
MARKET SIZE 2023	6.03(USD Billion)
MARKET SIZE 2024	6.46(USD Billion)
MARKET SIZE 2032	11.25(USD Billion)
SEGMENTS COVERED	Application, Deployment Mode, End User, Functionality, Regional
COUNTRIES COVERED	North America, Europe, APAC, South America, MEA
KEY MARKET DYNAMICS	Growing demand for data analytics, Increasing adoption of cloud solutions, Rising importance of data-driven decision-making, Expanding use in healthcare sector, Enhanced integration with AI technologies
MARKET FORECAST UNITS	USD Billion
KEY COMPANIES PROFILED	StataCorp, Tableau, SAS Institute, TIBCO Software, Microsoft, IBM, Oracle, Domo, RStudio, Statista, SPSS, Minitab, RapidMiner, Qlik, Alteryx
MARKET FORECAST PERIOD	2025 - 2032
KEY MARKET OPPORTUNITIES	AI integration for enhanced analytics, Cloud-based solutions for scalability, Growing demand in healthcare analytics, Increased use in academic research, Real-time data processing capabilities
COMPOUND ANNUAL GROWTH RATE (CAGR)	7.18% (2025 - 2032)

d
Data from: Local L2 Thresholding Based Data Mining in Peer-to-Peer Systems
catalog.data.gov
datasets.ai
Updated Apr 10, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dashlink (2025). Local L2 Thresholding Based Data Mining in Peer-to-Peer Systems [Dataset]. https://catalog.data.gov/dataset/local-l2-thresholding-based-data-mining-in-peer-to-peer-systems
Explore at:
Dataset updated
Apr 10, 2025
Dataset provided by
Dashlink
Description
In a large network of computers, wireless sensors, or mobile devices, each of the components (hence, peers) has some data about the global status of the system. Many of the functions of the system, such as routing decisions, search strategies, data cleansing, and the assignment of mutual trust, depend on the global status. Therefore, it is essential that the system be able to detect, and react to, changes in its global status. Computing global predicates in such systems is usually very costly. Mainly because of their scale, and in some cases (e.g., sensor networks) also because of the high cost of communication. The cost further increases when the data changes rapidly (due to state changes, node failure, etc.) and computation has to follow these changes. In this paper we describe a two step approach for dealing with these costs. First, we describe a highly efficient local algorithm which detect when the L2 norm of the average data surpasses a threshold. Then, we use this algorithm as a feedback loop for the monitoring of complex predicates on the data – such as the data’s k-means clustering. The efficiency of the L2 algorithm guarantees that so long as the clustering results represent the data (i.e., the data is stationary) few resources are required. When the data undergoes an epoch change – a change in the underlying distribution – and the model no longer represents it, the feedback loop indicates this and the model is rebuilt. Furthermore, the existence of a feedback loop allows using approximate and “best-effort ” methods for constructing the model; if an ill-fit model is built the feedback loop would indicate so, and the model would be rebuilt.
d
Data from: Privacy Preserving Outlier Detection through Random Nonlinear...
catalog.data.gov
data.amerigeoss.org
Updated Apr 10, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dashlink (2025). Privacy Preserving Outlier Detection through Random Nonlinear Data Distortion [Dataset]. https://catalog.data.gov/dataset/privacy-preserving-outlier-detection-through-random-nonlinear-data-distortion
Explore at:
Dataset updated
Apr 10, 2025
Dataset provided by
Dashlink
Description
Consider a scenario in which the data owner has some private/sensitive data and wants a data miner to access it for studying important patterns without revealing the sensitive information. Privacy preserving data mining aims to solve this problem by randomly transforming the data prior to its release to data miners. Previous work only considered the case of linear data perturbations — additive, multiplicative or a combination of both for studying the usefulness of the perturbed output. In this paper, we discuss nonlinear data distortion using potentially nonlinear random data transformation and show how it can be useful for privacy preserving anomaly detection from sensitive datasets. We develop bounds on the expected accuracy of the nonlinear distortion and also quantify privacy by using standard definitions. The highlight of this approach is to allow a user to control the amount of privacy by varying the degree of nonlinearity. We show how our general transformation can be used for anomaly detection in practice for two specific problem instances: a linear model and a popular nonlinear model using the sigmoid function. We also analyze the proposed nonlinear transformation in full generality and then show that for specific cases it is distance preserving. A main contribution of this paper is the discussion between the invertibility of a transformation and privacy preservation and the application of these techniques to outlier detection. Experiments conducted on real-life datasets demonstrate the effectiveness of the approach.
Hazmat Special Permits - Data Mining Tool
data.virginia.gov
data.transportation.gov
+1more
html
Updated May 23, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S Department of Transportation (2025). Hazmat Special Permits - Data Mining Tool [Dataset]. https://data.virginia.gov/dataset/hazmat-special-permits-data-mining-tool
Explore at:
htmlAvailable download formats
Dataset updated
May 23, 2025
Dataset provided by
United States Pipeline and Hazardous Materials Safety Administrationhttp://phmsa.dot.gov/
Authors
U.S Department of Transportation
Description
The Pipeline and Hazardous Materials Safety Administration has the primary responsibility for the issuance of DOT Special Permits and Approvals to the Hazardous Materials Regulations (HMR). A Special Permit or Approval is a document which authorizes a person to perform a function that is not currently authorized under the authority of the HMR. Also, in many instances, the Regulations require approvals and/or registrations prior to transportation in commerce. The Special Permits Search tool allows a user to search for active Special permits.
Examining the Capacity of Text Mining and Software Metrics in Vulnerability...
zenodo.org
data.europa.eu
csv
Updated Sep 29, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ilias Kalouptsoglou; Ilias Kalouptsoglou; Miltiadis Siavvas; Miltiadis Siavvas; Dionysios Kehagias; Dionysios Kehagias; Alexander Chatzigeorgiou; Apostolos Ampatzoglou; Alexander Chatzigeorgiou; Apostolos Ampatzoglou (2023). Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction [dataset] [Dataset]. http://doi.org/10.5281/zenodo.8369963
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.8369963
Dataset updated
Sep 29, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Ilias Kalouptsoglou; Ilias Kalouptsoglou; Miltiadis Siavvas; Miltiadis Siavvas; Dionysios Kehagias; Dionysios Kehagias; Alexander Chatzigeorgiou; Apostolos Ampatzoglou; Alexander Chatzigeorgiou; Apostolos Ampatzoglou
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains the extension of a publicly available dataset that was published initially by Ferenc et al. in their paper:

“Ferenc, R.; Hegedus, P.; Gyimesi, P.; Antal, G.; Bán, D.; Gyimóthy, T. Challenging machine learning algorithms in predicting vulnerable javascript functions. 2019 IEEE/ACM 7th InternationalWorkshop on Realizing Artificial Intelligence Synergies in Software Engineering (RAISE). IEEE, 2019, pp. 8–14.”

The dataset contained software metrics for source code functions written in JavaScript (JS) programming language. Each function was labeled as vulnerable or clean. The authors gathered vulnerabilities from publicly available vulnerability databases.

In our paper entitled: “Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction” and cited as:

“Kalouptsoglou I, Siavvas M, Kehagias D, Chatzigeorgiou A, Ampatzoglou A. Examining the Capacity of Text Mining and Software Metrics in Vulnerability Prediction. Entropy. 2022; 24(5):651. https://doi.org/10.3390/e24050651”

, we presented an extended version of the dataset by extracting textual features for the labeled JS functions. In particular, we got the dataset provided by Ferenc et al. in CSV format and then we gathered all the GitHub URLs of the dataset's functions (i.e., methods). Using these URLs, we collected the source code of the corresponding JS files from GitHub. Subsequently, by utilizing the start and end line information for every function, we cut off the code of the functions. Each function was then tokenized to construct a list of tokens per function.

To extract text features, we used a text mining technique called sequences of tokens. As a result, we created a repository with all methods' source code, the token sequences of each method, and their labels. To boost the generalizability of type-specific tokens, all comments were eliminated, as well as all integers and strings, which were replaced with two unique IDs.

The dataset contains 12,106 JavaScript functions, from which 1,493 are considered vulnerable.

This dataset was created and utilized during the Vulnerability Prediction Task of the Horizon2020 IoTAC Project as training and evaluation data for the construction of vulnerability prediction models. The dataset is provided in the csv format. Each row of the csv file has the following parts:

Label: Flag with values ‘1’ for vulnerable and ‘0’ for non-vulnerable methods

Name: The name of the JavaScript method

Longname: The longname of the JavaScript method

Path: The path of the file of the method in the repository

Full_repo_path: The GitHub URL of the file of the method

TokenX: Each next row corresponds to each token included in the method
Fishing activity hotspots in the Mediterranean and Atlantic Seas at 0.5°
zenodo.org
data.niaid.nih.gov
bin
Updated Sep 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Carmen Ferrà; Carmen Ferrà (2023). Fishing activity hotspots in the Mediterranean and Atlantic Seas at 0.5° [Dataset]. http://doi.org/10.5281/zenodo.8363095
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.8363095
Dataset updated
Sep 20, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Carmen Ferrà; Carmen Ferrà
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Statistically significant hotspots of fishing activities in the Mediterranean and Atlanti Seas were identified by the application of the Getis-Ord Gi statistic (Getis and Ord 2010) though the statistical software R using the globalG.test function (spdep package). The function computes a global test for spatial autocorrelation using a Monte Carlo simulation approach. It tests the null hypothesis of no autocorrelation against the alternative hypothesis of positive spatial autocorrelation. Then the local spatial autocorrelation was tested calculating the Gi statistic, using the local_g_perm function (dfdep package), which indicates the strength of the clustering.

Categorization of hotspots was performed, according to the Gi value and the p-value of a folded permutation test obtained for each grid cell, as follows:

Gi>0 and p_value <=0.01 as Very hot

Gi>0 and p_value <=0.05 as Hot

Gi>0 and p_value <=0.1 as Somewhat hot

Gi<0 and p_value <=0.1 as Somewhat cold

Gi<0 and p_value <=0.05 as Cold

Gi<0 and p_value <=0.01 à Very cold

Grid cells with a p-value > 0.1 were categorized as Insignificant.

The analyses were performed on cumulative fishing activity data at 0.5° resolution of seven different gears separately for the two macroareas.

The dataset presented includes for each area maps of each gear hotspot and spatial layers of the gears hotspots (.shp; .csv)

Global Business Intelligence Tool Market Research Report: By Application...

wiseguyreports.com

Updated Jul 31, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

wWiseguy Research Consultants Pvt Ltd (2025). Global Business Intelligence Tool Market Research Report: By Application (Data Mining, Performance Management, Reporting and Querying, Statistical Analysis), By Deployment Model (On-Premises, Cloud-Based, Hybrid), By End User (BFSI, Healthcare, Retail, Manufacturing, Telecom), By Functionality (Reporting, Data Visualization, Data Integration, Dashboard Development) and By Regional (North America, Europe, South America, Asia Pacific, Middle East and Africa) - Forecast to 2032. [Dataset]. https://www.wiseguyreports.com/reports/business-intelligence-tool-market

Explore at:

Dataset updated

Jul 31, 2025

Dataset authored and provided by

wWiseguy Research Consultants Pvt Ltd

License

https://www.wiseguyreports.com/pages/privacy-policyhttps://www.wiseguyreports.com/pages/privacy-policy

Area covered

Global

Description

BASE YEAR	2024
HISTORICAL DATA	2019 - 2024
REPORT COVERAGE	Revenue Forecast, Competitive Landscape, Growth Factors, and Trends
MARKET SIZE 2023	23.37(USD Billion)
MARKET SIZE 2024	24.84(USD Billion)
MARKET SIZE 2032	40.5(USD Billion)
SEGMENTS COVERED	Application, Deployment Model, End User, Functionality, Regional
COUNTRIES COVERED	North America, Europe, APAC, South America, MEA
KEY MARKET DYNAMICS	Increasing data generation, Demand for real-time analytics, Growing adoption of cloud solutions, Emergence of AI and machine learning, Rising necessity for data visualization
MARKET FORECAST UNITS	USD Billion
KEY COMPANIES PROFILED	Domo, Microsoft, IBM, MicroStrategy, TIBCO Software, Oracle, Talend, Board, Zoho, Looker, Sisense, SAP, Tableau, Qlik, SAS
MARKET FORECAST PERIOD	2025 - 2032
KEY MARKET OPPORTUNITIES	Increased demand for data analytics, Integration with AI and machine learning, Growing adoption of cloud-based solutions, Rising importance of data visualization, Expanding use in SMEs and startups
COMPOUND ANNUAL GROWTH RATE (CAGR)	6.3% (2025 - 2032)

s
Onto-Design
scicrunch.org
dknet.org
+2more
Updated Aug 19, 2011
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2011). Onto-Design [Dataset]. http://identifiers.org/RRID:SCR_000601
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_000601
Dataset updated
Aug 19, 2011
Description
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 6,2023. Many Laboratories chose to design and print their own microarrays. At present, the choice of the genes to include on a certain microarray is a very laborious process requiring a high level of expertise. Onto-Design database is able to assist the designers of custom microarrays by providing the means to select genes based on their experiment. Design custom microarrays based on GO terms of interest. User account required. Platform: Online tool

Facebook

Twitter

Click to copy link

Link copied

Cite

Bisresearch (2025). Data Mining Tools Market - A Global and Regional Analysis [Dataset]. https://bisresearch.com/industry-report/global-data-mining-tools-market.html

Data Mining Tools Market - A Global and Regional Analysis

Explore at:

csv, pdfAvailable download formats

Dataset updated

Jul 25, 2025

Dataset authored and provided by

Bisresearch

License

https://bisresearch.com/privacy-policy-cookie-restriction-modehttps://bisresearch.com/privacy-policy-cookie-restriction-mode

Time period covered

2023 - 2033

Area covered

Worldwide

Description

The Data Mining Tools Market is expected to be valued at $1.24 billion in 2024, with an anticipated expansion at a CAGR of 11.63% to reach $3.73 billion by 2034.

Clear search

Close search

Google apps

Main menu

Data Mining Tools Market - A Global and Regional Analysis

Data from: A Generic Local Algorithm for Mining Data Streams in Large...

Data Mining Tools Market Size, Share, Trend Analysis by 2033

Lifesciences Data Mining and Visualization Market Report | Global Forecast...

Lifesciences Data Mining and Visualization Market Outlook

Component Analysis

Data Mining Software Market Report | Global Forecast From 2025 To 2033

Data Mining Software Market Outlook

Component Analysis

Table_13_Revealing Relationships Among Cognitive Functions Using Functional...

Big Data Tools Report

Local L2 Thresholding Based Data Mining in Peer-to-Peer Systems - Dataset -...

Table_11_Revealing Relationships Among Cognitive Functions Using Functional...

Computing Infrastructure and Remote, Parallel Data Mining Engine for Virtual...

Enterprise Data Warehouse (EDW) Report

Data from: Reconstruction of magnetospheric storm-time dynamics using...

Global Statistical Analysis Software Market Research Report: By Application...

Data from: Local L2 Thresholding Based Data Mining in Peer-to-Peer Systems

Data from: Privacy Preserving Outlier Detection through Random Nonlinear...

Hazmat Special Permits - Data Mining Tool

Examining the Capacity of Text Mining and Software Metrics in Vulnerability...

Fishing activity hotspots in the Mediterranean and Atlantic Seas at 0.5°

Global Business Intelligence Tool Market Research Report: By Application...

Onto-Design

Data Mining Tools Market - A Global and Regional Analysis