100+ datasets found

Importance of big data analytics and machine learning technologies worldwide...
statista.com
Updated Dec 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). Importance of big data analytics and machine learning technologies worldwide 2019 [Dataset]. https://www.statista.com/statistics/919497/worldwide-critical-big-data-analytics-machine-learning-technologies/
Explore at:
Dataset updated
Dec 10, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2019
Area covered
Worldwide
Description
This statistic shows the importance of big data analysis and machine learning technologies worldwide as of 2019. Tensorflow was seen as the most important big data analytics and machine learning technology, with 59 percent of respondents stating that it was important to critial for their organization.
S
Supervised Learning Report
archivemarketresearch.com
doc, pdf, ppt
Updated Mar 14, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Archive Market Research (2025). Supervised Learning Report [Dataset]. https://www.archivemarketresearch.com/reports/supervised-learning-56899
Explore at:
doc, pdf, pptAvailable download formats
Dataset updated
Mar 14, 2025
Dataset authored and provided by
Archive Market Research
License
https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The supervised learning market is experiencing robust growth, driven by the increasing adoption of artificial intelligence (AI) and machine learning (ML) across various industries. The market, estimated at $20 billion in 2025, is projected to witness a Compound Annual Growth Rate (CAGR) of 25% from 2025 to 2033. This substantial growth is fueled by several factors, including the rising volume of data generated by businesses, the need for improved decision-making, and the development of more sophisticated algorithms. The demand for accurate predictions and insights across diverse applications, such as fraud detection, customer segmentation, and risk management, is propelling the market forward. Furthermore, the availability of cloud-based solutions is making supervised learning more accessible and cost-effective for small and medium-sized enterprises (SMEs), contributing significantly to market expansion. The market segmentation reveals strong demand across both enterprise sizes. Large enterprises are driving adoption due to their need for sophisticated analytics and predictive modeling for complex business processes. However, SMEs are rapidly adopting cloud-based supervised learning solutions, simplifying implementation and lowering barriers to entry. The preference between on-premise and cloud-based solutions varies depending on factors such as data security concerns, budget constraints, and IT infrastructure capabilities. Geographic expansion is also noteworthy, with North America and Europe currently holding the largest market shares. However, rapid technological advancement and increasing digitalization in Asia-Pacific regions are expected to fuel significant growth in these markets over the forecast period. The presence of established players like Microsoft, IBM, and Amazon, coupled with the emergence of specialized AI companies like RapidMiner and H2o.AI, fosters innovation and competition, further accelerating market evolution.
L
Large-Scale Model Training Machine Report
datainsightsmarket.com
doc, pdf, ppt
Updated Mar 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AMA Research & Media LLP (2025). Large-Scale Model Training Machine Report [Dataset]. https://www.datainsightsmarket.com/reports/large-scale-model-training-machine-41601
Explore at:
doc, ppt, pdfAvailable download formats
Dataset updated
Mar 16, 2025
Dataset provided by
AMA Research & Media LLP
License
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The Large-Scale Model Training Machine market is experiencing explosive growth, fueled by the increasing demand for advanced artificial intelligence (AI) applications across diverse sectors. The market, estimated at $15 billion in 2025, is projected to witness a robust Compound Annual Growth Rate (CAGR) of 25% from 2025 to 2033, reaching an estimated $75 billion by 2033. This surge is driven by several factors, including the proliferation of big data, advancements in deep learning algorithms, and the growing need for efficient model training in applications such as natural language processing (NLP), computer vision, and recommendation systems. Key market segments include the Internet, telecommunications, and government sectors, which are heavily investing in AI infrastructure to enhance their services and operational efficiency. The CPU+GPU segment dominates the market due to its superior performance in handling complex computations required for large-scale model training. Leading companies like Google, Amazon, Microsoft, and NVIDIA are at the forefront of innovation, constantly developing more powerful hardware and software solutions to address the evolving needs of this rapidly expanding market. The market's growth trajectory is shaped by several trends. The increasing adoption of cloud-based solutions for model training is significantly lowering the barrier to entry for smaller companies. Simultaneously, the development of specialized hardware like Tensor Processing Units (TPUs) and Field-Programmable Gate Arrays (FPGAs) is further optimizing performance and reducing costs. Despite this positive outlook, challenges remain. High infrastructure costs, the complexity of managing large datasets, and the shortage of skilled AI professionals are significant restraints on the market's expansion. However, ongoing technological advancements and increased investment in AI research are expected to mitigate these challenges, paving the way for sustained growth in the Large-Scale Model Training Machine market. Regional analysis indicates North America and Asia Pacific (particularly China) as the leading markets, with strong growth anticipated in other regions as AI adoption accelerates globally.
i
Data from: Twitter Big Data as a Resource for Exoskeleton Research: A...
ieee-dataport.org
Updated Oct 22, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nirmalya Thakur (2022). Twitter Big Data as a Resource for Exoskeleton Research: A Large-Scale Dataset of about 140,000 Tweets and 100 Research Questions [Dataset]. http://doi.org/10.21227/r5mv-ax79
Explore at:
Unique identifier
https://doi.org/10.21227/r5mv-ax79
Dataset updated
Oct 22, 2022
Dataset provided by
IEEE Dataport
Authors
Nirmalya Thakur
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Please cite the following paper when using this dataset:N. Thakur, "Twitter Big Data as a Resource for Exoskeleton Research: A Large-Scale Dataset of about 140,000 Tweets from 2017–2022 and 100 Research Questions", Journal of Analytics, Volume 1, Issue 2, 2022, pp. 72-97, DOI: https://doi.org/10.3390/analytics1020007AbstractThe exoskeleton technology has been rapidly advancing in the recent past due to its multitude of applications and diverse use cases in assisted living, military, healthcare, firefighting, and industry 4.0. The exoskeleton market is projected to increase by multiple times its current value within the next two years. Therefore, it is crucial to study the degree and trends of user interest, views, opinions, perspectives, attitudes, acceptance, feedback, engagement, buying behavior, and satisfaction, towards exoskeletons, for which the availability of Big Data of conversations about exoskeletons is necessary. The Internet of Everything style of today’s living, characterized by people spending more time on the internet than ever before, with a specific focus on social media platforms, holds the potential for the development of such a dataset by the mining of relevant social media conversations. Twitter, one such social media platform, is highly popular amongst all age groups, where the topics found in the conversation paradigms include emerging technologies such as exoskeletons. To address this research challenge, this work makes two scientific contributions to this field. First, it presents an open-access dataset of about 140,000 Tweets about exoskeletons that were posted in a 5-year period from 21 May 2017 to 21 May 2022. Second, based on a comprehensive review of the recent works in the fields of Big Data, Natural Language Processing, Information Retrieval, Data Mining, Pattern Recognition, and Artificial Intelligence that may be applied to relevant Twitter data for advancing research, innovation, and discovery in the field of exoskeleton research, a total of 100 Research Questions are presented for researchers to study, analyze, evaluate, ideate, and investigate based on this dataset.

Data Science Platform Market Analysis North America, Europe, APAC, South...

technavio.com

Updated Feb 13, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

Technavio (2025). Data Science Platform Market Analysis North America, Europe, APAC, South America, Middle East and Africa - US, Germany, China, Canada, UK, India, France, Japan, Brazil, UAE - Size and Forecast 2025-2029 [Dataset]. https://www.technavio.com/report/data-science-platform-market-industry-analysis

Explore at:

Dataset updated

Feb 13, 2025

Dataset provided by

TechNavio

Authors

Technavio

Time period covered

2021 - 2025

Area covered

Global, United States, United Kingdom

Description

Snapshot img

Data Science Platform Market Size 2025-2029

The data science platform market size is forecast to increase by USD 763.9 million at a CAGR of 40.2% between 2024 and 2029.

The market is experiencing significant growth, driven by the integration of artificial intelligence (AI) and machine learning (ML). This enhancement enables more advanced data analysis and prediction capabilities, making data science platforms an essential tool for businesses seeking to gain insights from their data. Another trend shaping the market is the emergence of containerization and microservices in platforms. This development offers increased flexibility and scalability, allowing organizations to efficiently manage their projects. 
However, the use of platforms also presents challenges, particularly In the area of data privacy and security. Ensuring the protection of sensitive data is crucial for businesses, and platforms must provide strong security measures to mitigate risks. In summary, the market is witnessing substantial growth due to the integration of AI and ML technologies, containerization, and microservices, while data privacy and security remain key challenges.

What will be the Size of the Data Science Platform Market During the Forecast Period?

Request Free Sample

The market is experiencing significant growth due to the increasing demand for advanced data analysis capabilities in various industries. Cloud-based solutions are gaining popularity as they offer scalability, flexibility, and cost savings. The market encompasses the entire project life cycle, from data acquisition and preparation to model development, training, and distribution. Big data, IoT, multimedia, machine data, consumer data, and business data are prime sources fueling this market's expansion. Unstructured data, previously challenging to process, is now being effectively managed through tools and software. Relational databases and machine learning models are integral components of platforms, enabling data exploration, preprocessing, and visualization.
Moreover, Artificial intelligence (AI) and machine learning (ML) technologies are essential for handling complex workflows, including data cleaning, model development, and model distribution. Data scientists benefit from these platforms by streamlining their tasks, improving productivity, and ensuring accurate and efficient model training. The market is expected to continue its growth trajectory as businesses increasingly recognize the value of data-driven insights.

How is this Data Science Platform Industry segmented and which is the largest segment?

The industry research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD million' for the period 2025-2029, as well as historical data from 2019-2023 for the following segments.

Deployment

  On-premises
  Cloud


Component

  Platform
  Services


End-user

  BFSI
  Retail and e-commerce
  Manufacturing
  Media and entertainment
  Others


Sector

  Large enterprises
  SMEs


Geography

  North America

    Canada
    US


  Europe

    Germany
    UK
    France


  APAC

    China
    India
    Japan


  South America

    Brazil


  Middle East and Africa

By Deployment Insights

The on-premises segment is estimated to witness significant growth during the forecast period.

On-premises deployment is a traditional method for implementing technology solutions within an organization. This approach involves purchasing software with a one-time license fee and a service contract. On-premises solutions offer enhanced security, as they keep user credentials and data within the company's premises. They can be customized to meet specific business requirements, allowing for quick adaptation. On-premises deployment eliminates the need for third-party providers to manage and secure data, ensuring data privacy and confidentiality. Additionally, it enables rapid and easy data access, and keeps IP addresses and data confidential. This deployment model is particularly beneficial for businesses dealing with sensitive data, such as those in manufacturing and large enterprises. While cloud-based solutions offer flexibility and cost savings, on-premises deployment remains a popular choice for organizations prioritizing data security and control.

Get a glance at the Data Science Platform Industry report of share of various segments. Request Free Sample

The on-premises segment was valued at USD 38.70 million in 2019 and showed a gradual increase during the forecast period.

Regional Analysis

North America is estimated to contribute 48% to the growth of the global market during the forecast period.

Technavio's analysts have elaborately explained the regional trends and drivers that shape the market during the forecast period.

For more insights on the market share of various regions, Request F

TREC 2022 Deep Learning test collection
catalog.data.gov
data.nist.gov
Updated May 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institute of Standards and Technology (2023). TREC 2022 Deep Learning test collection [Dataset]. https://catalog.data.gov/dataset/trec-2022-deep-learning-test-collection
Explore at:
Dataset updated
May 9, 2023
Dataset provided by
National Institute of Standards and Technologyhttp://www.nist.gov/
Description
This is a test collection for passage and document retrieval, produced in the TREC 2023 Deep Learning track. The Deep Learning Track studies information retrieval in a large training data regime. This is the case where the number of training queries with at least one positive label is at least in the tens of thousands, if not hundreds of thousands or more. This corresponds to real-world scenarios such as training based on click logs and training based on labels from shallow pools (such as the pooling in the TREC Million Query Track or the evaluation of search engines based on early precision).Certain machine learning based methods, such as methods based on deep learning are known to require very large datasets for training. Lack of such large scale datasets has been a limitation for developing such methods for common information retrieval tasks, such as document ranking. The Deep Learning Track organized in the previous years aimed at providing large scale datasets to TREC, and create a focused research effort with a rigorous blind evaluation of ranker for the passage ranking and document ranking tasks.Similar to the previous years, one of the main goals of the track in 2022 is to study what methods work best when a large amount of training data is available. For example, do the same methods that work on small data also work on large data? How much do methods improve when given more training data? What external data and models can be brought in to bear in this scenario, and how useful is it to combine full supervision with other forms of supervision?The collection contains 12 million web pages, 138 million passages from those web pages, search queries, and relevance judgments for the queries.
Deep Learning Market Report by Product Type (Software, Services, Hardware),...
imarcgroup.com
pdf,excel,csv,ppt
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
IMARC Group, Deep Learning Market Report by Product Type (Software, Services, Hardware), Application (Image Recognition, Signal Recognition, Data Mining, and Others), End-Use Industry (Security, Manufacturing, Retail, Automotive, Healthcare, Agriculture, and Others), Architecture (RNN, CNN, DBN, DSN, GRU), and Region 2025-2033 [Dataset]. https://www.imarcgroup.com/deep-learning-market
Explore at:
pdf,excel,csv,pptAvailable download formats
Dataset provided by
Imarc Group
Authors
IMARC Group
License
https://www.imarcgroup.com/privacy-policyhttps://www.imarcgroup.com/privacy-policy
Time period covered
2024 - 2032
Area covered
Global
Description
The global deep learning market size reached USD 30.9 Billion in 2024. Looking forward, IMARC Group expects the market to reach USD 423.4 Billion by 2033, exhibiting a growth rate (CAGR) of 29.92% during 2025-2033. The increasing artificial intelligence (AI) adoption, advancements in data processing, the growing demand for image and speech recognition, investments in research and development (R&D), and the introduction of big data and cloud computing technologies are some of the major factors propelling the market.
B
Big Data Analytics Market in Energy Sector Report
marketresearchforecast.com
doc, pdf, ppt
Updated Dec 5, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Market Research Forecast (2024). Big Data Analytics Market in Energy Sector Report [Dataset]. https://www.marketresearchforecast.com/reports/big-data-analytics-market-in-energy-sector-5888
Explore at:
ppt, doc, pdfAvailable download formats
Dataset updated
Dec 5, 2024
Dataset authored and provided by
Market Research Forecast
License
https://www.marketresearchforecast.com/privacy-policyhttps://www.marketresearchforecast.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The Big Data Analytics Market in Energy Sector size was valued at USD 9.56 USD Billion in 2023 and is projected to reach USD 13.81 USD Billion by 2032, exhibiting a CAGR of 5.4 % during the forecast period. Big Data Analytics in the energy sector can be defined as the application of sophisticated methods or tools in analyzing vast collections of information that are produced by numerous entities within the energy industry. This process covers descriptive, predictive, and prescriptive analytics to provide valuable information for procedures, costs, and strategies. Real-time analytics, etc are immediate, while predictive analytics focuses on the probability to happen in the future and prescriptive analytics solutions provide recommendations for action. Some of the main characteristics of the data collectors include handling large datasets, compatibility with IoT to stream data, and machine learning features for pattern detection. These can range from grid control and load management to predicting customer demand and equipment reliability and equipment efficiency enhancement. Thus, there is a significant advantage because Big Data Analytics helps global energy companies to increase performance, minimize sick time, and develop effective strategies to meet the necessary legal demands. Key drivers for this market are: Growing Focus on Safety and Organization to Fuel Market Growth. Potential restraints include: Higher Cost of Geotechnical Services to Hinder Market Growth. Notable trends are: Growth of IT Infrastructure to Bolster the Demand for Modern Cable Tray Management Solutions.
m
Data from: Generating Heterogeneous Big Data Set for Healthcare and...
data.mendeley.com
Updated Jan 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Omar Al-Obidi (2023). Generating Heterogeneous Big Data Set for Healthcare and Telemedicine Research Based on ECG, Spo2, Blood Pressure Sensors, and Text Inputs: Data set classified, Analyzed, Organized, And Presented in Excel File Format. [Dataset]. http://doi.org/10.17632/gsmjh55sfy.1
Explore at:
Unique identifier
https://doi.org/10.17632/gsmjh55sfy.1
Dataset updated
Jan 23, 2023
Authors
Omar Al-Obidi
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Heterogenous Big dataset is presented in this proposed work: electrocardiogram (ECG) signal, blood pressure signal, oxygen saturation (SpO2) signal, and the text input. This work is an extension version for our relevant formulating of dataset that presented in [1] and a trustworthy and relevant medical dataset library (PhysioNet [2]) was used to acquire these signals. The dataset includes medical features from heterogenous sources (sensory data and non-sensory). Firstly, ECG sensor’s signals which contains QRS width, ST elevation, peak numbers, and cycle interval. Secondly: SpO2 level from SpO2 sensor’s signals. Third, blood pressure sensors’ signals which contain high (systolic) and low (diastolic) values and finally text input which consider non-sensory data. The text inputs were formulated based on doctors diagnosing procedures for heart chronic diseases. Python software environment was used, and the simulated big data is presented along with analyses.
Computer Vision Model Report
archivemarketresearch.com
doc, pdf, ppt
Updated Mar 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AMA Research & Media LLP (2025). Computer Vision Model Report [Dataset]. https://www.archivemarketresearch.com/reports/computer-vision-model-53249
Explore at:
pdf, ppt, docAvailable download formats
Dataset updated
Mar 7, 2025
Dataset provided by
AMA Research & Media
License
https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy
Time period covered
2025 - 2033
Area covered
Global
Variables measured
Market Size
Description
The computer vision market is experiencing robust growth, projected to reach a market size of $556.7 million in 2025, with a compound annual growth rate (CAGR) of 5.1% from 2025 to 2033. This expansion is driven by several key factors. The increasing adoption of AI and machine learning across various sectors, from healthcare and automotive to retail and security, fuels demand for sophisticated computer vision solutions. Advancements in deep learning algorithms and the availability of high-quality image and video data further enhance the accuracy and efficiency of computer vision applications. The market is segmented by deployment model (Software as a Service, Platform as a Service, Infrastructure as a Service) and application (Government, Small and Medium Enterprises, Large Enterprises). The SaaS model currently holds a significant market share due to its scalability and cost-effectiveness, while the government and large enterprise segments are driving significant demand due to their high investment capacity and requirement for advanced security and efficiency solutions. Geographic growth is expected to be broadly distributed, with North America and Europe currently leading the market, followed by a rapidly expanding Asia-Pacific region. The competitive landscape is characterized by a diverse range of companies offering specialized computer vision solutions. Established players are focusing on developing advanced algorithms and expanding their product portfolios to cater to diverse industry needs. Meanwhile, innovative startups are disrupting the market with niche solutions targeting specific applications and sectors. The ongoing technological advancements and increased investment in R&D will continue to shape the future of this dynamic market. Further growth is expected as more businesses recognize the potential of computer vision to enhance operational efficiency, improve decision-making, and create new revenue streams. The increasing integration of computer vision with other technologies, such as IoT and big data analytics, will further broaden the scope and impact of this transformative technology.
f
Table_1_Detecting the Information of Functional Connectivity Networks in...
frontiersin.figshare.com
docx
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xin Wen; Li Dong; Junjie Chen; Jie Xiang; Jie Yang; Hechun Li; Xiaobo Liu; Cheng Luo; Dezhong Yao (2023). Table_1_Detecting the Information of Functional Connectivity Networks in Normal Aging Using Deep Learning From a Big Data Perspective.DOCX [Dataset]. http://doi.org/10.3389/fnins.2019.01435.s001
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.3389/fnins.2019.01435.s001
Dataset updated
May 31, 2023
Dataset provided by
Frontiers
Authors
Xin Wen; Li Dong; Junjie Chen; Jie Xiang; Jie Yang; Hechun Li; Xiaobo Liu; Cheng Luo; Dezhong Yao
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
A resting-state functional connectivity (rsFC)-constructed functional network (FN) derived from functional magnetic resonance imaging (fMRI) data can effectively mine alterations in brain function during aging due to the non-invasive and effective advantages of fMRI. With global health research focusing on aging, several open fMRI datasets have been made available that combine deep learning with big data and are a new, promising trend and open issue for brain information detection in fMRI studies of brain aging. In this study, we proposed a new method based on deep learning from the perspective of big data, named Deep neural network (DNN) with Autoencoder (AE) pretrained Functional connectivity Analysis (DAFA), to deeply mine the important functional connectivity changes in fMRI during brain aging. First, using resting-state fMRI data from 421 subjects from the CamCAN dataset, functional connectivities were calculated using sliding window method, and the complex functional patterns were mined by an AE. Then, to increase the statistical power and reliability of the results, we used an AE-pretrained DNN to relabel the functional connectivities of each subject to classify them as belonging to the attributes of young or old individuals. A method called search-back analysis was performed to find alterations in brain function during aging according to the relabeled functional connectivities. Finally, behavioral data regarding fluid intelligence and response time were used to verify the revealed functional changes. Compared to traditional methods, DAFA revealed additional, important aged-related changes in FC patterns [e.g., FC connections within the default mode (DMN) and the sensorimotor and cingulo-opercular networks, as well as connections between the frontoparietal and cingulo-opercular networks, between the DMN and the frontoparietal/cingulo-opercular/sensorimotor/occipital/cerebellum networks, and between the sensorimotor and frontoparietal/cingulo-opercular networks], which were correlated to behavioral data. These findings demonstrated that the proposed DAFA method was superior to traditional FC-determining methods in discovering changes in brain functional connectivity during aging. In addition, it may be a promising method for exploring important information in other fMRI studies.
o
Replication data for: Big Data: New Tricks for Econometrics
openicpsr.org
Updated May 1, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hal R. Varian (2014). Replication data for: Big Data: New Tricks for Econometrics [Dataset]. http://doi.org/10.3886/E113925V1
Explore at:
Unique identifier
https://doi.org/10.3886/E113925V1
Dataset updated
May 1, 2014
Dataset provided by
American Economic Association
Authors
Hal R. Varian
Time period covered
May 1, 2014
Description
Computers are now involved in many economic transactions and can capture data associated with these transactions, which can then be manipulated and analyzed. Conventional statistical and econometric techniques such as regression often work well, but there are issues unique to big datasets that may require different tools. First, the sheer size of the data involved may require more powerful data manipulation tools. Second, we may have more potential predictors than appropriate for estimation, so we need to do some kind of variable selection. Third, large datasets may allow for more flexible relationships than simple linear models. Machine learning techniques such as decision trees, support vector machines, neural nets, deep learning, and so on may allow for more effective ways to model complex relationships. In this essay, I will describe a few of these tools for manipulating and analyzing big data. I believe that these methods have a lot to offer and should be more widely known and used by economists.
d
Summary report of the 4th IAEA Technical Meeting on Fusion Data Processing,...
dataone.org
Updated Sep 24, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
S.M. Gonzalez de Vicente, D. Mazon, M. Xu, S. Pinches, M. Churchill, A. Dinklage, R. Fischer, A. Murari, P. Rodriguez-Fernandez, J. Stillerman, J. Vega, G. Verdoolaege (2024). Summary report of the 4th IAEA Technical Meeting on Fusion Data Processing, Validation and Analysis (FDPVA) [Dataset]. https://dataone.org/datasets/sha256%3A77e8a13f897db6e70537f800ceea9a2c8228a4eb1b33289b7e2c056845298a12
Explore at:
Dataset updated
Sep 24, 2024
Dataset provided by
Harvard Dataverse
Authors
S.M. Gonzalez de Vicente, D. Mazon, M. Xu, S. Pinches, M. Churchill, A. Dinklage, R. Fischer, A. Murari, P. Rodriguez-Fernandez, J. Stillerman, J. Vega, G. Verdoolaege
Description
The objective of the fourth Technical Meeting on Fusion Data Processing, Validation and Analysis was to provide a platform during which a set of topics relevant to fusion data processing, validation and analysis are discussed with the view of extrapolating needs to next step fusion devices such as ITER. The validation and analysis of experimental data obtained from diagnostics used to characterize fusion plasmas are crucial for a knowledge-based understanding of the physical processes governing the dynamics of these plasmas. This paper presents the recent progress and achievements in the domain of plasma diagnostics and synthetic diagnostics data analysis (including image processing, regression analysis, inverse problems, deep learning, machine learning, big data and physics-based models for control) reported at the meeting. The progress in these areas highlight trends observed in current major fusion confinement devices. A special focus is dedicated on data analysis requirements for ITER and DEMO with a particular attention paid to Artificial Intelligence for automatization and improving reliability of control processes.
Data from: Industry-scale Application and Evaluation of Deep Learning for...
zenodo.org
data.niaid.nih.gov
application/gzip
Updated Apr 21, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Noé Sturm; Andreas Mayr; Thanh Le Van; Vladimir Chupakhin; Vladimir Chupakhin; Hugo Ceulemans; Joerg Wegner; Jose-Felipe Golib-Dzib; Nina Jeliazkova; Nina Jeliazkova; Yves Vandriessche; Stanislav Bohm; Vojtech Cima; Jan Martinovic; Nigel Greene; Tom Vander Aa; Thomas J. Ashby; Sepp Hochreiter; Ola Engkvist; Günter Klambauer; Hongming Chen; Noé Sturm; Andreas Mayr; Thanh Le Van; Hugo Ceulemans; Joerg Wegner; Jose-Felipe Golib-Dzib; Yves Vandriessche; Stanislav Bohm; Vojtech Cima; Jan Martinovic; Nigel Greene; Tom Vander Aa; Thomas J. Ashby; Sepp Hochreiter; Ola Engkvist; Günter Klambauer; Hongming Chen (2020). Industry-scale Application and Evaluation of Deep Learning for Drug Target Prediction [Dataset]. http://doi.org/10.5281/zenodo.3239499
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.3239499
Dataset updated
Apr 21, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Noé Sturm; Andreas Mayr; Thanh Le Van; Vladimir Chupakhin; Vladimir Chupakhin; Hugo Ceulemans; Joerg Wegner; Jose-Felipe Golib-Dzib; Nina Jeliazkova; Nina Jeliazkova; Yves Vandriessche; Stanislav Bohm; Vojtech Cima; Jan Martinovic; Nigel Greene; Tom Vander Aa; Thomas J. Ashby; Sepp Hochreiter; Ola Engkvist; Günter Klambauer; Hongming Chen; Noé Sturm; Andreas Mayr; Thanh Le Van; Hugo Ceulemans; Joerg Wegner; Jose-Felipe Golib-Dzib; Yves Vandriessche; Stanislav Bohm; Vojtech Cima; Jan Martinovic; Nigel Greene; Tom Vander Aa; Thomas J. Ashby; Sepp Hochreiter; Ola Engkvist; Günter Klambauer; Hongming Chen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Artificial intelligence (AI) is undergoing a revolution thanks to the breakthroughs of machine learning algorithms in computer vision, speech recognition, natural language processing and generative modelling. Recent works on publicly available pharmaceutical data showed that AI methods are highly promising for Drug Target prediction. However, the quality of public data might be different than that of industry data due to different labs reporting measurements, different measurement techniques, fewer samples and less diverse and specialized assays. As part of a European funded project (ExCAPE), that brought together expertise from pharmaceutical industry, machine learning, and high-performance computing, we investigated how well machine learning models obtained from public data can be transferred to internal pharmaceutical industry data. Our results show that machine learning models trained on public data can indeed maintain their predictive power to a large degree when applied to industry data. Moreover, we observed that deep learning derived machine learning models outperformed comparable models, which were trained by other machine learning algorithms, when applied to internal pharmaceutical company datasets. To our knowledge, this is the first large-scale study evaluating the potential of machine learning and especially deep learning directly at the level of industry-scale settings and moreover investigating the transferability of publicly learned target prediction models towards industrial bioactivity prediction pipelines.
The global Graph Analytics market size is USD 2522 million in 2024 and will...
cognitivemarketresearch.com
pdf,excel,csv,ppt
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cognitive Market Research, The global Graph Analytics market size is USD 2522 million in 2024 and will expand at a compound annual growth rate (CAGR) of 34.0% from 2024 to 2031. [Dataset]. https://www.cognitivemarketresearch.com/graph-analytics-market-report
Explore at:
pdf,excel,csv,pptAvailable download formats
Dataset authored and provided by
Cognitive Market Research
License
https://www.cognitivemarketresearch.com/privacy-policyhttps://www.cognitivemarketresearch.com/privacy-policy
Time period covered
2021 - 2033
Area covered
Global
Description
According to Cognitive Market Research, the global Graph Analytics market size will be USD 2522 million in 2024 and will expand at a compound annual growth rate (CAGR) of 34.0% from 2024 to 2031. Market Dynamics of Graph Analytics Market

Key Drivers for Graph Analytics Market

Increasing Recognition of the Advantages of Graph Databases- One of the main reasons for the Graph Analytics market is the increasing recognition of the advantages of graph databases. Unlike traditional relational databases, graph databases excel at handling complex relationships and interconnected data, making them ideal for use cases such as fraud detection, recommendation engines, and social network analysis. Businesses are leveraging these capabilities to uncover insights and patterns that were previously difficult to detect. The rise of big data and the need for real-time analytics are further driving the adoption of graph databases, as they offer enhanced performance and scalability for large-scale data sets. Additionally, advancements in artificial intelligence and machine learning are amplifying the value of graph databases, enabling more sophisticated data modeling and predictive analytics. Growing Uptake of Big Data Tools to Drive the Graph Analytics Market's Expansion in the Years Ahead.

Key Restraints for Graph Analytics Market

Limited Awareness and Understanding pose a serious threat to the Graph Analytics industry. The market also faces significant difficulties related to data security and privacy.

Introduction of the Graph Analytics Market

The Graph Analytics Market is rapidly expanding, driven by the growing need for advanced data analysis techniques in various sectors. Graph analytics leverages graph structures to represent and analyze relationships and dependencies, providing deeper insights than traditional data analysis methods. Key factors propelling this market include the rise of big data, the increasing adoption of artificial intelligence and machine learning, and the demand for real-time data processing. Industries such as finance, healthcare, telecommunications, and retail are major contributors, utilizing graph analytics for fraud detection, personalized recommendations, network optimization, and more. Leading vendors are continually innovating to offer scalable, efficient solutions, incorporating advanced features like graph databases and visualization tools.
m
The RHMCD-20 datasets for Depression and Mental Health Data Analysis with...
data.mendeley.com
Updated Dec 18, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The RHMCD-20 datasets for Depression and Mental Health Data Analysis with Machine Learning [Dataset]. https://data.mendeley.com/datasets/pxjmjyfdh2/1
Explore at:
Unique identifier
https://doi.org/10.17632/pxjmjyfdh2.1
Dataset updated
Dec 18, 2023
Authors
Imrus Salehin
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
the RHMCD-20 dataset, we took care to include information from a wide range of sources, including teenagers from Bangladesh, college students, housewives, professionals from businesses and corporations, and other people.This is survey data for Depression and Mental Health Data Analysis. Survey questions : Age: Represents the age of the participants. Gender: Indicates the gender of the participants. Occupation: Represents the participant's occupations. Days_Indoors :Indicates the number of days the participant has not been out of the house Growing_Stress: Indicates the participant's stress is increasing day by day (Yes/No). Quarantine_Frustration: Frustrations in the first two weeks of quarantine (Yes/Maybe/No). Changes_Habits: Represents major changes in eating habits and sleeping (Yes/Maybe/No). Mental_Health_History : A precedent of mental disorders in the previous generation (Yes/No). Weight_Change :Highlights changes in body weight during quarantine (Yes/Maybe/No) Mood_Swings: Represents extreme mood changes (Low/Medium/High). Coping_Struggles: The inability to cope with daily problems or stress (Yes/Maybe/No). Work_Interest :Represents whether the participant is losing interest in working (Yes/No). Social_Weakness :Conveys feeling mentally weak when interacting with others (Yes/No).
A Labelled Dataset for Sentiment Analysis of Videos on YouTube, TikTok, and...
zenodo.org
data.niaid.nih.gov
+2more
csv
Updated Jul 20, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nirmalya Thakur; Nirmalya Thakur; Vanessa Su; Mingchen Shao; Kesha A. Patel; Hongseok Jeong; Victoria Knieling; Andrew Bian; Vanessa Su; Mingchen Shao; Kesha A. Patel; Hongseok Jeong; Victoria Knieling; Andrew Bian (2024). A Labelled Dataset for Sentiment Analysis of Videos on YouTube, TikTok, and other sources about the 2024 outbreak of Measles [Dataset]. http://doi.org/10.5281/zenodo.11711230
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.11711230
Dataset updated
Jul 20, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Nirmalya Thakur; Nirmalya Thakur; Vanessa Su; Mingchen Shao; Kesha A. Patel; Hongseok Jeong; Victoria Knieling; Andrew Bian; Vanessa Su; Mingchen Shao; Kesha A. Patel; Hongseok Jeong; Victoria Knieling; Andrew Bian
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jun 15, 2024
Area covered
YouTube
Description
Please cite the following paper when using this dataset:

N. Thakur, V. Su, M. Shao, K. Patel, H. Jeong, V. Knieling, and A. Bian “A labelled dataset for sentiment analysis of videos on YouTube, TikTok, and other sources about the 2024 outbreak of measles,” Proceedings of the 26th International Conference on Human-Computer Interaction (HCII 2024), Washington, USA, 29 June - 4 July 2024. (Accepted as a Late Breaking Paper, Preprint Available at: https://doi.org/10.48550/arXiv.2406.07693)

Abstract

This dataset contains the data of 4011 videos about the ongoing outbreak of measles published on 264 websites on the internet between January 1, 2024, and May 31, 2024. These websites primarily include YouTube and TikTok, which account for 48.6% and 15.2% of the videos, respectively. The remainder of the websites include Instagram and Facebook as well as the websites of various global and local news organizations. For each of these videos, the URL of the video, title of the post, description of the post, and the date of publication of the video are presented as separate attributes in the dataset. After developing this dataset, sentiment analysis (using VADER), subjectivity analysis (using TextBlob), and fine-grain sentiment analysis (using DistilRoBERTa-base) of the video titles and video descriptions were performed. This included classifying each video title and video description into (i) one of the sentiment classes i.e. positive, negative, or neutral, (ii) one of the subjectivity classes i.e. highly opinionated, neutral opinionated, or least opinionated, and (iii) one of the fine-grain sentiment classes i.e. fear, surprise, joy, sadness, anger, disgust, or neutral. These results are presented as separate attributes in the dataset for the training and testing of machine learning algorithms for performing sentiment analysis or subjectivity analysis in this field as well as for other applications. The paper associated with this dataset (please see the above-mentioned citation) also presents a list of open research questions that may be investigated using this dataset.
Global Machine Learning Market Size By Component (Hardware, Software), By...
verifiedmarketresearch.com
Updated Oct 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
VERIFIED MARKET RESEARCH (2024). Global Machine Learning Market Size By Component (Hardware, Software), By Enterprise Size (SMEs, Large Enterprises), By End-User (Healthcare, Retail), By Geographic Scope And Forecast [Dataset]. https://www.verifiedmarketresearch.com/product/global-machine-learning-market-size-and-forecast/
Explore at:
Dataset updated
Oct 10, 2024
Dataset provided by
Verified Market Researchhttps://www.verifiedmarketresearch.com/
Authors
VERIFIED MARKET RESEARCH
License
https://www.verifiedmarketresearch.com/privacy-policy/https://www.verifiedmarketresearch.com/privacy-policy/
Time period covered
2024 - 2031
Area covered
Global
Description
Machine Learning Market size was valued at USD 10.24 Billion in 2024 and is projected to reach USD 200.08 Billion by 2031, growing at a CAGR of 10.9% from 2024 to 2031.

Key Market Drivers:

Increasing Data Volume and Complexity: The explosion of digital data is fueling ML adoption across industries. Organizations are leveraging ML to extract insights from vast, complex datasets. According to the European Commission, the volume of data globally is projected to grow from 33 zettabytes in 2018 to 175 zettabytes by 2025. For instance, on September 15, 2023, Google Cloud announced new ML-powered data analytics tools to help enterprises handle increasing data complexity.

Advancements in AI and Deep Learning Algorithms: Continuous improvements in AI algorithms are expanding ML capabilities. Deep learning breakthroughs are enabling more sophisticated applications. The U.S. National Science Foundation reported a 63% increase in AI research publications from 2017 to 2021. For instance, on August 24, 2023, DeepMind unveiled Graphcast, a new ML weather forecasting model achieving unprecedented accuracy.
Importance of data science and machine learning features worldwide 2019
statista.com
Updated Mar 17, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2022). Importance of data science and machine learning features worldwide 2019 [Dataset]. https://www.statista.com/statistics/1053588/data-science-machine-learning-importance-of-features/
Explore at:
Dataset updated
Mar 17, 2022
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2019
Area covered
Worldwide
Description
Support for a wide range of regression models is the most important feature organizations need in data science and machine learning technologies as of 2019, with about 66 percent of respondents reporting this feature to be critical or very important. Hierarchical clustering and textbook statistical functions are also on top of the list.
Big Data as a Service (BDaaS) Market Analysis North...
technavio.com
Updated Dec 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Technavio (2023). Big Data as a Service (BDaaS) Market Analysis North America,APAC,Europe,South America,Middle East and Africa - US,Canada,China,Germany,UK - Size and Forecast 2024-2028 [Dataset]. https://www.technavio.com/report/big-data-as-a-service-market-industry-analysis
Explore at:
Dataset updated
Dec 20, 2023
Dataset provided by
TechNavio
Authors
Technavio
Time period covered
2021 - 2025
Area covered
Canada, United States, United Kingdom, Global
Description
Snapshot img

Big Data as a Service Market Size 2024-2028

The big data as a service market size is forecast to increase by USD 41.20 billion at a CAGR of 28.45% between 2023 and 2028.

The market is experiencing significant growth due to the increasing volume of data and the rising demand for advanced data insights. Machine learning algorithms and artificial intelligence are driving product quality and innovation in this sector. Hybrid cloud solutions are gaining popularity, offering the benefits of both private and public cloud platforms for optimal data storage and scalability. Industry standards for data privacy and security are increasingly important, as large amounts of data pose unique risks. The BDaaS market is expected to continue its expansion, providing valuable data insights to businesses across various industries.

What will be the Big Data as a Service Market Size During the Forecast Period?

Request Free Sample

Big Data as a Service (BDaaS) has emerged as a game-changer in the business world, enabling organizations to harness the power of big data without the need for extensive infrastructure and expertise. This service model offers various components such as data management, analytics, and visualization tools, enabling businesses to derive valuable insights from their data. BDaaS encompasses several key components that drive market growth. These include Business Intelligence (BI), Data Science, Data Quality, and Data Security. BI provides organizations with the ability to analyze data and gain insights to make informed decisions. Data Science, on the other hand, focuses on extracting meaningful patterns and trends from large datasets using advanced algorithms. Data Quality is a critical component of BDaaS, ensuring that the data being analyzed is accurate, complete, and consistent. Data Security is another essential aspect, safeguarding sensitive data from cybersecurity threats and data breaches. Moreover, BDaaS offers various data pipelines, enabling seamless data integration and data lifecycle management. Network Analysis, Real-time Analytics, and Predictive Analytics are other essential components, providing businesses with actionable insights in real-time and enabling them to anticipate future trends. Data Mining, Machine Learning Algorithms, and Data Visualization Tools are other essential components of BDaaS.

How is this market segmented and which is the largest segment?

The market research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD billion' for the period 2024-2028, as well as historical data from 2018-2022 for the following segments.

Type Data analytics-as-a-Service Hadoop-as-a-service Data-as-a-service Deployment Public cloud Hybrid cloud Private cloud Geography North America Canada US APAC China Europe Germany UK South America Middle East and Africa

By Type Insights

The data analytics-as-a-service segment is estimated to witness significant growth during the forecast period.

Big Data as a Service (BDaaS) is a significant market segment, highlighted by the availability of Hadoop-as-a-Service solutions. These offerings enable businesses to access essential datasets on-demand without the burden of expensive infrastructure. DAaaS solutions facilitate real-time data analysis, empowering organizations to make informed decisions. The DAaaS landscape is expanding rapidly as companies acknowledge its value in enhancing internal data. Integrating DAaaS with big data systems amplifies analytics capabilities, creating a vibrant market landscape. Organizations can leverage diverse datasets to gain a competitive edge, driving the growth of the global BDaaS market. In the context of digital transformation, cloud computing, IoT, and 5G technologies, BDaaS solutions offer optimal resource utilization.

However, regulatory scrutiny poses challenges, necessitating stringent data security measures. Retail and other industries stand to benefit significantly from BDaaS, particularly with distributed computing solutions. DAaaS adoption is a strategic investment for businesses seeking to capitalize on the power of external data for valuable insights.

Get a glance at the market report of share of various segments Request Free Sample

The Data analytics-as-a-Service segment was valued at USD 2.59 billion in 2018 and showed a gradual increase during the forecast period.

Regional Analysis

North America is estimated to contribute 35% to the growth of the global market during the forecast period.

Technavio's analysts have elaborately explained the regional trends and drivers that shape the market during the forecast period.

For more insights on the market share of various regions Request Free Sample

Big Data as a Service Market analysis, North America is experiencing signif

Facebook

Twitter

Click to copy link

Link copied

Cite

Statista (2024). Importance of big data analytics and machine learning technologies worldwide 2019 [Dataset]. https://www.statista.com/statistics/919497/worldwide-critical-big-data-analytics-machine-learning-technologies/

Importance of big data analytics and machine learning technologies worldwide 2019

Explore at:

Dataset updated

Dec 10, 2024

Dataset authored and provided by

Statistahttp://statista.com/

Time period covered

2019

Area covered

Worldwide

Description

This statistic shows the importance of big data analysis and machine learning technologies worldwide as of 2019. Tensorflow was seen as the most important big data analytics and machine learning technology, with 59 percent of respondents stating that it was important to critial for their organization.

Clear search

Close search

Google apps

Main menu

Importance of big data analytics and machine learning technologies worldwide...

Supervised Learning Report

Large-Scale Model Training Machine Report

Data from: Twitter Big Data as a Resource for Exoskeleton Research: A...

Data Science Platform Market Analysis North America, Europe, APAC, South...

Snapshot img

TREC 2022 Deep Learning test collection

Deep Learning Market Report by Product Type (Software, Services, Hardware),...

Big Data Analytics Market in Energy Sector Report

Data from: Generating Heterogeneous Big Data Set for Healthcare and...

Computer Vision Model Report

Table_1_Detecting the Information of Functional Connectivity Networks in...

Replication data for: Big Data: New Tricks for Econometrics

Summary report of the 4th IAEA Technical Meeting on Fusion Data Processing,...

Data from: Industry-scale Application and Evaluation of Deep Learning for...

The global Graph Analytics market size is USD 2522 million in 2024 and will...

The RHMCD-20 datasets for Depression and Mental Health Data Analysis with...

A Labelled Dataset for Sentiment Analysis of Videos on YouTube, TikTok, and...

Global Machine Learning Market Size By Component (Hardware, Software), By...

Importance of data science and machine learning features worldwide 2019

Big Data as a Service (BDaaS) Market Analysis North...

Snapshot img

Importance of big data analytics and machine learning technologies worldwide 2019