100+ datasets found
  1. Importance of big data analytics and machine learning technologies worldwide...

    • statista.com
    Updated Dec 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Importance of big data analytics and machine learning technologies worldwide 2019 [Dataset]. https://www.statista.com/statistics/919497/worldwide-critical-big-data-analytics-machine-learning-technologies/
    Explore at:
    Dataset updated
    Dec 10, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2019
    Area covered
    Worldwide
    Description

    This statistic shows the importance of big data analysis and machine learning technologies worldwide as of 2019. Tensorflow was seen as the most important big data analytics and machine learning technology, with 59 percent of respondents stating that it was important to critial for their organization.

  2. S

    Supervised Learning Report

    • archivemarketresearch.com
    doc, pdf, ppt
    Updated Mar 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archive Market Research (2025). Supervised Learning Report [Dataset]. https://www.archivemarketresearch.com/reports/supervised-learning-56899
    Explore at:
    doc, pdf, pptAvailable download formats
    Dataset updated
    Mar 14, 2025
    Dataset authored and provided by
    Archive Market Research
    License

    https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The supervised learning market is experiencing robust growth, driven by the increasing adoption of artificial intelligence (AI) and machine learning (ML) across various industries. The market, estimated at $20 billion in 2025, is projected to witness a Compound Annual Growth Rate (CAGR) of 25% from 2025 to 2033. This substantial growth is fueled by several factors, including the rising volume of data generated by businesses, the need for improved decision-making, and the development of more sophisticated algorithms. The demand for accurate predictions and insights across diverse applications, such as fraud detection, customer segmentation, and risk management, is propelling the market forward. Furthermore, the availability of cloud-based solutions is making supervised learning more accessible and cost-effective for small and medium-sized enterprises (SMEs), contributing significantly to market expansion. The market segmentation reveals strong demand across both enterprise sizes. Large enterprises are driving adoption due to their need for sophisticated analytics and predictive modeling for complex business processes. However, SMEs are rapidly adopting cloud-based supervised learning solutions, simplifying implementation and lowering barriers to entry. The preference between on-premise and cloud-based solutions varies depending on factors such as data security concerns, budget constraints, and IT infrastructure capabilities. Geographic expansion is also noteworthy, with North America and Europe currently holding the largest market shares. However, rapid technological advancement and increasing digitalization in Asia-Pacific regions are expected to fuel significant growth in these markets over the forecast period. The presence of established players like Microsoft, IBM, and Amazon, coupled with the emergence of specialized AI companies like RapidMiner and H2o.AI, fosters innovation and competition, further accelerating market evolution.

  3. L

    Large-Scale Model Training Machine Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Mar 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AMA Research & Media LLP (2025). Large-Scale Model Training Machine Report [Dataset]. https://www.datainsightsmarket.com/reports/large-scale-model-training-machine-41601
    Explore at:
    doc, ppt, pdfAvailable download formats
    Dataset updated
    Mar 16, 2025
    Dataset provided by
    AMA Research & Media LLP
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Large-Scale Model Training Machine market is experiencing explosive growth, fueled by the increasing demand for advanced artificial intelligence (AI) applications across diverse sectors. The market, estimated at $15 billion in 2025, is projected to witness a robust Compound Annual Growth Rate (CAGR) of 25% from 2025 to 2033, reaching an estimated $75 billion by 2033. This surge is driven by several factors, including the proliferation of big data, advancements in deep learning algorithms, and the growing need for efficient model training in applications such as natural language processing (NLP), computer vision, and recommendation systems. Key market segments include the Internet, telecommunications, and government sectors, which are heavily investing in AI infrastructure to enhance their services and operational efficiency. The CPU+GPU segment dominates the market due to its superior performance in handling complex computations required for large-scale model training. Leading companies like Google, Amazon, Microsoft, and NVIDIA are at the forefront of innovation, constantly developing more powerful hardware and software solutions to address the evolving needs of this rapidly expanding market. The market's growth trajectory is shaped by several trends. The increasing adoption of cloud-based solutions for model training is significantly lowering the barrier to entry for smaller companies. Simultaneously, the development of specialized hardware like Tensor Processing Units (TPUs) and Field-Programmable Gate Arrays (FPGAs) is further optimizing performance and reducing costs. Despite this positive outlook, challenges remain. High infrastructure costs, the complexity of managing large datasets, and the shortage of skilled AI professionals are significant restraints on the market's expansion. However, ongoing technological advancements and increased investment in AI research are expected to mitigate these challenges, paving the way for sustained growth in the Large-Scale Model Training Machine market. Regional analysis indicates North America and Asia Pacific (particularly China) as the leading markets, with strong growth anticipated in other regions as AI adoption accelerates globally.

  4. i

    Data from: Twitter Big Data as a Resource for Exoskeleton Research: A...

    • ieee-dataport.org
    Updated Oct 22, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nirmalya Thakur (2022). Twitter Big Data as a Resource for Exoskeleton Research: A Large-Scale Dataset of about 140,000 Tweets and 100 Research Questions [Dataset]. http://doi.org/10.21227/r5mv-ax79
    Explore at:
    Dataset updated
    Oct 22, 2022
    Dataset provided by
    IEEE Dataport
    Authors
    Nirmalya Thakur
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Please cite the following paper when using this dataset:N. Thakur, "Twitter Big Data as a Resource for Exoskeleton Research: A Large-Scale Dataset of about 140,000 Tweets from 2017–2022 and 100 Research Questions", Journal of Analytics, Volume 1, Issue 2, 2022, pp. 72-97, DOI: https://doi.org/10.3390/analytics1020007AbstractThe exoskeleton technology has been rapidly advancing in the recent past due to its multitude of applications and diverse use cases in assisted living, military, healthcare, firefighting, and industry 4.0. The exoskeleton market is projected to increase by multiple times its current value within the next two years. Therefore, it is crucial to study the degree and trends of user interest, views, opinions, perspectives, attitudes, acceptance, feedback, engagement, buying behavior, and satisfaction, towards exoskeletons, for which the availability of Big Data of conversations about exoskeletons is necessary. The Internet of Everything style of today’s living, characterized by people spending more time on the internet than ever before, with a specific focus on social media platforms, holds the potential for the development of such a dataset by the mining of relevant social media conversations. Twitter, one such social media platform, is highly popular amongst all age groups, where the topics found in the conversation paradigms include emerging technologies such as exoskeletons. To address this research challenge, this work makes two scientific contributions to this field. First, it presents an open-access dataset of about 140,000 Tweets about exoskeletons that were posted in a 5-year period from 21 May 2017 to 21 May 2022. Second, based on a comprehensive review of the recent works in the fields of Big Data, Natural Language Processing, Information Retrieval, Data Mining, Pattern Recognition, and Artificial Intelligence that may be applied to relevant Twitter data for advancing research, innovation, and discovery in the field of exoskeleton research, a total of 100 Research Questions are presented for researchers to study, analyze, evaluate, ideate, and investigate based on this dataset.

  5. Data Science Platform Market Analysis North America, Europe, APAC, South...

    • technavio.com
    Updated Feb 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Technavio (2025). Data Science Platform Market Analysis North America, Europe, APAC, South America, Middle East and Africa - US, Germany, China, Canada, UK, India, France, Japan, Brazil, UAE - Size and Forecast 2025-2029 [Dataset]. https://www.technavio.com/report/data-science-platform-market-industry-analysis
    Explore at:
    Dataset updated
    Feb 13, 2025
    Dataset provided by
    TechNavio
    Authors
    Technavio
    Time period covered
    2021 - 2025
    Area covered
    Global, United States, United Kingdom
    Description

    Snapshot img

    Data Science Platform Market Size 2025-2029

    The data science platform market size is forecast to increase by USD 763.9 million at a CAGR of 40.2% between 2024 and 2029.

    The market is experiencing significant growth, driven by the integration of artificial intelligence (AI) and machine learning (ML). This enhancement enables more advanced data analysis and prediction capabilities, making data science platforms an essential tool for businesses seeking to gain insights from their data. Another trend shaping the market is the emergence of containerization and microservices in platforms. This development offers increased flexibility and scalability, allowing organizations to efficiently manage their projects. 
    However, the use of platforms also presents challenges, particularly In the area of data privacy and security. Ensuring the protection of sensitive data is crucial for businesses, and platforms must provide strong security measures to mitigate risks. In summary, the market is witnessing substantial growth due to the integration of AI and ML technologies, containerization, and microservices, while data privacy and security remain key challenges.
    

    What will be the Size of the Data Science Platform Market During the Forecast Period?

    Request Free Sample

    The market is experiencing significant growth due to the increasing demand for advanced data analysis capabilities in various industries. Cloud-based solutions are gaining popularity as they offer scalability, flexibility, and cost savings. The market encompasses the entire project life cycle, from data acquisition and preparation to model development, training, and distribution. Big data, IoT, multimedia, machine data, consumer data, and business data are prime sources fueling this market's expansion. Unstructured data, previously challenging to process, is now being effectively managed through tools and software. Relational databases and machine learning models are integral components of platforms, enabling data exploration, preprocessing, and visualization.
    Moreover, Artificial intelligence (AI) and machine learning (ML) technologies are essential for handling complex workflows, including data cleaning, model development, and model distribution. Data scientists benefit from these platforms by streamlining their tasks, improving productivity, and ensuring accurate and efficient model training. The market is expected to continue its growth trajectory as businesses increasingly recognize the value of data-driven insights.
    

    How is this Data Science Platform Industry segmented and which is the largest segment?

    The industry research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD million' for the period 2025-2029, as well as historical data from 2019-2023 for the following segments.

    Deployment
    
      On-premises
      Cloud
    
    
    Component
    
      Platform
      Services
    
    
    End-user
    
      BFSI
      Retail and e-commerce
      Manufacturing
      Media and entertainment
      Others
    
    
    Sector
    
      Large enterprises
      SMEs
    
    
    Geography
    
      North America
    
        Canada
        US
    
    
      Europe
    
        Germany
        UK
        France
    
    
      APAC
    
        China
        India
        Japan
    
    
      South America
    
        Brazil
    
    
      Middle East and Africa
    

    By Deployment Insights

    The on-premises segment is estimated to witness significant growth during the forecast period.
    

    On-premises deployment is a traditional method for implementing technology solutions within an organization. This approach involves purchasing software with a one-time license fee and a service contract. On-premises solutions offer enhanced security, as they keep user credentials and data within the company's premises. They can be customized to meet specific business requirements, allowing for quick adaptation. On-premises deployment eliminates the need for third-party providers to manage and secure data, ensuring data privacy and confidentiality. Additionally, it enables rapid and easy data access, and keeps IP addresses and data confidential. This deployment model is particularly beneficial for businesses dealing with sensitive data, such as those in manufacturing and large enterprises. While cloud-based solutions offer flexibility and cost savings, on-premises deployment remains a popular choice for organizations prioritizing data security and control.

    Get a glance at the Data Science Platform Industry report of share of various segments. Request Free Sample

    The on-premises segment was valued at USD 38.70 million in 2019 and showed a gradual increase during the forecast period.

    Regional Analysis

    North America is estimated to contribute 48% to the growth of the global market during the forecast period.
    

    Technavio's analysts have elaborately explained the regional trends and drivers that shape the market during the forecast period.

    For more insights on the market share of various regions, Request F

  6. TREC 2022 Deep Learning test collection

    • catalog.data.gov
    • data.nist.gov
    Updated May 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2023). TREC 2022 Deep Learning test collection [Dataset]. https://catalog.data.gov/dataset/trec-2022-deep-learning-test-collection
    Explore at:
    Dataset updated
    May 9, 2023
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    Description

    This is a test collection for passage and document retrieval, produced in the TREC 2023 Deep Learning track. The Deep Learning Track studies information retrieval in a large training data regime. This is the case where the number of training queries with at least one positive label is at least in the tens of thousands, if not hundreds of thousands or more. This corresponds to real-world scenarios such as training based on click logs and training based on labels from shallow pools (such as the pooling in the TREC Million Query Track or the evaluation of search engines based on early precision).Certain machine learning based methods, such as methods based on deep learning are known to require very large datasets for training. Lack of such large scale datasets has been a limitation for developing such methods for common information retrieval tasks, such as document ranking. The Deep Learning Track organized in the previous years aimed at providing large scale datasets to TREC, and create a focused research effort with a rigorous blind evaluation of ranker for the passage ranking and document ranking tasks.Similar to the previous years, one of the main goals of the track in 2022 is to study what methods work best when a large amount of training data is available. For example, do the same methods that work on small data also work on large data? How much do methods improve when given more training data? What external data and models can be brought in to bear in this scenario, and how useful is it to combine full supervision with other forms of supervision?The collection contains 12 million web pages, 138 million passages from those web pages, search queries, and relevance judgments for the queries.

  7. Deep Learning Market Report by Product Type (Software, Services, Hardware),...

    • imarcgroup.com
    pdf,excel,csv,ppt
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    IMARC Group, Deep Learning Market Report by Product Type (Software, Services, Hardware), Application (Image Recognition, Signal Recognition, Data Mining, and Others), End-Use Industry (Security, Manufacturing, Retail, Automotive, Healthcare, Agriculture, and Others), Architecture (RNN, CNN, DBN, DSN, GRU), and Region 2025-2033 [Dataset]. https://www.imarcgroup.com/deep-learning-market
    Explore at:
    pdf,excel,csv,pptAvailable download formats
    Dataset provided by
    Imarc Group
    Authors
    IMARC Group
    License

    https://www.imarcgroup.com/privacy-policyhttps://www.imarcgroup.com/privacy-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    The global deep learning market size reached USD 30.9 Billion in 2024. Looking forward, IMARC Group expects the market to reach USD 423.4 Billion by 2033, exhibiting a growth rate (CAGR) of 29.92% during 2025-2033. The increasing artificial intelligence (AI) adoption, advancements in data processing, the growing demand for image and speech recognition, investments in research and development (R&D), and the introduction of big data and cloud computing technologies are some of the major factors propelling the market.

  8. B

    Big Data Analytics Market in Energy Sector Report

    • marketresearchforecast.com
    doc, pdf, ppt
    Updated Dec 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Forecast (2024). Big Data Analytics Market in Energy Sector Report [Dataset]. https://www.marketresearchforecast.com/reports/big-data-analytics-market-in-energy-sector-5888
    Explore at:
    ppt, doc, pdfAvailable download formats
    Dataset updated
    Dec 5, 2024
    Dataset authored and provided by
    Market Research Forecast
    License

    https://www.marketresearchforecast.com/privacy-policyhttps://www.marketresearchforecast.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Big Data Analytics Market in Energy Sector size was valued at USD 9.56 USD Billion in 2023 and is projected to reach USD 13.81 USD Billion by 2032, exhibiting a CAGR of 5.4 % during the forecast period. Big Data Analytics in the energy sector can be defined as the application of sophisticated methods or tools in analyzing vast collections of information that are produced by numerous entities within the energy industry. This process covers descriptive, predictive, and prescriptive analytics to provide valuable information for procedures, costs, and strategies. Real-time analytics, etc are immediate, while predictive analytics focuses on the probability to happen in the future and prescriptive analytics solutions provide recommendations for action. Some of the main characteristics of the data collectors include handling large datasets, compatibility with IoT to stream data, and machine learning features for pattern detection. These can range from grid control and load management to predicting customer demand and equipment reliability and equipment efficiency enhancement. Thus, there is a significant advantage because Big Data Analytics helps global energy companies to increase performance, minimize sick time, and develop effective strategies to meet the necessary legal demands. Key drivers for this market are: Growing Focus on Safety and Organization to Fuel Market Growth. Potential restraints include: Higher Cost of Geotechnical Services to Hinder Market Growth. Notable trends are: Growth of IT Infrastructure to Bolster the Demand for Modern Cable Tray Management Solutions.

  9. m

    Data from: Generating Heterogeneous Big Data Set for Healthcare and...

    • data.mendeley.com
    Updated Jan 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Omar Al-Obidi (2023). Generating Heterogeneous Big Data Set for Healthcare and Telemedicine Research Based on ECG, Spo2, Blood Pressure Sensors, and Text Inputs: Data set classified, Analyzed, Organized, And Presented in Excel File Format. [Dataset]. http://doi.org/10.17632/gsmjh55sfy.1
    Explore at:
    Dataset updated
    Jan 23, 2023
    Authors
    Omar Al-Obidi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Heterogenous Big dataset is presented in this proposed work: electrocardiogram (ECG) signal, blood pressure signal, oxygen saturation (SpO2) signal, and the text input. This work is an extension version for our relevant formulating of dataset that presented in [1] and a trustworthy and relevant medical dataset library (PhysioNet [2]) was used to acquire these signals. The dataset includes medical features from heterogenous sources (sensory data and non-sensory). Firstly, ECG sensor’s signals which contains QRS width, ST elevation, peak numbers, and cycle interval. Secondly: SpO2 level from SpO2 sensor’s signals. Third, blood pressure sensors’ signals which contain high (systolic) and low (diastolic) values and finally text input which consider non-sensory data. The text inputs were formulated based on doctors diagnosing procedures for heart chronic diseases. Python software environment was used, and the simulated big data is presented along with analyses.

  10. Computer Vision Model Report

    • archivemarketresearch.com
    doc, pdf, ppt
    Updated Mar 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AMA Research & Media LLP (2025). Computer Vision Model Report [Dataset]. https://www.archivemarketresearch.com/reports/computer-vision-model-53249
    Explore at:
    pdf, ppt, docAvailable download formats
    Dataset updated
    Mar 7, 2025
    Dataset provided by
    AMA Research & Media
    License

    https://www.archivemarketresearch.com/privacy-policyhttps://www.archivemarketresearch.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The computer vision market is experiencing robust growth, projected to reach a market size of $556.7 million in 2025, with a compound annual growth rate (CAGR) of 5.1% from 2025 to 2033. This expansion is driven by several key factors. The increasing adoption of AI and machine learning across various sectors, from healthcare and automotive to retail and security, fuels demand for sophisticated computer vision solutions. Advancements in deep learning algorithms and the availability of high-quality image and video data further enhance the accuracy and efficiency of computer vision applications. The market is segmented by deployment model (Software as a Service, Platform as a Service, Infrastructure as a Service) and application (Government, Small and Medium Enterprises, Large Enterprises). The SaaS model currently holds a significant market share due to its scalability and cost-effectiveness, while the government and large enterprise segments are driving significant demand due to their high investment capacity and requirement for advanced security and efficiency solutions. Geographic growth is expected to be broadly distributed, with North America and Europe currently leading the market, followed by a rapidly expanding Asia-Pacific region. The competitive landscape is characterized by a diverse range of companies offering specialized computer vision solutions. Established players are focusing on developing advanced algorithms and expanding their product portfolios to cater to diverse industry needs. Meanwhile, innovative startups are disrupting the market with niche solutions targeting specific applications and sectors. The ongoing technological advancements and increased investment in R&D will continue to shape the future of this dynamic market. Further growth is expected as more businesses recognize the potential of computer vision to enhance operational efficiency, improve decision-making, and create new revenue streams. The increasing integration of computer vision with other technologies, such as IoT and big data analytics, will further broaden the scope and impact of this transformative technology.

  11. f

    Table_1_Detecting the Information of Functional Connectivity Networks in...

    • frontiersin.figshare.com
    docx
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xin Wen; Li Dong; Junjie Chen; Jie Xiang; Jie Yang; Hechun Li; Xiaobo Liu; Cheng Luo; Dezhong Yao (2023). Table_1_Detecting the Information of Functional Connectivity Networks in Normal Aging Using Deep Learning From a Big Data Perspective.DOCX [Dataset]. http://doi.org/10.3389/fnins.2019.01435.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    Frontiers
    Authors
    Xin Wen; Li Dong; Junjie Chen; Jie Xiang; Jie Yang; Hechun Li; Xiaobo Liu; Cheng Luo; Dezhong Yao
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A resting-state functional connectivity (rsFC)-constructed functional network (FN) derived from functional magnetic resonance imaging (fMRI) data can effectively mine alterations in brain function during aging due to the non-invasive and effective advantages of fMRI. With global health research focusing on aging, several open fMRI datasets have been made available that combine deep learning with big data and are a new, promising trend and open issue for brain information detection in fMRI studies of brain aging. In this study, we proposed a new method based on deep learning from the perspective of big data, named Deep neural network (DNN) with Autoencoder (AE) pretrained Functional connectivity Analysis (DAFA), to deeply mine the important functional connectivity changes in fMRI during brain aging. First, using resting-state fMRI data from 421 subjects from the CamCAN dataset, functional connectivities were calculated using sliding window method, and the complex functional patterns were mined by an AE. Then, to increase the statistical power and reliability of the results, we used an AE-pretrained DNN to relabel the functional connectivities of each subject to classify them as belonging to the attributes of young or old individuals. A method called search-back analysis was performed to find alterations in brain function during aging according to the relabeled functional connectivities. Finally, behavioral data regarding fluid intelligence and response time were used to verify the revealed functional changes. Compared to traditional methods, DAFA revealed additional, important aged-related changes in FC patterns [e.g., FC connections within the default mode (DMN) and the sensorimotor and cingulo-opercular networks, as well as connections between the frontoparietal and cingulo-opercular networks, between the DMN and the frontoparietal/cingulo-opercular/sensorimotor/occipital/cerebellum networks, and between the sensorimotor and frontoparietal/cingulo-opercular networks], which were correlated to behavioral data. These findings demonstrated that the proposed DAFA method was superior to traditional FC-determining methods in discovering changes in brain functional connectivity during aging. In addition, it may be a promising method for exploring important information in other fMRI studies.

  12. o

    Replication data for: Big Data: New Tricks for Econometrics

    • openicpsr.org
    Updated May 1, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hal R. Varian (2014). Replication data for: Big Data: New Tricks for Econometrics [Dataset]. http://doi.org/10.3886/E113925V1
    Explore at:
    Dataset updated
    May 1, 2014
    Dataset provided by
    American Economic Association
    Authors
    Hal R. Varian
    Time period covered
    May 1, 2014
    Description

    Computers are now involved in many economic transactions and can capture data associated with these transactions, which can then be manipulated and analyzed. Conventional statistical and econometric techniques such as regression often work well, but there are issues unique to big datasets that may require different tools. First, the sheer size of the data involved may require more powerful data manipulation tools. Second, we may have more potential predictors than appropriate for estimation, so we need to do some kind of variable selection. Third, large datasets may allow for more flexible relationships than simple linear models. Machine learning techniques such as decision trees, support vector machines, neural nets, deep learning, and so on may allow for more effective ways to model complex relationships. In this essay, I will describe a few of these tools for manipulating and analyzing big data. I believe that these methods have a lot to offer and should be more widely known and used by economists.

  13. d

    Summary report of the 4th IAEA Technical Meeting on Fusion Data Processing,...

    • dataone.org
    Updated Sep 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    S.M. Gonzalez de Vicente, D. Mazon, M. Xu, S. Pinches, M. Churchill, A. Dinklage, R. Fischer, A. Murari, P. Rodriguez-Fernandez, J. Stillerman, J. Vega, G. Verdoolaege (2024). Summary report of the 4th IAEA Technical Meeting on Fusion Data Processing, Validation and Analysis (FDPVA) [Dataset]. https://dataone.org/datasets/sha256%3A77e8a13f897db6e70537f800ceea9a2c8228a4eb1b33289b7e2c056845298a12
    Explore at:
    Dataset updated
    Sep 24, 2024
    Dataset provided by
    Harvard Dataverse
    Authors
    S.M. Gonzalez de Vicente, D. Mazon, M. Xu, S. Pinches, M. Churchill, A. Dinklage, R. Fischer, A. Murari, P. Rodriguez-Fernandez, J. Stillerman, J. Vega, G. Verdoolaege
    Description

    The objective of the fourth Technical Meeting on Fusion Data Processing, Validation and Analysis was to provide a platform during which a set of topics relevant to fusion data processing, validation and analysis are discussed with the view of extrapolating needs to next step fusion devices such as ITER. The validation and analysis of experimental data obtained from diagnostics used to characterize fusion plasmas are crucial for a knowledge-based understanding of the physical processes governing the dynamics of these plasmas. This paper presents the recent progress and achievements in the domain of plasma diagnostics and synthetic diagnostics data analysis (including image processing, regression analysis, inverse problems, deep learning, machine learning, big data and physics-based models for control) reported at the meeting. The progress in these areas highlight trends observed in current major fusion confinement devices. A special focus is dedicated on data analysis requirements for ITER and DEMO with a particular attention paid to Artificial Intelligence for automatization and improving reliability of control processes.

  14. Data from: Industry-scale Application and Evaluation of Deep Learning for...

    • zenodo.org
    • data.niaid.nih.gov
    application/gzip
    Updated Apr 21, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Noé Sturm; Andreas Mayr; Thanh Le Van; Vladimir Chupakhin; Vladimir Chupakhin; Hugo Ceulemans; Joerg Wegner; Jose-Felipe Golib-Dzib; Nina Jeliazkova; Nina Jeliazkova; Yves Vandriessche; Stanislav Bohm; Vojtech Cima; Jan Martinovic; Nigel Greene; Tom Vander Aa; Thomas J. Ashby; Sepp Hochreiter; Ola Engkvist; Günter Klambauer; Hongming Chen; Noé Sturm; Andreas Mayr; Thanh Le Van; Hugo Ceulemans; Joerg Wegner; Jose-Felipe Golib-Dzib; Yves Vandriessche; Stanislav Bohm; Vojtech Cima; Jan Martinovic; Nigel Greene; Tom Vander Aa; Thomas J. Ashby; Sepp Hochreiter; Ola Engkvist; Günter Klambauer; Hongming Chen (2020). Industry-scale Application and Evaluation of Deep Learning for Drug Target Prediction [Dataset]. http://doi.org/10.5281/zenodo.3239499
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Apr 21, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Noé Sturm; Andreas Mayr; Thanh Le Van; Vladimir Chupakhin; Vladimir Chupakhin; Hugo Ceulemans; Joerg Wegner; Jose-Felipe Golib-Dzib; Nina Jeliazkova; Nina Jeliazkova; Yves Vandriessche; Stanislav Bohm; Vojtech Cima; Jan Martinovic; Nigel Greene; Tom Vander Aa; Thomas J. Ashby; Sepp Hochreiter; Ola Engkvist; Günter Klambauer; Hongming Chen; Noé Sturm; Andreas Mayr; Thanh Le Van; Hugo Ceulemans; Joerg Wegner; Jose-Felipe Golib-Dzib; Yves Vandriessche; Stanislav Bohm; Vojtech Cima; Jan Martinovic; Nigel Greene; Tom Vander Aa; Thomas J. Ashby; Sepp Hochreiter; Ola Engkvist; Günter Klambauer; Hongming Chen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Artificial intelligence (AI) is undergoing a revolution thanks to the breakthroughs of machine learning algorithms in computer vision, speech recognition, natural language processing and generative modelling. Recent works on publicly available pharmaceutical data showed that AI methods are highly promising for Drug Target prediction. However, the quality of public data might be different than that of industry data due to different labs reporting measurements, different measurement techniques, fewer samples and less diverse and specialized assays. As part of a European funded project (ExCAPE), that brought together expertise from pharmaceutical industry, machine learning, and high-performance computing, we investigated how well machine learning models obtained from public data can be transferred to internal pharmaceutical industry data. Our results show that machine learning models trained on public data can indeed maintain their predictive power to a large degree when applied to industry data. Moreover, we observed that deep learning derived machine learning models outperformed comparable models, which were trained by other machine learning algorithms, when applied to internal pharmaceutical company datasets. To our knowledge, this is the first large-scale study evaluating the potential of machine learning and especially deep learning directly at the level of industry-scale settings and moreover investigating the transferability of publicly learned target prediction models towards industrial bioactivity prediction pipelines.

  15. The global Graph Analytics market size is USD 2522 million in 2024 and will...

    • cognitivemarketresearch.com
    pdf,excel,csv,ppt
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cognitive Market Research, The global Graph Analytics market size is USD 2522 million in 2024 and will expand at a compound annual growth rate (CAGR) of 34.0% from 2024 to 2031. [Dataset]. https://www.cognitivemarketresearch.com/graph-analytics-market-report
    Explore at:
    pdf,excel,csv,pptAvailable download formats
    Dataset authored and provided by
    Cognitive Market Research
    License

    https://www.cognitivemarketresearch.com/privacy-policyhttps://www.cognitivemarketresearch.com/privacy-policy

    Time period covered
    2021 - 2033
    Area covered
    Global
    Description

    According to Cognitive Market Research, the global Graph Analytics market size will be USD 2522 million in 2024 and will expand at a compound annual growth rate (CAGR) of 34.0% from 2024 to 2031. Market Dynamics of Graph Analytics Market

    Key Drivers for Graph Analytics Market

    Increasing Recognition of the Advantages of Graph Databases- One of the main reasons for the Graph Analytics market is the increasing recognition of the advantages of graph databases. Unlike traditional relational databases, graph databases excel at handling complex relationships and interconnected data, making them ideal for use cases such as fraud detection, recommendation engines, and social network analysis. Businesses are leveraging these capabilities to uncover insights and patterns that were previously difficult to detect. The rise of big data and the need for real-time analytics are further driving the adoption of graph databases, as they offer enhanced performance and scalability for large-scale data sets. Additionally, advancements in artificial intelligence and machine learning are amplifying the value of graph databases, enabling more sophisticated data modeling and predictive analytics.
    Growing Uptake of Big Data Tools to Drive the Graph Analytics Market's Expansion in the Years Ahead.
    

    Key Restraints for Graph Analytics Market

    Limited Awareness and Understanding pose a serious threat to the Graph Analytics industry.
    The market also faces significant difficulties related to data security and privacy.
    

    Introduction of the Graph Analytics Market

    The Graph Analytics Market is rapidly expanding, driven by the growing need for advanced data analysis techniques in various sectors. Graph analytics leverages graph structures to represent and analyze relationships and dependencies, providing deeper insights than traditional data analysis methods. Key factors propelling this market include the rise of big data, the increasing adoption of artificial intelligence and machine learning, and the demand for real-time data processing. Industries such as finance, healthcare, telecommunications, and retail are major contributors, utilizing graph analytics for fraud detection, personalized recommendations, network optimization, and more. Leading vendors are continually innovating to offer scalable, efficient solutions, incorporating advanced features like graph databases and visualization tools.

  16. m

    The RHMCD-20 datasets for Depression and Mental Health Data Analysis with...

    • data.mendeley.com
    Updated Dec 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The RHMCD-20 datasets for Depression and Mental Health Data Analysis with Machine Learning [Dataset]. https://data.mendeley.com/datasets/pxjmjyfdh2/1
    Explore at:
    Dataset updated
    Dec 18, 2023
    Authors
    Imrus Salehin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    the RHMCD-20 dataset, we took care to include information from a wide range of sources, including teenagers from Bangladesh, college students, housewives, professionals from businesses and corporations, and other people.This is survey data for Depression and Mental Health Data Analysis. Survey questions : Age: Represents the age of the participants. Gender: Indicates the gender of the participants. Occupation: Represents the participant's occupations. Days_Indoors :Indicates the number of days the participant has not been out of the house Growing_Stress: Indicates the participant's stress is increasing day by day (Yes/No). Quarantine_Frustration: Frustrations in the first two weeks of quarantine (Yes/Maybe/No). Changes_Habits: Represents major changes in eating habits and sleeping (Yes/Maybe/No). Mental_Health_History : A precedent of mental disorders in the previous generation (Yes/No). Weight_Change :Highlights changes in body weight during quarantine (Yes/Maybe/No) Mood_Swings: Represents extreme mood changes (Low/Medium/High). Coping_Struggles: The inability to cope with daily problems or stress (Yes/Maybe/No). Work_Interest :Represents whether the participant is losing interest in working (Yes/No). Social_Weakness :Conveys feeling mentally weak when interacting with others (Yes/No).

  17. A Labelled Dataset for Sentiment Analysis of Videos on YouTube, TikTok, and...

    • zenodo.org
    • data.niaid.nih.gov
    • +2more
    csv
    Updated Jul 20, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nirmalya Thakur; Nirmalya Thakur; Vanessa Su; Mingchen Shao; Kesha A. Patel; Hongseok Jeong; Victoria Knieling; Andrew Bian; Vanessa Su; Mingchen Shao; Kesha A. Patel; Hongseok Jeong; Victoria Knieling; Andrew Bian (2024). A Labelled Dataset for Sentiment Analysis of Videos on YouTube, TikTok, and other sources about the 2024 outbreak of Measles [Dataset]. http://doi.org/10.5281/zenodo.11711230
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jul 20, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Nirmalya Thakur; Nirmalya Thakur; Vanessa Su; Mingchen Shao; Kesha A. Patel; Hongseok Jeong; Victoria Knieling; Andrew Bian; Vanessa Su; Mingchen Shao; Kesha A. Patel; Hongseok Jeong; Victoria Knieling; Andrew Bian
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jun 15, 2024
    Area covered
    YouTube
    Description

    Please cite the following paper when using this dataset:

    N. Thakur, V. Su, M. Shao, K. Patel, H. Jeong, V. Knieling, and A. Bian “A labelled dataset for sentiment analysis of videos on YouTube, TikTok, and other sources about the 2024 outbreak of measles,” Proceedings of the 26th International Conference on Human-Computer Interaction (HCII 2024), Washington, USA, 29 June - 4 July 2024. (Accepted as a Late Breaking Paper, Preprint Available at: https://doi.org/10.48550/arXiv.2406.07693)

    Abstract

    This dataset contains the data of 4011 videos about the ongoing outbreak of measles published on 264 websites on the internet between January 1, 2024, and May 31, 2024. These websites primarily include YouTube and TikTok, which account for 48.6% and 15.2% of the videos, respectively. The remainder of the websites include Instagram and Facebook as well as the websites of various global and local news organizations. For each of these videos, the URL of the video, title of the post, description of the post, and the date of publication of the video are presented as separate attributes in the dataset. After developing this dataset, sentiment analysis (using VADER), subjectivity analysis (using TextBlob), and fine-grain sentiment analysis (using DistilRoBERTa-base) of the video titles and video descriptions were performed. This included classifying each video title and video description into (i) one of the sentiment classes i.e. positive, negative, or neutral, (ii) one of the subjectivity classes i.e. highly opinionated, neutral opinionated, or least opinionated, and (iii) one of the fine-grain sentiment classes i.e. fear, surprise, joy, sadness, anger, disgust, or neutral. These results are presented as separate attributes in the dataset for the training and testing of machine learning algorithms for performing sentiment analysis or subjectivity analysis in this field as well as for other applications. The paper associated with this dataset (please see the above-mentioned citation) also presents a list of open research questions that may be investigated using this dataset.

  18. Global Machine Learning Market Size By Component (Hardware, Software), By...

    • verifiedmarketresearch.com
    Updated Oct 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VERIFIED MARKET RESEARCH (2024). Global Machine Learning Market Size By Component (Hardware, Software), By Enterprise Size (SMEs, Large Enterprises), By End-User (Healthcare, Retail), By Geographic Scope And Forecast [Dataset]. https://www.verifiedmarketresearch.com/product/global-machine-learning-market-size-and-forecast/
    Explore at:
    Dataset updated
    Oct 10, 2024
    Dataset provided by
    Verified Market Researchhttps://www.verifiedmarketresearch.com/
    Authors
    VERIFIED MARKET RESEARCH
    License

    https://www.verifiedmarketresearch.com/privacy-policy/https://www.verifiedmarketresearch.com/privacy-policy/

    Time period covered
    2024 - 2031
    Area covered
    Global
    Description

    Machine Learning Market size was valued at USD 10.24 Billion in 2024 and is projected to reach USD 200.08 Billion by 2031, growing at a CAGR of 10.9% from 2024 to 2031.

    Key Market Drivers:

    Increasing Data Volume and Complexity: The explosion of digital data is fueling ML adoption across industries. Organizations are leveraging ML to extract insights from vast, complex datasets. According to the European Commission, the volume of data globally is projected to grow from 33 zettabytes in 2018 to 175 zettabytes by 2025. For instance, on September 15, 2023, Google Cloud announced new ML-powered data analytics tools to help enterprises handle increasing data complexity.

    Advancements in AI and Deep Learning Algorithms: Continuous improvements in AI algorithms are expanding ML capabilities. Deep learning breakthroughs are enabling more sophisticated applications. The U.S. National Science Foundation reported a 63% increase in AI research publications from 2017 to 2021. For instance, on August 24, 2023, DeepMind unveiled Graphcast, a new ML weather forecasting model achieving unprecedented accuracy.

  19. Importance of data science and machine learning features worldwide 2019

    • statista.com
    Updated Mar 17, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2022). Importance of data science and machine learning features worldwide 2019 [Dataset]. https://www.statista.com/statistics/1053588/data-science-machine-learning-importance-of-features/
    Explore at:
    Dataset updated
    Mar 17, 2022
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2019
    Area covered
    Worldwide
    Description

    Support for a wide range of regression models is the most important feature organizations need in data science and machine learning technologies as of 2019, with about 66 percent of respondents reporting this feature to be critical or very important. Hierarchical clustering and textbook statistical functions are also on top of the list.

  20. Big Data as a Service (BDaaS) Market Analysis North...

    • technavio.com
    Updated Dec 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Technavio (2023). Big Data as a Service (BDaaS) Market Analysis North America,APAC,Europe,South America,Middle East and Africa - US,Canada,China,Germany,UK - Size and Forecast 2024-2028 [Dataset]. https://www.technavio.com/report/big-data-as-a-service-market-industry-analysis
    Explore at:
    Dataset updated
    Dec 20, 2023
    Dataset provided by
    TechNavio
    Authors
    Technavio
    Time period covered
    2021 - 2025
    Area covered
    Canada, United States, United Kingdom, Global
    Description

    Snapshot img

    Big Data as a Service Market Size 2024-2028

    The big data as a service market size is forecast to increase by USD 41.20 billion at a CAGR of 28.45% between 2023 and 2028.

    The market is experiencing significant growth due to the increasing volume of data and the rising demand for advanced data insights. Machine learning algorithms and artificial intelligence are driving product quality and innovation in this sector. Hybrid cloud solutions are gaining popularity, offering the benefits of both private and public cloud platforms for optimal data storage and scalability. Industry standards for data privacy and security are increasingly important, as large amounts of data pose unique risks. The BDaaS market is expected to continue its expansion, providing valuable data insights to businesses across various industries.
    

    What will be the Big Data as a Service Market Size During the Forecast Period?

    Request Free Sample

    Big Data as a Service (BDaaS) has emerged as a game-changer in the business world, enabling organizations to harness the power of big data without the need for extensive infrastructure and expertise. This service model offers various components such as data management, analytics, and visualization tools, enabling businesses to derive valuable insights from their data. BDaaS encompasses several key components that drive market growth. These include Business Intelligence (BI), Data Science, Data Quality, and Data Security. BI provides organizations with the ability to analyze data and gain insights to make informed decisions.
    
    
    
    Data Science, on the other hand, focuses on extracting meaningful patterns and trends from large datasets using advanced algorithms. Data Quality is a critical component of BDaaS, ensuring that the data being analyzed is accurate, complete, and consistent. Data Security is another essential aspect, safeguarding sensitive data from cybersecurity threats and data breaches. Moreover, BDaaS offers various data pipelines, enabling seamless data integration and data lifecycle management. Network Analysis, Real-time Analytics, and Predictive Analytics are other essential components, providing businesses with actionable insights in real-time and enabling them to anticipate future trends. Data Mining, Machine Learning Algorithms, and Data Visualization Tools are other essential components of BDaaS.
    

    How is this market segmented and which is the largest segment?

    The market research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD billion' for the period 2024-2028, as well as historical data from 2018-2022 for the following segments.

    Type
    
      Data analytics-as-a-Service
      Hadoop-as-a-service
      Data-as-a-service
    
    
    Deployment
    
      Public cloud
      Hybrid cloud
      Private cloud
    
    
    Geography
    
      North America
    
        Canada
        US
    
    
      APAC
    
        China
    
    
      Europe
    
        Germany
        UK
    
    
      South America
    
    
    
      Middle East and Africa
    

    By Type Insights

    The data analytics-as-a-service segment is estimated to witness significant growth during the forecast period.
    

    Big Data as a Service (BDaaS) is a significant market segment, highlighted by the availability of Hadoop-as-a-Service solutions. These offerings enable businesses to access essential datasets on-demand without the burden of expensive infrastructure. DAaaS solutions facilitate real-time data analysis, empowering organizations to make informed decisions. The DAaaS landscape is expanding rapidly as companies acknowledge its value in enhancing internal data. Integrating DAaaS with big data systems amplifies analytics capabilities, creating a vibrant market landscape. Organizations can leverage diverse datasets to gain a competitive edge, driving the growth of the global BDaaS market. In the context of digital transformation, cloud computing, IoT, and 5G technologies, BDaaS solutions offer optimal resource utilization.

    However, regulatory scrutiny poses challenges, necessitating stringent data security measures. Retail and other industries stand to benefit significantly from BDaaS, particularly with distributed computing solutions. DAaaS adoption is a strategic investment for businesses seeking to capitalize on the power of external data for valuable insights.

    Get a glance at the market report of share of various segments Request Free Sample

    The Data analytics-as-a-Service segment was valued at USD 2.59 billion in 2018 and showed a gradual increase during the forecast period.

    Regional Analysis

    North America is estimated to contribute 35% to the growth of the global market during the forecast period.
    

    Technavio's analysts have elaborately explained the regional trends and drivers that shape the market during the forecast period.

    For more insights on the market share of various regions Request Free Sample

    Big Data as a Service Market analysis, North America is experiencing signif

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Statista (2024). Importance of big data analytics and machine learning technologies worldwide 2019 [Dataset]. https://www.statista.com/statistics/919497/worldwide-critical-big-data-analytics-machine-learning-technologies/
Organization logo

Importance of big data analytics and machine learning technologies worldwide 2019

Explore at:
Dataset updated
Dec 10, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2019
Area covered
Worldwide
Description

This statistic shows the importance of big data analysis and machine learning technologies worldwide as of 2019. Tensorflow was seen as the most important big data analytics and machine learning technology, with 59 percent of respondents stating that it was important to critial for their organization.

Search
Clear search
Close search
Google apps
Main menu