100+ datasets found
  1. n

    Data from: Trust, AI, and Synthetic Biometrics

    • curate.nd.edu
    pdf
    Updated Nov 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Patrick G Tinsley (2024). Trust, AI, and Synthetic Biometrics [Dataset]. http://doi.org/10.7274/25604631.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Nov 11, 2024
    Dataset provided by
    University of Notre Dame
    Authors
    Patrick G Tinsley
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Artificial Intelligence-based image generation has recently seen remarkable advancements, largely driven by deep learning techniques, such as Generative Adversarial Networks (GANs). With the influx and development of generative models, so too have biometric re-identification models and presentation attack detection models seen a surge in discriminative performance. However, despite the impressive photo-realism of generated samples and the additive value to the data augmentation pipeline, the role and usage of machine learning models has received intense scrutiny and criticism, especially in the context of biometrics, often being labeled as untrustworthy. Problems that have garnered attention in modern machine learning include: humans' and machines' shared inability to verify the authenticity of (biometric) data, the inadvertent leaking of private biometric data through the image synthesis process, and racial bias in facial recognition algorithms. Given the arrival of these unwanted side effects, public trust has been shaken in the blind use and ubiquity of machine learning.

    However, in tandem with the advancement of generative AI, there are research efforts to re-establish trust in generative and discriminative machine learning models. Explainability methods based on aggregate model salience maps can elucidate the inner workings of a detection model, establishing trust in a post hoc manner. The CYBORG training strategy, originally proposed by Boyd, attempts to actively build trust into discriminative models by incorporating human salience into the training process.

    In doing so, CYBORG-trained machine learning models behave more similar to human annotators and generalize well to unseen types of synthetic data. Work in this dissertation also attempts to renew trust in generative models by training generative models on synthetic data in order to avoid identity leakage in models trained on authentic data. In this way, the privacy of individuals whose biometric data was seen during training is not compromised through the image synthesis procedure. Future development of privacy-aware image generation techniques will hopefully achieve the same degree of biometric utility in generative models with added guarantees of trustworthiness.

  2. f

    Table1_Enhancing biomechanical machine learning with limited data:...

    • frontiersin.figshare.com
    pdf
    Updated Feb 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carlo Dindorf; Jonas Dully; Jürgen Konradi; Claudia Wolf; Stephan Becker; Steven Simon; Janine Huthwelker; Frederike Werthmann; Johanna Kniepert; Philipp Drees; Ulrich Betz; Michael Fröhlich (2024). Table1_Enhancing biomechanical machine learning with limited data: generating realistic synthetic posture data using generative artificial intelligence.pdf [Dataset]. http://doi.org/10.3389/fbioe.2024.1350135.s001
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Feb 14, 2024
    Dataset provided by
    Frontiers
    Authors
    Carlo Dindorf; Jonas Dully; Jürgen Konradi; Claudia Wolf; Stephan Becker; Steven Simon; Janine Huthwelker; Frederike Werthmann; Johanna Kniepert; Philipp Drees; Ulrich Betz; Michael Fröhlich
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Objective: Biomechanical Machine Learning (ML) models, particularly deep-learning models, demonstrate the best performance when trained using extensive datasets. However, biomechanical data are frequently limited due to diverse challenges. Effective methods for augmenting data in developing ML models, specifically in the human posture domain, are scarce. Therefore, this study explored the feasibility of leveraging generative artificial intelligence (AI) to produce realistic synthetic posture data by utilizing three-dimensional posture data.Methods: Data were collected from 338 subjects through surface topography. A Variational Autoencoder (VAE) architecture was employed to generate and evaluate synthetic posture data, examining its distinguishability from real data by domain experts, ML classifiers, and Statistical Parametric Mapping (SPM). The benefits of incorporating augmented posture data into the learning process were exemplified by a deep autoencoder (AE) for automated feature representation.Results: Our findings highlight the challenge of differentiating synthetic data from real data for both experts and ML classifiers, underscoring the quality of synthetic data. This observation was also confirmed by SPM. By integrating synthetic data into AE training, the reconstruction error can be reduced compared to using only real data samples. Moreover, this study demonstrates the potential for reduced latent dimensions, while maintaining a reconstruction accuracy comparable to AEs trained exclusively on real data samples.Conclusion: This study emphasizes the prospects of harnessing generative AI to enhance ML tasks in the biomechanics domain.

  3. G

    Generative AI Market Report

    • marketresearchforecast.com
    doc, pdf, ppt
    Updated Jan 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Forecast (2025). Generative AI Market Report [Dataset]. https://www.marketresearchforecast.com/reports/generative-ai-market-1667
    Explore at:
    doc, pdf, pptAvailable download formats
    Dataset updated
    Jan 2, 2025
    Dataset authored and provided by
    Market Research Forecast
    License

    https://www.marketresearchforecast.com/privacy-policyhttps://www.marketresearchforecast.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Generative AI Market size was valued at USD 43.87 USD Billion in 2023 and is projected to reach USD 453.28 USD Billion by 2032, exhibiting a CAGR of 39.6 % during the forecast period. The market's expansion is driven by the increasing adoption of AI in various industries, the growing demand for personalized experiences, and the advancement of machine learning and deep learning technologies. Generative AI is a form of AI technology that come with the capability to generate content in several of forms such us that include text, images, audio data, and artificial data. In the latest trend of the use of generative AI, fingertip friendly interfaces that allow for the creation of top-quality text design, and videos in a brief time of only seconds have been the leading cause of the hype around it. The AI technology called Generative AI employs a variety of techniques that its development is still being improved. Fundamentally, AI foundation models are based on training on a wide spate of unlabelled data that can be used for many tasks; working primarily on specific areas where additional fine-tuning finds its place. Over-simplifying the process, huge amounts of maths and computer power get used to develop AI models. Nevertheless, at its core, it is the predictions amplified. Generative AI relies on deep learning models – sophisticated machine learning models that work as neural networks and learn and take decisions just the human minds do. Such models are based on the detection and emission of codes of complex relationships or patterns in huge information volumes and that data is used to respond to users' original speech requests or questions with native language replies or new content. Recent developments include: June 2023: Salesforce launched two generative artificial intelligence (AI) products for commerce experience and customized consumers –Commerce GPT and Marketing GPT. The Marketing GPT model leverages data from Salesforce's real-time data cloud platform to generate more innovative audience segments, personalized emails, and marketing strategies., June 2023: Accenture and Microsoft are teaming up to help companies primarily transform their businesses by harnessing the power of generative AI accelerated by the cloud. It helps customers find the right way to build and extend technology in their business responsibly., May 2023: SAP SE partnered with Microsoft to help customers solve their fundamental business challenges with the latest enterprise-ready innovations. This integration will enable new experiences to improve how businesses attract, retain and qualify their employees. , April 2023: Amazon Web Services, Inc. launched a global generative AI accelerator for startups. The company’s Generative AI Accelerator offers access to impactful AI tools and models, machine learning stack optimization, customized go-to-market strategies, and more., March 2023: Adobe and NVIDIA have partnered to join the growth of generative AI and additional advanced creative workflows. Adobe and NVIDIA will innovate advanced AI models with new generations aiming at tight integration into the applications that significant developers and marketers use. . Key drivers for this market are: Growing Necessity to Create a Virtual World in the Metaverse to Drive the Market. Potential restraints include: Risks Related to Data Breaches and Sensitive Information to Hinder Market Growth . Notable trends are: Rising Awareness about Conversational AI to Transform the Market Outlook .

  4. S

    Synthetic Data Generation Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Jun 16, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Synthetic Data Generation Report [Dataset]. https://www.datainsightsmarket.com/reports/synthetic-data-generation-1124388
    Explore at:
    doc, pdf, pptAvailable download formats
    Dataset updated
    Jun 16, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The synthetic data generation market is experiencing explosive growth, driven by the increasing need for high-quality data in various applications, including AI/ML model training, data privacy compliance, and software testing. The market, currently estimated at $2 billion in 2025, is projected to experience a Compound Annual Growth Rate (CAGR) of 25% from 2025 to 2033, reaching an estimated $10 billion by 2033. This significant expansion is fueled by several key factors. Firstly, the rising adoption of artificial intelligence and machine learning across industries demands large, high-quality datasets, often unavailable due to privacy concerns or data scarcity. Synthetic data provides a solution by generating realistic, privacy-preserving datasets that mirror real-world data without compromising sensitive information. Secondly, stringent data privacy regulations like GDPR and CCPA are compelling organizations to explore alternative data solutions, making synthetic data a crucial tool for compliance. Finally, the advancements in generative AI models and algorithms are improving the quality and realism of synthetic data, expanding its applicability in various domains. Major players like Microsoft, Google, and AWS are actively investing in this space, driving further market expansion. The market segmentation reveals a diverse landscape with numerous specialized solutions. While large technology firms dominate the broader market, smaller, more agile companies are making significant inroads with specialized offerings focused on specific industry needs or data types. The geographical distribution is expected to be skewed towards North America and Europe initially, given the high concentration of technology companies and early adoption of advanced data technologies. However, growing awareness and increasing data needs in other regions are expected to drive substantial market growth in Asia-Pacific and other emerging markets in the coming years. The competitive landscape is characterized by a mix of established players and innovative startups, leading to continuous innovation and expansion of market applications. This dynamic environment indicates sustained growth in the foreseeable future, driven by an increasing recognition of synthetic data's potential to address critical data challenges across industries.

  5. Scripted Monologues Speech Data | 65,000 Hours | Generative AI Audio Data|...

    • datarade.ai
    Updated Dec 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nexdata (2023). Scripted Monologues Speech Data | 65,000 Hours | Generative AI Audio Data| Speech Recognition Data | Machine Learning (ML) Data [Dataset]. https://datarade.ai/data-products/nexdata-multilingual-read-speech-data-65-000-hours-aud-nexdata
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Dec 11, 2023
    Dataset authored and provided by
    Nexdata
    Area covered
    Poland, Puerto Rico, Japan, Chile, France, Uruguay, Italy, Luxembourg, Pakistan, Taiwan
    Description
    1. Specifications Format : 16kHz, 16bit, uncompressed wav, mono channel

    Recording environment : quiet indoor environment, without echo

    Recording content (read speech) : economy, entertainment, news, oral language, numbers, letters

    Speaker : native speaker, gender balance

    Device : Android mobile phone, iPhone

    Language : 100+ languages

    Transcription content : text, time point of speech data, 5 noise symbols, 5 special identifiers

    Accuracy rate : 95% (the accuracy rate of noise symbols and other identifiers is not included)

    Application scenarios : speech recognition, voiceprint recognition

    1. About Nexdata Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 1 million hours of Audio Data and 800TB of Annotated Imagery Data. These ready-to-go Machine Learning (ML) Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/speechrecog?source=Datarade
  6. C

    Data about the use of generative artificial intelligence in the training of...

    • dataverse.csuc.cat
    pdf, txt
    Updated Jun 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carlos Lopezosa; Carlos Lopezosa; Lluís Codina; Lluís Codina; Carles Pont-Sorribes; Carles Pont-Sorribes; Mari Vállez; Mari Vállez (2024). Data about the use of generative artificial intelligence in the training of journalists: challenges, uses and training proposal [Dataset]. http://doi.org/10.34810/data1039
    Explore at:
    pdf(38617), pdf(50207), pdf(52988), pdf(36364), pdf(34344), pdf(36477), pdf(37962), pdf(48432), pdf(35020), pdf(36306), pdf(34658), pdf(35943), pdf(37142), pdf(37040), pdf(38893), pdf(34253), pdf(36837), pdf(40050), pdf(33848), pdf(37655), pdf(34281), pdf(35076), pdf(46523), txt(2005), pdf(34261), pdf(35963), pdf(35252), pdf(37754), pdf(34350), pdf(38577), pdf(38895), pdf(38856), pdf(36243)Available download formats
    Dataset updated
    Jun 3, 2024
    Dataset provided by
    CORA.Repositori de Dades de Recerca
    Authors
    Carlos Lopezosa; Carlos Lopezosa; Lluís Codina; Lluís Codina; Carles Pont-Sorribes; Carles Pont-Sorribes; Mari Vállez; Mari Vállez
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The influence of artificial intelligence (AI) on communication and journalism is explored based on in-depth, semi-structured interviews with 32 experts. The ethical and technological use of AI in automatically generating news content is highlighted, along with challenges related to transparency and bias prevention.

  7. A

    AI Training Dataset Market Report

    • marketresearchforecast.com
    doc, pdf, ppt
    Updated Feb 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Forecast (2025). AI Training Dataset Market Report [Dataset]. https://www.marketresearchforecast.com/reports/ai-training-dataset-market-5125
    Explore at:
    pdf, ppt, docAvailable download formats
    Dataset updated
    Feb 23, 2025
    Dataset authored and provided by
    Market Research Forecast
    License

    https://www.marketresearchforecast.com/privacy-policyhttps://www.marketresearchforecast.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    Recent developments include: December 2023: TELUS International, a digital customer experience innovator in AI and content moderation, launched Experts Engine, a fully managed, technology-driven, on-demand expert acquisition solution for generative AI models. It programmatically brings together human expertise and Gen AI tasks, such as data collection, data generation, annotation, and validation, to build high-quality training sets for the most challenging master models, including the Large Language Model (LLM)., September 2023: Cogito Tech, a player in data labeling for AI development, launched an appeal to AI vendors globally by introducing a “Nutrition Facts” style model for an AI training dataset known as DataSum. The company has been actively encouraging a more Ethical approach to AI, ML, and employment practices., June 2023: Sama, a provider of data annotation solutions that power AI models, launched Platform 2.0, a new computer vision platform designed to reduce the risk of ML algorithm failure in AI training models., May 2023: Appen Limited, a player in AI lifecycle data, announced a partnership with Reka AI, an emerging AI company making its way from stealth. This partnership aims to combine Appen's data services with Reka's proprietary multimodal language models., March 2022: Appen Limited invested in Mindtech, a synthetic data company focusing on the development of training data for AI computer vision models. This investment is part of Appen's strategy to invest capital in product-led businesses generating new and emerging sources of training data for supporting the AI lifecycle.. Key drivers for this market are: Rapid Adoption of AI Technologies for Training Datasets to Aid Market Growth. Potential restraints include: Lack of Skilled AI Professionals and Data Privacy Concerns to Hinder Market Expansion. Notable trends are: Rising Usage of Synthetic Data for Enhancing Authentication to Propel Market Growth.

  8. Tabular Data to Image Generation - Training Data

    • figshare.com
    application/x-gzip
    Updated Jan 30, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alex Tang; Ryan Rossi (2023). Tabular Data to Image Generation - Training Data [Dataset]. http://doi.org/10.6084/m9.figshare.21975359.v1
    Explore at:
    application/x-gzipAvailable download formats
    Dataset updated
    Jan 30, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Alex Tang; Ryan Rossi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We defined 300 table-image pairs across 6 categories: meat, wine, sweet, fish, gold, fruit, each with 50 table-image pairs. All images are resized to 256*256 pixles, and all tables consist of 5 to 20 rows.

  9. d

    Film Dataset for AI-Generated Music (Machine Learning (ML) Data)

    • datarade.ai
    .json, .csv, .xls
    Updated Feb 14, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rightsify (2024). Film Dataset for AI-Generated Music (Machine Learning (ML) Data) [Dataset]. https://datarade.ai/data-products/film-dataset-for-ai-generated-music-machine-learning-ml-data-rightsify
    Explore at:
    .json, .csv, .xlsAvailable download formats
    Dataset updated
    Feb 14, 2024
    Dataset authored and provided by
    Rightsify
    Area covered
    Kuwait, Cuba, Guinea, Denmark, Bermuda, Antarctica, Luxembourg, Falkland Islands (Malvinas), Tokelau, Moldova (Republic of)
    Description

    The Film dataset is a large collection of audio files with full metadata, including chords, instrumentation, key, tempo, and timestamps. This dataset is designed for machine learning applications and serves as a reliable resource for generative AI music, Music Information Retrieval (MIR), and source separation. With an emphasis on expanding machine learning attempts, the dataset allows researchers to delve into the complexities of film music, enabling the development of algorithms capable of generating creative compositions that genuinely represent the emotive nuances of various genres.

    Film music, an essential component of cinematic storytelling, plays an important role in increasing spectator engagement and emotional resonance. Composers work collaboratively with filmmakers to create music that enhance visual aspects, set the tone, and reinforce story themes.

    Training models on this cinema dataset allows researchers to better grasp and mimic these artistic details, extending the bounds of AI-generated music and contributing to advances in MIR and source separation.

  10. v

    Synthetic Data Generation Market By Offering (Solution/Platform, Services),...

    • verifiedmarketresearch.com
    Updated Mar 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    VERIFIED MARKET RESEARCH (2025). Synthetic Data Generation Market By Offering (Solution/Platform, Services), Data Type (Tabular, Text, Image, Video), Application (AI/ML Training & Development, Test Data Management), & Region for 2026-2032 [Dataset]. https://www.verifiedmarketresearch.com/product/synthetic-data-generation-market/
    Explore at:
    Dataset updated
    Mar 5, 2025
    Dataset authored and provided by
    VERIFIED MARKET RESEARCH
    License

    https://www.verifiedmarketresearch.com/privacy-policy/https://www.verifiedmarketresearch.com/privacy-policy/

    Time period covered
    2026 - 2032
    Area covered
    Global
    Description

    Synthetic Data Generation Market size was valued at USD 0.4 Billion in 2024 and is projected to reach USD 9.3 Billion by 2032, growing at a CAGR of 46.5 % from 2026 to 2032.

    The Synthetic Data Generation Market is driven by the rising demand for AI and machine learning, where high-quality, privacy-compliant data is crucial for model training. Businesses seek synthetic data to overcome real-data limitations, ensuring security, diversity, and scalability without regulatory concerns. Industries like healthcare, finance, and autonomous vehicles increasingly adopt synthetic data to enhance AI accuracy while complying with stringent privacy laws.

    Additionally, cost efficiency and faster data availability fuel market growth, reducing dependency on expensive, time-consuming real-world data collection. Advancements in generative AI, deep learning, and simulation technologies further accelerate adoption, enabling realistic synthetic datasets for robust AI model development.

  11. h

    hallo3_training_data

    • huggingface.co
    Updated Feb 18, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fudan Generative AI (2025). hallo3_training_data [Dataset]. https://huggingface.co/datasets/fudan-generative-ai/hallo3_training_data
    Explore at:
    Dataset updated
    Feb 18, 2025
    Dataset authored and provided by
    Fudan Generative AI
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Diffusion Transformer Networks

    Jiahao Cui1 
    Hui Li1 
    Yun Zhan1 
    Hanlin Shang1 
    Kaihui Cheng1 
    Yuqi Ma1 
    Shan Mu1 
    
    
    Hang Zhou2 
    Jingdong Wang2 
    Siyu Zhu1✉️ 
    
    
    
    1Fudan University  2Baidu Inc 
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    
    
      I. Dataset Overview
    

    This dataset serves as the training data for the open - source Hallo3 model, specifically created for the training of video… See the full description on the dataset page: https://huggingface.co/datasets/fudan-generative-ai/hallo3_training_data.

  12. Data from: MatSciML: A Broad, Multi-Task Benchmark for Solid-State Materials...

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated Sep 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Santiago Miret; Santiago Miret; Kin Long Kelvin Lee; Kin Long Kelvin Lee; Carmelo Gonzales; Carmelo Gonzales; Mikhail Galkin; Mikhail Galkin; Matthew Spelling; Matthew Spelling (2023). MatSciML: A Broad, Multi-Task Benchmark for Solid-State Materials Modeling [Dataset]. http://doi.org/10.5281/zenodo.8381476
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 28, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Santiago Miret; Santiago Miret; Kin Long Kelvin Lee; Kin Long Kelvin Lee; Carmelo Gonzales; Carmelo Gonzales; Mikhail Galkin; Mikhail Galkin; Matthew Spelling; Matthew Spelling
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We propose MatSci ML, a novel benchmark for modeling MATerials SCIence using Machine Learning (MatSci ML) methods focused on solid-state materials with periodic crystal structures. Applying machine learning methods to solid-state materials is a nascent field with substantial fragmentation largely driven by the great variety of datasets used to develop machine learning models. This fragmentation makes comparing the performance and generalizability of different methods difficult, thereby hindering overall research progress in the field. Building on top of open-source datasets, including large-scale datasets like the OpenCatalyst, OQMD, NOMAD, the Carolina Materials Database, and Materials Project, the MatSci ML benchmark provides a diverse set of materials systems and properties data for model training and evaluation, including simulated energies, atomic forces, material bandgaps, as well as classification data for crystal symmetries via space groups. The diversity of properties in MatSci ML makes the implementation and evaluation of multi-task learning algorithms for solid-state materials possible, while the diversity of datasets facilitates the development of new, more generalized algorithms and methods across multiple datasets. In the multi-dataset learning setting, MatSci ML enables researchers to combine observations from multiple datasets to perform joint prediction of common properties, such as energy and forces. Using MatSci ML, we evaluate the performance of different graph neural networks and equivariant point cloud networks on several benchmark tasks spanning single task, multitask, and multi-data learning scenarios. Our open-source code is available at https://github.com/IntelLabs/matsciml.

  13. G

    Generative AI Market Report

    • marketreportanalytics.com
    doc, pdf, ppt
    Updated Apr 22, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Report Analytics (2025). Generative AI Market Report [Dataset]. https://www.marketreportanalytics.com/reports/generative-ai-market-89360
    Explore at:
    doc, pdf, pptAvailable download formats
    Dataset updated
    Apr 22, 2025
    Dataset authored and provided by
    Market Report Analytics
    License

    https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Generative AI market is experiencing explosive growth, projected to reach $36.06 billion in 2025 and exhibiting a remarkable Compound Annual Growth Rate (CAGR) of 50.87% from 2025 to 2033. This rapid expansion is fueled by several key drivers. Firstly, the increasing availability and affordability of powerful computing resources, particularly GPUs, are making generative AI models more accessible and easier to train. Secondly, advancements in deep learning techniques, particularly in transformer-based architectures, have significantly improved the quality and capabilities of generative AI systems, leading to wider adoption across various sectors. Thirdly, the rising demand for automation and personalization across industries is driving the integration of generative AI solutions for tasks ranging from content creation and customer service to drug discovery and financial modeling. The BFSI (Banking, Financial Services, and Insurance), healthcare, and IT & telecommunication sectors are currently leading the adoption, but significant growth is anticipated across retail and consumer goods, and government sectors as well. The market is segmented into software and services, reflecting the diverse nature of generative AI offerings, ranging from pre-trained models and APIs to customized solutions and ongoing support. The competitive landscape is dynamic, with major technology players like Google, IBM, Microsoft, and Amazon Web Services leading the charge alongside innovative startups like Cohere and Rephrase.ai. While the market enjoys significant momentum, challenges remain. These include the ethical considerations surrounding biased data and potential misuse, concerns about data privacy and security, and the need for skilled professionals to develop, deploy, and manage these complex systems. Despite these challenges, the long-term outlook for the generative AI market remains exceptionally positive, driven by continuous technological innovation, expanding application areas, and increasing investment from both private and public sectors. The market's trajectory indicates a significant transformation across numerous industries in the coming years, promising increased efficiency, productivity, and novel applications previously unimaginable. Recent developments include: April 2024: Cognizant expanded its collaboration with Microsoft to bring Microsoft’s generative AI capabilities to its employees and a million users across its 2,000 global clients. The professional services business has purchased 25,000 Microsoft 365 Copilot seats for its associates, 500 Sales Copilot seats, and 500 Services Copilot seats to enhance productivity, workflows, and customer experiences. Cognizant will also work to deploy Microsoft 365 Copilot to its customers., February 2024: Stack Overflow and Google Cloud reported a strategic collaboration that will deliver new-gen AI-powered abilities to developers through the Stack Overflow platform, Google Cloud Console, and Gemini for Google Cloud. Through the partnership, Stack Overflow will work with Google Cloud to bring new AI-powered features to its widely adopted developer knowledge platform. Google Cloud will integrate Gemini for Google Cloud with Stack Overflow, enabling it to surface important knowledge base information and coding assistance capabilities to developers.. Key drivers for this market are: Increasing Use of AI-Integrated System across Multiple Industries, Increase in Demand for Customization and Personalization Needs. Potential restraints include: Increasing Use of AI-Integrated System across Multiple Industries, Increase in Demand for Customization and Personalization Needs. Notable trends are: BFSI is Expected to Hold a Significant Share of the Market.

  14. G

    Generative AI in Healthcare Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Apr 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Generative AI in Healthcare Report [Dataset]. https://www.datainsightsmarket.com/reports/generative-ai-in-healthcare-1367591
    Explore at:
    ppt, pdf, docAvailable download formats
    Dataset updated
    Apr 12, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Generative AI in Healthcare market is experiencing explosive growth, driven by the increasing need for improved diagnostic accuracy, personalized medicine, and enhanced operational efficiency within the healthcare sector. The market, estimated at $5 billion in 2025, is projected to witness a robust Compound Annual Growth Rate (CAGR) of 25% from 2025 to 2033, reaching an estimated market value of $25 billion by 2033. This expansion is fueled by several key factors. Firstly, the ability of generative AI to analyze complex medical images (X-rays, CT scans, MRIs) and patient data far surpasses human capabilities in speed and accuracy, leading to earlier and more precise diagnoses. Secondly, the development of personalized treatment plans, tailored to individual patient genetic profiles and medical histories, promises to significantly improve treatment outcomes. Finally, automation of administrative tasks through generative AI frees up healthcare professionals' time, allowing them to focus on patient care. Leading technology giants like Google, IBM, and Microsoft, alongside specialized healthcare companies and innovative startups, are actively investing in R&D and deploying generative AI solutions across various applications, including drug discovery, clinical trials, and patient engagement. However, the market faces some challenges. Data privacy and security concerns remain paramount, necessitating robust regulatory frameworks and ethical guidelines. The high cost of implementing and maintaining generative AI systems can also pose a barrier to entry for smaller healthcare providers. Furthermore, the need for extensive data sets to train accurate and reliable AI models, along with potential biases in training data, needs careful consideration. Despite these hurdles, the transformative potential of generative AI in healthcare is undeniable, leading to continued investment and innovation, particularly in applications involving image and text-based analysis within hospitals, clinical research, and diagnostic centers. The market's segmentation across different application areas and AI types further reflects the diversity of this rapidly evolving field. Future growth will likely be shaped by advancements in natural language processing, computer vision, and the development of more robust and explainable AI models.

  15. S

    Synthetic Data Platform Report

    • marketresearchforecast.com
    doc, pdf, ppt
    Updated Mar 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Forecast (2025). Synthetic Data Platform Report [Dataset]. https://www.marketresearchforecast.com/reports/synthetic-data-platform-33672
    Explore at:
    doc, ppt, pdfAvailable download formats
    Dataset updated
    Mar 14, 2025
    Dataset authored and provided by
    Market Research Forecast
    License

    https://www.marketresearchforecast.com/privacy-policyhttps://www.marketresearchforecast.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Synthetic Data Platform market is experiencing robust growth, driven by the increasing need for data privacy and security, coupled with the rising demand for AI and machine learning model training. The market's expansion is fueled by several key factors. Firstly, stringent data privacy regulations like GDPR and CCPA are limiting the use of real-world data, creating a surge in demand for synthetic data that mimics the characteristics of real data without compromising sensitive information. Secondly, the expanding applications of AI and ML across diverse sectors like healthcare, finance, and transportation require massive datasets for effective model training. Synthetic data provides a scalable and cost-effective solution to this challenge, enabling organizations to build and test models without the limitations imposed by real data scarcity or privacy concerns. Finally, advancements in synthetic data generation techniques, including generative adversarial networks (GANs) and variational autoencoders (VAEs), are continuously improving the quality and realism of synthetic datasets, making them increasingly viable alternatives to real data. The market is segmented by application (Government, Retail & eCommerce, Healthcare & Life Sciences, BFSI, Transportation & Logistics, Telecom & IT, Manufacturing, Others) and type (Cloud-Based, On-Premises). While the cloud-based segment currently dominates due to its scalability and accessibility, the on-premises segment is expected to witness growth driven by organizations prioritizing data security and control. Geographically, North America and Europe are currently leading the market, owing to the presence of mature technological infrastructure and a high adoption rate of AI and ML technologies. However, Asia-Pacific is anticipated to show significant growth potential in the coming years, driven by increasing digitalization and investments in AI across the region. While challenges remain in terms of ensuring the quality and fidelity of synthetic data and addressing potential biases in generated datasets, the overall outlook for the Synthetic Data Platform market remains highly positive, with substantial growth projected over the forecast period. We estimate a CAGR of 25% from 2025 to 2033.

  16. Generative Artificial Intelligence (AI) Market Analysis, Size, and Forecast...

    • technavio.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Technavio, Generative Artificial Intelligence (AI) Market Analysis, Size, and Forecast 2025-2029: North America (Canada and Mexico), APAC (China, India, Japan, South Korea), Europe (France, Germany, Italy, Spain, The Netherlands, UK), South America (Brazil), and Middle East and Africa (UAE) [Dataset]. https://www.technavio.com/report/generative-ai-market-analysis
    Explore at:
    Dataset provided by
    TechNavio
    Authors
    Technavio
    Time period covered
    2021 - 2025
    Area covered
    Global
    Description

    Snapshot img

    Generative Artificial Intelligence (AI) Market Size 2025-2029

    The generative artificial intelligence (AI) market size is forecast to increase by USD 185.82 billion at a CAGR of 59.4% between 2024 and 2029.

    The market is experiencing significant growth due to the increasing demand for AI-generated content. This trend is being driven by the accelerated deployment of large language models (LLMs), which are capable of generating human-like text, music, and visual content. However, the market faces a notable challenge: the lack of quality data. Despite the promising advancements in AI technology, the availability and quality of data remain a significant obstacle. To effectively train and improve AI models, high-quality, diverse, and representative data are essential. The scarcity and biases in existing data sets can limit the performance and generalizability of AI systems, posing challenges for businesses seeking to capitalize on the market opportunities presented by generative AI.
    Companies must prioritize investing in data collection, curation, and ethics to address this challenge and ensure their AI solutions deliver accurate, unbiased, and valuable results. By focusing on data quality, businesses can navigate this challenge and unlock the full potential of generative AI in various industries, including content creation, customer service, and research and development.
    

    What will be the Size of the Generative Artificial Intelligence (AI) Market during the forecast period?

    Request Free Sample

    The market continues to evolve, driven by advancements in foundation models and large language models. These models undergo constant refinement through prompt engineering and model safety measures, ensuring they deliver personalized experiences for various applications. Research and development in open-source models, language modeling, knowledge graph, product design, and audio generation propel innovation. Neural networks, machine learning, and deep learning techniques fuel data analysis, while model fine-tuning and predictive analytics optimize business intelligence. Ethical considerations, responsible AI, and model explainability are integral parts of the ongoing conversation.
    Model bias, data privacy, and data security remain critical concerns. Transformer models and conversational AI are transforming customer service, while code generation, image generation, text generation, video generation, and topic modeling expand content creation possibilities. Ongoing research in natural language processing, sentiment analysis, and predictive analytics continues to shape the market landscape.
    

    How is this Generative Artificial Intelligence (AI) Industry segmented?

    The generative artificial intelligence (AI) industry research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD million' for the period 2025-2029, as well as historical data from 2019-2023 for the following segments.

    Component
    
      Software
      Services
    
    
    Technology
    
      Transformers
      Generative adversarial networks (GANs)
      Variational autoencoder (VAE)
      Diffusion networks
    
    
    Application
    
      Computer Vision
      NLP
      Robotics & Automation
      Content Generation
      Chatbots & Intelligent Virtual Assistants
      Predictive Analytics
      Others
    
    
    End-Use
    
      Media & Entertainment
      BFSI
      IT & Telecommunication
      Healthcare
      Automotive & Transportation
      Gaming
      Others
    
    
    Model
    
      Large Language Models
      Image & Video Generative Models
      Multi-modal Generative Models
      Others
    
    
    Geography
    
      North America
    
        US
        Canada
        Mexico
    
    
      Europe
    
        France
        Germany
        Italy
        Spain
        The Netherlands
        UK
    
    
      Middle East and Africa
    
        UAE
    
    
      APAC
    
        China
        India
        Japan
        South Korea
    
    
      South America
    
        Brazil
    
    
      Rest of World (ROW)
    

    By Component Insights

    The software segment is estimated to witness significant growth during the forecast period.

    Generative Artificial Intelligence (AI) is revolutionizing the tech landscape with its ability to create unique and personalized content. Foundation models, such as GPT-4, employ deep learning techniques to generate human-like text, while large language models fine-tune these models for specific applications. Prompt engineering and model safety are crucial in ensuring accurate and responsible AI usage. Businesses leverage these technologies for various purposes, including content creation, customer service, and product design. Research and development in generative AI is ongoing, with open-source models and transformer models leading the way. Neural networks and deep learning power these models, enabling advanced capabilities like audio generation, data analysis, and predictive analytics.

    Natural language processing, sentiment analysis, and conversational AI are essential applications, enhancing business intelligence and customer experiences. Ethica

  17. f

    DataSheet1_Generative artificial intelligence model for simulating...

    • frontiersin.figshare.com
    pdf
    Updated Oct 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hiroyuki Yamaguchi; Genichi Sugihara; Masaaki Shimizu; Yuichi Yamashita (2024). DataSheet1_Generative artificial intelligence model for simulating structural brain changes in schizophrenia.pdf [Dataset]. http://doi.org/10.3389/fpsyt.2024.1437075.s001
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Oct 4, 2024
    Dataset provided by
    Frontiers
    Authors
    Hiroyuki Yamaguchi; Genichi Sugihara; Masaaki Shimizu; Yuichi Yamashita
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundRecent advancements in generative artificial intelligence (AI) for image generation have presented significant opportunities for medical imaging, offering a promising way to generate realistic virtual medical images while ensuring patient privacy. The generation of a large number of virtual medical images through AI has the potential to augment training datasets for discriminative AI models, particularly in fields with limited data availability, such as neuroimaging. Current studies on generative AI in neuroimaging have mainly focused on disease discrimination; however, its potential for simulating complex phenomena in psychiatric disorders remains unknown. In this study, as examples of a simulation, we aimed to present a novel generative AI model that transforms magnetic resonance imaging (MRI) images of healthy individuals into images that resemble those of patients with schizophrenia (SZ) and explore its application.MethodsWe used anonymized public datasets from the Center for Biomedical Research Excellence (SZ, 71 patients; healthy subjects [HSs], 71 patients) and the Autism Brain Imaging Data Exchange (autism spectrum disorder [ASD], 79 subjects; HSs, 105 subjects). We developed a model to transform MRI images of HSs into MRI images of SZ using cycle generative adversarial networks. The efficacy of the transformation was evaluated using voxel-based morphometry to assess the differences in brain region volumes and the accuracy of age prediction pre- and post-transformation. In addition, the model was examined for its applicability in simulating disease comorbidities and disease progression.ResultsThe model successfully transformed HS images into SZ images and identified brain volume changes consistent with existing case-control studies. We also applied this model to ASD MRI images, where simulations comparing SZ with and without ASD backgrounds highlighted the differences in brain structures due to comorbidities. Furthermore, simulating disease progression while preserving individual characteristics showcased the model’s ability to reflect realistic disease trajectories.DiscussionThe results suggest that our generative AI model can capture subtle changes in brain structures associated with SZ, providing a novel tool for visualizing brain changes in different diseases. The potential of this model extends beyond clinical diagnosis to advances in the simulation of disease mechanisms, which may ultimately contribute to the refinement of therapeutic strategies.

  18. A

    AI Training Dataset Market Report

    • promarketreports.com
    doc, pdf, ppt
    Updated Feb 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pro Market Reports (2025). AI Training Dataset Market Report [Dataset]. https://www.promarketreports.com/reports/ai-training-dataset-market-18858
    Explore at:
    pdf, ppt, docAvailable download formats
    Dataset updated
    Feb 6, 2025
    Dataset authored and provided by
    Pro Market Reports
    License

    https://www.promarketreports.com/privacy-policyhttps://www.promarketreports.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The AI Training Dataset Market is projected to exhibit a robust CAGR of 17.63% during the forecast period of 2025-2033, growing from a value of USD 8.23 billion in 2025 to USD 30.41 billion by 2033. The market is driven by the increasing demand for high-quality training data to train AI models, as well as the growing adoption of AI in various industries such as healthcare, retail, and manufacturing. Key market trends include the increasing use of unstructured data for training AI models, the development of new AI training techniques such as transfer learning, and the growing popularity of cloud-based AI training platforms. The market is segmented by data type (text, images, audio, video, structured data), algorithm type (supervised learning, unsupervised learning, reinforcement learning, semi-supervised learning, generative adversarial networks), application (natural language processing, computer vision, speech recognition, machine translation, predictive analytics), and vertical (healthcare, retail, manufacturing, financial services, government). North America is the largest regional market, followed by Europe and Asia Pacific. Key drivers for this market are: Evolving Deep Learning Algorithms Growing Adoption in Healthcare Advancement in Computer Vision Increasing Demand for Accurate AI Models Expansion into New Industries. Potential restraints include: Growing AI adoption, increasing data availability; technological advancements; rising demand for personalized AI solutions; and expanding applications in various industries.

  19. G

    Generative Artificial Intelligence (Gen AI) Services Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated May 5, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Generative Artificial Intelligence (Gen AI) Services Report [Dataset]. https://www.datainsightsmarket.com/reports/generative-artificial-intelligence-gen-ai-services-1966306
    Explore at:
    doc, ppt, pdfAvailable download formats
    Dataset updated
    May 5, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Generative Artificial Intelligence (Gen AI) services market is experiencing explosive growth, driven by advancements in deep learning, natural language processing, and computer vision. The market, estimated at $50 billion in 2025, is projected to achieve a Compound Annual Growth Rate (CAGR) of 35% from 2025 to 2033, reaching an impressive $500 billion by 2033. This surge is fueled by increasing adoption across diverse sectors, including electronics (e.g., automated design and content creation), entertainment (e.g., personalized gaming experiences and AI-generated music), and the rapidly expanding medical field (e.g., drug discovery and personalized medicine). Key trends include the rise of multimodal AI (combining text, image, and audio generation), increased focus on ethical considerations and bias mitigation, and the emergence of specialized Gen AI solutions tailored to specific industry needs. While challenges remain, such as high computational costs and the need for substantial data sets, the overall market trajectory remains exceptionally positive. The major players in the Gen AI services market are a mix of technology giants and specialized consulting firms. Companies like NVIDIA, Google, and OpenAI are at the forefront of developing foundational models and infrastructure, while consulting firms such as McKinsey, Bain & Company, and Accenture are instrumental in integrating Gen AI solutions into business operations. Furthermore, specialized data annotation companies like Clickworker and platform providers such as Microsoft Azure and AWS SageMaker play crucial roles in supporting the ecosystem. The regional distribution is currently dominated by North America, benefiting from strong technological advancements and early adoption, but Asia-Pacific, particularly China and India, is quickly emerging as a significant market due to its burgeoning tech sector and large talent pool. The competitive landscape is dynamic, with continuous innovation and strategic partnerships shaping the market's future. The continued development of more efficient and accessible Gen AI tools will be crucial in driving widespread adoption and unlocking the full potential of this transformative technology.

  20. G

    Generative Artificial Intelligence Technology Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Jun 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Generative Artificial Intelligence Technology Report [Dataset]. https://www.datainsightsmarket.com/reports/generative-artificial-intelligence-technology-1941969
    Explore at:
    doc, ppt, pdfAvailable download formats
    Dataset updated
    Jun 25, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Generative Artificial Intelligence (GenAI) technology market is experiencing explosive growth, projected to reach a market size of $990 million in 2025, exhibiting a remarkable Compound Annual Growth Rate (CAGR) of 20.8%. This rapid expansion is fueled by several key drivers. The increasing availability of large datasets for training sophisticated models, advancements in deep learning algorithms, and the growing demand for automated content creation across various sectors are significantly contributing to this market boom. Furthermore, the integration of GenAI into existing business workflows, enhancing efficiency and productivity, is proving to be a major catalyst. Companies are actively exploring GenAI's potential to personalize customer experiences, optimize operations, and develop innovative products and services. This is reflected in the active participation of major players such as OpenAI, DeepMind, Salesforce, Microsoft, Facebook (Meta), IBM, NVIDIA, and Adobe, who are heavily investing in research and development, strategic partnerships, and acquisitions to solidify their positions within this rapidly evolving landscape. The market's trajectory suggests continued strong growth throughout the forecast period (2025-2033). While challenges such as ethical concerns surrounding biased algorithms and potential job displacement need careful consideration and mitigation, the overall market outlook remains positive. The continuous refinement of GenAI models, coupled with the expanding applications across diverse industries like healthcare, finance, and entertainment, promises sustained market expansion. Furthermore, emerging trends such as the development of more explainable AI (XAI) and the integration of GenAI with other emerging technologies (like the Metaverse) are poised to further accelerate growth in the coming years. The potential for GenAI to revolutionize various industries and create new economic opportunities is undeniable, making it a compelling investment area with significant long-term prospects.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Patrick G Tinsley (2024). Trust, AI, and Synthetic Biometrics [Dataset]. http://doi.org/10.7274/25604631.v1

Data from: Trust, AI, and Synthetic Biometrics

Related Article
Explore at:
pdfAvailable download formats
Dataset updated
Nov 11, 2024
Dataset provided by
University of Notre Dame
Authors
Patrick G Tinsley
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Artificial Intelligence-based image generation has recently seen remarkable advancements, largely driven by deep learning techniques, such as Generative Adversarial Networks (GANs). With the influx and development of generative models, so too have biometric re-identification models and presentation attack detection models seen a surge in discriminative performance. However, despite the impressive photo-realism of generated samples and the additive value to the data augmentation pipeline, the role and usage of machine learning models has received intense scrutiny and criticism, especially in the context of biometrics, often being labeled as untrustworthy. Problems that have garnered attention in modern machine learning include: humans' and machines' shared inability to verify the authenticity of (biometric) data, the inadvertent leaking of private biometric data through the image synthesis process, and racial bias in facial recognition algorithms. Given the arrival of these unwanted side effects, public trust has been shaken in the blind use and ubiquity of machine learning.

However, in tandem with the advancement of generative AI, there are research efforts to re-establish trust in generative and discriminative machine learning models. Explainability methods based on aggregate model salience maps can elucidate the inner workings of a detection model, establishing trust in a post hoc manner. The CYBORG training strategy, originally proposed by Boyd, attempts to actively build trust into discriminative models by incorporating human salience into the training process.

In doing so, CYBORG-trained machine learning models behave more similar to human annotators and generalize well to unseen types of synthetic data. Work in this dissertation also attempts to renew trust in generative models by training generative models on synthetic data in order to avoid identity leakage in models trained on authentic data. In this way, the privacy of individuals whose biometric data was seen during training is not compromised through the image synthesis procedure. Future development of privacy-aware image generation techniques will hopefully achieve the same degree of biometric utility in generative models with added guarantees of trustworthiness.

Search
Clear search
Close search
Google apps
Main menu