100+ datasets found
  1. f

    Data from: Target Population Statistical Inference With Data Integration...

    • tandf.figshare.com
    txt
    Updated Feb 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xihao Li; Yang Song (2024). Target Population Statistical Inference With Data Integration Across Multiple Sources—An Approach to Mitigate Information Shortage in Rare Disease Clinical Trials [Dataset]. http://doi.org/10.6084/m9.figshare.9594392.v2
    Explore at:
    txtAvailable download formats
    Dataset updated
    Feb 12, 2024
    Dataset provided by
    Taylor & Francis
    Authors
    Xihao Li; Yang Song
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A major challenge for rare disease clinical trials is the limited amount of available information for making robust statistical inference. While external data present information integration opportunities to enhance statistical inference, conventional data combining methods, for example, meta-analysis, usually do not adequately address study population differences. Matching methods, on the other hand, directly account for population characteristics but often lead to inefficient use of data by underutilizing unmatched data points. Aiming at a better bias-variance tradeoff, we propose an intuitive integrated inference framework to borrow information from all relevant data sources and make inference on the response of interest over a target population precisely characterized by the joint distribution of baseline covariates. The method is easily implemented and can be complemented by modern statistical learning or machine learning tools. Statistical inference is facilitated by the bootstrap. We argue that the integrated inference framework not only provides an intuitive and coherent perspective for a variety of clinical trial inference problems but also has broad application areas in clinical trial settings and beyond, as a quantitative data integration tool for making robust inference in a target population precise manner for policy and decision makers.

  2. Genomic data integration systematically biases interactome mapping

    • plos.figshare.com
    pdf
    Updated Jun 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael A. Skinnider; R. Greg Stacey; Leonard J. Foster (2023). Genomic data integration systematically biases interactome mapping [Dataset]. http://doi.org/10.1371/journal.pcbi.1006474
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Michael A. Skinnider; R. Greg Stacey; Leonard J. Foster
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Elucidating the complete network of protein-protein interactions, or interactome, is a fundamental goal of the post-genomic era, yet existing interactome maps are far from complete. To increase the throughput and resolution of interactome mapping, methods for protein-protein interaction discovery by co-migration have been introduced. However, accurate identification of interacting protein pairs within the resulting large-scale proteomic datasets is challenging. Consequently, most computational pipelines for co-migration data analysis incorporate external genomic datasets to distinguish interacting from non-interacting protein pairs. The effect of this procedure on interactome mapping is poorly understood. Here, we conduct a rigorous analysis of genomic data integration for interactome recovery across a large number of co-migration datasets, spanning diverse experimental and computational methods. We find that genomic data integration leads to an increase in the functional coherence of the resulting interactome maps, but this comes at the expense of a decrease in power to discover novel interactions. Importantly, putative novel interactions predicted by genomic data integration are no more likely to later be experimentally discovered than those predicted from co-migration data alone. Our results reveal a widespread and unappreciated limitation in a methodology that has been widely used to map the interactome of humans and model organisms.

  3. i

    Enterprise Data Integration Market - Global Demand & Analysis

    • imrmarketreports.com
    Updated Apr 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Swati Kalagate; Akshay Patil; Vishal Kumbhar (2024). Enterprise Data Integration Market - Global Demand & Analysis [Dataset]. https://www.imrmarketreports.com/reports/enterprise-data-integration-market
    Explore at:
    Dataset updated
    Apr 2024
    Dataset provided by
    IMR Market Reports
    Authors
    Swati Kalagate; Akshay Patil; Vishal Kumbhar
    License

    https://www.imrmarketreports.com/privacy-policy/https://www.imrmarketreports.com/privacy-policy/

    Description

    The report offers Enterprise Data Integration Market Dynamics, Comprises Industry development drivers, challenges, opportunities, threats and limitations. A report also incorporates Cost Trend of products, Mergers & Acquisitions, Expansion, Crucial Suppliers of products, Concentration Rate of Steel Coupling Economy. Global Enterprise Data Integration Market Research Report covers Market Effect Factors investigation chiefly included Technology Progress, Consumer Requires Trend, External Environmental Change.

  4. f

    Table1_Unsupervised Multi-Omics Data Integration Methods: A Comprehensive...

    • frontiersin.figshare.com
    docx
    Updated Jun 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nasim Vahabi; George Michailidis (2023). Table1_Unsupervised Multi-Omics Data Integration Methods: A Comprehensive Review.DOCX [Dataset]. http://doi.org/10.3389/fgene.2022.854752.s001
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 5, 2023
    Dataset provided by
    Frontiers
    Authors
    Nasim Vahabi; George Michailidis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Through the developments of Omics technologies and dissemination of large-scale datasets, such as those from The Cancer Genome Atlas, Alzheimer’s Disease Neuroimaging Initiative, and Genotype-Tissue Expression, it is becoming increasingly possible to study complex biological processes and disease mechanisms more holistically. However, to obtain a comprehensive view of these complex systems, it is crucial to integrate data across various Omics modalities, and also leverage external knowledge available in biological databases. This review aims to provide an overview of multi-Omics data integration methods with different statistical approaches, focusing on unsupervised learning tasks, including disease onset prediction, biomarker discovery, disease subtyping, module discovery, and network/pathway analysis. We also briefly review feature selection methods, multi-Omics data sets, and resources/tools that constitute critical components for carrying out the integration.

  5. f

    Data from: Integrative Data Analysis Where Partial Covariates Have Complex...

    • tandf.figshare.com
    zip
    Updated Jan 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jia Liang; Shuo Chen; Peter Kochunov; L. Elliot Hong; Chixiang Chen (2025). Integrative Data Analysis Where Partial Covariates Have Complex Nonlinear Effects by Using Summary Information from an External Data [Dataset]. http://doi.org/10.6084/m9.figshare.26053224.v2
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 24, 2025
    Dataset provided by
    Taylor & Francis
    Authors
    Jia Liang; Shuo Chen; Peter Kochunov; L. Elliot Hong; Chixiang Chen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A full parametric and linear specification may be insufficient to capture complicated patterns in studies exploring complex features, such as those investigating age-related changes in brain functional abilities. Alternatively, a partially linear model (PLM) consisting of both parametric and nonparametric elements may have a better fit. This model has been widely applied in economics, environmental science, and biomedical studies. In this article, we introduce a novel statistical inference framework that equips PLM with high estimation efficiency by effectively synthesizing summary information from external data into the main analysis. Such an integrative scheme is versatile in assimilating various types of reduced models from the external study. The proposed method is shown to be theoretically valid and numerically convenient, and it ensures a high-efficiency gain compared to classic methods in PLM. Our method is further validated using two data applications by evaluating the risk factors of brain imaging measures and blood pressure.

  6. D

    Data Quality Tools Industry Report

    • marketreportanalytics.com
    doc, pdf, ppt
    Updated Apr 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Report Analytics (2025). Data Quality Tools Industry Report [Dataset]. https://www.marketreportanalytics.com/reports/data-quality-tools-industry-89686
    Explore at:
    pdf, ppt, docAvailable download formats
    Dataset updated
    Apr 21, 2025
    Dataset authored and provided by
    Market Report Analytics
    License

    https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Data Quality Tools market is experiencing robust growth, fueled by the increasing volume and complexity of data across diverse industries. The market, currently valued at an estimated $XX million in 2025 (assuming a logically derived value based on a 17.5% CAGR from a 2019 base year), is projected to reach $YY million by 2033. This substantial expansion is driven by several key factors. Firstly, the rising adoption of cloud-based solutions offers enhanced scalability, flexibility, and cost-effectiveness, attracting both small and medium enterprises (SMEs) and large enterprises. Secondly, the growing need for regulatory compliance (e.g., GDPR, CCPA) necessitates robust data quality management, pushing organizations to invest in advanced tools. Further, the increasing reliance on data-driven decision-making across sectors like BFSI, healthcare, and retail necessitates high-quality, reliable data, thus boosting market demand. The preference for software solutions over on-premise deployments and the substantial investments in services aimed at data integration and cleansing contribute to this growth. However, certain challenges restrain market expansion. High initial investment costs, the complexity of implementation, and the need for skilled professionals to manage these tools can act as barriers for some organizations, particularly SMEs. Furthermore, concerns related to data security and privacy continue to impact adoption rates. Despite these challenges, the long-term outlook for the Data Quality Tools market remains positive, driven by the ever-increasing importance of data quality in a rapidly digitalizing world. The market segmentation highlights significant opportunities across different deployment models, organizational sizes, and industry verticals, suggesting diverse avenues for growth and innovation in the coming years. Competition among established players like IBM, Informatica, and Oracle, alongside emerging players, is intensifying, driving innovation and providing diverse solutions to meet varied customer needs. Recent developments include: September 2022: MIT Computer Science and Artificial Intelligence Laboratory (CSAIL) spin-off DataCebo announced the launch of a new tool, dubbed Synthetic Data (SD) Metrics, to help enterprises compare the quality of machine-generated synthetic data by pitching it against real data sets., May 2022: Pyramid Analytics, which developed its flagship platform, Pyramids Decision Intelligence, announced that it raised USD 120 million in a Series E round of funding. The Pyramid Decision Intelligence platform combines business analytics, data preparation, and data science capabilities with AI guidance functionality. It enables governed self-service analytics in a no-code environment.. Key drivers for this market are: Increasing Use of External Data Sources Owing to Mobile Connectivity Growth. Potential restraints include: Increasing Use of External Data Sources Owing to Mobile Connectivity Growth. Notable trends are: Healthcare is Expected to Witness Significant Growth.

  7. Supporting Data from: Physically Constrained Mass Spectrometry Data Binning...

    • zenodo.org
    Updated Apr 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hiu-Lok NGAN; Hiu-Lok NGAN; Jialing Zhang; Jialing Zhang; Kin Leung Kwan; Kin Leung Kwan; Jacinth Wing-Sum Cheu; Jacinth Wing-Sum Cheu; Li ZHONG; Li ZHONG; Yike Guo; Yike Guo; Xian Yang; Xian Yang; Carmen Chak Lui Wong; Carmen Chak Lui Wong; Hong Yan; Hong Yan; Zongwei Cai; Zongwei Cai (2025). Supporting Data from: Physically Constrained Mass Spectrometry Data Binning for Multi-Platform Untargeted Metabolomics Data Integration [Dataset]. http://doi.org/10.5281/zenodo.15251976
    Explore at:
    Dataset updated
    Apr 20, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Hiu-Lok NGAN; Hiu-Lok NGAN; Jialing Zhang; Jialing Zhang; Kin Leung Kwan; Kin Leung Kwan; Jacinth Wing-Sum Cheu; Jacinth Wing-Sum Cheu; Li ZHONG; Li ZHONG; Yike Guo; Yike Guo; Xian Yang; Xian Yang; Carmen Chak Lui Wong; Carmen Chak Lui Wong; Hong Yan; Hong Yan; Zongwei Cai; Zongwei Cai
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Mar 2025
    Description

    § Supporting file 1

    A spreadsheet recording numerous iterations for hyperparameter tuning for logistic regression model.

    § Supporting file 2

    A spreadsheet recording numerous iterations for hyperparameter tuning for linear Support Vector Classifier model.

    § Supporting file 3

    A spreadsheet recording numerous iterations for hyperparameter tuning for gradient boosting model.

    § Supporting file 4

    A spreadsheet recording numerous iterations for hyperparameter tuning for eXtreme Gradient Boosting model.

    § Supporting file 5

    A spreadsheet recording numerous iterations for hyperparameter tuning for decision tree model.

    § Supporting file 6

    A spreadsheet recording numerous iterations for hyperparameter tuning for random forest model.

    § Supporting file 7

    A spreadsheet recording binning performance before data integration (batch 1).

    § Supporting file 8

    A spreadsheet recording binning performance before data integration (batch 2).

    § Supporting file 9

    A spreadsheet recording binning performance after inter-batch data integration.

    § Supporting dataset 1

    A zip collection of DESI-MSI raw data for the trainging dataset.

    § Supporting dataset 2

    A zip collection of DESI-MSI raw data for the external dataset used for data ingestion.

    § Supporting dataset 3

    A zip collection of MS raw data for the fine-needle aspiration smear DESI-MSI dataset.

    § Supporting dataset 4

    A zip collection of MS raw data for the direction infusion dataset.

    § Supporting dataset 5

    A zip collection of histograms of the m/z buckets before data integration (batch 1).

    § Supporting dataset 6

    A zip collection of histograms of the m/z buckets before data integration (batch 2).

    § Supporting dataset 7

    A zip collection of histograms of the m/z buckets after inter-batch data integration.

  8. i

    Data Integration Machines Market - Current Analysis by Market Share

    • imrmarketreports.com
    Updated Dec 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Swati Kalagate; Akshay Patil; Vishal Kumbhar (2023). Data Integration Machines Market - Current Analysis by Market Share [Dataset]. https://www.imrmarketreports.com/reports/data-integration-machines-market
    Explore at:
    Dataset updated
    Dec 2023
    Dataset provided by
    IMR Market Reports
    Authors
    Swati Kalagate; Akshay Patil; Vishal Kumbhar
    License

    https://www.imrmarketreports.com/privacy-policy/https://www.imrmarketreports.com/privacy-policy/

    Description

    The report offers Data Integration Machines Market Dynamics, Comprises Industry development drivers, challenges, opportunities, threats and limitations. A report also incorporates Cost Trend of products, Mergers & Acquisitions, Expansion, Crucial Suppliers of products, Concentration Rate of Steel Coupling Economy. Global Data Integration Machines Market Research Report covers Market Effect Factors investigation chiefly included Technology Progress, Consumer Requires Trend, External Environmental Change.

  9. CSS.3.2.2.1 NaKnowBase

    • s.cnmilf.com
    • catalog.data.gov
    Updated Sep 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. EPA Office of Research and Development (ORD) (2023). CSS.3.2.2.1 NaKnowBase [Dataset]. https://s.cnmilf.com/user74170196/https/catalog.data.gov/dataset/css-3-2-2-1-naknowbase
    Explore at:
    Dataset updated
    Sep 17, 2023
    Dataset provided by
    United States Environmental Protection Agencyhttp://www.epa.gov/
    Description

    Product CSS.3.2.2 includes three inter-related components, the delivery of which completes this product. This product represents updates to the NaKnowBase database, which has been cleared under STICS Public accessibility for NaKnowBase ORD-043098. The first component is a tool to automate formatting of ENM data into the standard and universally accepted ISO-TAB Nano format. We have written this code to both WRITE (export) NKB data in ISO-TAB Nano format, as well as READ (input) external data already in the ISO-TAB Nano format for potential inclusion into NKB. The code and corresponding documentation for this tool are made available to the public via the EPA Office of Research and Development at: https://gaftp.epa.gov/EPADataCommons/ORD/NaKnowBase/. The second component is an application, entitled “OntoSearcher”, that automates ontological term mapping for a given ENM dataset. We have developed this code to read in external partner ENM data, and map those data to ontological terms with reported diagnostics on speed and accuracy. This is the first step in the development of a common language for ENMs, aims to minimize necessary human curation time and is critical to EPA efforts to integrate across Federal ENM datasets in a FAIR (Findable, Accessible, Interoperable, Accessible) way. The code and corresponding documentation for this application are made available to the public via the EPA Office of Research and Development at: https://gaftp.epa.gov/EPADataCommons/ORD/NaKnowBase/. The third component is the integration of NaKnowBase ENM data with the EPA Chemistry Dashboard. Currently, we have 373 chemical structure mapped on the Dashboard at https://comptox.epa.gov/dashboard/chemical_lists/NAKNOWBASE. This collaborative, intra-Agency effort between CCTE and CPHEA continues as we update NKB ENMs, establish web-services to update NKB-Dashboard integration with the NKB application, build on our EPA standard nomenclature for ENMs (Beach et al.(2021)), and continue our semantic mapping efforts with Federal and International collaborators.

  10. d

    Human-gpDB

    • dknet.org
    Updated Jan 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Human-gpDB [Dataset]. http://identifiers.org/RRID:SCR_006223
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    A publicly accessible, relational database of human G-Proteins and their interactions with human GPCRs and Effectors. Advanced data integration techniques make Human-gpDB very rich in context since all of the bioentities are linked to a rich variety of external data sources. High quality visualization methods make the networks more informative and the extraction of information easier. Human-gpDB is currently a very useful tool for drug targeting investigation. The sequences of G-Proteins and GPCRs are classified according to a hierarchy of different classes, families and sub-families, whereas the Effectors sequences are classified in families, subfamilies and types, based on extensive literature search. The classification of GPCRs follows the IUPHAR classification, while the Effectors classification is a unique feature and is based on their function. The database currently holds information about 713 human GPCRs, 36 human G-Proteins and 99 human Effectors. The collection of the information about the interactions between these molecules was done manually and the current status of Human-gpDB reveals information about 1663 connections between GPCRs and G-Proteins and 1618 connections between G-Proteins and Effectors.

  11. D

    Data Fusion Market Report | Global Forecast From 2025 To 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Dec 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2024). Data Fusion Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/data-fusion-market
    Explore at:
    csv, pdf, pptxAvailable download formats
    Dataset updated
    Dec 3, 2024
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Data Fusion Market Outlook




    The global data fusion market size was valued at approximately USD 11 billion in 2023 and is expected to reach around USD 29 billion by 2032, growing at a robust compound annual growth rate (CAGR) of 11.5% during the forecast period. This significant growth is driven by the increasing demand for advanced analytics and decision-making processes across various industries. The need to integrate large volumes of data from multiple sources, including IoT devices, enterprise systems, and external data feeds, has become crucial for businesses aiming to gain competitive advantages and fuel innovation. Furthermore, the growing adoption of artificial intelligence and machine learning technologies enhances the capabilities of data fusion solutions, allowing organizations to extract valuable insights and drive strategic initiatives.




    One of the key growth factors for the data fusion market is the exponential increase in data generation across all sectors. With the proliferation of digital transformation, businesses are generating massive amounts of data from diverse sources, including social media, sensors, and enterprise applications. The ability to effectively integrate and analyze this data is critical for organizations aiming to make data-driven decisions. The rise of the Internet of Things (IoT) has further magnified the need for efficient data fusion solutions. IoT devices generate vast amounts of real-time data, and the capability to aggregate, process, and analyze this data is essential for optimizing operations and enhancing customer experiences. As more devices become interconnected, the demand for data fusion solutions that can handle large-scale, heterogeneous data sets will continue to grow.




    Another significant growth factor is the increasing need for enhanced security and intelligence in defense and security sectors. Governments and defense organizations worldwide are investing heavily in data fusion technologies to improve situational awareness and decision-making capabilities. Data fusion aids in integrating data from various intelligence sources, enabling defense forces to identify potential threats and respond promptly. The ability to combine data from satellite imagery, radar systems, and other intelligence sources provides a comprehensive view, enhancing the effectiveness of defense operations. Moreover, the ongoing advancements in machine learning and artificial intelligence are further augmenting the capabilities of data fusion technologies in predictive analytics and threat detection, driving market growth in the defense and security sector.




    The healthcare and life sciences industry also presents significant growth opportunities for the data fusion market. The integration of data from electronic health records, patient monitoring systems, and genomic data is crucial for improving patient outcomes and advancing precision medicine. Data fusion enables healthcare providers to gain a holistic view of patient information, facilitating personalized treatment plans and early diagnosis of diseases. The COVID-19 pandemic has further accelerated the adoption of data fusion technologies in healthcare, as the need for real-time data integration and analysis became paramount in managing the crisis. As healthcare organizations continue to prioritize data-driven approaches, the demand for robust data fusion solutions that ensure data accuracy, privacy, and interoperability is expected to increase.




    Regionally, North America is anticipated to hold the largest share of the data fusion market, driven by the presence of key market players and the early adoption of advanced technologies. The high demand for data-driven insights across industries, such as defense, healthcare, and finance, is a significant growth driver in this region. Additionally, government initiatives promoting digital transformation and data-driven decision-making further contribute to market expansion. The Asia-Pacific region is expected to witness the highest growth rate during the forecast period, owing to rapid industrialization, increasing investments in smart city projects, and the rising adoption of IoT technologies. The growing focus on enhancing operational efficiency and customer experience is fueling the demand for data fusion solutions in countries like China, India, and Japan, making Asia-Pacific a lucrative market for data fusion providers.



    Component Analysis




    The data fusion market is segmented into three major components: software, hardware, and services. Each of these compon

  12. U

    US Business Intelligence Market Report

    • promarketreports.com
    doc, pdf, ppt
    Updated Jan 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pro Market Reports (2025). US Business Intelligence Market Report [Dataset]. https://www.promarketreports.com/reports/us-business-intelligence-market-8130
    Explore at:
    pdf, ppt, docAvailable download formats
    Dataset updated
    Jan 8, 2025
    Dataset authored and provided by
    Pro Market Reports
    License

    https://www.promarketreports.com/privacy-policyhttps://www.promarketreports.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    United States, Global
    Variables measured
    Market Size
    Description

    The size of the US Business Intelligence Market was valued at USD 19942.01 million in 2023 and is projected to reach USD 38369.43 million by 2032, with an expected CAGR of 9.80% during the forecast period. Business Intelligence (BI) refers to the technologies, processes, and practices used to collect, analyze, and present business data in a meaningful way to support decision-making within an organization. BI involves a wide range of tools and techniques, including data mining, reporting, performance management, analytics, and querying, to convert raw data into actionable insights. By integrating data from various sources such as internal databases, external data providers, and cloud platforms, BI enables companies to gain a comprehensive view of their operations, market trends, customer behavior, and financial performance. This growth is driven by factors such as the increasing adoption of data-driven decision-making, the need for real-time insights, and advancements in artificial intelligence (AI) and machine learning (ML) technologies. The market benefits from the integration of BI with other technologies such as cloud computing, big data, and the Internet of Things (IoT). Additionally, government initiatives promoting data transparency and accountability, as well as rising data security concerns, are contributing to the growth of the US Business Intelligence Market. Recent developments include: In January 2023, Microsoft launched Power Bl in Microsoft Teams to enhance user experiences. The announcements include three new features: rich broadcast cards for Chat in Microsoft Teams, an update for classic Power Bl tabs for Channels 2.0, and listening to and learning from experiences and requirements., In December 2022, Tableau released its improved Tableau 2022.4 for business users and analysts to discover insights. It automates the creation, analysis, and communication of insights through data stories like Data Change Radar, Data Guide, and Explain the Viz., In November 2022, Qlik introduced a new cloud-based data integration platform. The sophisticated platform as a service brings together catalog capabilities and data preparation in one place. The new integration enables firms to do real-time data analysis. The advanced platform includes a number of services that combine to form a data fabric, connecting data sources and providing an organization with an integrated view of its data.. Notable trends are: Increased capital infusion promotes market growth.

  13. c

    ckanext-powerview

    • catalog.civicdataecosystem.org
    Updated Jun 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). ckanext-powerview [Dataset]. https://catalog.civicdataecosystem.org/dataset/ckanext-powerview
    Explore at:
    Dataset updated
    Jun 4, 2025
    Description

    The PowerView extension for CKAN enables users to configure data sources in order to power views for one or more resources. This extension provides the required infrastructure and actions to define and manage how CKAN visualizes data from various sources. By leveraging this extension, CKAN administrators can define specific configurations tailored to presenting datasets effectively. Key Features: Data Source Configuration: Allows administrators to configure data sources that drive views within CKAN, enabling seamless integration of external data sources. Action API: All actions related to PowerView, facilitating automated management and integration within CKAN workflows are exposed via the CKAN Action API. Extensible View Management: Supports creation of configurable views based on defined data sources for one or more available resources. Technical Integration: The PowerView extension integrates directly with the CKAN Action API, which allows for seamless incorporation into custom workflows and automation processes. By adding powerview to the ckan.plugins setting in the CKAN configuration file, the extension is activated within a CKAN instance. This modification allows for PowerView actions to be managed. Also, this Extension will require SQL tables to be created within CKAN to properly record data mappings and associated data. Benefits & Impact: By using the PowerView extension, users can display different types of views, powered by specific data sources (e.g., databases, APIs, etc.). This facilitates better data presentation and simplifies dataset curation to visualize data from multiple data entities within the ecosystem.

  14. C

    CRM Integration Services Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Jan 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). CRM Integration Services Report [Dataset]. https://www.datainsightsmarket.com/reports/crm-integration-services-493676
    Explore at:
    pdf, doc, pptAvailable download formats
    Dataset updated
    Jan 27, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The CRM Integration Services Market is projected to grow from USD 5022 million in 2025 to USD 8469.05 million by 2033, at a CAGR of 6.1%. The growth of this market is attributed to the increasing adoption of CRM systems by businesses of all sizes, the need for better customer relationship management, and the rising demand for data integration and analytics. Large enterprises are the major contributors to the market, owing to their complex business operations and need for robust CRM systems. However, SMEs are also expected to witness a significant growth in the market, as they increasingly recognize the benefits of CRM integration. The market for CRM Integration Services is segmented based on application, type, and region. By application, the market is segmented into large enterprises and SMEs. By type, the market is segmented into internal system integration and external system integration. By region, the market is segmented into North America, South America, Europe, Middle East & Africa, and Asia Pacific. North America is the largest market for CRM Integration Services, followed by Europe and Asia Pacific. The growth in these regions is attributed to the presence of a large number of businesses and the increasing adoption of CRM systems.

  15. D

    Data-As-A-Service (Daas) Market Report

    • marketreportanalytics.com
    doc, pdf, ppt
    Updated Mar 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Report Analytics (2025). Data-As-A-Service (Daas) Market Report [Dataset]. https://www.marketreportanalytics.com/reports/data-as-a-service-daas-market-10653
    Explore at:
    doc, pdf, pptAvailable download formats
    Dataset updated
    Mar 18, 2025
    Dataset authored and provided by
    Market Report Analytics
    License

    https://www.marketreportanalytics.com/privacy-policyhttps://www.marketreportanalytics.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The Data-as-a-Service (DaaS) market is experiencing rapid growth, projected to reach $19.20 billion in 2025 and exhibiting a remarkable Compound Annual Growth Rate (CAGR) of 46.01%. This explosive expansion is fueled by several key factors. The increasing adoption of cloud computing and the rising demand for real-time data analytics across diverse sectors, including BFSI (Banking, Financial Services, and Insurance), retail, and telecommunications, are major drivers. Businesses are increasingly recognizing the strategic advantage of accessing and utilizing external data sources to enhance decision-making, improve operational efficiency, and gain a competitive edge. Furthermore, the shift towards data-driven strategies and the growing need for advanced data management capabilities are significantly contributing to the DaaS market's growth trajectory. The diverse range of deployment models, including cloud and on-premises solutions, caters to the varying needs and preferences of businesses across different sizes and industries. North America currently holds a significant market share, driven by early adoption and robust technological infrastructure. However, the APAC region is projected to witness substantial growth in the coming years, fueled by increasing digitalization and a burgeoning data landscape. While the DaaS market presents significant opportunities, certain challenges remain. Data security and privacy concerns are paramount, requiring robust security protocols and compliance measures. The complexity of data integration and the need for skilled professionals to manage and interpret data also pose potential hurdles for market adoption. However, ongoing technological advancements in data management, analytics, and security are continuously mitigating these concerns. The competitive landscape is also becoming increasingly dynamic, with established technology giants and specialized DaaS providers vying for market share. This competitive intensity is expected to drive innovation and further accelerate market growth, leading to a broader range of solutions and services available to businesses. Strategic partnerships and acquisitions are also becoming more prevalent as companies seek to expand their offerings and strengthen their market positions.

  16. US Enterprise Data Management Market For BFSI Sector - Size and Forecast...

    • technavio.com
    Updated Nov 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Technavio (2024). US Enterprise Data Management Market For BFSI Sector - Size and Forecast 2024-2028 [Dataset]. https://www.technavio.com/report/enterprise-data-management-market-for-bfsi-sector-market-industry-analysis
    Explore at:
    Dataset updated
    Nov 15, 2024
    Dataset provided by
    TechNavio
    Authors
    Technavio
    Time period covered
    2021 - 2025
    Area covered
    United States
    Description

    Snapshot img

    US Enterprise Data Management Market Size 2024-2028

    The US enterprise data management market size is forecast to increase by USD 5.59 billion at a CAGR of 13.6% between 2023 and 2028.

    The market, including Enterprise Data Management (EDM) software, is experiencing significant growth due to increasing demand for data integration and visual analytics. The BFSI industry's reliance on data warehousing and data security continues to drive market expansion. Technological advancements, such as artificial intelligence and machine learning are revolutionizing EDM solutions, offering enhanced capabilities for data processing and analysis. However, the high cost of implementing these advanced EDM solutions remains a challenge for some organizations. Additionally, data security concerns and the need for regulatory compliance are ongoing challenges that require continuous attention and investment. In the telecom sector, the trend towards digital transformation and the generation of vast amounts of data are fueling the demand for strong EDM solutions. Overall, the EDM software market is expected to continue its growth trajectory, driven by these market trends and challenges.
    

    What will be the size of the US Enterprise Data Management Market during the forecast period?

    Request Free Sample

    The Enterprise Data Management (EDM) market in the BFSI sector is experiencing significant growth due to the industry's expansion and strict regulations. With the increasing volume, velocity, and complexity of data, IT organizations in banks and other financial institutions are prioritizing EDM solutions to handle massive datasets and ensure information accuracy. These systems enable data synchronization, address validation, and single-source reporting, addressing data conflicts and silos that hinder effective business operations. EDM solutions are essential for both internal applications and external communication, allowing for leveraging analytics to gain a competitive edge. In the BFSI sector, where risk control is paramount, EDM plays a crucial role in managing and consuming datasets efficiently.
    The market is characterized by a competitive environment, with IT investments focused on multiuser functionality and Big Data capabilities to meet the diverse needs of various business verticals, including manufacturing and services industries. Overall, EDM is a strategic imperative for businesses seeking to stay competitive and compliant in today's data-driven economy.
    

    How is this market segmented and which is the largest segment?

    The market research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD billion' for the period 2024-2028, as well as historical data from 2018-2022 for the following segments.

    Deployment
    
      On-premises
      Cloud
    
    
    Ownership
    
      Large enterprise
      Small and medium enterprise
    
    
    End-user
    
      Commercial banks
      Savings institutions
    
    
    Geography
    
      US
    

    By Deployment Insights

    The on-premises segment is estimated to witness significant growth during the forecast period. The BFSI sector in the US is witnessing a significant expansion in the enterprise data management market, driven by strict regulations and the competitive environment. Large organizations, including commercial banks, insurance companies, and non-banking financial institutions, are prioritizing data management to ensure information accuracy and risk control. Enterprise Data Management (EDM) solutions are crucial for internal applications and external communication, enabling data synchronization and business operations. Leveraging analytics, IT organizations manage vast datasets and datasets' consumption, addressing data conflicts and ensuring data quality for reporting. EDM encompasses handling massive data through Business Analytics, ETL tools, data pipelines, and data warehouses, as well as data visualization tools.
    

    Get a glance at the market share of various segments Request Free Sample

    The on-premises segment was valued at USD 2.9 billion in 2018 and showed a gradual increase during the forecast period.

    Market Dynamics

    Our researchers analyzed the data with 2023 as the base year, along with the key drivers, trends, and challenges. A holistic analysis of drivers will help companies refine their marketing strategies to gain a competitive advantage.

    What are the key market drivers leading to the rise in adoption of US Enterprise Data Management Market?

    Growing demand for data integration and visual analytics is the key driver of the market. In the BFSI sector, strict regulations necessitate the effective management of large volumes of structured and unstructured data. The industry's expansion and competitive environment necessitate the need for advanced data management solutions. Enterprises are leveraging Enterprise Data Management (EDM) systems to address the challenges of data synchronization, internal
    
  17. m

    Data from: Integration of Meta-Multi-Omics Data Using Probabilistic Graphs...

    • metabolomicsworkbench.org
    zip
    Updated Aug 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sophie Alvarez (2023). Integration of Meta-Multi-Omics Data Using Probabilistic Graphs and External Knowledge [Dataset]. https://www.metabolomicsworkbench.org/data/DRCCMetadata.php?Mode=Study&StudyID=ST002741
    Explore at:
    zipAvailable download formats
    Dataset updated
    Aug 10, 2023
    Dataset provided by
    University of Nebraska-Lincoln
    Authors
    Sophie Alvarez
    Description

    Multi-omics has the promise to provide a detailed molecular picture for biological systems. Although obtaining multi-omics data is relatively easy, methods that analyze such data have been lagging. In this paper, we present an algorithm that uses probabilistic graph representations and external knowledge to perform optimum structure learning and deduce a multifarious interaction network for multi-omics data from a bacterial community. Kefir grain, a microbial community that ferments milk and creates kefir, represents a self-renewing, stable, natural microbial community. Kefir has been shown to associate with a wide range of health benefits. We obtained a controlled bacterial community using the two most abundant and well-studied species in kefir grains: Lentilactobacillus kefiri and Lactobacillus kefiranofaciens. We applied growth temperatures of 30°C and 37°C, and obtained transcriptomic, metabolomic, and proteomic data for the same 20 samples (10 samples per temperature). We obtained a multi-omics interaction network, which generated insights that would not have been possible with single-omics analysis. We identified interactions among transcripts, proteins, and metabolites suggesting active toxin/antitoxin systems. We also observed multifarious interactions that involved the shikimate pathway. These observations helped explain bacterial adaptation to different stress conditions, co-aggregation, and increased activation of L. kefiranofaciens at 37°C.

  18. c

    Data Monetization market size will be $14.96 Billion by 2030!

    • cognitivemarketresearch.com
    pdf,excel,csv,ppt
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cognitive Market Research, Data Monetization market size will be $14.96 Billion by 2030! [Dataset]. https://www.cognitivemarketresearch.com/data-monetization-market-report
    Explore at:
    pdf,excel,csv,pptAvailable download formats
    Dataset authored and provided by
    Cognitive Market Research
    License

    https://www.cognitivemarketresearch.com/privacy-policyhttps://www.cognitivemarketresearch.com/privacy-policy

    Time period covered
    2021 - 2033
    Area covered
    Global
    Description

    According to Cognitive Market Research, the Global Data Monetization market size will be $14.96 Billion by 2030. Data Monetization Industry's Compound Annual Growth Rate will be 21.83% from 2023 to 2030.

    The global Data Monetization market will expand significantly by XX% CAGR between 2023 to 2031.
    North America held the major market of more than XX% of the global revenue with a market size of USD XX million in 2023 and will grow at a compound annual growth rate (CAGR) of XX% from 2023 to 2031.
    Large Enterprises held the highest Data Monetization market revenue share in 2023.
    

    Market Dynamics of the Data Monetization Market

    Key Drivers of the Data Monetization Market

    Rising Demand for Data Integration Drive the Data Monetization Market Further

    The Data Monetization market is buoyed due to factors including growing enterprise data volume, awareness of data monetization, and external data sources. Growth is also anticipated to be fuelled by the use of big analytics innovations, artificial intelligence, data-driven decision-making techniques, and data processing. Optimising data consumption, improving customer loyalty, cutting operating expenses, enhancing compliance, raising profitability, fortifying alliances, and improving customer experience are all benefits of data monetization. In addition, it promotes targeted marketing, simplifies planning, fosters better teamwork, and adds value to goods and services.

    For instance, Cisco states in its Annual Internet Report that by 2023, about two thirds of people on the planet would have access to the Internet. By 2023, there will be 5.3 billion Internet users worldwide, or 66% of the world's population, up from 3.9 billion in 2018 (or 51% of the world's population). As a result, the necessity for data monetization is going to explode.

    https://www.cisco.com/c/en/us/solutions/executive-perspectives/annual-internet-report/

    Maximizing Revenue Potential Propels the Data Monetization Market Growth

    A critical component of company is data monetization, which uses client information to generate extra income. It is essential for customer service, upselling, and churn reduction. The monetization of data provides insights into new business categories and is fuelled by cutting edge technology like as cloud computing, big data, IoT, and artificial intelligence. Many players have chosen to invest in this strategy due to its capacity to synthesise and integrate thousands of data, IoT records, and advanced analytics, which enables companies to establish new business categories and spur market expansion. For businesses to succeed, they must be able to extract significant revenue from the data sources that are readily available.

    For instance, data monetization enables businesses to leverage their artificial intelligence (AI) capabilities and data assets to generate real economic value, according to an IBM article. Data products are used in this value exchange system to improve business performance, obtain a competitive edge, and solve market demands while addressing industry difficulties.

    https://www.ibm.com/blog/unlocking-financial-benefits-through-data-monetization/

    Key Restraints of the Data Monetization Market

    Quality and Standardization Challenges in Data Monetization Restrict the Market Growth

    Data monetization relies on data quality and standardization to ensure accuracy, comprehensiveness, consistency, reliability, and relevance. However, businesses face challenges in ensuring data quality due to factors like data fragmentation, data silos, and inconsistency. This lack of data quality and standards hinders the market for data monetization, negatively impacting the worth and usability of data, and increasing costs and complexity. Addressing these issues is crucial for ensuring the success of data monetization.

    For instance, according to EY's Data Quality Management reports, low response rates and sampling flaws in 2016 tainted the polling data used to forecast the result of the US presidential election, leading to erroneous analysis and forecasts.

    https://assets.ey.com/content/dam/ey-sites/ey-com/en_ca/topics/ai/ey-data-quality-management-discussion-paper.pdf

    Impact of COVID-19 on the Data Monetization Market

    The COVID-19 pandemic caused travel and logistical limitations, which increased the effect on industries. The expansion of the data monetisation business was somewhat hindered by this, as there ...

  19. c

    ckanext-rdfstoreimporter

    • catalog.civicdataecosystem.org
    Updated Jun 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). ckanext-rdfstoreimporter [Dataset]. https://catalog.civicdataecosystem.org/dataset/ckanext-rdfstoreimporter
    Explore at:
    Dataset updated
    Jun 4, 2025
    Description

    The rdfstoreimporter extension for CKAN facilitates the synchronization of CKAN datasets with external RDF (Resource Description Framework) stores, such as Virtuoso. This synchronization empowers users to link CKAN's data management capabilities with the structured data environment provided by RDF stores. The extension enhances CKAN's ability to work seamlessly with semantic web technologies, providing a bridge between traditional data catalogs and linked data repositories. Key Features: RDF Store Synchronization: Allows automated synchronization of CKAN datasets with an external RDF store, which enables consistent data representation and availability across different platforms. Virtuoso Compatibility: Specifically mentions compatibility with Virtuoso, a popular RDF store, ensuring users can integrate CKAN with a widely used semantic data management system. Command-Line Interface (CLI) Execution: Provides a command-line interface for triggering the RDF store synchronization process, offering flexibility and control in managing the synchronization tasks. Technical Integration: The rdfstoreimporter extension integrates with CKAN by extending its core functionalities through the addition of a plugin. To enable the extension, users must modify the CKAN configuration file (production.ini) by adding rdfstoreimporter to the ckan.plugins setting. After modifying the configuration, a CKAN restart is required to activate the extension to ensure proper functionality. Benefits & Impact: By implementing the rdfstoreimporter extension, CKAN installations can benefit from enhanced data interoperability and semantic enrichment. Synchronizing datasets with RDF stores makes it easier to describe, link, and query data using semantic web standards. This can lead to: Improved Data Discoverability: Representing CKAN datasets in RDF format enhances their discoverability by semantic web crawlers and search engines. Enhanced Data Integration: Linking CKAN datasets to external RDF knowledge graphs can facilitate easier integration of data from different sources. Facilitated Semantic Analysis: Storing CKAN data in RDF stores enables sophisticated semantic analysis and reasoning, leading to new insights and knowledge discovery.

  20. Cloud Data Warehouse Market Analysis, Size, and Forecast 2025-2029: North...

    • technavio.com
    Updated Jun 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Technavio (2025). Cloud Data Warehouse Market Analysis, Size, and Forecast 2025-2029: North America (US, Canada, and Mexico), Europe (France, Germany, Italy, and UK), APAC (China, India, and Japan), and Rest of World (ROW) [Dataset]. https://www.technavio.com/report/cloud-data-warehouse-market-industry-analysis
    Explore at:
    Dataset updated
    Jun 13, 2025
    Dataset provided by
    TechNavio
    Authors
    Technavio
    Time period covered
    2021 - 2025
    Area covered
    Global, United States
    Description

    Snapshot img

    Cloud Data Warehouse Market Size 2025-2029

    The cloud data warehouse market size is forecast to increase by USD 63.91 billion at a CAGR of 43.3% between 2024 and 2029.

    The market is experiencing significant growth, driven by the increasing penetration of IoT-enabled devices generating vast amounts of data. This data requires efficient storage and analysis, making cloud data warehouses an attractive solution due to their scalability and flexibility. Additionally, the growing need for edge computing further fuels market expansion, as organizations seek to process data closer to its source in real-time. However, challenges persist in the form of company lock-in issues, where businesses may find it difficult to migrate their data from one cloud provider to another, potentially limiting their flexibility and strategic options.
    To capitalize on market opportunities and navigate challenges effectively, companies must stay informed of emerging trends and adapt their strategies accordingly. By focusing on interoperability and data portability, they can mitigate lock-in risks and maintain agility in their data management strategies. The market is experiencing significant growth due to several key trends. The increasing penetration of Internet of Things (IoT) devices is driving the need for more efficient data management solutions, leading to the adoption of cloud data warehouses.
    

    What will be the Size of the Cloud Data Warehouse Market during the forecast period?

    Explore in-depth regional segment analysis with market size data - historical 2019-2023 and forecasts 2025-2029 - in the full report.
    Request Free Sample

    In the dynamic market, businesses seek efficient solutions for managing and analyzing their data. Data visualization tools and business intelligence platforms enable users to gain insights through interactive dashboards and reports. Data automation tools streamline data processing, while data enrichment tools enhance data quality by adding external data sources. Data virtualization tools provide a unified view of data from various sources, and data integration tools ensure seamless data flow between systems. NoSQL databases and big data platforms offer scalability and flexibility for handling large volumes of data. Data cleansing tools eliminate errors and inconsistencies, while data encryption tools secure sensitive data.
    Data migration tools facilitate moving data between systems, and data validation tools ensure data accuracy. Real-time analytics platforms and predictive analytics platforms provide insights in near real-time, while prescriptive analytics platforms suggest actions based on data trends. Data deduplication tools eliminate redundant data, and data governance tools ensure compliance with regulations. Data orchestration tools manage workflows, and data science platforms facilitate machine learning and artificial intelligence applications. Data archiving tools store historical data, and data pipeline tools manage data movement between systems. Data fabric and data standardization tools ensure data consistency across the organization, while data replication tools maintain data availability and disaster recovery.
    

    How is this Cloud Data Warehouse Industry segmented?

    The cloud data warehouse industry research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD million' for the period 2025-2029, as well as historical data from 2019-2023 for the following segments.

    Industry Application
    
      Large enterprises
      SMEs
    
    
    Deployment
    
      Public
      Private
    
    
    End-user
    
      Cloud server provider
      IT and ITES
      BFSI
      Retail
      Others
    
    
    Application
    
      Customer analytics
      Business intelligence
      Data modernization
      Operational analytics
      Predictive analytics
    
    
    Geography
    
      North America
    
        US
        Canada
        Mexico
    
    
      Europe
    
        France
        Germany
        Italy
        UK
    
    
      APAC
    
        China
        India
        Japan
    
    
      Rest of World (ROW)
    

    By Industry Application Insights

    The large enterprises segment is estimated to witness significant growth during the forecast period. In today's business landscape, cloud data warehouse solutions have gained significant traction among large enterprises, enabling them to efficiently manage and process data across various industries and geographies. Traditional on-premises data warehouses come with high costs due to the need for expensive hardware and physical space. Cloud-based alternatives offer a more cost-effective and convenient solution, allowing organizations to access tools and information remotely and streamline document sharing between multiple workplaces. Predictive analytics, data cost optimization, and data discovery are key drivers for cloud data warehouse adoption. These technologies offer insights into data trends and patterns, helping businesses make data-driven decisions.

    Data timeliness and data standardiz

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Xihao Li; Yang Song (2024). Target Population Statistical Inference With Data Integration Across Multiple Sources—An Approach to Mitigate Information Shortage in Rare Disease Clinical Trials [Dataset]. http://doi.org/10.6084/m9.figshare.9594392.v2

Data from: Target Population Statistical Inference With Data Integration Across Multiple Sources—An Approach to Mitigate Information Shortage in Rare Disease Clinical Trials

Related Article
Explore at:
txtAvailable download formats
Dataset updated
Feb 12, 2024
Dataset provided by
Taylor & Francis
Authors
Xihao Li; Yang Song
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

A major challenge for rare disease clinical trials is the limited amount of available information for making robust statistical inference. While external data present information integration opportunities to enhance statistical inference, conventional data combining methods, for example, meta-analysis, usually do not adequately address study population differences. Matching methods, on the other hand, directly account for population characteristics but often lead to inefficient use of data by underutilizing unmatched data points. Aiming at a better bias-variance tradeoff, we propose an intuitive integrated inference framework to borrow information from all relevant data sources and make inference on the response of interest over a target population precisely characterized by the joint distribution of baseline covariates. The method is easily implemented and can be complemented by modern statistical learning or machine learning tools. Statistical inference is facilitated by the bootstrap. We argue that the integrated inference framework not only provides an intuitive and coherent perspective for a variety of clinical trial inference problems but also has broad application areas in clinical trial settings and beyond, as a quantitative data integration tool for making robust inference in a target population precise manner for policy and decision makers.

Search
Clear search
Close search
Google apps
Main menu