100+ datasets found
  1. d

    Data from: Improved Wetland Soil Organic Carbon Stocks of the Conterminous...

    • datasets.ai
    • catalog.data.gov
    0, 21
    Updated Aug 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Environmental Protection Agency (2024). Improved Wetland Soil Organic Carbon Stocks of the Conterminous U.S. Through Data Harmonization [Dataset]. https://datasets.ai/datasets/improved-wetland-soil-organic-carbon-stocks-of-the-conterminous-u-s-through-data-harmoniza
    Explore at:
    21, 0Available download formats
    Dataset updated
    Aug 6, 2024
    Dataset authored and provided by
    U.S. Environmental Protection Agency
    Area covered
    Contiguous United States, United States
    Description

    Public data used for data harmonization.

    This dataset is associated with the following publication: Uhran, B., L. Windham-Myers, N. Bliss, A. Nahlik, E. Sundquist, and C. Stagg. Improved Wetland Soil Organic Carbon Stocks of the Conterminous U.S. Through Data Harmonization. Frontiers in Soil Science. Frontiers, Lausanne, SWITZERLAND, 1: 706701, (2021).

  2. D

    Multi-Omics Clinical Data Harmonization Market Research Report 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Sep 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). Multi-Omics Clinical Data Harmonization Market Research Report 2033 [Dataset]. https://dataintelo.com/report/multi-omics-clinical-data-harmonization-market
    Explore at:
    pptx, pdf, csvAvailable download formats
    Dataset updated
    Sep 30, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Multi-Omics Clinical Data Harmonization Market Outlook



    According to our latest research, the global Multi-Omics Clinical Data Harmonization market size reached USD 1.65 billion in 2024, reflecting robust adoption across healthcare and life sciences. With a strong compound annual growth rate (CAGR) of 14.2% projected from 2025 to 2033, the market is anticipated to reach USD 4.65 billion by 2033. This growth is primarily driven by the escalating integration of multi-omics approaches in clinical research, the increasing demand for personalized medicine, and the urgent need to standardize complex biological data for actionable insights. As per our latest research, the market's expansion is underpinned by technological advancements and the broadening scope of omics-based applications in diagnostics and therapeutics.




    The rapid growth of the Multi-Omics Clinical Data Harmonization market can be attributed to several key factors. One of the most significant drivers is the exponential increase in biological data generated from next-generation sequencing and other high-throughput omics platforms. As researchers and clinicians seek to unravel the complexities of human health and disease, the need to integrate and harmonize disparate data types—such as genomics, proteomics, metabolomics, and transcriptomics—has become paramount. This harmonization enables a more comprehensive understanding of disease mechanisms, facilitating the identification of novel biomarkers and therapeutic targets. Moreover, regulatory bodies and funding agencies are increasingly emphasizing data standardization and interoperability, further fueling demand for robust harmonization solutions.




    Another major growth factor is the accelerating adoption of precision medicine initiatives worldwide. The shift from one-size-fits-all therapies to tailored treatment regimens necessitates the integration of multi-omics data with clinical and phenotypic information. Harmonized data platforms empower clinicians and researchers to draw meaningful correlations between omics signatures and patient outcomes, thereby enhancing diagnostic accuracy and enabling the development of personalized therapeutic strategies. Pharmaceutical and biotechnology companies, in particular, are leveraging multi-omics harmonization to streamline drug discovery pipelines, improve patient stratification, and optimize clinical trial designs, contributing to significant market growth.




    Technological innovation plays a central role in propelling the Multi-Omics Clinical Data Harmonization market forward. Advances in artificial intelligence, machine learning, and cloud computing have revolutionized the way multi-omics data is processed, integrated, and analyzed. Sophisticated software platforms now offer automated data curation, normalization, and annotation, reducing manual errors and accelerating research timelines. Additionally, collaborative efforts between academic institutions, healthcare providers, and industry stakeholders have led to the establishment of large-scale multi-omics databases and consortia, further driving market expansion. The growing focus on data privacy, security, and regulatory compliance also shapes market dynamics, prompting continuous innovation in harmonization technologies.




    Regionally, North America remains the dominant force in the Multi-Omics Clinical Data Harmonization market, accounting for the largest share in 2024. The region's leadership is attributed to its advanced healthcare infrastructure, significant investments in omics research, and a strong presence of key market players. Europe follows closely, leveraging robust public-private partnerships and supportive regulatory frameworks. Meanwhile, the Asia Pacific region is witnessing the fastest growth, fueled by increasing government initiatives, expanding healthcare access, and rising awareness of precision medicine. Latin America and the Middle East & Africa, though currently smaller markets, are expected to demonstrate steady growth as they enhance their research capabilities and digital health ecosystems.



    Solution Analysis



    The Solution segment of the Multi-Omics Clinical Data Harmonization market is bifurcated into software and services, each playing a pivotal role in enabling seamless integration and analysis of diverse omics datasets. Software solutions encompass a wide range of platforms and tools designed to automate data normalization, annotation, and integ

  3. Z

    Harmonized LUCAS dataset (ST_LUCAS)

    • data.niaid.nih.gov
    • zenodo.org
    Updated Apr 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Landa; Lukáš Brodský; Tomáš Bouček; Lena Halounová; Ondřej Pešek (2025). Harmonized LUCAS dataset (ST_LUCAS) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7777474
    Explore at:
    Dataset updated
    Apr 11, 2025
    Dataset provided by
    CTU in Prague
    Authors
    Martin Landa; Lukáš Brodský; Tomáš Bouček; Lena Halounová; Ondřej Pešek
    Description

    ST_LUCAS is a harmonized dataset derived from the LUCAS (Land Use and Coverage Area frame Survey) dataset. LUCAS is an Eurostat activity that has performed repeated in situ surveys over Europe every three years since 2006. Original LUCAS data (https://ec.europa.eu/eurostat/web/lucas/data) starting with the 2006 survey were harmonized into common nomenclature based on the 2018 survey. ST_LUCAS dataset is provided in two versions:

    lucas_points: each LUCAS survey is represented by single record

    lucas_st_points: each LUCAS point is represented by a single location calculated from multiple surveys and by a set of harmonized attributes for each survey year

    Harmonization and space-aggregation of LUCAS data were performed by ST_LUCAS system available from https://geoforall.fsv.cvut.cz/st_lucas. The methodology is described in Landa, M.; Brodský, L.; Halounová, L.; Bouček, T.; Pešek, O. Open Geospatial System for LUCAS In Situ Data Harmonization and Distribution. ISPRS Int. J. Geo-Inf. 2022, 11, 361. https://doi.org/10.3390/ijgi11070361.

    List of harmonized LUCAS attributes: https://geoforall.fsv.cvut.cz/st_lucas/tables/list_of_attributes.html

    ST_LUCAS dataset is provided under the same conditions (“free of charge”) as the original LUCAS data (https://ec.europa.eu/eurostat/web/lucas/data).

  4. V

    Data Harmonization Procedures

    • odgavaprod.ogopendata.com
    • catalog.data.gov
    html
    Updated Sep 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Administration for Children and Families (2025). Data Harmonization Procedures [Dataset]. https://odgavaprod.ogopendata.com/dataset/data-harmonization-procedures
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Sep 6, 2025
    Dataset provided by
    Administration for Children and Families
    Description

    ACF Agency Wide resource

    Metadata-only record linking to the original dataset. Open original dataset below.

  5. D

    EO Data Harmonization Pipelines Market Research Report 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Sep 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). EO Data Harmonization Pipelines Market Research Report 2033 [Dataset]. https://dataintelo.com/report/eo-data-harmonization-pipelines-market
    Explore at:
    pptx, pdf, csvAvailable download formats
    Dataset updated
    Sep 30, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    EO Data Harmonization Pipelines Market Outlook



    According to our latest research, the global EO Data Harmonization Pipelines market size reached USD 2.17 billion in 2024, with a robust compound annual growth rate (CAGR) of 13.2% projected through the forecast period. By 2033, the market is expected to attain a value of USD 6.19 billion. This growth is primarily driven by the surging demand for integrated, high-quality Earth Observation (EO) data across various sectors, including environmental monitoring, agriculture, and urban planning, as organizations increasingly seek actionable insights from multi-source geospatial datasets.




    The exponential increase in the volume and diversity of EO data sources has emerged as a primary growth factor for the EO Data Harmonization Pipelines market. Organizations now rely on satellite imagery, aerial photographs, UAV data, and ground-based sensors to monitor and analyze dynamic terrestrial and atmospheric phenomena. However, the heterogeneity and varying formats of these datasets have posed significant challenges for seamless integration and analysis. The development and adoption of sophisticated EO data harmonization pipelines have become essential, enabling the conversion, standardization, and fusion of disparate data streams into coherent, analysis-ready datasets. This capability not only enhances the accuracy and reliability of downstream analytics but also accelerates decision-making processes in critical domains such as disaster management, climate change assessment, and precision agriculture.




    Another pivotal driver is the rapid technological advancement in cloud computing, artificial intelligence, and machine learning, which has revolutionized the EO data harmonization landscape. Cloud-based platforms now offer scalable, on-demand processing power, allowing for real-time harmonization of massive EO datasets. AI-powered algorithms automate data cleansing, normalization, and feature extraction, significantly reducing manual intervention and operational costs. These innovations have democratized access to EO data harmonization solutions, making them accessible to a broader spectrum of end-users, from government agencies and research institutes to commercial enterprises. The integration of these advanced technologies not only improves the efficiency of EO data pipelines but also opens new avenues for developing predictive models and geospatial intelligence solutions.




    The increasing focus on sustainability and environmental stewardship has further amplified the demand for EO data harmonization pipelines. Governments and international organizations are investing heavily in monitoring land use, water resources, and atmospheric conditions to meet regulatory requirements and inform policy decisions. Harmonized EO data enables comprehensive, cross-border analyses that are vital for addressing global challenges such as deforestation, urban sprawl, and natural disasters. As regulatory frameworks around data quality and interoperability become more stringent, organizations are compelled to invest in robust harmonization solutions to ensure compliance and maintain data integrity. This regulatory push, combined with growing public and private sector awareness of the value of harmonized EO data, is expected to sustain market growth over the coming decade.




    Regionally, North America and Europe continue to dominate the EO Data Harmonization Pipelines market, accounting for a combined market share of over 60% in 2024. The United States, in particular, benefits from a mature geospatial technology ecosystem and significant investments in satellite infrastructure. Meanwhile, the Asia Pacific region is witnessing the fastest growth, driven by expanding EO satellite programs in China, India, and Japan, coupled with increasing adoption of cloud-based geospatial solutions. Latin America and the Middle East & Africa are gradually emerging as promising markets, propelled by investments in environmental monitoring and disaster management initiatives. As these regions enhance their EO capabilities, the global market is poised for sustained expansion.



    Component Analysis



    The EO Data Harmonization Pipelines market by component is segmented into software, hardware, and services. Software solutions remain the largest segment, accounting for over 45% of the market share in 2024. These platforms are integral for the automated ingestion, normalization, and fusio

  6. G

    Multi-Omics Clinical Data Harmonization Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Oct 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). Multi-Omics Clinical Data Harmonization Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/multi-omics-clinical-data-harmonization-market
    Explore at:
    csv, pdf, pptxAvailable download formats
    Dataset updated
    Oct 7, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Multi-Omics Clinical Data Harmonization Market Outlook



    According to the latest research conducted in 2025, the global Multi-Omics Clinical Data Harmonization market size stands at USD 1.47 billion in 2024. The market is experiencing robust momentum, driven by technological advancements and the growing adoption of precision medicine. With a recorded CAGR of 13.6%, the market is projected to reach USD 4.22 billion by 2033. This substantial growth is primarily fueled by the increasing integration of multi-omics datasets in clinical research and diagnostics, which is enabling more comprehensive and actionable insights into complex diseases and therapeutic responses.




    The primary growth factor propelling the Multi-Omics Clinical Data Harmonization market is the escalating demand for personalized and precision medicine. As healthcare systems globally shift towards individualized treatment regimens, the necessity to harmonize and integrate diverse omics datasets—such as genomics, proteomics, metabolomics, and transcriptomics—has become paramount. These integrated data solutions facilitate a holistic understanding of disease mechanisms, improve diagnostic accuracy, and enable the development of targeted therapies. The proliferation of next-generation sequencing technologies, coupled with the decreasing cost of omics profiling, has further democratized access to multi-omics data, thereby accelerating its utilization across clinical and research settings.




    Another significant driver is the rapid digitization of healthcare and the growing emphasis on interoperability and data standardization. The harmonization of multi-omics clinical data addresses critical challenges related to data silos, heterogeneity, and lack of standardized formats. Advanced data harmonization platforms are leveraging artificial intelligence and machine learning to automate the integration and curation of large-scale omics datasets, ensuring data quality, consistency, and compliance with regulatory standards. This technological evolution is not only enhancing the efficiency of clinical workflows but also fostering collaborations among pharmaceutical companies, research institutions, and healthcare providers.




    Furthermore, the rising investments from both public and private sectors in biomedical research are playing a pivotal role in market expansion. Governments and funding agencies worldwide are supporting large-scale multi-omics projects aimed at deciphering the molecular underpinnings of complex diseases such as cancer, neurodegenerative disorders, and rare genetic conditions. These initiatives are generating vast amounts of clinical omics data that require robust harmonization solutions for effective utilization. Additionally, the growing prevalence of chronic diseases and the increasing adoption of electronic health records (EHRs) are amplifying the demand for integrated data management platforms that can seamlessly harmonize clinical and omics datasets for improved patient outcomes.




    Regionally, North America continues to dominate the Multi-Omics Clinical Data Harmonization market, accounting for the largest share in 2024, followed by Europe and Asia Pacific. The presence of leading biotechnology firms, advanced healthcare infrastructure, and strong government support for precision medicine initiatives have positioned North America at the forefront of innovation. Meanwhile, Asia Pacific is emerging as a high-growth region, driven by expanding research capabilities, rising healthcare expenditures, and increasing adoption of multi-omics technologies in countries like China, Japan, and India. Europe also maintains a significant market presence, supported by collaborative research networks and robust regulatory frameworks for data standardization and interoperability.





    Omics Type Analysis



    The Omics Type segment of the Multi-Omics Clinical Data Harmonization market encompasses genomics, proteomics, transcriptomics, metabolomics, epigenomics, and other emerging omics disciplines. Among these, genomics

  7. G

    EO Data Harmonization Pipelines Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Oct 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). EO Data Harmonization Pipelines Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/eo-data-harmonization-pipelines-market
    Explore at:
    pdf, pptx, csvAvailable download formats
    Dataset updated
    Oct 4, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    EO Data Harmonization Pipelines Market Outlook



    According to our latest research, the EO Data Harmonization Pipelines market size globally reached USD 1.94 billion in 2024, and is projected to grow at a robust CAGR of 13.2% from 2025 to 2033, culminating in a forecasted market value of USD 5.62 billion by 2033. This dynamic growth is primarily attributed to the surging demand for integrated Earth Observation (EO) data across diverse industries, driven by the need for accurate, real-time, and interoperable geospatial insights for decision-making. The market is experiencing significant advancements in data processing technologies and AI-driven harmonization tools, which are further propelling adoption rates on a global scale. As per our comprehensive analysis, the increasing complexity of EO data sources and the critical need for standardized, high-quality data pipelines remain pivotal growth factors shaping the future of this market.




    One of the primary growth drivers for the EO Data Harmonization Pipelines market is the exponential increase in the volume and variety of EO data generated by satellites, drones, and ground-based sensors. As governments, research institutions, and commercial enterprises deploy more sophisticated EO platforms, the diversity in data formats, resolutions, and temporal frequencies has created a pressing need for harmonization solutions. These pipelines enable seamless integration, cleansing, and transformation of disparate datasets, ensuring consistency and reliability in downstream analytics. The proliferation of AI and machine learning algorithms within these pipelines has further enhanced their ability to automate data normalization, anomaly detection, and metadata enrichment, resulting in more actionable and timely insights for end-users across sectors.




    Another significant factor contributing to market growth is the increasing adoption of EO data for environmental monitoring, agriculture, disaster management, and urban planning. Governments and private organizations are leveraging harmonized EO data to monitor deforestation, predict crop yields, assess disaster risks, and optimize urban infrastructure planning. The ability to harmonize multi-source data streams enables stakeholders to generate comprehensive, cross-temporal analyses that support sustainable development goals and climate resilience strategies. The integration of cloud-based platforms has democratized access to harmonized EO data, allowing even small and medium enterprises to leverage advanced geospatial analytics without substantial upfront investments in hardware or specialized personnel.




    Furthermore, the rising emphasis on interoperability and data sharing among international agencies, research institutions, and commercial providers is fueling the demand for robust EO data harmonization pipelines. Initiatives such as the Global Earth Observation System of Systems (GEOSS) and the European Copernicus program underscore the importance of standardized data frameworks for global collaboration. These trends are driving investments in open-source harmonization tools, API-driven architectures, and scalable cloud infrastructures that can support multi-stakeholder data exchange. As regulatory requirements for data quality and provenance intensify, organizations are increasingly prioritizing investments in harmonization technologies to ensure compliance and maintain competitive advantage in the rapidly evolving EO ecosystem.




    From a regional perspective, North America currently dominates the EO Data Harmonization Pipelines market, accounting for over 38% of the global market share in 2024, followed by Europe and Asia Pacific. The United States, in particular, benefits from a mature EO ecosystem, substantial government funding, and a vibrant commercial space sector. Europe’s growth is propelled by strong policy frameworks and cross-border collaborations, while Asia Pacific is rapidly emerging as a high-growth region, driven by increasing investments in satellite infrastructure and smart city initiatives. Latin America and the Middle East & Africa are also witnessing steady adoption, supported by international development programs and growing awareness of EO’s value in addressing regional challenges such as agriculture productivity and climate adaptation.



  8. d

    PanTool – software for data harmonization and conversion, Version 1

    • dataone.org
    • doi.pangaea.de
    Updated Apr 15, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sieger, Rainer; Grobe, Hannes; Alfred Wegener Institute, Helmholtz Center for Polar and Marine Research, Bremerhaven (2018). PanTool – software for data harmonization and conversion, Version 1 [Dataset]. http://doi.org/10.1594/PANGAEA.510701
    Explore at:
    Dataset updated
    Apr 15, 2018
    Dataset provided by
    PANGAEA Data Publisher for Earth and Environmental Science
    Authors
    Sieger, Rainer; Grobe, Hannes; Alfred Wegener Institute, Helmholtz Center for Polar and Marine Research, Bremerhaven
    Description

    The program PanTool was developed as a tool box like a Swiss Army Knife for data conversion and recalculation, written to harmonize individual data collections to standard import format used by PANGAEA. The format of input files the program PanTool needs is a tabular saved in plain ASCII. The user can create this files with a spread sheet program like MS-Excel or with the system text editor. PanTool is distributed as freeware for the operating systems Microsoft Windows, Apple OS X and Linux.

  9. ACF NIEM Human Services Domain Data Harmonization Process

    • catalog.data.gov
    • odgavaprod.ogopendata.com
    Updated Sep 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Administration for Children and Families (2025). ACF NIEM Human Services Domain Data Harmonization Process [Dataset]. https://catalog.data.gov/dataset/acf-niem-human-services-domain-data-harmonization-process
    Explore at:
    Dataset updated
    Sep 8, 2025
    Dataset provided by
    Administration for Children and Families
    Description

    ACF Agency Wide resource Metadata-only record linking to the original dataset. Open original dataset below.

  10. H

    Harmonized Income Dataset

    • dataverse.harvard.edu
    Updated Jan 29, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marta Kołczyńska; Przemek Powałko (2019). Harmonized Income Dataset [Dataset]. http://doi.org/10.7910/DVN/UE7XIJ
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 29, 2019
    Dataset provided by
    Harvard Dataverse
    Authors
    Marta Kołczyńska; Przemek Powałko
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The Harmonized Income Dataset provides harmonized individual-level survey variables on personal and household income from 19 major cross-national survey projects, as well as technical variables necessary to match them to the Survey Data Recycling Master File version 1 (SDR v.1, DOI:10.7910/DVN/VWGF5Q), which contains harmonized survey items on political participation, political attitudes, as well as their selected correlates.

  11. t

    Data from: Deep Image Harmonization

    • service.tib.eu
    Updated Dec 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Deep Image Harmonization [Dataset]. https://service.tib.eu/ldmservice/dataset/deep-image-harmonization
    Explore at:
    Dataset updated
    Dec 3, 2024
    Description

    Deep Image Harmonization.

  12. f

    Description and harmonization strategy for the predictor variables.

    • figshare.com
    xlsx
    Updated Apr 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xin Wu; Jeran Stratford; Karen Kesler; Cataia Ives; Tabitha Hendershot; Barbara Kroner; Ying Qin; Huaqin Pan (2025). Description and harmonization strategy for the predictor variables. [Dataset]. http://doi.org/10.1371/journal.pone.0309572.s001
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Apr 23, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Xin Wu; Jeran Stratford; Karen Kesler; Cataia Ives; Tabitha Hendershot; Barbara Kroner; Ying Qin; Huaqin Pan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Description and harmonization strategy for the predictor variables.

  13. e

    ComBat HarmonizR enables the integrated analysis of independently generated...

    • ebi.ac.uk
    Updated May 23, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hannah Voß (2022). ComBat HarmonizR enables the integrated analysis of independently generated proteomic datasets through data harmonization with appropriate handling of missing values [Dataset]. https://www.ebi.ac.uk/pride/archive/projects/PXD027467
    Explore at:
    Dataset updated
    May 23, 2022
    Authors
    Hannah Voß
    Variables measured
    Proteomics
    Description

    The integration of proteomic datasets, generated by non-cooperating laboratories using different LC-MS/MS setups can overcome limitations in statistically underpowered sample cohorts but has not been demonstrated to this day. In proteomics, differences in sample preservation and preparation strategies, chromatography and mass spectrometry approaches and the used quantification strategy distort protein abundance distributions in integrated datasets. The Removal of these technical batch effects requires setup-specific normalization and strategies that can deal with missing at random (MAR) and missing not at random (MNAR) type values at a time. Algorithms for batch effect removal, such as the ComBat-algorithm, commonly used for other omics types, disregard proteins with MNAR missing values and reduce the informational yield and the effect size for combined datasets significantly. Here, we present a strategy for data harmonization across different tissue preservation techniques, LC-MS/MS instrumentation setups and quantification approaches. To enable batch effect removal without the need for data reduction or error-prone imputation we developed an extension to the ComBat algorithm, ´ComBat HarmonizR, that performs data harmonization with appropriate handling of MAR and MNAR missing values by matrix dissection The ComBat HarmonizR based strategy enables the combined analysis of independently generated proteomic datasets for the first time. Furthermore, we found ComBat HarmonizR to be superior for removing batch effects between different Tandem Mass Tag (TMT)-plexes, compared to commonly used internal reference scaling (iRS). Due to the matrix dissection approach without the need of data imputation, the HarmonizR algorithm can be applied to any type of -omics data while assuring minimal data loss

  14. f

    Additional file 1 of Conceptual design of a generic data harmonization...

    • datasetcatalog.nlm.nih.gov
    • springernature.figshare.com
    Updated Feb 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zoch, Michele; Peng, Yuan; Reinecke, Ines; Henke, Elisa; Sedlmayr, Martin; Bathelt, Franziska (2024). Additional file 1 of Conceptual design of a generic data harmonization process for OMOP common data model [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001502363
    Explore at:
    Dataset updated
    Feb 27, 2024
    Authors
    Zoch, Michele; Peng, Yuan; Reinecke, Ines; Henke, Elisa; Sedlmayr, Martin; Bathelt, Franziska
    Description

    A detailed overview of the results of the literature search, including the data extraction matrix can be found in the Additional file 1.

  15. Data from: LUH2-GCB2019: Land-Use Harmonization 2 Update for the Global...

    • catalog.data.gov
    • data.nasa.gov
    • +3more
    Updated Sep 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ORNL_DAAC (2025). LUH2-GCB2019: Land-Use Harmonization 2 Update for the Global Carbon Budget, 850-2019 [Dataset]. https://catalog.data.gov/dataset/luh2-gcb2019-land-use-harmonization-2-update-for-the-global-carbon-budget-850-2019-d4862
    Explore at:
    Dataset updated
    Sep 19, 2025
    Dataset provided by
    Oak Ridge National Laboratory Distributed Active Archive Center
    Description

    This dataset, referred to as LUH2-GCB2019, includes 0.25-degree gridded, global maps of fractional land-use states, transitions, and management practices for the period 0850-2019. The LUH2-GCB2019 dataset is an update to the previous Land-Use Harmonization Version 2 (LUH2-GCB) datasets prepared as required input to land models in the annual Global Carbon Budget (GCB) assessments, including land-use change data relating to agricultural expansion, deforestation, wood harvesting, shifting cultivation, afforestation, and crop rotations. Compared with previous LUH2-GCB datasets, the LUH2-GCB2019 takes advantage of new data inputs that corrected cropland and grazing areas in the globally important region of Brazil, as far back as 1950. LUH2-GCB datasets are used by bookkeeping models and Dynamic Global Vegetation Models (DGVMs) for the GCB.

  16. f

    Predictor variables used in analysis and the methods used to harmonize to...

    • plos.figshare.com
    xls
    Updated Apr 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xin Wu; Jeran Stratford; Karen Kesler; Cataia Ives; Tabitha Hendershot; Barbara Kroner; Ying Qin; Huaqin Pan (2025). Predictor variables used in analysis and the methods used to harmonize to the categorical variables. [Dataset]. http://doi.org/10.1371/journal.pone.0309572.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Apr 23, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Xin Wu; Jeran Stratford; Karen Kesler; Cataia Ives; Tabitha Hendershot; Barbara Kroner; Ying Qin; Huaqin Pan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Predictor variables used in analysis and the methods used to harmonize to the categorical variables.

  17. s

    Citation Trends for "Promoting data harmonization to evaluate vaccine...

    • shibatadb.com
    Updated Oct 22, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yubetsu (2022). Citation Trends for "Promoting data harmonization to evaluate vaccine hesitancy in LMICs: approach and applications" [Dataset]. https://www.shibatadb.com/article/nCj3w3fn
    Explore at:
    Dataset updated
    Oct 22, 2022
    Dataset authored and provided by
    Yubetsu
    License

    https://www.shibatadb.com/license/data/proprietary/v1.0/license.txthttps://www.shibatadb.com/license/data/proprietary/v1.0/license.txt

    Time period covered
    2025
    Variables measured
    New Citations per Year
    Description

    Yearly citation counts for the publication titled "Promoting data harmonization to evaluate vaccine hesitancy in LMICs: approach and applications".

  18. d

    Data from: SOils DAta Harmonization database (SoDaH): an open-source...

    • search.dataone.org
    • portal.edirepository.org
    Updated Jul 15, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    William R Wieder; Derek Pierson; Stevan R Earl; Kate Lajtha; Sara Baer; Ford Ballantyne; Asmeret A Berhe; Sharon Billings; Laurel M Brigham; Stephany S Chacon; Jennifer Fraterrigo; Serita D Frey; Katerina Georgiou; Marie-Anne de Graaff; A S Grandy; Melannie D Hartman; Sarah E Hobbie; Chris Johnson; Jason Kaye; Emily Snowman; Marcy E Litvak; Michelle C Mack; Avni Malhotra; Jessica A M Moore; Knute Nadelhoffer; Craig Rasmussen; Whendee L Silver; Benjamin N Sulman; Xanthe Walker; Samantha Weintraub (2020). SOils DAta Harmonization database (SoDaH): an open-source synthesis of soil data from research networks [Dataset]. https://search.dataone.org/view/https%3A%2F%2Fpasta.lternet.edu%2Fpackage%2Fmetadata%2Feml%2Fedi%2F521%2F1
    Explore at:
    Dataset updated
    Jul 15, 2020
    Dataset provided by
    Environmental Data Initiative
    Authors
    William R Wieder; Derek Pierson; Stevan R Earl; Kate Lajtha; Sara Baer; Ford Ballantyne; Asmeret A Berhe; Sharon Billings; Laurel M Brigham; Stephany S Chacon; Jennifer Fraterrigo; Serita D Frey; Katerina Georgiou; Marie-Anne de Graaff; A S Grandy; Melannie D Hartman; Sarah E Hobbie; Chris Johnson; Jason Kaye; Emily Snowman; Marcy E Litvak; Michelle C Mack; Avni Malhotra; Jessica A M Moore; Knute Nadelhoffer; Craig Rasmussen; Whendee L Silver; Benjamin N Sulman; Xanthe Walker; Samantha Weintraub
    Area covered
    Variables measured
    K, Ca, L1, L2, L3, L4, L5, Mg, Na, bs, and 147 more
    Description

    This SOils DAta Harmonization (SoDaH) database is designed to bring together soil carbon data from diverse research networks into a harmonized dataset that can be used for synthesis activities and model development. The research network sources for SoDaH span different biomes and climates, encompass multiple ecosystem types, and have collected data across a range of spatial, temporal, and depth gradients. The rich data sets assembled in SoDaH consist of observations from monitoring efforts and long-term ecological experiments. The SoDaH database also incorporates related environmental covariate data pertaining to climate, vegetation, soil chemistry, and soil physical properties. The data are harmonized and aggregated using open-source code that enables a scripted, repeatable approach for soil data synthesis.

  19. w

    Harmonized Database of Forcibly Displaced Populations and Their Hosts...

    • microdata.worldbank.org
    • catalog.ihsn.org
    • +1more
    Updated Nov 15, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Poverty and Equity Global Practice (2023). Harmonized Database of Forcibly Displaced Populations and Their Hosts 2015-2020 - Ecuador, Peru, Niger...and 7 more [Dataset]. https://microdata.worldbank.org/index.php/catalog/6104
    Explore at:
    Dataset updated
    Nov 15, 2023
    Dataset authored and provided by
    Poverty and Equity Global Practice
    Time period covered
    2015 - 2020
    Area covered
    Niger
    Description

    Abstract

    This multi-country harmonized dataset concerning forcibly displaced populations (FDPs) and their host communities was produced by the World Bank’s Poverty and Equity Global Practice. It incorporates representative surveys conducted in 10 countries across five regions that hosted FDPs in the period 2015 to 2020. The goal of this harmonization exercise is to provide researchers and policymakers with a valuable input for comparative analyses of forced displacement across key developing country settings.

    Geographic coverage

    The datasets included in the harmonization effort cover key recent displacement contexts: the Venezuelan influx in Latin America’s Andean states; the Syrian crisis in the Mashreq; the Rohingya displacement in Bangladesh; and forcible displacement in Sub-Saharan Africa (Sahel and East Africa). The harmonization exercise encompasses 10 different surveys. These include nationally representative surveys with a separate representative stratum for displaced populations; sub-national representative surveys covering displaced populations and their host communities; and surveys designed specifically to provide insights on displacement contexts. Most of the surveys were collected between 2015 and 2020.

    Analysis unit

    Household

    Universe

    Forcibly displaced populations and their hosts communities.

    Kind of data

    Sample survey data [ssd]

    Mode of data collection

    Computer Assisted Personal Interview [capi]

  20. Religion data harmonization scoping review

    • osf.io
    Updated Apr 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nicholas Gibson; Dominic Johnson; Hillary Lenfesty (2025). Religion data harmonization scoping review [Dataset]. https://osf.io/qnysz
    Explore at:
    Dataset updated
    Apr 17, 2025
    Dataset provided by
    Center for Open Sciencehttps://cos.io/
    Authors
    Nicholas Gibson; Dominic Johnson; Hillary Lenfesty
    Description

    No description was included in this Dataset collected from the OSF

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
U.S. Environmental Protection Agency (2024). Improved Wetland Soil Organic Carbon Stocks of the Conterminous U.S. Through Data Harmonization [Dataset]. https://datasets.ai/datasets/improved-wetland-soil-organic-carbon-stocks-of-the-conterminous-u-s-through-data-harmoniza

Data from: Improved Wetland Soil Organic Carbon Stocks of the Conterminous U.S. Through Data Harmonization

Related Article
Explore at:
21, 0Available download formats
Dataset updated
Aug 6, 2024
Dataset authored and provided by
U.S. Environmental Protection Agency
Area covered
Contiguous United States, United States
Description

Public data used for data harmonization.

This dataset is associated with the following publication: Uhran, B., L. Windham-Myers, N. Bliss, A. Nahlik, E. Sundquist, and C. Stagg. Improved Wetland Soil Organic Carbon Stocks of the Conterminous U.S. Through Data Harmonization. Frontiers in Soil Science. Frontiers, Lausanne, SWITZERLAND, 1: 706701, (2021).

Search
Clear search
Close search
Google apps
Main menu