100+ datasets found
  1. g

    Data Harmonization Procedures

    • gimi9.com
    • data.virginia.gov
    • +1more
    Updated Sep 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Data Harmonization Procedures [Dataset]. https://gimi9.com/dataset/data-gov_data-harmonization-procedures/
    Explore at:
    Dataset updated
    Sep 6, 2025
    Description

    ACF Agency Wide resource Metadata-only record linking to the original dataset. Open original dataset below.

  2. Data from: Improved Wetland Soil Organic Carbon Stocks of the Conterminous...

    • catalog.data.gov
    • datasets.ai
    Updated Nov 18, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. EPA Office of Research and Development (ORD) (2022). Improved Wetland Soil Organic Carbon Stocks of the Conterminous U.S. Through Data Harmonization [Dataset]. https://catalog.data.gov/dataset/improved-wetland-soil-organic-carbon-stocks-of-the-conterminous-u-s-through-data-harmoniza
    Explore at:
    Dataset updated
    Nov 18, 2022
    Dataset provided by
    United States Environmental Protection Agencyhttp://www.epa.gov/
    Area covered
    Contiguous United States, United States
    Description

    Public data used for data harmonization. This dataset is associated with the following publication: Uhran, B., L. Windham-Myers, N. Bliss, A. Nahlik, E. Sundquist, and C. Stagg. Improved Wetland Soil Organic Carbon Stocks of the Conterminous U.S. Through Data Harmonization. Frontiers in Soil Science. Frontiers, Lausanne, SWITZERLAND, 1: 706701, (2021).

  3. D

    Multi-Omics Clinical Data Harmonization Market Research Report 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Sep 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). Multi-Omics Clinical Data Harmonization Market Research Report 2033 [Dataset]. https://dataintelo.com/report/multi-omics-clinical-data-harmonization-market
    Explore at:
    pptx, pdf, csvAvailable download formats
    Dataset updated
    Sep 30, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Multi-Omics Clinical Data Harmonization Market Outlook



    According to our latest research, the global Multi-Omics Clinical Data Harmonization market size reached USD 1.65 billion in 2024, reflecting robust adoption across healthcare and life sciences. With a strong compound annual growth rate (CAGR) of 14.2% projected from 2025 to 2033, the market is anticipated to reach USD 4.65 billion by 2033. This growth is primarily driven by the escalating integration of multi-omics approaches in clinical research, the increasing demand for personalized medicine, and the urgent need to standardize complex biological data for actionable insights. As per our latest research, the market's expansion is underpinned by technological advancements and the broadening scope of omics-based applications in diagnostics and therapeutics.




    The rapid growth of the Multi-Omics Clinical Data Harmonization market can be attributed to several key factors. One of the most significant drivers is the exponential increase in biological data generated from next-generation sequencing and other high-throughput omics platforms. As researchers and clinicians seek to unravel the complexities of human health and disease, the need to integrate and harmonize disparate data types—such as genomics, proteomics, metabolomics, and transcriptomics—has become paramount. This harmonization enables a more comprehensive understanding of disease mechanisms, facilitating the identification of novel biomarkers and therapeutic targets. Moreover, regulatory bodies and funding agencies are increasingly emphasizing data standardization and interoperability, further fueling demand for robust harmonization solutions.




    Another major growth factor is the accelerating adoption of precision medicine initiatives worldwide. The shift from one-size-fits-all therapies to tailored treatment regimens necessitates the integration of multi-omics data with clinical and phenotypic information. Harmonized data platforms empower clinicians and researchers to draw meaningful correlations between omics signatures and patient outcomes, thereby enhancing diagnostic accuracy and enabling the development of personalized therapeutic strategies. Pharmaceutical and biotechnology companies, in particular, are leveraging multi-omics harmonization to streamline drug discovery pipelines, improve patient stratification, and optimize clinical trial designs, contributing to significant market growth.




    Technological innovation plays a central role in propelling the Multi-Omics Clinical Data Harmonization market forward. Advances in artificial intelligence, machine learning, and cloud computing have revolutionized the way multi-omics data is processed, integrated, and analyzed. Sophisticated software platforms now offer automated data curation, normalization, and annotation, reducing manual errors and accelerating research timelines. Additionally, collaborative efforts between academic institutions, healthcare providers, and industry stakeholders have led to the establishment of large-scale multi-omics databases and consortia, further driving market expansion. The growing focus on data privacy, security, and regulatory compliance also shapes market dynamics, prompting continuous innovation in harmonization technologies.




    Regionally, North America remains the dominant force in the Multi-Omics Clinical Data Harmonization market, accounting for the largest share in 2024. The region's leadership is attributed to its advanced healthcare infrastructure, significant investments in omics research, and a strong presence of key market players. Europe follows closely, leveraging robust public-private partnerships and supportive regulatory frameworks. Meanwhile, the Asia Pacific region is witnessing the fastest growth, fueled by increasing government initiatives, expanding healthcare access, and rising awareness of precision medicine. Latin America and the Middle East & Africa, though currently smaller markets, are expected to demonstrate steady growth as they enhance their research capabilities and digital health ecosystems.



    Solution Analysis



    The Solution segment of the Multi-Omics Clinical Data Harmonization market is bifurcated into software and services, each playing a pivotal role in enabling seamless integration and analysis of diverse omics datasets. Software solutions encompass a wide range of platforms and tools designed to automate data normalization, annotation, and integ

  4. Harmonized LUCAS dataset (ST_LUCAS)

    • zenodo.org
    • data.niaid.nih.gov
    bin
    Updated May 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Landa; Martin Landa; Lukáš Brodský; Lukáš Brodský; Tomáš Bouček; Tomáš Bouček; Lena Halounová; Lena Halounová; Ondřej Pešek; Ondřej Pešek (2025). Harmonized LUCAS dataset (ST_LUCAS) [Dataset]. http://doi.org/10.5281/zenodo.15349226
    Explore at:
    binAvailable download formats
    Dataset updated
    May 6, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Martin Landa; Martin Landa; Lukáš Brodský; Lukáš Brodský; Tomáš Bouček; Tomáš Bouček; Lena Halounová; Lena Halounová; Ondřej Pešek; Ondřej Pešek
    Description

    ST_LUCAS is a harmonized dataset derived from the LUCAS (Land Use and Coverage Area frame Survey) dataset. LUCAS is an Eurostat activity that has performed repeated in situ surveys over Europe every three years since 2006. Original LUCAS data (https://ec.europa.eu/eurostat/web/lucas/data) starting with the 2006 survey were harmonized into common nomenclature based on the 2018 survey. ST_LUCAS dataset is provided in two versions:

    • lucas_points: each LUCAS survey is represented by single record

    • lucas_st_points: each LUCAS point is represented by a single location calculated from multiple surveys and by a set of harmonized attributes for each survey year

    Harmonization and space-aggregation of LUCAS data were performed by ST_LUCAS system available from https://geoforall.fsv.cvut.cz/st_lucas. The methodology is described in Landa, M.; Brodský, L.; Halounová, L.; Bouček, T.; Pešek, O. Open Geospatial System for LUCAS In Situ Data Harmonization and Distribution. ISPRS Int. J. Geo-Inf. 2022, 11, 361. https://doi.org/10.3390/ijgi11070361.

    List of harmonized LUCAS attributes: https://geoforall.fsv.cvut.cz/st_lucas/tables/list_of_attributes.html

    ST_LUCAS dataset is provided under the same conditions (“free of charge”) as the original LUCAS data (https://ec.europa.eu/eurostat/web/lucas/data).

  5. G

    Multi-Omics Clinical Data Harmonization Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Oct 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). Multi-Omics Clinical Data Harmonization Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/multi-omics-clinical-data-harmonization-market
    Explore at:
    csv, pdf, pptxAvailable download formats
    Dataset updated
    Oct 7, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Multi-Omics Clinical Data Harmonization Market Outlook



    According to the latest research conducted in 2025, the global Multi-Omics Clinical Data Harmonization market size stands at USD 1.47 billion in 2024. The market is experiencing robust momentum, driven by technological advancements and the growing adoption of precision medicine. With a recorded CAGR of 13.6%, the market is projected to reach USD 4.22 billion by 2033. This substantial growth is primarily fueled by the increasing integration of multi-omics datasets in clinical research and diagnostics, which is enabling more comprehensive and actionable insights into complex diseases and therapeutic responses.




    The primary growth factor propelling the Multi-Omics Clinical Data Harmonization market is the escalating demand for personalized and precision medicine. As healthcare systems globally shift towards individualized treatment regimens, the necessity to harmonize and integrate diverse omics datasets—such as genomics, proteomics, metabolomics, and transcriptomics—has become paramount. These integrated data solutions facilitate a holistic understanding of disease mechanisms, improve diagnostic accuracy, and enable the development of targeted therapies. The proliferation of next-generation sequencing technologies, coupled with the decreasing cost of omics profiling, has further democratized access to multi-omics data, thereby accelerating its utilization across clinical and research settings.




    Another significant driver is the rapid digitization of healthcare and the growing emphasis on interoperability and data standardization. The harmonization of multi-omics clinical data addresses critical challenges related to data silos, heterogeneity, and lack of standardized formats. Advanced data harmonization platforms are leveraging artificial intelligence and machine learning to automate the integration and curation of large-scale omics datasets, ensuring data quality, consistency, and compliance with regulatory standards. This technological evolution is not only enhancing the efficiency of clinical workflows but also fostering collaborations among pharmaceutical companies, research institutions, and healthcare providers.




    Furthermore, the rising investments from both public and private sectors in biomedical research are playing a pivotal role in market expansion. Governments and funding agencies worldwide are supporting large-scale multi-omics projects aimed at deciphering the molecular underpinnings of complex diseases such as cancer, neurodegenerative disorders, and rare genetic conditions. These initiatives are generating vast amounts of clinical omics data that require robust harmonization solutions for effective utilization. Additionally, the growing prevalence of chronic diseases and the increasing adoption of electronic health records (EHRs) are amplifying the demand for integrated data management platforms that can seamlessly harmonize clinical and omics datasets for improved patient outcomes.




    Regionally, North America continues to dominate the Multi-Omics Clinical Data Harmonization market, accounting for the largest share in 2024, followed by Europe and Asia Pacific. The presence of leading biotechnology firms, advanced healthcare infrastructure, and strong government support for precision medicine initiatives have positioned North America at the forefront of innovation. Meanwhile, Asia Pacific is emerging as a high-growth region, driven by expanding research capabilities, rising healthcare expenditures, and increasing adoption of multi-omics technologies in countries like China, Japan, and India. Europe also maintains a significant market presence, supported by collaborative research networks and robust regulatory frameworks for data standardization and interoperability.





    Omics Type Analysis



    The Omics Type segment of the Multi-Omics Clinical Data Harmonization market encompasses genomics, proteomics, transcriptomics, metabolomics, epigenomics, and other emerging omics disciplines. Among these, genomics

  6. ACF NIEM Human Services Domain Data Harmonization Process

    • catalog.data.gov
    • data.virginia.gov
    Updated Sep 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Administration for Children and Families (2025). ACF NIEM Human Services Domain Data Harmonization Process [Dataset]. https://catalog.data.gov/dataset/acf-niem-human-services-domain-data-harmonization-process
    Explore at:
    Dataset updated
    Sep 8, 2025
    Dataset provided by
    Administration for Children and Families
    Description

    ACF Agency Wide resource Metadata-only record linking to the original dataset. Open original dataset below.

  7. G

    EO Data Harmonization Pipelines Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Oct 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). EO Data Harmonization Pipelines Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/eo-data-harmonization-pipelines-market
    Explore at:
    pdf, pptx, csvAvailable download formats
    Dataset updated
    Oct 4, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    EO Data Harmonization Pipelines Market Outlook



    According to our latest research, the EO Data Harmonization Pipelines market size globally reached USD 1.94 billion in 2024, and is projected to grow at a robust CAGR of 13.2% from 2025 to 2033, culminating in a forecasted market value of USD 5.62 billion by 2033. This dynamic growth is primarily attributed to the surging demand for integrated Earth Observation (EO) data across diverse industries, driven by the need for accurate, real-time, and interoperable geospatial insights for decision-making. The market is experiencing significant advancements in data processing technologies and AI-driven harmonization tools, which are further propelling adoption rates on a global scale. As per our comprehensive analysis, the increasing complexity of EO data sources and the critical need for standardized, high-quality data pipelines remain pivotal growth factors shaping the future of this market.




    One of the primary growth drivers for the EO Data Harmonization Pipelines market is the exponential increase in the volume and variety of EO data generated by satellites, drones, and ground-based sensors. As governments, research institutions, and commercial enterprises deploy more sophisticated EO platforms, the diversity in data formats, resolutions, and temporal frequencies has created a pressing need for harmonization solutions. These pipelines enable seamless integration, cleansing, and transformation of disparate datasets, ensuring consistency and reliability in downstream analytics. The proliferation of AI and machine learning algorithms within these pipelines has further enhanced their ability to automate data normalization, anomaly detection, and metadata enrichment, resulting in more actionable and timely insights for end-users across sectors.




    Another significant factor contributing to market growth is the increasing adoption of EO data for environmental monitoring, agriculture, disaster management, and urban planning. Governments and private organizations are leveraging harmonized EO data to monitor deforestation, predict crop yields, assess disaster risks, and optimize urban infrastructure planning. The ability to harmonize multi-source data streams enables stakeholders to generate comprehensive, cross-temporal analyses that support sustainable development goals and climate resilience strategies. The integration of cloud-based platforms has democratized access to harmonized EO data, allowing even small and medium enterprises to leverage advanced geospatial analytics without substantial upfront investments in hardware or specialized personnel.




    Furthermore, the rising emphasis on interoperability and data sharing among international agencies, research institutions, and commercial providers is fueling the demand for robust EO data harmonization pipelines. Initiatives such as the Global Earth Observation System of Systems (GEOSS) and the European Copernicus program underscore the importance of standardized data frameworks for global collaboration. These trends are driving investments in open-source harmonization tools, API-driven architectures, and scalable cloud infrastructures that can support multi-stakeholder data exchange. As regulatory requirements for data quality and provenance intensify, organizations are increasingly prioritizing investments in harmonization technologies to ensure compliance and maintain competitive advantage in the rapidly evolving EO ecosystem.




    From a regional perspective, North America currently dominates the EO Data Harmonization Pipelines market, accounting for over 38% of the global market share in 2024, followed by Europe and Asia Pacific. The United States, in particular, benefits from a mature EO ecosystem, substantial government funding, and a vibrant commercial space sector. Europe’s growth is propelled by strong policy frameworks and cross-border collaborations, while Asia Pacific is rapidly emerging as a high-growth region, driven by increasing investments in satellite infrastructure and smart city initiatives. Latin America and the Middle East & Africa are also witnessing steady adoption, supported by international development programs and growing awareness of EO’s value in addressing regional challenges such as agriculture productivity and climate adaptation.



  8. D

    EO Data Harmonization Pipelines Market Research Report 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Sep 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). EO Data Harmonization Pipelines Market Research Report 2033 [Dataset]. https://dataintelo.com/report/eo-data-harmonization-pipelines-market
    Explore at:
    pptx, pdf, csvAvailable download formats
    Dataset updated
    Sep 30, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    EO Data Harmonization Pipelines Market Outlook



    According to our latest research, the global EO Data Harmonization Pipelines market size reached USD 2.17 billion in 2024, with a robust compound annual growth rate (CAGR) of 13.2% projected through the forecast period. By 2033, the market is expected to attain a value of USD 6.19 billion. This growth is primarily driven by the surging demand for integrated, high-quality Earth Observation (EO) data across various sectors, including environmental monitoring, agriculture, and urban planning, as organizations increasingly seek actionable insights from multi-source geospatial datasets.




    The exponential increase in the volume and diversity of EO data sources has emerged as a primary growth factor for the EO Data Harmonization Pipelines market. Organizations now rely on satellite imagery, aerial photographs, UAV data, and ground-based sensors to monitor and analyze dynamic terrestrial and atmospheric phenomena. However, the heterogeneity and varying formats of these datasets have posed significant challenges for seamless integration and analysis. The development and adoption of sophisticated EO data harmonization pipelines have become essential, enabling the conversion, standardization, and fusion of disparate data streams into coherent, analysis-ready datasets. This capability not only enhances the accuracy and reliability of downstream analytics but also accelerates decision-making processes in critical domains such as disaster management, climate change assessment, and precision agriculture.




    Another pivotal driver is the rapid technological advancement in cloud computing, artificial intelligence, and machine learning, which has revolutionized the EO data harmonization landscape. Cloud-based platforms now offer scalable, on-demand processing power, allowing for real-time harmonization of massive EO datasets. AI-powered algorithms automate data cleansing, normalization, and feature extraction, significantly reducing manual intervention and operational costs. These innovations have democratized access to EO data harmonization solutions, making them accessible to a broader spectrum of end-users, from government agencies and research institutes to commercial enterprises. The integration of these advanced technologies not only improves the efficiency of EO data pipelines but also opens new avenues for developing predictive models and geospatial intelligence solutions.




    The increasing focus on sustainability and environmental stewardship has further amplified the demand for EO data harmonization pipelines. Governments and international organizations are investing heavily in monitoring land use, water resources, and atmospheric conditions to meet regulatory requirements and inform policy decisions. Harmonized EO data enables comprehensive, cross-border analyses that are vital for addressing global challenges such as deforestation, urban sprawl, and natural disasters. As regulatory frameworks around data quality and interoperability become more stringent, organizations are compelled to invest in robust harmonization solutions to ensure compliance and maintain data integrity. This regulatory push, combined with growing public and private sector awareness of the value of harmonized EO data, is expected to sustain market growth over the coming decade.




    Regionally, North America and Europe continue to dominate the EO Data Harmonization Pipelines market, accounting for a combined market share of over 60% in 2024. The United States, in particular, benefits from a mature geospatial technology ecosystem and significant investments in satellite infrastructure. Meanwhile, the Asia Pacific region is witnessing the fastest growth, driven by expanding EO satellite programs in China, India, and Japan, coupled with increasing adoption of cloud-based geospatial solutions. Latin America and the Middle East & Africa are gradually emerging as promising markets, propelled by investments in environmental monitoring and disaster management initiatives. As these regions enhance their EO capabilities, the global market is poised for sustained expansion.



    Component Analysis



    The EO Data Harmonization Pipelines market by component is segmented into software, hardware, and services. Software solutions remain the largest segment, accounting for over 45% of the market share in 2024. These platforms are integral for the automated ingestion, normalization, and fusio

  9. f

    Description and harmonization strategy for the predictor variables.

    • figshare.com
    xlsx
    Updated Apr 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xin Wu; Jeran Stratford; Karen Kesler; Cataia Ives; Tabitha Hendershot; Barbara Kroner; Ying Qin; Huaqin Pan (2025). Description and harmonization strategy for the predictor variables. [Dataset]. http://doi.org/10.1371/journal.pone.0309572.s001
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Apr 23, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Xin Wu; Jeran Stratford; Karen Kesler; Cataia Ives; Tabitha Hendershot; Barbara Kroner; Ying Qin; Huaqin Pan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Description and harmonization strategy for the predictor variables.

  10. f

    Predictor variables used in analysis and the methods used to harmonize to...

    • plos.figshare.com
    xls
    Updated Apr 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xin Wu; Jeran Stratford; Karen Kesler; Cataia Ives; Tabitha Hendershot; Barbara Kroner; Ying Qin; Huaqin Pan (2025). Predictor variables used in analysis and the methods used to harmonize to the categorical variables. [Dataset]. http://doi.org/10.1371/journal.pone.0309572.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Apr 23, 2025
    Dataset provided by
    PLOS ONE
    Authors
    Xin Wu; Jeran Stratford; Karen Kesler; Cataia Ives; Tabitha Hendershot; Barbara Kroner; Ying Qin; Huaqin Pan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Predictor variables used in analysis and the methods used to harmonize to the categorical variables.

  11. e

    ComBat HarmonizR enables the integrated analysis of independently generated...

    • ebi.ac.uk
    Updated May 23, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hannah Voß (2022). ComBat HarmonizR enables the integrated analysis of independently generated proteomic datasets through data harmonization with appropriate handling of missing values [Dataset]. https://www.ebi.ac.uk/pride/archive/projects/PXD027467
    Explore at:
    Dataset updated
    May 23, 2022
    Authors
    Hannah Voß
    Variables measured
    Proteomics
    Description

    The integration of proteomic datasets, generated by non-cooperating laboratories using different LC-MS/MS setups can overcome limitations in statistically underpowered sample cohorts but has not been demonstrated to this day. In proteomics, differences in sample preservation and preparation strategies, chromatography and mass spectrometry approaches and the used quantification strategy distort protein abundance distributions in integrated datasets. The Removal of these technical batch effects requires setup-specific normalization and strategies that can deal with missing at random (MAR) and missing not at random (MNAR) type values at a time. Algorithms for batch effect removal, such as the ComBat-algorithm, commonly used for other omics types, disregard proteins with MNAR missing values and reduce the informational yield and the effect size for combined datasets significantly. Here, we present a strategy for data harmonization across different tissue preservation techniques, LC-MS/MS instrumentation setups and quantification approaches. To enable batch effect removal without the need for data reduction or error-prone imputation we developed an extension to the ComBat algorithm, ´ComBat HarmonizR, that performs data harmonization with appropriate handling of MAR and MNAR missing values by matrix dissection The ComBat HarmonizR based strategy enables the combined analysis of independently generated proteomic datasets for the first time. Furthermore, we found ComBat HarmonizR to be superior for removing batch effects between different Tandem Mass Tag (TMT)-plexes, compared to commonly used internal reference scaling (iRS). Due to the matrix dissection approach without the need of data imputation, the HarmonizR algorithm can be applied to any type of -omics data while assuring minimal data loss

  12. f

    Additional file 1 of Conceptual design of a generic data harmonization...

    • datasetcatalog.nlm.nih.gov
    • springernature.figshare.com
    Updated Feb 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zoch, Michele; Peng, Yuan; Reinecke, Ines; Henke, Elisa; Sedlmayr, Martin; Bathelt, Franziska (2024). Additional file 1 of Conceptual design of a generic data harmonization process for OMOP common data model [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001502363
    Explore at:
    Dataset updated
    Feb 27, 2024
    Authors
    Zoch, Michele; Peng, Yuan; Reinecke, Ines; Henke, Elisa; Sedlmayr, Martin; Bathelt, Franziska
    Description

    A detailed overview of the results of the literature search, including the data extraction matrix can be found in the Additional file 1.

  13. Data from: LUH2-GCB2019: Land-Use Harmonization 2 Update for the Global...

    • data.nasa.gov
    • cmr.earthdata.nasa.gov
    • +3more
    Updated Apr 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nasa.gov (2025). LUH2-GCB2019: Land-Use Harmonization 2 Update for the Global Carbon Budget, 850-2019 [Dataset]. https://data.nasa.gov/dataset/luh2-gcb2019-land-use-harmonization-2-update-for-the-global-carbon-budget-850-2019-92ae8
    Explore at:
    Dataset updated
    Apr 1, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Description

    This dataset, referred to as LUH2-GCB2019, includes 0.25-degree gridded, global maps of fractional land-use states, transitions, and management practices for the period 0850-2019. The LUH2-GCB2019 dataset is an update to the previous Land-Use Harmonization Version 2 (LUH2-GCB) datasets prepared as required input to land models in the annual Global Carbon Budget (GCB) assessments, including land-use change data relating to agricultural expansion, deforestation, wood harvesting, shifting cultivation, afforestation, and crop rotations. Compared with previous LUH2-GCB datasets, the LUH2-GCB2019 takes advantage of new data inputs that corrected cropland and grazing areas in the globally important region of Brazil, as far back as 1950. LUH2-GCB datasets are used by bookkeeping models and Dynamic Global Vegetation Models (DGVMs) for the GCB.

  14. s

    Citation Trends for "Promoting data harmonization to evaluate vaccine...

    • shibatadb.com
    Updated Oct 22, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yubetsu (2022). Citation Trends for "Promoting data harmonization to evaluate vaccine hesitancy in LMICs: approach and applications" [Dataset]. https://www.shibatadb.com/article/nCj3w3fn
    Explore at:
    Dataset updated
    Oct 22, 2022
    Dataset authored and provided by
    Yubetsu
    License

    https://www.shibatadb.com/license/data/proprietary/v1.0/license.txthttps://www.shibatadb.com/license/data/proprietary/v1.0/license.txt

    Time period covered
    2025
    Variables measured
    New Citations per Year
    Description

    Yearly citation counts for the publication titled "Promoting data harmonization to evaluate vaccine hesitancy in LMICs: approach and applications".

  15. e

    SOils DAta Harmonization database (SoDaH): an open-source synthesis of soil...

    • portal.edirepository.org
    • search.dataone.org
    csv
    Updated Jul 15, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    William Wieder; Derek Pierson; Stevan Earl; Kate Lajtha; Sara Baer; Ford Ballantyne; Asmeret Berhe; Sharon Billings; Laurel Brigham; Stephany Chacon; Jennifer Fraterrigo; Serita Frey; Katerina Georgiou; Marie-Anne de Graaff; A Grandy; Melannie Hartman; Sarah Hobbie; Chris Johnson; Jason Kaye; Emily Snowman; Marcy Litvak; Michelle Mack; Avni Malhotra; Jessica Moore; Knute Nadelhoffer; Craig Rasmussen; Whendee Silver; Benjamin Sulman; Xanthe Walker; Samantha Weintraub (2020). SOils DAta Harmonization database (SoDaH): an open-source synthesis of soil data from research networks [Dataset]. http://doi.org/10.6073/pasta/9733f6b6d2ffd12bf126dc36a763e0b4
    Explore at:
    csv(318301060 bytes)Available download formats
    Dataset updated
    Jul 15, 2020
    Dataset provided by
    EDI
    Authors
    William Wieder; Derek Pierson; Stevan Earl; Kate Lajtha; Sara Baer; Ford Ballantyne; Asmeret Berhe; Sharon Billings; Laurel Brigham; Stephany Chacon; Jennifer Fraterrigo; Serita Frey; Katerina Georgiou; Marie-Anne de Graaff; A Grandy; Melannie Hartman; Sarah Hobbie; Chris Johnson; Jason Kaye; Emily Snowman; Marcy Litvak; Michelle Mack; Avni Malhotra; Jessica Moore; Knute Nadelhoffer; Craig Rasmussen; Whendee Silver; Benjamin Sulman; Xanthe Walker; Samantha Weintraub
    Area covered
    Variables measured
    K, Ca, L1, L2, L3, L4, L5, Mg, Na, bs, and 147 more
    Description

    This SOils DAta Harmonization (SoDaH) database is designed to bring together soil carbon data from diverse research networks into a harmonized dataset that can be used for synthesis activities and model development. The research network sources for SoDaH span different biomes and climates, encompass multiple ecosystem types, and have collected data across a range of spatial, temporal, and depth gradients. The rich data sets assembled in SoDaH consist of observations from monitoring efforts and long-term ecological experiments. The SoDaH database also incorporates related environmental covariate data pertaining to climate, vegetation, soil chemistry, and soil physical properties. The data are harmonized and aggregated using open-source code that enables a scripted, repeatable approach for soil data synthesis.

  16. d

    SDR 2.0 Cotton File: Cumulative List of Variables in the Surveys of the SDR...

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Mar 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Powałko, Przemek (2024). SDR 2.0 Cotton File: Cumulative List of Variables in the Surveys of the SDR Database [Dataset]. http://doi.org/10.7910/DVN/6QBGNF
    Explore at:
    Dataset updated
    Mar 6, 2024
    Dataset provided by
    Harvard Dataverse
    Authors
    Powałko, Przemek
    Time period covered
    Jan 1, 1966 - Jan 1, 2017
    Description

    SDR 2.0 Cotton File: Cumulative List of Variables in the Surveys of the SDR Database is a comprehensive data dictionary, in Microsoft Excel format. Its main purpose is to facilitate the overview of 88118 variables (i.e. variable names, values, and labels) available in the original (source) data files that we retrieved automatically for harmonization purposes in the SDR Project. Information in the Cotton File comes from 215 source data files that comprise ca. 3500 national surveys administered between 1966 and 2017 in 169 countries or territories, as part of 23 international survey projects. The COTTON FILE SDR2 is a product of the project Survey Data Recycling: New Analytic Framework, Integrated Database, and Tools for Cross-national Social, Behavioral and Economic Research, financed by the US National Science Foundation (PTE Federal award 1738502). We thank the Ohio State University and the Institute of Philosophy and Sociology, Polish Academy of Sciences, for organizational support.

  17. Z

    D3.3_20230919_ProbeField_Aligned_Spectra_V1

    • data.niaid.nih.gov
    Updated Sep 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Castaldi, Fabio (2024). D3.3_20230919_ProbeField_Aligned_Spectra_V1 [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_13757028
    Explore at:
    Dataset updated
    Sep 16, 2024
    Dataset provided by
    National Research Council
    Authors
    Castaldi, Fabio
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A dataset composed of 470 soil laboratory spectral data has been aligned using the white Lucky Bay sands as internal soil standard (ISS). Soil samples were collected in Sweden by SLU, in Italy by CNR, in France by INRAE and in Poland by IUNG, and the spectra were acquired in the lab on dry samples. Each partners scanned the ISS using the same instrument as the soil samples, this allowed to compute the correction factor for each instrument.

    Along with the main dataset, an explanation document and a R script are provided-

    The R script can be used to align spectral data acquired by using different nstruments. Within the provided file "CF_lb" you can find 5 correction factors for 5 instruments: The correction factors were computed using the Lucky bay spectra scanned by each of the 5 instrument (ISS) and the master lucky bay spectrum acquired in the CSIRO lab.

  18. PanTool – software for data harmonization and conversion, Version 1

    • doi.pangaea.de
    html, tsv
    Updated Aug 28, 2006
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rainer Sieger; Hannes Grobe (2006). PanTool – software for data harmonization and conversion, Version 1 [Dataset]. http://doi.org/10.1594/PANGAEA.510701
    Explore at:
    tsv, htmlAvailable download formats
    Dataset updated
    Aug 28, 2006
    Dataset provided by
    PANGAEA
    Alfred Wegener Institute, Helmholtz Centre for Polar and Marine Research, Bremerhaven
    Authors
    Rainer Sieger; Hannes Grobe
    License

    https://www.gnu.org/licenses/gpl-3.0https://www.gnu.org/licenses/gpl-3.0

    Variables measured
    File size, File content, Uniform resource locator/link to file
    Description

    The program PanTool was developed as a tool box like a Swiss Army Knife for data conversion and recalculation, written to harmonize individual data collections to standard import format used by PANGAEA. The format of input files the program PanTool needs is a tabular saved in plain ASCII. The user can create this files with a spread sheet program like MS-Excel or with the system text editor. PanTool is distributed as freeware for the operating systems Microsoft Windows, Apple OS X and Linux.

  19. Z

    Harmonized Cultural Access & Participation Dataset for Music

    • data-staging.niaid.nih.gov
    • data.niaid.nih.gov
    Updated Jun 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniel Antal (2022). Harmonized Cultural Access & Participation Dataset for Music [Dataset]. https://data-staging.niaid.nih.gov/resources?id=zenodo_5917741
    Explore at:
    Dataset updated
    Jun 4, 2022
    Authors
    Daniel Antal
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Changes since the last version: in the .csv export there was a naming problem.

    • visit_concert: This is a standard CAP variables about visiting frequencies, in numeric form.
    • fct_visit_concert: This is a standard CAP variables about visiting frequencies, in categorical form.
    • is_visit_concert: binary variable, 0 if the person had not visited concerts in the previous 12 months.
    • artistic_activity_played_music: A variable of the frequency of playing music as an amateur or professional practice, in some surveys we have only a binary variable (played in the last 12 months or not) in other we have frequencies. We will convert this into a binary variable.
    • fct_artistic_activity_played_music: The artistic_activity_played_music in categorical representation.
    • artistic_activity_sung: A variable of the frequency of singing as an amateur or professional practice, like played_muisc. Because of the liturgical use of singing, and the differences of religious practices among countries and gender, this is a significantly different variable from played_music.
    • fct_artistic_activity_sung: The artistic_activity_sung variable in categorical representation.
    • age_exact: The respondent’s age as an integer number.
    • country_code: an ISO country code
    • geo: an ISO code that separates Germany to the former East and West Germany, and the United Kingdom to Great Britain and Northern Ireland, and Cyprus to Cyprus and the Turiksh Cypriot community.[we may leave Turkish Cyprus out for practical reasons.]
    • age_education: This is a harmonized education proxy. Because we work with the data of more than 30 countries, education levels are difficult to harmonize, and we use the Eurobarometer standard proxy, age of leaving education. It is a specially coded variable, and we will re-code them into two variables, age_education and is_student.
    • is_student: is a dummy variable for the special coding in age_education for “still studying”, i.e. the person does not have yet a school leaving age. It would be tempting to impute age in this case to age_education, but we will show why this is not a good strategy.
    • w, w1: Post-stratification weights for the 15+ years old population of each country. Use w1 for averages of geo entities treating Northern Ireland, Great Britain, the United Kingdom, the former GDR, the former West Germany, and Germany as geographical areas. Use w when treating the United Kingdom and Germany as one territory.
    • wex: Projected weight variable. For weighted average values, use w, w1, for projections on the population size, i.e., use with sums, use wex.
    • id: The identifier of the original survey.
    • rowid`: A new unique identifier that is unique in all harmonized surveys, i.e., remains unique in the harmonized dataset.
  20. Additional file 1 of Common data models and data standards for tabular...

    • springernature.figshare.com
    zip
    Updated Nov 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Melissa Finster; Markus Wenzel; Elham Taghizadeh (2025). Additional file 1 of Common data models and data standards for tabular health data: a systematic review [Dataset]. http://doi.org/10.6084/m9.figshare.30616392.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 14, 2025
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Melissa Finster; Markus Wenzel; Elham Taghizadeh
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Supplementary material 1

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
(2025). Data Harmonization Procedures [Dataset]. https://gimi9.com/dataset/data-gov_data-harmonization-procedures/

Data Harmonization Procedures

Explore at:
138 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Sep 6, 2025
Description

ACF Agency Wide resource Metadata-only record linking to the original dataset. Open original dataset below.

Search
Clear search
Close search
Google apps
Main menu