37 datasets found
  1. f

    Table 6 -

    • plos.figshare.com
    xls
    Updated Dec 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hyunsoo Yoon; Todd J. Schwedt; Catherine D. Chong; Oyekanmi Olatunde; Teresa Wu (2024). Table 6 - [Dataset]. http://doi.org/10.1371/journal.pone.0288300.t006
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Dec 31, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Hyunsoo Yoon; Todd J. Schwedt; Catherine D. Chong; Oyekanmi Olatunde; Teresa Wu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Multicenter and multi-scanner imaging studies may be necessary to ensure sufficiently large sample sizes for developing accurate predictive models. However, multicenter studies, incorporating varying research participant characteristics, MRI scanners, and imaging acquisition protocols, may introduce confounding factors, potentially hindering the creation of generalizable machine learning models. Models developed using one dataset may not readily apply to another, emphasizing the importance of classification model generalizability in multi-scanner and multicenter studies for producing reproducible results. This study focuses on enhancing generalizability in classifying individual migraine patients and healthy controls using brain MRI data through a data harmonization strategy. We propose identifying a ’healthy core’—a group of homogeneous healthy controls with similar characteristics—from multicenter studies. The Maximum Mean Discrepancy (MMD) in Geodesic Flow Kernel (GFK) space is employed to compare two datasets, capturing data variabilities and facilitating the identification of this ‘healthy core’. Homogeneous healthy controls play a vital role in mitigating unwanted heterogeneity, enabling the development of highly accurate classification models with improved performance on new datasets. Extensive experimental results underscore the benefits of leveraging a ’healthy core’. We utilized two datasets: one comprising 120 individuals (66 with migraine and 54 healthy controls), and another comprising 76 individuals (34 with migraine and 42 healthy controls). Notably, a homogeneous dataset derived from a cohort of healthy controls yielded a significant 25% accuracy improvement for both episodic and chronic migraineurs.

  2. Data from: Integrated Approach to Global Land Use and Land Cover Reference...

    • zenodo.org
    bin, zip
    Updated Oct 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bernard Silva de Oliveira; Bernard Silva de Oliveira; Nathália Monteiro Teles; Vinícius Vieira Mesquita; Leandro Leal Parente; Laerte Guimarães Ferreira; Nathália Monteiro Teles; Vinícius Vieira Mesquita; Leandro Leal Parente; Laerte Guimarães Ferreira (2024). Integrated Approach to Global Land Use and Land Cover Reference Data Harmonization [Dataset]. http://doi.org/10.5281/zenodo.11285561
    Explore at:
    bin, zipAvailable download formats
    Dataset updated
    Oct 18, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Bernard Silva de Oliveira; Bernard Silva de Oliveira; Nathália Monteiro Teles; Vinícius Vieira Mesquita; Leandro Leal Parente; Laerte Guimarães Ferreira; Nathália Monteiro Teles; Vinícius Vieira Mesquita; Leandro Leal Parente; Laerte Guimarães Ferreira
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    INTRODUCTION

    This document outlines the creation of a global inventory of reference samples and Earth Observation (EO) / gridded datasets for the Global Pasture Watch (GPW) initiative. This inventory supports the training and validation of machine-learning models for GPW grassland mapping. This documentation outlines methodology, data sources, workflow, and results.

    Keywords: Grassland, Land Use, Land Cover, Gridded Datasets, Harmonization

    OBJECTIVES

    • Create a global inventory of existing reference samples for land use and land cover (LULC);

    • Compile global EO / gridded datasets that capture LULC classes and harmonize them to match the GPW classes;

    • Develop automated scripts for data harmonization and integration.

    DATA COLLECTION

    Datasets incorporated:

    Datasets

    Spatial distribution

    Time periodNumber of individual samples
    WorldCerealGlobal2016-202138,267,911
    Global Land Cover Mapping and Estimation (GLanCE)Global1985-202131,061,694
    EuroCropsEurope2015-202214,742,648
    GeoWiki G-GLOPS training datasetGlobal202111,394,623
    MapBiomas BrazilBrazil1985-20183,234,370
    Land Use/Land Cover
    Area Frame Survey (LUCAS)
    Europe2006-20181,351,293
    Dynamic WorldGlobal2019-20201,249,983
    Land Change Monitoring,
    Assessment, and Projection (LCMap)
    U.S. (CONUS)1984-2018874,836
    GeoWiki 2012Global2011-2012151,942
    PREDICTSGlobal1984-201316,627
    CropHarvestGlobal2018-20219,714

    Total: 102,355,642 samples

    WORKFLOW

    Harmonization Process

    We harmonized global reference samples and EO/gridded datasets to align with GPW classes, optimizing their integration into the GPW machine-learning workflow.

    We considered reference samples derived by visual interpretation with spatial support of at least 30 m (Landsat and Sentinel), that could represent LULC classes for a point or region.

    Each dataset was processed using automated Python scripts to download vector files and convert the original LULC classes into the following GPW classes:

    0. Other land cover

    1. Natural and Semi-natural grassland

    2. Cultivated grassland

    3. Crops and other related agricultural practices

    We empirically assigned a weight to each sample based on the original dataset's class description, reflecting the level of mixture within the class. The weights range from 1 (Low) to 3 (High), with higher weights indicating greater mixture. Samples with low mixture levels are more accurate and effective for differentiating typologies and for validation purposes.

    The harmonized dataset includes these columns:

    Attribute NameDefinition
    dataset_nameOriginal dataset name
    reference_yearReference year of samples from the original dataset
    original_lulc_classLULC class from the original dataset
    gpw_lulc_classGlobal Pasture Watch LULC class
    sample_weightSample's weight based on the mixture level within the original LULC class

    ACKNOWLEDGMENTS

    The development of this global inventory of reference samples and EO/gridded datasets relied on valuable contributions from various sources. We would like to express our sincere gratitude to the creators and maintainers of all datasets used in this project.

    REFERENCES

    • Brown, C.F., Brumby, S.P., Guzder-Williams, B. et al. Dynamic World, Near real-time global 10 m land use land cover mapping. Sci Data 9, 251 (2022). https://doi.org/10.1038/s41597-022-01307-4Van Tricht, K. et al. Worldcereal: a dynamic open-source system for global-scale, seasonal, and reproducible crop and irrigation mapping. Earth Syst. Sci. Data 15, 5491–5515, 10.5194/essd-15-5491-2023 (2023)

    • Buchhorn, M.; Smets, B.; Bertels, L.; De Roo, B.; Lesiv, M.; Tsendbazar, N.E., Linlin, L., Tarko, A. (2020): Copernicus Global Land Service: Land Cover 100m: Version 3 Globe 2015-2019: Product User Manual; Zenodo, Geneve, Switzerland, September 2020; doi: 10.5281/zenodo.3938963

    • d’Andrimont, R. et al. Harmonised lucas in-situ land cover and use database for field surveys from 2006 to 2018 in the european union. Sci. data 7, 352, 10.1038/s41597-019-0340-y (2020)

    • Fritz, S. et al. Geo-Wiki: An online platform for improving global land cover, Environmental Modelling & Software, 31, https://doi.org/10.1016/j.envsoft.2011.11.015 (2012)

    • Fritz, S., See, L., Perger, C. et al. A global dataset of crowdsourced land cover and land use reference data. Sci Data 4, 170075 https://doi.org/10.1038/sdata.2017.75 (2017)

    • Schneider, M., Schelte, T., Schmitz, F. & Körner, M. Eurocrops: The largest harmonized open crop dataset across the european union. Sci. Data 10, 612, 10.1038/s41597-023-02517-0 (2023)

    • Souza, C. M. et al. Reconstructing Three Decades of Land Use and Land Cover Changes in Brazilian Biomes with Landsat Archive and Earth Engine. Remote. Sens. 12, 2735, 10.3390/rs12172735 (2020)

    • Stanimirova, R. et al. A global land cover training dataset from 1984 to 2020. Sci. Data 10, 879 (2023)

    • Stehman, S. V., Pengra, B. W., Horton, J. A. & Wellington, D. F. Validation of the us geological survey’s land change monitoring, assessment and projection (lcmap) collection 1.0 annual land cover products 1985–2017. Remot Sensing environment 265, 112646, 10.1016/j.rse.2021.112646 (2021).
    • Tsendbazar, N. et al. Product validation report (d12-pvr) v 1.1 (2021).

    • Tseng, G., Zvonkov, I., Nakalembe, C. L., & Kerner, H. (2021). CropHarvest: A global dataset for crop-type classification. In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track.
  3. o

    Data from: HarDWR - Harmonized Water Rights Records

    • osti.gov
    Updated Oct 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Caccese, Robert; Fisher-Vanden, Karen; Fowler, Lara; Grogan, Danielle; Lammers, Richard; Lisk, Matthew; Olmstead, Sheila; Peklak, Darrah; Zheng, Jiameng; Zuidema, Shan (2024). HarDWR - Harmonized Water Rights Records [Dataset]. https://www.osti.gov/dataexplorer/biblio/dataset/2475306
    Explore at:
    Dataset updated
    Oct 31, 2024
    Dataset provided by
    USDOE Office of Science (SC), Biological and Environmental Research (BER)
    MultiSector Dynamics - Living, Intuitive, Value-adding, Environment
    Authors
    Caccese, Robert; Fisher-Vanden, Karen; Fowler, Lara; Grogan, Danielle; Lammers, Richard; Lisk, Matthew; Olmstead, Sheila; Peklak, Darrah; Zheng, Jiameng; Zuidema, Shan
    Description

    A dataset within the Harmonized Database of Western U.S. Water Rights (HarDWR). For a detailed description of the database, please see the meta-record v2.0. Changelog v2.0 - Recalculated based on data sourced from WestDAAT - Changed using a Site ID column to identify unique records to using aa combination of Site ID and Allocation ID - Removed the Water Management Area (WMA) column from the harmonized records. The replacement is a separate file which stores the relationship between allocations and WMAs. This allows for allocations to contribute to water right amounts to multiple WMAs during the subsequent cumulative process. - Added a column describing a water rights legal status - Added "Unspecified" was a water source category - Added an acre-foot (AF) column - Added a column for the classification of the right's owner v1.02 - Added a .RData file to the dataset as a convenience for anyone exploring our code. This is an internal file, and the one referenced in analysis scripts as the data objects are already in R data objects. v1.01 - Updated the names of each file with an ID number less than 3 digits to include leading 0s v1.0 - Initial public release Description Heremore » we present an updated database of Western U.S. water right records. This database provides consistent unique identifiers for each water right record, and a consistent categorization scheme that puts each water right record into one of seven broad use categories. These data were instrumental in conducting a study of the multi-sector dynamics of inter-sectoral water allocation changes though water markets (Grogan et al., in review). Specifically, the data were formatted for use as input to a process-based hydrologic model, Water Balance Model (WBM), with a water rights module (Grogan et al., in review). While this specific study motivated the development of the database presented here, water management in the U.S. West is a rich area of study (e.g., Anderson and Woosly, 2005; Tidwell, 2014; Null and Prudencio, 2016; Carney et al., 2021) so releasing this database publicly with documentation and usage notes will enable other researchers to do further work on water management in the U.S. West. We produced the water rights database presented here in four main steps: (1) data collection, (2) data quality control, (3) data harmonization, and (4) generation of cumulative water rights curves. Each of steps (1)-(3) had to be completed in order to produce (4), the final product that was used in the modeling exercise in Grogan et al. (in review). All data in each step is associated with a spatial unit called a Water Management Area (WMA), which is the unit of water right administration utilized by the state in which the right came from. Steps (2) and (3) required use to make assumptions and interpretation, and to remove records from the raw data collection. We describe each of these assumptions and interpretations below so that other researchers can choose to implement alternative assumptions an interpretation as fits their research aims. Motivation for Changing Data Sources The most significant change has been a switch from collecting the raw water rights directly from each state to using the water rights records presented in WestDAAT, a product of the Water Data Exchange (WaDE) Program under the Western States Water Council (WSWC). One of the main reasons for this is that each state of interest is a member of the WSWC, meaning that WaDE is partially funded by these states, as well as many universities. As WestDAAT is also a database with consistent categorization, it has allowed us to spend less time on data collection and quality control and more time on answering research questions. This has included records from water right sources we had previously not known about when creating v1.0 of this database. The only major downside to utilizing the WestDAAT records as our raw data is that further updates are tied to when WestDAAT is updated, as some states update their public water right records daily. However, as our focus is on cumulative water amounts at the regional scale, it is unlikely most records updates would have a significant effect on our results. The structure of WestDAAT led to several important changes to how HarWR is formatted. The most significant change is that WaDE has calculated a field known as SiteUUID, which is a unique identifier for the Point of Diversion (POD), or where the water is drawn from. This separate from AllocationNativeID, which is the identifier for the allocation of water, or the amount of water associated with the water right. It should be noted that it is possible for a single site to have multiple allocations associated with it and for an allocation to be able to be extracted from multiple sites. The site-allocation structure has allowed us to adapt a more consistent, and hopefully more realistic, approach in organizing the water right records than we had with HarDWR v1.0. This was incredibly helpful as the raw data from many states had multiple water uses within a single field within a single row of their raw data, and it was not always clear if the first water use was the most important, or simply first alphabetically. WestDAAT has already addressed this data quality issue. Furthermore, with v1.0, when there were multiple records with the same water right ID, we selected the largest volume or flow amount and disregarded the rest. As WestDAAT was already a common structure for disparate data formats, we were better able to identify sites with multiple allocations and, perhaps more importantly, allocations with multiple sites. This is particularly helpful when an allocation has sites which cross WMA boundaries, instead of just assigning the full water amount to a single WMA we are now able to divide the amount of water between the number of relevant WMAs. As it is now possible to identify allocations with water used in multiple WMAs, it is no longer practical to store this information within a single column. Instead the stAllocationToWMATab.csv file was created, which is an allocation by WMA matrix containing the percent Place of Use area overlap with each WMA. We then use this percentage to divide the allocation's flow amount between the given WMAs during the cumulation process to hopefully provide more realistic totals of water use in each area. However, not every state provides areas of water use, so like HarDWR v1.0, a hierarchical decision tree was used to assign each allocation to a WMA. First, if a WMA could be identified based on the allocation ID, then that WMA was used; typically, when available, this applied to the entire state and no further steps were needed. Second was the spatial analysis of Place of Use to WMAs. Third was a spatial analysis of the POD locations to WMAs, with the assumption that allocation's POD is within the WMA it should belong to; if an allocation still had multiple WMAs based on its POD locations, then the allocation's flow amount would be divided equally between all WMAs. The fourth, and final, process was to include water allocations which spatially fell outside of the state WMA boundaries. This could be due to several reasons, such as coordinate errors / imprecision in the POD location, imprecision in the WMA boundaries, or rights attached with features, such as a reservoir, which crosses state boundaries. To include these records, we decided for any POD which was within one kilometer of the state's edge would be assigned to the nearest WMA. Other Changes WestDAAT has Allowed In addition to a more nuanced and consistent method of assigning water right's data to WMAs, there are other benefits gained from using the WestDAAT dataset. Among those is a consistent categorization of a water right's legal status. In HarDWR v1.0, legal status was effectively ignored, which led to many valid concerns about the quality of the database related to the amounts of water the rights allowed to be claimed. The main issue was that rights with legal status' such as "application withdrawn", "non-active", or "cancelled" were included within HarDWR v1.0. These, and other water rights status' which were deemed to not be in use have been removed from this version of the database. Another major change has been the addition of the "unspecified water source category. This is water that can come from either surface water or groundwater, or the source of which is unknown. The addition of this source category brings the total number of categories to three. Due to reviewer feedback, we decided to add the acre-foot (AF) column so that the data may be more applicable to a wider audience. We added the ownerClassification column so that the data may be more applicable to a wider audience. File Descriptions The dataset is a series of various files organized by state sub-directories. In addition, each file begins with the state's name, in case the file is separate from its sub-directory for some reason. After the state name is the text which describes the contents of the file. Here is each file described in detail. Note that st is a placeholder for the state's name. stFullRecords_HarmonizedRights.csv: A file of the complete water records for each state. The column headers for each of this type of file are: state - The name of the state to which the allocations belong to. FIPS - The two digit numeric state ID code. siteID - The site location ID for POD locations. A site may have multiple allocations, which are the actual amount of water which can be drawn. In a simplified hypothetical, a farm stead may have an allocation for "irrigation" and an allocation for "domestic" water use, but the water is drawn from the same pumping equipment. It should be noted that many of the site ID appear to have been added by WaDE, and therefore may not be recognized by a given state's water rights database. allocationID - The allocation ID for the water right. For most states this is the water right ID, and what is

  4. f

    Data from: Calibration Using a Single-Point External Reference Material...

    • acs.figshare.com
    xlsx
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lindsay K. Pino; Brian C. Searle; Eric L. Huang; William Stafford Noble; Andrew N. Hoofnagle; Michael J. MacCoss (2023). Calibration Using a Single-Point External Reference Material Harmonizes Quantitative Mass Spectrometry Proteomics Data between Platforms and Laboratories [Dataset]. http://doi.org/10.1021/acs.analchem.8b04581.s003
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    ACS Publications
    Authors
    Lindsay K. Pino; Brian C. Searle; Eric L. Huang; William Stafford Noble; Andrew N. Hoofnagle; Michael J. MacCoss
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Mass spectrometry (MS) measurements are not inherently calibrated. Researchers use various calibration methods to assign meaning to arbitrary signal intensities and improve precision. Internal calibration (IC) methods use internal standards (IS) such as synthesized or recombinant proteins or peptides to calibrate MS measurements by comparing endogenous analyte signal to the signal from known IS concentrations spiked into the same sample. However, recent work suggests that using IS as IC introduces quantitative biases that affect comparison across studies because of the inability of IS to capture all sources of variation present throughout an MS workflow. Here, we describe a single-point external calibration strategy to calibrate signal intensity measurements to a common reference material, placing MS measurements on the same scale and harmonizing signal intensities between instruments, acquisition methods, and sites. We demonstrate data harmonization between laboratories and methodologies using this generalizable approach.

  5. D

    Linking past and present: augmenting historical municipality characteristics...

    • ssh.datastations.nl
    bin, bz2, ods, pdf +7
    Updated Nov 19, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    A. Ashkpour; A. Ashkpour (2017). Linking past and present: augmenting historical municipality characteristics through harmonization and linkage with contemporary data [Dataset]. http://doi.org/10.17026/DANS-ZMS-H2S6
    Explore at:
    bz2(2641), ttl(6370218), ttl(12010366), ods(230879), xls(31232), bz2(100583), bz2(17049), bz2(174075), bz2(82975), bz2(4298), ttl(1083508), ttl(4937266), ttl(1430689), bz2(104122), ods(165969), bz2(196951), bz2(6669), ttl(656504), bz2(208252), bz2(55583), bz2(69248), tsv(250777), ttl(6876761), xls(107520), ttl(15), bz2(10282), ttl(5858857), bz2(1089973), bz2(1639), xls(29184), ods(77981), ttl(1016324), ttl(24422097), ttl(8754131), ttl(112777), tsv(63511), bz2(135742), bz2(189107), ttl(525511), bz2(209548), bz2(217035), bz2(5657), bz2(11659), ttl(6621439), tsv(104), ttl(2656799), ods(212942), ttl(248825), pdf(611366), bz2(160846), bz2(158798), ttl(240463), ttl(6641113), bz2(7110), ttl(40347), tsv(46504), ttl(1320590), tsv(288750), bz2(140959), ods(227237), tsv(834046), ttl(6106788), tsv(191523), ttl(2704662), bz2(15172), bz2(66423), ttl(573517), tsv(651), bz2(15832), rdf(28966), ods(155102), ods(1009856), ods(105635), rdf(8901), ttl(75737), ttl(414712), tsv(426631), bz2(101234), xls(33280), ods(1105219), ods(118493), ttl(39767611), bz2(7587), tsv(21516), tsv(183020), ttl(3685288), bz2(169942), ods(103892), ttl(5464355), ods(231164), bz2(280081), ttl(1517247), ttl(2843675), bz2(50619), ods(108176), ttl(1395866), bz2(27054), bz2(92017), ttl(156644), xls(1489408), bz2(899939), ttl(92159), bz2(192471), bz2(21882), bz2(176464), ttl(2191693), bz2(160665), ods(73172), tsv(1321), ttl(479324), bz2(14684), bz2(173877), ttl(5201137), ttl(7016029), bz2(183137), bz2(2305), ods(92302), tsv(923741), ods(1148667), bz2(69892), ods(129369), ttl(9305150), tsv(611548), bz2(262786), bz2(3190), ttl(3555417), ttl(8786083), ttl(34269924), xls(956928), ttl(3357643), ods(1379158), ods(943172), bz2(6144), ttl(1779804), ods(112535), bin(2062), bz2(222746), tsv(338807), bz2(4770), ods(113420), ods(173912), ttl(567239), ods(264984), ttl(2917074), ttl(25167593), bz2(71255), ttl(295433), bz2(85886), ttl(899914), ttl(23994), ttl(3500591), ods(222960), zip(280433), tsv(261495), tsv(885), ods(207975), bz2(15105), xls(35840), ods(143242), ods(162665), bz2(55427), ttl(1372527), ods(182771), ttl(3985397), bz2(6743), bz2(7181), ods(120901), bz2(13315), ods(145095), bz2(257835), ods(189669), tsv(461486), ttl(386059), tsv(286208), bz2(236643), ttl(3947), ttl(7672201), ttl(2211858), ods(91183), tsv(5721), ttl(2336956), ods(186280), bz2(109505), ttl(5612800), bz2(124936), bz2(71287), bz2(8986), tsv(291518), xls(29696), ttl(265107), ttl(4262884), tsv(143821), tsv(17472), ttl(1619414), tsv(186830), ttl(11446334), ods(228893), bz2(73070), ods(83365), bz2(98251), ods(189774), tsv(120700), bz2(117391), ods(147402), ttl(6746190), ttl(54408853), bz2(148031), ttl(950581), bz2(93778), ttl(1077), tsv(923785), bz2(206883), bz2(1883), bz2(44216), tsv(291617), tsv(179717), bz2(3752), ods(172981), ttl(733174), ttl(3063699), bz2(7684), bz2(13325), ods(114435), ttl(4981946), ttl(3166411), bz2(7666), tsv(196488), ttl(5830596), ttl(1485263), ods(127259), ttl(4500804), ods(69768), bz2(13562), ods(86039), tsv(135), tsv(120160), bz2(80166), bz2(3515), tsv(3795), ttl(449957), ttl(8838782), bz2(9045), ttl(9059635), bin(87677), bz2(21304), tsv(236901), bz2(7310), bz2(6869), ods(109694), bz2(140588), bz2(1067342), bz2(33463), bz2(172361), ttl(1815465), ods(85762), bz2(4946), ods(174491), ttl(4011976), ods(115654), bz2(278996), tsv(186940), bz2(265333), bz2(70907), bz2(16651), ods(95964), bz2(62001), ods(210211), ods(148098), ttl(4923413), ttl(6123608), ods(142622), ttl(2536500), ods(160756), bz2(1059965), tsv(1981150), ttl(142935), pdf(58381), ods(530592), ttl(447797), ttl(361032), ttl(2716230), ttl(48842), xls(3568640), ods(1059469), ttl(2192863), ttl(9136788), ttl(1321891), ods(1063434), bz2(8155), ods(84630), tsv(169041), ttl(2912929), ttl(577264), bz2(16265), bz2(9194), ods(133438), ttl(3991490), ttl(1289251), ttl(281197), bz2(9645), bz2(186104), ttl(1177704), ttl(15076), ods(34088), ttl(285199), ods(167551), bz2(6516), bz2(3359), tsv(34087), ttl(42759), ods(198703), bz2(160072), ods(129035), ttl(4292950), ttl(7636975), bz2(2661942), ttl(2576515), ttl(1530097), tsv(133556), ttl(49415), ttl(1484543), ods(180217), ods(65672), tsv(49658), ttl(9145115), ttl(528051), tsv(153936), ttl(60276), ttl(33449), ttl(3520092), ttl(3973424), ttl(284798), ods(105120), ttl(810683), bz2(14634), ttl(387661), bz2(155453), bz2(149234), tsv(199237), tsv(288662), bz2(3230), ttl(7380384), bz2(71932), ttl(7917356), bz2(10255), ttl(4835353), tsv(49768), ttl(332674), bz2(73997), ttl(2612999), tsv(28139), ttl(31645641), bz2(5725), ods(246357), ttl(3752156), text/markdown(68), ttl(1287140), ods(186258), bz2(84649), ttl(500633), ods(77685), bz2(94184), ttl(684839), tsv(340147), bz2(4383), bz2(3325), bz2(7908), ttl(890288), ttl(3621521), ttl(24979), ttl(148251), bz2(25791), ttl(952285), ttl(2351165), ttl(8618085), tsv(2700), ttl(5468521), tsv(158811), ods(69678), bz2(150244), tsv(250649), txt(2043), ttl(775558), bz2(11823), bz2(95521), bz2(157538), tsv(107551), tsv(187654), bz2(77388), ttl(5951386), bz2(18439), bz2(17620), bz2(4360), ods(32742), ttl(6777472), ttl(3142066), tsv(469), ttl(854033), tsv(47601), bz2(160161), ttl(3812), ods(90091), bz2(5121), tsv(1282), ttl(4297437), bz2(125625), bz2(8356), ods(159112), bz2(189862), ttl(2020763), ttl(2292890), ttl(4624), ods(183602), ods(289147), tsv(32620), ttl(6902817), bz2(99033), ods(172814), tsv(385), txt(98496), ttl(5321724), tsv(2380), ttl(46439), ttl(7265734), tsv(30154), bz2(14997), bz2(174402), bz2(829588), ttl(530553), bz2(86804), bz2(7106), ttl(7564844), bz2(220856), ttl(2643924), ods(198529), tsv(177572), bz2(30604), ttl(3746691), ttl(8000164), bz2(8294), tsv(30610), ttl(2937717), bz2(81018), ods(117298), tsv(3064), bz2(114689), bz2(68357), bz2(126123), ttl(8129890), ttl(3967465), bz2(185604), ttl(264639), bz2(59), bz2(921775), ttl(6381473), bz2(17649), bz2(928), bz2(78829), bz2(185153), bz2(263403), tsv(130507), bz2(204579), ttl(798586), ttl(877012), ttl(42464302), bz2(153590), bz2(64453), ttl(2374147), tsv(50317), bz2(13782), ttl(150791), tsv(37226), ttl(163112), tsv(289559), bz2(276532), tsv(50319), ods(246478), ttl(1268674), ttl(5169361), ttl(5897222), bz2(1647), ttl(2595787), bz2(159307), bin(10), tsv(191521), bz2(200974), ttl(5868364), ttl(233693), tsv(5099), ttl(6704863), bz2(222822), bz2(5084), ttl(9306859), bz2(115149), ttl(580201), ttl(7248111), bz2(61045), ttl(5253), ttl(5039415)Available download formats
    Dataset updated
    Nov 19, 2017
    Dataset provided by
    DANS Data Station Social Sciences and Humanities
    Authors
    A. Ashkpour; A. Ashkpour
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Goal: Setting up a pipeline for extending, improving and visualizing time series of municipality characteristics by means of data harmonization and linkage of historical and contemporary dataseries using Linked Data technologies (RDF).This project focused on increasing the data availability, data quality and visualization of characteristics of Dutch municipalities for the period 1795-2010. We did so by (1) combining data from historical and contemporary time series, (2) evaluating and improving on the quality of these time series, and (3) extending the availability of NLGIS maps for the last two decades in order to visualize municipality characteristics for two centuries.

  6. d

    Multisource surface-water-quality data for the Delaware River Basin

    • catalog.data.gov
    Updated Jul 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2024). Multisource surface-water-quality data for the Delaware River Basin [Dataset]. https://catalog.data.gov/dataset/multisource-surface-water-quality-data-for-the-delaware-river-basin
    Explore at:
    Dataset updated
    Jul 6, 2024
    Dataset provided by
    U.S. Geological Survey
    Area covered
    Delaware River
    Description

    Jointly managed by multiple states and the federal government, there are many ongoing efforts to characterize and understand water quality in the Delaware River Basin (DRB). Many State, Federal and non-profit organizations have collected surface-water-quality samples across the DRB for decades and many of these data are available through the National Water Quality Monitoring Council's Water Quality Portal (WQP). In this data release, WQP data in the DRB were harmonized, meaning that they were processed to create a clean and readily usable dataset. This harmonization processing included the synthesis of parameter names and fractions, the condensation of remarks and other data qualifiers, the resolution of duplicate records, an initial quality control check of the data, and other processing steps described in the metadata. This data set provides harmonized discrete multisource surface-water-quality data pulled from the WQP for nutrients, sediment, salinity, major ions, bacteria, temperature, dissolved oxygen, pH, and turbidity in the DRB, for all available years.

  7. f

    Average mean difference after mean and variance adjustments under random...

    • figshare.com
    xls
    Updated Dec 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hyunsoo Yoon; Todd J. Schwedt; Catherine D. Chong; Oyekanmi Olatunde; Teresa Wu (2024). Average mean difference after mean and variance adjustments under random subsets within dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0288300.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Dec 31, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Hyunsoo Yoon; Todd J. Schwedt; Catherine D. Chong; Oyekanmi Olatunde; Teresa Wu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Average mean difference after mean and variance adjustments under random subsets within dataset.

  8. i

    Employment and Unemployment Survey 2010, Economic Research Forum (ERF)...

    • catalog.ihsn.org
    Updated Mar 29, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Statistics (2019). Employment and Unemployment Survey 2010, Economic Research Forum (ERF) Harmonization Data - Jordan [Dataset]. https://catalog.ihsn.org/index.php/catalog/4074
    Explore at:
    Dataset updated
    Mar 29, 2019
    Dataset provided by
    Department of Statistics
    Economic Research Forum
    Time period covered
    2010
    Area covered
    Jordan
    Description

    Abstract

    THE CLEANED AND HARMONIZED VERSION OF THE SURVEY DATA PRODUCED AND PUBLISHED BY THE ECONOMIC RESEARCH FORUM REPRESENTS 100% OF THE ORIGINAL SURVEY DATA COLLECTED BY THE DEPARTMENT OF STATISTICS OF THE HASHEMITE KINGDOM OF JORDAN

    The Department of Statistics (DOS) carried out four rounds of the 2010 Employment and Unemployment Survey (EUS). The survey rounds covered a total sample of about fifty three thousand households Nation-wide. The sampled households were selected using a stratified multi-stage cluster sampling design. It is noteworthy that the sample represents the national level (Kingdom), governorates, and the urban/rural areas.

    The importance of this survey lies in that it provides a comprehensive data base on employment and unemployment that serves decision makers, researchers as well as other parties concerned with policies related to the organization of the Jordanian labor market.

    It is worthy to mention that the DOS employed new technology in data collection and data processing. Data was collected using electronic questionnaire instead of a hard copy, namely a hand held device (PDA).

    The survey main objectives are:

    • To identify the demographic, social and economic characteristics of the population and manpower.
    • To identify the occupational structure and economic activity of the employed persons, as well as their employment status.
    • To identify the reasons behind the desire of the employed persons to search for a new or additional job.
    • To measure the economic activity participation rates (the number of economically active population divided by the population of 15+ years old).
    • To identify the different characteristics of the unemployed persons.
    • To measure unemployment rates (the number of unemployed persons divided by the number of economically active population of 15+ years old) according to the various characteristics of the unemployed, and the changes that might take place in this regard.
    • To identify the most important ways and means used by the unemployed persons to get a job, in addition to measuring durations of unemployment for such persons.
    • To identify the changes overtime that might take place regarding the above-mentioned variables.

    The raw survey data provided by the Statistical Agency were cleaned and harmonized by the Economic Research Forum, in the context of a major project that started in 2009. During which extensive efforts have been exerted to acquire, clean, harmonize, preserve and disseminate micro data of existing labor force surveys in several Arab countries.

    Geographic coverage

    Covering a sample representative on the national level (Kingdom), governorates, the three Regions (Central, North and South), and the urban/rural areas.

    Analysis unit

    1- Household/family. 2- Individual/person.

    Universe

    The survey covered a national sample of households and all individuals permanently residing in surveyed households.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    THE CLEANED AND HARMONIZED VERSION OF THE SURVEY DATA PRODUCED AND PUBLISHED BY THE ECONOMIC RESEARCH FORUM REPRESENTS 100% OF THE ORIGINAL SURVEY DATA COLLECTED BY THE DEPARTMENT OF STATISTICS OF THE HASHEMITE KINGDOM OF JORDAN

    Survey Frame

    The sample of this survey is based on the frame provided by the data of the Population and Housing Census, 2004. The Kingdom was divided into strata, where each city with a population of 100,000 persons or more was considered as a large city. The total number of these cities is 6. Each governorate (except for the 6 large cities) was divided into rural and urban areas. The rest of the urban areas in each governorate was considered as an independent stratum. The same was applied to rural areas where it was considered as an independent stratum. The total number of strata was 30.

    In view of the existing significant variation in the socio-economic characteristics in large cities in particular and in urban in general, each stratum of the large cities and urban strata was divided into four sub-stratum according to the socio- economic characteristics provided by the population and housing census with the purpose of providing homogeneous strata.

    The frame excludes the population living in remote areas (most of whom are nomads), In addition to that, the frame does not include collective dwellings, such as hotels, hospitals, work camps, prisons and alike.

    Sample Design

    The sample of this survey was designed using the cluster stratified sampling method. It is representative at the Kingdom, rural and urban areas, regions and governorates levels. The Primary Sampling Units (clusters) were distributed to governorates, urban and rural areas and large cities in each governorate according to the weight of persons/households and according to the variance within each stratum. Slight modifications regarding the number of these units were made. The Primary Sampling Units (PSUs) were ordered within each stratum according to geographic characteristics and then according to socio-economic characteristics in order to ensure good spread of the sample. Then, the sample were selected on two stages. In the first stage, the PSUs were selected using the Probability Proportionate to Size with systematic selection procedure. The number of households, in each PSU served as its weight or size. In the second stage, the blocks of the PSUs which were selected in the first stage have been updated. Then a constant number of households was selected, using the random systematic sampling method as final PSUs from each PSU (cluster).

    Sampling notes

    It is noteworthy that the sample of the present survey does not represent the non-Jordanian population, due to the fact that it is based on households living in conventional dwellings. In other words, it does not cover the collective households living in collective dwellings. Therefore, the non-Jordanian households covered in the present survey are either private households or collective households living in conventional dwellings. In Jordan, it is well known that a large number of non-Jordanian workers live as groups and spend most of their time at workplaces. Hence, it is more unlikely to find them at their residences during daytime (i.e. the time when the data of the survey is collected). Furthermore, most of them live in their workplaces, such as: workshops, sales stores, guard places, or under construction building's sites. Such places are not classified as occupied dwellings for household sampling purposes. Due to all of the above, the coverage of such population would not be complete in household surveys.

    Mode of data collection

    Computer Assisted Personal Interview [capi]

    Research instrument

    The questionnaire was designed electronically on the PDA and revised by the DOS technical staff. It was finalized upon completion of the training program. The questionnaire is divided into main topics, each containing a clear and consistent group of questions, and designed in a way that facilitates the electronic data entry and verification. The questionnaire includes the characteristics of household members in addition to the identification information, which reflects the administrative as well as the statistical divisions of the Kingdom.

    Cleaning operations

    Raw Data

    PDAs were used to input and transfer data from the interviewees to the database. The plan of the tabulation of survey results was guided by former Employment and Unemployment Surveys which were previously prepared and tested. When all data processing procedures were completed, the actual survey results were tabulated using an ORACLE package. The tabulations were then thoroughly checked for consistency of data such as titles, inputs, concepts, as well as the figures. The final survey report was then prepared to include all detailed tabulations as well as the methodology of the survey.

    Harmonized Data

    • STATA package is used to clean and harmonize the datasets.
    • The harmonization process starts with a cleaning process for all raw data files received from the Statistical Agency.
    • All cleaned data files are then merged to produce one data file on the individual level containing all variables subject to harmonization.
    • A country-specific program is generated for each dataset to generate/ compute/ recode/ rename/ format/ label harmonized variables.
    • A post-harmonization cleaning process is then conducted on the data.
    • Harmonized data is saved on the household as well as the individual level, in STATA and then converted to SPSS, to be disseminated.
  9. E

    Data from: Integration and harmonization of trait data from plant...

    • live.european-language-grid.eu
    • zenodo.org
    • +1more
    csv
    Updated Dec 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Data from: Integration and harmonization of trait data from plant individuals across heterogeneous sources [Dataset]. https://live.european-language-grid.eu/catalogue/lcr/7662
    Explore at:
    csvAvailable download formats
    Dataset updated
    Dec 13, 2023
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Trait data represent the basis for ecological and evolutionary research and have relevance for biodiversity conservation, ecosystem management and earth system modelling. The collection and mobilization of trait data has strongly increased over the last decade, but many trait databases still provide only species-level, aggregated trait values (e.g. ranges, means) and lack the direct observations on which those data are based. Thus, the vast majority of trait data measured directly from individuals remains hidden and highly heterogeneous, impeding their discoverability, semantic interoperability, digital accessibility and (re-)use. Here, we integrate quantitative measurements of verbatim trait information from plant individuals (e.g. lengths, widths, counts and angles of stems, leaves, fruits and inflorescence parts) from multiple sources such as field observations and herbarium collections. We develop a workflow to harmonize heterogeneous trait measurements (e.g. trait names and their values and units) as well as additional information related to taxonomy, measurement or fact and occurrence. This data integration and harmonization builds on vocabularies and terminology from existing metadata standards and ontologies such as the Ecological Trait-data Standard (ETS), the Darwin Core (DwC), the Thesaurus Of Plant characteristics (TOP) and the Plant Trait Ontology (TO). A metadata form filled out by data providers enables the automated integration of trait information from heterogeneous datasets. We illustrate our tools with data from palms (family Arecaceae), a globally distributed (pantropical), diverse plant family that is considered a good model system for understanding the ecology and evolution of tropical rainforests. We mobilize nearly 140,000 individual palm trait measurements in an interoperable format, identify semantic gaps in existing plant trait terminology and provide suggestions for the future development of a thesaurus of plant characteristics. Our work thereby promotes the semantic integration of plant trait data in a machine-readable way and shows how large amounts of small trait data sets and their metadata can be integrated into standardized data products.

  10. ENV-16 Harmonised Monitoring Scheme datasets

    • gov.uk
    Updated Oct 15, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department for Environment, Food & Rural Affairs (2014). ENV-16 Harmonised Monitoring Scheme datasets [Dataset]. https://www.gov.uk/government/statistical-data-sets/env-16-harmonised-monitoring-scheme-datasets
    Explore at:
    Dataset updated
    Oct 15, 2014
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Department for Environment, Food & Rural Affairs
    Description

    Additional data sets on river water quality in Great Britain collected by the Environment Agency under the Harmonised Monitoring Scheme. This provides information about nutrient and heavy metal loads entering the marine environment and contributes to our commitment to report figures to the OSPAR Convention for the Protection of the North Atlantic. Full details available at: OSPAR - Riverine Inputs and Direct Discharges

    The full dataset will be available from the Environment Agency http://www.geostore.com/environment-agency/WebStore?xml=environment-agency/xml/dataLayers.xml" class="govuk-link">datashare site. From November 2013

    Release statement - Following a review by Defra and the Environment Agency on reducing the monitoring programme where it is not required under the present regulatory regime, it has been decided that the monitoring under the Harmonised Monitoring Scheme will be discontinued. For further information please contact enviro.statistics Inbox.

    https://assets.publishing.service.gov.uk/media/5a75ab19e5274a545822d4bb/DissolvedOxygen1980_2013.csv">Annual average, highest and lowest mean concentrations of Dissolved oxygen by region: 1980 to 2013, Great Britain

     <p class="gem-c-attachment_metadata"><span class="gem-c-attachment_attribute">MS Excel Spreadsheet</span>, <span class="gem-c-attachment_attribute">12.8 KB</span></p>
    
     <p class="gem-c-attachment_metadata"><a class="govuk-link" aria-label="View Annual average, highest and lowest mean concentrations of Dissolved oxygen by region: 1980 to 2013, Great Britain online" href="/media/5a75ab19e5274a545822d4bb/DissolvedOxygen1980_2013.csv/preview">View online</a></p>
    
    
    
     <p class="gem-c-attachment_metadata">This file may not be suitable for users of assistive technology.</p>
     <details data-module="ga4-event-tracker" data-ga4-event='{"event_name":"select_content","type":"detail","text":"Request an accessible format.","section":"Request an accessible format.","index_section":1}' class="gem-c-details govuk-details govuk-!-margin-bottom-0" title="Request an accessible format.">
    

    Request an accessible format.

      If you use assistive technology (such as a screen reader) and need a version of this document in a more accessible format, please email <a href="mailto:defra.helpline@defra.gov.uk" target="_blank" class="govuk-link">defra.helpline@defra.gov.uk</a>. Please tell us what format you need. It will help us if you say what assistive technology you use.
    

  11. f

    Codes that could not be found in the OMOP concept dictionary.

    • plos.figshare.com
    xls
    Updated Jun 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jeffrey G. Klann; Matthew A. H. Joss; Kevin Embree; Shawn N. Murphy (2023). Codes that could not be found in the OMOP concept dictionary. [Dataset]. http://doi.org/10.1371/journal.pone.0212463.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 21, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Jeffrey G. Klann; Matthew A. H. Joss; Kevin Embree; Shawn N. Murphy
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Codes that could not be found in the OMOP concept dictionary.

  12. c

    Data from: A Multi-Centric Anthropomorphic 3D CT Phantom-Based Benchmark...

    • cancerimagingarchive.net
    dicom, n/a
    Updated May 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Cancer Imaging Archive (2025). A Multi-Centric Anthropomorphic 3D CT Phantom-Based Benchmark Dataset for Harmonization [Dataset]. http://doi.org/10.7937/m0pb-bh69
    Explore at:
    dicom, n/aAvailable download formats
    Dataset updated
    May 7, 2025
    Dataset authored and provided by
    The Cancer Imaging Archive
    License

    https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/https://www.cancerimagingarchive.net/data-usage-policies-and-restrictions/

    Time period covered
    Jun 9, 2025
    Dataset funded by
    National Cancer Institutehttp://www.cancer.gov/
    Description

    Abstract

    This collection introduces an open-source, anthropomorphic phantom-based dataset of CT scans for developing harmonization methods for deep learning based models. The phantom mimics human anatomy, allowing repeated scans without radiation delivery to real patients and isolating scanner effects by removing inter- and intra-patient variations. The dataset includes 268 image series from 13 scanners, 4 manufacturers, and 8 institutions, repeated 18-30 times at a 10 mGy dose using a harmonized protocol. An additional 1,378 image series were acquired with the same 13 scanners and harmonized protocol but including additional acquisition doses. The presented phantom scans consist of three compartments from thorax, liver and test patterns. The 3D-printed liver includes three types of abnormal regions of interest, including two cysts, a metastasis, and a hemangioma, with ground truth segmentation masks that could be used for classification and segmentation.

    Introduction

    Recent breakthroughs in data-driven algorithms and artificial intelligence (AI) applications in medical information processing have introduced tremendous potential for AI-assisted image-based personalized medicine that addresses tasks such as segmentation, diagnosis, and prognosis. However, these opportunities come with two challenges: large data requirements and consistency in data distribution. Machine and deep learning algorithms have extreme data demand, which is coupled with the high costs of data acquisition and annotation for a single observation (e.g., one event corresponds to one patient in a survival study). These challenges encourage pooling of data collected from multiple centers and scanners to achieve a critical mass of data for training models. However, pooling data from multiple centers introduces significant variability in the acquisition parameters and specifics of image reconstruction algorithms, leading to data domain shifts and inconsistencies in the collected data. The domain shift introduced by this variability in scanners reduces the value of merging data from multiple centers, reducing performance of predictive tasks such as segmentation, diagnosis, and prognosis, as well as in federated scenarios. Furthermore, domain shifts between training and test or inference data entails high risks of incorrect and uncontrolled predictions for treatment planning and personalized medicine when the inference is based on a scanner (and/or acquisition setting) that was not represented in the training data. Although this challenge applies to all medical imaging modalities, it is particularly important for computed tomography (CT) images due to the wide range of variability in manufacturers, acquisition parameters and dose, reconstruction algorithms, and customized parameter tunings in different centers.

    This dataset provides the material to reproduce several different research works conducted in conjunction with it. Researchers can use this dataset for developing their own harmonization methods at both the image and feature levels to tackle the data drift problem from one scanner to another and across different manufacturers. We also release baseline performance metrics for the similarity of scans in the image domain and feature space without harmonization. This will set a baseline to evaluate the effectiveness of various harmonization techniques in the image and feature domains.

    Methods

    The following subsections provide information about how the data were selected, acquired and prepared for publication, approximate date range of imaging studies.

    Data Acquisition

    Before the CT scans of the phantom were acquired, a survey was carried out to collect realistic acquisition and reconstruction parameter settings that are used in clinical thoracoabdominal CT scans for oncological staging, tumor search, and infectious focus detection in the portal venous contrast phase. The survey included 21 CT scanners from 9 centers across Switzerland. This translates to a tube voltage of 120 kV, a tube current-time product of 148 mAs, a pitch of 1.000, and a rotation time of 0.5 seconds for the Siemens SOMATOM Definition Edge scanner. The collimation was set to 38.4 mm, with a slice thickness/increment of 2.0 mm, and a pixel spacing of 1.367 mm. Due to vendor-specific limitations, the parameters mentioned above were slightly adapted to the closest possible parameters for each given scanner. The scans were repeated for 13 scanners from 4 manufacturers—Siemens, Philips, General Electric (GE), and Toshiba—at five dose levels (1 mGy, 3 mGy, 6 mGy, 10 mGy, 14 mGy). Only the tube current-time product (in mAs) was adjusted to set the various dose levels; all other parameters were kept the same. For each CT scanner and each dose level, 10 repeated scans (identified in the image series as #1 to #10) with identical settings were performed, except inadvertently for the Toshiba Aquilion Prime SP scanner at 10 mGy (9 repeated scans). Thus, a total of 649 CT scans were performed.

    Images were reconstructed using two or three different reconstruction algorithms per CT scan, resulting in two or three CT image series per CT scan. For all CT scans, a vendor-specific iterative reconstruction (IR) algorithm with a standard soft tissue kernel was used, resulting in 649 IR CT series. In addition, filtered backprojection (FBP) reconstruction with a standard soft tissue kernel was used for all CT scans, resulting in another 649 FBP CT series. For 2 of the 13 CT scanners, a DL based reconstruction algorithm was available. For one of these scanners, it was used for three dose levels (1 mGy, 3 mGy, 6 mGy), resulting in 30 additional CT series. For the second scanner, DL reconstruction was used for all five dose levels, resulting in 50 additional CT series. In summary, the dataset presented in this work consists of 1378 series reconstructed from 649 CT scans.

    Data Analysis

    The DICOM data files presented in conjunction with this repository did not undergo any preprocessing steps, in order to preserve all sources of variation—such as spatial shifts and voxel spacing differences introduced by various scanners. However, this repository is linked to a data descriptor paper where we thoroughly analyzed the data, as well as a Git repository that provides the code for resampling the scans to a uniform voxel spacing and performing registration.

    Usage Notes

    The dataset includes original DICOM files with all acquisition parameters stored in the DICOM tags, without any special pre-processing. For each DICOM study, the Study Description tags contain scanner IDs (e.g. "A1" or "H2") which represent 8 institutions. Each DICOM study contains multiple image series reconstructed with different reconstruction methods, plus a series containing the mask related to the various regions of interest in the liver tissue. When downloading these data, the directory and file names will follow the format described in this FAQ entry.

  13. VOTP Dataset

    • kaggle.com
    Updated Apr 10, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    sdorius (2017). VOTP Dataset [Dataset]. https://www.kaggle.com/datasets/sdorius/votpharm/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 10, 2017
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    sdorius
    Description

    This is an integration of 10 independent multi-country, multi-region, multi-cultural social surveys fielded by Gallup International between 2000 and 2013. The integrated data file contains responses from 535,159 adults living in 103 countries. In total, the harmonization project combined 571 social surveys.

    These data have value in a number of longitudinal multi-country, multi-regional, and multi-cultural (L3M) research designs. Understood as independent, though non-random, L3M samples containing a number of multiple indicator ASQ (ask same questions) and ADQ (ask different questions) measures of human development, the environment, international relations, gender equality, security, international organizations, and democracy, to name a few [see full list below].

    The data can be used for exploratory and descriptive analysis, with greatest utility at low levels of resolution (e.g. nation-states, supranational groupings). Level of resolution in analysis of these data should be sufficiently low to approximate confidence intervals.

    These data can be used for teaching 3M methods, including data harmonization in L3M, 3M research design, survey design, 3M measurement invariance, analysis, and visualization, and reporting. Opportunities to teach about para data, meta data, and data management in L3M designs.

    The country units are an unbalanced panel derived from non-probability samples of countries and respondents> Panels (countries) have left and right censorship and are thusly unbalanced. This design limitation can be overcome to the extent that VOTP panels are harmonized with public measurements from other 3M surveys to establish balance in terms of panels and occasions of measurement. Should L3M harmonization occur, these data can be assigned confidence weights to reflect the amount of error in these surveys.

    Pooled public opinion surveys (country means), when combine with higher quality country measurements of the same concepts (ASQ, ADQ), can be leveraged to increase the statistical power of pooled publics opinion research designs (multiple L3M datasets)…that is, in studies of public, rather than personal, beliefs.

    The Gallup Voice of the People survey data are based on uncertain sampling methods based on underspecified methods. Country sampling is non-random. The sampling method appears be primarily probability and quota sampling, with occasional oversample of urban populations in difficult to survey populations. The sampling units (countries and individuals) are poorly defined, suggesting these data have more value in research designs calling for independent samples replication and repeated-measures frameworks.

    **The Voice of the People Survey Series is WIN/Gallup International Association's End of Year survey and is a global study that collects the public's view on the challenges that the world faces today. Ongoing since 1977, the purpose of WIN/Gallup International's End of Year survey is to provide a platform for respondents to speak out concerning government and corporate policies. The Voice of the People, End of Year Surveys for 2012, fielded June 2012 to February 2013, were conducted in 56 countries to solicit public opinion on social and political issues. Respondents were asked whether their country was governed by the will of the people, as well as their attitudes about their society. Additional questions addressed respondents' living conditions and feelings of safety around their living area, as well as personal happiness. Respondents' opinions were also gathered in relation to business development and their views on the effectiveness of the World Health Organization. Respondents were also surveyed on ownership and use of mobile devices. Demographic information includes sex, age, income, education level, employment status, and type of living area.

  14. u

    Harmonized definition of occupational burnout. A systematic review, semantic...

    • data.unisante.ch
    Updated Oct 6, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Guseva-Canu, Irina (2020). Harmonized definition of occupational burnout. A systematic review, semantic analysis, and Delphi consensus in 29 countries - Switzerland [Dataset]. https://data.unisante.ch/catalog/25
    Explore at:
    Dataset updated
    Oct 6, 2020
    Dataset authored and provided by
    Guseva-Canu, Irina
    Time period covered
    2018 - 2020
    Area covered
    Switzerland
    Description

    Abstract

    Background: A consensual definition of occupational burnout is currently lacking. We aimed to harmonize the definition of occupational burnout as a health outcome in medical research and to reach a consensus on this definition within the Network on the Coordination and Harmonisation of European Occupational Cohorts (OMEGA-NET). Methods: First, we performed a systematic review in MEDLINE, PsycINFO and EMBASE (January 1990 to August 2018) and a semantic analysis of the available definitions. We used the definitions of burnout and burnout-related concepts from the Systematized Nomenclature of Medicine Clinical Terms (SNOMED-CT) to formulate a consistent harmonized definition of the concept. Second, we sought to obtain consensus on the proposed definition using the Delphi technique. Results: We identified 88 unique definitions of burnout and assigned each of them to one of the 11 original definitions. The semantic analysis yielded a semantic proposal, formulated in accordance with SNOMED-CT as follows: “In a worker, occupational burnout or occupational physical AND emotional exhaustion state is an exhaustion due to prolonged exposure to work-related problems”. A panel of 50 experts (researchers and healthcare professionals with an interest for occupational burnout) reached consensus on this proposal at the second round of the Delphi, with 82% of experts agreeing on it. Conclusion: This study resulted in a harmonized definition of occupational burnout approved by experts from 29 countries within the OMEGA-NET. Future research should address the reproducibility of the Delphi consensus in a larger panel of experts, representing more countries, and examine the practicability of the definition.

    Geographic coverage

    International

    Analysis unit

    Number of citations per original and secondary definition of occupational burnout among studies included in the systematic review

    Kind of data

    Three csv files. The first one (ResearchStrings.csv) presents the literature research strings applied to MEDLINE, EMBASE, and PsychINFO, respectively. The second file (DefinitionsIndexation&Citation_OriginaVsUniqueDef.csv) presents the statements of different definitions of occupational burnout identified within the systematic review, their references and the references of studies citing them. Finally the third file (DefinitionsIndexation&Citation_UniqueDefinitionSummary.csv) presents the correspondence between these “unique” definitions and their “original” definitions.

  15. Harmonized Sentinel-2 MSI: MultiSpectral Instrument, Level-2A (SR)

    • developers.google.com
    Updated Jan 30, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    European Union/ESA/Copernicus (2020). Harmonized Sentinel-2 MSI: MultiSpectral Instrument, Level-2A (SR) [Dataset]. https://developers.google.com/earth-engine/datasets/catalog/COPERNICUS_S2_SR_HARMONIZED
    Explore at:
    Dataset updated
    Jan 30, 2020
    Dataset provided by
    European Space Agencyhttp://www.esa.int/
    Time period covered
    Mar 28, 2017 - Jul 22, 2025
    Area covered
    Description

    After 2022-01-25, Sentinel-2 scenes with PROCESSING_BASELINE '04.00' or above have their DN (value) range shifted by 1000. The HARMONIZED collection shifts data in newer scenes to be in the same range as in older scenes. Sentinel-2 is a wide-swath, high-resolution, multi-spectral imaging mission supporting Copernicus Land Monitoring studies, including the …

  16. f

    Data_Sheet_1_Multisite Harmonization of Structural DTI Networks in Children:...

    • frontiersin.figshare.com
    pdf
    Updated Jun 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adrian I. Onicas; Ashley L. Ware; Ashley D. Harris; Miriam H. Beauchamp; Christian Beaulieu; William Craig; Quynh Doan; Stephen B. Freedman; Bradley G. Goodyear; Roger Zemek; Keith Owen Yeates; Catherine Lebel (2023). Data_Sheet_1_Multisite Harmonization of Structural DTI Networks in Children: An A-CAP Study.pdf [Dataset]. http://doi.org/10.3389/fneur.2022.850642.s001
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 14, 2023
    Dataset provided by
    Frontiers
    Authors
    Adrian I. Onicas; Ashley L. Ware; Ashley D. Harris; Miriam H. Beauchamp; Christian Beaulieu; William Craig; Quynh Doan; Stephen B. Freedman; Bradley G. Goodyear; Roger Zemek; Keith Owen Yeates; Catherine Lebel
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The analysis of large, multisite neuroimaging datasets provides a promising means for robust characterization of brain networks that can reduce false positives and improve reproducibility. However, the use of different MRI scanners introduces variability to the data. Managing those sources of variability is increasingly important for the generation of accurate group-level inferences. ComBat is one of the most promising tools for multisite (multiscanner) harmonization of structural neuroimaging data, but no study has examined its application to graph theory metrics derived from the structural brain connectome. The present work evaluates the use of ComBat for multisite harmonization in the context of structural network analysis of diffusion-weighted scans from the Advancing Concussion Assessment in Pediatrics (A-CAP) study. Scans were acquired on six different scanners from 484 children aged 8.00–16.99 years [Mean = 12.37 ± 2.34 years; 289 (59.7%) Male] ~10 days following mild traumatic brain injury (n = 313) or orthopedic injury (n = 171). Whole brain deterministic diffusion tensor tractography was conducted and used to construct a 90 x 90 weighted (average fractional anisotropy) adjacency matrix for each scan. ComBat harmonization was applied separately at one of two different stages during data processing, either on the (i) weighted adjacency matrices (matrix harmonization) or (ii) global network metrics derived using unharmonized weighted adjacency matrices (parameter harmonization). Global network metrics based on unharmonized adjacency matrices and each harmonization approach were derived. Robust scanner effects were found for unharmonized metrics. Some scanner effects remained significant for matrix harmonized metrics, but effect sizes were less robust. Parameter harmonized metrics did not differ by scanner. Intraclass correlations (ICC) indicated good to excellent within-scanner consistency between metrics calculated before and after both harmonization approaches. Age correlated with unharmonized network metrics, but was more strongly correlated with network metrics based on both harmonization approaches. Parameter harmonization successfully controlled for scanner variability while preserving network topology and connectivity weights, indicating that harmonization of global network parameters based on unharmonized adjacency matrices may provide optimal results. The current work supports the use of ComBat for removing multiscanner effects on global network topology.

  17. d

    Harmonized Tariff Schedule of the United States (2025)

    • catalog.data.gov
    Updated Jul 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office of Tariff Affairs and Trade Agreements (2025). Harmonized Tariff Schedule of the United States (2025) [Dataset]. https://catalog.data.gov/dataset/harmonized-tariff-schedule-of-the-united-states-2024
    Explore at:
    Dataset updated
    Jul 11, 2025
    Dataset provided by
    Office of Tariff Affairs and Trade Agreements
    Description

    This dataset is the current 2025 Harmonized Tariff Schedule plus all revisions for the current year. It provides the applicable tariff rates and statistical categories for all merchandise imported into the United States; it is based on the international Harmonized System, the global system of nomenclature that is used to describe most world trade in goods.

  18. National Survey on Population and Employment, ENPE 2012 - Tunisia

    • erfdataportal.com
    Updated Apr 11, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Statistics - Tunisia (2017). National Survey on Population and Employment, ENPE 2012 - Tunisia [Dataset]. http://www.erfdataportal.com/index.php/catalog/123
    Explore at:
    Dataset updated
    Apr 11, 2017
    Dataset provided by
    National institute of statisticshttp://www.ins.tn/en/
    Economic Research Forum
    Time period covered
    2012
    Area covered
    Tunisia
    Description

    Abstract

    THE CLEANED AND HARMONIZED VERSION OF THE SURVEY DATA PRODUCED AND PUBLISHED BY THE ECONOMIC RESEARCH FORUM REPRESENTS 100% OF THE ORIGINAL SURVEY DATA COLLECTED BY THE NATIONAL INSTITUTE OF STATISTICS (INS) - TUNISIA

    The survey aims at estimating the demographic and educational characteristics of the population. It also calculates the economic indicators of the population such as the number of active individuals, the additional demand for jobs, the number of employed and their characteristics, the number of jobs created, the characteristics of the unemployed and the unemployment rate. Furthermore, this survey estimates these indicators on the household level and their living conditions.

    The results of this survey were compared with the results of the second quarter of the national survey on population and employment 2011. It should also be noted that the National Institute of Statistics -Tunisia uses the unemployment definition and concepts adopted by the International Labour Organization. This definition implies that, the individual did not work during the week preceding the day of the interview, was looking for a job in the month preceding the date of the interview, is available to work within two weeks after the day of the interview.

    In 2010, the National Institute of Statistics has adopted a strict ILO definition for unemployment, by conditioning that the person must perform effective approaches to search for a job in the month preceding the day of the interview.

    Geographic coverage

    Covering a representative sample at the national and regional level (governorates).

    Analysis unit

    1- Household/family. 2- Individual/person.

    Universe

    The survey covered a national sample of households and all individuals permanently residing in surveyed households.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    THE CLEANED AND HARMONIZED VERSION OF THE SURVEY DATA PRODUCED AND PUBLISHED BY THE ECONOMIC RESEARCH FORUM REPRESENTS 100% OF THE ORIGINAL SURVEY DATA COLLECTED BY THE NATIONAL INSTITUTE OF STATISTICS - TUNISIA (INS)

    The sample is drawn from the frame of the 2004 General Census of Population and Housing.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    Three modules were designed for data collection:

    • Household Questionnaire (Module 1): Includes questions regarding household characteristics, living conditions, individuals and their demographic, educational and economic characteristics. This module also provides information on internal and external migration.

    • Active Employed Questionnaire (Module 2): Includes questions regarding the characteristics of the employed individuals as occupation, industry and wages for employees.

    • Active Unemployed Questionnaire (Module 3): Includes questions regarding the characteristics of the unemployed as unemployment duration, the last occupation, activity, and the number of days worked during the last year...etc.

    Cleaning operations

    Harmonized Data

    • SPSS software is used to clean and harmonize the datasets.
    • The harmonization process starts with cleaning all raw data files received from the Statistical Agency.
    • Cleaned data files are then all merged to produce one data file on the individual level containing all variables subject to harmonization.
    • A country-specific program is generated for each dataset to generate/ compute/ recode/ rename/ format/ label harmonized variables.
    • A post-harmonization cleaning process is then conducted on the data.
    • Harmonized data is saved on the household as well as the individual level, in SPSS and converted to STATA format.
  19. Harmonized IACS inventory

    • zenodo.org
    zip
    Updated Dec 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Clemens Jänicke; Clemens Jänicke; Kristoffer Ansbak Petersen; Kristoffer Ansbak Petersen; Phillip Schmidts; Phillip Schmidts; Daniel Müller; Daniel Müller; Martin Rudbeck Jepsen; Martin Rudbeck Jepsen (2024). Harmonized IACS inventory [Dataset]. http://doi.org/10.5281/zenodo.14230621
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 12, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Clemens Jänicke; Clemens Jänicke; Kristoffer Ansbak Petersen; Kristoffer Ansbak Petersen; Phillip Schmidts; Phillip Schmidts; Daniel Müller; Daniel Müller; Martin Rudbeck Jepsen; Martin Rudbeck Jepsen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Nov 29, 2024
    Description

    This version contains errors; please use the data from version 1.1 or later.

    Inventory description

    The Harmonized IACS inventory of Europe-LAND is a harmonized collection of data from the Geospatial Aid (GSA) system of the Integrated Control and Administration System (IACS), which manages and controls agricultural subsidies in the European Union (EU). The GSA data are a unique data source with field-levels of land use information that are annually generated. The data carry information on crops grown per field, a unique identifier of the subsidy applicants that allows to aggregate fields to farms, information on organic cultivation and animal numbers per farm.

    Due to General Data Protection Regulations (GDPR), we are not allowed to share all data that we collected and harmonized. Therefore, there are two versions of the inventory, a public version and an internal version. The internal version contains more information and covers more countries and years.

    The public version contains all data that can be shared following the GDPR of the data providers. It covers 18 countries with time series up to 17 years. For most countries, only the crop information can be shared. However, for 6 countries also the applicant identifier and for two of them also the organic management information can be shared. If you use the data, please also cite the original sources of the data. You can find the references in the provided documentation that is in the "_Documentation.zip".

    The crop information were harmonized using the Hierarchical Crop and Agriculture Taxonomy (HCAT) of the EuroCrops project (Schneider et al., 2023). To allow for interoperability with EuroCrops, the harmonized Europe-LAND data come with the same column names that relate to the crop information. All crop mapping tables can be found in our GitHub repository.

    Column names:

    • field_id (mandatory): Unique identifier for each field per country, state or region
    • farm_id (optional): Unique identifier for each applicant per country, state, or region
    • crop_code (mandatory): The original, country-specific crop code
    • crop_name (mandatory): The original, country-specific crop name
    • EC_trans_n (mandatory): The original crop name translated into English
    • EC_hcat_n (mandatory): The machine-readable HCAT name of the crop
    • EC_hcat_c (mandatory): The 10-digit HCAT code indicating the hierarchy of the crop
    • organic (optional): Indicates whether a field was organically cultivated or not. Values: [0,1]
    • field_size (mandatory): Size of field in hectares
    • crop_area (optional): Area in hectares of crop reported in crop column. This occurs only if multiple crops are reported per parcel and refers to the area of the main crop.

    More detailed information for all countries in our harmonized inventory (including those that are not publicly available) can also be found in the documentation.

    The inventory will be updated at least annually. In future versions, we will add a new crop classification, harmonized animal data, and harmonized agri-environmental measures/eco-schemes.

    Information on data provision

    All files come as .geoparquets to stay within the space limitations of Zenodo. Geoparquets can simply be opened in QGIS via drag and drop. Additionally, various libraries from different porgramming languages are able to handle geoparquets, e.g. geoarrow and sgarrwo in R, GDAL/OGR in C++, GeoParquet.jl in Julia or Fiona in Python.

    We bundled multiple years of each country to stay below the file number limitation of Zenodo. Each zip file name indicates the country, region, or federal state and the years covered. The meaning of the abbreviations of the countries, regions, and federal states can be found in the "country_region_codes.xlsx" in the "_Documentation.zip".

    The Spanish data are also bundled across regions, as they are separated into more than 50 regions. See the country_regions_codes.xlsx tables for the meaning of the abbreviations:

    • ES_Bundle1 (Northeast): BAL, BAR, CAS, GIR, HEC, LLE, NAV, TAR, TER, ZAR
    • ES_Bundel2 (Northwest): ACO, ALA, AST, BUR, CAN, GUI, LEO, LRI, LUG, OUR, PAL, PON, VIZ, VLD, ZAM
    • ES_Bundle3 (West): ALB, AVI, CAC, CIU, CUE, GUA, MAD, SAL, SEG, SOR, TOL
    • ES_Bundle4 (Southwest): ALI, ALM, BAD, CAD, CDB, GRA, HEV, JAE, LAP, MAL, MUR, SAN, SEV, VLC
  20. f

    Demographics and clinical characteristics of study participants, including...

    • plos.figshare.com
    xls
    Updated Dec 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hyunsoo Yoon; Todd J. Schwedt; Catherine D. Chong; Oyekanmi Olatunde; Teresa Wu (2024). Demographics and clinical characteristics of study participants, including healthy controls (HC), episodic migraine (EM), and chronic migraine (CM) patients from two datasets (DS1 and DS2). [Dataset]. http://doi.org/10.1371/journal.pone.0288300.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Dec 31, 2024
    Dataset provided by
    PLOS ONE
    Authors
    Hyunsoo Yoon; Todd J. Schwedt; Catherine D. Chong; Oyekanmi Olatunde; Teresa Wu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The table includes sample size, mean age with standard deviation, gender distribution (female/male), and the presence of aura.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Hyunsoo Yoon; Todd J. Schwedt; Catherine D. Chong; Oyekanmi Olatunde; Teresa Wu (2024). Table 6 - [Dataset]. http://doi.org/10.1371/journal.pone.0288300.t006

Table 6 -

Related Article
Explore at:
xlsAvailable download formats
Dataset updated
Dec 31, 2024
Dataset provided by
PLOS ONE
Authors
Hyunsoo Yoon; Todd J. Schwedt; Catherine D. Chong; Oyekanmi Olatunde; Teresa Wu
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Multicenter and multi-scanner imaging studies may be necessary to ensure sufficiently large sample sizes for developing accurate predictive models. However, multicenter studies, incorporating varying research participant characteristics, MRI scanners, and imaging acquisition protocols, may introduce confounding factors, potentially hindering the creation of generalizable machine learning models. Models developed using one dataset may not readily apply to another, emphasizing the importance of classification model generalizability in multi-scanner and multicenter studies for producing reproducible results. This study focuses on enhancing generalizability in classifying individual migraine patients and healthy controls using brain MRI data through a data harmonization strategy. We propose identifying a ’healthy core’—a group of homogeneous healthy controls with similar characteristics—from multicenter studies. The Maximum Mean Discrepancy (MMD) in Geodesic Flow Kernel (GFK) space is employed to compare two datasets, capturing data variabilities and facilitating the identification of this ‘healthy core’. Homogeneous healthy controls play a vital role in mitigating unwanted heterogeneity, enabling the development of highly accurate classification models with improved performance on new datasets. Extensive experimental results underscore the benefits of leveraging a ’healthy core’. We utilized two datasets: one comprising 120 individuals (66 with migraine and 54 healthy controls), and another comprising 76 individuals (34 with migraine and 42 healthy controls). Notably, a homogeneous dataset derived from a cohort of healthy controls yielded a significant 25% accuracy improvement for both episodic and chronic migraineurs.

Search
Clear search
Close search
Google apps
Main menu