100+ datasets found
  1. Quality and completeness scores for curated and non-curated datasets

    • springernature.figshare.com
    xlsx
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Graham Smith; Iain Hrynaszkiewicz; Rebecca Grant (2023). Quality and completeness scores for curated and non-curated datasets [Dataset]. http://doi.org/10.6084/m9.figshare.6200357.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Graham Smith; Iain Hrynaszkiewicz; Rebecca Grant
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    These data contain aggregated survey responses assessing the quality and completeness of metadata for datasets deposited in public repositories and for the same datasets after professional curation.Responses were provided by 10 professional editors representing life, social and physical sciences. Each were randomly assigned four datasets to assess, half (20) of which had been curated according to the standards of Springer Nature's Research Data Support service and half (20) which had not.Curated datasets were shared privately with research participants. The versions that did not receive curation via Springer Nature's Research Data Support are openly accessible.Single-blind testing was employed; the researchers were not made aware which datasets had been curated and which had not, and it was ensured that no participant assessed the same dataset before and after curation. Responses were collected via an online survey. The relevant question and scoring is provided below:Rate the overall quality and completeness of the metadata for the dataset (with regards to finding and accessing and citing the data, not reusing the data)1 = not complete, 5 = very complete

  2. d

    Data Rescue & Curation Best Practices Guide

    • search.dataone.org
    • borealisdata.ca
    Updated Dec 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OCUL Data Community (ODC) Data Rescue Group (2023). Data Rescue & Curation Best Practices Guide [Dataset]. http://doi.org/10.5683/SP2/Y8MQXV
    Explore at:
    Dataset updated
    Dec 28, 2023
    Dataset provided by
    Borealis
    Authors
    OCUL Data Community (ODC) Data Rescue Group
    Description

    The aim of the Data Rescue & Curation Best Practices Guide is to provide an accessible and hands-on approach to handling data rescue and digital curation of at-risk data for use in secondary research. We provide a set of examples and workflows for addressing common challenges with social science survey data that can be applied to other social and behavioural research data. The goal of this guide and set of workflows presented is to improve librarians’ and data curators’ skills in providing access to high-quality, well-documented, and reusable research data. The aspects of data curation that are addressed throughout this guide are adopted from long-standing data library and archiving practices, including: documenting data using standard metadata, file and data organization; using open and software-agnostic formats; and curating research data for reuse.

  3. Dataset: FAIR Biomedical Research Software (FAIR-BioRS) manuscript

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated Jul 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bhavesh Patel; Bhavesh Patel; Sanjay Soundarajan; Sanjay Soundarajan; Hervé Ménager; Hervé Ménager; Zicheng Hu; Zicheng Hu (2023). Dataset: FAIR Biomedical Research Software (FAIR-BioRS) manuscript [Dataset]. http://doi.org/10.5281/zenodo.8112100
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 5, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Bhavesh Patel; Bhavesh Patel; Sanjay Soundarajan; Sanjay Soundarajan; Hervé Ménager; Hervé Ménager; Zicheng Hu; Zicheng Hu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data related to our FAIR-BioRS manuscript. More details are available at the associated GitHub repository: https://github.com/FAIR-BioRS/Data.

  4. o

    Research Software Workshop: Guidelines and Metrics for Metadata Curation

    • explore.openaire.eu
    Updated Jun 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Morane Gruenpeter; Neil Chue Hong; Sabrina Granger; Elena Breitmoser; Mike Priddy; Mario Antonioletti; Paula Andrea Martinez; Tom Honeyman; Julia A. Collins; Rita Meneses (2023). Research Software Workshop: Guidelines and Metrics for Metadata Curation [Dataset]. http://doi.org/10.5281/zenodo.8075097
    Explore at:
    Dataset updated
    Jun 23, 2023
    Authors
    Morane Gruenpeter; Neil Chue Hong; Sabrina Granger; Elena Breitmoser; Mike Priddy; Mario Antonioletti; Paula Andrea Martinez; Tom Honeyman; Julia A. Collins; Rita Meneses
    Description

    The Research Software workshop, was a co-located event with the RDA plenary. Organized by the FAIR-IMPACT European project, with the contribution of the FAIRCORE4EOSC European project. The workshop goals were: Identifying what we have - what is the current landscape of guidelines and metrics for Research Software; Discussing what we need - what are the needs that can be provided by infrastructures and research support staff; Planning how we create adapted guidelines and metrics to the current reality and how we improve the scholarly ecosystem. During the workshop we received valuable feedback which will impact the expected deliverables by the two European projects: FAIR-IMPACT: D4.4 - Guidelines for recommended metadata standard for research software within EOSC FAIR-IMPACT: D5.2 - Metrics for automated FAIR software assessment in a disciplinary context We strive that this process will be as transparent as possible and will yield a result which will be useful for the community and for the service providers that handle research software or act in the scholarly ecosystem, this is why the full planning of the workshop is now made available on Zenodo. We invited the participants to prepare for the session by reviewing the following publications: the SIRS report recommendations, the FAIR4RS principles the CodeMeta initiative vocabulary The authors of this repository are the organizing committee of the workshop and the contributors in this repository are the participants of the workshop onsite and online.

  5. A Standard Metadata Template For Representing Rock Specimens

    • data.csiro.au
    • researchdata.edu.au
    Updated May 28, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anusuriya Devaraju; Tina Shelton; Tenten Pinchand; Jacob Walmsley; Anusree Ramachandran Menon; Kirsten Fenselau; Louise Schoneveld (2025). A Standard Metadata Template For Representing Rock Specimens [Dataset]. http://doi.org/10.25919/myem-sf84
    Explore at:
    Dataset updated
    May 28, 2025
    Dataset provided by
    CSIROhttp://www.csiro.au/
    Authors
    Anusuriya Devaraju; Tina Shelton; Tenten Pinchand; Jacob Walmsley; Anusree Ramachandran Menon; Kirsten Fenselau; Louise Schoneveld
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Dataset funded by
    CSIROhttp://www.csiro.au/
    Description

    This dataset contains a standard template for representing the metadata of rock specimens (e.g., core, microanalysis, hand grab) in the CSIRO Mineral Resources Discovery program. The template includes core properties of samples such as their name, identifier, type, and location, as well as associated metadata such as project, drilling contexts, hazard declaration and physical storage. The template will be used to catalogue legacy and specimens systematically collected through mineral exploration projects. It has been developed iteratively, revised, and improved based on feedback from researchers and lab technicians. This standardized template can prevent duplicate sample metadata entry and lower metadata redundancy, thereby improving the program's physical sample curation and discovery. Lineage: The template includes a readme section summarising all the metadata fields, including their requirements and definitions. The template incorporates several established controlled terms representing, e.g., sample type, rock type, drill type, EPSG and hazard information to ensure consistency in metadata entry.

  6. f

    Additional file 1 of OMD Curation Toolkit: a workflow for in-house curation...

    • springernature.figshare.com
    xls
    Updated Aug 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samuel Piquer-Esteban; Vicente Arnau; Wladimiro Diaz; Andrés Moya (2024). Additional file 1 of OMD Curation Toolkit: a workflow for in-house curation of public omics datasets [Dataset]. http://doi.org/10.6084/m9.figshare.26718447.v1
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Aug 15, 2024
    Dataset provided by
    figshare
    Authors
    Samuel Piquer-Esteban; Vicente Arnau; Wladimiro Diaz; Andrés Moya
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 1. Supplementary Table 1. Comparison of different omics data tools.

  7. Metadata record for: Data-driven curation process for describing the blood...

    • springernature.figshare.com
    txt
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Scientific Data Curation Team (2023). Metadata record for: Data-driven curation process for describing the blood glucose management in the intensive care unit [Dataset]. http://doi.org/10.6084/m9.figshare.13564187.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Scientific Data Curation Team
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This dataset contains key characteristics about the data described in the Data Descriptor Data-driven curation process for describing the blood glucose management in the intensive care unit. Contents:

        1. human readable metadata summary table in CSV format
    
    
        2. machine readable metadata file in JSON format
    
  8. u

    Data from: OPENBIB: Selected curated open metadata based on OpenAlex

    • pub.uni-bielefeld.de
    • zenodo.org
    Updated May 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nick Haupka; Jack Culbert; Paul Donner; Najko Jahn; Christopher Lenke; Philipp Mayr; Andreas Meier; Bernhard Mittermaier; Barbara Scheidt; Stephan Stahlschmidt; Niels Christian Taubert (2025). OPENBIB: Selected curated open metadata based on OpenAlex [Dataset]. https://pub.uni-bielefeld.de/record/3003455
    Explore at:
    Dataset updated
    May 14, 2025
    Authors
    Nick Haupka; Jack Culbert; Paul Donner; Najko Jahn; Christopher Lenke; Philipp Mayr; Andreas Meier; Bernhard Mittermaier; Barbara Scheidt; Stephan Stahlschmidt; Niels Christian Taubert
    Description

    This dataset, compiled by the German Kompetenznetzwerk Bibliometrie, provides access to curated bibliometric data in OpenAlex focussing on the German research landscape.

    Curated data is provided for following entities:

    - Address information
    - Publishers
    - Funding information
    - Document types
    - Transformative agreements
    - Authors (tba)

    For an overview about the tables included, see data-overview.md

    This release is based on the August 2024 snapshot of OpenAlex. The OPENBIB snapshot is offered in both CSV and JSONL format.

    This is a initial release to demonstrate the current state of metadata curation. The aim is to continue these efforts and improve the curation together with the community and data providers.

    Data is made available under the CC0 license.

    Github repository: https://github.com/kbopenbib/kbopenbib_data/

  9. COVID-19 Configurable Data Curation System (COVID-19 CDCS)

    • catalog.data.gov
    • cloud.csiss.gmu.edu
    • +1more
    Updated Sep 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2024). COVID-19 Configurable Data Curation System (COVID-19 CDCS) [Dataset]. https://catalog.data.gov/dataset/covid-19-configurable-data-curation-system-covid-19-cdcs-d9dbe
    Explore at:
    Dataset updated
    Sep 11, 2024
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    Description

    The COVID-19 CDCS represents a metadata repository that provides a catalog of COVID-19 related research literature and data.

  10. n

    Data for: Sustainable connectivity in a community repository

    • data.niaid.nih.gov
    • search.dataone.org
    • +2more
    zip
    Updated Dec 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ted Habermann (2023). Data for: Sustainable connectivity in a community repository [Dataset]. http://doi.org/10.5061/dryad.nzs7h44xr
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 7, 2023
    Dataset provided by
    Metadata Game Changers (United States)
    Authors
    Ted Habermann
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Description

    Identifiers of many kinds are the key to creating unambiguous and persistent connections between research objects and other items in the global research infrastructure (GRI). Many repositories are implementing mechanisms to collect and integrate these identifiers into their submission and record curation processes. This bodes well for a well-connected future, but many existing resources submitted in the past are missing these identifiers, thus missing the connections required for inclusion in the connected infrastructure. Re-curation of these metadata is required to make these connections. The Dryad Data Repository has existed since 2008 and has successfully re-curated the repository metadata several times, adding identifiers for research organizations, funders, and researchers. Understanding and quantifying these successes depends on measuring repository and identifier connectivity. Metrics are described and applied to the entire repository here. Identifiers for papers (DOIs) connected to datasets in Dryad have long been a critical part of the Dryad metadata creation and curation processes. Since 2019, the % of datasets with connected papers has decreased from 100% to less than 40%. This decrease has significant ramifications for the re-curation efforts described above as connected papers are an important source of metadata. In addition, missing connections to papers make understanding and re-using datasets more difficult. Connections between datasets and papers are many times difficult to make because of time lags between submission and publication, lack of clear mechanisms for citing datasets and other research objects from papers, changing focus of researchers, and other obstacles. The Dryad community of members, i.e. users, research institutions, publishers, and funders have vested interests in identifying these connections and critical roles in the curation and re-curation efforts. Their engagement will be critical in building on the successes Dryad has already achieved and ensuring sustainable connectivity in the future. Methods These data are Dryad metadata retrieved from https://datadryad.org and translated into csv files. There are two datasets: 1. DryadJournalDataset was retrieved from Dryad using the ISSNs in the file DryadJournalDataset_ISSNs.txt, although some had no data. 2. DryadOrganizationDataset was retrieved from Dryad using the RORs in the file DryadOrganizationDataset_RORs.txt, although some had no data. Each dataset includes four types of metadata: identifiers, funders, keywords, and related works, each in a separate comma (.csv) or tab (.tsv) delimited files. There are also Microsoft Excel files (.xlsx) for the identifier metadata and connectivity summaries for each dataset (*.html). The connectivity summaries include summaries of each parameter in all four data files with definitions, counts, unique counts, most frequent values, and completeness. These data formed the basis for an analysis of the connectivity of the Dryad repository for organizations, funders, and people.

  11. D

    Harvester-Curator, a tool to elevate metadata provision in data and/or...

    • darus.uni-stuttgart.de
    Updated Mar 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sarbani Roy; Fangfang Wang; Dennis Gläser (2024). Harvester-Curator, a tool to elevate metadata provision in data and/or software repositories [Dataset]. http://doi.org/10.18419/DARUS-3785
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 27, 2024
    Dataset provided by
    DaRUS
    Authors
    Sarbani Roy; Fangfang Wang; Dennis Gläser
    License

    https://spdx.org/licenses/MIT.htmlhttps://spdx.org/licenses/MIT.html

    Dataset funded by
    DFG
    Description

    Harvester-Curator is a tool, designed to elevate metadata provision in data repositories. In the first phase, Harvester-Curator acts as a scanner, navigating through user code and/or data repositories to identify suitable parsers for different file types. It collects metadata from each of the files by applying corresponding parsers and then compiles this information into a structured JSON file, providing researchers with a seamless and automated solution for metadata collection. Moving to the second phase, Harvester-Curator transforms into a curator, leveraging the harvested metadata to populate metadata fields in a target repository. By automating this process, it not only relieves researchers of the manual burden but also ensures the accuracy and comprehensiveness of the metadata. Beyond its role in streamlining the intricate task of metadata collection, this tool contributes to the broader objective of elevating data accessibility and interoperability within repositories.

  12. Z

    Data from: HVG

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jun 21, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Palkó, Gábor (2022). HVG [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6546176
    Explore at:
    Dataset updated
    Jun 21, 2022
    Dataset provided by
    Palkó, Gábor
    Sárközi-Lindner, Zsófia
    Fellegi, Zsófia
    Indig, Balázs
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This object has been created as a part of the web harvesting project of the Eötvös Loránd University Department of Digital Humanities ELTE DH. Learn more about the workflow HERE about the software used HERE.The aim of the project is to make online news articles and their metadata suitable for research purposes. The archiving workflow is designed to prevent modification or manipulation of the downloaded content. The current version of the curated content with normalized formatting in standard TEI XML format with Schema.org encoded metadata is available HERE. The detailed description of the raw content is the following:

    The portal's archived content (from 2000-09-10 to 2021-10-16) in WARC format available HERE (crawled: 2021-09-02T19:51:04.673039 - 2021-10-16T16:25:30.654675).Please fill in the following form before requesting access to this dataset: ACCESS FORM

  13. A Standard Metadata Template for Representing Mineral Spectral Reference...

    • researchdata.edu.au
    • data.csiro.au
    datadownload
    Updated May 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carsten Laukamp; Tina Shelton; Anusuriya Devaraju; Anusree Ramachandran Menon; Laukamp, C. (2025). A Standard Metadata Template for Representing Mineral Spectral Reference Samples [Dataset]. http://doi.org/10.25919/NEMV-ER15
    Explore at:
    datadownloadAvailable download formats
    Dataset updated
    May 30, 2025
    Dataset provided by
    CSIROhttp://www.csiro.au/
    Authors
    Carsten Laukamp; Tina Shelton; Anusuriya Devaraju; Anusree Ramachandran Menon; Laukamp, C.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains a standard template for representing the metadata of mineral spectral reference specimens in the CSIRO Mineral Resources Discovery program. The template includes core properties of samples such as their name, identifier, type, and location, as well as associated metadata such as project, hazard declaration and physical storage. The template will be used to catalogue reference samples used for mineral spectral analysis (NVCL). It has been developed iteratively, revised, and improved based on feedback from researchers and lab technicians. This standardized template can prevent duplicate sample metadata entry and lower metadata redundancy, thereby improving the program's physical sample curation and discovery. Lineage: This template was built on the CMR rock metadata template (https://doi.org/10.25919/2prf-dk88). The template includes a readme section summarising all the metadata fields, including their requirements and definitions. The template incorporates several established controlled terms representing, e.g., sample type, mineral type, EPSG and hazard information to ensure consistency in metadata entry. The template also contains few metadata fields that are specific to mineral spectra samples like different analysis conducted for the samples (XRD, Whole-rock geochemical analysis, etc).

  14. Data curation materials in "Daily life in the Open Biologist's second job,...

    • zenodo.org
    bin, tiff, txt
    Updated Sep 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Livia C T Scorza; Livia C T Scorza; Tomasz Zieliński; Tomasz Zieliński; Andrew J Millar; Andrew J Millar (2024). Data curation materials in "Daily life in the Open Biologist's second job, as a Data Curator" [Dataset]. http://doi.org/10.5281/zenodo.13321937
    Explore at:
    tiff, txt, binAvailable download formats
    Dataset updated
    Sep 18, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Livia C T Scorza; Livia C T Scorza; Tomasz Zieliński; Tomasz Zieliński; Andrew J Millar; Andrew J Millar
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This is the supplementary material accompanying the manuscript "Daily life in the Open Biologist’s second job, as a Data Curator", published in Wellcome Open Research.

    It contains:

    - Python_scripts.zip: Python scripts used for data cleaning and organization:

    -add_headers.py: adds specified headers automatically to a list of csv files, creating new output files containing a "_with_headers" suffix.

    -count_NaN_values.py: counts the total number of rows containing null values in a csv file and prints the location of null values in the (row, column) format.

    -remove_rowsNaN_file.py: removes rows containing null values in a single csv file and saves the modified file with a "_dropNaN" suffix.

    -remove_rowsNaN_list.py: removes rows containing null values in list of csv files and saves the modified files with a "_dropNaN" suffix.

    - README_template.txt: a template for a README file to be used to describe and accompany a dataset.

    - template_for_source_data_information.xlsx: a spreadsheet to help manuscript authors to keep track of data used for each figure (e.g., information about data location and links to dataset description).

    - Supplementary_Figure_1.tif: Example of a dataset shared by us on Zenodo. The elements that make the dataset FAIR are indicated by the respective letters. Findability (F) is achieved by the dataset unique and persistent identifier (DOI), as well as by the related identifiers for the publication and dataset on GitHub. Additionally, the dataset is described with rich metadata, (e.g., keywords). Accessibility (A) is achieved by the ease of visualization and downloading using a standardised communications protocol (https). Also, the metadata are publicly accessible and licensed under the public domain. Interoperability (I) is achieved by the open formats used (CSV; R), and metadata are harvestable using the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH), a low-barrier mechanism for repository interoperability. Reusability (R) is achieved by the complete description of the data with metadata in README files and links to the related publication (which contains more detailed information, as well as links to protocols on protocols.io). The dataset has a clear and accessible data usage license (CC-BY 4.0).

  15. Z

    XRef Dataset: ICBO_2020 paper

    • data.niaid.nih.gov
    Updated Jul 10, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amir LAADHAR (2020). XRef Dataset: ICBO_2020 paper [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3842749
    Explore at:
    Dataset updated
    Jul 10, 2020
    Dataset provided by
    Clement JONQUET
    Amir LAADHAR
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A curated dataset of XRefs extracted from agri-food ontologies and curated using OMHT (Ontology Mapping Harvester Tool), which is a script in Java language designed to automatically extract and semi-automatically curate declared mappings from ontologies and reify them into specific objects with metadata and provenance information.

  16. o

    Enabling Interdisciplinary Research in the European Science Cloud through...

    • explore.openaire.eu
    Updated Apr 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vaidas Morkevicius; Patricia Anne Nugent (2023). Enabling Interdisciplinary Research in the European Science Cloud through Metadata Standards. [Dataset]. http://doi.org/10.5281/zenodo.7827905
    Explore at:
    Dataset updated
    Apr 14, 2023
    Authors
    Vaidas Morkevicius; Patricia Anne Nugent
    Description

    The Lithuanian Data Archive for Social Sciences and Humanities (LiDA) developed a framework for harvesting and delivering rich metadata of social science data (SSD) objects for the EOSC Portal Service Catalogue, with the aim of meeting the FAIR F2 principle of providing rich metadata in a machine-readable format. The project identified and recommended the vocabulary for standardized description of major data types for SSD objects, which allows for more detailed descriptions of data and increases possibilities for secondary analysis on generic data portals. The framework is extensible to other disciplines, which benefits developers of software, infrastructures, and data curation standards in other communities. Scientists and government bodies could benefit from the resources produced during the project, such as the recommendations and milestones, which may help them develop their own data curation software, data repositories, and controlled vocabularies. The project’s impact lies in benefiting the wider research community by enabling better data discovery, reuse, and interoperability in the EOSC Portal Service Catalogue.

  17. Data from: Székelyhon

    • zenodo.org
    • data.niaid.nih.gov
    bin
    Updated Apr 13, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gábor Palkó; Gábor Palkó; Balázs Indig; Balázs Indig; Zsófia Fellegi; Zsófia Fellegi; Zsófia Sárközi-Lindner; Zsófia Sárközi-Lindner (2022). Székelyhon [Dataset]. http://doi.org/10.5281/zenodo.5849138
    Explore at:
    binAvailable download formats
    Dataset updated
    Apr 13, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Gábor Palkó; Gábor Palkó; Balázs Indig; Balázs Indig; Zsófia Fellegi; Zsófia Fellegi; Zsófia Sárközi-Lindner; Zsófia Sárközi-Lindner
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 29, 2017 - May 21, 2021
    Description

    This object has been created as a part of the web harvesting project of the Eötvös Loránd University Department of Digital Humanities ELTE DH. Learn more about the workflow HERE about the software used HERE.The aim of the project is to make online news articles and their metadata suitable for research purposes. The archiving workflow is designed to prevent modification or manipulation of the downloaded content. The current version of the curated content with normalized formatting in standard TEI XML format with Schema.org encoded metadata is available HERE. The detailed description of the raw content is the following:

    • The portal's archived content (from 2017-01-29 to 2021-05-21) in WARC format available HERE (crawled: 2021-05-21T09:51:12.531750 - 2021-05-21T18:38:24.961226).

    Please fill in the following form before requesting access to this dataset:ACCES FORM

  18. Data from: Transindex

    • zenodo.org
    • data.niaid.nih.gov
    bin
    Updated Apr 13, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gábor Palkó; Gábor Palkó; Balázs Indig; Balázs Indig; Zsófia Fellegi; Zsófia Fellegi; Zsófia Sárközi-Lindner; Zsófia Sárközi-Lindner (2022). Transindex [Dataset]. http://doi.org/10.5281/zenodo.4899444
    Explore at:
    binAvailable download formats
    Dataset updated
    Apr 13, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Gábor Palkó; Gábor Palkó; Balázs Indig; Balázs Indig; Zsófia Fellegi; Zsófia Fellegi; Zsófia Sárközi-Lindner; Zsófia Sárközi-Lindner
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 1, 2001 - May 22, 2021
    Description

    This object has been created as a part of the web harvesting project of the Eötvös Loránd University Department of Digital Humanities ELTE DH. Learn more about the workflow HERE about the software used HERE.The aim of the project is to make online news articles and their metadata suitable for research purposes. The archiving workflow is designed to prevent modification or manipulation of the downloaded content. The current version of the curated content with normalized formatting in standard TEI XML format with Schema.org encoded metadata is available HERE. The detailed description of the raw content is the following:

    • The portal's archived content (from 2001-01-01 to 2021-05-22) in WARC format available HERE (crawled: 2021-05-21T10:01:38.592950 - 2021-05-22T20:50:22.079445).
  19. f

    Metadata review - Manually curated dataset

    • figshare.com
    xlsx
    Updated Feb 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mathew Harris (2023). Metadata review - Manually curated dataset [Dataset]. http://doi.org/10.6084/m9.figshare.21387351.v2
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Feb 8, 2023
    Dataset provided by
    figshare
    Authors
    Mathew Harris
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is a manually curated dataset of metadata downloaded off GenBank. The metadata is information pertaining to plant-associated fungal communities from marker gene sequences.

  20. w

    New&Notable

    • data.wu.ac.at
    • datadiscoverystudio.org
    csv, json, rdf, xml
    Updated Sep 7, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    State of Washington (2017). New&Notable [Dataset]. https://data.wu.ac.at/schema/data_gov/NjIzNWVmZTctNTc5Yi00ZDg1LTgzN2ItNTUxYjVkMjVmYmI3
    Explore at:
    rdf, csv, xml, jsonAvailable download formats
    Dataset updated
    Sep 7, 2017
    Dataset provided by
    State of Washington
    Description

    Curated list of new and notable datasets, visualizations etc.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Graham Smith; Iain Hrynaszkiewicz; Rebecca Grant (2023). Quality and completeness scores for curated and non-curated datasets [Dataset]. http://doi.org/10.6084/m9.figshare.6200357.v1
Organization logo

Quality and completeness scores for curated and non-curated datasets

Related Article
Explore at:
3 scholarly articles cite this dataset (View in Google Scholar)
xlsxAvailable download formats
Dataset updated
Jun 1, 2023
Dataset provided by
Figsharehttp://figshare.com/
Authors
Graham Smith; Iain Hrynaszkiewicz; Rebecca Grant
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

These data contain aggregated survey responses assessing the quality and completeness of metadata for datasets deposited in public repositories and for the same datasets after professional curation.Responses were provided by 10 professional editors representing life, social and physical sciences. Each were randomly assigned four datasets to assess, half (20) of which had been curated according to the standards of Springer Nature's Research Data Support service and half (20) which had not.Curated datasets were shared privately with research participants. The versions that did not receive curation via Springer Nature's Research Data Support are openly accessible.Single-blind testing was employed; the researchers were not made aware which datasets had been curated and which had not, and it was ensured that no participant assessed the same dataset before and after curation. Responses were collected via an online survey. The relevant question and scoring is provided below:Rate the overall quality and completeness of the metadata for the dataset (with regards to finding and accessing and citing the data, not reusing the data)1 = not complete, 5 = very complete

Search
Clear search
Close search
Google apps
Main menu