100+ datasets found
  1. Quality and completeness scores for curated and non-curated datasets

    • springernature.figshare.com
    xlsx
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Graham Smith; Iain Hrynaszkiewicz; Rebecca Taylor-Grant (2023). Quality and completeness scores for curated and non-curated datasets [Dataset]. http://doi.org/10.6084/m9.figshare.6200357.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Graham Smith; Iain Hrynaszkiewicz; Rebecca Taylor-Grant
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    These data contain aggregated survey responses assessing the quality and completeness of metadata for datasets deposited in public repositories and for the same datasets after professional curation.Responses were provided by 10 professional editors representing life, social and physical sciences. Each were randomly assigned four datasets to assess, half (20) of which had been curated according to the standards of Springer Nature's Research Data Support service and half (20) which had not.Curated datasets were shared privately with research participants. The versions that did not receive curation via Springer Nature's Research Data Support are openly accessible.Single-blind testing was employed; the researchers were not made aware which datasets had been curated and which had not, and it was ensured that no participant assessed the same dataset before and after curation. Responses were collected via an online survey. The relevant question and scoring is provided below:Rate the overall quality and completeness of the metadata for the dataset (with regards to finding and accessing and citing the data, not reusing the data)1 = not complete, 5 = very complete

  2. Data from: A Novel Curated Scholarly Graph Connecting Textual and Data...

    • zenodo.org
    • resodate.org
    zip
    Updated Dec 21, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ornella Irrera; Ornella Irrera; Andrea Mannocci; Andrea Mannocci; Paolo Manghi; Paolo Manghi; Gianmaria Silvello; Gianmaria Silvello (2022). A Novel Curated Scholarly Graph Connecting Textual and Data Publications [Dataset]. http://doi.org/10.5281/zenodo.7464120
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 21, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Ornella Irrera; Ornella Irrera; Andrea Mannocci; Andrea Mannocci; Paolo Manghi; Paolo Manghi; Gianmaria Silvello; Gianmaria Silvello
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains an open and curated scholarly graph we built as a training and test set for data discovery, data connection, author disambiguation, and link prediction tasks. This graph represents the European Marine Science community included in the OpenAIRE Graph. The nodes of the graph we release represent publications, datasets, software, and authors respectively; edges interconnecting research products always have the publication as source, and the dataset/software as target. In addition, edges are labeled with semantics that outline whether the publication is referencing, citing, documenting, or supplementing the related outcome. To curate and enrich nodes metadata and edges semantics, we relied on the information extracted from the PDF of the publications and the datasets/software webpages respectively. We curated the authors so to remove duplicated nodes representing the same person.

    The resource we release counts 4,047 publications, 5,488 datasets, 22 software, 21,561 authors, and 9,692 edges connect publications to datasets/software. This graph is in the curated_MES folder. We provide this resource as:

    1. a property graph: we provide the dump that can be imported in neo4j
    2. 5 jsonl files containing publications, datasets, software, authors, and relationships respectively. Each line of a jsonl file contains a JSON object representing a node and contains the metadata of that node (or a relationship).

    We provide two additional scholarly graphs:

    • The curated MES graph with the removed edges. During the curation we removed some edges since they were labeled with an inconsistent or imprecise semantics. This graph includes the same nodes and edges as the previous one, and, in addition, it contains the edges removed during the curation pipeline; these edges are marked as Removed. This graph is in the curated_MES_with_removed_semantics folder.
    • The original MES community of OpenAIRE. It represents the MES community extracted from the OpenAIRE Research Graph. This graph has not been curated, and the metadata and semantics are those of the OpenAIRE Research Graph. This graph is in the original_MES_community folder.

  3. Data curation materials in "Daily life in the Open Biologist's second job,...

    • dtechtive.com
    • zenodo.org
    Updated Aug 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zenodo (2024). Data curation materials in "Daily life in the Open Biologist's second job, as a Data Curator" [Dataset]. https://dtechtive.com/datasets/49296
    Explore at:
    Dataset updated
    Aug 14, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Area covered
    Scotland
    Description

    This is the supplementary material accompanying the manuscript "Daily life in the Open Biologist’s second job, as a Data Curator", published in Wellcome Open Research.It contains:- Python_scripts.zip: Python scripts used for data cleaning and organization:-add_headers.py: adds specified headers automatically to a list of csv files, creating new output files containing a "_with_headers" suffix.-count_NaN_values.py: counts the total number of rows containing null values in a csv file and prints the location of null values in the (row, column) format.-remove_rowsNaN_file.py: removes rows containing null values in a single csv file and saves the modified file with a "_dropNaN" suffix.-remove_rowsNaN_list.py: removes rows containing null values in list of csv files and saves the modified files with a "_dropNaN" suffix.- README_template.txt: a template for a README file to be used to describe and accompany a dataset.- template_for_source_data_information.xlsx: a spreadsheet to help manuscript authors to keep track of data used for each figure (e.g., information about data location and links to dataset description).- Supplementary_Figure_1.tif: Example of a dataset shared by us on Zenodo. The elements that make the dataset FAIR are indicated by the respective letters. Findability (F) is achieved by the dataset unique and persistent identifier (DOI), as well as by the related identifiers for the publication and dataset on GitHub. Additionally, the dataset is described with rich metadata, (e.g., keywords). Accessibility (A) is achieved by the ease of visualization and downloading using a standardised communications protocol (https). Also, the metadata are publicly accessible and licensed under the public domain. Interoperability (I) is achieved by the open formats used (CSV; R), and metadata are harvestable using the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH), a low-barrier mechanism for repository interoperability. Reusability (R) is achieved by the complete description of the data with metadata in README files and links to the related publication (which contains more detailed information, as well as links to protocols on protocols.io). The dataset has a clear and accessible data usage license (CC-BY 4.0).

  4. A Standard Metadata Template For Representing Rock Specimens

    • researchdata.edu.au
    datadownload
    Updated Sep 10, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Louise Schoneveld; Kirsten Fenselau; Anusree Ramachandran Menon; Jacob Walmsley; Tenten Pinchand; Tina Shelton; Anusuriya Devaraju; Louise Schoneveld; Guillerma Tenten Pinchand (2025). A Standard Metadata Template For Representing Rock Specimens [Dataset]. http://doi.org/10.25919/MSH6-KT43
    Explore at:
    datadownloadAvailable download formats
    Dataset updated
    Sep 10, 2025
    Dataset provided by
    CSIROhttps://www.csiro.au/
    Authors
    Louise Schoneveld; Kirsten Fenselau; Anusree Ramachandran Menon; Jacob Walmsley; Tenten Pinchand; Tina Shelton; Anusuriya Devaraju; Louise Schoneveld; Guillerma Tenten Pinchand
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains a standard template for representing the metadata of rock specimens (e.g., core, microanalysis, hand grab) in the CSIRO Mineral Resources Discovery program. The template includes core properties of samples such as their name, identifier, type, and location, as well as associated metadata such as project, drilling contexts, hazard declaration and physical storage. The template will be used to catalogue legacy and specimens systematically collected through mineral exploration projects. It has been developed iteratively, revised, and improved based on feedback from researchers and lab technicians. This standardized template can prevent duplicate sample metadata entry and lower metadata redundancy, thereby improving the program's physical sample curation and discovery. Lineage: The template includes a readme section summarising all the metadata fields, including their requirements and definitions. The template incorporates several established controlled terms representing, e.g., sample type, rock type, drill type, EPSG and hazard information to ensure consistency in metadata entry.

  5. Data from: Software Metadata Extraction and Curation Software (SMECS)

    • meta4ds.fokus.fraunhofer.de
    unknown, zip
    Updated Aug 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zenodo (2025). Software Metadata Extraction and Curation Software (SMECS) [Dataset]. https://meta4ds.fokus.fraunhofer.de/datasets/oai-zenodo-org-16892288?locale=en
    Explore at:
    zip(241955), unknownAvailable download formats
    Dataset updated
    Aug 18, 2025
    Dataset authored and provided by
    Zenodohttp://zenodo.org/
    Description

    If you use this software, please cite it using the metadata from this file.

  6. B

    Data Rescue & Curation Best Practices Guide

    • borealisdata.ca
    Updated Apr 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OCUL Data Community (ODC) Data Rescue Group (2023). Data Rescue & Curation Best Practices Guide [Dataset]. http://doi.org/10.5683/SP2/Y8MQXV
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 19, 2023
    Dataset provided by
    Borealis
    Authors
    OCUL Data Community (ODC) Data Rescue Group
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    The aim of the Data Rescue & Curation Best Practices Guide is to provide an accessible and hands-on approach to handling data rescue and digital curation of at-risk data for use in secondary research. We provide a set of examples and workflows for addressing common challenges with social science survey data that can be applied to other social and behavioural research data. The goal of this guide and set of workflows presented is to improve librarians’ and data curators’ skills in providing access to high-quality, well-documented, and reusable research data. The aspects of data curation that are addressed throughout this guide are adopted from long-standing data library and archiving practices, including: documenting data using standard metadata, file and data organization; using open and software-agnostic formats; and curating research data for reuse.

  7. Metadata record for: Data-driven curation process for describing the blood...

    • springernature.figshare.com
    txt
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Scientific Data Curation Team (2023). Metadata record for: Data-driven curation process for describing the blood glucose management in the intensive care unit [Dataset]. http://doi.org/10.6084/m9.figshare.13564187.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Scientific Data Curation Team
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This dataset contains key characteristics about the data described in the Data Descriptor Data-driven curation process for describing the blood glucose management in the intensive care unit. Contents:

        1. human readable metadata summary table in CSV format
    
    
        2. machine readable metadata file in JSON format
    
  8. g

    COVID-19 Configurable Data Curation System (COVID-19 CDCS) | gimi9.com

    • gimi9.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    COVID-19 Configurable Data Curation System (COVID-19 CDCS) | gimi9.com [Dataset]. https://gimi9.com/dataset/data-gov_covid-19-configurable-data-curation-system-covid-19-cdcs/
    Explore at:
    Description

    The COVID-19 CDCS represents a metadata repository that provides a catalog of COVID-19 related research literature and data.

  9. A Standard Metadata Template for Representing Mineral Spectral Reference...

    • researchdata.edu.au
    • data.csiro.au
    datadownload
    Updated Sep 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carsten Laukamp; Tina Shelton; Anusuriya Devaraju; Anusree Ramachandran Menon; Laukamp, C. (2025). A Standard Metadata Template for Representing Mineral Spectral Reference Samples [Dataset]. http://doi.org/10.25919/WQA1-3R64
    Explore at:
    datadownloadAvailable download formats
    Dataset updated
    Sep 15, 2025
    Dataset provided by
    CSIROhttps://www.csiro.au/
    Authors
    Carsten Laukamp; Tina Shelton; Anusuriya Devaraju; Anusree Ramachandran Menon; Laukamp, C.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains a standard template for representing the metadata of mineral spectral reference specimens in the CSIRO Mineral Resources Discovery program. The template includes core properties of samples such as their name, identifier, type, and location, as well as associated metadata such as project, hazard declaration and physical storage. The template will be used to catalogue reference samples used for mineral spectral analysis (NVCL). It has been developed iteratively, revised, and improved based on feedback from researchers and lab technicians. This standardized template can prevent duplicate sample metadata entry and lower metadata redundancy, thereby improving the program's physical sample curation and discovery. Lineage: This template was built on the CMR rock metadata template (https://doi.org/10.25919/2prf-dk88). The template includes a readme section summarising all the metadata fields, including their requirements and definitions. The template incorporates several established controlled terms representing, e.g., sample type, mineral type, EPSG and hazard information to ensure consistency in metadata entry. The template also contains few metadata fields that are specific to mineral spectra samples like different analysis conducted for the samples (XRD, Whole-rock geochemical analysis, etc).

  10. d

    Would DMPs be helpful to our researchers? Yes and YES

    • search.dataone.org
    Updated Dec 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Eugene Barsky (2023). Would DMPs be helpful to our researchers? Yes and YES [Dataset]. http://doi.org/10.5683/SP3/QMCO3T
    Explore at:
    Dataset updated
    Dec 28, 2023
    Dataset provided by
    Borealis
    Authors
    Eugene Barsky
    Description

    Why DMPS are helpful to our researchers. Visit https://dataone.org/datasets/sha256%3A20f68cc9df1e285bd047214421264aaa881de9d21bcde4a6fceb1d48867076da for complete metadata about this dataset.

  11. v

    3D Metadata Schema

    • data.lib.vt.edu
    xlsx
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wen Nie Ng; Alex Kinnaman; Nathan Hall (2023). 3D Metadata Schema [Dataset]. http://doi.org/10.7294/21230690.v2
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    University Libraries, Virginia Tech
    Authors
    Wen Nie Ng; Alex Kinnaman; Nathan Hall
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Supplemental materials for Web3D paper - Levels of Representation and Data Infrastructures in Entomo-3D: An applied research approach to addressing metadata curation issues to support preservation and access of 3D.

    One 3D metadata schema and three dataset pipeline figures

  12. Z

    XRef Dataset: ICBO_2020 paper

    • data.niaid.nih.gov
    • data-staging.niaid.nih.gov
    Updated Jul 10, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amir LAADHAR; Clement JONQUET (2020). XRef Dataset: ICBO_2020 paper [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3842749
    Explore at:
    Dataset updated
    Jul 10, 2020
    Dataset provided by
    LIRMM
    Authors
    Amir LAADHAR; Clement JONQUET
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A curated dataset of XRefs extracted from agri-food ontologies and curated using OMHT (Ontology Mapping Harvester Tool), which is a script in Java language designed to automatically extract and semi-automatically curate declared mappings from ontologies and reify them into specific objects with metadata and provenance information.

  13. d

    Data for: Sustainable connectivity in a community repository

    • search.dataone.org
    • data.niaid.nih.gov
    • +1more
    Updated Dec 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ted Habermann (2023). Data for: Sustainable connectivity in a community repository [Dataset]. http://doi.org/10.5061/dryad.nzs7h44xr
    Explore at:
    Dataset updated
    Dec 8, 2023
    Dataset provided by
    Dryad Digital Repository
    Authors
    Ted Habermann
    Time period covered
    Jan 1, 2023
    Description

    Identifiers of many kinds are the key to creating unambiguous and persistent connections between research objects and other items in the global research infrastructure (GRI). Many repositories are implementing mechanisms to collect and integrate these identifiers into their submission and record curation processes. This bodes well for a well-connected future, but many existing resources submitted in the past are missing these identifiers, thus missing the connections required for inclusion in the connected infrastructure. Re-curation of these metadata is required to make these connections. The Dryad Data Repository has existed since 2008 and has successfully re-curated the repository metadata several times, adding identifiers for research organizations, funders, and researchers. Understanding and quantifying these successes depends on measuring repository and identifier connectivity. Metrics are described and applied to the entire repository here. Identifiers for papers (DOIs) connected..., These data are Dryad metadata retrieved from https://datadryad.org and translated into csv files. There are two datasets: Â 1. DryadJournalDataset was retrieved from Dryad using the ISSNs in the file DryadJournalDataset_ISSNs.txt, although some had no data. Â 2. DryadOrganizationDataset was retrieved from Dryad using the RORs in the file DryadOrganizationDataset_RORs.txt, although some had no data. Each dataset includes four types of metadata: identifiers, funders, keywords, and related works, each in a separate comma (.csv) or tab (.tsv) delimited files. There are also Microsoft Excel files (.xlsx) for the identifier metadata and connectivity summaries for each dataset (*.html). The connectivity summaries include summaries of each parameter in all four data files with definitions, counts, unique counts, most frequent values, and completeness. These data formed the basis for an analysis of the connectivity of the Dryad repository for organizations, funders, and people., , # Data For: Sustainable Connectivity in a Community Repository

    GENERAL INFORMATION

    This readme.txt file was generated on 30231110 by Ted Habermann

    Title of Dataset

    Data For: Sustainable Connectivity in a Community Repository

    Author Information

    Principal Investigator Contact Information Name: Ted Habermann (0000-0003-3585-6733) Institution: Metadata Game Changers () Email: ORCID: 0000-0003-3585-6733

    Date published or finalized for release:

    November 10, 2023

    Date of data collection (single date, range, approximate date)

    May and June 2023

    Information about funding sources that supported the collection of the data:

    National Science Foundation (Crossref Funder ID: 100000001) Award 2134956.

    Overview of the data (abstract):

    These data are Dryad metadata retrieved from and translated into csv files. There are two datasets:

    1. DryadJournalDataset was retrieved from Dryad using the ISSNs in the file DryadJournalDataset_ISSNs.txt, although some had n...
  14. YouTube Video Curation (Metadata and URLs)

    • kaggle.com
    zip
    Updated Aug 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kaleemullah Qasim (2023). YouTube Video Curation (Metadata and URLs) [Dataset]. https://www.kaggle.com/datasets/kaleemqasim/youtube-video-curation-metadata-and-urls
    Explore at:
    zip(202311083 bytes)Available download formats
    Dataset updated
    Aug 11, 2023
    Authors
    Kaleemullah Qasim
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Area covered
    YouTube
    Description

    Title: YouTube Video Curation (Metadata and URLs)😇 Subtitle: Analyzing YouTube Content: From Video Descriptions to Viewer Engagement Metrics

    Introduction

    The YouTube Video Metadata Explorer dataset is a comprehensive collection of metadata related to YouTube videos, encompassing a wide range of information including video IDs, content details, statistical data, descriptions, and associated URLs. This rich dataset provides a unique opportunity to explore, analyze, and understand the digital media landscape on one of the world's largest video-sharing platforms.

    Content

    The dataset consists of 307,623 entries and six main attributes, detailed as follows:

    ID: Unique identifier for each video. Snippet: Contains detailed information, including: Category ID: YouTube video category identifier Channel ID: Unique identifier for the channel hosting the video Channel Title: Name of the channel hosting the video Default Audio Language: The default audio language of the video Default Language: The default language of the video Live Broadcast Content: Indicator for Live Broadcast Content Localized: Information related to localization Title: Title of the video Published At: Publication date and time Tags: Associated tags for the video Thumbnails: Different resolution thumbnails, including: Default: 90x120 pixels. High: 360x480 pixels. Maxres: 720x1280 pixels Medium: 180x320 pixels. Standard: 480x640 pixels. Content Details: Includes information about the video's technical specifications and features: Caption: Indicates whether captions are available (true or false). Content Rating: YouTube content rating (e.g., 'ytRating': None). Definition: Video definition quality (e.g., 'hd' for high definition). - -Dimension: Video dimension (e.g., '2d' for 2-dimensional). - -Duration: Duration of the video (e.g., 'PT16M34S' for 16 minutes and 34 seconds). Licensed Content: Indicates whether the content is licensed (true or false). Projection: Type of video projection (e.g., 'rectangular'). Region Restriction: Any region restrictions applied to the video Statistics: Features video engagement metrics: Comment Count: Number of comments on the video - -Favorite Count: Number of times the video has been marked as a favorite (e.g., '0'). Like Count: Number of likes on the video (e.g., '29942'). View Count: Number of views for the video (e.g., '704710'). Description: A brief description or summary of the video content - -URLs: Links associated with the videos description

  15. H

    Replication Data for: Research data management need assessment of LIS...

    • dataverse.harvard.edu
    • search.dataone.org
    Updated Nov 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tamanna Hossain (2025). Replication Data for: Research data management need assessment of LIS graduate students [Dataset]. http://doi.org/10.7910/DVN/HWGQWO
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 20, 2025
    Dataset provided by
    Harvard Dataverse
    Authors
    Tamanna Hossain
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This dataset comprises survey data collected from graduate students at Simmons University's School of Library and Information Science (LIS), focusing on their Research Data Management (RDM) awareness, experience, preparedness and need for professional development. The files in this dataset include the raw survey responses (in CSV format).

  16. Metadata record for: Creation and validation of a chest X-ray dataset with...

    • springernature.figshare.com
    txt
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Scientific Data Curation Team (2023). Metadata record for: Creation and validation of a chest X-ray dataset with eye-tracking and report dictation for AI development [Dataset]. http://doi.org/10.6084/m9.figshare.14035613.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Scientific Data Curation Team
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This dataset contains key characteristics about the data described in the Data Descriptor Creation and validation of a chest X-ray dataset with eye-tracking and report dictation for AI development. Contents:

        1. human readable metadata summary table in CSV format
    
    
        2. machine readable metadata file in JSON format
    
  17. Metadata record for: A dataset describing data discovery and reuse practices...

    • springernature.figshare.com
    txt
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Scientific Data Curation Team (2023). Metadata record for: A dataset describing data discovery and reuse practices in research [Dataset]. http://doi.org/10.6084/m9.figshare.12445034.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Scientific Data Curation Team
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This dataset contains key characteristics about the data described in the Data Descriptor A dataset describing data discovery and reuse practices in research. Contents:

        1. human readable metadata summary table in CSV format
    
    
        2. machine readable metadata file in JSON format
    
  18. Metadata record for: A DICOM dataset for evaluation of medical image...

    • springernature.figshare.com
    txt
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Scientific Data Curation Team (2023). Metadata record for: A DICOM dataset for evaluation of medical image de-identification [Dataset]. http://doi.org/10.6084/m9.figshare.14802774.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Scientific Data Curation Team
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This dataset contains key characteristics about the data described in the Data Descriptor A DICOM dataset for evaluation of medical image de-identification. Contents:

        1. human readable metadata summary table in CSV format
    
    
        2. machine readable metadata file in JSON format
    
  19. o

    Data from: Székelyhon

    • explore.openaire.eu
    • data.niaid.nih.gov
    • +1more
    Updated Jan 12, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gábor Palkó; Balázs Indig; Zsófia Fellegi; Zsófia Sárközi-Lindner (2022). Székelyhon [Dataset]. http://doi.org/10.5281/zenodo.5849138
    Explore at:
    Dataset updated
    Jan 12, 2022
    Authors
    Gábor Palkó; Balázs Indig; Zsófia Fellegi; Zsófia Sárközi-Lindner
    Description

    This object has been created as a part of the web harvesting project of the Eötvös Loránd University Department of Digital Humanities ELTE DH. Learn more about the workflow HERE about the software used HERE.The aim of the project is to make online news articles and their metadata suitable for research purposes. The archiving workflow is designed to prevent modification or manipulation of the downloaded content. The current version of the curated content with normalized formatting in standard TEI XML format with Schema.org encoded metadata is available HERE. The detailed description of the raw content is the following:The portal's archived content (from 2017-01-29 to 2021-05-21) in WARC format available HERE (crawled: 2021-05-21T09:51:12.531750 - 2021-05-21T18:38:24.961226).Please fill in the following form before requesting access to this dataset:ACCES FORM {"references": ["https://doi.org/10.5281/zenodo.3755323"]}

  20. o

    Metadata record for: A global database of Holocene paleotemperature records

    • explore.openaire.eu
    • springernature.figshare.com
    Updated Jan 1, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Scientific Data Curation Team (2020). Metadata record for: A global database of Holocene paleotemperature records [Dataset]. http://doi.org/10.6084/m9.figshare.11967924.v1
    Explore at:
    Dataset updated
    Jan 1, 2020
    Authors
    Scientific Data Curation Team
    Description

    This dataset contains key characteristics about the data described in the Data Descriptor A global database of Holocene paleotemperature records. Contents: 1. human readable metadata summary table in CSV format 2. machine readable metadata file in JSON format

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Graham Smith; Iain Hrynaszkiewicz; Rebecca Taylor-Grant (2023). Quality and completeness scores for curated and non-curated datasets [Dataset]. http://doi.org/10.6084/m9.figshare.6200357.v1
Organization logoOrganization logo

Quality and completeness scores for curated and non-curated datasets

Related Article
Explore at:
xlsxAvailable download formats
Dataset updated
Jun 1, 2023
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Graham Smith; Iain Hrynaszkiewicz; Rebecca Taylor-Grant
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

These data contain aggregated survey responses assessing the quality and completeness of metadata for datasets deposited in public repositories and for the same datasets after professional curation.Responses were provided by 10 professional editors representing life, social and physical sciences. Each were randomly assigned four datasets to assess, half (20) of which had been curated according to the standards of Springer Nature's Research Data Support service and half (20) which had not.Curated datasets were shared privately with research participants. The versions that did not receive curation via Springer Nature's Research Data Support are openly accessible.Single-blind testing was employed; the researchers were not made aware which datasets had been curated and which had not, and it was ensured that no participant assessed the same dataset before and after curation. Responses were collected via an online survey. The relevant question and scoring is provided below:Rate the overall quality and completeness of the metadata for the dataset (with regards to finding and accessing and citing the data, not reusing the data)1 = not complete, 5 = very complete

Search
Clear search
Close search
Google apps
Main menu