72 datasets found
  1. Listing of data repositories that embed schema.org metadata in dataset...

    • zenodo.org
    • explore.openaire.eu
    • +1more
    csv
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Martin Fenner; Martin Fenner; Merce Crosas; Merce Crosas; Gustavo Durand; Gustavo Durand; Sarala Wimalaratne; Sarala Wimalaratne; Florian Gräf; Florian Gräf; Richard Hallett; Richard Hallett; Manuel Bernal Llinares; Manuel Bernal Llinares; Uwe Schindler; Uwe Schindler; Tim Clark; Tim Clark (2020). Listing of data repositories that embed schema.org metadata in dataset landing pages [Dataset]. http://doi.org/10.5281/zenodo.1262598
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Martin Fenner; Martin Fenner; Merce Crosas; Merce Crosas; Gustavo Durand; Gustavo Durand; Sarala Wimalaratne; Sarala Wimalaratne; Florian Gräf; Florian Gräf; Richard Hallett; Richard Hallett; Manuel Bernal Llinares; Manuel Bernal Llinares; Uwe Schindler; Uwe Schindler; Tim Clark; Tim Clark
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Machine-readable metadata available from landing pages for datasets facilitate data citation by enabling easy integration with reference managers and other tools used in a data citation workflow. Embedding these metadata using the schema.org standard with the JSON-LD is emerging as the community standard. This dataset is a listing of data repositories that have implemented this approach or are in the progress of doing so.

    This is the first version of this dataset and was generated via community consultation. We expect to update this dataset, as an increasing number of data repositories adopt this approach, and we hope to see this information added to registries of data repositories such as re3data and FAIRsharing.

    In addition to the listing of data repositories we provide information of the schema.org properties supported by these data repositories, focussing on the required and recommended properties from the "Data Citation Roadmap for Scholarly Data Repositories".

  2. d

    Leveraging the Schema.org Vocabulary to Create an Actionable Metadata...

    • dataone.org
    • hydroshare.org
    • +1more
    Updated Dec 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Irene Garousi-Nejad; Anthony M. Castronova; Jeffery S. Horsburgh; Scott Black; Pabitra Dash; Mauriel Ramirez (2023). Leveraging the Schema.org Vocabulary to Create an Actionable Metadata Representation for Geospatial Data and Computing Resources [Dataset]. https://dataone.org/datasets/sha256%3A7f06cf7a9c31b5c545a85694bdb862cbbeb2255f86b0b6f0c88e852e04ad8038
    Explore at:
    Dataset updated
    Dec 30, 2023
    Dataset provided by
    Hydroshare
    Authors
    Irene Garousi-Nejad; Anthony M. Castronova; Jeffery S. Horsburgh; Scott Black; Pabitra Dash; Mauriel Ramirez
    Description

    This resource contains slides for the AGU Fall Meeting 2023 presentation (#IN23A-07) in San Francisco on Dec 12. Session: IN23A: Advancing Open Science: Emerging Techniques in Knowledge Management and Discovery II Oral

    Effective response to global crises relies on universal access to scientific data and models, understanding their attributes, and representing their interconnectivity to facilitate collaborative research and decision making. In the age of distributed data, geospatial researchers frequently invest significant time searching for, accessing, and working to understand scientific data. This often leads to the recreation of existing datasets as well as challenges in determining methods for accessing, using, and ultimately establishing connections between resources. In recent years, following FAIR and CARE principles, there is an emerging practice to leverage structured and robust metadata to accelerate the discovery of web-based scientific resources and products. This practice assists users in not only discovery, but also in understanding the context, quality, and provenance of data, as well as the rights and responsibilities of data owners and consumers. It also empowers organizations to leverage their data more effectively and derive meaningful insights from them. Doing so, however, can be difficult, especially when diverse resources needed for scientific applications may be spread across multiple repositories or locations. We present a solution for leveraging the Schema.org vocabulary along with various web encodings such as the Resource Description Framework (RDF) with JSON-LD to create an actionable, curated catalog of scientific resources ranging from spatio-temporal data to software source code. We explore how resources of various types and common scientific formats, such as multidimensional, software containers, source code, and spatial features, which are stored across various repositories and distributed cloud storage, can be described and cataloged. Recognizing the impracticality of manually cataloging metadata, we have developed generic capabilities to automatically extract metadata for such resources, while empowering scientists to provide additional context. By incorporating comprehensive metadata, the exploration of diverse data relationships can be realized to gain insight into gaps and opportunities to improve the connectivity between science communities.

  3. R

    RKD schema.org Knowledge Graph

    • rkd.triply.cc
    application/n-quads +5
    Updated Jun 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    RKD (2025). RKD schema.org Knowledge Graph [Dataset]. https://rkd.triply.cc/rkd/RKD-SDO-Knowledge-Graph
    Explore at:
    jsonld, ttl, application/trig, application/n-triples, application/n-quads, application/sparql-results+jsonAvailable download formats
    Dataset updated
    Jun 1, 2025
    Dataset authored and provided by
    RKD
    License

    Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
    License information was derived automatically

    Description

    Combines all individual instances, models (shapes) and metadata from RKD schema.org datasets into one unified dataset.

  4. Z

    Crosswalks among metadata schemas for data cube descriptions in RELIANCE

    • data.niaid.nih.gov
    Updated May 10, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    González-Guardia, Esteban (2021). Crosswalks among metadata schemas for data cube descriptions in RELIANCE [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4744767
    Explore at:
    Dataset updated
    May 10, 2021
    Dataset provided by
    Corcho, Oscar
    Garijo, Daniel
    González-Guardia, Esteban
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This Excel file contains crosswalks among different metadata schemas that can be used for the description of data cubes in the areas of Marine Science, Earth Sciences and Climate Research. These data cubes common contain observations of some variables in some feature of interest, taken by Earth Observation systems (e.g., satellites) or as in-situ observations.

  5. e

    EOU Product, Service and Event Reviews

    • earth.org.uk
    html
    Updated Jun 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Damon Hart-Davis (2025). EOU Product, Service and Event Reviews [Dataset]. https://www.earth.org.uk/SECTION_review.html
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Jun 2025
    Authors
    Damon Hart-Davis
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    Mar 9, 2008 - Present
    Description

    Earth.Org.UK (EOU) Product, Service and Event Review metadata for schema.org as key/value pairs in plain-text files.

  6. H

    Example Aggregation Metadata in Schema Org Format

    • hydroshare.org
    • beta.hydroshare.org
    zip
    Updated Mar 27, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pabitra Dash; Jamy (2024). Example Aggregation Metadata in Schema Org Format [Dataset]. https://www.hydroshare.org/resource/e2319a79641047bebaee534cfc250e2a
    Explore at:
    zip(7.3 MB)Available download formats
    Dataset updated
    Mar 27, 2024
    Dataset provided by
    HydroShare
    Authors
    Pabitra Dash; Jamy
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Oct 1, 2009 - May 30, 2010
    Area covered
    Description

    This netCDF data is the simulation output from Utah Energy Balance (UEB) model.It includes the simulation result of snow water equivalent during the period Oct. 2009 to June 2010 for TWDEF site in Utah.

  7. o

    Mapping CDC to OpenAIRE, B2find, schema.org and Dublin Core

    • explore.openaire.eu
    Updated Oct 29, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Esra Akdeniz; Morten Jakobsen; Silje Storviken (2021). Mapping CDC to OpenAIRE, B2find, schema.org and Dublin Core [Dataset]. http://doi.org/10.5281/zenodo.5614657
    Explore at:
    Dataset updated
    Oct 29, 2021
    Authors
    Esra Akdeniz; Morten Jakobsen; Silje Storviken
    Description

    The document contains a mapping of metadata elements of the CESSDA Data Catalogue to the CESSDA Metadata Model (CMM), to OpenAire, to B2Find, to schema.org and to Dublin Core. It also provides definitions, information on requirements and notes for every metadata element.

  8. The NIST Extensible Resource Data Model (NERDm): JSON schemas for rich...

    • catalog.data.gov
    • data.nist.gov
    Updated Mar 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2025). The NIST Extensible Resource Data Model (NERDm): JSON schemas for rich description of data resources [Dataset]. https://catalog.data.gov/dataset/the-nist-extensible-resource-data-model-nerdm-json-schemas-for-rich-description-of-data-re
    Explore at:
    Dataset updated
    Mar 14, 2025
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    Description

    The NIST Extensible Resource Data Model (NERDm) is a set of schemas for encoding in JSON format metadatathat describe digital resources. The variety of digital resources it can describe includes not onlydigital data sets and collections, but also software, digital services, web sites and portals, anddigital twins. It was created to serve as the internal metadata format used by the NIST Public DataRepository and Science Portal to drive rich presentations on the web and to enable discovery; however, itwas also designed to enable programmatic access to resources and their metadata by external users.Interoperability was also a key design aim: the schemas are defined using the JSON Schema standard,metadata are encoded as JSON-LD, and their semantics are tied to community ontologies, with an emphasison DCAT and the US federal Project Open Data (POD) models. Finally, extensibility is also central to itsdesign: the schemas are composed of a central core schema and various extension schemas. New extensionsto support richer metadata concepts can be added over time without breaking existing applications.Validation is central to NERDm's extensibility model. Consuming applications should be able to choosewhich metadata extensions they care to support and ignore terms and extensions they don't support.Furthermore, they should not fail when a NERDm document leverages extensions they don't recognize, evenwhen on-the-fly validation is required. To support this flexibility, the NERDm framework allowsdocuments to declare what extensions are being used and where. We have developed an optional extensionto the standard JSON Schema validation (see ejsonschema below) to support flexible validation: while astandard JSON Schema validater can validate a NERDm document against the NERDm core schema, our extensionwill validate a NERDm document against any recognized extensions and ignore those that are notrecognized.The NERDm data model is based around the concept of resource, semantically equivalent to a schema.orgResource, and as in schema.org, there can be different types of resources, such as data sets andsoftware. A NERDm document indicates what types the resource qualifies as via the JSON-LD "@type"property. All NERDm Resources are described by metadata terms from the core NERDm schema; however,different resource types can be described by additional metadata properties (often drawing on particularNERDm extension schemas). A Resource contains Components of various types (includingDCAT-defined Distributions) that are considered part of the Resource; specifically, these can include downloadable data files, hierachical datacollecitons, links to web sites (like software repositories), software tools, or other NERDm Resources.Through the NERDm extension system, domain-specific metadata can be included at either the resource orcomponent level. The direct semantic and syntactic connections to the DCAT, POD, and schema.org schemasis intended to ensure unambiguous conversion of NERDm documents into those schemas.As of this writing, the Core NERDm schema and its framework stands at version 0.7 and is compatible withthe "draft-04" version of JSON Schema. Version 1.0 is projected to be released in 2025. In thatrelease, the NERDm schemas will be updated to the "draft2020" version of JSON Schema. Other improvementswill include stronger support for RDF and the Linked Data Platform through its support of JSON-LD.

  9. Crosswalk of most used metadata schemes and guidelines for metadata...

    • zenodo.org
    Updated Jan 7, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ojsteršek; Ojsteršek (2021). Crosswalk of most used metadata schemes and guidelines for metadata interoperability [Dataset]. http://doi.org/10.5281/zenodo.4420116
    Explore at:
    Dataset updated
    Jan 7, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Ojsteršek; Ojsteršek
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This resource provides crosswalks among the most commonly used metadata schemes and guidelines to describe digital objects in Open Science, including:

    • RDA metadata IG recommendation of the metadata element set,
    • EOSC Pilot - EDMI metadata set,
    • Dublin CORE Metadata Terms,
    • Datacite 4.3 metadata schema,
    • DCAT 2.0 metadata schema and DCAT 2.0 application profile,
    • EUDAT B2Find metadata recommendation,
    • OpenAIRE Guidelines for Data Archives,
    • OpenAire Guidelines for literature repositories 4.0,
    • OpenAIRE Guidelines for Other Research Products,
    • OpenAIRE Guidelines for Software Repository Managers,
    • OpenAIRE Guidelines for CRIS Managers,
    • Crossref 4.4.2 metadata XML schema,
    • Harvard Dataverse metadata schema,
    • DDI Codebook 2.5 metadata XML schema,
    • Europeana EDM metadata schema,
    • Schema.org,
    • Bioschemas,
    • The PROV Ontology.
  10. Metadata for Datasets and Relationships

    • figshare.com
    bin
    Updated Aug 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kate Lin; Tarfah Alrashed; Natasha Noy (2024). Metadata for Datasets and Relationships [Dataset]. http://doi.org/10.6084/m9.figshare.22790810.v6
    Explore at:
    binAvailable download formats
    Dataset updated
    Aug 23, 2024
    Dataset provided by
    figshare
    Authors
    Kate Lin; Tarfah Alrashed; Natasha Noy
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains two tables. One table contains metadata for "citable" datasets (datasets that have either a DOI or a compact identifier). The other table contains the relationships between each pair of datasets in the first table.We generated this corpus of dataset metadata by crawling the Web to find pages with schema.org or DCAT metadata indicating that the page contains a dataset. The metadata for datasets includes information such as the dataset’s name, description, provider, creation date, Digital Object Identifiers (DOI), and more. Out of the 46 million dataset pages that have schema.org, we publish this subset of 4.3 million dataset-metadata entries that are citable. We also include an additional table on relationships between these datasets.Please contact dataset-search@googlegroups.com if you have any questions or requests to remove a dataset that you own from this collection.

  11. s

    Semantify.it details page for tirol.at

    • pages.semantify.it
    jsonld, zip
    Updated Sep 13, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2014). Semantify.it details page for tirol.at [Dataset]. https://pages.semantify.it/tirol
    Explore at:
    jsonld, zipAvailable download formats
    Dataset updated
    Sep 13, 2014
    Time period covered
    Jun 19, 2017 - Nov 19, 2019
    Area covered
    Tyrol
    Description

    Metadata and Annotations (structured data) of tirol.at

  12. d

    Samples for different file types used for designing the IGUIDE data catalog...

    • search.dataone.org
    Updated Dec 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Irene Garousi-Nejad (2023). Samples for different file types used for designing the IGUIDE data catalog schemas [Dataset]. https://search.dataone.org/view/sha256%3A2ecabae3776ff8f80cae0ff8f833935d7333f257d5726ae1efac82b89637850b
    Explore at:
    Dataset updated
    Dec 30, 2023
    Dataset provided by
    Hydroshare
    Authors
    Irene Garousi-Nejad
    Time period covered
    Oct 1, 1989 - Aug 15, 2023
    Description

    This resource contains several different file types to help identify appropriate properties for each file type when designing the metadata schema based on Schema.org.

  13. H

    Samples of various file types used for designing file-type metadata schemas

    • beta.hydroshare.org
    • hydroshare.org
    zip
    Updated Mar 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Irene Garousi-Nejad (2025). Samples of various file types used for designing file-type metadata schemas [Dataset]. https://beta.hydroshare.org/resource/fed970c19b9c41928f2591adf5b64dd1/
    Explore at:
    zip(256.1 MB)Available download formats
    Dataset updated
    Mar 11, 2025
    Dataset provided by
    HydroShare
    Authors
    Irene Garousi-Nejad
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Oct 1, 1989 - Aug 15, 2023
    Description

    This resource contains several different file types to help identify appropriate properties for each file type when designing the metadata schema based on Schema.org.

  14. The Red Queen in the Repository: metadata quality in an ever-changing...

    • zenodo.org
    • researchdata.se
    bin, csv, zip
    Updated Jul 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joakim Philipson; Joakim Philipson (2024). The Red Queen in the Repository: metadata quality in an ever-changing environment (preprint of paper, presentation slides and dataset collection with validation schemas to IDCC2019 conference paper) [Dataset]. http://doi.org/10.5281/zenodo.2276777
    Explore at:
    zip, bin, csvAvailable download formats
    Dataset updated
    Jul 25, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Joakim Philipson; Joakim Philipson
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This fileset contains a preprint version of the conference paper (.pdf), presentation slides (as .pptx) and the dataset(s) and validation schema(s) for the IDCC 2019 (Melbourne) conference paper: The Red Queen in the Repository: metadata quality in an ever-changing environment. Datasets and schemas are in .xml, .xsd , Excel (.xlsx) and .csv (two files representing two different sheets in the .xslx -file). The validationSchemas.zip holds the additional validation schemas (.xsd), that were not found in the schemaLocations of the metadata xml-files to be validated. The schemas must all be placed in the same folder, and are to be used for validating the Dataverse dcterms records (with metadataDCT.xsd) and the Zenodo oai_datacite feeds respectively (schema.datacite.org_oai_oai-1.0_oai.xsd). In the latter case, a simpler way of doing it might be to replace the incorrect URL "http://schema.datacite.org/oai/oai-1.0/ oai_datacite.xsd" in the schemaLocation of these xml-files by the CORRECT: schemaLocation="http://schema.datacite.org/oai/oai-1.0/ http://schema.datacite.org/oai/oai-1.0/oai.xsd" as has been done already in the sample files here. The sample file folders testDVNcoll.zip (Dataverse), testFigColl.zip (Figshare) and testZenColl.zip (Zenodo) contain all the metadata files tested and validated that are registered in the spreadsheet with objectIDs.
    In the case of Zenodo, one original file feed,
    zen2018oai_datacite3orig-https%20_zenodo.org_oai2d%20verb=ListRecords%26metadata
    Prefix=oai_datacite%26from=2018-11-29%26until=2018-11-30.xml
    ,
    is also supplied to show what was necessary to change in order to perform validation as indicated in the paper.

    For Dataverse, a corrected version of a file,
    dvn2014ddi-27595Corr_https%20_dataverse.harvard.edu_api_datasets_export%20
    exporter=ddi%26persistentId=doi%253A10.7910_DVN_27595Corr.xml
    ,
    is also supplied in order to show the changes it would take to make the file validate without error.

  15. Data associated with "Developing a standardized but extendable framework to...

    • zenodo.org
    json
    Updated Jan 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ginger Tsueng; Ginger Tsueng; Marco A. Alvarado Cano; Marco A. Alvarado Cano; José Bento; Candice Czech; Lars Pache; Tor C. Savidge; Tor C. Savidge; Justin Starren; Luke V. Rasmussen; Luke V. Rasmussen; Mengjia (Marjorie) Kang; Mengjia (Marjorie) Kang; Qinglong Wu; Qinglong Wu; Jiwen Xin; Jiwen Xin; Xinghua Zhou; Xinghua Zhou; Andrew I. Su; Andrew I. Su; Chunlei Wu; Chunlei Wu; Reed S. Shabman; Reed S. Shabman; Laura D. Hughes; Laura D. Hughes; José Bento; Candice Czech; Lars Pache; Justin Starren (2023). Data associated with "Developing a standardized but extendable framework to increase the findability of infectious disease datasets" [Dataset]. http://doi.org/10.5281/zenodo.7032896
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Jan 12, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Ginger Tsueng; Ginger Tsueng; Marco A. Alvarado Cano; Marco A. Alvarado Cano; José Bento; Candice Czech; Lars Pache; Tor C. Savidge; Tor C. Savidge; Justin Starren; Luke V. Rasmussen; Luke V. Rasmussen; Mengjia (Marjorie) Kang; Mengjia (Marjorie) Kang; Qinglong Wu; Qinglong Wu; Jiwen Xin; Jiwen Xin; Xinghua Zhou; Xinghua Zhou; Andrew I. Su; Andrew I. Su; Chunlei Wu; Chunlei Wu; Reed S. Shabman; Reed S. Shabman; Laura D. Hughes; Laura D. Hughes; José Bento; Candice Czech; Lars Pache; Justin Starren
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data associated with "Developing a standardized but extendable framework to increase the findability of infectious disease datasets"

    Includes:

    • NIAID Dataset schema
    • NIAID ComputationalTool schema
    • Crosswalk between NIAID schemas and common schemas
    • Survey of Schema.org-compliant repositories


    The open access movement and scientific reproducibility concerns have led the biomedical research community to embrace efforts to make scientific datasets openly accessible. While many datasets are now available, there are still challenges in ensuring that they are Findable, Accessible, Interoperable, and Reusable (FAIR). To improve the FAIRness of datasets, we evaluated dataset repositories for compliance with Schema.org standards – a collection of standards developed to increase metadata searchability across the internet. Adoption of the Schema.org Dataset standard was highly variable in biomedical research datasets, and the standard omitted many desirable metadata fields. We customized the Schema.org Dataset standard to catalog datasets collected across a Systems Biology research consortium consisting of 15 Centers. We developed a reusable process for creating a schema which is interoperable with other standards, but still extendable and customizable to a particular context. Here, we describe our process along with the associated gains in FAIRness, and discuss ongoing challenges with dataset discoverability – the first step to ensure that the vast amount of open data published by the research community is reused to its maximum value.

  16. Data from: Evolution of an application profile: advancing metadata best...

    • zenodo.org
    • researchdiscovery.drexel.edu
    • +2more
    bin
    Updated May 30, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Edward M. Krause; Erin Clary; Adrian Ogletree; Jane Greenberg; Edward M. Krause; Erin Clary; Adrian Ogletree; Jane Greenberg (2022). Data from: Evolution of an application profile: advancing metadata best practices through the Dryad data repository [Dataset]. http://doi.org/10.5061/dryad.f0n35
    Explore at:
    binAvailable download formats
    Dataset updated
    May 30, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Edward M. Krause; Erin Clary; Adrian Ogletree; Jane Greenberg; Edward M. Krause; Erin Clary; Adrian Ogletree; Jane Greenberg
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Dryad is a general-purpose curated repository for data underlying scholarly publications. Dryad's metadata framework is supported by a Dublin Core Application Profile (DCAP, hereafter referred to as application profile). This paper examines the evolution of Dryad's application profile, which has been revised over time, in an operational system, serving day-to-day needs of stakeholders. We model the relationships between data packages and data files over time, from its initial implementation in 2007 to its current practice, version 3.2, and present a crosswalk analysis. Results covering versions 1.0 to 3.0 show an increase in the number of metadata elements used to describe Dryad's data objects in Dryad. Results also confirm that Version 3.0, which envisioned separate metadata element sets for data package, data files, and publication metadata, was never fully realized due to constraints in Dryad system architecture. Version 3.1 subsequently reduced the number of metadata elements captured by recombining the publication and data package element sets. This paper documents a real world application profile implemented in an operational system, noting practical system and infrastructure constraints. Finally, the analysis presented informs an ongoing effort to update the application profile to support Dryad's diverse and expanding community of stakeholders.

  17. Data from: LCA Domain Metadata Schema Inventory

    • catalog.data.gov
    • datadiscoverystudio.org
    Updated Aug 5, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Agricultural Research Service (2019). LCA Domain Metadata Schema Inventory [Dataset]. https://catalog.data.gov/it/dataset/lca-domain-metadata-schema-inventory
    Explore at:
    Dataset updated
    Aug 5, 2019
    Dataset provided by
    Agricultural Research Servicehttps://www.ars.usda.gov/
    Description

    This excel workbook is a compilation of the major metadata schemas for life cycle assessment.

  18. g

    ClaimsKG - A Knowledge Graph of Fact-Checked Claims (August, 2022)

    • search.gesis.org
    • datacatalogue.cessda.eu
    Updated Aug 15, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gangopadhyay, Susmita; Boland, Katarina; Schüller, Sascha; Todorov, Konstantin; Tchechmedjiev, Andon; Zapilko, Benjamin; Fafalios, Pavlos; Jabeen, Hajira; Dietze, Stefan (2022). ClaimsKG - A Knowledge Graph of Fact-Checked Claims (August, 2022) [Dataset]. http://doi.org/10.7802/2620
    Explore at:
    Dataset updated
    Aug 15, 2022
    Dataset provided by
    GESIS, Köln
    GESIS search
    Authors
    Gangopadhyay, Susmita; Boland, Katarina; Schüller, Sascha; Todorov, Konstantin; Tchechmedjiev, Andon; Zapilko, Benjamin; Fafalios, Pavlos; Jabeen, Hajira; Dietze, Stefan
    License

    https://www.gesis.org/en/institute/data-usage-termshttps://www.gesis.org/en/institute/data-usage-terms

    Description

    ClaimsKG is a knowledge graph of metadata information for 59580 fact-checked claims scraped from 13 fact-checking sites. In addition to providing a single dataset of claims and associated metadata, truth ratings are harmonised and additional information is provided for each claim, e.g., about mentioned entities. Please see (https://data.gesis.org/claimskg/) for further details about the data model and statistics.

    The dataset facilitates structured queries about claims, their truth values, involved entities, authors, dates, and other kinds of metadata. ClaimsKG is generated through a (semi-)automated pipeline, which harvests claim-related data from popular fact-checking web sites, annotates them with related entities from DBpedia/Wikipedia, and lifts all data to RDF using established vocabularies (such as schema.org). 

    The latest release of ClaimsKG covers 59580 claims. The data was scraped till August, of 2022 containing claims published between the years 1996-2022 from 13 factchecking websites. The claim-review (fact checking) period for claims ranges between the year 1996 to 2022. Entity fishing python client (https://github.com/hirmeos/entity-fishing-client-python) has been used for entity linking and disambiguation in this release. The dataset contains a total of 1371271 entities detected and referenced with DBpedia. More information, such as detailed statistics, query examples and a user-friendly interface to explore the knowledge graph is available at: https://data.gesis.org/claimskg/ .

    The first two releases of ClaimsKG are hosted at Zenodo (https://doi.org/10.5281/zenodo.3518960), ClaimsKGV1.0 (published on 04.04.2019), ClaimsKGV2.0 (published on 01.09.2019). This latest release of ClaimsKG supersedes the previous versions as it contains all the claims from the previous versions together with additional claims as well as improved entity annotations.

  19. Zenodo Open Metadata snapshot - Training dataset for records and communities...

    • data.niaid.nih.gov
    • explore.openaire.eu
    Updated Dec 15, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zenodo team (2022). Zenodo Open Metadata snapshot - Training dataset for records and communities classifier building [Dataset]. https://data.niaid.nih.gov/resources?id=ZENODO_787062
    Explore at:
    Dataset updated
    Dec 15, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Zenodo team
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains Zenodo's published open access records and communities metadata, including entries marked by the Zenodo staff as spam and deleted.

    The datasets are gzipped compressed JSON-lines files, where each line is a JSON object representation of a Zenodo record or community.

    Records dataset

    Filename: zenodo_open_metadata_{ date of export }.jsonl.gz

    Each object contains the terms: part_of, thesis, description, doi, meeting, imprint, references, recid, alternate_identifiers, resource_type, journal, related_identifiers, title, subjects, notes, creators, communities, access_right, keywords, contributors, publication_date

    which correspond to the fields with the same name available in Zenodo's record JSON Schema at https://zenodo.org/schemas/records/record-v1.0.0.json.

    In addition, some terms have been altered:

    The term files contains a list of dictionaries containing filetype, size, and filename only.

    The term license contains a short Zenodo ID of the license (e.g. "cc-by").

    Communities dataset

    Filename: zenodo_community_metadata_{ date of export }.jsonl.gz

    Each object contains the terms: id, title, description, curation_policy, page

    which correspond to the fields with the same name available in Zenodo's community creation form.

    Notes for all datasets

    For each object the term spam contains a boolean value, determining whether a given record/community was marked as spam content by Zenodo staff.

    Some values for the top-level terms, which were missing in the metadata may contain a null value.

    A smaller uncompressed random sample of 200 JSON lines is also included for each dataset to test and get familiar with the format without having to download the entire dataset.

  20. c

    Audiovisual collection provided as LOD - Datasets - CLARIAH Labs Dataset...

    • mediasuitedata.clariah.nl
    Updated Nov 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    clariah.nl (2023). Audiovisual collection provided as LOD - Datasets - CLARIAH Labs Dataset Registry [Dataset]. https://mediasuitedata.clariah.nl/dataset/nisv-catalogue-lod
    Explore at:
    Dataset updated
    Nov 29, 2023
    Dataset provided by
    clariah.nl
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Metadata for the Sound & Vision catalogue items have been transformed to RDF and are mapped to schema.org. Descriptive metadata that is not subject to copyright laws are included. All available metadata are loaded in one S&V knowledge graph, that is accessible via a SPARQL endpoint.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Martin Fenner; Martin Fenner; Merce Crosas; Merce Crosas; Gustavo Durand; Gustavo Durand; Sarala Wimalaratne; Sarala Wimalaratne; Florian Gräf; Florian Gräf; Richard Hallett; Richard Hallett; Manuel Bernal Llinares; Manuel Bernal Llinares; Uwe Schindler; Uwe Schindler; Tim Clark; Tim Clark (2020). Listing of data repositories that embed schema.org metadata in dataset landing pages [Dataset]. http://doi.org/10.5281/zenodo.1262598
Organization logo

Listing of data repositories that embed schema.org metadata in dataset landing pages

Related Article
Explore at:
csvAvailable download formats
Dataset updated
Jan 24, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Martin Fenner; Martin Fenner; Merce Crosas; Merce Crosas; Gustavo Durand; Gustavo Durand; Sarala Wimalaratne; Sarala Wimalaratne; Florian Gräf; Florian Gräf; Richard Hallett; Richard Hallett; Manuel Bernal Llinares; Manuel Bernal Llinares; Uwe Schindler; Uwe Schindler; Tim Clark; Tim Clark
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Machine-readable metadata available from landing pages for datasets facilitate data citation by enabling easy integration with reference managers and other tools used in a data citation workflow. Embedding these metadata using the schema.org standard with the JSON-LD is emerging as the community standard. This dataset is a listing of data repositories that have implemented this approach or are in the progress of doing so.

This is the first version of this dataset and was generated via community consultation. We expect to update this dataset, as an increasing number of data repositories adopt this approach, and we hope to see this information added to registries of data repositories such as re3data and FAIRsharing.

In addition to the listing of data repositories we provide information of the schema.org properties supported by these data repositories, focussing on the required and recommended properties from the "Data Citation Roadmap for Scholarly Data Repositories".

Search
Clear search
Close search
Google apps
Main menu