72 datasets found

Listing of data repositories that embed schema.org metadata in dataset...
zenodo.org
explore.openaire.eu
+1more
csv
Updated Jan 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Martin Fenner; Martin Fenner; Merce Crosas; Merce Crosas; Gustavo Durand; Gustavo Durand; Sarala Wimalaratne; Sarala Wimalaratne; Florian Gräf; Florian Gräf; Richard Hallett; Richard Hallett; Manuel Bernal Llinares; Manuel Bernal Llinares; Uwe Schindler; Uwe Schindler; Tim Clark; Tim Clark (2020). Listing of data repositories that embed schema.org metadata in dataset landing pages [Dataset]. http://doi.org/10.5281/zenodo.1262598
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.1262598
Dataset updated
Jan 24, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Martin Fenner; Martin Fenner; Merce Crosas; Merce Crosas; Gustavo Durand; Gustavo Durand; Sarala Wimalaratne; Sarala Wimalaratne; Florian Gräf; Florian Gräf; Richard Hallett; Richard Hallett; Manuel Bernal Llinares; Manuel Bernal Llinares; Uwe Schindler; Uwe Schindler; Tim Clark; Tim Clark
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Machine-readable metadata available from landing pages for datasets facilitate data citation by enabling easy integration with reference managers and other tools used in a data citation workflow. Embedding these metadata using the schema.org standard with the JSON-LD is emerging as the community standard. This dataset is a listing of data repositories that have implemented this approach or are in the progress of doing so.

This is the first version of this dataset and was generated via community consultation. We expect to update this dataset, as an increasing number of data repositories adopt this approach, and we hope to see this information added to registries of data repositories such as re3data and FAIRsharing.

In addition to the listing of data repositories we provide information of the schema.org properties supported by these data repositories, focussing on the required and recommended properties from the "Data Citation Roadmap for Scholarly Data Repositories".
d
Leveraging the Schema.org Vocabulary to Create an Actionable Metadata...
dataone.org
hydroshare.org
+1more
Updated Dec 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Irene Garousi-Nejad; Anthony M. Castronova; Jeffery S. Horsburgh; Scott Black; Pabitra Dash; Mauriel Ramirez (2023). Leveraging the Schema.org Vocabulary to Create an Actionable Metadata Representation for Geospatial Data and Computing Resources [Dataset]. https://dataone.org/datasets/sha256%3A7f06cf7a9c31b5c545a85694bdb862cbbeb2255f86b0b6f0c88e852e04ad8038
Explore at:
Dataset updated
Dec 30, 2023
Dataset provided by
Hydroshare
Authors
Irene Garousi-Nejad; Anthony M. Castronova; Jeffery S. Horsburgh; Scott Black; Pabitra Dash; Mauriel Ramirez
Description
This resource contains slides for the AGU Fall Meeting 2023 presentation (#IN23A-07) in San Francisco on Dec 12. Session: IN23A: Advancing Open Science: Emerging Techniques in Knowledge Management and Discovery II Oral

Effective response to global crises relies on universal access to scientific data and models, understanding their attributes, and representing their interconnectivity to facilitate collaborative research and decision making. In the age of distributed data, geospatial researchers frequently invest significant time searching for, accessing, and working to understand scientific data. This often leads to the recreation of existing datasets as well as challenges in determining methods for accessing, using, and ultimately establishing connections between resources. In recent years, following FAIR and CARE principles, there is an emerging practice to leverage structured and robust metadata to accelerate the discovery of web-based scientific resources and products. This practice assists users in not only discovery, but also in understanding the context, quality, and provenance of data, as well as the rights and responsibilities of data owners and consumers. It also empowers organizations to leverage their data more effectively and derive meaningful insights from them. Doing so, however, can be difficult, especially when diverse resources needed for scientific applications may be spread across multiple repositories or locations. We present a solution for leveraging the Schema.org vocabulary along with various web encodings such as the Resource Description Framework (RDF) with JSON-LD to create an actionable, curated catalog of scientific resources ranging from spatio-temporal data to software source code. We explore how resources of various types and common scientific formats, such as multidimensional, software containers, source code, and spatial features, which are stored across various repositories and distributed cloud storage, can be described and cataloged. Recognizing the impracticality of manually cataloging metadata, we have developed generic capabilities to automatically extract metadata for such resources, while empowering scientists to provide additional context. By incorporating comprehensive metadata, the exploration of diverse data relationships can be realized to gain insight into gaps and opportunities to improve the connectivity between science communities.
R
RKD schema.org Knowledge Graph
rkd.triply.cc
application/n-quads +5
Updated Jun 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
RKD (2025). RKD schema.org Knowledge Graph [Dataset]. https://rkd.triply.cc/rkd/RKD-SDO-Knowledge-Graph
Explore at:
jsonld, ttl, application/trig, application/n-triples, application/n-quads, application/sparql-results+jsonAvailable download formats
Dataset updated
Jun 1, 2025
Dataset authored and provided by
RKD
License
Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
License information was derived automatically
Description
Combines all individual instances, models (shapes) and metadata from RKD schema.org datasets into one unified dataset.
Z
Crosswalks among metadata schemas for data cube descriptions in RELIANCE
data.niaid.nih.gov
Updated May 10, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
González-Guardia, Esteban (2021). Crosswalks among metadata schemas for data cube descriptions in RELIANCE [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4744767
Explore at:
Dataset updated
May 10, 2021
Dataset provided by
Corcho, Oscar
Garijo, Daniel
González-Guardia, Esteban
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This Excel file contains crosswalks among different metadata schemas that can be used for the description of data cubes in the areas of Marine Science, Earth Sciences and Climate Research. These data cubes common contain observations of some variables in some feature of interest, taken by Earth Observation systems (e.g., satellites) or as in-situ observations.
e
EOU Product, Service and Event Reviews
earth.org.uk
html
Updated Jun 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Damon Hart-Davis (2025). EOU Product, Service and Event Reviews [Dataset]. https://www.earth.org.uk/SECTION_review.html
Explore at:
htmlAvailable download formats
Dataset updated
Jun 2025
Authors
Damon Hart-Davis
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Time period covered
Mar 9, 2008 - Present
Description
Earth.Org.UK (EOU) Product, Service and Event Review metadata for schema.org as key/value pairs in plain-text files.
H
Example Aggregation Metadata in Schema Org Format
hydroshare.org
beta.hydroshare.org
zip
Updated Mar 27, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pabitra Dash; Jamy (2024). Example Aggregation Metadata in Schema Org Format [Dataset]. https://www.hydroshare.org/resource/e2319a79641047bebaee534cfc250e2a
Explore at:
zip(7.3 MB)Available download formats
Dataset updated
Mar 27, 2024
Dataset provided by
HydroShare
Authors
Pabitra Dash; Jamy
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Oct 1, 2009 - May 30, 2010
Area covered

Description
This netCDF data is the simulation output from Utah Energy Balance (UEB) model.It includes the simulation result of snow water equivalent during the period Oct. 2009 to June 2010 for TWDEF site in Utah.
o
Mapping CDC to OpenAIRE, B2find, schema.org and Dublin Core
explore.openaire.eu
Updated Oct 29, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Esra Akdeniz; Morten Jakobsen; Silje Storviken (2021). Mapping CDC to OpenAIRE, B2find, schema.org and Dublin Core [Dataset]. http://doi.org/10.5281/zenodo.5614657
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.5614657
Dataset updated
Oct 29, 2021
Authors
Esra Akdeniz; Morten Jakobsen; Silje Storviken
Description
The document contains a mapping of metadata elements of the CESSDA Data Catalogue to the CESSDA Metadata Model (CMM), to OpenAire, to B2Find, to schema.org and to Dublin Core. It also provides definitions, information on requirements and notes for every metadata element.
The NIST Extensible Resource Data Model (NERDm): JSON schemas for rich...
catalog.data.gov
data.nist.gov
Updated Mar 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institute of Standards and Technology (2025). The NIST Extensible Resource Data Model (NERDm): JSON schemas for rich description of data resources [Dataset]. https://catalog.data.gov/dataset/the-nist-extensible-resource-data-model-nerdm-json-schemas-for-rich-description-of-data-re
Explore at:
Dataset updated
Mar 14, 2025
Dataset provided by
National Institute of Standards and Technologyhttp://www.nist.gov/
Description
The NIST Extensible Resource Data Model (NERDm) is a set of schemas for encoding in JSON format metadatathat describe digital resources. The variety of digital resources it can describe includes not onlydigital data sets and collections, but also software, digital services, web sites and portals, anddigital twins. It was created to serve as the internal metadata format used by the NIST Public DataRepository and Science Portal to drive rich presentations on the web and to enable discovery; however, itwas also designed to enable programmatic access to resources and their metadata by external users.Interoperability was also a key design aim: the schemas are defined using the JSON Schema standard,metadata are encoded as JSON-LD, and their semantics are tied to community ontologies, with an emphasison DCAT and the US federal Project Open Data (POD) models. Finally, extensibility is also central to itsdesign: the schemas are composed of a central core schema and various extension schemas. New extensionsto support richer metadata concepts can be added over time without breaking existing applications.Validation is central to NERDm's extensibility model. Consuming applications should be able to choosewhich metadata extensions they care to support and ignore terms and extensions they don't support.Furthermore, they should not fail when a NERDm document leverages extensions they don't recognize, evenwhen on-the-fly validation is required. To support this flexibility, the NERDm framework allowsdocuments to declare what extensions are being used and where. We have developed an optional extensionto the standard JSON Schema validation (see ejsonschema below) to support flexible validation: while astandard JSON Schema validater can validate a NERDm document against the NERDm core schema, our extensionwill validate a NERDm document against any recognized extensions and ignore those that are notrecognized.The NERDm data model is based around the concept of resource, semantically equivalent to a schema.orgResource, and as in schema.org, there can be different types of resources, such as data sets andsoftware. A NERDm document indicates what types the resource qualifies as via the JSON-LD "@type"property. All NERDm Resources are described by metadata terms from the core NERDm schema; however,different resource types can be described by additional metadata properties (often drawing on particularNERDm extension schemas). A Resource contains Components of various types (includingDCAT-defined Distributions) that are considered part of the Resource; specifically, these can include downloadable data files, hierachical datacollecitons, links to web sites (like software repositories), software tools, or other NERDm Resources.Through the NERDm extension system, domain-specific metadata can be included at either the resource orcomponent level. The direct semantic and syntactic connections to the DCAT, POD, and schema.org schemasis intended to ensure unambiguous conversion of NERDm documents into those schemas.As of this writing, the Core NERDm schema and its framework stands at version 0.7 and is compatible withthe "draft-04" version of JSON Schema. Version 1.0 is projected to be released in 2025. In thatrelease, the NERDm schemas will be updated to the "draft2020" version of JSON Schema. Other improvementswill include stronger support for RDF and the Linked Data Platform through its support of JSON-LD.
Crosswalk of most used metadata schemes and guidelines for metadata...
zenodo.org
Updated Jan 7, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ojsteršek; Ojsteršek (2021). Crosswalk of most used metadata schemes and guidelines for metadata interoperability [Dataset]. http://doi.org/10.5281/zenodo.4420116
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.4420116
Dataset updated
Jan 7, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Ojsteršek; Ojsteršek
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This resource provides crosswalks among the most commonly used metadata schemes and guidelines to describe digital objects in Open Science, including:

RDA metadata IG recommendation of the metadata element set,

EOSC Pilot - EDMI metadata set,

Dublin CORE Metadata Terms,

Datacite 4.3 metadata schema,

DCAT 2.0 metadata schema and DCAT 2.0 application profile,

EUDAT B2Find metadata recommendation,

OpenAIRE Guidelines for Data Archives,

OpenAire Guidelines for literature repositories 4.0,

OpenAIRE Guidelines for Other Research Products,

OpenAIRE Guidelines for Software Repository Managers,

OpenAIRE Guidelines for CRIS Managers,

Crossref 4.4.2 metadata XML schema,

Harvard Dataverse metadata schema,

DDI Codebook 2.5 metadata XML schema,

Europeana EDM metadata schema,

Schema.org,

Bioschemas,

The PROV Ontology.
Metadata for Datasets and Relationships
figshare.com
bin
Updated Aug 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kate Lin; Tarfah Alrashed; Natasha Noy (2024). Metadata for Datasets and Relationships [Dataset]. http://doi.org/10.6084/m9.figshare.22790810.v6
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.22790810.v6
Dataset updated
Aug 23, 2024
Dataset provided by
figshare
Authors
Kate Lin; Tarfah Alrashed; Natasha Noy
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains two tables. One table contains metadata for "citable" datasets (datasets that have either a DOI or a compact identifier). The other table contains the relationships between each pair of datasets in the first table.We generated this corpus of dataset metadata by crawling the Web to find pages with schema.org or DCAT metadata indicating that the page contains a dataset. The metadata for datasets includes information such as the dataset’s name, description, provider, creation date, Digital Object Identifiers (DOI), and more. Out of the 46 million dataset pages that have schema.org, we publish this subset of 4.3 million dataset-metadata entries that are citable. We also include an additional table on relationships between these datasets.Please contact dataset-search@googlegroups.com if you have any questions or requests to remove a dataset that you own from this collection.
s
Semantify.it details page for tirol.at
pages.semantify.it
jsonld, zip
Updated Sep 13, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2014). Semantify.it details page for tirol.at [Dataset]. https://pages.semantify.it/tirol
Explore at:
jsonld, zipAvailable download formats
Dataset updated
Sep 13, 2014
Time period covered
Jun 19, 2017 - Nov 19, 2019
Area covered
Tyrol
Description
Metadata and Annotations (structured data) of tirol.at
d
Samples for different file types used for designing the IGUIDE data catalog...
search.dataone.org
Updated Dec 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Irene Garousi-Nejad (2023). Samples for different file types used for designing the IGUIDE data catalog schemas [Dataset]. https://search.dataone.org/view/sha256%3A2ecabae3776ff8f80cae0ff8f833935d7333f257d5726ae1efac82b89637850b
Explore at:
Dataset updated
Dec 30, 2023
Dataset provided by
Hydroshare
Authors
Irene Garousi-Nejad
Time period covered
Oct 1, 1989 - Aug 15, 2023
Description
This resource contains several different file types to help identify appropriate properties for each file type when designing the metadata schema based on Schema.org.
H
Samples of various file types used for designing file-type metadata schemas
beta.hydroshare.org
hydroshare.org
zip
Updated Mar 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Irene Garousi-Nejad (2025). Samples of various file types used for designing file-type metadata schemas [Dataset]. https://beta.hydroshare.org/resource/fed970c19b9c41928f2591adf5b64dd1/
Explore at:
zip(256.1 MB)Available download formats
Dataset updated
Mar 11, 2025
Dataset provided by
HydroShare
Authors
Irene Garousi-Nejad
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Oct 1, 1989 - Aug 15, 2023
Description
This resource contains several different file types to help identify appropriate properties for each file type when designing the metadata schema based on Schema.org.
The Red Queen in the Repository: metadata quality in an ever-changing...
zenodo.org
researchdata.se
bin, csv, zip
Updated Jul 25, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Joakim Philipson; Joakim Philipson (2024). The Red Queen in the Repository: metadata quality in an ever-changing environment (preprint of paper, presentation slides and dataset collection with validation schemas to IDCC2019 conference paper) [Dataset]. http://doi.org/10.5281/zenodo.2276777
Explore at:
zip, bin, csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.2276777
Dataset updated
Jul 25, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Joakim Philipson; Joakim Philipson
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This fileset contains a preprint version of the conference paper (.pdf), presentation slides (as .pptx) and the dataset(s) and validation schema(s) for the IDCC 2019 (Melbourne) conference paper: The Red Queen in the Repository: metadata quality in an ever-changing environment. Datasets and schemas are in .xml, .xsd , Excel (.xlsx) and .csv (two files representing two different sheets in the .xslx -file). The validationSchemas.zip holds the additional validation schemas (.xsd), that were not found in the schemaLocations of the metadata xml-files to be validated. The schemas must all be placed in the same folder, and are to be used for validating the Dataverse dcterms records (with metadataDCT.xsd) and the Zenodo oai_datacite feeds respectively (schema.datacite.org_oai_oai-1.0_oai.xsd). In the latter case, a simpler way of doing it might be to replace the incorrect URL "http://schema.datacite.org/oai/oai-1.0/ oai_datacite.xsd" in the schemaLocation of these xml-files by the CORRECT: schemaLocation="http://schema.datacite.org/oai/oai-1.0/ http://schema.datacite.org/oai/oai-1.0/oai.xsd" as has been done already in the sample files here. The sample file folders testDVNcoll.zip (Dataverse), testFigColl.zip (Figshare) and testZenColl.zip (Zenodo) contain all the metadata files tested and validated that are registered in the spreadsheet with objectIDs.
In the case of Zenodo, one original file feed,
zen2018oai_datacite3orig-https%20_zenodo.org_oai2d%20verb=ListRecords%26metadata
Prefix=oai_datacite%26from=2018-11-29%26until=2018-11-30.xml ,
is also supplied to show what was necessary to change in order to perform validation as indicated in the paper.

For Dataverse, a corrected version of a file,
dvn2014ddi-27595Corr_https%20_dataverse.harvard.edu_api_datasets_export%20
exporter=ddi%26persistentId=doi%253A10.7910_DVN_27595Corr.xml ,
is also supplied in order to show the changes it would take to make the file validate without error.
Data associated with "Developing a standardized but extendable framework to...
zenodo.org
json
Updated Jan 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ginger Tsueng; Ginger Tsueng; Marco A. Alvarado Cano; Marco A. Alvarado Cano; José Bento; Candice Czech; Lars Pache; Tor C. Savidge; Tor C. Savidge; Justin Starren; Luke V. Rasmussen; Luke V. Rasmussen; Mengjia (Marjorie) Kang; Mengjia (Marjorie) Kang; Qinglong Wu; Qinglong Wu; Jiwen Xin; Jiwen Xin; Xinghua Zhou; Xinghua Zhou; Andrew I. Su; Andrew I. Su; Chunlei Wu; Chunlei Wu; Reed S. Shabman; Reed S. Shabman; Laura D. Hughes; Laura D. Hughes; José Bento; Candice Czech; Lars Pache; Justin Starren (2023). Data associated with "Developing a standardized but extendable framework to increase the findability of infectious disease datasets" [Dataset]. http://doi.org/10.5281/zenodo.7032896
Explore at:
jsonAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.7032896
Dataset updated
Jan 12, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Ginger Tsueng; Ginger Tsueng; Marco A. Alvarado Cano; Marco A. Alvarado Cano; José Bento; Candice Czech; Lars Pache; Tor C. Savidge; Tor C. Savidge; Justin Starren; Luke V. Rasmussen; Luke V. Rasmussen; Mengjia (Marjorie) Kang; Mengjia (Marjorie) Kang; Qinglong Wu; Qinglong Wu; Jiwen Xin; Jiwen Xin; Xinghua Zhou; Xinghua Zhou; Andrew I. Su; Andrew I. Su; Chunlei Wu; Chunlei Wu; Reed S. Shabman; Reed S. Shabman; Laura D. Hughes; Laura D. Hughes; José Bento; Candice Czech; Lars Pache; Justin Starren
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Data associated with "Developing a standardized but extendable framework to increase the findability of infectious disease datasets"

Includes:

NIAID Dataset schema

NIAID ComputationalTool schema

Crosswalk between NIAID schemas and common schemas

Survey of Schema.org-compliant repositories

The open access movement and scientific reproducibility concerns have led the biomedical research community to embrace efforts to make scientific datasets openly accessible. While many datasets are now available, there are still challenges in ensuring that they are Findable, Accessible, Interoperable, and Reusable (FAIR). To improve the FAIRness of datasets, we evaluated dataset repositories for compliance with Schema.org standards – a collection of standards developed to increase metadata searchability across the internet. Adoption of the Schema.org Dataset standard was highly variable in biomedical research datasets, and the standard omitted many desirable metadata fields. We customized the Schema.org Dataset standard to catalog datasets collected across a Systems Biology research consortium consisting of 15 Centers. We developed a reusable process for creating a schema which is interoperable with other standards, but still extendable and customizable to a particular context. Here, we describe our process along with the associated gains in FAIRness, and discuss ongoing challenges with dataset discoverability – the first step to ensure that the vast amount of open data published by the research community is reused to its maximum value.
Data from: Evolution of an application profile: advancing metadata best...
zenodo.org
researchdiscovery.drexel.edu
+2more
bin
Updated May 30, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Edward M. Krause; Erin Clary; Adrian Ogletree; Jane Greenberg; Edward M. Krause; Erin Clary; Adrian Ogletree; Jane Greenberg (2022). Data from: Evolution of an application profile: advancing metadata best practices through the Dryad data repository [Dataset]. http://doi.org/10.5061/dryad.f0n35
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.f0n35
Dataset updated
May 30, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Edward M. Krause; Erin Clary; Adrian Ogletree; Jane Greenberg; Edward M. Krause; Erin Clary; Adrian Ogletree; Jane Greenberg
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Dryad is a general-purpose curated repository for data underlying scholarly publications. Dryad's metadata framework is supported by a Dublin Core Application Profile (DCAP, hereafter referred to as application profile). This paper examines the evolution of Dryad's application profile, which has been revised over time, in an operational system, serving day-to-day needs of stakeholders. We model the relationships between data packages and data files over time, from its initial implementation in 2007 to its current practice, version 3.2, and present a crosswalk analysis. Results covering versions 1.0 to 3.0 show an increase in the number of metadata elements used to describe Dryad's data objects in Dryad. Results also confirm that Version 3.0, which envisioned separate metadata element sets for data package, data files, and publication metadata, was never fully realized due to constraints in Dryad system architecture. Version 3.1 subsequently reduced the number of metadata elements captured by recombining the publication and data package element sets. This paper documents a real world application profile implemented in an operational system, noting practical system and infrastructure constraints. Finally, the analysis presented informs an ongoing effort to update the application profile to support Dryad's diverse and expanding community of stakeholders.
Data from: LCA Domain Metadata Schema Inventory
catalog.data.gov
datadiscoverystudio.org
Updated Aug 5, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Agricultural Research Service (2019). LCA Domain Metadata Schema Inventory [Dataset]. https://catalog.data.gov/it/dataset/lca-domain-metadata-schema-inventory
Explore at:
Dataset updated
Aug 5, 2019
Dataset provided by
Agricultural Research Servicehttps://www.ars.usda.gov/
Description
This excel workbook is a compilation of the major metadata schemas for life cycle assessment.
g
ClaimsKG - A Knowledge Graph of Fact-Checked Claims (August, 2022)
search.gesis.org
datacatalogue.cessda.eu
Updated Aug 15, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gangopadhyay, Susmita; Boland, Katarina; Schüller, Sascha; Todorov, Konstantin; Tchechmedjiev, Andon; Zapilko, Benjamin; Fafalios, Pavlos; Jabeen, Hajira; Dietze, Stefan (2022). ClaimsKG - A Knowledge Graph of Fact-Checked Claims (August, 2022) [Dataset]. http://doi.org/10.7802/2620
Explore at:
Unique identifier
https://doi.org/10.7802/2620
Dataset updated
Aug 15, 2022
Dataset provided by
GESIS, Köln
GESIS search
Authors
Gangopadhyay, Susmita; Boland, Katarina; Schüller, Sascha; Todorov, Konstantin; Tchechmedjiev, Andon; Zapilko, Benjamin; Fafalios, Pavlos; Jabeen, Hajira; Dietze, Stefan
License
https://www.gesis.org/en/institute/data-usage-termshttps://www.gesis.org/en/institute/data-usage-terms
Description
ClaimsKG is a knowledge graph of metadata information for 59580 fact-checked claims scraped from 13 fact-checking sites. In addition to providing a single dataset of claims and associated metadata, truth ratings are harmonised and additional information is provided for each claim, e.g., about mentioned entities. Please see (https://data.gesis.org/claimskg/) for further details about the data model and statistics.

The dataset facilitates structured queries about claims, their truth values, involved entities, authors, dates, and other kinds of metadata. ClaimsKG is generated through a (semi-)automated pipeline, which harvests claim-related data from popular fact-checking web sites, annotates them with related entities from DBpedia/Wikipedia, and lifts all data to RDF using established vocabularies (such as schema.org). 

The latest release of ClaimsKG covers 59580 claims. The data was scraped till August, of 2022 containing claims published between the years 1996-2022 from 13 factchecking websites. The claim-review (fact checking) period for claims ranges between the year 1996 to 2022. Entity fishing python client (https://github.com/hirmeos/entity-fishing-client-python) has been used for entity linking and disambiguation in this release. The dataset contains a total of 1371271 entities detected and referenced with DBpedia. More information, such as detailed statistics, query examples and a user-friendly interface to explore the knowledge graph is available at: https://data.gesis.org/claimskg/ .

The first two releases of ClaimsKG are hosted at Zenodo (https://doi.org/10.5281/zenodo.3518960), ClaimsKGV1.0 (published on 04.04.2019), ClaimsKGV2.0 (published on 01.09.2019). This latest release of ClaimsKG supersedes the previous versions as it contains all the claims from the previous versions together with additional claims as well as improved entity annotations.
Zenodo Open Metadata snapshot - Training dataset for records and communities...
data.niaid.nih.gov
explore.openaire.eu
Updated Dec 15, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zenodo team (2022). Zenodo Open Metadata snapshot - Training dataset for records and communities classifier building [Dataset]. https://data.niaid.nih.gov/resources?id=ZENODO_787062
Explore at:
Dataset updated
Dec 15, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Zenodo team
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains Zenodo's published open access records and communities metadata, including entries marked by the Zenodo staff as spam and deleted.

The datasets are gzipped compressed JSON-lines files, where each line is a JSON object representation of a Zenodo record or community.

Records dataset

Filename: zenodo_open_metadata_{ date of export }.jsonl.gz

Each object contains the terms: part_of, thesis, description, doi, meeting, imprint, references, recid, alternate_identifiers, resource_type, journal, related_identifiers, title, subjects, notes, creators, communities, access_right, keywords, contributors, publication_date

which correspond to the fields with the same name available in Zenodo's record JSON Schema at https://zenodo.org/schemas/records/record-v1.0.0.json.

In addition, some terms have been altered:

The term files contains a list of dictionaries containing filetype, size, and filename only.

The term license contains a short Zenodo ID of the license (e.g. "cc-by").

Communities dataset

Filename: zenodo_community_metadata_{ date of export }.jsonl.gz

Each object contains the terms: id, title, description, curation_policy, page

which correspond to the fields with the same name available in Zenodo's community creation form.

Notes for all datasets

For each object the term spam contains a boolean value, determining whether a given record/community was marked as spam content by Zenodo staff.

Some values for the top-level terms, which were missing in the metadata may contain a null value.

A smaller uncompressed random sample of 200 JSON lines is also included for each dataset to test and get familiar with the format without having to download the entire dataset.
c
Audiovisual collection provided as LOD - Datasets - CLARIAH Labs Dataset...
mediasuitedata.clariah.nl
Updated Nov 29, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
clariah.nl (2023). Audiovisual collection provided as LOD - Datasets - CLARIAH Labs Dataset Registry [Dataset]. https://mediasuitedata.clariah.nl/dataset/nisv-catalogue-lod
Explore at:
Dataset updated
Nov 29, 2023
Dataset provided by
clariah.nl
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Metadata for the Sound & Vision catalogue items have been transformed to RDF and are mapped to schema.org. Descriptive metadata that is not subject to copyright laws are included. All available metadata are loaded in one S&V knowledge graph, that is accessible via a SPARQL endpoint.

Facebook

Twitter

Click to copy link

Link copied

Cite

Martin Fenner; Martin Fenner; Merce Crosas; Merce Crosas; Gustavo Durand; Gustavo Durand; Sarala Wimalaratne; Sarala Wimalaratne; Florian Gräf; Florian Gräf; Richard Hallett; Richard Hallett; Manuel Bernal Llinares; Manuel Bernal Llinares; Uwe Schindler; Uwe Schindler; Tim Clark; Tim Clark (2020). Listing of data repositories that embed schema.org metadata in dataset landing pages [Dataset]. http://doi.org/10.5281/zenodo.1262598

Listing of data repositories that embed schema.org metadata in dataset landing pages

Explore at:

csvAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.1262598

Dataset updated

Jan 24, 2020

Dataset provided by

Zenodohttp://zenodo.org/

Authors

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Machine-readable metadata available from landing pages for datasets facilitate data citation by enabling easy integration with reference managers and other tools used in a data citation workflow. Embedding these metadata using the schema.org standard with the JSON-LD is emerging as the community standard. This dataset is a listing of data repositories that have implemented this approach or are in the progress of doing so.

This is the first version of this dataset and was generated via community consultation. We expect to update this dataset, as an increasing number of data repositories adopt this approach, and we hope to see this information added to registries of data repositories such as re3data and FAIRsharing.

In addition to the listing of data repositories we provide information of the schema.org properties supported by these data repositories, focussing on the required and recommended properties from the "Data Citation Roadmap for Scholarly Data Repositories".

Clear search

Close search

Google apps

Main menu

Listing of data repositories that embed schema.org metadata in dataset...

Leveraging the Schema.org Vocabulary to Create an Actionable Metadata...

RKD schema.org Knowledge Graph

Crosswalks among metadata schemas for data cube descriptions in RELIANCE

EOU Product, Service and Event Reviews

Example Aggregation Metadata in Schema Org Format

Mapping CDC to OpenAIRE, B2find, schema.org and Dublin Core

The NIST Extensible Resource Data Model (NERDm): JSON schemas for rich...

Crosswalk of most used metadata schemes and guidelines for metadata...

Metadata for Datasets and Relationships

Semantify.it details page for tirol.at

Samples for different file types used for designing the IGUIDE data catalog...

Samples of various file types used for designing file-type metadata schemas

The Red Queen in the Repository: metadata quality in an ever-changing...

Data associated with "Developing a standardized but extendable framework to...

Data from: Evolution of an application profile: advancing metadata best...

Data from: LCA Domain Metadata Schema Inventory

ClaimsKG - A Knowledge Graph of Fact-Checked Claims (August, 2022)

Zenodo Open Metadata snapshot - Training dataset for records and communities...

Audiovisual collection provided as LOD - Datasets - CLARIAH Labs Dataset...

Listing of data repositories that embed schema.org metadata in dataset landing pages