100+ datasets found

Data from: Data Collections Explorer – An Easy-to-Use Tool for Sharing and...
meta4ds.fokus.fraunhofer.de
pdf, unknown
Updated Sep 15, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Zenodo (2023). Data Collections Explorer – An Easy-to-Use Tool for Sharing and Discovering Research Data [Dataset]. https://meta4ds.fokus.fraunhofer.de/datasets/oai-zenodo-org-8399407~~1?locale=en
Explore at:
pdf(4797020), unknownAvailable download formats
Dataset updated
Sep 15, 2023
Dataset authored and provided by
Zenodohttp://zenodo.org/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
There is a wide variety of archives, databases, and repositories currently available that pro-vide access to research data. However, basic information about these systems is often diffi-cult to gather, such as whether there are limits to the size of data sets that can be published or whether there is any publication fee that applies. In addition to that, there are plenty of re-search groups publishing their research data sets independently of these infrastructures, making it difficult for scientists to find them since they are not centrally registered. Research data must be easily discoverable and accessible for scientists to use it effectively. The Data Collections Explorer, developed within the national research data infrastructure for the engineering sciences NFDI4Ing, is an easy-to-use information system addressing these needs. It is a low threshold information system that provides an overview of research data repositories, archives, databases as well as individually published data sets. Similar systems exist in other subject areas, for example the Data Repository Finder focusing on the medi-cal, life and social sciences. Contrary to the Data Collections Explorer, the Data Repository Finder only lists repositories. This is the slide set for the talk as part of the "Engineering Sciences" track at the 1st Conference on Research Data Infrastructures.
Inventory of Online Agricultural Data Repositories
kaggle.com
zip
Updated Jul 22, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Abdelaziz Sami (2024). Inventory of Online Agricultural Data Repositories [Dataset]. https://www.kaggle.com/datasets/abdelazizsami/inventory-of-online-agricultural-data-repositories
Explore at:
zip(819512 bytes)Available download formats
Dataset updated
Jul 22, 2024
Authors
Abdelaziz Sami
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Inventory of Online Public Databases and Repositories Holding Agricultural Data in 2017

Metadata Updated: March 30, 2024

United States agricultural researchers have many options for making their data available online. This dataset aggregates the primary sources of ag-related data and determines where researchers are likely to deposit their agricultural data. These data serve as both a current landscape analysis and a baseline for future studies of ag research data.

Purpose As sources of agricultural data become more numerous and disparate, and collaboration and open data become more expected if not required, this research provides a landscape inventory of online sources of open agricultural data. An inventory of current agricultural data-sharing options will help assess how the Ag Data Commons, a platform for USDA-funded data cataloging and publication, can best support data-intensive and multidisciplinary research. It will also help agricultural librarians assist their researchers in data management and publication. The goals of this study were to:

Establish where agricultural researchers in the United States—land grant and USDA researchers, primarily ARS, NRCS, USFS, and other agencies—currently publish their data, including general research data repositories, domain-specific databases, and the top journals.

Compare how much data is in institutional vs. domain-specific vs. federal platforms.

Determine which repositories are recommended by top journals that require or recommend the publication of supporting data.

Ascertain where researchers not affiliated with funding or initiatives possessing a designated open data repository can publish data.

Approach The National Agricultural Library team focused on Agricultural Research Service (ARS), Natural Resources Conservation Service (NRCS), and United States Forest Service (USFS) style research data, rather than ag economics, statistics, and social sciences data. To find domain-specific, general, institutional, and federal agency repositories and databases that are open to US research submissions and have some amount of ag data, resources including re3data, libguides, and ARS lists were analyzed. Primarily environmental or public health databases were not included, but places where ag grantees would publish data were considered.

Search Methods - We first compiled a list of known domain-specific USDA/ARS datasets/databases represented in the Ag Data Commons, including ARS Image Gallery, ARS Nutrition Databases (sub-components), SoyBase, PeanutBase, National Fungus Collection, i5K Workspace @ NAL, and GRIN. - We then searched using search engines such as Bing and Google for non-USDA/federal ag databases, using Boolean variations of “agricultural data” /“ag data” / “scientific data” + NOT + USDA (to filter out the federal/USDA results). Most of these results were domain-specific, though some contained a mix of data subjects. - We searched using search engines such as Bing and Google to find top agricultural university repositories using variations of “agriculture”, “ag data” and “university” to find schools with agriculture programs. Using that list of universities, we searched each university website to see if their institution had a repository for their unique, independent research data if not apparent in the initial web browser search. - We found both ag-specific university repositories and general university repositories that housed a portion of agricultural data. Ag-specific university repositories are included in the list of domain-specific repositories. Results included Columbia University – International Research Institute for Climate and Society, UC Davis – Cover Crops Database, etc. If a general university repository existed, we determined whether that repository could filter to include only data results after our chosen ag search terms were applied. General university databases that contain ag data included Colorado State University Digital Collections, University of Michigan ICPSR (Inter-university Consortium for Political and Social Research), and University of Minnesota DRUM (Digital Repository of the University of Minnesota). - We then split out NCBI (National Center for Biotechnology Information) repositories. - Next, we searched the internet for open general data repositories using a variety of search engines, and repositories containing a mix of data, journals, books, and other types of records were tested to determine whether that repository could filter for data results after search terms were applied. General subject data repositories include Figshare, Open Science Framework, PANGEA, Protein Data Bank, and Zenodo. - Finally, we compared scholarly journal suggestions for data repositories against our list to fill in any missing repositories that might contain agricultural data. Extensive lists of journals were compi...
Locating Restricted Data Repositories
osf.io
Updated Mar 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mary Oberlies; Megan Potterbusch (2025). Locating Restricted Data Repositories [Dataset]. https://osf.io/k9u5x
Explore at:
Dataset updated
Mar 11, 2025
Dataset provided by
Center for Open Sciencehttps://cos.io/
Authors
Mary Oberlies; Megan Potterbusch
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
In the modern era, the near impossibility of true anonymization means we must provide tangible recommendations for researchers who need to share de-identified, person-level data that could potentially be re-identified due to the presence of quasi-identifiers. While various repository aggregators like Re3data and DataCite Repository Finder provide lists of data repositories, navigating these can be cumbersome when trying to locate options for depositing restricted data. These listings rarely include certain necessary details, making the process of recommending third-party repositories to researchers time-consuming – or even limited, and we often end up relying on a short list of well-known repositories. An additional challenge is the difficulty of identifying repositories that mediate access via data usage agreements, where the repository handles access requests to ensure potential users meet established security and privacy requirements and have taken the necessary steps to protect confidentiality and commit to appropriate data use. As part of a capstone project for the Data Services Continuing Education Program, we identified and created a spreadsheet of restricted data repositories with mediated access processes for researchers. While our project scope was limited to the social sciences and US based repositories, in sharing this work, we hope others will continue to contribute to this work and expand on it.
Open Reading Frame Finder (ORF Finder) - tud7-8pud - Archive Repository
healthdata.gov
csv, xlsx, xml
Updated Jul 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Open Reading Frame Finder (ORF Finder) - tud7-8pud - Archive Repository [Dataset]. https://healthdata.gov/w/jfid-e939/default?cur=P-TbBfwSVkc&from=iJ4kX3Wb2Jf
Explore at:
csv, xml, xlsxAvailable download formats
Dataset updated
Jul 16, 2025
Description
This dataset tracks the updates made on the dataset "Open Reading Frame Finder (ORF Finder)" as a repository for previous versions of the data and metadata.
d
Data from: Scientific production on data repositories and open science...
search.dataone.org
dataverse.harvard.edu
Updated Sep 24, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rodrigues-Junior, Sinval (2024). Scientific production on data repositories and open science published in the Web of Science database – Bibliometric conceptual analysis [Dataset]. http://doi.org/10.7910/DVN/MZ1EUP
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/MZ1EUP
Dataset updated
Sep 24, 2024
Dataset provided by
Harvard Dataverse
Authors
Rodrigues-Junior, Sinval
Description
This document describes data collected from the Main Collection of the Web of Science database. Records of published studies addressing the intersection of Open Science and data repository were searched up to January 15th, 2024, and the final dataset was comprised of 545 records for bibliometric analysis.
Project MILDRED Research Data Repository Survey, University of Helsinki
figshare.com
resodate.org
txt
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Salmi, Anna; Ojanen, Mikko; Kuusniemi, Mari Elisa (2023). Project MILDRED Research Data Repository Survey, University of Helsinki [Dataset]. http://doi.org/10.6084/m9.figshare.3806394.v4
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.3806394.v4
Dataset updated
May 31, 2023
Dataset provided by
Figsharehttp://figshare.com/
Authors
Salmi, Anna; Ojanen, Mikko; Kuusniemi, Mari Elisa
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is part of Project MILDRED, Development Project of Research Data Infrastructure at University of Helsinki. The project started on April 29, 2016. Project aim is to provide University of Helsinki with state-of-the-art research data management service infrastructure. To gain knowledge about researchers' data storage and preservation practices in 2016, an e-survey was sent to the UH research staff about 1) what data repositories they use for depositing their research data; 2) what reasons they had for not depositing data and 3) what alternative storage devices and repository services they used for their data.The dataset consists of e-survey report master file and analysis of the original master file. The files have been anonymized. A readme.rtf file is included to provide full project and data level documentation.
d
Bear Lake Data Repository
search.dataone.org
Updated Sep 14, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jeff Nielson; Katie Wadsworth; Bear Lake Watch (2024). Bear Lake Data Repository [Dataset]. https://search.dataone.org/view/sha256%3A72e21ae4eb9367bfc7595b0a4a27bf5a246e0b20e0609c9f169620b8ed7b6182
Explore at:
Dataset updated
Sep 14, 2024
Dataset provided by
Hydroshare
Authors
Jeff Nielson; Katie Wadsworth; Bear Lake Watch
Description
The Bear Lake Data Repository (BLDR) is an active archive, containing a growing compilation of biological, chemical, and physical datasets collected from Bear Lake and its surrounding watershed. The datasets herein have been digitized from historical records and reports, extracted from papers and theses, and obtained from public and private entities, including the United States Geological Survey, PacifiCorp, and, inter alia, Ecosystems Research Institute.

Contributions are welcome. The BLDR accepts biological, chemical, or physical datasets obtained at Bear Lake, irrespective of funding source. There is no submission size limit at present—workarounds will be found if submissions exceed Hydroshare limits (20 GB). Contributions are published with an open access license and will serve many use cases. The current repository steward, Bear Lake Watch, will advise on submissions and make accepted contributions available promptly.

Metadata files are provided for each dataset, however, contact with original contributor(s) is encouraged for questions and additional details prior to data usage. The BLDR and its contributors shall not be liable for any damages resulting from misinterpretation or misuse of the data or metadata.
d
Data from: Common Metadata Framework for Research Data Repository: Necessity...
search.dataone.org
dataverse.harvard.edu
Updated Mar 5, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Asok, Kavya; Dandpat, Snigdha; Gupta, Dinesh K.; Shrivastava, Prashant (2024). Common Metadata Framework for Research Data Repository: Necessity to Support Open Science [Dataset]. http://doi.org/10.7910/DVN/JK6HBB
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/JK6HBB
Dataset updated
Mar 5, 2024
Dataset provided by
Harvard Dataverse
Authors
Asok, Kavya; Dandpat, Snigdha; Gupta, Dinesh K.; Shrivastava, Prashant
Description
These research datasets are the updated version of the conference poster "Research data repositories and their metadata: A comparative study," presented by Ms. Kavya Asok and Ms. Snigdha Dandpat in a Conference on Open and FAIR Data Ecosystem: Principles, Policies, and Platforms scheduled from 11th -13th September 2023, at IIC, New Delhi. The study describes the features of a select number of RDRs and analyzes their metadata practices.
Total number of datasets parsed per data repository and metadata standards...
plos.figshare.com
xls
Updated Jun 11, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Felicitas Löffler; Valentin Wesp; Birgitta König-Ries; Friederike Klan (2023). Total number of datasets parsed per data repository and metadata standards and schemata. [Dataset]. http://doi.org/10.1371/journal.pone.0246099.t006
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0246099.t006
Dataset updated
Jun 11, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Felicitas Löffler; Valentin Wesp; Birgitta König-Ries; Friederike Klan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The numbers in brackets denote the number of datasets used for the analysis. All datasets were harvested and parsed in May 2019.
d
Digital Collections of Colorado, DSpace Repository, Long Term Ecological...
catalog.data.gov
geodata.nal.usda.gov
+1more
Updated Dec 2, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Agricultural Research Service (2025). Digital Collections of Colorado, DSpace Repository, Long Term Ecological Research (LTER) datasets [Dataset]. https://catalog.data.gov/dataset/digital-collections-of-colorado-dspace-repository-long-term-ecological-research-lter-datas
Explore at:
Dataset updated
Dec 2, 2025
Dataset provided by
Agricultural Research Service
Area covered
Colorado
Description
Dataset links to the Digital Collections of Colorado, DSpace Repository. From the homepage, you can search the 1240 datasets hosted there, or browse using a list of filters on the right. DSpace is a digital service that collects, preserves, and distributes digital material. Resources in this dataset: Resource Title: GeoData catalog record. File Name: Web Page, url: https://geodata.nal.usda.gov/geonetwork/srv/eng/catalog.search#/metadata/ShortgrassSteppe_eaa_2015_March_19_1220
Living Document I: Belgian mental health (care) data repository
repository.uantwerpen.be
explore.openaire.eu
Updated 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Van Hoof, Elk; De Laet, Hannah; Résibois, Maxime; Gérard, Sylvie; Dekeyser, Sarah; Loix, Ellen; Philips, Evelien; Snoeck, Sylvia; Maratovna Safiullina, Zamira; De Koker, Benedicte; De Witte, Nico; Lambotte, Deborah; Cruyt, Ellen; Van de Velde, Dominique; Godderis, Lode; Blavier, Adelaïde; Van den Broeck, Kris; Luminet, Olivier; Van den Cruyce, Nele (2022). Living Document I: Belgian mental health (care) data repository [Dataset]. http://doi.org/10.5281/zenodo.4415442
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.4415442
Dataset updated
2022
Dataset provided by
Zenodohttp://zenodo.org/
Faculty of Medicine and Health Sciences
University of Antwerp
Authors
Van Hoof, Elk; De Laet, Hannah; Résibois, Maxime; Gérard, Sylvie; Dekeyser, Sarah; Loix, Ellen; Philips, Evelien; Snoeck, Sylvia; Maratovna Safiullina, Zamira; De Koker, Benedicte; De Witte, Nico; Lambotte, Deborah; Cruyt, Ellen; Van de Velde, Dominique; Godderis, Lode; Blavier, Adelaïde; Van den Broeck, Kris; Luminet, Olivier; Van den Cruyce, Nele
Area covered
Belgium
Description
Covid-19 has had a big impact on many aspects of our life, including mental health. Since the start of the pandemic, a whole body of Belgian research has been performed on the relation between covid-19 and mental health (care). The mental health & covid-19 working group of the superior health council lists these studies in order to provide advice to policy makers and the general public. The first advisory report focused on international literature on contagious outbreaks, since not many studies on covid-19 and, especially, not many Belgian studies were published yet. This advice can be found here: https://www.health.belgium.be/en/report-9589-mental-health-and-covid-19 As part of the work performed in the first advisory report, the Policy Coordination Working group has asked the Superior Health Council to list all Belgian studies investigating the relation between covid-19 and mental health and/or mental health care and to provide regular updates. The superior Health Council, therefore, started the project of the Belgian mental health data repository. This repository will consist of ongoing studies, preliminary results, accepted and published articles with a Belgian population. For each study, an overview will be given of the authors (including contact details), level of evidence and a short description of the study. The Belgian Mental Health Data Repository will allow for other researchers, policy makers, health care providers and the general public to have a better idea of and easier access to the mental health studies in Belgium. Additionally, more in-depth analyses across studies can be facilitated leading to better insights into the impact of covid-19 on mental health. An update of the living document will be published weekly.
ALW Assisted Living Facility Finder App - 8fah-cj2v - Archive Repository
healthdata.gov
csv, xlsx, xml
Updated Nov 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). ALW Assisted Living Facility Finder App - 8fah-cj2v - Archive Repository [Dataset]. https://healthdata.gov/dataset/ALW-Assisted-Living-Facility-Finder-App-8fah-cj2v-/m5uh-yzt7
Explore at:
xml, xlsx, csvAvailable download formats
Dataset updated
Nov 17, 2025
Description
This dataset tracks the updates made on the dataset "ALW Assisted Living Facility Finder App" as a repository for previous versions of the data and metadata.
o
Data Repository for "A Matheuristic for Complex Pricing Problems: An...
explore.openaire.eu
Updated Jan 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kristina Bayer; Robert Klein (2025). Data Repository for "A Matheuristic for Complex Pricing Problems: An Application to Rentable Resources" [Dataset]. http://doi.org/10.5281/zenodo.14250619
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.14250619
Dataset updated
Jan 1, 2025
Authors
Kristina Bayer; Robert Klein
Description
Data Repository for "A Matheuristic for Complex Pricing Problems: An Application to Rentable Resources"
d
Migrating ODESI to the national Borealis data repository
search.dataone.org
borealisdata.ca
Updated Dec 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Leahey, Amber; Sabina, Pagotto; Urquidi Diaz, Alicia (2023). Migrating ODESI to the national Borealis data repository [Dataset]. http://doi.org/10.5683/SP3/BRRYSV
Explore at:
Unique identifier
https://doi.org/10.5683/SP3/BRRYSV
Dataset updated
Dec 28, 2023
Dataset provided by
Borealis
Authors
Leahey, Amber; Sabina, Pagotto; Urquidi Diaz, Alicia
Description
Since the launch of ODESI in 2008, academic libraries have supported the development of shared infrastructure for open discovery and access to important collections of Canadian social science survey data. With the current migration of all metadata and data collections to the new national Borealis data repository, collaborative curation and best practices are migrating as well, leading to the development of new approaches, training, policies, and documentation, to support the ongoing deposit, curation, and preservation of data in the repository. This presentation will provide an overview of the repository migration project, updates to the search interface to support further integration with Borealis, and steps taken by the technical team and community-led committee to ensure a smooth transition for all ODESI end-users and library data stewards.
Supporting dataset for: Repository optimisation & techniques to improve...
zenodo.org
data.niaid.nih.gov
+1more
bin, txt
Updated Jan 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
George Macgregor; George Macgregor (2020). Supporting dataset for: Repository optimisation & techniques to improve discoverability and web impact : an evaluation [Dataset]. http://doi.org/10.5281/zenodo.1411207
Explore at:
txt, binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.1411207
Dataset updated
Jan 24, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
George Macgregor; George Macgregor
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset supports the working paper, "Repository optimisation & techniques to improve discoverability and web impact : an evaluation", currently under review for publication and available as a preprint at: https://doi.org/10.17868/65389/.

Macgregor, G. (2018). Repository optimisation techniques to improve discoverability and web impact: an evaluation. (pp. 1-13). Glasgow: University of Strathclyde [Strathprints repository]. Available: https://doi.org/10.17868/65389/

The dataset comprises a single OpenDocument Spreadsheet (.ods) format file containing seven data sheets of data pertaining to COUNTER compliant usage statistics, search query traffic from Google Search Console, web traffic data for Google Analytics and Google Scholar, and usage statistics from IRStats2. All data relate to the EPrints repository, Strathprints, based at the University of Strathclyde.
n
Data from: Repository Analytics and Metrics Portal (RAMP) 2021 data
data.niaid.nih.gov
search.dataone.org
+1more
zip
Updated May 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jonathan Wheeler; Kenning Arlitsch (2023). Repository Analytics and Metrics Portal (RAMP) 2021 data [Dataset]. http://doi.org/10.5061/dryad.1rn8pk0tz
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.1rn8pk0tz
Dataset updated
May 23, 2023
Dataset provided by
University of New Mexico
Montana State University
Authors
Jonathan Wheeler; Kenning Arlitsch
License
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Description
The Repository Analytics and Metrics Portal (RAMP) is a web service that aggregates use and performance use data of institutional repositories. The data are a subset of data from RAMP, the Repository Analytics and Metrics Portal (http://rampanalytics.org), consisting of data from all participating repositories for the calendar year 2021. For a description of the data collection, processing, and output methods, please see the "methods" section below.

The record will be revised periodically to make new data available through the remainder of 2021.

Methods

Data Collection

RAMP data are downloaded for participating IR from Google Search Console (GSC) via the Search Console API. The data consist of aggregated information about IR pages which appeared in search result pages (SERP) within Google properties (including web search and Google Scholar).

Data are downloaded in two sets per participating IR. The first set includes page level statistics about URLs pointing to IR pages and content files. The following fields are downloaded for each URL, with one row per URL:

url: This is returned as a 'page' by the GSC API, and is the URL of the page which was included in an SERP for a Google property. impressions: The number of times the URL appears within the SERP. clicks: The number of clicks on a URL which took users to a page outside of the SERP. clickThrough: Calculated as the number of clicks divided by the number of impressions. position: The position of the URL within the SERP. date: The date of the search.

Following data processing describe below, on ingest into RAMP a additional field, citableContent, is added to the page level data.

The second set includes similar information, but instead of being aggregated at the page level, the data are grouped based on the country from which the user submitted the corresponding search, and the type of device used. The following fields are downloaded for combination of country and device, with one row per country/device combination:

country: The country from which the corresponding search originated. device: The device used for the search. impressions: The number of times the URL appears within the SERP. clicks: The number of clicks on a URL which took users to a page outside of the SERP. clickThrough: Calculated as the number of clicks divided by the number of impressions. position: The position of the URL within the SERP. date: The date of the search.

Note that no personally identifiable information is downloaded by RAMP. Google does not make such information available.

More information about click-through rates, impressions, and position is available from Google's Search Console API documentation: https://developers.google.com/webmaster-tools/search-console-api-original/v3/searchanalytics/query and https://support.google.com/webmasters/answer/7042828?hl=en

Data Processing

Upon download from GSC, the page level data described above are processed to identify URLs that point to citable content. Citable content is defined within RAMP as any URL which points to any type of non-HTML content file (PDF, CSV, etc.). As part of the daily download of page level statistics from Google Search Console (GSC), URLs are analyzed to determine whether they point to HTML pages or actual content files. URLs that point to content files are flagged as "citable content." In addition to the fields downloaded from GSC described above, following this brief analysis one more field, citableContent, is added to the page level data which records whether each page/URL in the GSC data points to citable content. Possible values for the citableContent field are "Yes" and "No."

The data aggregated by the search country of origin and device type do not include URLs. No additional processing is done on these data. Harvested data are passed directly into Elasticsearch.

Processed data are then saved in a series of Elasticsearch indices. Currently, RAMP stores data in two indices per participating IR. One index includes the page level data, the second index includes the country of origin and device type data.

About Citable Content Downloads

Data visualizations and aggregations in RAMP dashboards present information about citable content downloads, or CCD. As a measure of use of institutional repository content, CCD represent click activity on IR content that may correspond to research use.

CCD information is summary data calculated on the fly within the RAMP web application. As noted above, data provided by GSC include whether and how many times a URL was clicked by users. Within RAMP, a "click" is counted as a potential download, so a CCD is calculated as the sum of clicks on pages/URLs that are determined to point to citable content (as defined above).

For any specified date range, the steps to calculate CCD are:

Filter data to only include rows where "citableContent" is set to "Yes." Sum the value of the "clicks" field on these rows.

Output to CSV

Published RAMP data are exported from the production Elasticsearch instance and converted to CSV format. The CSV data consist of one "row" for each page or URL from a specific IR which appeared in search result pages (SERP) within Google properties as described above. Also as noted above, daily data are downloaded for each IR in two sets which cannot be combined. One dataset includes the URLs of items that appear in SERP. The second dataset is aggregated by combination of the country from which a search was conducted and the device used.

As a result, two CSV datasets are provided for each month of published data:

page-clicks:

The data in these CSV files correspond to the page-level data, and include the following fields:

url: This is returned as a 'page' by the GSC API, and is the URL of the page which was included in an SERP for a Google property. impressions: The number of times the URL appears within the SERP. clicks: The number of clicks on a URL which took users to a page outside of the SERP. clickThrough: Calculated as the number of clicks divided by the number of impressions. position: The position of the URL within the SERP. date: The date of the search. citableContent: Whether or not the URL points to a content file (ending with pdf, csv, etc.) rather than HTML wrapper pages. Possible values are Yes or No. index: The Elasticsearch index corresponding to page click data for a single IR. repository_id: This is a human readable alias for the index and identifies the participating repository corresponding to each row. As RAMP has undergone platform and version migrations over time, index names as defined for the previous field have not remained consistent. That is, a single participating repository may have multiple corresponding Elasticsearch index names over time. The repository_id is a canonical identifier that has been added to the data to provide an identifier that can be used to reference a single participating repository across all datasets. Filtering and aggregation for individual repositories or groups of repositories should be done using this field.

Filenames for files containing these data end with “page-clicks”. For example, the file named 2021-01_RAMP_all_page-clicks.csv contains page level click data for all RAMP participating IR for the month of January, 2021.

country-device-info:

The data in these CSV files correspond to the data aggregated by country from which a search was conducted and the device used. These include the following fields:

country: The country from which the corresponding search originated. device: The device used for the search. impressions: The number of times the URL appears within the SERP. clicks: The number of clicks on a URL which took users to a page outside of the SERP. clickThrough: Calculated as the number of clicks divided by the number of impressions. position: The position of the URL within the SERP. date: The date of the search. index: The Elasticsearch index corresponding to country and device access information data for a single IR. repository_id: This is a human readable alias for the index and identifies the participating repository corresponding to each row. As RAMP has undergone platform and version migrations over time, index names as defined for the previous field have not remained consistent. That is, a single participating repository may have multiple corresponding Elasticsearch index names over time. The repository_id is a canonical identifier that has been added to the data to provide an identifier that can be used to reference a single participating repository across all datasets. Filtering and aggregation for individual repositories or groups of repositories should be done using this field.

Filenames for files containing these data end with “country-device-info”. For example, the file named 2021-01_RAMP_all_country-device-info.csv contains country and device data for all participating IR for the month of January, 2021.

References

Google, Inc. (2021). Search Console APIs. Retrieved from https://developers.google.com/webmaster-tools/search-console-api-original.
SIDS - Gender Spatial Data Repository
datacatalog.worldbank.org
Updated May 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
World Bank Group (2025). SIDS - Gender Spatial Data Repository [Dataset]. https://datacatalog.worldbank.org/search/dataset/0064127/sids-gender-spatial-data-repository
Explore at:
Dataset updated
May 6, 2025
Dataset authored and provided by
World Bank Grouphttp://www.worldbank.org/
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Sao Tome and Principe, Maldives, Mauritius, Comoros, Guinea-Bissau, Cabo Verde
Description
As part of the “Geospatial Assessment of Women Employment and Business Opportunities in the Energy Sector” project, open-source Gender-related spatial data was collected for 31 Small Island Developing States (SIDS) across the globe, resulting in curated and thoroughly documented geodatabases (GDBs) that are now ready to be explored! Fifty-nine spatial layers were identified and then researched for each country, covering the following categories: Demographics and Population | Renewable Energy | Energy Access | Education | Jobs and Finance | Digital Inclusion | Transportation | Safety | Amenities | Climate/Earth | Law/Policy/Government. However, not every country GDB contains all 59 data layers, as this was dependent on the availability of open-source data in each SIDS. Users are encouraged to check the accompanying metadata excel file for more information on the datasets in each GDB, the vintage, and the source utilized. | This dataset contains important information and resources. For comprehensive details, documentation, and inquiries, please contact data@worldbank.org. Additional metadata and related resources are available on this page.
Facility Finder Detail Map - wmqk-akgw - Archive Repository
healthdata.gov
csv, xlsx, xml
Updated Nov 7, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Facility Finder Detail Map - wmqk-akgw - Archive Repository [Dataset]. https://healthdata.gov/dataset/Facility-Finder-Detail-Map-wmqk-akgw-Archive-Repos/yhpq-63vq
Explore at:
xml, xlsx, csvAvailable download formats
Dataset updated
Nov 7, 2025
Description
This dataset tracks the updates made on the dataset "Facility Finder Detail Map" as a repository for previous versions of the data and metadata.
d
Data repository for PELE experiments
search.dataone.org
dataverse.harvard.edu
Updated Nov 25, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Perttu, Anna (2025). Data repository for PELE experiments [Dataset]. http://doi.org/10.7910/DVN/QAX1RU
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/QAX1RU
Dataset updated
Nov 25, 2025
Dataset provided by
Harvard Dataverse
Authors
Perttu, Anna
Description
Data repository for the data used in the thesis of A. Perttu from the experimental PDCs at the PELE facility.
d
ESS-DIVE Reporting Format for Dataset Package Metadata
search.dataone.org
knb.ecoinformatics.org
+1more
Updated Jun 11, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Deb Agarwal; Shreyas Cholia; Valerie C. Hendrix; Robert Crystal-Ornelas; Cory Snavely; Joan Damerow; Charuleka Varadharajan (2022). ESS-DIVE Reporting Format for Dataset Package Metadata [Dataset]. http://doi.org/10.15485/1866026
Explore at:
Unique identifier
https://doi.org/10.15485/1866026
Dataset updated
Jun 11, 2022
Dataset provided by
ESS-DIVE
Authors
Deb Agarwal; Shreyas Cholia; Valerie C. Hendrix; Robert Crystal-Ornelas; Cory Snavely; Joan Damerow; Charuleka Varadharajan
Time period covered
Jan 1, 2017
Description
ESS-DIVE’s (Environmental Systems Science Data Infrastructure for a Virtual Ecosystem) dataset metadata reporting format is intended to compile information about a dataset (e.g., title, description, funding sources) that can enable reuse of data submitted to the ESS-DIVE data repository. The files contained in this dataset include instructions (dataset_metadata_guide.md and README.md) that can be used to understand the types of metadata ESS-DIVE collects. The data dictionary (dd.csv) follows ESS-DIVE’s file-level metadata reporting format and includes brief descriptions about each element of the dataset metadata reporting format. This dataset also includes a terminology crosswalk (dataset_metadata_crosswalk.csv) that shows how ESS-DIVE’s metadata reporting format maps onto other existing metadata standards and reporting formats. Data contributors to ESS-DIVE can provide this metadata by manual entry using a web form or programmatically via ESS-DIVE’s API (Application Programming Interface). A metadata template (dataset_metadata_template.docx or dataset_metadata_template.pdf) can be used to collaboratively compile metadata before providing it to ESS-DIVE. Since being incorporated into ESS-DIVE’s data submission user interface, ESS-DIVE’s dataset metadata reporting format, has enabled features like automated metadata quality checks, and dissemination of ESS-DIVE datasets onto other data platforms including Google Dataset Search and DataCite.

Facebook

Twitter

Click to copy link

Link copied

Cite

Zenodo (2023). Data Collections Explorer – An Easy-to-Use Tool for Sharing and Discovering Research Data [Dataset]. https://meta4ds.fokus.fraunhofer.de/datasets/oai-zenodo-org-8399407~~1?locale=en

Data from: Data Collections Explorer – An Easy-to-Use Tool for Sharing and Discovering Research Data

Explore at:

pdf(4797020), unknownAvailable download formats

Dataset updated

Sep 15, 2023

Dataset authored and provided by

Zenodohttp://zenodo.org/

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

There is a wide variety of archives, databases, and repositories currently available that pro-vide access to research data. However, basic information about these systems is often diffi-cult to gather, such as whether there are limits to the size of data sets that can be published or whether there is any publication fee that applies. In addition to that, there are plenty of re-search groups publishing their research data sets independently of these infrastructures, making it difficult for scientists to find them since they are not centrally registered. Research data must be easily discoverable and accessible for scientists to use it effectively. The Data Collections Explorer, developed within the national research data infrastructure for the engineering sciences NFDI4Ing, is an easy-to-use information system addressing these needs. It is a low threshold information system that provides an overview of research data repositories, archives, databases as well as individually published data sets. Similar systems exist in other subject areas, for example the Data Repository Finder focusing on the medi-cal, life and social sciences. Contrary to the Data Collections Explorer, the Data Repository Finder only lists repositories. This is the slide set for the talk as part of the "Engineering Sciences" track at the 1st Conference on Research Data Infrastructures.

Clear search

Close search

Google apps

Main menu

Data from: Data Collections Explorer – An Easy-to-Use Tool for Sharing and...

Inventory of Online Agricultural Data Repositories

Inventory of Online Public Databases and Repositories Holding Agricultural Data in 2017

Metadata Updated: March 30, 2024

Locating Restricted Data Repositories

Open Reading Frame Finder (ORF Finder) - tud7-8pud - Archive Repository

Data from: Scientific production on data repositories and open science...

Project MILDRED Research Data Repository Survey, University of Helsinki

Bear Lake Data Repository

Data from: Common Metadata Framework for Research Data Repository: Necessity...

Total number of datasets parsed per data repository and metadata standards...

Digital Collections of Colorado, DSpace Repository, Long Term Ecological...

Living Document I: Belgian mental health (care) data repository

ALW Assisted Living Facility Finder App - 8fah-cj2v - Archive Repository

Data Repository for "A Matheuristic for Complex Pricing Problems: An...

Migrating ODESI to the national Borealis data repository

Supporting dataset for: Repository optimisation & techniques to improve...

Data from: Repository Analytics and Metrics Portal (RAMP) 2021 data

SIDS - Gender Spatial Data Repository

Facility Finder Detail Map - wmqk-akgw - Archive Repository

Data repository for PELE experiments

ESS-DIVE Reporting Format for Dataset Package Metadata

Data from: Data Collections Explorer – An Easy-to-Use Tool for Sharing and Discovering Research Data