Facebook
TwitterA list of NIH-supported repositories that accept submissions of appropriate scientific research data from biomedical researchers. It includes resources that aggregate information about biomedical data and information sharing systems. Links are provided to information about submitting data to and accessing data from the listed repositories. Additional information about the repositories and points-of contact for further information or inquiries can be found on the websites of the individual repositories.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This table includes repositories with at least 10 shared datasets on the site.
Facebook
TwitterA listing of NIH supported data sharing repositories that make data accessible for reuse. Most accept submissions of appropriate data from NIH-funded investigators (and others), but some restrict data submission to only those researchers involved in a specific research network. Also included are resources that aggregate information about biomedical data and information sharing systems. The table can be sorted according by name and by NIH Institute or Center and may be searched using keywords so that you can find repositories more relevant to your data. Links are provided to information about submitting data to and accessing data from the listed repositories. Additional information about the repositories and points-of-contact for further information or inquiries can be found on the websites of the individual repositories.
Facebook
TwitterThe VHA Data Sharing Agreement Repository serves as a centralized location to collect and report on agreements that share VHA data with entities outside of VA. It provides senior management an overall view of existing data sharing agreements; fosters productive sharing of health information with VHA's external partners; and streamlines data acquisition to improve data management responsibilities overall. Agreements that VHA has established with entities within the VA are not candidates for this Repository.
Facebook
Twitterhttps://borealisdata.ca/api/datasets/:persistentId/versions/4.0/customlicense?persistentId=doi:10.5683/SP3/UPABVHhttps://borealisdata.ca/api/datasets/:persistentId/versions/4.0/customlicense?persistentId=doi:10.5683/SP3/UPABVH
Data collected from major Canadian and international research data repositories cover data storage, preservation, metadata, interchange, data file types, and other standard features used in the retention and sharing of research data. The outputs of this project primarily aim to assist in the establishment of recommended minimum requirements for a Canadian research data infrastructure. The committee also aims to further develop guidelines and criteria for the assessment and selection o f repositories for deposit of Canadian research data by researchers, data managers, librarians, archivists etc.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
United States agricultural researchers have many options for making their data available online. This dataset aggregates the primary sources of ag-related data and determines where researchers are likely to deposit their agricultural data. These data serve as both a current landscape analysis and also as a baseline for future studies of ag research data. Purpose As sources of agricultural data become more numerous and disparate, and collaboration and open data become more expected if not required, this research provides a landscape inventory of online sources of open agricultural data. An inventory of current agricultural data sharing options will help assess how the Ag Data Commons, a platform for USDA-funded data cataloging and publication, can best support data-intensive and multi-disciplinary research. It will also help agricultural librarians assist their researchers in data management and publication. The goals of this study were to
establish where agricultural researchers in the United States-- land grant and USDA researchers, primarily ARS, NRCS, USFS and other agencies -- currently publish their data, including general research data repositories, domain-specific databases, and the top journals compare how much data is in institutional vs. domain-specific vs. federal platforms determine which repositories are recommended by top journals that require or recommend the publication of supporting data ascertain where researchers not affiliated with funding or initiatives possessing a designated open data repository can publish data
Approach
The National Agricultural Library team focused on Agricultural Research Service (ARS), Natural Resources Conservation Service (NRCS), and United States Forest Service (USFS) style research data, rather than ag economics, statistics, and social sciences data. To find domain-specific, general, institutional, and federal agency repositories and databases that are open to US research submissions and have some amount of ag data, resources including re3data, libguides, and ARS lists were analysed. Primarily environmental or public health databases were not included, but places where ag grantees would publish data were considered.
Search methods
We first compiled a list of known domain specific USDA / ARS datasets / databases that are represented in the Ag Data Commons, including ARS Image Gallery, ARS Nutrition Databases (sub-components), SoyBase, PeanutBase, National Fungus Collection, i5K Workspace @ NAL, and GRIN. We then searched using search engines such as Bing and Google for non-USDA / federal ag databases, using Boolean variations of “agricultural data” /“ag data” / “scientific data” + NOT + USDA (to filter out the federal / USDA results). Most of these results were domain specific, though some contained a mix of data subjects.
We then used search engines such as Bing and Google to find top agricultural university repositories using variations of “agriculture”, “ag data” and “university” to find schools with agriculture programs. Using that list of universities, we searched each university web site to see if their institution had a repository for their unique, independent research data if not apparent in the initial web browser search. We found both ag specific university repositories and general university repositories that housed a portion of agricultural data. Ag specific university repositories are included in the list of domain-specific repositories. Results included Columbia University – International Research Institute for Climate and Society, UC Davis – Cover Crops Database, etc. If a general university repository existed, we determined whether that repository could filter to include only data results after our chosen ag search terms were applied. General university databases that contain ag data included Colorado State University Digital Collections, University of Michigan ICPSR (Inter-university Consortium for Political and Social Research), and University of Minnesota DRUM (Digital Repository of the University of Minnesota). We then split out NCBI (National Center for Biotechnology Information) repositories.
Next we searched the internet for open general data repositories using a variety of search engines, and repositories containing a mix of data, journals, books, and other types of records were tested to determine whether that repository could filter for data results after search terms were applied. General subject data repositories include Figshare, Open Science Framework, PANGEA, Protein Data Bank, and Zenodo.
Finally, we compared scholarly journal suggestions for data repositories against our list to fill in any missing repositories that might contain agricultural data. Extensive lists of journals were compiled, in which USDA published in 2012 and 2016, combining search results in ARIS, Scopus, and the Forest Service's TreeSearch, plus the USDA web sites Economic Research Service (ERS), National Agricultural Statistics Service (NASS), Natural Resources and Conservation Service (NRCS), Food and Nutrition Service (FNS), Rural Development (RD), and Agricultural Marketing Service (AMS). The top 50 journals' author instructions were consulted to see if they (a) ask or require submitters to provide supplemental data, or (b) require submitters to submit data to open repositories.
Data are provided for Journals based on a 2012 and 2016 study of where USDA employees publish their research studies, ranked by number of articles, including 2015/2016 Impact Factor, Author guidelines, Supplemental Data?, Supplemental Data reviewed?, Open Data (Supplemental or in Repository) Required? and Recommended data repositories, as provided in the online author guidelines for each the top 50 journals.
Evaluation
We ran a series of searches on all resulting general subject databases with the designated search terms. From the results, we noted the total number of datasets in the repository, type of resource searched (datasets, data, images, components, etc.), percentage of the total database that each term comprised, any dataset with a search term that comprised at least 1% and 5% of the total collection, and any search term that returned greater than 100 and greater than 500 results.
We compared domain-specific databases and repositories based on parent organization, type of institution, and whether data submissions were dependent on conditions such as funding or affiliation of some kind.
Results
A summary of the major findings from our data review:
Over half of the top 50 ag-related journals from our profile require or encourage open data for their published authors.
There are few general repositories that are both large AND contain a significant portion of ag data in their collection. GBIF (Global Biodiversity Information Facility), ICPSR, and ORNL DAAC were among those that had over 500 datasets returned with at least one ag search term and had that result comprise at least 5% of the total collection.
Not even one quarter of the domain-specific repositories and datasets reviewed allow open submission by any researcher regardless of funding or affiliation.
See included README file for descriptions of each individual data file in this dataset. Resources in this dataset:Resource Title: Journals. File Name: Journals.csvResource Title: Journals - Recommended repositories. File Name: Repos_from_journals.csvResource Title: TDWG presentation. File Name: TDWG_Presentation.pptxResource Title: Domain Specific ag data sources. File Name: domain_specific_ag_databases.csvResource Title: Data Dictionary for Ag Data Repository Inventory. File Name: Ag_Data_Repo_DD.csvResource Title: General repositories containing ag data. File Name: general_repos_1.csvResource Title: README and file inventory. File Name: README_InventoryPublicDBandREepAgData.txt
Facebook
TwitterBRADS is a repository for data and biospecimens from population health research initiatives and clinical or interventional trials designed and implemented by NICHD’s Division of Intramural Population Health Research (DIPHR). Topics include human reproduction and development, pregnancy, child health and development, and women’s health. The website is maintained by DIPHR.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This poster is for RDA P19 poster exhibition
Although there is yet a holistic national level support in Taiwan on the pursue of excellence in research data management, however, a culture of research data management is starting to take shape. As a research data repository operating in Taiwan, we report in this poster our work in helping the advance of good research data practices in Taiwan.
The depositar is a general-purpose data repository open to all for the deposit, discovery, and reuse of research data. It has been in service since early 2018. Its development has been supported by Academia Sinica and, in part, by a grant from Taiwan’s Ministry of Science and Technology. In addition to developing and operating the repository, since early 2019 the depositar team has been active in advocating good research data practices in Taiwan. From the perspective of depositar, researchers in Taiwan will be more likely to share data—hence to deposit data to depositar or to any other data repositories—when their data is well managed and in a state ready to be reused and shared. The funding we receive from the Ministry of Science and Technology also has a focus on facilitating better research data management in Taiwan (though initially only applied to grants awarded in the area of sustainable development research).
For the last few years, the depositar team has been working to cultivate a culture of research data management in Taiwan. We hold co-learning workshops where domain experts share their practices in managing research data. We work closely with several research projects about implementing data management plans. Above all, we strive to produce and make available guidelines and toolkits on research data management and on using research data repositories. At the same time we constantly improve the functionalities of depositar in response to the feedback we received from our users and from the above activities.
This poster will report on these activities and the lessons we have learned. We will also reflect on the strategy aspects of advocating for good research data practices, especially in the settings of limited resources and/or missing policies.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset (anonymized transcripts that were coded based on themes) is from focus groups with scientists (n=25) from five disciplines (atmospheric and earth science, computer science, chemistry, ecology, and neuroscience), where we asked questions about data management to lead into a discussion of what features they think are necessary to include in data repository systems and services to help them implement the data sharing and preservation parts of their data management plans. Participants identified metadata quality control and training as problem areas in data management. Additionally, participants discussed several desired repository features, including: metadata control, data traceability, security, stable infrastructure, and data use restrictions. The dataset was created using MAXQDA and has the .mx20 file format extent.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains article metadata and information about Open Science Indicators for approximately 139,000 research articles published in PLOS journals from 1 January 2018 to 30 March 2025 and a set of approximately 28,000 comparator articles published in non-PLOS journals. This is the tenth release of this dataset, which will be updated with new versions on an annual basis.This version of the Open Science Indicators dataset shares the indicators seen in the previous versions as well as fully operationalised protocols and study registration indicators, which were previously only shared in preliminary forms. The v10 dataset focuses on detection of five Open Science practices by analysing the XML of published research articles:Sharing of research data, in particular data shared in data repositoriesSharing of codePosting of preprintsSharing of protocolsSharing of study registrationsThe dataset provides data and code generation and sharing rates, the location of shared data and code (whether in Supporting Information or in an online repository). It also provides preprint, protocol and study registration sharing rates as well as details of the shared output, such as publication date, URL/DOI/Registration Identifier and platform used. Additional data fields are also provided for each article analysed. This release has been run using an updated preprint detection method (see OSI-Methods-Statement_v10_Jul25.pdf for details). Further information on the methods used to collect and analyse the data can be found in Documentation.Further information on the principles and requirements for developing Open Science Indicators is available in https://doi.org/10.6084/m9.figshare.21640889.Data folders/filesData Files folderThis folder contains the main OSI dataset files PLOS-Dataset_v10_Jul25.csv and Comparator-Dataset_v10_Jul25.csv, which containdescriptive metadata, e.g. article title, publication data, author countries, is taken from the article .xml filesadditional information around the Open Science Indicators derived algorithmicallyand the OSI-Summary-statistics_v10_Jul25.xlsx file contains the summary data for both PLOS-Dataset_v10_Jul25.csv and Comparator-Dataset_v10_Jul25.csv.Documentation folderThis file contains documentation related to the main data files. The file OSI-Methods-Statement_v10_Jul25.pdf describes the methods underlying the data collection and analysis. OSI-Column-Descriptions_v10_Jul25.pdf describes the fields used in PLOS-Dataset_v10_Jul25.csv and Comparator-Dataset_v10_Jul25.csv. OSI-Repository-List_v1_Dec22.xlsx lists the repositories and their characteristics used to identify specific repositories in the PLOS-Dataset_v10_Jul25.csv and Comparator-Dataset_v10_Jul25.csv repository fields.The folder also contains documentation originally shared alongside the preliminary versions of the protocols and study registration indicators in order to give fuller details of their detection methods.Note on the accuracy of the protocols indicator (January 2026)It has come to our attention that the accuracy rates for the protocols indicator as reported in the Protocols_Methods_Statement_Sep23.pdf file are incorrect. The groundtruth exercise for protocols incorrectly identified 19 articles as having protocols in the PLOS corpus and 21 in the Comparator corpus during the manual exercise. These have been corrected in the groundtruth dataset. The accuracy rates for the PLOS and Comparator corpus are therefore 94% and 97% respectively. Documentation will be updated in the next scheduled release (anticipated in July 2026).Contact details for further information:Iain Hrynaszkiewicz, Director, Open Research Solutions, PLOS, ihrynaszkiewicz@plos.org / plos@plos.orgLauren Cadwallader, Open Research Manager, PLOS, lcadwallader@plos.org / plos@plos.orgAcknowledgements:Thanks to Allegra Pearce, Tim Vines, Asura Enkhbayar, Scott Kerr and parth sarin of DataSeer for contributing to data acquisition and supporting information.
Facebook
TwitterADR provides an authoritative data store for shared administrative, demographic, enrollment, and eligibility information which is managed as a corporate asset. This administrative database system offers mission-critical database support for all VA Medical 21st Century Core applications such as Enrollment Systems, Identity Management System, Community Care Program, Veterans's Choice program, President's Affordable Care Act project, Patient Advocacy Tracking System, Veterans 360, and others.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
An Effective Research data management (RDM) practices associated with more benefits particularly in increasing research impact, enabling the data reuse, data preservation and archive for future use. To maintain research integrity, researcher hold the responsivity to practice the effective research data management with data which they have been generate while pursuing research activities, but researcher needs the supports from either libraries or universities to establish proper data management practices. Therefore authors of aimed to conduct an assessment to understand the researcher perceptions, current practices on research data management and their willingness to share data if libraries or universities offer the some support in managing and storing research data. The study undertaken the online survey method to collect the data and it includes 34 researchers’ perceptions on research data generation, various types and formats of research data generated, infrastructure for data storage and archive, practices of research data sharing, awareness on research data repositories etc. The results provided most of researcher generate the research data irrespective of various types & formats, but they are not more aware about proper research data management and sharing though researcher have been preserving research data using offline infrastructure like desktop, pen drives, hard disks, CD/DVDs etc. and most of researcher very much keen to avail under the library support in managing research data. The study helps a better understanding of the current practices and needs of researcher at Central University of Gujarat about research data management and sharing. From the study results author provides suggestions for improve the researcher awareness on research data management by conducting training, consulting, special lectures, seminars and workshops of research data management, sharing, and storage.A survey with simple online questionnaire method was used for this study to collecting the data regarding researchers’ perceptions on research data management in the Central University of Gujarat (CUG). The authors set out to the following questions in online questionnaire:
· Have you generated any research data while you are pursuing research activities?
· Roughly what kind/ type of research data generated or created while you are pursuing research activities?
·
What format and volume of research data
generated?
·
Where do you store your research data for
future use?
·
Does the practice of research data sharing
exist in your discipline/school/centre/ department?
·
How important is research data sharing and
make available for free access?
·
Are you aware of any research data
repositories?
·
Do you have submitted your research data
to any following research data repositories?
·
Do you feel that the libraries should
offer some support in managing, storing and archiving your research data for
future use?
Mail communication channel was used to send online questionnaire to the researchers and there were three attempts to get the responses. The study involves the Thirty Four (34) researchers’ responses from various schools and department of Central University of Gujarat. Statistical tools were used to analyse & interpret the collected data from survey method.
Facebook
TwitterThe journals’ author guidelines and/or editorial policies were examined on whether they take a stance with regard to the availability of the underlying data of the submitted article. The mere explicated possibility of providing supplementary material along with the submitted article was not considered as a research data policy in the present study. Furthermore, the present article excluded source codes or algorithms from the scope of the paper and thus policies related to them are not included in the analysis of the present article.
For selection of journals within the field of neurosciences, Clarivate Analytics’ InCites Journal Citation Reports database was searched using categories of neurosciences and neuroimaging. From the results, journals with the 40 highest Impact Factor (for the year 2017) indicators were extracted for scrutiny of research data policies. Respectively, the selection journals within the field of physics was created by performing a similar search with the categories of physics, applied; physics, atomic, molecular & chemical; physics, condensed matter; physics, fluids & plasmas; physics, mathematical; physics, multidisciplinary; physics, nuclear and physics, particles & fields. From the results, journals with the 40 highest Impact Factor indicators were again extracted for scrutiny. Similarly, the 40 journals representing the field of operations research were extracted by using the search category of operations research and management.
Journal-specific data policies were sought from journal specific websites providing journal specific author guidelines or editorial policies. Within the present study, the examination of journal data policies was done in May 2019. The primary data source was journal-specific author guidelines. If journal guidelines explicitly linked to the publisher’s general policy with regard to research data, these were used in the analyses of the present article. If journal-specific research data policy, or lack of, was inconsistent with the publisher’s general policies, the journal-specific policies and guidelines were prioritized and used in the present article’s data. If journals’ author guidelines were not openly available online due to, e.g., accepting submissions on an invite-only basis, the journal was not included in the data of the present article. Also journals that exclusively publish review articles were excluded and replaced with the journal having the next highest Impact Factor indicator so that each set representing the three field of sciences consisted of 40 journals. The final data thus consisted of 120 journals in total.
‘Public deposition’ refers to a scenario where researcher deposits data to a public repository and thus gives the administrative role of the data to the receiving repository. ‘Scientific sharing’ refers to a scenario where researcher administers his or her data locally and by request provides it to interested reader. Note that none of the journals examined in the present article required that all data types underlying a submitted work should be deposited into a public data repositories. However, some journals required public deposition of data of specific types. Within the journal research data policies examined in the present article, these data types are well presented by the Springer Nature policy on “Availability of data, materials, code and protocols” (Springer Nature, 2018), that is, DNA and RNA data; protein sequences and DNA and RNA sequencing data; genetic polymorphisms data; linked phenotype and genotype data; gene expression microarray data; proteomics data; macromolecular structures and crystallographic data for small molecules. Furthermore, the registration of clinical trials in a public repository was also considered as a data type in this study. The term specific data types used in the custom coding framework of the present study thus refers to both life sciences data and public registration of clinical trials. These data types have community-endorsed public repositories where deposition was most often mandated within the journals’ research data policies.
The term ‘location’ refers to whether the journal’s data policy provides suggestions or requirements for the repositories or services used to share the underlying data of the submitted works. A mere general reference to ‘public repositories’ was not considered a location suggestion, but only references to individual repositories and services. The category of ‘immediate release of data’ examines whether the journals’ research data policy addresses the timing of publication of the underlying data of submitted works. Note that even though the journals may only encourage public deposition of the data, the editorial processes could be set up so that it leads to either publication of the research data or the research data metadata in conjunction to publishing of the submitted work.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Repositories with at least 2 DOIs that fail to resolve.
Facebook
TwitterData curation and data deposit workflows for the University of Guelph Research Data Repositories. These documents describe the internal workflows for the University of Guelph Library's data repository service. Please note that these are dynamic documents and are updated as required.
Facebook
TwitterOpen data sharing improves the quality and reproducibility of scholarly research. Open data are defined as data that can be freely used, reused, and redistributed by anyone. Open data sharing democratizes science by making data more equitably available throughout the world regardless of funding or access to other resources necessary for generating cutting-edge data. For an interdisciplinary field like biological anthropology, data sharing is critical since one person cannot easily collect data across multiple domains. The goal of this paper is to encourage broader data sharing by exploring the state of data sharing in the field of biological anthropology. Our paper is divided into four parts: the first section describes the benefits, challenges, and emerging solutions to open data sharing; the second section presents the results of our data archiving and sharing survey that was completed by over 700 researchers; the third section presents personal experiences of data sharing by the auth...
Facebook
TwitterObjective To characterise experiences using clinical research data shared through the National Institutes of Health (NIH)'s Biologic Specimen and Data Repository Information Coordinating Center (BioLINCC) clinical research data repository, along with data recipients’ perceptions of the value, importance and challenges with using BioLINCC data. Design and setting Cross-sectional web-based survey. Participants All investigators who requested and received access to clinical research data from BioLINCC between 2007 and 2014. Main outcome measures Reasons for BioLINCC data request, research project plans, interactions with original study investigators, BioLINCC experience and other project details. Results There were 536 investigators who requested and received access to clinical research data from BioLINCC between 2007 and 2014. Of 441 potential respondents, 195 completed the survey (response rate=44%); 89% (n=174) requested data for an independent study, 17% (n=33) for pilot/preliminary analysis. Commonly cited reasons for requesting data through BioLINCC were feasibility of collecting data of similar size and scope (n=122) and insufficient financial resources for primary data collection (n=76). For 95% of respondents (n=186), a primary research objective was to complete new research, as opposed to replicate prior analyses. Prior to requesting data from BioLINCC, 18% (n=36) of respondents had contacted the original study investigators to obtain data, whereas 24% (n=47) had done so to request collaboration. Nearly all (n=176; 90%) respondents found the data to be suitable for their proposed project; among those who found the data unsuitable (n=19; 10%), cited reasons were data too complicated to use (n=5) and data poorly organised (n=5). Half (n=98) of respondents had completed their proposed projects, of which 67% (n=66) have been published. Conclusions Investigators were primarily using clinical research data from BioLINCC for independent research, making use of data that would otherwise have not been feasible to collect.
Facebook
TwitterArchive for storing and sharing digital data (and accompanying documentation) generated or collected through qualitative and multi method research in social sciences. QDR provides data management consulting services and actively curates all data projects, maintaining value and usefulness of data over time, and ensuring their availability and findability for re-use.
Facebook
TwitterA listing of NIH supported data sharing repositories that make data accessible for reuse. Most accept submissions of appropriate data from NIH-funded investigators (and others), but some restrict data submission to only those researchers involved in a specific research network. Also included are resources that aggregate information about biomedical data and information sharing systems. The table can be sorted according by name and by NIH Institute or Center and may be searched using keywords so that you can find repositories more relevant to your data. Links are provided to information about submitting data to and accessing data from the listed repositories. Additional information about the repositories and points-of-contact for further information or inquiries can be found on the websites of the individual repositories.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
• Exploring the factors affecting the bike-sharing demand on an urban area using multiscale geographically weighted regression (MGWR) • Establishing the bike sharing demand prediction model incorporating the machine learning techniques
Facebook
TwitterA list of NIH-supported repositories that accept submissions of appropriate scientific research data from biomedical researchers. It includes resources that aggregate information about biomedical data and information sharing systems. Links are provided to information about submitting data to and accessing data from the listed repositories. Additional information about the repositories and points-of contact for further information or inquiries can be found on the websites of the individual repositories.