100+ datasets found
  1. h

    Repository-Dataset

    • huggingface.co
    Updated Feb 14, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    vicky A (2025). Repository-Dataset [Dataset]. https://huggingface.co/datasets/Mr-Vicky-01/Repository-Dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 14, 2025
    Authors
    vicky A
    Description

    Mr-Vicky-01/Repository-Dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  2. NSF Public Access Repository

    • catalog.data.gov
    Updated Sep 19, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Science Foundation (2021). NSF Public Access Repository [Dataset]. https://catalog.data.gov/dataset/nsf-public-access-repository
    Explore at:
    Dataset updated
    Sep 19, 2021
    Dataset provided by
    National Science Foundationhttp://www.nsf.gov/
    Description

    The NSF Public Access Repository contains an initial collection of journal publications and the final accepted version of the peer-reviewed manuscript or the version of record. To do this, NSF draws upon services provided by the publisher community including the Clearinghouse of Open Research for the United States, CrossRef, and International Standard Serial Number. When clicking on a Digital Object Identifier number, you will be taken to an external site maintained by the publisher. Some full text articles may not be available without a charge during the embargo, or administrative interval. Some links on this page may take you to non-federal websites. Their policies may differ from this website.

  3. d

    Research Data Repository Requirements and Features Review

    • dataone.org
    • borealisdata.ca
    Updated Dec 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Leahey, Amber; Webster, Peter; Austin, Claire; Fong, Nancy; Friddell, Julie; Humphrey, Chuck; Brown, Susan; Stewart, Walter (2023). Research Data Repository Requirements and Features Review [Dataset]. https://dataone.org/datasets/sha256%3A62d6afa13f8bbcc2db9a10232ee31799621432d36e61ba14e43e33ef0d004c92
    Explore at:
    Dataset updated
    Dec 28, 2023
    Dataset provided by
    Borealis
    Authors
    Leahey, Amber; Webster, Peter; Austin, Claire; Fong, Nancy; Friddell, Julie; Humphrey, Chuck; Brown, Susan; Stewart, Walter
    Time period covered
    Sep 1, 2014 - Feb 1, 2015
    Description

    Data collected from major Canadian and international research data repositories cover data storage, preservation, metadata, interchange, data file types, and other standard features used in the retention and sharing of research data. The outputs of this project primarily aim to assist in the establishment of recommended minimum requirements for a Canadian research data infrastructure. The committee also aims to further develop guidelines and criteria for the assessment and selection o f repositories for deposit of Canadian research data by researchers, data managers, librarians, archivists etc.

  4. How to choose a research data repository software? Experience report. Table...

    • zenodo.org
    • data.niaid.nih.gov
    bin
    Updated Feb 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nina Buck; Nina Buck; Volodymyr Kushnarenko; Volodymyr Kushnarenko; Björn Schembera; Björn Schembera; Mona Ulrich; Mona Ulrich; Heinz Werner Kramski; Heinz Werner Kramski; Andreas Ganzenmüller; Jan Hess; Jan Hess; Alexander Holz; Alexander Holz; André Blessing; André Blessing; Pascal Hein; Kerstin Jung; Kerstin Jung; Nicolas Schenk; Nicolas Schenk; Claus-Michael Schlesinger; Claus-Michael Schlesinger; Thomas Bönisch; Thomas Bönisch; Roland S. Kamzelak; Roland S. Kamzelak; Jonas Kuhn; Jonas Kuhn; Gabriel Viehhauser; Gabriel Viehhauser; Andreas Ganzenmüller; Pascal Hein (2023). How to choose a research data repository software? Experience report. Table of requirements. [Dataset]. http://doi.org/10.5281/zenodo.7656574
    Explore at:
    binAvailable download formats
    Dataset updated
    Feb 22, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Nina Buck; Nina Buck; Volodymyr Kushnarenko; Volodymyr Kushnarenko; Björn Schembera; Björn Schembera; Mona Ulrich; Mona Ulrich; Heinz Werner Kramski; Heinz Werner Kramski; Andreas Ganzenmüller; Jan Hess; Jan Hess; Alexander Holz; Alexander Holz; André Blessing; André Blessing; Pascal Hein; Kerstin Jung; Kerstin Jung; Nicolas Schenk; Nicolas Schenk; Claus-Michael Schlesinger; Claus-Michael Schlesinger; Thomas Bönisch; Thomas Bönisch; Roland S. Kamzelak; Roland S. Kamzelak; Jonas Kuhn; Jonas Kuhn; Gabriel Viehhauser; Gabriel Viehhauser; Andreas Ganzenmüller; Pascal Hein
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In the age of digital transformation, scientific and social interest for data and data products is constantly on the rise. The quantity as well as the variety of digital research data is increasing significantly. This raises the question about the governance of this data. For example, how to store the data so that it is presented transparently, freely accessible and subsequently available for re-use in the context of good scientific practice. Research data repositories provide solutions to these issues.

    Considering the variety of repository software, it is sometimes difficult to identify a fitting solution for a specific use case. For this purpose a detailed analysis of existing software is needed. Presented table of requirements can serve as a starting point and decision-making guide for choosing the most suitable for your purposes repository software. This table is dealing as a supplementary material for the paper "How to choose a research data repository software? Experience report." (persistent identifier to the paper will be added as soon as paper is published).

  5. d

    Directory of Public Repositories of Geological Materials

    • catalog.data.gov
    • data.usgs.gov
    Updated Jul 6, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2024). Directory of Public Repositories of Geological Materials [Dataset]. https://catalog.data.gov/dataset/directory-of-public-repositories-of-geological-materials
    Explore at:
    Dataset updated
    Jul 6, 2024
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Description

    Overview This directory was developed to provide discovery information for anyone looking for publicly accessible repositories that house geological materials in the U.S. and Canada. In addition, this resource is intended to be a tool to facilitate a community of practice. The need for the directory was identified during planning for and follow-up from a drill core repository webinar series in Spring 2020 for public repository curators and staff in the U.S. and Canada hosted by the Minnesota Geological Survey and the Minnesota Department of Natural Resources. Additional supporting sponsors included the U.S. Geological Survey National Geological and Geophysical Data Preservation Program and the Association of American State Geologists Data Preservation Committee. The 10-part webinar series provided overviews of state, provincial, territorial, and national repositories that house drill core, other geoscience materials, and data. When the series concluded a small working group of the participants continued to meet to facilitate the development and production of a directory of repositories that maintain publicly-accessible geological materials throughout the U.S. and Canada. The group used previous directory efforts described in the next section, Summary of Historical Repository Directory Compilation Efforts, as guides for content during development. The working group prepared and compiled responses from a call for repository information and characterization. This directory is planned to be a living resource for the geoscience community with updates every other year to accommodate changes. The updates will facilitated through versioned updates of this data release. Summary of Historical Repository Directory Compilation Efforts 1957 – Sample and Core Repositories of the United States, Alaska, and Canada. Published by AAPG. Committee on Preservation of Samples and Cores. 13 members from industry, academia, and government. 1977 – Well-Sample and Core Repositories of the Unites States and Canada, C.K. Fisher; M.P. Krupa, USGS Open file report 77-567.USGS wanted to update the original index. Includes a map showing core repositories by “State” “University” “Commercial” and “Federal”. Also includes a “Brief Statement of Requirements for the Preservation of Subsurface Material and Data” and referral to state regulations for details on preserved materials. 1984 - Nonprofit Sample and Core Repositories Open to the Public in the United States – USGS Circular 942. James Schmoker, Thomas Michalski, Patricia Worl. The survey was conducted by a questionnaire mailed to repository curators. Information on additions, corrections, and deletions to earlier (1957,1977) directories from state geologists, each state office of the Water Resources Division of the U.S. Geological Survey, additional government agencies and colleagues were also used. 1997 - The National Directory of Geoscience Data Repositories, edited by Nicholas H. Claudy – American Geologic Institute. To prepare the directory, questionnaires were mailed to state geologists, more than 60 geological societies, private-sector data centers selected from oil and gas directories, and to the membership committee of the American Association of Petroleum Geologists, one of AGI's member societies. The directory contains 124 repository listings, organized alphabetically by state. 2002 – National Research Council 2002. Geoscience Data and Collections: National resources in Peril. Washington, D.C.: The National Academies Press 2005 – The National Geological and Geophysical Data Preservation Program (NGGDPP) of the United States Geological Survey (USGS) was established by The Energy Policy Act of 2005, and reauthorized in the Consolidated Appropriations Act, 2021, “to preserve and expose the Nation’s geoscience collections (samples, logs, maps, data) to promote their discovery and use for research and resource development”. The Program provides “technical and financial assistance to state geological surveys and U.S. Department of the Interior (DOI) bureaus” to archive “geological, geophysical, and engineering data, maps, photographs, samples, and other physical specimens”. Metadata records describing the preserved assets are cataloged in the National Digital Catalog (NDC). References American Association of Petroleum Geologists, 1957, Sample and core repositories of the United States, Alaska, and Canada: American Association of Petroleum Geologists, Committee on Preservation of Samples and Cores, 29 p. American Association of Petroleum Geologists, 2018, US Geological Sample and Data Repositories: American Association of Petroleum Geologists, Preservation of Geoscience Data Committee, Unpublished, (Contact: AAPG Preservation of Geoscience Data Committee) American Geological Institute, 1997, National Geoscience Data Repository System, Phase II. Final report, January 30, 1995--January 28, 1997. United States. https://doi.org/10.2172/598388 American Geological Institute, 1997, National Directory of Geoscience Data Repositories, Claudy, N. H., (ed.), 91pp. Claudy N., Stevens D., 1997, AGI Publishes first edition of national directory of geoscience data repositories. American Geological Institute Spotlight, https://www.agiweb.org/news/datarep2.html Consolidated Appropriations Act, 2021 (Public Law 116-260, Sec.7002) Davidson, E. D., Jr., 1981, A look at core and sample libraries: Bureau of Economic Geology, The University of Texas at Austin, 4 p. and Appendix. Deep Carbon Observatory (DCO) Data Portal, Scientific Collections, https://info.deepcarbon.net/vivo/scientific-collections; Keyword Search: sample repository, https://info.deepcarbon.net/vivo/scientific-collections?source=%7B%22query%22%3A%7B%22query_string%22%3A%7B%22query%22%3A%22sample%20repository%20%22%2C%22default_operator%22%3A%22OR%22%7D%7D%2C%22sort%22%3A%5B%7B%22_score%22%3A%7B%22order%22%3A%22asc%22%7D%7D%5D%2C%22from%22%3A0%2C%22size%22%3A200%7D: Accessed September 29, 2021 Fisher, C. K., and Krupa, M. P., 1977, Well-sample and core repositories of the United States and Canada: U.S. Geological Survey Open-File Report 77-567, 73 p. https://doi.org/10.3133/ofr77567 Fogwill, W.D., 1985, Drill Core Collection and Storage Systems in Canada, Manitoba Energy & Mines. https://www.ngsc-cptgs.com/files/PGJSpecialReport_1985_V03b.pdf Goff, S., and Heiken, G., eds., 1982, Workshop on core and sample curation for the National Continental Scientific Drilling Program: Los Alamos National Laboratory, May 5-6, 1981, LA-9308-C, 31 p. https://www.osti.gov/servlets/purl/5235532 Lonsdale, J. T., 1953, On the preservation of well samples and cores: Oklahoma City Geological Society Shale Shaker, v. 3, no. 7, p. 4. National Geological and Geophysical Data Preservation Program. https://www.usgs.gov/core-science-systems/national-geological-and-geophysical-data-preservation-program National Research Council. 2002. Geoscience Data and Collections: National Resources in Peril. Washington, DC: The National Academies Press, 107 p. https://doi.org/10.17226/10348 Pow, J. R., 1969, Core and sample storage in western Canada: Bulletin of Canadian Petroleum Geology, v. 17, no. 4, p. 362-369. DOI: 10.35767/gscpgbull.17.4.362 Ramdeen, S., 2015. Preservation challenges for geological data at state geological surveys, GeoResJ 6 (2015) 213-220, https://doi.org/10.1016/j.grj.2015.04.002 Schmoker, J. W., Michalski, T. C., and Worl, P. B., 1984, Nonprofit sample and core repositories of the United States: U.S. Geological Survey Circular 942. https://doi.org/10.3133/cir942 Schmoker, J. W., Michalski, T. C., and Worl, P. B., 1984, Addresses, telephone numbers, and brief descriptions of publicly available, nonprofit sample and core repositories of the United States: U.S. Geological Survey Open-File Report 84-333, 13 p. (Superseded by USGS Circular 942) https://doi.org/10.3133/ofr84333 The Energy Policy Act of 2005 (Public Law 109-58, Sec. 351) The National Digital Catalog (NDC). https://www.usgs.gov/core-science-systems/national-geological-and-geophysical-data-preservation-program/national-digital U.S. Bureau of Mines, 1978, CORES Operations Manual: Bureau of Mines Core Repository System: U.S. Bureau of Mines Information Circular IC 8784, 118 p. https://digital.library.unt.edu/ark:/67531/metadc170848/

  6. Z

    GitHub developer behavior and repository evolution dataset

    • data.niaid.nih.gov
    • zenodo.org
    Updated Feb 7, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ShengyuZhao (2020). GitHub developer behavior and repository evolution dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3648083
    Explore at:
    Dataset updated
    Feb 7, 2020
    Dataset provided by
    TianyiZhou
    ShengyuZhao
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In this work, based on GitHub Archive project and repository mining tools, we process all available data into concise and structured format to generate GitHub developer behavior and repository evolution dataset. With the self-configurable interactive analysis tool provided by us, it will give us a macroscopic view of open source ecosystem evolution.

  7. H

    Bear Lake Data Repository

    • hydroshare.org
    • search.dataone.org
    zip
    Updated Sep 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jeff Nielson; Katie Wadsworth (2024). Bear Lake Data Repository [Dataset]. https://www.hydroshare.org/resource/444e4bd2940e47e6bcab5e7966a929fe
    Explore at:
    zip(154.6 MB)Available download formats
    Dataset updated
    Sep 9, 2024
    Dataset provided by
    HydroShare
    Authors
    Jeff Nielson; Katie Wadsworth
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Bear Lake
    Description

    The Bear Lake Data Repository (BLDR) is an active archive, containing a growing compilation of biological, chemical, and physical datasets collected from Bear Lake and its surrounding watershed. The datasets herein have been digitized from historical records and reports, extracted from papers and theses, and obtained from public and private entities, including the United States Geological Survey, PacifiCorp, and, inter alia, Ecosystems Research Institute.

    Contributions are welcome. The BLDR accepts biological, chemical, or physical datasets obtained at Bear Lake, irrespective of funding source. There is no submission size limit at present—workarounds will be found if submissions exceed Hydroshare limits (20 GB). Contributions are published with an open access license and will serve many use cases. The current repository steward, Bear Lake Watch, will advise on submissions and make accepted contributions available promptly.

    Metadata files are provided for each dataset, however, contact with original contributor(s) is encouraged for questions and additional details prior to data usage. The BLDR and its contributors shall not be liable for any damages resulting from misinterpretation or misuse of the data or metadata.

  8. i

    Cellular Data Repository

    • ieee-dataport.org
    Updated Jun 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stefano Savazzi (2025). Cellular Data Repository [Dataset]. https://ieee-dataport.org/documents/cellular-data-repository
    Explore at:
    Dataset updated
    Jun 17, 2025
    Authors
    Stefano Savazzi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    and how these can be affected by the presence of human body nearby.

  9. Dataset supporting "Are data repositories fettered? A survey of current...

    • figshare.com
    txt
    Updated Mar 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nushrat Khan; Mike Thelwall; Kayvan Kousha (2022). Dataset supporting "Are data repositories fettered? A survey of current practices, challenges and future technologies" [Dataset]. http://doi.org/10.6084/m9.figshare.14191739.v2
    Explore at:
    txtAvailable download formats
    Dataset updated
    Mar 1, 2022
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Nushrat Khan; Mike Thelwall; Kayvan Kousha
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains 189 survey responses from a respository/ data managers' survey where we explored the current status, needs and challenges of research data repositories.

  10. Administrative Data Repository (ADR)

    • catalog.data.gov
    • datahub.va.gov
    Updated Aug 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Veterans Affairs (2025). Administrative Data Repository (ADR) [Dataset]. https://catalog.data.gov/dataset/administrative-data-repository-adr
    Explore at:
    Dataset updated
    Aug 2, 2025
    Dataset provided by
    United States Department of Veterans Affairshttp://va.gov/
    Description

    The Administrative Data Repository (ADR) was established to provide support for the administrative data elements relative to multiple categories of a person entity such as demographic and eligibility information. Although initially focused on the computing needs of the Veterans Health Administration, the ADR is positioned to provide identity management and demographics support for all IT systems within the Department of Veterans Affairs.

  11. Smart network repository based on Neo4j native graph database

    • catalog.data.gov
    • s.cnmilf.com
    Updated Feb 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2024). Smart network repository based on Neo4j native graph database [Dataset]. https://catalog.data.gov/dataset/smart-network-repository-based-on-neo4j-native-graph-database-f6ffc
    Explore at:
    Dataset updated
    Feb 1, 2024
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    Description

    To address the increasing complexity of network management and the limitations of data repositories in handling the various network operational data, this paper proposes a novel repository design that uniformly represents network operational data while allowing for a multiple abstractions access to the information. This smart repository simplifies network management functions by enabling network verification directly within the repository. The data is organized in a knowledge graph compatible with any general-purpose graph database, offering a comprehensive and extensible network repository. Performance evaluations confirm the feasibility of the proposed design. The repository's ability to natively support 'what-if' scenario evaluation is demonstrated by verifying Border Gateway Protocol (BGP) route policies and analyzing forwarding behavior with virtual Traceroute.

  12. I

    Language values for DataCite dataset records

    • databank.illinois.edu
    Updated Jun 23, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Elizabeth Wickes (2016). Language values for DataCite dataset records [Dataset]. http://doi.org/10.13012/B2IDB-1065549_V1
    Explore at:
    Dataset updated
    Jun 23, 2016
    Authors
    Elizabeth Wickes
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This dataset was extracted from a set of metadata files harvested from the DataCite metadata store (http://search.datacite.org/ui) during December 2015. Metadata records for items with a resourceType of dataset were collected. 1,647,949 total records were collected. This dataset contains four files: 1) readme.txt: a readme file. 2) language-results.csv: A CSV file containing three columns: DOI, DOI prefix, and language text contents 3) language-counts.csv: A CSV file containing counts for unique language text content values. 4) language-grouped-counts.txt: A text file containing the results of manually grouping these language codes.

  13. PLOS Open Science Indicators

    • plos.figshare.com
    zip
    Updated Jul 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Public Library of Science (2025). PLOS Open Science Indicators [Dataset]. http://doi.org/10.6084/m9.figshare.21687686.v10
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 10, 2025
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Public Library of Science
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains article metadata and information about Open Science Indicators for approximately 139,000 research articles published in PLOS journals from 1 January 2018 to 30 March 2025 and a set of approximately 28,000 comparator articles published in non-PLOS journals. This is the tenth release of this dataset, which will be updated with new versions on an annual basis.This version of the Open Science Indicators dataset shares the indicators seen in the previous versions as well as fully operationalised protocols and study registration indicators, which were previously only shared in preliminary forms. The v10 dataset focuses on detection of five Open Science practices by analysing the XML of published research articles:Sharing of research data, in particular data shared in data repositoriesSharing of codePosting of preprintsSharing of protocolsSharing of study registrationsThe dataset provides data and code generation and sharing rates, the location of shared data and code (whether in Supporting Information or in an online repository). It also provides preprint, protocol and study registration sharing rates as well as details of the shared output, such as publication date, URL/DOI/Registration Identifier and platform used. Additional data fields are also provided for each article analysed. This release has been run using an updated preprint detection method (see OSI-Methods-Statement_v10_Jul25.pdf for details). Further information on the methods used to collect and analyse the data can be found in Documentation.Further information on the principles and requirements for developing Open Science Indicators is available in https://doi.org/10.6084/m9.figshare.21640889.Data folders/filesData Files folderThis folder contains the main OSI dataset files PLOS-Dataset_v10_Jul25.csv and Comparator-Dataset_v10_Jul25.csv, which containdescriptive metadata, e.g. article title, publication data, author countries, is taken from the article .xml filesadditional information around the Open Science Indicators derived algorithmicallyand the OSI-Summary-statistics_v10_Jul25.xlsx file contains the summary data for both PLOS-Dataset_v10_Jul25.csv and Comparator-Dataset_v10_Jul25.csv.Documentation folderThis file contains documentation related to the main data files. The file OSI-Methods-Statement_v10_Jul25.pdf describes the methods underlying the data collection and analysis. OSI-Column-Descriptions_v10_Jul25.pdf describes the fields used in PLOS-Dataset_v10_Jul25.csv and Comparator-Dataset_v10_Jul25.csv. OSI-Repository-List_v1_Dec22.xlsx lists the repositories and their characteristics used to identify specific repositories in the PLOS-Dataset_v10_Jul25.csv and Comparator-Dataset_v10_Jul25.csv repository fields.The folder also contains documentation originally shared alongside the preliminary versions of the protocols and study registration indicators in order to give fuller details of their detection methods.Contact details for further information:Iain Hrynaszkiewicz, Director, Open Research Solutions, PLOS, ihrynaszkiewicz@plos.org / plos@plos.orgLauren Cadwallader, Open Research Manager, PLOS, lcadwallader@plos.org / plos@plos.orgAcknowledgements:Thanks to Allegra Pearce, Tim Vines, Asura Enkhbayar, Scott Kerr and parth sarin of DataSeer for contributing to data acquisition and supporting information.

  14. Privacy Impact Assessment (PIA) Repository

    • catalog.data.gov
    • data.va.gov
    • +3more
    Updated Aug 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Veterans Affairs (2025). Privacy Impact Assessment (PIA) Repository [Dataset]. https://catalog.data.gov/dataset/privacy-impact-assessment-pia-repository
    Explore at:
    Dataset updated
    Aug 2, 2025
    Dataset provided by
    United States Department of Veterans Affairshttp://va.gov/
    Description

    This repository contains Privacy Impact Assessments (PIA) that have been vetted/approved. Section 208 of the Electronic Government Act of 2002 (E-Gov Act) requires federal agencies to conduct a Privacy Impact Assessment (PIA) on information technology systems that either (1) collect, maintain, and/or disseminate Personally Identifiable Information (PII); or (2) make substantial changes to existing technology that manages PII.. A PIA is an analysis of how PII is collected, stored, protected, shared, and managed. Its purpose is to demonstrate that the information system owners and/or developers have incorporated privacy protections throughout the lifecycle of the system. Absent a legitimate security or sensitivity concern demonstrated by the agency, the E-Gov Act requires agencies to make PIAs publicly available.The Department of Veterans Affairs (VA) Office of Privacy, FOIA, and Records Management strive to be the leader among Federal Information Technology (IT) organizations in protecting the privacy of Veterans' personal data. Our goal is to respond to mandates to meet the needs of Veterans, while improving the protection of their personal information.To view a PIA for a particular system, go to the heading labeled 'System Operating with a Current/Valid FY2013 PIA' and select the name of the program/project of interest.

  15. d

    Biospecimen Repository Access and Data Sharing (BRADS)

    • catalog.data.gov
    • healthdata.gov
    • +3more
    Updated Jul 26, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institutes of Health (NIH) (2023). Biospecimen Repository Access and Data Sharing (BRADS) [Dataset]. https://catalog.data.gov/dataset/biospecimen-repository-access-and-data-sharing-brads
    Explore at:
    Dataset updated
    Jul 26, 2023
    Dataset provided by
    National Institutes of Health (NIH)
    Description

    BRADS is a repository for data and biospecimens from population health research initiatives and clinical or interventional trials designed and implemented by NICHD’s Division of Intramural Population Health Research (DIPHR). Topics include human reproduction and development, pregnancy, child health and development, and women’s health. The website is maintained by DIPHR.

  16. Data from: Towards an Ideal Methodological Data Repository: Lessons and...

    • zenodo.org
    csv
    Updated Oct 1, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rachel Longjohn; Rachel Longjohn; Markelle Kelly; Markelle Kelly; Padhraic Smyth; Sameer Singh; Padhraic Smyth; Sameer Singh (2023). Towards an Ideal Methodological Data Repository: Lessons and Recommendations [Dataset]. http://doi.org/10.5281/zenodo.6645277
    Explore at:
    csvAvailable download formats
    Dataset updated
    Oct 1, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Rachel Longjohn; Rachel Longjohn; Markelle Kelly; Markelle Kelly; Padhraic Smyth; Sameer Singh; Padhraic Smyth; Sameer Singh
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Our dataset "repository_survey" summarizes a comprehensive survey of over 150 data repositories, characterizing their metadata documentation and standardization, data curation and validation, and tracking of dataset use in the literature. In addition, "survey_model_evaluation" includes our findings on model evaluation for four methodological repositories. The data are associated with our paper "Towards an Ideological Data Repository: Lessons and Recommendations," where individual columns and survey methods are described.

  17. Selected facets for DataCite Repositories

    • zenodo.org
    csv
    Updated May 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ted Habermann; Ted Habermann (2025). Selected facets for DataCite Repositories [Dataset]. http://doi.org/10.5281/zenodo.15549013
    Explore at:
    csvAvailable download formats
    Dataset updated
    May 29, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Ted Habermann; Ted Habermann
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    May 26, 2025
    Description

    What is a facet?

    A facet is a metadata element, usually from a controlled list, that provides counts of records in a query result with particular values for the metadata element. The DataCite JSON Response includes data on a variety of facets for each query done using the DataCite API.

    Repository Facets

    DataCite Commons uses facets on repository pages to provide an overview of repositories. For example, the Metadata Game Changers Commons page shows publication year, work types, licenses, creators and contributors, and some other facets as graphics and lists.

    The facets provided by DataCite can be used to 1) understand characteristics of DataCite metadata, 2) understand some aspects of repository completeness, and 3) provide overviews of repositories.

    Using Facets to Understand DataCite Metadata

    DataCite includes facets and facet values in all query results, so they are a useful tool for answering some "big picture" questions about DataCite metadata. Some of these questions were explored during 2022 in DataCite Facets: Understanding DataCite Usage using a tool called DataCite Facets.

    DataCite Facets and Repository Overviews

    DataCite facets can be used to provide overviews of any DataCite Repository and understand some characteristics of the repositories. They can also be used, in some cases, to provide insights into some aspects of repository completeness.

    Repository Facets and Metadata Completeness

    Many useful repository measures focus on completeness of the metadata, i.e., the portion of records in the repository that include some metadata element. The DataCite facet data can provide some insight into completeness, but we must keep in mind that the facet data are limited to top ten values for most facets (except for published and resourceTypes, which can be > 10). The blog DataCite Facets and Metadata Completeness describes how some facets can be used to provide insights into metadata completeness.

    This dataset provides selected facets downloaded using the DataCite API and associated statistics as a comma-separated-value (CSV) file.

    Column definitions:

    The dataset includes a number of columns for the selected facets:

    Statistic

    Description

    number

    The number of facet values

    max

    The number of occurrences of the most common facet value

    common

    The most common facet value

    total

    The total number of records in the top 10, i.e. the total listed in the facets

    homogeneity (HI)

    An indicator of homogeneity of the facet: maximum count / total count (0.1 = uniform, 1.0 = single item)

    coverage

    The % of all records covered by the top 10 (numbers close to 100% are good)

  18. D

    Data Repository for a Systematic Mapping Study on Warm-Starting and Quantum...

    • darus.uni-stuttgart.de
    Updated Mar 15, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Felix Truger; Johanna Barzen; Marvin Bechtold; Martin Beisel; Frank Leymann; Alexander Mandl; Vladimir Yussupov (2023). Data Repository for a Systematic Mapping Study on Warm-Starting and Quantum Computing [Dataset]. http://doi.org/10.18419/DARUS-3367
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 15, 2023
    Dataset provided by
    DaRUS
    Authors
    Felix Truger; Johanna Barzen; Marvin Bechtold; Martin Beisel; Frank Leymann; Alexander Mandl; Vladimir Yussupov
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Oct 1, 2018 - Sep 30, 2022
    Dataset funded by
    BMWK
    Ministry of Economic Affairs, Labour and Tourism of Baden-Württemberg
    Description

    Data representing intermediate and final results of the systematic mapping study entitled "Warm-Starting and Quantum Computing: A Systematic Mapping Study". The data is prepared in a folder structure. It is recommended to change the files view to "Tree" to get a better overview of the files. This comprises (file prefix: 01) the exact search queries executed in the database search phase and details on the execution of these queries, (02) the search results of the database search, (03) the list of included publications from the database search, (04) the list of included publications from the snowballing activity, (05) the consolidated final list of included publications, (06) the information extracted from these publications, including the properties of identified techniques, (07) the analysis of these techniques based on their properties and the resulting categorization.

  19. d

    NIH Common Data Elements Repository

    • catalog.data.gov
    • datadiscovery.nlm.nih.gov
    • +4more
    Updated Jun 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Library of Medicine (2025). NIH Common Data Elements Repository [Dataset]. https://catalog.data.gov/dataset/nih-common-data-elements-repository-f6b3a
    Explore at:
    Dataset updated
    Jun 19, 2025
    Dataset provided by
    National Library of Medicine
    Description

    The NIH Common Data Elements (CDE) Repository has been designed to provide access to structured human and machine-readable definitions of data elements that have been recommended or required by NIH Institutes and Centers and other organizations for use in research and for other purposes. Visit the NIH CDE Resource Portal for contextual information about the repository.

  20. Data from: Beyond the Repository Curatorial Toolkit

    • osf.io
    Updated Jun 28, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Laura Alagna; Lauren Work (2021). Beyond the Repository Curatorial Toolkit [Dataset]. http://doi.org/10.17605/OSF.IO/GEJQS
    Explore at:
    Dataset updated
    Jun 28, 2021
    Dataset provided by
    Center for Open Sciencehttps://cos.io/
    Authors
    Laura Alagna; Lauren Work
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This toolkit is intended to assist cultural heritage organizations in choosing materials to send to distributed digital preservation (DDP) systems, networks in geographically-dispersed locations designed to perform preservation actions.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
vicky A (2025). Repository-Dataset [Dataset]. https://huggingface.co/datasets/Mr-Vicky-01/Repository-Dataset

Repository-Dataset

Mr-Vicky-01/Repository-Dataset

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 14, 2025
Authors
vicky A
Description

Mr-Vicky-01/Repository-Dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

Search
Clear search
Close search
Google apps
Main menu