100+ datasets found
  1. H

    Bear Lake Data Repository

    • hydroshare.org
    • search.dataone.org
    zip
    Updated Sep 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jeff Nielson; Katie Wadsworth (2024). Bear Lake Data Repository [Dataset]. https://www.hydroshare.org/resource/444e4bd2940e47e6bcab5e7966a929fe
    Explore at:
    zip(154.6 MB)Available download formats
    Dataset updated
    Sep 9, 2024
    Dataset provided by
    HydroShare
    Authors
    Jeff Nielson; Katie Wadsworth
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The Bear Lake Data Repository (BLDR) is an active archive, containing a growing compilation of biological, chemical, and physical datasets collected from Bear Lake and its surrounding watershed. The datasets herein have been digitized from historical records and reports, extracted from papers and theses, and obtained from public and private entities, including the United States Geological Survey, PacifiCorp, and, inter alia, Ecosystems Research Institute.

    Contributions are welcome. The BLDR accepts biological, chemical, or physical datasets obtained at Bear Lake, irrespective of funding source. There is no submission size limit at present—workarounds will be found if submissions exceed Hydroshare limits (20 GB). Contributions are published with an open access license and will serve many use cases. The current repository steward, Bear Lake Watch, will advise on submissions and make accepted contributions available promptly.

    Metadata files are provided for each dataset, however, contact with original contributor(s) is encouraged for questions and additional details prior to data usage. The BLDR and its contributors shall not be liable for any damages resulting from misinterpretation or misuse of the data or metadata.

  2. NSF Public Access Repository

    • catalog.data.gov
    Updated Sep 19, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Science Foundation (2021). NSF Public Access Repository [Dataset]. https://catalog.data.gov/dataset/nsf-public-access-repository
    Explore at:
    Dataset updated
    Sep 19, 2021
    Dataset provided by
    National Science Foundationhttp://www.nsf.gov/
    Description

    The NSF Public Access Repository contains an initial collection of journal publications and the final accepted version of the peer-reviewed manuscript or the version of record. To do this, NSF draws upon services provided by the publisher community including the Clearinghouse of Open Research for the United States, CrossRef, and International Standard Serial Number. When clicking on a Digital Object Identifier number, you will be taken to an external site maintained by the publisher. Some full text articles may not be available without a charge during the embargo, or administrative interval. Some links on this page may take you to non-federal websites. Their policies may differ from this website.

  3. p

    Building Information Modelling (BIM) data repository with labels

    • purr.purdue.edu
    Updated Oct 3, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jiansong Zhang; Jin Wu (2019). Building Information Modelling (BIM) data repository with labels [Dataset]. http://doi.org/10.4231/60V2-PJ72
    Explore at:
    Dataset updated
    Oct 3, 2019
    Dataset provided by
    PURR
    Authors
    Jiansong Zhang; Jin Wu
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The dataset contains five models with extracted elements for each model. The elements are manually labeled by researchers with construction/civil engineering backgrounds based on discussion and majority vote.

  4. B

    FishSounds Website Data Repository

    • borealisdata.ca
    • search.dataone.org
    Updated Oct 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Audrey Looby; Amalis Riera; Sarah Vela; Kieran Cox; Santiago Bravo; Rodney Rountree; Francis Juanes; Laura K. Reynolds; Charles W. Martin (2024). FishSounds Website Data Repository [Dataset]. http://doi.org/10.5683/SP2/TACOUX
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 28, 2024
    Dataset provided by
    Borealis
    Authors
    Audrey Looby; Amalis Riera; Sarah Vela; Kieran Cox; Santiago Bravo; Rodney Rountree; Francis Juanes; Laura K. Reynolds; Charles W. Martin
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Area covered
    global, underwater ecosystems
    Description

    FishSounds presents a compilation of acoustic recordings and published information on sound production across all extant fish species globally. We hope this information can be used to advance research into fish behavior, passive acoustic monitoring, and human impacts on underwater soundscapes as well as serve as a public resource for anyone interested in learning more about fish sounds. This work is the product of an international collaboration between researchers and developers from five organizations. We have taken a cross-disciplinary approach, combining expertise in fish ecology, bioacoustics, and data management to produce a website that we hope will serve the wider marine research community. This Dataverse dataset serves as a permanent repository for all versions of the FishSounds website and associated publications and products. Please see the latest version for the most detailed methodology and data, though the other versions are available for reference. All of the data provided here may be more easily viewed and searched at FishSounds.net. We will be continuing to update and add to FishSounds.net and this repository, so if you would like to suggest an edit or contribute a reference or associated fish sound recording, please contact us at fishsoundscontact@gmail.com.

  5. JSON Repository

    • data.amerigeoss.org
    • cloud.csiss.gmu.edu
    • +2more
    csv, geojson, json +1
    Updated Feb 26, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    UN Humanitarian Data Exchange (2025). JSON Repository [Dataset]. https://data.amerigeoss.org/dataset/json-repository
    Explore at:
    geojson(9124), csv(335), csv(242), json(707249), geojson(886086), geojson(953043), json(1132925), json(3478518), csv(9901), csv(177073), csv(536), csv(669568), geojson(365288), csv(462610), geojson(545299), csv(85982), json(559095), json(876253), csv(6789), json(1975854), json(520472), geojson(2396630), geojson(709673), json(640845), json(457832), csv(177), geojson(178718), csv(845984), geojson(219728), geojson(543777), geojson(162605), csv(358964), csv(4907), geojson(222216), geojson(74470), geojson(1324722), json(632081), json(3411081), geojson(164379), geojson(366788), geojson(54889), csv(779), topojson(2728099), geojson(135805), json(2064743), csv(9980), json(461423), json(327649), json(3401512)Available download formats
    Dataset updated
    Feb 26, 2025
    Dataset provided by
    United Nationshttp://un.org/
    Description

    This dataset contains resources transformed from other datasets on HDX. They exist here only in a format modified to support visualization on HDX and may not be as up to date as the source datasets from which they are derived.

    Source datasets: https://data.hdx.rwlabs.org/dataset/idps-data-by-region-in-mali

  6. Administrative Data Repository (ADR)

    • catalog.data.gov
    • datahub.va.gov
    Updated May 1, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Veterans Affairs (2021). Administrative Data Repository (ADR) [Dataset]. https://catalog.data.gov/dataset/administrative-data-repository-adr
    Explore at:
    Dataset updated
    May 1, 2021
    Dataset provided by
    United States Department of Veterans Affairshttp://va.gov/
    Description

    The Administrative Data Repository (ADR) was established to provide support for the administrative data elements relative to multiple categories of a person entity such as demographic and eligibility information. Although initially focused on the computing needs of the Veterans Health Administration, the ADR is positioned to provide identity management and demographics support for all IT systems within the Department of Veterans Affairs.

  7. O

    EdSight (State education data repository)

    • data.ct.gov
    • cloud.csiss.gmu.edu
    • +4more
    application/rdfxml +5
    Updated May 3, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    State Department of Education (2017). EdSight (State education data repository) [Dataset]. https://data.ct.gov/Education/EdSight-State-education-data-repository-/7uts-qap4
    Explore at:
    xml, application/rdfxml, tsv, json, csv, application/rssxmlAvailable download formats
    Dataset updated
    May 3, 2017
    Dataset authored and provided by
    State Department of Education
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    EdSight is an education data portal that integrates information from over 30 different sources – some reported by districts and others from external sources. The portal can be accessed here: http://edsight.ct.gov/.

    Information is available on key performance measures that make up the Next Generation Accountability System, as well as dozens of other topics, including school finance, special education, staffing levels and school enrollment.

  8. repository-metadata

    • huggingface.co
    Updated Oct 18, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amazon SageMaker (2023). repository-metadata [Dataset]. https://huggingface.co/datasets/amazon-sagemaker/repository-metadata
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 18, 2023
    Dataset provided by
    Amazon.comhttp://amazon.com/
    Authors
    Amazon SageMaker
    Description

    amazon-sagemaker/repository-metadata dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. w

    Global Code Repository Software Market Research Report: By Deployment Type...

    • wiseguyreports.com
    Updated Jul 19, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    wWiseguy Research Consultants Pvt Ltd (2024). Global Code Repository Software Market Research Report: By Deployment Type (Cloud-Based, On-Premises), By Enterprise Size (Small and Medium-Sized Enterprises (SMEs), Large Enterprises), By Industry Vertical (IT & Telecommunications, Manufacturing, Financial Services, Healthcare, Education, Government), By Application (Version Control, Issue Tracking, Code Review, Project Management, Continuous Integration/Continuous Delivery (CI/CD)) and By Regional (North America, Europe, South America, Asia Pacific, Middle East and Africa) - Forecast to 2032. [Dataset]. https://www.wiseguyreports.com/reports/code-repository-software-market
    Explore at:
    Dataset updated
    Jul 19, 2024
    Dataset authored and provided by
    wWiseguy Research Consultants Pvt Ltd
    License

    https://www.wiseguyreports.com/pages/privacy-policyhttps://www.wiseguyreports.com/pages/privacy-policy

    Time period covered
    Jan 7, 2024
    Area covered
    Global
    Description
    BASE YEAR2024
    HISTORICAL DATA2019 - 2024
    REPORT COVERAGERevenue Forecast, Competitive Landscape, Growth Factors, and Trends
    MARKET SIZE 20233.63(USD Billion)
    MARKET SIZE 20244.11(USD Billion)
    MARKET SIZE 203211.2(USD Billion)
    SEGMENTS COVEREDDeployment Type ,Enterprise Size ,Industry Vertical ,Application ,Regional
    COUNTRIES COVEREDNorth America, Europe, APAC, South America, MEA
    KEY MARKET DYNAMICSIncreased adoption of cloudbased code repositories Growing demand for collaborative development tools Rising popularity of open source software Need for improved code security and compliance Integration of code repositories with other DevOps tools
    MARKET FORECAST UNITSUSD Billion
    KEY COMPANIES PROFILEDVersionDog ,Plastic SCM ,SourceForge ,Beanstalk ,GitLab ,Google ,Microsoft ,Assembla ,Atlassian ,SmartGit ,CodebaseHQ ,RhodeCode ,CollabNet VersionOne ,Perforce Software
    MARKET FORECAST PERIOD2024 - 2032
    KEY MARKET OPPORTUNITIESCloudbased adoption Growing cloud adoption driving demand for cloudbased code repository solutions DevOps integration Increasing adoption of DevOps practices necessitating integrated code repository solutions AIpowered code management AIbased features for code quality security and collaboration Remote and distributed teams Remote work trends increasing the need for collaborative code repositories Open source code repositories Rising popularity of open source software and associated code repositories
    COMPOUND ANNUAL GROWTH RATE (CAGR) 13.34% (2024 - 2032)
  10. B

    Research Data Repository Requirements and Features Review

    • borealisdata.ca
    Updated Aug 24, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amber Leahey; Peter Webster; Claire Austin; Nancy Fong; Julie Friddell; Chuck Humphrey; Susan Brown; Walter Stewart (2015). Research Data Repository Requirements and Features Review [Dataset]. http://doi.org/10.5683/SP3/UPABVH
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 24, 2015
    Dataset provided by
    Borealis
    Authors
    Amber Leahey; Peter Webster; Claire Austin; Nancy Fong; Julie Friddell; Chuck Humphrey; Susan Brown; Walter Stewart
    License

    https://borealisdata.ca/api/datasets/:persistentId/versions/4.0/customlicense?persistentId=doi:10.5683/SP3/UPABVHhttps://borealisdata.ca/api/datasets/:persistentId/versions/4.0/customlicense?persistentId=doi:10.5683/SP3/UPABVH

    Time period covered
    Sep 2014 - Feb 2015
    Area covered
    Europe, United Kingdom, United States, Canada, International
    Description

    Data collected from major Canadian and international research data repositories cover data storage, preservation, metadata, interchange, data file types, and other standard features used in the retention and sharing of research data. The outputs of this project primarily aim to assist in the establishment of recommended minimum requirements for a Canadian research data infrastructure. The committee also aims to further develop guidelines and criteria for the assessment and selection o f repositories for deposit of Canadian research data by researchers, data managers, librarians, archivists etc.

  11. d

    Biologic Specimen and Data Repository Information Coordinating Center...

    • dknet.org
    • scicrunch.org
    • +2more
    Updated Jan 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Biologic Specimen and Data Repository Information Coordinating Center (BioLINCC) [Dataset]. http://identifiers.org/RRID:SCR_013142
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    Repository that serves to coordinate searches across data and biospecimen collections from participants in numerous clinical trials and epidemiologic studies and to provide an electronic means for requests for additional information and the submission of requests for collections. The collections, comprising data from more than 80 trials or studies and millions of biospecimens, are available to qualified investigators under specific terms and conditions consistent with the informed consents provided by the individual study participants. Some datasets are presented with studies and supporting materials to facilitate their use in reuse and teaching. Datasets support basic research, clinical studies, observational studies, and demonstrations. Researchers wishing to apply to submit biospecimen collections to the NHLBI Biorepository for sharing with qualified investigators may also use this website to initiate that process.

  12. e

    Proteome UP000001584 - (Mycobacterium tuberculosis) SWISS-MODEL dataset

    • swissmodel.expasy.org
    gz
    Updated Aug 2, 2014
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2014). Proteome UP000001584 - (Mycobacterium tuberculosis) SWISS-MODEL dataset [Dataset]. https://swissmodel.expasy.org/repository
    Explore at:
    gzAvailable download formats
    Dataset updated
    Aug 2, 2014
    Description

    SWISS-MODEL homology models mapping to UniProtKB Proteome UP000001584 (Mycobacterium tuberculosis)

  13. H

    Scientific production on data repositories and open science published in the...

    • dataverse.harvard.edu
    • search.dataone.org
    Updated May 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sinval Rodrigues-Junior (2024). Scientific production on data repositories and open science published in the Web of Science database – Bibliometric conceptual analysis [Dataset]. http://doi.org/10.7910/DVN/MZ1EUP
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 2, 2024
    Dataset provided by
    Harvard Dataverse
    Authors
    Sinval Rodrigues-Junior
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This document describes data collected from the Main Collection of the Web of Science database. Records of published studies addressing the intersection of Open Science and data repository were searched up to January 15th, 2024, and the final dataset was comprised of 545 records for bibliometric analysis.

  14. Data for: Sustainable connectivity in a community repository

    • data.niaid.nih.gov
    • datadryad.org
    zip
    Updated Dec 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ted Habermann (2023). Data for: Sustainable connectivity in a community repository [Dataset]. http://doi.org/10.5061/dryad.nzs7h44xr
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 7, 2023
    Dataset provided by
    Authors
    Ted Habermann
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Description

    Identifiers of many kinds are the key to creating unambiguous and persistent connections between research objects and other items in the global research infrastructure (GRI). Many repositories are implementing mechanisms to collect and integrate these identifiers into their submission and record curation processes. This bodes well for a well-connected future, but many existing resources submitted in the past are missing these identifiers, thus missing the connections required for inclusion in the connected infrastructure. Re-curation of these metadata is required to make these connections. The Dryad Data Repository has existed since 2008 and has successfully re-curated the repository metadata several times, adding identifiers for research organizations, funders, and researchers. Understanding and quantifying these successes depends on measuring repository and identifier connectivity. Metrics are described and applied to the entire repository here. Identifiers for papers (DOIs) connected to datasets in Dryad have long been a critical part of the Dryad metadata creation and curation processes. Since 2019, the % of datasets with connected papers has decreased from 100% to less than 40%. This decrease has significant ramifications for the re-curation efforts described above as connected papers are an important source of metadata. In addition, missing connections to papers make understanding and re-using datasets more difficult. Connections between datasets and papers are many times difficult to make because of time lags between submission and publication, lack of clear mechanisms for citing datasets and other research objects from papers, changing focus of researchers, and other obstacles. The Dryad community of members, i.e. users, research institutions, publishers, and funders have vested interests in identifying these connections and critical roles in the curation and re-curation efforts. Their engagement will be critical in building on the successes Dryad has already achieved and ensuring sustainable connectivity in the future. Methods These data are Dryad metadata retrieved from https://datadryad.org and translated into csv files. There are two datasets: 1. DryadJournalDataset was retrieved from Dryad using the ISSNs in the file DryadJournalDataset_ISSNs.txt, although some had no data. 2. DryadOrganizationDataset was retrieved from Dryad using the RORs in the file DryadOrganizationDataset_RORs.txt, although some had no data. Each dataset includes four types of metadata: identifiers, funders, keywords, and related works, each in a separate comma (.csv) or tab (.tsv) delimited files. There are also Microsoft Excel files (.xlsx) for the identifier metadata and connectivity summaries for each dataset (*.html). The connectivity summaries include summaries of each parameter in all four data files with definitions, counts, unique counts, most frequent values, and completeness. These data formed the basis for an analysis of the connectivity of the Dryad repository for organizations, funders, and people.

  15. Z

    urbisphere-Berlin campaign BAMS data repository

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jul 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    urbisphere-Berlin campaign BAMS data repository [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_11235441
    Explore at:
    Dataset updated
    Jul 26, 2024
    Dataset provided by
    Ruhtz, Thomas
    Van Hove, Melania
    Kittner, Jonas
    McGrory, Megan
    Grimmond, Sue
    Mitraka, Zina
    Barlow, Janet
    Tsirantonakis, Dimitris
    Feigel, Gregor
    Fenner, Daniel
    Iqbal, Nimra
    Benjamin, Kit
    Looschelders, Dana
    Trachte, Katja
    Meier, Fred
    Chrysoulakis, Nektarios
    Zeeman, Matthias
    Ravan, Marvin
    Liu, Yiqing
    Scherer, Dieter
    Lean, Humphrey
    Smith, Stefan Thor
    Metzger, Swen
    Kotthaus, Simone
    Beyrich, Frank
    Blunn, Lewis
    Morrison, William
    Hertwig, Denise
    Birkmann, Joern
    Glazer, Russell
    Christen, Andreas
    Poursanidis, Dimitris
    Gertsen, Carlotta
    Saunders, Bethany
    Bechtel, Benjamin
    Paskin, Matthew
    Luo, Zhiwen
    Briegel, Ferdinand
    König, Kai
    Stretton, Megan
    Clements, Matthew
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Berlin
    Description

    This data set accompanies Fenner et al. (2024) and contains the data (and references to data sources) of the plots and tables therein.

    Data are organized by Figure and Table in the article, each located in a separate (zip-)folder.

    See README.pdf for additional information and data descriptions.

    Detailed data processing details are given in the Appendices of the article.

    RAW measurement data are accessible via the Zenodo “urbisphere” community.

    Fenner, D., Christen, A., Grimmond, S., Meier, F., Morrison, W., Zeeman, M., Barlow, J., Birkmann, J., Blunn, L., Chrysoulakis, N., Clements, M., Glazer, R., Hertwig, D., Kotthaus, S., König, K., Looschelders, D., Mitraka, Z., Poursanidis, D., Tsirantonakis, D., Bechtel, B., Benjamin, K., Beyrich, F., Briegel, F., Feigel, G., Gertsen, C., Iqbal, N., Kittner, J., Lean, H., Liu, Y., Luo, Z., McGrory, M., Metzger, S., Paskin, M., Ravan, M., Ruhtz, T., Saunders, B., Scherer, D., Smith, S. T., Stretton, M., Trachte, K. and Van Hove, M., 2024: urbisphere-Berlin campaign: Investigating multi-scale urban impacts on the atmospheric boundary layer. Bull. Am. Meteorol. Soc. DOI: 10.1175/BAMS-D-23-0030.1

  16. d

    Stowers Original Data Repository

    • dknet.org
    • neuinfo.org
    Updated Jan 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Stowers Original Data Repository [Dataset]. http://identifiers.org/RRID:SCR_002640/resolver?q=&i=rrid
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    THIS RESOURCE IS NO LONGER IN SERVICE, documented January 13, 2022. Open access repository of original, unprocessed data underlying work published by Stowers researchers to allow the scientific community to validate and extend the findings made by Stowers researchers. For papers first submitted for publication after November 1, 2011, the Stowers Institute requires its members to deposit original data files into the Stowers Original Data Repository or to repositories maintained by third parties at the time of publication. Access to the Stowers Original Data Repository is free, but you will be asked to register before you can download data.

  17. d

    Toward a Reproducible Research Data Repository

    • data.depositar.io
    mp4, pdf
    Updated Jan 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Toward a Reproducible Research Data Repository [Dataset]. https://data.depositar.io/dataset/reproducible-research-data-repository
    Explore at:
    pdf(212638), pdf(2586248), pdf(627064), mp4(22141307)Available download formats
    Dataset updated
    Jan 26, 2024
    Dataset provided by
    depositar
    Description

    Collected in this dataset are the slideset and abstract for a presentation on Toward a Reproducible Research Data Repository by the depositar team at International Symposium on Data Science 2023 (DSWS 2023), hosted by the Science Council of Japan in Tokyo on December 13-15, 2023. The conference was organized by the Joint Support-Center for Data Science Research (DS), Research Organization of Information and Systems (ROIS) and the Committee of International Collaborations on Data Science, Science Council of Japan. The conference programme is also included as a reference.

    Title

    Toward a Reproducible Research Data Repository

    Author(s)

    Cheng-Jen Lee, Chia-Hsun Ally Wang, Ming-Syuan Ho, and Tyng-Ruey Chuang

    Affiliation of presenter

    Institute of Information Science, Academia Sinica, Taiwan

    Summary of Abstract

    The depositar (https://data.depositar.io/) is a research data repository at Academia Sinica (Taiwan) open to researhers worldwide for the deposit, discovery, and reuse of datasets. The depositar software itself is open source and builds on top of CKAN. CKAN, an open source project initiated by the Open Knowledge Foundation and sustained by an active user community, is a leading data management system for building data hubs and portals. In addition to CKAN's out-of-the-box features such as JSON data API and in-browser preview of uploaded data, we have added several features to the depositar, including sourcing from Wikidata as dataset keywords, a citation snippet for datasets, in-browser Shapefile preview, and a persistent identifier system based on ARK (Archival Resource Keys). At the same time, the depositar team faces an increasing demand for interactive computing (e.g. Jupyter Notebook) which facilitates not just data analysis, but also for the replication and demonstration of scientific studies. Recently, we have provided a JupyterHub service (a multi-tenancy JupyterLab) to some of the depositar's users. However, it still requires users to first download the data files (or copy the URLs of the files) from the depositar, then upload the data files (or paste the URLs) to the Jupyter notebooks for analysis. Furthermore, a JupyterHub deployed on a single server is limited by its processing power which may lower the service level to the users. To address the above issues, we are integrating the BinderHub into the depositar. BinderHub (https://binderhub.readthedocs.io/) is a kubernetes-based service that allows users to create interactive computing environments from code repositories. Once the integration is completed, users will be able to launch Jupyter Notebooks to perform data analysis and vsualization without leaving the depositar by clicking the BinderHub buttons on the datasets. In this presentation, we will first make a brief introduction to the depositar and BinderHub along with their relationship, then we will share our experiences in incorporating interactive computation in a data repository. We shall also evaluate the possibility of integrating the depositar with other automation frameworks (e.g. the Snakemake workflow management system) in order to enable users to reproduce data analysis.

    Keywords

    BinderHub, CKAN, Data Repositories, Interactive Computing, Reproducible Research

  18. r

    NIH Data Sharing Repositories

    • rrid.site
    • dknet.org
    • +2more
    Updated Mar 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). NIH Data Sharing Repositories [Dataset]. http://identifiers.org/RRID:SCR_003551
    Explore at:
    Dataset updated
    Mar 7, 2025
    Description

    A listing of NIH supported data sharing repositories that make data accessible for reuse. Most accept submissions of appropriate data from NIH-funded investigators (and others), but some restrict data submission to only those researchers involved in a specific research network. Also included are resources that aggregate information about biomedical data and information sharing systems. The table can be sorted according by name and by NIH Institute or Center and may be searched using keywords so that you can find repositories more relevant to your data. Links are provided to information about submitting data to and accessing data from the listed repositories. Additional information about the repositories and points-of-contact for further information or inquiries can be found on the websites of the individual repositories.

  19. Data from: What factors influence where researchers deposit their data? A...

    • zenodo.org
    • data.niaid.nih.gov
    • +1more
    csv, txt
    Updated May 30, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shea Swauger; Todd J. Vision; Shea Swauger; Todd J. Vision (2022). Data from: What factors influence where researchers deposit their data? A survey of researchers submitting to data repositories [Dataset]. http://doi.org/10.5061/dryad.51vs3
    Explore at:
    txt, csvAvailable download formats
    Dataset updated
    May 30, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Shea Swauger; Todd J. Vision; Shea Swauger; Todd J. Vision
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    In order to better understand the factors that most influence where researchers deposit their data when they have a choice, we collected survey data from researchers who deposited phylogenetic data in either the TreeBASE or Dryad data repositories. Respondents were asked to rank the relative importance of eight possible factors. We found that factors differed in importance for both TreeBASE and Dryad, and that the rankings differed subtly but significantly between TreeBASE and Dryad users. On average, TreeBASE users ranked the domain specialization of the repository highest, while Dryad users ranked as equal highest their trust in the persistence of the repository and the ease of its data submission process. Interestingly, respondents (particularly Dryad users) were strongly divided as to whether being directed to choose a particular repository by a journal policy or funding agency was among the most or least important factors. Some users reported depositing their data in multiple repositories and archiving their data voluntarily.

  20. Securities Finance: Repo Data Analytics Dataset | S&P Global Marketplace

    • marketplace.spglobal.com
    Updated Mar 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    S&P Global (2024). Securities Finance: Repo Data Analytics Dataset | S&P Global Marketplace [Dataset]. https://www.marketplace.spglobal.com/en/datasets/securities-finance-repo-data-analytics-(1707231856)
    Explore at:
    Dataset updated
    Mar 2, 2024
    Dataset authored and provided by
    S&P Globalhttp://www.spglobal.com/
    Description

    The Securities Finance: Repo Data Analytics dataset equips users with essential metrics and comprehensive data to make informed decisions in global repo trading.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Jeff Nielson; Katie Wadsworth (2024). Bear Lake Data Repository [Dataset]. https://www.hydroshare.org/resource/444e4bd2940e47e6bcab5e7966a929fe

Bear Lake Data Repository

Explore at:
zip(154.6 MB)Available download formats
Dataset updated
Sep 9, 2024
Dataset provided by
HydroShare
Authors
Jeff Nielson; Katie Wadsworth
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The Bear Lake Data Repository (BLDR) is an active archive, containing a growing compilation of biological, chemical, and physical datasets collected from Bear Lake and its surrounding watershed. The datasets herein have been digitized from historical records and reports, extracted from papers and theses, and obtained from public and private entities, including the United States Geological Survey, PacifiCorp, and, inter alia, Ecosystems Research Institute.

Contributions are welcome. The BLDR accepts biological, chemical, or physical datasets obtained at Bear Lake, irrespective of funding source. There is no submission size limit at present—workarounds will be found if submissions exceed Hydroshare limits (20 GB). Contributions are published with an open access license and will serve many use cases. The current repository steward, Bear Lake Watch, will advise on submissions and make accepted contributions available promptly.

Metadata files are provided for each dataset, however, contact with original contributor(s) is encouraged for questions and additional details prior to data usage. The BLDR and its contributors shall not be liable for any damages resulting from misinterpretation or misuse of the data or metadata.

Search
Clear search
Close search
Google apps
Main menu