100+ datasets found

d
Data Collaborations Across Boundaries (Slides)
data.depositar.io
pdf
Updated Jun 27, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
depositar (2025). Data Collaborations Across Boundaries (Slides) [Dataset]. https://data.depositar.io/dataset/data-collaborations-across-boundaries
Explore at:
pdf(4440122), pdf(10713394), pdf(1792282), pdf(1296859), pdf(3112569)Available download formats
Dataset updated
Jun 27, 2025
Dataset provided by
depositar
Description
This dataset collects the slides that were presented at the Data Collaborations Across Boundaries session in SciDataCon 2022, part of the International Data Week.

The following session proposal was prepared by Tyng-Ruey Chuang and submitted to SciDataCon 2022 organizers for consideration on 2022-02-28. The proposal was accepted on 2022-03-28. Six abstracts were submitted and accepted to this session. Five presentations were delivered online in a virtual session on 2022-06-21.

Data Collaborations Across Boundaries

There are many good stories about data collaborations across boundaries. We need more. We also need to share the lessons each of us has learned from collaborating with parties and communities not in our familiar circles.

By boundaries, we mean not just the regulatory borders in between the nation states about data sharing but the various barriers, readily conceivable or not, that hinder collaboration in aggregating, sharing, and reusing data for social good. These barriers to collaboration exist between the academic disciplines, between the economic players, and between the many user communities, just to name a few. There are also cross-domain barriers, for example those that lay among data practitioners, public administrators, and policy makers when they are articulating the why, what, and how of "open data" and debating its economic significance and fair distribution. This session aims to bring together experiences and thoughts on good data practices in facilitating collaborations across boundaries and domains.

The success of Wikipedia proves that collaborative content production and service, by ways of copyleft licenses, can be sustainable when coordinated by a non-profit and funded by the general public. Collaborative code repositories like GitHub and GitLab demonstrate the enormous value and mass scale of systems-facilitated integration of user contributions that run across multiple programming languages and developer communities. Research data aggregators and repositories such as GBIF, GISAID, and Zenodo have served numerous researchers across academic disciplines. Citizen science projects and platforms, for instance eBird, Galaxy Zoo, and Taiwan Roadkill Observation Network (TaiRON), not only collect data from diverse communities but also manage and release datasets for research use and public benefit (e.g. TaiRON datasets being used to improve road design and reduce animal mortality). At the same time large scale data collaborations depend on standards, protocols, and tools for building registries (e.g. Archival Resource Key), ontologies (e.g. Wikidata and schema.org), repositories (e.g. CKAN and Omeka), and computing services (e.g. Jupyter Notebook). There are many types of data collaborations. The above lists only a few.

This session proposal calls for contributions to bring forward lessons learned from collaborative data projects and platforms, especially about those that involve multiple communities and/or across organizational boundaries. Presentations focusing on the following (non-exclusive) topics are sought after:

Support mechanisms and governance structures for data collaborations across organizations/communities.

Data policies --- such as data sharing agreements, memorandum of understanding, terms of use, privacy policies, etc. --- for facilitating collaborations across organizations/communities.

Traditional and non-traditional funding sources for data collaborations across multiple parties; sustainability of data collaboration projects, platforms, and communities.

Data workflows --- collection, processing, aggregation, archiving, and publishing, etc. --- designed with considerations of (external) collaboration.

Collaborative web platforms for data acquisition, curation, analysis, visualization, and education.

Examples and insights from data trusts, data coops, as well as other formal and informal forms of data stewardship.

Debates on the pros and cons of centralized, distributed, and/or federated data services.

Practical lessons learned from data collaboration stories: failure, success, incidence, unexpected turn of event, aftermath, etc. (no story is too small!).
f
Table_1_Data Management in Health-Related Research Involving Indigenous...
frontiersin.figshare.com
docx
Updated Jun 1, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
R. Brian Woodbury; Julie A. Beans; Vanessa Y. Hiratsuka; Wylie Burke (2023). Table_1_Data Management in Health-Related Research Involving Indigenous Communities in the United States and Canada: A Scoping Review.docx [Dataset]. http://doi.org/10.3389/fgene.2019.00942.s001
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.3389/fgene.2019.00942.s001
Dataset updated
Jun 1, 2023
Dataset provided by
Frontiers
Authors
R. Brian Woodbury; Julie A. Beans; Vanessa Y. Hiratsuka; Wylie Burke
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Canada, United States
Description
Background: Multiple factors, including experiences with unethical research practices, have made some Indigenous groups in the United States and Canada reticent to participate in potentially beneficial health-related research. Yet, Indigenous peoples have also expressed a willingness to participate in research when certain conditions related to the components of data management—including data collection, analysis, security and storage, sharing, dissemination, and withdrawal—are met. A scoping review was conducted to better understand the terms of data management employed in health-related research involving Indigenous communities in the United States and Canada.Methods: PubMed, Embase, PsychINFO, and Web of Science were searched using terms related to the populations and topics of interest. Results were screened and articles deemed eligible for inclusion were extracted for content on data management, community engagement, and community-level research governance.Results: The search strategy returned 734 articles. 31 total articles were extracted, of which nine contained in-depth information on data management and underwent detailed extraction. All nine articles reported the development and implementation of data management tools, including research ethics codes, data-sharing agreements, and biobank access policies.These articles reported that communities were involved in activities and decisions related to data collection (n=7), data analysis (n=5), data-sharing (n=9), dissemination (n=7), withdrawal (n=4), and development of data management tools (n=9). The articles also reported that communities had full or shared ownership of (n=5), control over (n=9), access to (n=1), and possession of data (n=5).All nine articles discussed the role of community engagement in research and community-level research governance as means for aligning the terms of data management with the values, needs, and interests of communities.Conclusions: There is need for more research and improved reporting on data management in health-related research involving Indigenous peoples in the United States and Canada. Findings from this review can provide guidance for the identification of data management terms and practices that may be acceptable to Indigenous communities considering participation in health-related research.
Most common data sharing systems used in health information networks Japan...
statista.com
Updated Jan 9, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). Most common data sharing systems used in health information networks Japan FY 2019 [Dataset]. https://www.statista.com/statistics/1121884/japan-leading-data-sharing-systems-health-information-networks/
Explore at:
Dataset updated
Jan 9, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Japan
Description
According to a survey conducted in the fiscal year 2019, 80 community health information networks in Japan were using the system ID-Link. ID-Link is a cloud network system for medical institutions developed by Japanese technology and electronics company NEC. The community health information network is a system in which medical information of patients can be shared within the regions. The Japanese government has been promoting the system to improve comprehensive medical care offered in local areas.
o
Collaboratory Data on Community Engagement & Public Service in Higher...
openicpsr.org
Updated Mar 30, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kristin D. Medlin; Matthew Seto (2021). Collaboratory Data on Community Engagement & Public Service in Higher Education [Dataset]. http://doi.org/10.3886/E136322V1
Explore at:
Unique identifier
https://doi.org/10.3886/E136322V1
Dataset updated
Mar 30, 2021
Dataset provided by
Collaboratory
Authors
Kristin D. Medlin; Matthew Seto
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Area covered
United States
Description
Collaboratory is a software product developed and maintained by HandsOn Connect Cloud Solutions. It is intended to help higher education institutions accurately and comprehensively track their relationships with the community through engagement and service activities. Institutions that use Collaboratory are given the option to opt-in to a data sharing initiative at the time of onboarding, which grants us permission to de-identify their data and make it publicly available for research purposes. HandsOn Connect is committed to making Collaboratory data accessible to scholars for research, toward the goal of advancing the field of community engagement and social impact.Collaboratory is not a survey, but is instead a dynamic software tool designed to facilitate comprehensive, longitudinal data collection on community engagement and public service activities conducted by faculty, staff, and students in higher education. We provide a standard questionnaire that was developed by Collaboratory’s co-founders (Janke, Medlin, and Holland) in the Institute for Community and Economic Engagement at UNC Greensboro, which continues to be closely monitored and adapted by staff at HandsOn Connect and academic colleagues. It includes descriptive characteristics (what, where, when, with whom, to what end) of activities and invites participants to periodically update their information in accordance with activity progress over time. Examples of individual questions include the focus areas addressed, populations served, on- and off-campus collaborators, connections to teaching and research, and location information, among others.The Collaboratory dataset contains data from 37 institutions beginning in March 2016and continues to grow as more institutions adopt Collaboratory and continue to expand its use. The data represent over 3,600 published activities (and additional associated content) across our user base.Please cite this data as:Medlin, Kristin and Seto, Matthew. Dataset on Higher Education Community Engagement and Public Service Activities, 2016-2021. Collaboratory [producer], 2021. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2021-11-01. doi: _v1.When you cite this data, please also include: ORIGINS PAPER CITATION
A
AmeriGEOSS Data Sharing
data.amerigeoss.org
pdf
Updated Sep 3, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AmeriGEOSS (2019). AmeriGEOSS Data Sharing [Dataset]. https://data.amerigeoss.org/es/dataset/amerigeoss-data-sharing
Explore at:
pdfAvailable download formats
Dataset updated
Sep 3, 2019
Dataset provided by
AmeriGEOSS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Provides resources related to Data Sharing for the AmeriGEOSS community.
u
HydroShare and the Water Community, removing perceived barriers to data...
verso.uidaho.edu
mp4, pdf, txt, xml
Updated May 11, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Clara Cogswell (2023). HydroShare and the Water Community, removing perceived barriers to data sharing [Dataset]. https://verso.uidaho.edu/esploro/outputs/dataset/HydroShare-and-the-Water-Community-removing/996762907901851
Explore at:
mp4(138660928 bytes), txt(5304 bytes), pdf(313033 bytes), xml(5240 bytes)Available download formats
Dataset updated
May 11, 2023
Dataset provided by
Idaho EPSCoR
Authors
Clara Cogswell
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
May 11, 2023
Description
HydroShare is an open-source general purpose data repository for the water community, created with the goal of increasing data sharing and lowering the barrier to entry for FAIR data practices. We approach this goal by removing both traditional and perceived barriers to data sharing through an intuitive and general purpose, file agnostic repository. We have found that abundant and easy to access instruction in various formats, one on one interactions, and a flexible interface eases the burden of learning to use a data sharing platform.

Resource Use
License
Creative Commons Attribution (CC-BY 4.0)
Recommended Citation
Cogswell C. 2023. HydroShare and the Water Community, removing perceived barriers to data sharing. University of Idaho. https://doi.org/10.7923/2kae-fr20

Funding
National Science Foundation, EngageINFEWS Research Coordination Network: 1856059, 1856084
US National Science Foundation (NSF), Award numbers: 1148453, 1148090, 1664018, 1664061, 1338606, 1664119, 1849458
d
Data from: Sharing the Sun Community Solar Project Data
catalog.data.gov
data.openei.org
Updated Jun 17, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Renewable Energy Laboratory (2025). Sharing the Sun Community Solar Project Data [Dataset]. https://catalog.data.gov/dataset/sharing-the-sun-community-solar-project-data-d86b6
Explore at:
Dataset updated
Jun 17, 2025
Dataset provided by
National Renewable Energy Laboratory
Description
This database represents a list of community solar projects, as well as the low-income (LI) and low- and moderate-income (LMI) provisions for both complete and pending projects identified through various sources. The dataset is updated multiple times per year. The current version is the first file located below. Previous versions of the dataset published before June of 2024 can be found in the dataset below labeled “ARCHIVE_Sharing the Sun Community Solar Project Data_Before 06.24.“ The list has been reviewed but errors may exist and the list may not be comprehensive. Errors in the sources e.g. press releases may be duplicated in the list. Blank spaces represent missing information. NREL invites input to improve the database including to - correct erroneous information - add missing projects - fill in missing information - remove inactive projects.Updated information can be submitted to Sudha Kannan (sudha.kannan@nrel.gov).
D
Replication Data for: The time efficiency gain in sharing and reuse of...
dataverse.nl
test.dataverse.nl
type/x-r-syntax
Updated Feb 11, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tessa Pronk; Tessa Pronk (2019). Replication Data for: The time efficiency gain in sharing and reuse of research data [Dataset]. http://doi.org/10.34894/5QC75H
Explore at:
type/x-r-syntax(8198), type/x-r-syntax(3954)Available download formats
Unique identifier
https://doi.org/10.34894/5QC75H
Dataset updated
Feb 11, 2019
Dataset provided by
DataverseNL
Authors
Tessa Pronk; Tessa Pronk
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This dataset contains supplementary data and R scripts to generate figures for the paper 'The time efficiency gain in sharing and reuse of research data'. This version contains new R scripts to generate Figures of the revised manuscript. Abstract: Among the frequently stated benefits of sharing research data are time efficiency or increased productivity. The assumption is that reuse or secondary use of research data saves researchers time in not having to produce data for a publication themselves. This can make science more efficient and productive. However, if there is no reuse, time costs in making data available for reuse will have been made with no return on this investment. In this paper a mathematical model is used to calculate the break-even point for time spent sharing in a scientific community, versus time gain by reuse. This is done for several scenarios; from simple to complex datasets to share and reuse, and at different sharing rates. The results indicate that sharing research data can indeed cause an efficiency revenue for the scientific community. However, this is not a given in all modeled scenarios. The most efficient scientific community is one that has few sharing researchers, a high reuse rate, and low time investments for sharing and reuse. This suggests it would be beneficial to have a critical selection of datasets that are worth the effort to prepare for reuse in other scientific studies. In addition, stimulating reuse of datasets in itself would be beneficial to increase efficiency in scientific communities.
o
Scientists' Data Sharing Behaviors
openicpsr.org
Updated Aug 19, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Youngseek Kim (2016). Scientists' Data Sharing Behaviors [Dataset]. http://doi.org/10.3886/E100087V7
Explore at:
Unique identifier
https://doi.org/10.3886/E100087V7
Dataset updated
Aug 19, 2016
Authors
Youngseek Kim
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
United States
Description
The objective of this research is to investigate the factors influencing scientists’ data sharing behaviors in different scientific communities by examining both discipline and individual level predictors together. The target population of this research included faculty members and post-doctoral researchers in U.S. academic institutions who belong to STEM disciplines. The sampling frame of this research was identified from the scholar list in the Community of Science’s (CoS) Scholar Database (http://pivot.cos.com), which provides a researcher profile directory in the world mainly from universities and colleges. The final field survey instrument was distributed to the 16,165 potential survey participants in 56 STEM disciplines. From November 19, 2012 to February 15, 2013, a total of 2,470 valid responses were received for the initial data analysis (15.28% of response rate).
Sharing research data and findings relevant to the novel coronavirus...
zenodo.org
data.niaid.nih.gov
Updated Jun 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eleanor Cox; Eleanor Cox; Lucia Loffreda; Lucia Loffreda; Andrea Chiarelli; Andrea Chiarelli (2023). Sharing research data and findings relevant to the novel coronavirus (COVID-19) outbreak - Survey responses [Dataset]. http://doi.org/10.5281/zenodo.6620689
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.6620689
Dataset updated
Jun 21, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Eleanor Cox; Eleanor Cox; Lucia Loffreda; Lucia Loffreda; Andrea Chiarelli; Andrea Chiarelli
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The spreadsheets in the present dataset (CSV format) include the anonymised responses to our online survey of signatories of the Joint Statement on open research and data sharing. Responses have been split into quantitative responses (i.e., closed survey questions) and qualitative responses (i.e., free text survey questions).

This data has been used to inform our final report, which is available in our Zenodo Project Community.
d
SciCrunch
dknet.org
scicrunch.org
Updated Oct 16, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2019). SciCrunch [Dataset]. http://identifiers.org/RRID:SCR_003115
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_003115 https://identifiers.org/RRID:SCR_003115/resolver/mentions
Dataset updated
Oct 16, 2019
Description
Community portal for researchers and content management system for data and databases. Intended to provide common source of data to research community and data about Research Resource Identifiers (RRIDs), which can be used in scientific publications. Central service where RRIDs can be searched and created. Designed to help communities of researchers create their own portals to provide access to resources, databases and tools of relevance to their research areas. Adds value to existing scientific resources by increasing their discoverability, accessibility, visibility, utility and interoperability, regardless of their current design or capabilities and without need for extensive redesign of their components or information models. Resources can be searched and discovered at multiple levels of integration, from superficial discovery based on limited description of resource at SciCrunch Registry, to deep content query at SciCrunch Data Federation.
H
CIROH: Enabling collaboration through data and model sharing with CUAHSI...
beta.hydroshare.org
hydroshare.org
+1more
zip
Updated Aug 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
David Tarboton; Jeffery S. Horsburgh; Shaowen Wang; Jordan Stuart Read; Irene Garousi-Nejad; Anthony M. Castronova; Clara Cogswell (2024). CIROH: Enabling collaboration through data and model sharing with CUAHSI HydroShare [Dataset]. http://doi.org/10.4211/hs.23b51cb99b5445a1af740a21c5acaecb
Explore at:
zip(2.0 MB)Available download formats
Unique identifier
https://doi.org/10.4211/hs.23b51cb99b5445a1af740a21c5acaecb
Dataset updated
Aug 26, 2024
Dataset provided by
HydroShare
Authors
David Tarboton; Jeffery S. Horsburgh; Shaowen Wang; Jordan Stuart Read; Irene Garousi-Nejad; Anthony M. Castronova; Clara Cogswell
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Collaboration is central to CIROH (https://ciroh.ua.edu/). Advancing the knowledge needed to support research to operations in hydrology depends on collaboration around model and data sharing. It requires open data supporting the integration of information from multiple sources; easy to use, generally accessible, shareable computing; and working together as a team and community. The CUAHSI HydroShare platform was developed to advance water research by enabling communities of researchers to more easily and freely share digital products resulting from their research, not just the scientific publications summarizing a study, but the data, models and workflows used to produce the results, consistent with Findable, Accessible, Interoperable, and Reusable (FAIR) principles of present-day research. HydroShare supports and enables private (e.g., social science) and open data sharing, transparent workflows, and computational reproducibility, thereby improving reliability and trust in research findings. These are crucial as research is transferred into operations.

The goal of this project is to enhance the performance, reliability, usability, and scalability of HydroShare’s linkages with cloud storage and computational systems to fulfill CIROH’s community collaboration and linked computing needs and enable CIROH researchers to easily integrate and analyze national scale datasets required for their research using high-performance and cloud computing systems.

The objectives are to (1) enhance community data access; (2) establish interoperability with scalable computing; (3) demonstrate computational reproducibility; and (4) establish and grow a CIROH Community on HydroShare. Work under objective (1) will use community input to identify, prioritize, and establish easy to use access to multiple high-value community datasets. Work under objective (2) will establish or extend interfaces to high performance computing, leveraging tools for model input preparation such as the CUAHSI Domain Subsetter and I-GUIDE (the Institute for Geospatial Understanding through an enhanced Discovery Environment, https://iguide.illinois.edu). Work under objective (3) will establish and document CIROH community best practices for enhancing the reproducibility of high-performance computing and analysis workflows so that CIROH modeling workflows can be accessed, re-executed, and analyzed by multiple researchers. Work under objective (4) will establish a CIROH “Community” within the HydroShare repository to support collaboration around and sharing of CIROH research products.

Forecasting operations will benefit from the transparency of research products hosted in HydroShare and linked to computing platforms for reproducibility and evaluation. Linking publications, data, and code (often in GitHub), with methods and findings that are well documented and tested will support their evaluation by the National Water Center for operational adoption.

This project runs 6/1/2023 to 5/31/2025.
n
Data from: Public sharing of research datasets: a pilot study of...
data.niaid.nih.gov
datadryad.org
+1more
zip
Updated May 26, 2011
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Heather A. Piwowar; Wendy W. Chapman (2011). Public sharing of research datasets: a pilot study of associations [Dataset]. http://doi.org/10.5061/dryad.3td2f
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.3td2f
Dataset updated
May 26, 2011
Dataset provided by
University of Pittsburgh
Authors
Heather A. Piwowar; Wendy W. Chapman
License
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Description
The public sharing of primary research datasets potentially benefits the research community but is not yet common practice. In this pilot study, we analyzed whether data sharing frequency was associated with funder and publisher requirements, journal impact factor, or investigator experience and impact. Across 397 recent biomedical microarray studies, we found investigators were more likely to publicly share their raw dataset when their study was published in a high-impact journal and when the first or last authors had high levels of career experience and impact. We estimate the USA's National Institutes of Health (NIH) data sharing policy applied to 19% of the studies in our cohort; being subject to the NIH data sharing plan requirement was not found to correlate with increased data sharing behavior in multivariate logistic regression analysis. Studies published in journals that required a database submission accession number as a condition of publication were more likely to share their data, but this trend was not statistically significant. These early results will inform our ongoing larger analysis, and hopefully contribute to the development of more effective data sharing initiatives. Earlier version presented at ASIS&T and ISSI Pre-Conference: Symposium on Informetrics and Scientometrics 2009
d
DEPRECATED - Sharing the Sun Community Solar Project Data (June 2022)
catalog.data.gov
data.openei.org
+1more
Updated May 9, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Renewable Energy Laboratory (2025). DEPRECATED - Sharing the Sun Community Solar Project Data (June 2022) [Dataset]. https://catalog.data.gov/dataset/deprecated-sharing-the-sun-community-solar-project-data-june-2022-a50d7
Explore at:
Dataset updated
May 9, 2025
Dataset provided by
National Renewable Energy Laboratory
Description
This data set is no longer current – The most current data and all historical data sets can be found at https://data.nrel.gov/submissions/244 This database represents a list of community solar projects identified through various sources as of June 2022. The list has been reviewed but errors may exist and the list may not be comprehensive. Errors in the sources e.g. press releases may be duplicated in the list. Blank spaces represent missing information. NREL invites input to improve the database including to - correct erroneous information - add missing projects - fill in missing information - remove inactive projects. Updated information can be submitted to the contact(s) located on the current data set page linked at the top.
d
Data from: Research data sharing: practices and attitudes of geophysicists
dataone.org
datadryad.org
Updated Jun 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Carol Tenopir; Lisa Christian; Suzie Allard; Josh Borycz (2025). Research data sharing: practices and attitudes of geophysicists [Dataset]. http://doi.org/10.5061/dryad.sv6t740
Explore at:
Unique identifier
https://doi.org/10.5061/dryad.sv6t740
Dataset updated
Jun 9, 2025
Dataset provided by
Dryad Digital Repository
Authors
Carol Tenopir; Lisa Christian; Suzie Allard; Josh Borycz
Time period covered
Jan 1, 2019
Description
Open data policies have been introduced by governments, funders, and publishers over the past decade. Previous research showed a growing recognition by scientists of the benefits of data-sharing and reuse, but actual practices lag and are not always compliant with new regulations. The goal of this study is to investigate motives, attitudes, and data practices of the community of earth and planetary geophysicists, a discipline believed to have accepting attitudes towards data sharing and reuse. A better understanding of the attitudes and current data-sharing practices of this scientific community could enable funders, publishers, data managers, and librarians to design systems and services that help scientists understand and adhere to mandates and to create practices, tools, and services that are scientist-focused. An online survey was distributed to the members of the American Geophysical Union (AGU), producing 1372 responses from 116 countries. The attitudes of researchers to data shar...
f
Changes in Data Sharing and Data Reuse Practices and Perceptions among...
plos.figshare.com
data.niaid.nih.gov
+2more
docx
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Carol Tenopir; Elizabeth D. Dalton; Suzie Allard; Mike Frame; Ivanka Pjesivac; Ben Birch; Danielle Pollock; Kristina Dorsett (2023). Changes in Data Sharing and Data Reuse Practices and Perceptions among Scientists Worldwide [Dataset]. http://doi.org/10.1371/journal.pone.0134826
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0134826
Dataset updated
Jun 1, 2023
Dataset provided by
PLOS ONE
Authors
Carol Tenopir; Elizabeth D. Dalton; Suzie Allard; Mike Frame; Ivanka Pjesivac; Ben Birch; Danielle Pollock; Kristina Dorsett
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The incorporation of data sharing into the research lifecycle is an important part of modern scholarly debate. In this study, the DataONE Usability and Assessment working group addresses two primary goals: To examine the current state of data sharing and reuse perceptions and practices among research scientists as they compare to the 2009/2010 baseline study, and to examine differences in practices and perceptions across age groups, geographic regions, and subject disciplines. We distributed surveys to a multinational sample of scientific researchers at two different time periods (October 2009 to July 2010 and October 2013 to March 2014) to observe current states of data sharing and to see what, if any, changes have occurred in the past 3–4 years. We also looked at differences across age, geographic, and discipline-based groups as they currently exist in the 2013/2014 survey. Results point to increased acceptance of and willingness to engage in data sharing, as well as an increase in actual data sharing behaviors. However, there is also increased perceived risk associated with data sharing, and specific barriers to data sharing persist. There are also differences across age groups, with younger respondents feeling more favorably toward data sharing and reuse, yet making less of their data available than older respondents. Geographic differences exist as well, which can in part be understood in terms of collectivist and individualist cultural differences. An examination of subject disciplines shows that the constraints and enablers of data sharing and reuse manifest differently across disciplines. Implications of these findings include the continued need to build infrastructure that promotes data sharing while recognizing the needs of different research communities. Moving into the future, organizations such as DataONE will continue to assess, monitor, educate, and provide the infrastructure necessary to support such complex grand science challenges.
H
Hydroshare and community data sharing tools
hydroshare.org
search.dataone.org
zip
Updated Aug 28, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Danielle Tijerina (2019). Hydroshare and community data sharing tools [Dataset]. https://www.hydroshare.org/resource/d5f62e3458464ade84910dfdffd16c7f
Explore at:
zip(179.7 MB)Available download formats
Dataset updated
Aug 28, 2019
Dataset provided by
HydroShare
Authors
Danielle Tijerina
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered

Description
Data management, sharing, and publication are integral parts of a robust data management plan, a core requirement of all NSF funded research grants and many other funding agencies. This seminar will discuss some common challenges and present solutions for managing and sharing data using CUAHSI tools, specifically utilizing HydroShare. HydroShare is an online repository system for water data and models that aims to advance hydrologic science through enabling users to manage, share, and publish products resulting from their research and data collection. We will introduce attendees to approaches for managing current and archived data, collaboration within a research group, documentation of metadata, and publication. The webinar will center around tools and techniques within HydroShare to facilitate these activities, employing both discussions and demos.
r
FCP Classic Data Sharing Samples
rrid.site
dknet.org
Updated Jul 20, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). FCP Classic Data Sharing Samples [Dataset]. http://identifiers.org/RRID:SCR_005362
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_005362
Dataset updated
Jul 20, 2025
Description
1200+ ''resting state'' functional MRI (R-fMRI) datasets independently collected at 33 sites and donated by the principal investigators for the purpose of providing the broader imaging community complete access to a large-scale functional imaging dataset. Age, sex and imaging center information are provided for each of the datasets. In accordance with HIPAA guidelines, all datasets are anonymous, with no protected health information included. We anticipate this data-sharing effort will equip researchers with a means of exploring and refining R-fMRI approaches, and facilitate the growing ethos of sharing and collaboration. Disclaimer: The ''1000 Functional Connectomes Project'' datasets are provided freely without assurance of quality or appropriateness for usage.
w
Data from: Building a Community-based Data Network for Geothermal Energy
data.wu.ac.at
pdf
Updated Dec 5, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2017). Building a Community-based Data Network for Geothermal Energy [Dataset]. https://data.wu.ac.at/schema/geothermaldata_org/MDMzZTUxNDQtZDQxZS00NjkxLWE4NTMtODNmNGM2ZTZkMjQz
Explore at:
pdfAvailable download formats
Dataset updated
Dec 5, 2017
Area covered
fc79960b226c9b9b5279fda5ac037c5e31d9d087
Description
In this paper, we address the issues associated with the development and sustainability of a network of data sites and databases hosted by academic-based groups. These are the groups who, over the years, have conducted the majority of research on geothermal systems while educating the next generation of geothermal professionals and researchers. A network can be envisioned as an Internet-connected series of nodes (data sites), that allow for a common approach to finding data among the linked sites. Each of the co-authors' groups have for years collected and provided data to researchers, industry, state and federal agencies, and the public. Over the last ten years, the better management of and access to data have become an increasingly important part of not only the research process, but also for industry as they push forward with delineating and producing geothermal resources, for state and federal agencies to help them meet their missions and mandates, and to inform the public on the importance of geothermal energy. Indeed both Congress and the White House continue to strengthen the bipartisan goal of free and open access to all data created by the federal dollar. For academic-based data sites, the challenges are many, but not insurmountable. For example, we need to provide seamless linkages of data to analysis and visualization tools, in particular high-level modeling programs and required computational resources. When dealing with research results and industry partnerships, moratorium and proprietary data must be handled carefully and securely. At the same time, we have to make it as easy as possible for users to discover, aggregate and synthesize data in ways that allow them to focus on the analysis of these data rather than on finding and compiling them. We also must better utilize these research-level data within the education enterprise to train and attract our next generation of geoscientists and geoengineers. Data discovery and sharing among multiple data sites is a persistent issue. Each of the sites in the network hosts some unique data but a substantial number of data types may overlap among some of the sites, complicating the process. The notion of the "semantic web" as the solution for data integration has sparked continued debate because of its underlying requirement for ontologies and single-definition vocabularies. A hybrid approach is now evolving that utilizes some tools of the semantic web but recognizes the investment in current data sites and the different needs of the various user communities. This approach also recognizes that much "knowledge" (the ultimate value of a data network) is wrapped up in differences among vocabularies, languages and concepts and that forcing singularity decreases the knowledge value of data and their resulting data products. This approach makes data sharing more difficult, but not impossible, and importantly opens up the system for broader participation and collaboration. Sustainability will always remain an issue for an academic-based data network. Self-funding from the home institution is not feasible. However as agencies continue to increase their efforts to manage and provide access to data generated by projects and activities they fund, a long-term agency-academic partnership will evolve that includes both funding academic-based data networks and relying on these networks to provide some of that public access to federally-funded data. Finally, it is important to note that these academic-based data networks have to be self-governing for them to work at all, much less be sustainabl
Data from: Populating the Data Ark: An attempt to retrieve, preserve, and...
osf.io
Updated Aug 9, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tom E Hardwicke; John P. A. Ioannidis (2023). Populating the Data Ark: An attempt to retrieve, preserve, and liberate data from the most highly-cited psychology and psychiatry articles [Dataset]. https://osf.io/7t3qv/
Explore at:
Dataset updated
Aug 9, 2023
Dataset provided by
Center for Open Sciencehttps://cos.io/
Authors
Tom E Hardwicke; John P. A. Ioannidis
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The vast majority of scientific articles published to-date have not been accompanied by concomitant publication of the underlying research data upon which they are based. This state of affairs precludes the routine re-use and re-analysis of research data, undermining the efficiency of the scientific enterprise, and compromising the credibility of claims that cannot be independently verified. It may be especially important to make data available for the most influential studies that have provided a foundation for subsequent research and theory development. Therefore, we launched an initiative—the Data Ark—to examine whether we could retrospectively enhance the preservation and accessibility of important scientific data. Here we report the outcome of our efforts to retrieve, preserve, and liberate data from 111 of the most highly-cited articles published in psychology and psychiatry between 2006–2011 (n = 48) and 2014–2016 (n = 63). Most data sets were not made available (76/111, 68%, 95% CI [60, 77]), some were only made available with restrictions (20/111, 18%, 95% CI [10, 27]), and few were made available in a completely unrestricted form (15/111, 14%, 95% CI [5, 22]). Where extant data sharing systems were in place, they usually (17/22, 77%, 95% CI [54, 91]) did not allow unrestricted access. Authors reported several barriers to data sharing, including issues related to data ownership and ethical concerns. The Data Ark initiative could help preserve and liberate important scientific data, surface barriers to data sharing, and advance community discussions on data stewardship.

Facebook

Twitter

Click to copy link

Link copied

Cite

depositar (2025). Data Collaborations Across Boundaries (Slides) [Dataset]. https://data.depositar.io/dataset/data-collaborations-across-boundaries

Data Collaborations Across Boundaries (Slides)

Explore at:

pdf(4440122), pdf(10713394), pdf(1792282), pdf(1296859), pdf(3112569)Available download formats

Dataset updated

Jun 27, 2025

Dataset provided by

depositar

Description

This dataset collects the slides that were presented at the Data Collaborations Across Boundaries session in SciDataCon 2022, part of the International Data Week.

The following session proposal was prepared by Tyng-Ruey Chuang and submitted to SciDataCon 2022 organizers for consideration on 2022-02-28. The proposal was accepted on 2022-03-28. Six abstracts were submitted and accepted to this session. Five presentations were delivered online in a virtual session on 2022-06-21.

Data Collaborations Across Boundaries

There are many good stories about data collaborations across boundaries. We need more. We also need to share the lessons each of us has learned from collaborating with parties and communities not in our familiar circles.

By boundaries, we mean not just the regulatory borders in between the nation states about data sharing but the various barriers, readily conceivable or not, that hinder collaboration in aggregating, sharing, and reusing data for social good. These barriers to collaboration exist between the academic disciplines, between the economic players, and between the many user communities, just to name a few. There are also cross-domain barriers, for example those that lay among data practitioners, public administrators, and policy makers when they are articulating the why, what, and how of "open data" and debating its economic significance and fair distribution. This session aims to bring together experiences and thoughts on good data practices in facilitating collaborations across boundaries and domains.

The success of Wikipedia proves that collaborative content production and service, by ways of copyleft licenses, can be sustainable when coordinated by a non-profit and funded by the general public. Collaborative code repositories like GitHub and GitLab demonstrate the enormous value and mass scale of systems-facilitated integration of user contributions that run across multiple programming languages and developer communities. Research data aggregators and repositories such as GBIF, GISAID, and Zenodo have served numerous researchers across academic disciplines. Citizen science projects and platforms, for instance eBird, Galaxy Zoo, and Taiwan Roadkill Observation Network (TaiRON), not only collect data from diverse communities but also manage and release datasets for research use and public benefit (e.g. TaiRON datasets being used to improve road design and reduce animal mortality). At the same time large scale data collaborations depend on standards, protocols, and tools for building registries (e.g. Archival Resource Key), ontologies (e.g. Wikidata and schema.org), repositories (e.g. CKAN and Omeka), and computing services (e.g. Jupyter Notebook). There are many types of data collaborations. The above lists only a few.

This session proposal calls for contributions to bring forward lessons learned from collaborative data projects and platforms, especially about those that involve multiple communities and/or across organizational boundaries. Presentations focusing on the following (non-exclusive) topics are sought after:

Support mechanisms and governance structures for data collaborations across organizations/communities.
Data policies --- such as data sharing agreements, memorandum of understanding, terms of use, privacy policies, etc. --- for facilitating collaborations across organizations/communities.
Traditional and non-traditional funding sources for data collaborations across multiple parties; sustainability of data collaboration projects, platforms, and communities.
Data workflows --- collection, processing, aggregation, archiving, and publishing, etc. --- designed with considerations of (external) collaboration.
Collaborative web platforms for data acquisition, curation, analysis, visualization, and education.
Examples and insights from data trusts, data coops, as well as other formal and informal forms of data stewardship.
Debates on the pros and cons of centralized, distributed, and/or federated data services.
Practical lessons learned from data collaboration stories: failure, success, incidence, unexpected turn of event, aftermath, etc. (no story is too small!).

Clear search

Close search

Google apps

Main menu

Data Collaborations Across Boundaries (Slides)

Table_1_Data Management in Health-Related Research Involving Indigenous...

Most common data sharing systems used in health information networks Japan...

Collaboratory Data on Community Engagement & Public Service in Higher...

AmeriGEOSS Data Sharing

HydroShare and the Water Community, removing perceived barriers to data...

Data from: Sharing the Sun Community Solar Project Data

Replication Data for: The time efficiency gain in sharing and reuse of...

Scientists' Data Sharing Behaviors

Sharing research data and findings relevant to the novel coronavirus...

SciCrunch

CIROH: Enabling collaboration through data and model sharing with CUAHSI...

Data from: Public sharing of research datasets: a pilot study of...

DEPRECATED - Sharing the Sun Community Solar Project Data (June 2022)

Data from: Research data sharing: practices and attitudes of geophysicists

Changes in Data Sharing and Data Reuse Practices and Perceptions among...

Hydroshare and community data sharing tools

FCP Classic Data Sharing Samples

Data from: Building a Community-based Data Network for Geothermal Energy

Data from: Populating the Data Ark: An attempt to retrieve, preserve, and...

Data Collaborations Across Boundaries (Slides)