Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains Data Availability Statements from 47,593 papers published in PLOS ONE between March 2014 (when the policy went into effect) and May 2016, analyzed for type of statement.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
General descriptionThis dataset contains some markers of Open Science in the publications of the Chemical Biology Consortium Sweden (CBCS) between 2010 and July 2023. The sample of CBCS publications during this period consists of 188 articles. Every publication was visited manually at its DOI URL to answer the following questions.1. Is the research article an Open Access publication?2. Does the research article have a Creative Common license or a similar license?3. Does the research article contain a data availability statement?4. Did the authors submit data of their study to a repository such as EMBL, Genbank, Protein Data Bank PDB, Cambridge Crystallographic Data Centre CCDC, Dryad or a similar repository?5. Does the research article contain supplementary data?6. Do the supplementary data have a persistent identifier that makes them citable as a defined research output?VariablesThe data were compiled in a Microsoft Excel 365 document that includes the following variables.1. DOI URL of research article2. Year of publication3. Research article published with Open Access4. License for research article5. Data availability statement in article6. Supplementary data added to article7. Persistent identifier for supplementary data8. Authors submitted data to NCBI or EMBL or PDB or Dryad or CCDCVisualizationParts of the data were visualized in two figures as bar diagrams using Microsoft Excel 365. The first figure displays the number of publications during a year, the number of publications that is published with open access and the number of publications that contain a data availability statement (Figure 1). The second figure shows the number of publication sper year and how many publications contain supplementary data. This figure also shows how many of the supplementary datasets have a persistent identifier (Figure 2).File formats and softwareThe file formats used in this dataset are:.csv (Text file).docx (Microsoft Word 365 file).jpg (JPEG image file).pdf/A (Portable Document Format for archiving).png (Portable Network Graphics image file).pptx (Microsoft Power Point 365 file).txt (Text file).xlsx (Microsoft Excel 365 file)All files can be opened with Microsoft Office 365 and work likely also with the older versions Office 2019 and 2016. MD5 checksumsHere is a list of all files of this dataset and of their MD5 checksums.1. Readme.txt (MD5: 795f171be340c13d78ba8608dafb3e76)2. Manifest.txt (MD5: 46787888019a87bb9d897effdf719b71)3. Materials_and_methods.docx (MD5: 0eedaebf5c88982896bd1e0fe57849c2),4. Materials_and_methods.pdf (MD5: d314bf2bdff866f827741d7a746f063b),5. Materials_and_methods.txt (MD5: 26e7319de89285fc5c1a503d0b01d08a),6. CBCS_publications_until_date_2023_07_05.xlsx (MD5: 532fec0bd177844ac0410b98de13ca7c),7. CBCS_publications_until_date_2023_07_05.csv (MD5: 2580410623f79959c488fdfefe8b4c7b),8. Data_from_CBCS_publications_until_date_2023_07_05_obtained_by_manual_collection.xlsx (MD5: 9c67dd84a6b56a45e1f50a28419930e5),9. Data_from_CBCS_publications_until_date_2023_07_05_obtained_by_manual_collection.csv (MD5: fb3ac69476bfc57a8adc734b4d48ea2b),10. Aggregated_data_from_CBCS_publications_until_2023_07_05.xlsx (MD5: 6b6cbf3b9617fa8960ff15834869f793),11. Aggregated_data_from_CBCS_publications_until_2023_07_05.csv (MD5: b2b8dd36ba86629ed455ae5ad2489d6e),12. Figure_1_CBCS_publications_until_2023_07_05_Open_Access_and_data_availablitiy_statement.xlsx (MD5: 9c0422cf1bbd63ac0709324cb128410e),13. Figure_1.pptx (MD5: 55a1d12b2a9a81dca4bb7f333002f7fe),14. Image_of_figure_1.jpg (MD5: 5179f69297fbbf2eaaf7b641784617d7),15. Image_of_figure_1.png (MD5: 8ec94efc07417d69115200529b359698),16. Figure_2_CBCS_publications_until_2023_07_05_supplementary_data_and_PID_for_supplementary_data.xlsx (MD5: f5f0d6e4218e390169c7409870227a0a),17. Figure_2.pptx (MD5: 0fd4c622dc0474549df88cf37d0e9d72),18. Image_of_figure_2.jpg (MD5: c6c68b63b7320597b239316a1c15e00d),19. Image_of_figure_2.png (MD5: 24413cc7d292f468bec0ac60cbaa7809)
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Categories used to classify the data availability statements.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
For this dataset, scientific peer-reviewed articles by Tampere University researchers from the years 2020 and 2021 were extracted from the TUNICRIS. A random sample of 40 percent was taken from the listed 4,922 publications according to faculties and years. There were 2,085 analyzed articles, i.e. more than 42 percent of the total number.
To find Data Availability Statements, articles were opened one by one and searched for mentions of research data and its availability. For each article, it was written down whether DAS existed and where in the article it was located. From the contents of DAS, information about data availability, location, openness and possible restrictions on use was written down.
Dataset also includes information about the journals and publications taken from TUNICRIS.
The prevalence of DAS and data openness were examined in relation to different variables. Tampere University faculty information has been removed from the dataset.
Related slides: https://doi.org/10.5281/zenodo.7655892
Related article (in Finnish): Toikko, T., & Kylmälä, K. (2023). Tutkimusdatan saatavuustiedot tieteellisissä artikkeleissa: Raportti Data Availability Statementien käytöstä Tampereen yliopistossa. Informaatiotutkimus, 42(1-2), 31–50. https://doi.org/10.23978/inf.126098
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Availability of resources by age of the associated paper.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The datasets generated and analyzed in the present study are subject to non-disclosure agreements signed with our collaborative institutions. These agreements explicitly restrict the public release of raw data to protect the proprietary information, technical details, and confidential research interests of the collaborating parties. Therefore, the full raw dataset cannot be made publicly available via open repositories. To support research reproducibility to the maximum extent permitted by the NDAs, de-identified summary data or aggregated statistical results may be provided to qualified researchers upon reasonable request. Interested parties should first contact the corresponding author and submit a formal application, which will then be reviewed jointly by our research team and the collaborative institutions to ensure compliance with the terms of the non-disclosure agreements.
Facebook
TwitterData Availability Statement for Article
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Assessment of whether researchers promising to make data available on publication actually do so, and whether this differs if researchers included a link to an embargoed repository or not.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Content and data sourceThis dataset contains the results of a manual analysis of Open Science markers in the publications of the Swedish Metabolomics Centre (SMC) between 2016 and 2024. It contains similar variables as the data of the "Analysis of CBCS publications for Open Access, data availability statements and persistent identifiers for supplementary data" (Kieselbach, 2023).
The sample of these publications was fetched from SciLifeLab on 5 May 2025 at the URL: https://publications.scilifelab.se/label/Swedish Metabolomics Centre (SMC)
It contains 285 articles that are the source data for the work to create this dataset. Every publication was manually visited at its DOI URL and checked for 23 variables.
Questions studiedSome of the questions that were addressed in the collection of the data are:
Does the article have an open license and what kind of license does it have?
Does the article contain research data that may have restricted access such as personal data and health data?
Does the article contain a data availability statement?
Does the article contain supplementary material that the authors added to it?
Does the supplementary material contain research data?
Does the supplementary material contain metabolomics data such as, for instance, summaries and visualizations?
Did the authors submit metabolomics data to MetaboLights at the EBI or to other repsoitories?
Did the authors submit other data to other repositories?
Is data available on request from the authors?
Visualization of dataThe data was compiled and visualized using Microsoft Excel 365. The visualization includes one table that gives a general overview of the dataset, and four figures that show some results of the analysis.
Figure 1. Percentage of publications between 2016 and 2024 with an Open Access License and with a data availability statement.
Figure 2. Submissions to repositories between 2016 and 2024.
Figure 3. Percentage of publications that contained supplementary material and if this supplementary material contained research data and metabolomics data.
Figure 4. Repositories used by the authors between 2016 and 2024.
List of variables1. Year of Publication (answer: year)
Date of Publication (answer: date)
DOI (answer: DOI)
DOI URL (answer: DOI URL)
Research article (answer: Yes or No)
Access to article without paywall (answer: Yes or No)
License for research article (answer: Name of the license or No)
Data with restricted access (answer: Yes or No)
Data availability statement in article (answer: Yes or No)
Supplementary material added to article (answer: Yes or No)
Access to supplementary material without paywall (answer: Yes or No)
Supplementary material contains research data (answer: Yes or No)
Supplementary data contains metabolomics data (answer: Yes or No)
Persistent identifier for supplementary data (answer: Yes or No)
Source data added to the article (answer: Yes or No)
Source data contain metabolomics data (answer: Yes or No)
Authors submitted metabolomics data to MetaboLights (answer: Yes or No)
Authors submitted metabolomics data to another repository (answer: name of the repository or No)
Authors submitted other data to a repository (answer: name of the repository or No)
Authors submitted other data to a second repository (answer: name of the repository or No)
Authors submitted other data to a third repository (answer: name of the repository or No)
Authors submitted code to a repository (answer: name of the repository or No)
Data available on request from the authors (answer: Yes or No)
Variables that are available in the source data1. Title of article
Authors
Journal
Year
(Date) Published
(Date) E-published
Volume
Issue
Pages
DOI
PMID
Labels
Qualifiers
IUID
URL
DOI URL of research article
PubMed URL of research article
File formats and softwareThe file formats used in this dataset are:
.csv (Text file)
.jpg (JPEG image file)
.pdf/A (Portable Document Format for archiving)
.txt (Text file)
.xlsx (Microsoft Excel 365 file)
All files can be opened with Microsoft Office 365.
ReferenceKieselbach, Theresa (2023). Analysis of CBCS publications for Open Access, data availability statements and persistent identifiers for supplementary data. Umeå University. Dataset. https://doi.org/10.17044/scilifelab.23641749.v1
AbbreviationsCC BY 4.0: Creative Commons Attribution 4.0 International Public License
CC BY-NC 4.0: Creative Commons Attribution-NonCommercial 4.0 International Public License
CC BY-NC 3.0: Creative Commons Attribution-NonCommercial 3.0 International Public License
CC BY-NC-ND 4.0: Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International Public License
DOI: Digital Object Identifier
EBI: European Bioinformatics Institute
EBI-ArrayExpress: The ArrayExpress collection of functional genomics data at the EBI
EBI-ENA: European Nucleotide Archive at the EBI
EBI-Pride: Proteomics Identification Database at the EBI
e!DAL: electronic Data Archive Library at the Leibniz Institute for Plant Genetics and Crop Plant Research
IUID: Item Unique identification
LUDC: Lund University Diabetes Centre
LUDC repository: data repository at the Lund University Diabetes Centre
NCBI: National Center for Biotechnology Information
NCBI-GEO: The Gene Expression Omnibus database repository at the NCBI
NCBI-SRA: The Sequence Read Archive at the NCBI
PMID: Pubmed Identifier
URL: Uniform Resource Locator
MD5 Checksums of the filesManifest.txt (2 KB): 89f32a728fb74ebecef0aef4633130b0
README.txt (6 KB): 34ea4ad9cb9bdea54755fa87f2d0b913
Analysis_SMC_publications_2016_2024_Open_Access_publication_and_access_to_data_status_2025_06_24.csv (46 KB): 9719df26381901bc6aabfd34fdbfab81
Analysis_SMC_publications_2016_2024_Open_Access_publication_and_access_to_data_status_2025_06_24.xlsx (49 KB): 1ec95dc29262645240e7d8714967bcfc
Table_1_Overview_SMC_publications_2016_2024_status_2025_06_11.csv (391 Bytes): 1fd723dc6f52f18251d41c0d343a4f0f
Table_1_Overview_SMC_publications_2016_2024_status_2025_06_11.xlsx (9 KB): 38622a9681c6f1057a6e1a4be56b0285
Figure_1_SMC_publications_2016_2024_open_access_license_and_data_availability_status_2025_06_11.csv (468 Bytes): 9f9156f8d52603ccdec968f626bc002a
Figure_1_SMC_publications_2016_2024_open_access_license_and_data_availability_status_2025_06_11.jpg (119 KB): dc9a4d7de4c789e8aea46ce66e007301
Figure_1_SMC_publications_2016_2024_open_access_license_and_data_availability_status_2025_06_11.xlsx (15 KB): 6527d1ebd0069ef3757bd1b049f0fc74
Figure_2_SMC_publications_2016_2024_metabolomics_data_and_other_data_to_repositories_status_2024_06_12.csv (300 Bytes): 5abc4a0fcf776f8dc4745f41deddacbc
Figure_2_SMC_publications_2016_2024_metabolomics_data_and_other_data_to_repositories_status_2024_06_12.jpg (126 KB): e03e5bf4ba2d942c3b022aebb0a59033
Figure_2_SMC_publications_2016_2024_metabolomics_data_and_other_data_to_repositories_status_2024_06_12.xlsx (15 KB): a80f977c051d4798db221b07733c694b
Figure_3_SMC_publications_2016_2024_overview_supplementary_data_status_2025_06_11.csv (670 Bytes): a694a3defa98aa52fcdec8ff9e9e3316
Figure_3_SMC_publications_2016_2024_overview_supplementary_data_status_2025_06_11.jpg(153 KB): 3928bdc1f046ca9b6f66bdbcdf936ca8
Figure_3_SMC_publications_2016_2024_overview_supplementary_data_status_2025_06_11.xlsx (15 KB): 46dfda56b116b571b4bf8e3674b44512
Figure_4_SMC_publications_2016_2024_submission_of_data_to_repositories_status_2025_06_12.csv (498 Bytes): 8963a412cc9e458ced2e80883bb93e1a
Figure_4_SMC_publications_2016_2024_submission_of_data_to_repositories_status_2025_06_12.jpg (137 KB): c9ba447225e99431f24732128a754b7e
Figure_4_SMC_publications_2016_2024_submission_of_data_to_repositories_status_2025_06_12.xlsx (16 KB): 1e2813d3ccb0ee14991b276947c21b8a
Materials_and_methods_SMC_publications_2016_2024.docx (19 KB): 71776ffc1e530e1b40255763403b2f40
Materials_and_methods_SMC_publications_2016_2024.txt (4 KB): 26c4b91b958b9e33d93d13dc52b25da9
Materials_and_methods_SMC_publications_2026_2024.pdf (172 KB): eee564f452ef4f3cf57bb81a6874fcd4
SMC_publications_2016_2024_status_2025_05_05.csv (143 KB): 5e61d09244ca90b1e5b057a7afdfe5e7
SMC_publications_2016_2024_status_2025_05_05.xlsx (106 KB): 6977fbcac21ff5a12763e40de90c0a91
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Change in openness of data availability statements from preprint to published article, grouped by journal data-sharing policy.
Facebook
TwitterData from field work (crops vegetation in the urban gardens) and the data used for the development of the ANNs, which support the reported results
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This is a Data Availability Statement for a paper results being published.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Data Availability Statement of paper (Increasing landslide deformation and activity in a changing environment).
Facebook
Twitterhttps://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
The data underlying scientific papers should be accessible to researchers both now and in the future, but how best can we ensure that these data are available? Here we examine the effectiveness of four approaches to data archiving: no stated archiving policy, recommending (but not requiring) archiving, and two versions of mandating data deposition at acceptance. We control for differences between data types by trying to obtain data from papers that use a single, widespread population genetic analysis, STRUCTURE. At one extreme, we found that mandated data archiving policies that require the inclusion of a data availability statement in the manuscript improve the odds of finding the data online almost 1000-fold compared to having no policy. However, archiving rates at journals with less stringent policies were only very slightly higher than those with no policy at all. We also assessed the effectiveness of asking for data directly from authors and obtained over half of the requested datasets, albeit with ∼8 d delay and some disagreement with authors. Given the long-term benefits of data accessibility to the academic community, we believe that journal-based mandatory data archiving policies and mandatory data availability statements should be more widely adopted.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Data availability statement for the article "Practical Considerations for Quantitative Light Sheet Fluorescence Microscopy".
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
These data were used to identify the sediment source areas following rainfall events, primarily based on observations from the Chishui River hydrological station, the Linkou meteorological station, and ERA5 rainfall data.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The data, Excel files, and information and results derived from the Arena Simulation software used in this study, including the database, tools, and engineering methodologies, are available for review. A set of files has been compiled containing all the information necessary to replicate the analyses and results presented in this article. These files are available to promote academic transparency and facilitate verification of the methods employed.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Most frequently appearing URLs with resource name and count.
Facebook
TwitterDataDoP
Data Availability Statement
We are committed to maintaining transparency and compliance in our data collection and sharing methods. Please note the following:
Publicly Available Data: The data utilized in our studies is publicly available. We do not use any exclusive or private data sources.
Data Sharing Policy: Our data sharing policy aligns with precedents set by prior works, such as InternVid, Panda-70M , and Miradata. Rather than providing the original raw… See the full description on the dataset page: https://huggingface.co/datasets/Dubhe-zmc/DataDoP.
Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
This FAIRsharing record describes: The Type 2 policy encourages data sharing and evidence of data sharing. The policy text includes information on the Springer Nature list of recommended repositories; introductory text on data citations; information on writing a data availability statement; and a link to the Springer Nature Research Data Helpdesk.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains Data Availability Statements from 47,593 papers published in PLOS ONE between March 2014 (when the policy went into effect) and May 2016, analyzed for type of statement.