13 datasets found
  1. State of Open Data 2024: Springer Nature DAS analysis quantitative data

    • figshare.com
    xlsx
    Updated Nov 28, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Graham Smith (2024). State of Open Data 2024: Springer Nature DAS analysis quantitative data [Dataset]. http://doi.org/10.6084/m9.figshare.27886320.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Nov 28, 2024
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Graham Smith
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Raw data supporting the Springer Nature Data Availability Statement (DAS) analysis in the State of Open Data 2024. SOOD_2024_special_analysis_DAS_SN.xlsx contains the DAS, DOI, publication date, DAS categories and related country by Insitution of any author.SOOD 2024_DAS_analysis_sharing.xlsx contains the summary data by country and data sharing type.Utilizing the Dimensions database, we identified articles containing key DAS identifiers such as “Data Availability Statement” or “Availability of Data and Materials” within their full text. Digital Object Identifiers (DOIs) of these articles were collected and matched against Springer Nature’s XML database to extract the DAS for each article. The extracted DAS were categorized into specific sharing types using text and data matching terms. For statements indicating that data are publicly available in a repository, we matched against a predefined list of repository identifiers, names, and URLs. The DAS were classified into the following categories:1. Data are available from the author on request. 2. Data are included in the manuscript or its supplementary material. 3. Some or all of the data are publicly available, for example in a repository.4. Figure source data are included with the manuscript. 5. Data availability is not applicable.6. Data are declared as not available by the author.7. Data available online but not in a repository.These categories are non-exclusive: more than one can apply to any one article. Publications outside the 2019–2023 range and non-article publication types (e.g., book chapters) that were initially included in the Dimensions search results were excluded from the final dataset. Articles were included in the final analysis after applying the exclusion criteria. Upon processing, it was found that only 370 results were returned for Botswana across the five-year period; due to this low number, Botswana was not included in the DAS focused country-level analysis. This analysis does not assess the accuracy of the DAS in the context of each individual article. There was no manual verification of the categories applied; as a result, terms used out of context could have led to misclassification. Approximately 5% of articles remained unclassified following text and data matching due to these limitations.

  2. Data from: Data sharing in PLOS ONE: An analysis of Data Availability...

    • figshare.com
    txt
    Updated Feb 9, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lisa Federer (2018). Data sharing in PLOS ONE: An analysis of Data Availability Statements [Dataset]. http://doi.org/10.6084/m9.figshare.5690878.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Feb 9, 2018
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Lisa Federer
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains Data Availability Statements from 47,593 papers published in PLOS ONE between March 2014 (when the policy went into effect) and May 2016, analyzed for type of statement.

  3. Data Availability Statement.

    • figshare.com
    docx
    Updated Feb 12, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Olga Barros (2021). Data Availability Statement. [Dataset]. http://doi.org/10.6084/m9.figshare.13951607.v1
    Explore at:
    docxAvailable download formats
    Dataset updated
    Feb 12, 2021
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Olga Barros
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    We analyzed all the samples using a stereomicroscope, Olympus C011 trinocular microscope, coupled with a CCD camera. All the samples were measured and photographed by the Infinity Capture software.The drawn was improved with a drawing table, Parblo A610 – Graphhic tablet using the program ImageJ (Public Dominic). The geographical location of the Araripe Basin was produced using the software QGIS Geographic Information System (version 3.12 – QGIS.org – Public Dominic) considering the coordinate system Datum – SIRGAS 200 from Instituto Brasileiro de Geografia e Estatística (IBGE, Brazil) and Companhia de Pesquisa de Recursos Minerais (CPRM, Brazil). The stratigraphy of the Santana group was drawn with program ImageJ (Public Dominic) to according with stratigraphy on Neumann & Cabreira, 1999 and Valença et al., 2003.

  4. l

    Input datasets for UKRN Open Research Indicators Pilot 4 - Data Availability...

    • figshare.le.ac.uk
    xlsx
    Updated Mar 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Radoslaw Pajor; Laurian Williamson; Valerie McCutcheon; Michael Eadie (2025). Input datasets for UKRN Open Research Indicators Pilot 4 - Data Availability Statements [Dataset]. http://doi.org/10.25392/leicester.data.28675934.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Mar 27, 2025
    Dataset provided by
    University of Leicester
    Authors
    Radoslaw Pajor; Laurian Williamson; Valerie McCutcheon; Michael Eadie
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Two examples of input data from Universities of Glasgow and Leicester for UKRN led Open Research Indicators pilot 4. The overall aim of the pilot was to explore the co-creation of practical methods to monitor the prevalence of DAS in research articles and assess the quality of DAS and their usefulness.

  5. n

    Data access statements: building trust through transparency - presentation...

    • data.ncl.ac.uk
    Updated May 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bogdan Metes; Beth Houlis (2025). Data access statements: building trust through transparency - presentation slides [Dataset]. http://doi.org/10.25405/data.ncl.29076068.v1
    Explore at:
    Dataset updated
    May 16, 2025
    Dataset provided by
    Newcastle University
    Authors
    Bogdan Metes; Beth Houlis
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This presentation was delivered on Thursday 13th February for Love Data Week 2025. It was presented by the research data librarians at Newcastle University and Northumbria University.Access the presentation slides in Northumbria University's repository.Data access statements are a cornerstone of responsible research, providing users with clear guidance on whether and how they can access the underlying research data that supports research findings. In the age of open science, these statements are more than just a funder requirement, they are an important tool for facilitating data sharing and ensuring reproducible research. By including a well-crafted data access statement in your publications, you demonstrate a commitment to transparency and rigour, helping to enhance your research profile and boost citations by fostering trust in your work.The session explored:The principles and importance of data access statements in research.Practical guidance on writing clear and impactful statements.Real-world examples and common pitfalls to avoid.Where to share research data.Resources and support to simplify the process.

  6. Dataset #2: Experimental study

    • figshare.com
    docx
    Updated Jul 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adam Baimel (2023). Dataset #2: Experimental study [Dataset]. http://doi.org/10.6084/m9.figshare.23708766.v1
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jul 19, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Adam Baimel
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Project Title: Add title here

    Project Team: Add contact information for research project team members

    Summary: Provide a descriptive summary of the nature of your research project and its aims/focal research questions.

    Relevant publications/outputs: When available, add links to the related publications/outputs from this data.

    Data availability statement: If your data is not linked on figshare directly, provide links to where it is being hosted here (i.e., Open Science Framework, Github, etc.). If your data is not going to be made publicly available, please provide details here as to the conditions under which interested individuals could gain access to the data and how to go about doing so.

    Data collection details: 1. When was your data collected? 2. How were your participants sampled/recruited?

    Sample information: How many and who are your participants? Demographic summaries are helpful additions to this section.

    Research Project Materials: What materials are necessary to fully reproduce your the contents of your dataset? Include a list of all relevant materials (e.g., surveys, interview questions) with a brief description of what is included in each file that should be uploaded alongside your datasets.

    List of relevant datafile(s): If your project produces data that cannot be contained in a single file, list the names of each of the files here with a brief description of what parts of your research project each file is related to.

    Data codebook: What is in each column of your dataset? Provide variable names as they are encoded in your data files, verbatim question associated with each response, response options, details of any post-collection coding that has been done on the raw-response (and whether that's encoded in a separate column).

    Examples available at: https://www.thearda.com/data-archive?fid=PEWMU17 https://www.thearda.com/data-archive?fid=RELLAND14

  7. r

    Referenzierung von Forschungsdatenpublikationen in RADAR

    • radar-service.eu
    tar
    Updated Mar 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dorothea Strecker (2025). Referenzierung von Forschungsdatenpublikationen in RADAR [Dataset]. http://doi.org/10.22000/fbhfgzy8d43r3tjw
    Explore at:
    tar(78336 bytes)Available download formats
    Dataset updated
    Mar 21, 2025
    Dataset provided by
    Humboldt-Universität zu Berlin
    Authors
    Dorothea Strecker
    Description

    Description

    This dataset describes how datasets published in the research data repository RADAR are referenced, combining references extracted from Google Scholar, DataCite Event Data and the Data Citation Corpus.

    DOIs assigned to RADAR datasets were retrieved from the RADAR API 2025-01-27. References in the three data sources were then identified using these DOIs. Each research output referencing a RADAR dataset was accessed to determine where the reference occurred in the full text. Author names and publication dates for datasets and referencing objects were added from OpenAlex and DataCite on 2025-02-10. Author names of datasets and referencing objects were compared to determine if data reuse occurred.

    Columns

    • from: DOI of the referencing object
    • to: DOI of the RADAR dataset
    • from_date: publication date of the referencing object
    • to_date: publication date of the RADAR dataset
    • source_gs: boolean indicating if the reference was found in Google Scholar
    • source_dcc: boolean indicating if the reference was found in the Data Citation Corpus
    • source_ded: boolean indicating if the reference was found in DataCite Event Data
    • method_rl: boolean indicating if the dataset was referenced in the reference list
    • method_das: boolean indicating if the dataset was referenced in the data availability statement
    • method_fn: boolean indicating if the dataset was referenced in a footnote
    • method_ft: boolean indicating if the dataset was referenced in other parts of the full text, for example in the methods section
    • reuse_author: variable indicating if the reference is indicating data (overlap in the author names of dataset and referencing object) use or data reuse (no overlap)
  8. Global suicide mortality rates (2000-2019) and bibliographic data

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated Jun 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Erinija Pranckeviciene; Erinija Pranckeviciene (2024). Global suicide mortality rates (2000-2019) and bibliographic data [Dataset]. http://doi.org/10.5281/zenodo.12267302
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 22, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Erinija Pranckeviciene; Erinija Pranckeviciene
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jun 22, 2024
    Description

    The dataset contains World Bank Suicide mortality rate WDI (world development indicator) (2000-2019) world-wide data in original and processed form. In addition to the statistical data this dataset also contains bibliographic records of articles published on the topic of suicide in relation to individual countries during (2000-2019) in original and processed form.

    The data consists of six archives:

    1. World development indicator suicide mortality rate SH.STA.SUIC.P5. This archive contains suicide mortality rate of 159 countries during the period of 2000-2019 per 100,000 population including males and females as of November, 2023.
    2. Web of science records country and suicide. This archive contains bibliographic records organized by country on the topic of suicide related to that country published during 2000-2019 as of November, 2023.
    3. Suicide mortality rate statistics and keywords. This archive contains processed data of 1 and 2 archives in three files. The 'Countries suicide rates and WOS records' contains organized temporal suicide mortality rate data for each country and each year for males and females including counts of articles on suicide related in that country. The 'words and countries matrix' file contains information about how many times author and paper keywords from suicide related publications were seen in articles associated with each country. This data is organized as matrix in which rows are keywords, columns are countries and cells are counts of the keyword. The 'words and countries pairs' file contains same information only organized as keyword country pairs.
    4. Suicide mortality rate clusters countries keywords titles. This archive contains bibliographic data organized by country clusters. These clusters group countries with similar suicide mortality rate dynamics in males and females shown in two included figures. Each folder of the cluster contains a section with bibliographic records; a section with keywords associated with each country; and a section in which each publication associated with the country has a separate filecontaining its title and keywords.
    5. Suicide keywords embedding data. This archive contains word embedding vectors and metadata learned by recurrent neural network trained to classify countries from suicide related keywords of articles associated with those countries. Folder 'trained with keywords' contains embeddings learned in classifying countries in which training samples are keyword strings of publications. Folder 'trained with titles' contains embeddings learned in classifying countries in which training samples are strings containing titles of publication plus keywords.
    6. Suicide keywords association rule mining. This archive contains files of subsets of keywords frequently mentioned together in suicide related publications. Folder 'Mining in clusters' has frequent keyword itemsets in country clusters. Folder 'Mining in individual countries' has frequent keyword itemsets in countries. Examples of keyword networks connecting clusters and networks connecting countries in individual clusters are included which helps to identify specific and shared keywords by country clusters and by countries in the individual clusters.

    These datasets support a data availability statements for upcoming articles.

  9. Description of coding categories and example statements.

    • plos.figshare.com
    xls
    Updated May 30, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lisa M. Federer; Christopher W. Belter; Douglas J. Joubert; Alicia Livinski; Ya-Ling Lu; Lissa N. Snyders; Holly Thompson (2023). Description of coding categories and example statements. [Dataset]. http://doi.org/10.1371/journal.pone.0194768.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Lisa M. Federer; Christopher W. Belter; Douglas J. Joubert; Alicia Livinski; Ya-Ling Lu; Lissa N. Snyders; Holly Thompson
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Description of coding categories and example statements.

  10. Data from: A successful short-term volcanic eruption forecasting using...

    • zenodo.org
    • produccioncientifica.ugr.es
    bin, txt
    Updated Jul 25, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rey-Devesa, Pablo (1,2); Rey-Devesa, Pablo (1,2); Benitez Carmen (3); Benitez Carmen (3); Prudencio, Janire (1,2); Prudencio, Janire (1,2); Gutiérrez, Ligdamis (1,2); Gutiérrez, Ligdamis (1,2); Cortés, Guillermo (1,2); Manuel (3) Títos; Manuel (3) Títos; Koulakov, Iván (4,5); Koulakov, Iván (4,5); Luciano (6) Zuccarello; Luciano (6) Zuccarello; Ibáñez, Jesús (1,2); Ibáñez, Jesús (1,2); Cortés, Guillermo (1,2) (2022). A successful short-term volcanic eruption forecasting using seismic features: datasets and Sotware [Dataset]. http://doi.org/10.5281/zenodo.6821530
    Explore at:
    bin, txtAvailable download formats
    Dataset updated
    Jul 25, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Rey-Devesa, Pablo (1,2); Rey-Devesa, Pablo (1,2); Benitez Carmen (3); Benitez Carmen (3); Prudencio, Janire (1,2); Prudencio, Janire (1,2); Gutiérrez, Ligdamis (1,2); Gutiérrez, Ligdamis (1,2); Cortés, Guillermo (1,2); Manuel (3) Títos; Manuel (3) Títos; Koulakov, Iván (4,5); Koulakov, Iván (4,5); Luciano (6) Zuccarello; Luciano (6) Zuccarello; Ibáñez, Jesús (1,2); Ibáñez, Jesús (1,2); Cortés, Guillermo (1,2)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Successful Short-Term Volcanic Eruption Forecasting Using Seismic Features, Suplementary Material

    by Rey-Devesa (1,2), Benítez (3), Prudencio, Ligdamis Gutiérrez (1,2), Cortés (1,2), Titos (3), Koulakov (4,5), Zuccarello (6) and Ibáñez (1,2).


    Institutions associated:

    (1) Department of Theoretical Physics and Cosmos. Science Faculty. Avd. Fuentenueva s/n. University of Granada. 18071. Granada. Spain.

    (2) Andalusian Institute of Geophysiscs. Campus de Cartuja. University of Granada. C/Profesor Clavera 12. 18071. Granada. Spain.

    (3) Department of Signal Theory, Telematics and Communication. University of Granada. Informatics and Telecommunication School. 18071. Granada. Spain.

    (4) Trofimuk Institute of Petroleum Geology and Geophysics SB RAS, Prospekt Koptyuga, 3, 630090 Novosibirsk, Russia

    (5) Institute of the Earth’s Crust SB RAS, Lermontova 128, Irkutsk, Russia

    (6) Istituto Nazionale di Geofisica e Vulcanologia, Sezione di Pisa (INGV-Pisa), via Cesare Battisti, 53, 56125, Pisa, Italy.


    Acknowledgment:

    This study was partially supported by the Spanish FEMALE project (PID2019-106260GB-I00).
    P. Rey-Devesa was funded by the Ministerio de Ciencia e Innovación del Gobierno de España (MCIN),
    Agencia Estatal de Investigación (AEI), Fondo Social Europeo (FSE),
    and Programa Estatal de Promoción del Talento y su Empleabilidad en I+D+I Ayudas para contratos predoctorales para la formación de doctores 2020 (PRE2020-092719).
    Ivan Koulakov was supported by the Russian Science Foundation (Grant No. 20-17-00075).
    Luciano Zuccarello was supported by the INGV Pianeta Dinamico 2021 Tema 8 SOME project (grant no. CUP D53J1900017001)
    funded by the Italian Ministry of University and Research
    “Fondo finalizzato al rilancio degli investimenti delle amministrazioni centrali dello Stato e allo sviluppo del Paese, legge 145/2018”.
    English language editing was performed by Tornillo Scientific, UK.


    Data availability statement:

    1.- Seismic data from Kilauea, Augustine, Bezymianny (2007), and Mount St. Helens are available from the IRIS data repository (http://ds.iris.edu/seismon/index.phtml).
    (An example of the Python code to access the data is described below.)
    2.- Seismic data from Bezymianny (2017-2018) are available from Ivan Koulakov (ivan.science@gmail.com) upon request.
    3.- Seismic data from Mt. Etna are available from INGV-Italy upon request (http://terremoti.ingv.it/en/help),
    also available from the Zenodo data repository (https://doi.org/10.5281/zenodo.6849621).

    Access code in Python to download the records of Kilauea, Augustine and Mount St. Helens volcanoes, from the IRIS data repository.

    '''To access the raw signals please first install ObsPy and then execute following commands in a python console: '''

    Example:

    from obspy.core import UTCDateTime
    from obspy.clients.fdsn import Client
    import obspy.io.mseed
    client = Client('IRIS')
    t1 = UTCDateTime('2006-01-10T00:00:00')
    t2 = UTCDateTime('2006-01-12T00:00:00')
    raw_data = client.get_waveforms(
    network='AV',
    station='AUH',
    location='',
    channel='HHZ',
    starttime=t1,
    endtime=t2)

    '''To further download station information execute: '''

    xml = client.get_stations(network='AV',station='AUH',
    channel='HHZ',starttime=t1,endtime=t2,level='response')

    ''' 'To scale the data using the station’s meta-data: '''

    data = raw_data.remove_response(inventory=xml)

    ''' To filter, trim and plot the data execute: '''

    data.write("Augustine.mseed", format="MSEED")

    data.filter('bandpass',freqmin=1.0,freqmax=20)
    data.trim(t1+60,t2-60)
    data.plot()

    Contents:

    6 different Matlab codes. The principal code is called FeatureExtraction.
    The codes rsac.m and ReadMSEEDFast.m are for reading different format of data. (Not developed by the group)
    Seismic Data from Mt. Etna for using as an example.

  11. Data Template for UKRN Research Indicators Pilot 4

    • figshare.com
    xlsx
    Updated Jul 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mick Eadie; Valerie McCutcheon; Radoslaw Pajor; Laurian Williamson (2024). Data Template for UKRN Research Indicators Pilot 4 [Dataset]. http://doi.org/10.6084/m9.figshare.26165794.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jul 3, 2024
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Mick Eadie; Valerie McCutcheon; Radoslaw Pajor; Laurian Williamson
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is the template for datasets analysed as part of United Kingdom Reproducability Network (UKRN) Research Indicators Project, pilot 4 - the prevalence and quality of data availability statements.

  12. Metadata and data files supporting the published article: The therapeutic...

    • springernature.figshare.com
    txt
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    François BERTUCCI; Pascal Finetti; Anthony Goncalves; Daniel Birnbaum (2023). Metadata and data files supporting the published article: The therapeutic response of ER+/HER2- breast cancers differs according to the molecular Basal or Luminal subtype [Dataset]. http://doi.org/10.6084/m9.figshare.11558676.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    François BERTUCCI; Pascal Finetti; Anthony Goncalves; Daniel Birnbaum
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Here, the authors performed an in-silico analysis on a meta-dataset including gene-expression data from 5,342 clinically defined estrogen receptor-positive/ human epidermal growth factor receptor 2-negative (ER+/HER2-) breast cancers (BC), and DNA copy number/mutational and proteomic data, to determine whether the therapeutic response of ER+/HER2- breast cancers differs according to the molecular basal or luminal subtype.Data access: The dataset Breast_cancer_classifications.csv supporting figure 1, table 1, and supplementary tables 1-3 is publicly available in the figshare repository as part of this data record. This study used and analysed 36 publicly available datasets that are all listed in Supplementary table 8 and are cited from the data availability statement of the published article.Study aims and methodology: To evaluate the response and/or potential vulnerability to hormone treatment (HT) and other systemic therapies of BC, and to assess the degree of difference between basal and luminal breast cancer subtypes, the authors performed an in-silico analysis of a meta-dataset including gene expression data from 8,982 non-redundant BCs and DNA copy number/mutational and proteomic data from TCGA. The aim was to compare the Basal versus Luminal samples. Out of the 8,982 samples of the database, 6,563 were defined as ER+ (5,342 according to immunohistochemistry (IHC) and 1,221 according to inferred stratus).The authors analysed breast cancer gene expression data pooled from 36 public datasets (the publicly available datasets are listed in supplementary table 8), comprising 8,982 invasive primary BCs. The pre-analytic data processing was done as described previously in https://doi.org/10.1038/s41416-018-0309-1. Please refer to the published article for more details on the methodology and statistical analysis.Data supporting the figures, tables and supplementary tables in the published article: Data supporting figure 1, table 1, and supplementary tables 1-3: Dataset Breast_cancer_classifications.csv is in .csv file format. The dataset includes histo-clinical and molecular data of the tumors analysed in study, and is part of this data record.Data supporting supplementary table 4: Dataset genome.wustl.edu_BRCA.IlluminaGA_DNASeq.Level_2.3.2.0.tar.gz.1 is a tar archive gz compressed of maf format files. This dataset was accessed through the Genomic Data Commons (GDC) Data Portal and can be downloaded directly here: https://api.gdc.cancer.gov/data/afaf2790-04d4-453a-8c1b-75cf42ffd35f.Data supporting supplementary table 5: Dataset gdc_manifest.txt consists of gz archives of txt format files. The file was accessed through the GDC Data Portal here : https://portal.gdc.cancer.gov/repository?facetTab=files&filters={"op":"and","content":[{"op":"in","content":{"field":"cases.project.project_id","value":["TCGA-BRCA"]}},{"op":"in","content":{"field":"files.access","value":["open"]}},{"op":"in","content":{"field":"files.analysis.workflow_type","value":["HTSeq - Counts"]}},{"op":"in","content":{"field":"files.experimental_strategy","value":["RNA-Seq"]}}]}&searchTableTab=filesData supporting supplementary table 6: Dataset Table S5_Revised.xlsx is in .xlsx file format and is part of the supplementary information files of the published article.Data supporting supplementary table 7: Dataset BRCA.RPPA.Level_3.tar is a tar archive of txt format files. The file was accessed through the GDC Data Portal and can be downloaded directly here: https://api.gdc.cancer.gov/data/85988e1b-4f7d-493e-96ae-9eee61ac2833.

  13. EATRIS-Plus multi-omics data of a human reference cohort

    • zenodo.org
    • data.niaid.nih.gov
    bin
    Updated Mar 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peter-Bram 't Hoen; Peter-Bram 't Hoen; Casper de Visser; Casper de Visser (2024). EATRIS-Plus multi-omics data of a human reference cohort [Dataset]. http://doi.org/10.5281/zenodo.10782800
    Explore at:
    binAvailable download formats
    Dataset updated
    Mar 18, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Peter-Bram 't Hoen; Peter-Bram 't Hoen; Casper de Visser; Casper de Visser
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Time period covered
    Mar 12, 2024
    Description

    In this reference study, blood samples of 127 healthy individuals were analyzed with a wide range of -omics technologies, resulting in the most comprehensive -omics
    profiling data set that is publicly available. The molecular measurements that are available here, can be used as reference values for any future (multi-)omics studyies. Along with phenotypic information (Sex, Age, BMI etc. and measured cell types levels) on the healthy subjects, the following data types are included:

    • Targeted metabolomics (acylcarnitines, amino acids and very long chain fatty acids)
    • Lipidomics (negative and positive ionization modes)
    • Proteomics
    • mRNA-seq
    • miRNA-seq
    • miRNA qRT-PCR
    • Enzymation Methylation sequencing

    The pre-processed mult-omics data can be accessed here in the shape of a MultiAssayExperiment object (Ramos et al. 2017). Instructions on how to read the object into R can be found here: Read_MultiAssayExperiment.

    A similar object for Python (MuData) including the same data will be added later.

    DATA AVAILABILITY STATEMENT:

    Full data related to the EATRIS-Plus multiomic cohort are available in the ClinData repository (https://clindata.imtm.cz) and include full phenotypic information, physical and laboratory examinations, multiomic data from white blood cells (whole genome sequencing, enzymatic methylation DNA sequencing, mRNA sequencing, miRNA sequencing) or plasma (miRNA qPCR profiling, proteomics, targeted metabolomics, untargeted lipidomics, Raman spectroscopy profiling). However, access is restricted due to legal, ethical, scientific and/or commercial reasons. Access to the data is subject to approval and a data sharing transfer agreement. For data access please contact data.access@imtm.cz.

  14. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Graham Smith (2024). State of Open Data 2024: Springer Nature DAS analysis quantitative data [Dataset]. http://doi.org/10.6084/m9.figshare.27886320.v1
Organization logoOrganization logo

State of Open Data 2024: Springer Nature DAS analysis quantitative data

Explore at:
xlsxAvailable download formats
Dataset updated
Nov 28, 2024
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Graham Smith
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Raw data supporting the Springer Nature Data Availability Statement (DAS) analysis in the State of Open Data 2024. SOOD_2024_special_analysis_DAS_SN.xlsx contains the DAS, DOI, publication date, DAS categories and related country by Insitution of any author.SOOD 2024_DAS_analysis_sharing.xlsx contains the summary data by country and data sharing type.Utilizing the Dimensions database, we identified articles containing key DAS identifiers such as “Data Availability Statement” or “Availability of Data and Materials” within their full text. Digital Object Identifiers (DOIs) of these articles were collected and matched against Springer Nature’s XML database to extract the DAS for each article. The extracted DAS were categorized into specific sharing types using text and data matching terms. For statements indicating that data are publicly available in a repository, we matched against a predefined list of repository identifiers, names, and URLs. The DAS were classified into the following categories:1. Data are available from the author on request. 2. Data are included in the manuscript or its supplementary material. 3. Some or all of the data are publicly available, for example in a repository.4. Figure source data are included with the manuscript. 5. Data availability is not applicable.6. Data are declared as not available by the author.7. Data available online but not in a repository.These categories are non-exclusive: more than one can apply to any one article. Publications outside the 2019–2023 range and non-article publication types (e.g., book chapters) that were initially included in the Dimensions search results were excluded from the final dataset. Articles were included in the final analysis after applying the exclusion criteria. Upon processing, it was found that only 370 results were returned for Botswana across the five-year period; due to this low number, Botswana was not included in the DAS focused country-level analysis. This analysis does not assess the accuracy of the DAS in the context of each individual article. There was no manual verification of the categories applied; as a result, terms used out of context could have led to misclassification. Approximately 5% of articles remained unclassified following text and data matching due to these limitations.

Search
Clear search
Close search
Google apps
Main menu