2 datasets found
  1. f

    Data_Sheet_2_STATegra: Multi-Omics Data Integration – A Conceptual Scheme...

    • frontiersin.figshare.com
    pdf
    Updated Jun 1, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nuria Planell; Vincenzo Lagani; Patricia Sebastian-Leon; Frans van der Kloet; Ewoud Ewing; Nestoras Karathanasis; Arantxa Urdangarin; Imanol Arozarena; Maja Jagodic; Ioannis Tsamardinos; Sonia Tarazona; Ana Conesa; Jesper Tegner; David Gomez-Cabrero (2023). Data_Sheet_2_STATegra: Multi-Omics Data Integration – A Conceptual Scheme With a Bioinformatics Pipeline.pdf [Dataset]. http://doi.org/10.3389/fgene.2021.620453.s002
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    Frontiers
    Authors
    Nuria Planell; Vincenzo Lagani; Patricia Sebastian-Leon; Frans van der Kloet; Ewoud Ewing; Nestoras Karathanasis; Arantxa Urdangarin; Imanol Arozarena; Maja Jagodic; Ioannis Tsamardinos; Sonia Tarazona; Ana Conesa; Jesper Tegner; David Gomez-Cabrero
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Technologies for profiling samples using different omics platforms have been at the forefront since the human genome project. Large-scale multi-omics data hold the promise of deciphering different regulatory layers. Yet, while there is a myriad of bioinformatics tools, each multi-omics analysis appears to start from scratch with an arbitrary decision over which tools to use and how to combine them. Therefore, it is an unmet need to conceptualize how to integrate such data and implement and validate pipelines in different cases. We have designed a conceptual framework (STATegra), aiming it to be as generic as possible for multi-omics analysis, combining available multi-omic anlaysis tools (machine learning component analysis, non-parametric data combination, and a multi-omics exploratory analysis) in a step-wise manner. While in several studies, we have previously combined those integrative tools, here, we provide a systematic description of the STATegra framework and its validation using two The Cancer Genome Atlas (TCGA) case studies. For both, the Glioblastoma and the Skin Cutaneous Melanoma (SKCM) cases, we demonstrate an enhanced capacity of the framework (and beyond the individual tools) to identify features and pathways compared to single-omics analysis. Such an integrative multi-omics analysis framework for identifying features and components facilitates the discovery of new biology. Finally, we provide several options for applying the STATegra framework when parametric assumptions are fulfilled and for the case when not all the samples are profiled for all omics. The STATegra framework is built using several tools, which are being integrated step-by-step as OpenSource in the STATegRa Bioconductor package.1

  2. Datasets for the integration of MT data with magnetic inversion:...

    • zenodo.org
    • researchdata.edu.au
    zip
    Updated Oct 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jérémie Giraud; Jérémie Giraud; Hoël Seillé; Hoël Seillé (2023). Datasets for the integration of MT data with magnetic inversion: proof-of-concept using synthetic data and field application [Dataset]. http://doi.org/10.5281/zenodo.6340159
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 28, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Jérémie Giraud; Jérémie Giraud; Hoël Seillé; Hoël Seillé
    Description

    This dataset is a companion dataset to the manuscript "Utilisation of probabilistic MT inversions to constrain magnetic data inversion: proof-of-concept and field application", by Jérémie Giraud, Hoël Seillé, Mark D. Lindsay, Gerhard Visser, Vitaliy Ogarko, and Mark W. Jessel.

    It contains models and data shown in the paper.

    The document was submitted for publication in Solid Earth:
    https://se.copernicus.org/preprints/se-2021-124/se-2021-124-manuscript-version2.pdf

    The folder organisation is as follows, where bold refers to folders and subfolders, and text in italic corresponds to a succinct description of the contents.

    |-- Dataset_synth > synthetic dataset and results
    | |-- Mag > magnetic data and models: inversion and results
    | | |-- domains > contains the files used to define domains for inversion
    | | |-- inversion results > contains subfolders with inversion results for different cases
    | | | |-- case a
    | | | |-- case b
    | | | |-- case c
    | | | |-- case d
    | | | |-- case e
    | | | |-- case f
    | | |-- membership values > contain files with membership values for the cases (a)-(f)
    | | |-- responses > simulated data with and without noise
    | | |-- true model > true model used for simulation
    | |-- MT > MT probabilities, sites information and models
    | | |-- probabilities > MT-derived probabilities
    | | |-- responses > simulated MT data
    | | | |-- edi_noise_5p > 5% noise-contaminated data (*.edi files)
    | | | |-- MansfieldMT_fwd.dat > uncontaminated (ModEM format *.dat file)
    | | |-- model > model in ModEM format (*.mod) and WinGLink format (.out) formats
    | | |-- coordinates.txt > location of MT sites
    | |-- Rock units > contains the file with indices of the rock unit model, in 3D, of the modified Mansfield model. The indices are stored as a column vector.

    |-- Dataset_field > field dataset
    | |-- Mag > magnetic data and models: inversion and results
    | | |-- admm constraints > file with bound constraints used in cases 3, 4, and adjusted case 4.
    | | |-- data > magnetic data for inversion, x, y, z, data column format.
    | | |-- inverted models > folders containing inversion results for the different cases tested
    | | | |-- case 1
    | | | |-- case 2
    | | | |-- case 3
    | | | |-- case 4
    | | | |-- case 4 adjusted
    | |-- MT > MT probabilities and sites information
    | | |-- probabilities > MT probabilities of interface and rock units (*.txt files)
    | | |-- coord_L26_sites > File containing the location of sites along ligne L26

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Nuria Planell; Vincenzo Lagani; Patricia Sebastian-Leon; Frans van der Kloet; Ewoud Ewing; Nestoras Karathanasis; Arantxa Urdangarin; Imanol Arozarena; Maja Jagodic; Ioannis Tsamardinos; Sonia Tarazona; Ana Conesa; Jesper Tegner; David Gomez-Cabrero (2023). Data_Sheet_2_STATegra: Multi-Omics Data Integration – A Conceptual Scheme With a Bioinformatics Pipeline.pdf [Dataset]. http://doi.org/10.3389/fgene.2021.620453.s002

Data_Sheet_2_STATegra: Multi-Omics Data Integration – A Conceptual Scheme With a Bioinformatics Pipeline.pdf

Related Article
Explore at:
pdfAvailable download formats
Dataset updated
Jun 1, 2023
Dataset provided by
Frontiers
Authors
Nuria Planell; Vincenzo Lagani; Patricia Sebastian-Leon; Frans van der Kloet; Ewoud Ewing; Nestoras Karathanasis; Arantxa Urdangarin; Imanol Arozarena; Maja Jagodic; Ioannis Tsamardinos; Sonia Tarazona; Ana Conesa; Jesper Tegner; David Gomez-Cabrero
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Technologies for profiling samples using different omics platforms have been at the forefront since the human genome project. Large-scale multi-omics data hold the promise of deciphering different regulatory layers. Yet, while there is a myriad of bioinformatics tools, each multi-omics analysis appears to start from scratch with an arbitrary decision over which tools to use and how to combine them. Therefore, it is an unmet need to conceptualize how to integrate such data and implement and validate pipelines in different cases. We have designed a conceptual framework (STATegra), aiming it to be as generic as possible for multi-omics analysis, combining available multi-omic anlaysis tools (machine learning component analysis, non-parametric data combination, and a multi-omics exploratory analysis) in a step-wise manner. While in several studies, we have previously combined those integrative tools, here, we provide a systematic description of the STATegra framework and its validation using two The Cancer Genome Atlas (TCGA) case studies. For both, the Glioblastoma and the Skin Cutaneous Melanoma (SKCM) cases, we demonstrate an enhanced capacity of the framework (and beyond the individual tools) to identify features and pathways compared to single-omics analysis. Such an integrative multi-omics analysis framework for identifying features and components facilitates the discovery of new biology. Finally, we provide several options for applying the STATegra framework when parametric assumptions are fulfilled and for the case when not all the samples are profiled for all omics. The STATegra framework is built using several tools, which are being integrated step-by-step as OpenSource in the STATegRa Bioconductor package.1

Search
Clear search
Close search
Google apps
Main menu