100+ datasets found
  1. Downstream Analysis

    • figshare.com
    zip
    Updated Dec 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lisa Blankenhagen (2024). Downstream Analysis [Dataset]. http://doi.org/10.6084/m9.figshare.27619956.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 6, 2024
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Lisa Blankenhagen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data for the downstream analysis. One zip file contains the results of the TF-Prioritizer run, and the other contains the input files for the ORA. These are cancer-related GO terms, tissue-related GO terms for the cell lines used, and mutated gene lists for the different tissues.

  2. Impact of Library Preparation on Downstream Analysis and Interpretation of...

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    tiff
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhifu Sun; Yan W. Asmann; Asha Nair; Yuji Zhang; Liguo Wang; Krishna R. Kalari; Aditya V. Bhagwate; Tiffany R. Baker; Jennifer M. Carr; Jean-Pierre A. Kocher; Edith A. Perez; E. Aubrey Thompson (2023). Impact of Library Preparation on Downstream Analysis and Interpretation of RNA-Seq Data: Comparison between Illumina PolyA and NuGEN Ovation Protocol [Dataset]. http://doi.org/10.1371/journal.pone.0071745
    Explore at:
    tiffAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Zhifu Sun; Yan W. Asmann; Asha Nair; Yuji Zhang; Liguo Wang; Krishna R. Kalari; Aditya V. Bhagwate; Tiffany R. Baker; Jennifer M. Carr; Jean-Pierre A. Kocher; Edith A. Perez; E. Aubrey Thompson
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ObjectivesThe sequencing by the PolyA selection is the most common approach for library preparation. With limited amount or degraded RNA, alternative protocols such as the NuGEN have been developed. However, it is not yet clear how the different library preparations affect the downstream analyses of the broad applications of RNA sequencing.Methods and MaterialsEight human mammary epithelial cell (HMEC) lines with high quality RNA were sequenced by Illumina’s mRNA-Seq PolyA selection and NuGEN ENCORE library preparation. The following analyses and comparisons were conducted: 1) the numbers of genes captured by each protocol; 2) the impact of protocols on differentially expressed gene detection between biological replicates; 3) expressed single nucleotide variant (SNV) detection; 4) non-coding RNAs, particularly lincRNA detection; and 5) intragenic gene expression.ResultsSequences from the NuGEN protocol had lower (75%) alignment rate than the PolyA (over 90%). The NuGEN protocol detected fewer genes (12–20% less) with a significant portion of reads mapped to non-coding regions. A large number of genes were differentially detected between the two protocols. About 17–20% of the differentially expressed genes between biological replicates were commonly detected between the two protocols. Significantly higher numbers of SNVs (5–6 times) were detected in the NuGEN samples, which were largely from intragenic and intergenic regions. The NuGEN captured fewer exons (25% less) and had higher base level coverage variance. While 6.3% of reads were mapped to intragenic regions in the PolyA samples, the percentages were much higher (20–25%) for the NuGEN samples. The NuGEN protocol did not detect more known non-coding RNAs such as lincRNAs, but targeted small and “novel” lincRNAs.ConclusionDifferent library preparations can have significant impacts on downstream analysis and interpretation of RNA-seq data. The NuGEN provides an alternative for limited or degraded RNA but it has limitations for some RNA-seq applications.

  3. Z

    [Dataset] Advanced Single Cell Analysis tutorial - Complete downstream...

    • data.niaid.nih.gov
    Updated Mar 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Soraggi, Samuele; Andersen, Stig Uggerhøj; Fechete, Lavinia Ioana; Tedeschi, Francesca; Frank, Manuel (2024). [Dataset] Advanced Single Cell Analysis tutorial - Complete downstream analysis across conditions [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_10782589
    Explore at:
    Dataset updated
    Mar 7, 2024
    Dataset provided by
    Aarhus University
    BiRC (Bioinformatics Research Center, Aarhus University)
    Authors
    Soraggi, Samuele; Andersen, Stig Uggerhøj; Fechete, Lavinia Ioana; Tedeschi, Francesca; Frank, Manuel
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Datasets and metadata used for the full streamline analysis of plant data under different conditions of infection. The tutorial is an example of analysis which can be useful in multiple scenario where comparisons are needed (healthy and sick patients, for example). You can find the tutorial at our website https://hds-sandbox.github.io/AdvancedSingleCell

    Usage notes:

    all files are ready to use, except for control1.tar.gz which is a folder that needs to be decompressed

  4. Downstream analysis results.

    • figshare.com
    bin
    Updated Apr 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jiaqi Zhang (2024). Downstream analysis results. [Dataset]. http://doi.org/10.6084/m9.figshare.25602672.v1
    Explore at:
    binAvailable download formats
    Dataset updated
    Apr 15, 2024
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Jiaqi Zhang
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Model predictions and experiment results for downstream analysis in the updated scNODE paper (https://www.biorxiv.org/content/10.1101/2023.11.22.568346v2).

  5. f

    Data from: MS-DAP Platform for Downstream Data Analysis of Label-Free...

    • datasetcatalog.nlm.nih.gov
    • acs.figshare.com
    Updated Dec 21, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Klaassen, Remco V.; Li, Ka Wan; Koopmans, Frank; Smit, August B. (2022). MS-DAP Platform for Downstream Data Analysis of Label-Free Proteomics Uncovers Optimal Workflows in Benchmark Data Sets and Increased Sensitivity in Analysis of Alzheimer’s Biomarker Data [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000274261
    Explore at:
    Dataset updated
    Dec 21, 2022
    Authors
    Klaassen, Remco V.; Li, Ka Wan; Koopmans, Frank; Smit, August B.
    Description

    In the rapidly moving proteomics field, a diverse patchwork of data analysis pipelines and algorithms for data normalization and differential expression analysis is used by the community. We generated a mass spectrometry downstream analysis pipeline (MS-DAP) that integrates both popular and recently developed algorithms for normalization and statistical analyses. Additional algorithms can be easily added in the future as plugins. MS-DAP is open-source and facilitates transparent and reproducible proteome science by generating extensive data visualizations and quality reporting, provided as standardized PDF reports. Second, we performed a systematic evaluation of methods for normalization and statistical analysis on a large variety of data sets, including additional data generated in this study, which revealed key differences. Commonly used approaches for differential testing based on moderated t-statistics were consistently outperformed by more recent statistical models, all integrated in MS-DAP. Third, we introduced a novel normalization algorithm that rescues deficiencies observed in commonly used normalization methods. Finally, we used the MS-DAP platform to reanalyze a recently published large-scale proteomics data set of CSF from AD patients. This revealed increased sensitivity, resulting in additional significant target proteins which improved overlap with results reported in related studies and includes a large set of new potential AD biomarkers in addition to previously reported.

  6. f

    Downstream Analysis Scripts and Metadata Amplicon Sequencing - Grey Box...

    • open.flinders.edu.au
    • researchdata.edu.au
    txt
    Updated Aug 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nicole Fickling (2025). Downstream Analysis Scripts and Metadata Amplicon Sequencing - Grey Box Grassy Woodlands [Dataset]. http://doi.org/10.25451/flinders.29848643.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Aug 8, 2025
    Dataset provided by
    Flinders University
    Authors
    Nicole Fickling
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Downstream analysis scripts and metadata for the paper titled "Habitat fragmentation shifts soil microbial composition but not richness". All analysis done in R v4.4.2.See "variable descriptions" tab in the metadata file for details.

  7. D

    Downstream Bioprocessing Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Feb 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Downstream Bioprocessing Report [Dataset]. https://www.datainsightsmarket.com/reports/downstream-bioprocessing-589799
    Explore at:
    doc, ppt, pdfAvailable download formats
    Dataset updated
    Feb 10, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The downstream bioprocessing market is projected to grow from USD 18.7 billion in 2023 to USD 32.7 billion by 2028, at a CAGR of 14.3% during the forecast period. The growth of the market is primarily attributed to the increasing demand for biopharmaceuticals, the rising prevalence of chronic diseases, and the growing adoption of advanced technologies in downstream bioprocessing. Additionally, supportive government initiatives, such as tax incentives and grants, are creating a conducive environment for the growth of the market. Key drivers of the downstream bioprocessing market include the rising demand for biopharmaceuticals, the increasing prevalence of chronic diseases, the growing adoption of advanced technologies, and supportive government initiatives. However, the market is also facing certain challenges, such as the high cost of downstream bioprocessing and the regulatory complexities associated with biopharmaceutical production. The largest segment of the downstream bioprocessing market is purification, which accounted for over 40% of the market in 2023. This segment is expected to continue to grow at a steady pace due to the increasing demand for high-purity biopharmaceuticals. The major players in the downstream bioprocessing market include Danaher, Eppendorf, GE Healthcare, Parker Hannifin, and Thermo Fisher Scientific. These companies offer a wide range of products and services for downstream bioprocessing, including equipment, consumables, and software.

  8. Summary of single-cell sequencing datasets.

    • plos.figshare.com
    xls
    Updated Jun 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jun Seo Ha; Hyundoo Jeong (2023). Summary of single-cell sequencing datasets. [Dataset]. http://doi.org/10.1371/journal.pone.0284527.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 21, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Jun Seo Ha; Hyundoo Jeong
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Recent advances in single-cell sequencing techniques have enabled gene expression profiling of individual cells in tissue samples so that it can accelerate biomedical research to develop novel therapeutic methods and effective drugs for complex disease. The typical first step in the downstream analysis pipeline is classifying cell types through accurate single-cell clustering algorithms. Here, we describe a novel single-cell clustering algorithm, called GRACE (GRaph Autoencoder based single-cell Clustering through Ensemble similarity larning), that can yield highly consistent groups of cells. We construct the cell-to-cell similarity network through the ensemble similarity learning framework, and employ a low-dimensional vector representation for each cell through a graph autoencoder. Through performance assessments using real-world single-cell sequencing datasets, we show that the proposed method can yield accurate single-cell clustering results by achieving higher assessment metric scores.

  9. f

    Additional file 5 of circRNAprofiler: an R-based computational framework for...

    • datasetcatalog.nlm.nih.gov
    • springernature.figshare.com
    Updated Apr 30, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aufiero, Simona; Creemers, Esther E.; Pinto, Yigal M.; Tijsen, Anke J.; Reckman, Yolan J. (2020). Additional file 5 of circRNAprofiler: an R-based computational framework for the downstream analysis of circular RNAs [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000462790
    Explore at:
    Dataset updated
    Apr 30, 2020
    Authors
    Aufiero, Simona; Creemers, Esther E.; Pinto, Yigal M.; Tijsen, Anke J.; Reckman, Yolan J.
    Description

    Additional file 5. GWAS SNPs analysis. This file contains the results of GWAS SNPs analysis on the intron flanking BSJs.

  10. f

    Additional file 4 of circRNAprofiler: an R-based computational framework for...

    • datasetcatalog.nlm.nih.gov
    • springernature.figshare.com
    Updated Apr 30, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pinto, Yigal M.; Creemers, Esther E.; Tijsen, Anke J.; Aufiero, Simona; Reckman, Yolan J. (2020). Additional file 4 of circRNAprofiler: an R-based computational framework for the downstream analysis of circular RNAs [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000462789
    Explore at:
    Dataset updated
    Apr 30, 2020
    Authors
    Pinto, Yigal M.; Creemers, Esther E.; Tijsen, Anke J.; Aufiero, Simona; Reckman, Yolan J.
    Description

    Additional file 4. miRNA binding sites analysis. This file contains the results of the miRNA binding sites analysis in the ALPK2 circRNA sequence.

  11. 4

    Scripts and data for the paper: Consequences and opportunities arising due...

    • data.4tu.nl
    Updated Oct 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gerard Bouland; Marcel Reinders; Ahmed Mahfouz (2024). Scripts and data for the paper: Consequences and opportunities arising due to sparser single-cell RNA-seq datasets [Dataset]. http://doi.org/10.4121/424eea7a-cce9-4dbb-b6ef-e5b47e132410.v1
    Explore at:
    Dataset updated
    Oct 15, 2024
    Dataset provided by
    4TU.ResearchData
    Authors
    Gerard Bouland; Marcel Reinders; Ahmed Mahfouz
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Scripts and data for the paper: Consequences and opportunities arising due to sparser single-cell RNA-seq datasets


    With the number of cells measured in single-cell RNA sequencing (scRNA-seq) datasets increasing exponentially and concurrent increased sparsity due to more zero counts being measured for many genes, we demonstrate here that downstream analyses on binary-based gene expression give similar results as count-based analyses. Moreover, a binary representation scales up to ~ 50-fold more cells that can be analyzed using the same computational resources. We also highlight the possibilities provided by binarized scRNA-seq data. Development of specialized tools for bit-aware implementations of downstream analytical tasks will enable a more fine-grained resolution of biological heterogeneity.

  12. Comparison of the methods’ differentially expressed genes’ AUC score for the...

    • plos.figshare.com
    xls
    Updated Mar 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lily Monnier; Paul-Henry Cournède (2024). Comparison of the methods’ differentially expressed genes’ AUC score for the simulated datasets (all versions of Dataset 3 and Dataset 4). [Dataset]. http://doi.org/10.1371/journal.pcbi.1011880.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Mar 5, 2024
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Lily Monnier; Paul-Henry Cournède
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Comparison of the methods’ differentially expressed genes’ AUC score for the simulated datasets (all versions of Dataset 3 and Dataset 4).

  13. Nanopolish eventalign outputs for downstream analysis of yeast RNA and...

    • zenodo.org
    txt
    Updated Feb 5, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Laura K. White; Laura K. White (2023). Nanopolish eventalign outputs for downstream analysis of yeast RNA and synthetic oligonucleotides [Dataset]. http://doi.org/10.5281/zenodo.7604091
    Explore at:
    txtAvailable download formats
    Dataset updated
    Feb 5, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Laura K. White; Laura K. White
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains a set of text files from running the tool Nanopolish eventalign on several nanopore direct RNA sequencing data sets produced by Jay Hesselberth's lab at the University of Colorado (BioProject accession number PRJNA910992), as well as external sequencing data sets from PMID: 34893601 (synthetic oligonucleotides from Leger et al) and PMID: 35252946 (yeast rRNA data from Stephenson et al). These files can be used as inputs to the R markdown documents at https://github.com/hesselberthlab/RNARePore to reproduce the figures in the associated manuscript.

  14. Data from: Preprocessing of Public RNA-sequencing Datasets to Facilitate...

    • zenodo.org
    • data.niaid.nih.gov
    bin, zip
    Updated May 14, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Naomi Rapier Sharman; John Krapohl; Ethan Beausoleil; Kennedy Gifford; Ben Hinatsu; Curtis Hoffman; Makayla Komer; Tiana M. Scott; Brett E. Pickett; Naomi Rapier Sharman; John Krapohl; Ethan Beausoleil; Kennedy Gifford; Ben Hinatsu; Curtis Hoffman; Makayla Komer; Tiana M. Scott; Brett E. Pickett (2021). Preprocessing of Public RNA-sequencing Datasets to Facilitate Downstream Analyses of Human Diseases: Dataset [Dataset]. http://doi.org/10.5281/zenodo.4757764
    Explore at:
    zip, binAvailable download formats
    Dataset updated
    May 14, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Naomi Rapier Sharman; John Krapohl; Ethan Beausoleil; Kennedy Gifford; Ben Hinatsu; Curtis Hoffman; Makayla Komer; Tiana M. Scott; Brett E. Pickett; Naomi Rapier Sharman; John Krapohl; Ethan Beausoleil; Kennedy Gifford; Ben Hinatsu; Curtis Hoffman; Makayla Komer; Tiana M. Scott; Brett E. Pickett
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Publicly available RNA-sequencing (RNA-seq) data are a rich resource for elucidating the mechanisms of human disease; however, preprocessing these data requires considerable bioinformatic expertise and computational infrastructure. Analyzing multiple datasets with a consistent computational workflow increases the accuracy of downstream meta-analyses. This collection of datasets represents the human intracellular transcriptional response to disorders and diseases such as acute lymphoblastic leukemia (ALL), B-cell lymphomas, chronic obstructive pulmonary disease (COPD), colorectal cancer, lupus erythematosus; as well as infection with pathogens including Borrelia burgdorferi, hantavirus, influenza A virus, Middle East respiratory syndrome coronavirus (MERS-CoV), Streptococcus pneumoniae, respiratory syncytial virus (RSV), severe acute respiratory syndrome coronavirus (SARS-CoV), and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). We calculated the statistically significant differentially expressed genes and Gene Ontology (GO) terms for all datasets. In addition, a subset of the datasets also include results from splice variant analyses, intracellular signaling pathway enrichments as well as read mapping and quantification. All analyses were performed using well-established algorithms and are provided to facilitate future data mining activities, wet lab studies, and to accelerate collaboration and discovery.

  15. r

    Single-use Downstream Market Size, Share, Trends, Forecast (2035)

    • rootsanalysis.com
    Updated Aug 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Roots Analysis (2025). Single-use Downstream Market Size, Share, Trends, Forecast (2035) [Dataset]. https://www.rootsanalysis.com/reports/single-use-downstream-bioprocessing-market.html
    Explore at:
    Dataset updated
    Aug 1, 2025
    Dataset authored and provided by
    Roots Analysis
    License

    https://www.rootsanalysis.com/privacy.htmlhttps://www.rootsanalysis.com/privacy.html

    Description

    The single use downstream bioprocessing market is likely to grow from USD 1.54 bn in 2024 to USD 1.79 bn in 2025 and USD 6.75 bn by 2035, representing a CAGR of 14.2%

  16. e

    Downstream Processing Market Size, Trend, Demand Analysis till 2032

    • emergenresearch.com
    pdf,excel,csv,ppt
    Updated Feb 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Emergen Research (2024). Downstream Processing Market Size, Trend, Demand Analysis till 2032 [Dataset]. https://www.emergenresearch.com/industry-report/downstream-processing-market
    Explore at:
    pdf,excel,csv,pptAvailable download formats
    Dataset updated
    Feb 13, 2024
    Dataset authored and provided by
    Emergen Research
    License

    https://www.emergenresearch.com/privacy-policyhttps://www.emergenresearch.com/privacy-policy

    Area covered
    Global
    Variables measured
    Base Year, No. of Pages, Growth Drivers, Forecast Period, Segments covered, Historical Data for, Pitfalls Challenges, 2032 Value Projection, Tables, Charts, and Figures, Forecast Period 2023 - 2032 CAGR, and 1 more
    Description

    The global Downstream Processing Market size is expected to reach USD 121.51 Billion in 2032 registering a CAGR of 15.0%. Our report provides a comprehensive overview of the industry, including key players, market share, growth opportunities and more.

  17. Comparison of the methods based on the clustering metrics computed on the...

    • plos.figshare.com
    xls
    Updated Mar 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lily Monnier; Paul-Henry Cournède (2024). Comparison of the methods based on the clustering metrics computed on the full datasets. [Dataset]. http://doi.org/10.1371/journal.pcbi.1011880.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Mar 5, 2024
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Lily Monnier; Paul-Henry Cournède
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Comparison of the methods based on the clustering metrics computed on the full datasets.

  18. c

    Data from: Effects of transcriptional noise on estimates of gene and...

    • datacommons.cyverse.org
    Updated 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ales Varabyou (2020). Effects of transcriptional noise on estimates of gene and transcript expression in RNA sequencing experiments [Dataset]. http://doi.org/10.25739/v903-wd86
    Explore at:
    Dataset updated
    2020
    Dataset provided by
    CyVerse Data Commons
    Authors
    Ales Varabyou
    License

    ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
    License information was derived automatically

    Description

    RNA sequencing is widely used to measure gene expression across a vast range of animal and plant tissues and conditions. Most studies of computational methods for gene expression analysis use simulated data to evaluate the accuracy of these methods. In this work we present a dataset of 3 tissues each containing 10 samples of simulated short RNA-seq reads across 4 types of transcription characterized from the GTEx dataset. For each type of transcription: 1) known isoforms; 2) splicing noise; 3) intronic noise; 4) intergenic noise -we provide sets of reads in CRAM format along with corresponding expression matrices and annotations in the GTF format for downstream analysis. A copy of GRCh.38 used in our analysis is also provided along with the simulated data. Further details on the structure of each file is provided in the accompanying README document.

  19. scRNA-seq human embryonic stem H1, H9 cell lines

    • kaggle.com
    zip
    Updated Jul 28, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alexander Chervov (2021). scRNA-seq human embryonic stem H1, H9 cell lines [Dataset]. https://www.kaggle.com/alexandervc/scrnaseq-human-embryonic-stem-h1-h9-cell-lines
    Explore at:
    zip(65064956 bytes)Available download formats
    Dataset updated
    Jul 28, 2021
    Authors
    Alexander Chervov
    Description

    Remark: for cell cycle analysis - see paper https://arxiv.org/abs/2208.05229 "Computational challenges of cell cycle analysis using single cell transcriptomics" Alexander Chervov, Andrei Zinovyev

    Data and Context

    Data - results of single cell RNA sequencing, i.e. rows - correspond to cells, columns to genes (csv file is vice versa). value of the matrix shows how strong is "expression" of the corresponding gene in the corresponding cell. https://en.wikipedia.org/wiki/Single-cell_transcriptomics

    Particular data from: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE76381 There are original TXT files and reconversion to *.h5ad format which is more easy to work with. There are several subdatasets human/mouse/different cell types.

    Paper: SCnorm: robust normalization of single-cell RNA-seq data https://pubmed.ncbi.nlm.nih.gov/28418000/ Bacher R, Chu LF, Leng N, Gasch AP et al. SCnorm: robust normalization of single-cell RNA-seq data. Nat Methods 2017 Jun;14(6):584-586

    Abstract: The normalization of RNA-seq data is essential for accurate downstream inference, but the assumptions upon which most normalization methods are based are not applicable in the single-cell setting. Consequently, applying existing normalization methods to single-cell RNA-seq data introduces artifacts that bias downstream analyses. To address this, we introduce SCnorm for accurate and efficient normalization of single-cell RNA-seq data.

    Total 183 single cells (92 H1 cells, 91 H9 cells), sequenced twice, were used to evaluate SCnorm in normalizing single cell RNA-seq experiments. Total 48 bulk H1 samples were used to compare bulk and single cell properties. For single-cell RNA-seq, the identical single-cell indexed and fragmented cDNA were pooled at 96 cells per lane or at 24 cells per lane to test the effects of sequencing depth, resulting in approximately 1 million and 4 million mapped reads per cell in the two pooling groups, respectively.

    Inspiration

    Single cell RNA sequencing is important technology in modern biology, see e.g. "Eleven grand challenges in single-cell data science" https://genomebiology.biomedcentral.com/articles/10.1186/s13059-020-1926-6

    Also see review : Nature. P. Kharchenko: "The triumphs and limitations of computational methods for scRNA-seq" https://www.nature.com/articles/s41592-021-01171-x

  20. d

    Global Genome Analysis of the Downstream Binding Targets of Testis...

    • datamed.org
    Updated Apr 16, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2019). Global Genome Analysis of the Downstream Binding Targets of Testis Determining Factor SRY and SOX9 [SRY ChIP] [Dataset]. https://datamed.org/display-item.php?repository=0008&id=5914e4775152c67771b558df&query=SRY
    Explore at:
    Dataset updated
    Apr 16, 2019
    Description

    A major event in mammalian male sex determination is the induction of the testis determining factor Sry and its downstream gene Sox9. The current study provides one of the first genome wide analyses of the downstream gene binding targets for SRY and SOX9 to help elucidate the molecular control of Sertoli cell differentiation and testis development. A modified ChIP-Chip analysis using a comparative hybridization was used to identify 71 direct downstream binding targets for SRY and 109 binding targets for SOX9. Interestingly, only 5 gene targets overlapped between SRY and SOX9. In addition to the direct response element binding gene targets, a large number of atypical binding gene targets were identified for both SRY and SOX9. Bioinformatic analysis of the downstream binding targets identified gene networks and cellular pathways potentially involved in the induction of Sertoli cell differentiation and testis development. The specific DNA sequence binding site motifs for both SRY and SOX9 were identified. Observations provide insights into the molecular control of male gonadal sex determination. Overall design: The current study provides one of the first genome wide analyses of the downstream gene binding targets for SRY and SOX9 to help elucidate the molecular control of Sertoli cell differentiation and testis development. At embryonic day 13 (E13) of pregnancy rats were euthanized and embryonic gonads were collected for chromatin. A modified ChIP-Chip analysis using a comparative hybridization was used to identify direct downstream binding targets for SRY and for SOX9. Then, bioinformatic analysis of the downstream binding targets was done to identify gene networks and cellular pathways that are potentially involved in the induction of Sertoli cell differentiation and testis development.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Lisa Blankenhagen (2024). Downstream Analysis [Dataset]. http://doi.org/10.6084/m9.figshare.27619956.v1
Organization logoOrganization logo

Downstream Analysis

Explore at:
zipAvailable download formats
Dataset updated
Dec 6, 2024
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Lisa Blankenhagen
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Data for the downstream analysis. One zip file contains the results of the TF-Prioritizer run, and the other contains the input files for the ORA. These are cancer-related GO terms, tissue-related GO terms for the cell lines used, and mutated gene lists for the different tissues.

Search
Clear search
Close search
Google apps
Main menu