100+ datasets found
  1. RNA-Seq data files

    • figshare.com
    application/gzip
    Updated Mar 24, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jo Lynne Rokita (2019). RNA-Seq data files [Dataset]. http://doi.org/10.6084/m9.figshare.7751825.v5
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Mar 24, 2019
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Jo Lynne Rokita
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    RNA expression matrices - FPKM from STAR, cufflinks and TPM from STAR, RSEM, Toil. Fusion results from deFuse, SOAPfuse, fusioncatcher, STAR-Fusion.

  2. f

    RNA-seq data.

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Sep 12, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jerome, Keith R.; Vallejo-Gracia, Albert; Ott, Melanie; Soveg, Frank W.; Schilling, Birgit; Shah, Samah; Hayashi, Jennifer M.; Gross, John D.; Cruz, Andrew; Verdin, Eric; Krogan, Nevan J.; Chen, Irene P.; Kim, Ik-Jung; Lam, Victor L.; Bielska, Olga; Walter, Marius (2022). RNA-seq data. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000386244
    Explore at:
    Dataset updated
    Sep 12, 2022
    Authors
    Jerome, Keith R.; Vallejo-Gracia, Albert; Ott, Melanie; Soveg, Frank W.; Schilling, Birgit; Shah, Samah; Hayashi, Jennifer M.; Gross, John D.; Cruz, Andrew; Verdin, Eric; Krogan, Nevan J.; Chen, Irene P.; Kim, Ik-Jung; Lam, Victor L.; Bielska, Olga; Walter, Marius
    Description

    RNA-seq results of WT and SIRT5-KO A549-ACE2 cells infected or mock-infected with SARS-CoV-2 for 3 days at MOI = 0.1. First tab show normalized counts and log2 fold change (l2fc) between the different condition. Tabs 2–4 show results of gene ontology analysis between the different conditions. (XLSX)

  3. RNA-Seq-data

    • zenodo.org
    bin
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anna Syme; Anna Syme (2020). RNA-Seq-data [Dataset]. http://doi.org/10.5281/zenodo.1311269
    Explore at:
    binAvailable download formats
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Anna Syme; Anna Syme
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    RNA-seq data for analysing differential gene expression. Data from bacteria (E. coli) and subsampled to 1% or original data size. Six FASTQ files and two reference files (genome sequence and annotations).

  4. f

    Data from: Collection of pearl millet RNA sequencing data

    • datasetcatalog.nlm.nih.gov
    • figshare.com
    Updated May 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tsugama, Daisuke; Kambara, Kota (2024). Collection of pearl millet RNA sequencing data [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001306716
    Explore at:
    Dataset updated
    May 23, 2024
    Authors
    Tsugama, Daisuke; Kambara, Kota
    Description

    Pearl millet (Pennisetum glaucum, also known as Cenchrus americanus) is a C4 cereal crop that can tolerate stressed conditions including drought-stressed, high temperature-stressed and nutrient-poor conditions. Transcriptomes of pearl millet were studied by RNA sequencing (RNA-Seq) to understand mechanisms regulating its development and tolerance to such stressed conditions in previous studies. We collected RNA-Seq reads from as many of such studies in the NCBI (National Center for Biotechnology Information) BioProject database as popssible, and mapped them to the pearl millet reference genome to obtain read counts and transcripts per million (TPM) for each pearl millet gene. Here, the resulting count and TPM data as well as the attributes of the samples used for the RNA-Seq are provided. These data can be updated when a new study with RNA-Seq of pearl millet samples has become available.

  5. Comparison of alternative approaches for analysing multi-level RNA-seq data

    • plos.figshare.com
    pdf
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Irina Mohorianu; Amanda Bretman; Damian T. Smith; Emily K. Fowler; Tamas Dalmay; Tracey Chapman (2023). Comparison of alternative approaches for analysing multi-level RNA-seq data [Dataset]. http://doi.org/10.1371/journal.pone.0182694
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Irina Mohorianu; Amanda Bretman; Damian T. Smith; Emily K. Fowler; Tamas Dalmay; Tracey Chapman
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    RNA sequencing (RNA-seq) is widely used for RNA quantification in the environmental, biological and medical sciences. It enables the description of genome-wide patterns of expression and the identification of regulatory interactions and networks. The aim of RNA-seq data analyses is to achieve rigorous quantification of genes/transcripts to allow a reliable prediction of differential expression (DE), despite variation in levels of noise and inherent biases in sequencing data. This can be especially challenging for datasets in which gene expression differences are subtle, as in the behavioural transcriptomics test dataset from D. melanogaster that we used here. We investigated the power of existing approaches for quality checking mRNA-seq data and explored additional, quantitative quality checks. To accommodate nested, multi-level experimental designs, we incorporated sample layout into our analyses. We employed a subsampling without replacement-based normalization and an identification of DE that accounted for the hierarchy and amplitude of effect sizes within samples, then evaluated the resulting differential expression call in comparison to existing approaches. In a final step to test for broader applicability, we applied our approaches to a published set of H. sapiens mRNA-seq samples, The dataset-tailored methods improved sample comparability and delivered a robust prediction of subtle gene expression changes. The proposed approaches have the potential to improve key steps in the analysis of RNA-seq data by incorporating the structure and characteristics of biological experiments.

  6. u

    Data from: Reference transcriptomics of porcine peripheral immune cells...

    • agdatacommons.nal.usda.gov
    • datasets.ai
    • +2more
    zip
    Updated Nov 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Juber Herrera-Uribe; Jayne Wiarda; Sathesh K. Sivasankaran; Lance Daharsh; Haibo Liu; Kristen A. Byrne; Timothy P. L. Smith; Joan K. Lunney; Crystal L. Loving; Christopher K. Tuggle (2025). Data from: Reference transcriptomics of porcine peripheral immune cells created through bulk and single-cell RNA sequencing [Dataset]. http://doi.org/10.15482/USDA.ADC/1522411
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 21, 2025
    Dataset provided by
    Ag Data Commons
    Authors
    Juber Herrera-Uribe; Jayne Wiarda; Sathesh K. Sivasankaran; Lance Daharsh; Haibo Liu; Kristen A. Byrne; Timothy P. L. Smith; Joan K. Lunney; Crystal L. Loving; Christopher K. Tuggle
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    This dataset contains files reconstructing single-cell data presented in 'Reference transcriptomics of porcine peripheral immune cells created through bulk and single-cell RNA sequencing' by Herrera-Uribe & Wiarda et al. 2021. Samples of peripheral blood mononuclear cells (PBMCs) were collected from seven pigs and processed for single-cell RNA sequencing (scRNA-seq) in order to provide a reference annotation of porcine immune cell transcriptomics at enhanced, single-cell resolution. Analysis of single-cell data allowed identification of 36 cell clusters that were further classified into 13 cell types, including monocytes, dendritic cells, B cells, antibody-secreting cells, numerous populations of T cells, NK cells, and erythrocytes. Files may be used to reconstruct the data as presented in the manuscript, allowing for individual query by other users. Scripts for original data analysis are available at https://github.com/USDA-FSEPRU/PorcinePBMCs_bulkRNAseq_scRNAseq. Raw data are available at https://www.ebi.ac.uk/ena/browser/view/PRJEB43826. Funding for this dataset was also provided by NRSP8: National Animal Genome Research Program (https://www.nimss.org/projects/view/mrp/outline/18464). Resources in this dataset:Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells 10X Format. File Name: PBMC7_AllCells.zipResource Description: Zipped folder containing PBMC counts matrix, gene names, and cell IDs. Files are as follows:

    matrix of gene counts* (matrix.mtx.gx) gene names (features.tsv.gz) cell IDs (barcodes.tsv.gz)

    *The ‘raw’ count matrix is actually gene counts obtained following ambient RNA removal. During ambient RNA removal, we specified to calculate non-integer count estimations, so most gene counts are actually non-integer values in this matrix but should still be treated as raw/unnormalized data that requires further normalization/transformation. Data can be read into R using the function Read10X().Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells Metadata. File Name: PBMC7_AllCells_meta.csvResource Description: .csv file containing metadata for cells included in the final dataset. Metadata columns include:

    nCount_RNA = the number of transcripts detected in a cell nFeature_RNA = the number of genes detected in a cell Loupe = cell barcodes; correspond to the cell IDs found in the .h5Seurat and 10X formatted objects for all cells prcntMito = percent mitochondrial reads in a cell Scrublet = doublet probability score assigned to a cell seurat_clusters = cluster ID assigned to a cell PaperIDs = sample ID for a cell celltypes = cell type ID assigned to a cellResource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells PCA Coordinates. File Name: PBMC7_AllCells_PCAcoord.csvResource Description: .csv file containing first 100 PCA coordinates for cells. Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells t-SNE Coordinates. File Name: PBMC7_AllCells_tSNEcoord.csvResource Description: .csv file containing t-SNE coordinates for all cells.Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells UMAP Coordinates. File Name: PBMC7_AllCells_UMAPcoord.csvResource Description: .csv file containing UMAP coordinates for all cells.Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - CD4 T Cells t-SNE Coordinates. File Name: PBMC7_CD4only_tSNEcoord.csvResource Description: .csv file containing t-SNE coordinates for only CD4 T cells (clusters 0, 3, 4, 28). A dataset of only CD4 T cells can be re-created from the PBMC7_AllCells.h5Seurat, and t-SNE coordinates used in publication can be re-assigned using this .csv file.Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - CD4 T Cells UMAP Coordinates. File Name: PBMC7_CD4only_UMAPcoord.csvResource Description: .csv file containing UMAP coordinates for only CD4 T cells (clusters 0, 3, 4, 28). A dataset of only CD4 T cells can be re-created from the PBMC7_AllCells.h5Seurat, and UMAP coordinates used in publication can be re-assigned using this .csv file.Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - Gamma Delta T Cells UMAP Coordinates. File Name: PBMC7_GDonly_UMAPcoord.csvResource Description: .csv file containing UMAP coordinates for only gamma delta T cells (clusters 6, 21, 24, 31). A dataset of only gamma delta T cells can be re-created from the PBMC7_AllCells.h5Seurat, and UMAP coordinates used in publication can be re-assigned using this .csv file.Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - Gamma Delta T Cells t-SNE Coordinates. File Name: PBMC7_GDonly_tSNEcoord.csvResource Description: .csv file containing t-SNE coordinates for only gamma delta T cells (clusters 6, 21, 24, 31). A dataset of only gamma delta T cells can be re-created from the PBMC7_AllCells.h5Seurat, and t-SNE coordinates used in publication can be re-assigned using this .csv file.Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - Gene Annotation Information. File Name: UnfilteredGeneInfo.txtResource Description: .txt file containing gene nomenclature information used to assign gene names in the dataset. 'Name' column corresponds to the name assigned to a feature in the dataset.Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells H5Seurat. File Name: PBMC7.tarResource Description: .h5Seurat object of all cells in PBMC dataset. File needs to be untarred, then read into R using function LoadH5Seurat().

  7. n

    BioXpress

    • neuinfo.org
    • dknet.org
    • +1more
    Updated Oct 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). BioXpress [Dataset]. http://identifiers.org/RRID:SCR_014191
    Explore at:
    Dataset updated
    Oct 4, 2024
    Description

    BioXpress is a gene expression and cancer association database in which the expression levels are mapped to genes using RNA-seq data obtained from The Cancer Genome Atlas, International Cancer Genome Consortium, Expression Atlas and publications. BioXpress can be searched by gene name or cancer type. To search the database by gene name, select the appropriate identifier type from the dropdown menu and type in the corresponding identifier in the adjacent text box. The results are computed and presented to the user with information such as variable expression levels and tumor expression. To search by cancer type, select the desired type from the dropdown menu, such as "Cancer Type", "Significant", "Expression", "Adjusted p-value" and "p-value". Results are shown in a graph displaying the top 10 differentially expressed genes for the specified cancer type in terms of the frequency of significant altered expression between the tumor and normal pairs.

  8. S

    Small RNA-seq and RNA-seq data

    • scidb.cn
    Updated Apr 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yang Qiu (2024). Small RNA-seq and RNA-seq data [Dataset]. http://doi.org/10.57760/sciencedb.18146
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 17, 2024
    Dataset provided by
    Science Data Bank
    Authors
    Yang Qiu
    License

    Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
    License information was derived automatically

    Description

    Small RNA-seqLibraries of small RNAs were constructed using TruSeq Small RNA Library Preparation Kits (Illumina) according to manufacturer’s protocols, and then sequenced by Illumina HiSeq 2000 at Bioacme (Wuhan, China). Quality control and adapter trimming process were performed using Trim Galore (v0.6.4) with default parameters, and reads length between 17 and 70 were used for subsequent analyses. The sequenced reads were aligned to the human genome (GRCh38) by using Bowtie2 (v2.3.5.1). Reads that cannot be aligned to the human genome (GRCh38) were then aligned to agshRNAs. The sequences and lengths of reads aligned to agshRNA were then determined using an in-house script. Reads mapped to genome were further categorized as miRNA, tRNA, snoRNA etc. by using htseq-count (v0.11.2). Annotation file was downloaded from DASHR 2.0 (https://dashr2.lisanwanglab.org/). RNA-seqLibraries of total RNAs were constructed using MGIEasy RNA Library Preparation Kits (BGI) according to manufacturer’s protocols, and then sequenced by MGISEQ2000 (BGI) at Wuhan Institute of Virology, CAS. Quality control and adapter trimming were performed using Trim Galore (v0.6.4) with default parameter. All reads were mapped to the human genome (GRCh38) using Hisat2 (v2.1.0) and then annotated using htseq-count (v0.11.2). Annotation file was downloaded from NCBI. Annotated reads number were transformed into count per million reads (CPM). Statistical significance was evaluated via using unpaired t test and adjusted using Bonferroni correction. Different expression genes (DEGs) were defined by |log2FC| > 1 and P.adj < 0.05.

  9. m

    Investigating Highly Variable Genes in Single-cell RNA-seq Data across...

    • data.mendeley.com
    Updated May 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jantarika Kumar Arora (2023). Investigating Highly Variable Genes in Single-cell RNA-seq Data across Multiple Cell Types and Conditions [Dataset]. http://doi.org/10.17632/6ry3x7r8hf.3
    Explore at:
    Dataset updated
    May 16, 2023
    Authors
    Jantarika Kumar Arora
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The peripheral blood immune cell (PBMC) samples were collected from patients infected with dengue virus (DENV) at four time points: two and one day(s) before defervescence (febrile phase), at defervescence (critical phase), and two-week convalescence. The raw and filtered matrix files were generated using CellRanger version 3.0.2 (10x Genomics, USA) with the reference human genome GRCh38 1.2.0. Potential contamination of ambient RNAs was corrected using SoupX. Low quality cells, including cells expressing mitochondrial genes higher than 10% and doublets/multiplets, were excluded using Seurat and doubletFinder, respectively. The individual samples were then integrated using the SCTransform method with 3,000 gene features. Principal component analysis (PCA) and clustering were performed with the Louvain algorithm applying multi-level refinement algorithm. The gene expression level of each cell was normalized using the LogNormalize method in Seurat. Cell types were annotated using the canonical marker genes described in the original paper, see related link below.

  10. scRNA-seq + scATAC-seq Challenge at NeurIPS 2021

    • kaggle.com
    zip
    Updated Sep 16, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alexander Chervov (2022). scRNA-seq + scATAC-seq Challenge at NeurIPS 2021 [Dataset]. https://www.kaggle.com/datasets/alexandervc/scrnaseq-scatacseq-challenge-at-neurips-2021
    Explore at:
    zip(2917180928 bytes)Available download formats
    Dataset updated
    Sep 16, 2022
    Authors
    Alexander Chervov
    Description

    Context

    Dataset from NeurIPS2021 challenge similar to Kaggle 2022 competition: https://www.kaggle.com/competitions/open-problems-multimodal "Open Problems - Multimodal Single-Cell Integration Predict how DNA, RNA & protein measurements co-vary in single cells"

    It is https://en.wikipedia.org/wiki/ATAC-seq#Single-cell_ATAC-seq single cell ATAC-seq data. And single cell RNA-seq data: https://en.wikipedia.org/wiki/Single-cell_transcriptomics#Single-cell_RNA-seq

    Single cell RNA sequencing, i.e. rows - correspond to cells, columns to genes (or vice versa). value of the matrix shows how strong is "expression" of the corresponding gene in the corresponding cell. https://en.wikipedia.org/wiki/Single-cell_transcriptomics

    See tutorials: https://scanpy.readthedocs.io/en/stable/tutorials.html ("Scanpy" - main Python package to work with scRNA-seq data). Or https://satijalab.org/seurat/ "Seurat" - "R" package

    (For companion dataset on CITE-seq = scRNA-seq + Proteomics, see: https://www.kaggle.com/datasets/alexandervc/citeseqscrnaseqproteins-challenge-neurips2021)

    Particular data

    https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE194122

    Expression profiling by high throughput sequencing Genome binding/occupancy profiling by high throughput sequencing Summary Single-cell multiomics data collected from bone marrow mononuclear cells of 12 healthy human donors. Half the samples were measured using the 10X Multiome Gene Expression and Chromatin Accessability kit and half were measured using the 10X 3' Single-Cell Gene Expression kit with Feature Barcoding in combination with the BioLegend TotalSeq B Universal Human Panel v1.0. The dataset was generated to support Multimodal Single-Cell Data Integration Challenge at NeurIPS 2021. Samples were prepared using a standard protocol at four sites. The resulting data was then annotated to identify cell types and remove doublets. The dataset was designed with a nested batch layout such that some donor samples were measured at multiple sites with some donors measured at a single site. In the competition, participants were tasked with challenges including modality prediction, matching profiles from different modalities, and learning a joint embedding from multiple modalities.

    Overall design Single-cell multiomics data collected from bone marrow mononuclear cells of 12 healthy human donors.

    Contributor(s) Burkhardt DB, Lücken MD, Lance C, Cannoodt R, Pisco AO, Krishnaswamy S, Theis FJ, Bloom JM Citation https://datasets-benchmarks-proceedings.neurips.cc/paper/2021/hash/158f3069a435b314a80bdcb024f8e422-Abstract-round2.html

    Related datasets:

    Other single cell RNA seq datasets can be found on kaggle: Look here: https://www.kaggle.com/alexandervc/datasets Or search kaggle for "scRNA-seq"

    Inspiration

    Single cell RNA sequencing is important technology in modern biology, see e.g. "Eleven grand challenges in single-cell data science" https://genomebiology.biomedcentral.com/articles/10.1186/s13059-020-1926-6

    Also see review : Nature. P. Kharchenko: "The triumphs and limitations of computational methods for scRNA-seq" https://www.nature.com/articles/s41592-021-01171-x

    Search scholar.google "challenges in single cell rna sequencing" https://scholar.google.fr/scholar?q=challenges+in+single+cell+rna+sequencing&hl=en&as_sdt=0&as_vis=1&oi=scholart gives many interesting and highly cited articles

    (Cited 968) Computational and analytical challenges in single-cell transcriptomics Oliver Stegle, Sarah A. Teichmann, John C. Marioni Nat. Rev. Genet., 16 (3) (2015), pp. 133-145 https://www.nature.com/articles/nrg3833

  11. Gene Expression Cancer RNA-Seq

    • kaggle.com
    zip
    Updated May 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alban NYANTUDRE (2025). Gene Expression Cancer RNA-Seq [Dataset]. https://www.kaggle.com/datasets/waalbannyantudre/gene-expression-cancer-rna-seq-donated-on-682016
    Explore at:
    zip(73984306 bytes)Available download formats
    Dataset updated
    May 27, 2025
    Authors
    Alban NYANTUDRE
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    This collection of data is part of the RNA-Seq (HiSeq) PANCAN dataset. It is a random extraction of gene expressions of patients having different types of tumor: BRCA, KIRC, COAD, LUAD, and PRAD. Each sample contains the expression of 20,531 genes for a patient diagnosed with one of the following cancers:

    CodeTumor Name
    BRCABreast invasive carcinoma (breast cancer)
    KIRCKidney renal clear cell carcinoma (kidney)
    COADColon adenocarcinoma (colon)
    LUADLung adenocarcinoma (lung)
    PRADProstate adenocarcinoma (prostate)

    Files:

    • data.csv: Gene expression matrix X (881 samples × 20,531 genes)
    • label.csv: True class label for each sample y (881 labels)

    Source: UCI ML Repository – Gene Expression Cancer RNA-Seq Data

  12. Data from: Comparing RNA-Seq and microarray gene expression data in two...

    • data.nasa.gov
    • osdr.nasa.gov
    • +1more
    Updated Apr 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nasa.gov (2025). Comparing RNA-Seq and microarray gene expression data in two zones of the Arabidopsis root apex relevant to spaceflight. [Dataset]. https://data.nasa.gov/dataset/comparing-rna-seq-and-microarray-gene-expression-data-in-two-zones-of-the-arabidopsis-root
    Explore at:
    Dataset updated
    Apr 1, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Description

    Premise of the study: The root apex is an important region involved in environmental sensing, but comprises a very small part of the root. Obtaining root apex transcriptomes is therefore challenging when the samples are limited. The feasibility of using tiny root sections for transcriptome analysis was examined, comparing RNA sequencing (RNA-Seq) to microarrays in characterizing genes that are relevant to spaceflight.Methods:Arabidopsis thaliana Columbia ecotype (Col-0) roots were sectioned into Zone 1 (0.5 mm; root cap and meristematic zone) and Zone 2 (1.5 mm; transition, elongation, and growth-terminating zone). Differential gene expression in each was compared.Results: Both microarrays and RNA-Seq proved applicable to the small samples. A total of 4180 genes were differentially expressed (with fold changes of 2 or greater) between Zone 1 and Zone 2. In addition, 771 unique genes and 19 novel transcriptionally active regions were identified by RNA-Seq that were not detected in microarrays. However, microarrays detected spaceflight-relevant genes that were missed in RNA-Seq. Discussion: Single root tip subsections can be used for transcriptome analysis using either RNA-Seq or microarrays. Both RNA-Seq and microarrays provided novel information. These data suggest that techniques for dealing with small, rare samples from spaceflight can be further enhanced, and that RNA-Seq may miss some spaceflight-relevant changes in gene expression.

  13. l

    Human RNA-Seq data set GSM2819712 stored in NCBI (GEO)

    • seek.lisym.org
    Updated Aug 29, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mario Brosch (2022). Human RNA-Seq data set GSM2819712 stored in NCBI (GEO) [Dataset]. https://seek.lisym.org/data_files/685
    Explore at:
    Dataset updated
    Aug 29, 2022
    Authors
    Mario Brosch
    License

    https://choosealicense.com/no-permission/https://choosealicense.com/no-permission/

    Description

    Human RNA-Seq data set GSM2819712 stored in NCBI (GEO)

  14. 4

    Scripts and data for the paper: Consequences and opportunities arising due...

    • data.4tu.nl
    Updated Oct 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gerard Bouland; Marcel Reinders; Ahmed Mahfouz (2024). Scripts and data for the paper: Consequences and opportunities arising due to sparser single-cell RNA-seq datasets [Dataset]. http://doi.org/10.4121/424eea7a-cce9-4dbb-b6ef-e5b47e132410.v1
    Explore at:
    Dataset updated
    Oct 15, 2024
    Dataset provided by
    4TU.ResearchData
    Authors
    Gerard Bouland; Marcel Reinders; Ahmed Mahfouz
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Scripts and data for the paper: Consequences and opportunities arising due to sparser single-cell RNA-seq datasets


    With the number of cells measured in single-cell RNA sequencing (scRNA-seq) datasets increasing exponentially and concurrent increased sparsity due to more zero counts being measured for many genes, we demonstrate here that downstream analyses on binary-based gene expression give similar results as count-based analyses. Moreover, a binary representation scales up to ~ 50-fold more cells that can be analyzed using the same computational resources. We also highlight the possibilities provided by binarized scRNA-seq data. Development of specialized tools for bit-aware implementations of downstream analytical tasks will enable a more fine-grained resolution of biological heterogeneity.

  15. f

    RNA-sequencing data.

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Jan 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Davidson, Bradley; Pickett, C. J.; Gruner, Hannah N. (2024). RNA-sequencing data. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001449690
    Explore at:
    Dataset updated
    Jan 25, 2024
    Authors
    Davidson, Bradley; Pickett, C. J.; Gruner, Hannah N.
    Description

    (Tab 1) DESeq2 statistics for all genes. (Tab 2–6) Lists of up- and down-regulated genes from Fig 6. First column lists the KH gene IDs for all genes that were selected for a particular analysis shown in each figure panel. The KH gene IDs for all relevant genes that were significantly up- or down-regulated are shown in blue or red font. Boxes indicate the most highly impact subset of genes as shown in the Fig 6 tables. For these genes KH IDs were paired with KY IDs and human homologs. (Tab 7) Referenced Foxf target genes. (XLSX)

  16. f

    RNA-seq data analysis summary.

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Oct 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Klemm, Paul; Becker, Stephan; Biedenkopf, Nadine; Lechner, Marcus; Weber, Friedemann; Schlereth, Julia; Hartmann, Roland K.; Schoen, Andreas; Kämper, Lennart; Bach, Simone; Demper, Jana-Christin (2021). RNA-seq data analysis summary. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000808954
    Explore at:
    Dataset updated
    Oct 26, 2021
    Authors
    Klemm, Paul; Becker, Stephan; Biedenkopf, Nadine; Lechner, Marcus; Weber, Friedemann; Schlereth, Julia; Hartmann, Roland K.; Schoen, Andreas; Kämper, Lennart; Bach, Simone; Demper, Jana-Christin
    Description

    For methodological details, see S1 Text, paragraph "RNA-Seq Analysis". (XLSX)

  17. Reference-based RNA-seq data analysis (training data)

    • zenodo.org
    bin
    Updated Apr 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bérénice Batut; Pavankumar Videm; Anika Erxleben; Björn Grüning; Bérénice Batut; Pavankumar Videm; Anika Erxleben; Björn Grüning (2023). Reference-based RNA-seq data analysis (training data) [Dataset]. http://doi.org/10.5281/zenodo.1185122
    Explore at:
    binAvailable download formats
    Dataset updated
    Apr 26, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Bérénice Batut; Pavankumar Videm; Anika Erxleben; Björn Grüning; Bérénice Batut; Pavankumar Videm; Anika Erxleben; Björn Grüning
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data provided here are part of a Galaxy Training Network tutorial that analyzes RNA-Seq data from a study published by Brooks et al. 2011 to identify genes and exons that are regulated by Pasilla gene.

  18. m

    Summary of RNA-seq data with statistics.

    • data.mendeley.com
    Updated Mar 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anton Maximov (2024). Summary of RNA-seq data with statistics. [Dataset]. http://doi.org/10.17632/5p3whbjbry.1
    Explore at:
    Dataset updated
    Mar 13, 2024
    Authors
    Anton Maximov
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The spreadsheets include pairwise comparisons of gene expression levels under specified conditions. Abbreviations used are as follows: PV for Parvalbumin; Sst for Somatostatin; CFC for Contextual Fear Conditioning; PTZ for Pentylenetetrazol; WT for wild type; cKO for conditional Nr4a1 knockouts; and HC for home cage.

  19. E

    Vcf RNA-Seq data

    • ega-archive.org
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vcf RNA-Seq data [Dataset]. https://ega-archive.org/datasets/EGAD00001001941
    Explore at:
    License

    https://ega-archive.org/dacs/EGAC00001000145https://ega-archive.org/dacs/EGAC00001000145

    Description

    Variants derived from mapped whole transcriptome RNA-Seq data from 476 human samples of early stage urothelial carcinoma.

  20. Z

    Data Repository: Single-cell mapper (scMappR): using scRNA-seq to infer...

    • data.niaid.nih.gov
    Updated Feb 12, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dustin Sokolowski; Mariela Faykoo-Martinez; Lauren Erdman; Huayun Hou; Cadia Chan; Helen Zhu; Melissa M. Holmes; Anna Goldenberg; Michael D Wilson (2021). Data Repository: Single-cell mapper (scMappR): using scRNA-seq to infer cell-type specificities of differentially expressed genes [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4278129
    Explore at:
    Dataset updated
    Feb 12, 2021
    Dataset provided by
    Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, M5G 0A4, Canada; Department of Cell and Systems Biology, University of Toronto, Toronto
    Department of Cell and Systems Biology, University of Toronto, Toronto, ON, M5S 3G5, Canada; Department of Psychology, University of Toronto Mississauga, Mississauga, ON, L5L 1C6
    Department of Medical Biophysics, University of Toronto, Toronto, ON, M5G 1L7, Canada; Princess Margaret Cancer Center, University Health Network, Toronto, ON, M5G 2C1, Canada
    Department of Molecular Genetics, 2Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, M5G 0A4, CanadaUniversity of Toronto, Toronto, ON, M5S 1A8, Canada,
    Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, M5G 0A4, Canada; Department of Computer Science, University of Toronto, Toronto, ON, M5S 2E4, Canada
    Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, M5G 0A4, Canada; Department of Computer Science, University of Toronto, Toronto, ON, M5S 2E4, Canada; Vector Institute for Artificial Intelligence, MaRS Centre, Toronto, ON, M5G 1M1; CIFAR, MaRS Centre, Toronto, ON, M5G 1M1
    Authors
    Dustin Sokolowski; Mariela Faykoo-Martinez; Lauren Erdman; Huayun Hou; Cadia Chan; Helen Zhu; Melissa M. Holmes; Anna Goldenberg; Michael D Wilson
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data repository for the scMappR manuscript:

    Abstract from biorXiv (https://www.biorxiv.org/content/10.1101/2020.08.24.265298v1.full).

    RNA sequencing (RNA-seq) is widely used to identify differentially expressed genes (DEGs) and reveal biological mechanisms underlying complex biological processes. RNA-seq is often performed on heterogeneous samples and the resulting DEGs do not necessarily indicate the cell types where the differential expression occurred. While single-cell RNA-seq (scRNA-seq) methods solve this problem, technical and cost constraints currently limit its widespread use. Here we present single cell Mapper (scMappR), a method that assigns cell-type specificity scores to DEGs obtained from bulk RNA-seq by integrating cell-type expression data generated by scRNA-seq and existing deconvolution methods. After benchmarking scMappR using RNA-seq data obtained from sorted blood cells, we asked if scMappR could reveal known cell-type specific changes that occur during kidney regeneration. We found that scMappR appropriately assigned DEGs to cell-types involved in kidney regeneration, including a relatively small proportion of immune cells. While scMappR can work with any user supplied scRNA-seq data, we curated scRNA-seq expression matrices for ∼100 human and mouse tissues to facilitate its use with bulk RNA-seq data alone. Overall, scMappR is a user-friendly R package that complements traditional differential expression analysis available at CRAN.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Jo Lynne Rokita (2019). RNA-Seq data files [Dataset]. http://doi.org/10.6084/m9.figshare.7751825.v5
Organization logoOrganization logo

RNA-Seq data files

Explore at:
application/gzipAvailable download formats
Dataset updated
Mar 24, 2019
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Jo Lynne Rokita
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

RNA expression matrices - FPKM from STAR, cufflinks and TPM from STAR, RSEM, Toil. Fusion results from deFuse, SOAPfuse, fusioncatcher, STAR-Fusion.

Search
Clear search
Close search
Google apps
Main menu