100+ datasets found
  1. Z

    Repository for Single Cell RNA Sequencing Analysis of The EMT6 Dataset

    • data.niaid.nih.gov
    Updated Nov 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hsu, Jonathan; Stoop, Allart (2023). Repository for Single Cell RNA Sequencing Analysis of The EMT6 Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_10011621
    Explore at:
    Dataset updated
    Nov 20, 2023
    Authors
    Hsu, Jonathan; Stoop, Allart
    Description

    Table of Contents

    Main Description File Descriptions Linked Files Installation and Instructions

    1. Main Description

    This is the Zenodo repository for the manuscript titled "A TCR β chain-directed antibody-fusion molecule that activates and expands subsets of T cells and promotes antitumor activity.". The code included in the file titled marengo_code_for_paper_jan_2023.R was used to generate the figures from the single-cell RNA sequencing data. The following libraries are required for script execution:

    Seurat scReportoire ggplot2 stringr dplyr ggridges ggrepel ComplexHeatmap

    File Descriptions

    The code can be downloaded and opened in RStudios. The "marengo_code_for_paper_jan_2023.R" contains all the code needed to reproduce the figues in the paper The "Marengo_newID_March242023.rds" file is available at the following address: https://zenodo.org/badge/DOI/10.5281/zenodo.7566113.svg (Zenodo DOI: 10.5281/zenodo.7566113). The "all_res_deg_for_heat_updated_march2023.txt" file contains the unfiltered results from DGE anlaysis, also used to create the heatmap with DGE and volcano plots. The "genes_for_heatmap_fig5F.xlsx" contains the genes included in the heatmap in figure 5F.

    Linked Files

    This repository contains code for the analysis of single cell RNA-seq dataset. The dataset contains raw FASTQ files, as well as, the aligned files that were deposited in GEO. The "Rdata" or "Rds" file was deposited in Zenodo. Provided below are descriptions of the linked datasets:

    Gene Expression Omnibus (GEO) ID: GSE223311(https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE223311)

    Title: Gene expression profile at single cell level of CD4+ and CD8+ tumor infiltrating lymphocytes (TIL) originating from the EMT6 tumor model from mSTAR1302 treatment. Description: This submission contains the "matrix.mtx", "barcodes.tsv", and "genes.tsv" files for each replicate and condition, corresponding to the aligned files for single cell sequencing data. Submission type: Private. In order to gain access to the repository, you must use a reviewer token (https://www.ncbi.nlm.nih.gov/geo/info/reviewer.html).

    Sequence read archive (SRA) repository ID: SRX19088718 and SRX19088719

    Title: Gene expression profile at single cell level of CD4+ and CD8+ tumor infiltrating lymphocytes (TIL) originating from the EMT6 tumor model from mSTAR1302 treatment. Description: This submission contains the raw sequencing or .fastq.gz files, which are tab delimited text files. Submission type: Private. In order to gain access to the repository, you must use a reviewer token (https://www.ncbi.nlm.nih.gov/geo/info/reviewer.html).

    Zenodo DOI: 10.5281/zenodo.7566113(https://zenodo.org/record/7566113#.ZCcmvC2cbrJ)

    Title: A TCR β chain-directed antibody-fusion molecule that activates and expands subsets of T cells and promotes antitumor activity. Description: This submission contains the "Rdata" or ".Rds" file, which is an R object file. This is a necessary file to use the code. Submission type: Restricted Acess. In order to gain access to the repository, you must contact the author.

    Installation and Instructions

    The code included in this submission requires several essential packages, as listed above. Please follow these instructions for installation:

    Ensure you have R version 4.1.2 or higher for compatibility.

    Although it is not essential, you can use R-Studios (Version 2022.12.0+353 (2022.12.0+353)) for accessing and executing the code.

    1. Download the *"Rdata" or ".Rds" file from Zenodo (https://zenodo.org/record/7566113#.ZCcmvC2cbrJ) (Zenodo DOI: 10.5281/zenodo.7566113).
    2. Open R-Studios (https://www.rstudio.com/tags/rstudio-ide/) or a similar integrated development environment (IDE) for R.
    3. Set your working directory to where the following files are located:

    marengo_code_for_paper_jan_2023.R Install_Packages.R Marengo_newID_March242023.rds genes_for_heatmap_fig5F.xlsx all_res_deg_for_heat_updated_march2023.txt

    You can use the following code to set the working directory in R:

    setwd(directory)

    1. Open the file titled "Install_Packages.R" and execute it in R IDE. This script will attempt to install all the necessary pacakges, and its dependencies in order to set up an environment where the code in "marengo_code_for_paper_jan_2023.R" can be executed.
    2. Once the "Install_Packages.R" script has been successfully executed, re-start R-Studios or your IDE of choice.
    3. Open the file "marengo_code_for_paper_jan_2023.R" file in R-studios or your IDE of choice.
    4. Execute commands in the file titled "marengo_code_for_paper_jan_2023.R" in R-Studios or your IDE of choice to generate the plots.
  2. Single cell RNASeq Visualization GSE115189

    • kaggle.com
    zip
    Updated Dec 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dr. Nagendra (2025). Single cell RNASeq Visualization GSE115189 [Dataset]. https://www.kaggle.com/datasets/mannekuntanagendra/single-cell-rnaseq-visualization-gse115189
    Explore at:
    zip(25859992 bytes)Available download formats
    Dataset updated
    Dec 4, 2025
    Authors
    Dr. Nagendra
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset contains single-cell RNA sequencing (scRNA-seq) data for human cells.

    The data is derived from the publicly available GSE115189 dataset.

    It provides preprocessed gene expression matrices suitable for downstream analysis.

    The dataset is ideal for exploring cellular heterogeneity and gene expression patterns.

    Includes quality control (QC) metrics and visualizations to assess data reliability.

    Data can be used for dimensionality reduction, clustering, and visualization analyses.

    Researchers can study differential gene expression across distinct cell populations.

    The dataset supports the development and benchmarking of single-cell analysis tools.

    Contains visual outputs such as violin plots to illustrate QC and gene expression.

    Suitable for bioinformaticians, computational biologists, and single-cell researchers.

    Enables exploration of transcriptional programs in various cell types.

    Offers opportunities for educational purposes, tutorials, and hands-on analysis.

    Provides insights into single-cell RNA-seq data preprocessing and quality assessment.

  3. o

    Repository for the single cell RNA sequencing data analysis for the human...

    • explore.openaire.eu
    Updated Aug 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonathan; Andrew; Pierre; Allart; Adrian (2023). Repository for the single cell RNA sequencing data analysis for the human manuscript. [Dataset]. http://doi.org/10.5281/zenodo.8286134
    Explore at:
    Dataset updated
    Aug 26, 2023
    Authors
    Jonathan; Andrew; Pierre; Allart; Adrian
    Description

    This is the GitHub repository for the single cell RNA sequencing data analysis for the human manuscript. The following essential libraries are required for script execution: Seurat scReportoire ggplot2 dplyr ggridges ggrepel ComplexHeatmap Linked File: -------------------------------------- This repository contains code for the analysis of single cell RNA-seq dataset. The dataset contains raw FASTQ files, as well as, the aligned files that were deposited in GEO. Provided below are descriptions of the linked datasets: 1. Gene Expression Omnibus (GEO) ID: GSE229626 - Title: Gene expression profile at single cell level of human T cells stimulated via antibodies against the T Cell Receptor (TCR) - Description: This submission contains the matrix.mtx, barcodes.tsv, and genes.tsv files for each replicate and condition, corresponding to the aligned files for single cell sequencing data. - Submission type: Private. In order to gain access to the repository, you must use a "reviewer token"(https://www.ncbi.nlm.nih.gov/geo/info/reviewer.html). 2. Sequence read archive (SRA) repository - Title: Gene expression profile at single cell level of human T cells stimulated via antibodies against the T Cell Receptor (TCR) - Description: This submission contains the "raw sequencing" or .fastq.gz files, which are tab delimited text files. - Submission type: Private. In order to gain access to the repository, you must use a "reviewer token" (https://www.ncbi.nlm.nih.gov/geo/info/reviewer.html). Please note that since the GSE submission is private, the raw data deposited at SRA may not be accessible until the embargo on GSE229626 has been lifted. Installation and Instructions -------------------------------------- The code included in this submission requires several essential packages, as listed above. Please follow these instructions for installation: > Ensure you have R version 4.1.2 or higher for compatibility. > Although it is not essential, you can use R-Studios (Version 2022.12.0+353 (2022.12.0+353)) for accessing and executing the code. The following code can be used to set working directory in R: > setwd(directory) Steps: 1. Download the "Human_code_April2023.R" and "Install_Packages.R" R scripts, and the processed data from GSE229626. 2. Open "R-Studios"(https://www.rstudio.com/tags/rstudio-ide/) or a similar integrated development environment (IDE) for R. 3. Set your working directory to where the following files are located: - Human_code_April2023.R - Install_Packages.R 4. Open the file titled Install_Packages.R and execute it in R IDE. This script will attempt to install all the necessary pacakges, and its dependencies. 5. Open the Human_code_April2023.R R script and execute commands as necessary.

  4. l

    cellCounts

    • opal.latrobe.edu.au
    • researchdata.edu.au
    bin
    Updated Dec 19, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yang Liao; Dinesh Raghu; Bhupinder Pal; Lisa Mielke; Wei Shi (2022). cellCounts [Dataset]. http://doi.org/10.26181/21588276.v3
    Explore at:
    binAvailable download formats
    Dataset updated
    Dec 19, 2022
    Dataset provided by
    La Trobe
    Authors
    Yang Liao; Dinesh Raghu; Bhupinder Pal; Lisa Mielke; Wei Shi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This page includes the data and code necessary to reproduce the results of the following paper: Yang Liao, Dinesh Raghu, Bhupinder Pal, Lisa Mielke and Wei Shi. cellCounts: fast and accurate quantification of 10x Chromium single-cell RNA sequencing data. Under review. A Linux computer running an operating system of CentOS 7 (or later) or Ubuntu 20.04 (or later) is recommended for running this analysis. The computer should have >2 TB of disk space and >64 GB of RAM. The following software packages need to be installed before running the analysis. Software executables generated after installation should be included in the $PATH environment variable.

    R (v4.0.0 or newer) https://www.r-project.org/ Rsubread (v2.12.2 or newer) http://bioconductor.org/packages/3.16/bioc/html/Rsubread.html CellRanger (v6.0.1) https://support.10xgenomics.com/single-cell-gene-expression/software/overview/welcome STARsolo (v2.7.10a) https://github.com/alexdobin/STAR sra-tools (v2.10.0 or newer) https://github.com/ncbi/sra-tools Seurat (v3.0.0 or newer) https://satijalab.org/seurat/ edgeR (v3.30.0 or newer) https://bioconductor.org/packages/edgeR/ limma (v3.44.0 or newer) https://bioconductor.org/packages/limma/ mltools (v0.3.5 or newer) https://cran.r-project.org/web/packages/mltools/index.html

    Reference packages generated by 10x Genomics are also required for this analysis and they can be downloaded from the following link (2020-A version for individual human and mouse reference packages should be selected): https://support.10xgenomics.com/single-cell-gene-expression/software/downloads/latest After all these are done, you can simply run the shell script ‘test-all-new.bash’ to perform all the analyses carried out in the paper. This script will automatically download the mixture scRNA-seq data from the SRA database, and it will output a text file called ‘test-all.log’ that contains all the screen outputs and speed/accuracy results of CellRanger, STARsolo and cellCounts.

  5. scRNA-seq Human Pluripotent Stem Cells Messmer2019

    • kaggle.com
    zip
    Updated May 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alexander Chervov (2022). scRNA-seq Human Pluripotent Stem Cells Messmer2019 [Dataset]. https://www.kaggle.com/datasets/alexandervc/scrnaseq-human-pluripotent-stem-cells-messmer2019
    Explore at:
    zip(57267380 bytes)Available download formats
    Dataset updated
    May 1, 2022
    Authors
    Alexander Chervov
    Description

    Remark: for cell cycle analysis - see paper https://arxiv.org/abs/2208.05229 "Computational challenges of cell cycle analysis using single cell transcriptomics" Alexander Chervov, Andrei Zinovyev

    Data and Context

    Data - results of single cell RNA sequencing, i.e. rows - correspond to cells, columns to genes (or vice versa). value of the matrix shows how strong is "expression" of the corresponding gene in the corresponding cell. https://en.wikipedia.org/wiki/Single-cell_transcriptomics

    Particular data: https://pubmed.ncbi.nlm.nih.gov/30673604/ Cell Rep. 2019 Jan 22;26(4):815-824.e4. doi: 10.1016/j.celrep.2018.12.099. Transcriptional Heterogeneity in Naive and Primed Human Pluripotent Stem Cells at Single-Cell Resolution Tobias Messmer 1, Ferdinand von Meyenn 2, Aurora Savino 3, Fátima Santos 3, Hisham Mohammed 3, Aaron Tin Long Lun 4, John C Marioni 5, Wolf Reik 6

    Data in two variants: 1) scRNA-seq count matrix, downloaded from database of R-package "scRNAseq", see script: https://www.kaggle.com/alexandervc/rpackage-scrnaseq-downloads-datasets 2) Directly uploaded from E-MTAB-6819 https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-6819/

    Related datasets:

    Other single cell RNA seq datasets can be found on kaggle: Look here: https://www.kaggle.com/alexandervc/datasets Or search kaggle for "scRNA-seq"

    Inspiration

    Single cell RNA sequencing is important technology in modern biology, see e.g. "Eleven grand challenges in single-cell data science" https://genomebiology.biomedcentral.com/articles/10.1186/s13059-020-1926-6

    Also see review : Nature. P. Kharchenko: "The triumphs and limitations of computational methods for scRNA-seq" https://www.nature.com/articles/s41592-021-01171-x

    Search scholar.google "challenges in single cell rna sequencing" https://scholar.google.fr/scholar?q=challenges+in+single+cell+rna+sequencing&hl=en&as_sdt=0&as_vis=1&oi=scholart gives many interesting and highly cited articles

    (Cited 968) Computational and analytical challenges in single-cell transcriptomics Oliver Stegle, Sarah A. Teichmann, John C. Marioni Nat. Rev. Genet., 16 (3) (2015), pp. 133-145 https://www.nature.com/articles/nrg3833

    Challenges in unsupervised clustering of single-cell RNA-seq data https://www.nature.com/articles/s41576-018-0088-9 Review Article 07 January 2019 Vladimir Yu Kiselev, Tallulah S. Andrews & Martin Hemberg Nature Reviews Genetics volume 20, pages273–282 (2019)

    Challenges and emerging directions in single-cell analysis https://link.springer.com/article/10.1186/s13059-017-1218-y Published: 08 May 2017 Guo-Cheng Yuan, Long Cai, Michael Elowitz, Tariq Enver, Guoping Fan, Guoji Guo, Rafael Irizarry, Peter Kharchenko, Junhyong Kim, Stuart Orkin, John Quackenbush, Assieh Saadatpour, Timm Schroeder, Ramesh Shivdasani & Itay Tirosh Genome Biology volume 18, Article number: 84 (2017)

    Single-Cell RNA Sequencing in Cancer: Lessons Learned and Emerging Challenges https://www.sciencedirect.com/science/article/pii/S1097276519303569 Molecular Cell Volume 75, Issue 1, 11 July 2019, Pages 7-12 Journal home page for Molecular Cell

  6. Data, R code and output Seurat Objects for single cell RNA-seq analysis of...

    • figshare.com
    application/gzip
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yunshun Chen; Gordon Smyth (2023). Data, R code and output Seurat Objects for single cell RNA-seq analysis of human breast tissues [Dataset]. http://doi.org/10.6084/m9.figshare.17058077.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Yunshun Chen; Gordon Smyth
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains all the Seurat objects that were used for generating all the figures in Pal et al. 2021 (https://doi.org/10.15252/embj.2020107333). All the Seurat objects were created under R v3.6.1 using the Seurat package v3.1.1. The detailed information of each object is listed in a table in Chen et al. 2021.

  7. Source data for Manuscript (R result)

    • figshare.com
    application/gzip
    Updated Feb 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hongzhou Lin (2023). Source data for Manuscript (R result) [Dataset]. http://doi.org/10.6084/m9.figshare.22139771.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Feb 22, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Hongzhou Lin
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    singlecell seq result.rds : The Seurat Object which contains the single cell seq result in R. cellchat result.csv : The Ligand and receptor pairs by cellchat

    molecular dock.pdb : The molecular dock result of BMP7 and adriamycin

    Quantitative results for WB and qPCR.pzfx :

    Quantitative results for WB and qPCR in Prism

    Original Images for Westernblot.pdf:

    Original Images for Westernblot (PDF Version)

  8. scRNA-seq Kolodziejczyk et al. (2015)

    • kaggle.com
    zip
    Updated Apr 30, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alexander Chervov (2022). scRNA-seq Kolodziejczyk et al. (2015) [Dataset]. https://www.kaggle.com/datasets/alexandervc/scrnaseq-kolodziejczyk-et-al-2015
    Explore at:
    zip(13439744 bytes)Available download formats
    Dataset updated
    Apr 30, 2022
    Authors
    Alexander Chervov
    Description

    Remark: for cell cycle analysis - see paper https://arxiv.org/abs/2208.05229 "Computational challenges of cell cycle analysis using single cell transcriptomics" Alexander Chervov, Andrei Zinovyev

    Data and Context

    Data - results of single cell RNA sequencing, i.e. rows - correspond to cells, columns to genes (or vice versa). value of the matrix shows how strong is "expression" of the corresponding gene in the corresponding cell. https://en.wikipedia.org/wiki/Single-cell_transcriptomics

    Particular data: Data from the paper: Kolodziejczyk, A. A., J. K. Kim, J. C. Tsang, T. Ilicic, J. Henriksson, K. N. Natarajan, A. C. Tuck, et al. 2015. “Single cell RNA-Sequencing of pluripotent states unlocks modular transcriptional variation.” Cell Stem Cell 17 (4): 471–85. https://pubmed.ncbi.nlm.nih.gov/26431182/

    scRNA-seq count matrix, downloaded from database of R-package "scRNAseq", see script: https://www.kaggle.com/alexandervc/rpackage-scrnaseq-downloads-datasets

    Related datasets:

    Other single cell RNA seq datasets can be found on kaggle: Look here: https://www.kaggle.com/alexandervc/datasets Or search kaggle for "scRNA-seq"

    Inspiration

    Single cell RNA sequencing is important technology in modern biology, see e.g. "Eleven grand challenges in single-cell data science" https://genomebiology.biomedcentral.com/articles/10.1186/s13059-020-1926-6

    Also see review : Nature. P. Kharchenko: "The triumphs and limitations of computational methods for scRNA-seq" https://www.nature.com/articles/s41592-021-01171-x

    Search scholar.google "challenges in single cell rna sequencing" https://scholar.google.fr/scholar?q=challenges+in+single+cell+rna+sequencing&hl=en&as_sdt=0&as_vis=1&oi=scholart gives many interesting and highly cited articles

    (Cited 968) Computational and analytical challenges in single-cell transcriptomics Oliver Stegle, Sarah A. Teichmann, John C. Marioni Nat. Rev. Genet., 16 (3) (2015), pp. 133-145 https://www.nature.com/articles/nrg3833

    Challenges in unsupervised clustering of single-cell RNA-seq data https://www.nature.com/articles/s41576-018-0088-9 Review Article Published: 07 January 2019 Vladimir Yu Kiselev, Tallulah S. Andrews & Martin Hemberg Nature Reviews Genetics volume 20, pages273–282 (2019)

    Challenges and emerging directions in single-cell analysis https://link.springer.com/article/10.1186/s13059-017-1218-y Published: 08 May 2017 Guo-Cheng Yuan, Long Cai, Michael Elowitz, Tariq Enver, Guoping Fan, Guoji Guo, Rafael Irizarry, Peter Kharchenko, Junhyong Kim, Stuart Orkin, John Quackenbush, Assieh Saadatpour, Timm Schroeder, Ramesh Shivdasani & Itay Tirosh Genome Biology volume 18, Article number: 84 (2017)

    Single-Cell RNA Sequencing in Cancer: Lessons Learned and Emerging Challenges https://www.sciencedirect.com/science/article/pii/S1097276519303569 Molecular Cell Volume 75, Issue 1, 11 July 2019, Pages 7-12 Journal home page for Molecular Cell

  9. o

    Introduction to single cell RNAseq analysis: supplementary material

    • explore.openaire.eu
    Updated Apr 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jose Alejandro Romero Herrera; Samuele Soraggi (2023). Introduction to single cell RNAseq analysis: supplementary material [Dataset]. http://doi.org/10.5281/zenodo.7920686
    Explore at:
    Dataset updated
    Apr 14, 2023
    Authors
    Jose Alejandro Romero Herrera; Samuele Soraggi
    Description

    This archive contains supplementary material used in the workshop "Introduction to single cell RNAseq analysis" taught by the Danish National Sandbox for Health Data Science. The course repo can be found on Github. Data.zip contains 6 10x runs on Spermatogonia development. 3 from healthy individuals and 3 from azoospermic individuals. Data has been already preprocessed using cellranger and can be loaded using Seurat (R) or scanpy (python). Slides.zip contains slides explaning theory regarding single cell RNAseq data analysis Notebooks.zip contains Rmarkdown files to follow the course in using R in Rstudio. Updated version of the notebooks.

  10. Single Cell RNA Seq Analysis QC Clustering PBMC 3k

    • kaggle.com
    zip
    Updated Dec 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dr. Nagendra (2025). Single Cell RNA Seq Analysis QC Clustering PBMC 3k [Dataset]. https://www.kaggle.com/datasets/mannekuntanagendra/single-cell-rna-seq-analysis-qc-clustering-pbmc-3k
    Explore at:
    zip(29203448 bytes)Available download formats
    Dataset updated
    Dec 4, 2025
    Authors
    Dr. Nagendra
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset contains processed single-cell RNA-sequencing (scRNA-seq) data from the PBMC 3K experiment.

    It includes quality-control (QC) visualizations, cell-level metrics, clustering outputs, and exploratory analysis plots.

    The dataset is designed to guide beginners and intermediate users through the essential steps of scRNA-seq preprocessing and analysis.

    The PBMC 3K dataset represents human peripheral blood mononuclear cells sequenced using the 10x Genomics platform.

    Included QC metrics help identify low-quality cells, doublets, stressed cells, and outliers based on standard thresholds.

    The dataset covers filtering based on mitochondrial gene percentage, total UMIs, and number of detected genes.

    All plots follow widely accepted scRNA-seq workflows commonly used in tools like Seurat, Scanpy, and SingleCellExperiment.

    The QC violin plots illustrate distributions of nFeature_RNA, nCount_RNA, percent.mt, and other metrics used to assess cell quality.

    The data also highlights the effect of filtering on overall dataset structure and variability.

    Clustering-related files provide a visual understanding of how cells segregate into biologically meaningful groups.

    Dimensionality-reduction plots also show patterns such as immune-cell diversity present in PBMC populations.

    This dataset is suitable for hands-on learning, tutorial creation, classroom instruction, or benchmarking workflows.

    It serves as a ready reference for researchers who wish to practice QC interpretation and cluster inspection.

    The dataset allows quick reproduction of PBMC 3K quality-control visualizations without running the entire analysis pipeline.

    It provides an accessible introduction to scRNA-seq analysis concepts for students, data scientists, and bioinformaticians.

  11. MOESM11 of Benchmarking principal component analysis for large-scale...

    • springernature.figshare.com
    application/x-gzip
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Koki Tsuyuzaki; Hiroyuki Sato; Kenta Sato; Itoshi Nikaido (2023). MOESM11 of Benchmarking principal component analysis for large-scale single-cell RNA-sequencing [Dataset]. http://doi.org/10.6084/m9.figshare.11662101.v1
    Explore at:
    application/x-gzipAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Koki Tsuyuzaki; Hiroyuki Sato; Kenta Sato; Itoshi Nikaido
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 11 Pair plots of all the pCA (Brain) implementations.

  12. u

    Data from: Reference transcriptomics of porcine peripheral immune cells...

    • agdatacommons.nal.usda.gov
    • datasets.ai
    • +1more
    zip
    Updated Nov 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Juber Herrera-Uribe; Jayne Wiarda; Sathesh K. Sivasankaran; Lance Daharsh; Haibo Liu; Kristen A. Byrne; Timothy P. L. Smith; Joan K. Lunney; Crystal L. Loving; Christopher K. Tuggle (2025). Data from: Reference transcriptomics of porcine peripheral immune cells created through bulk and single-cell RNA sequencing [Dataset]. http://doi.org/10.15482/USDA.ADC/1522411
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 21, 2025
    Dataset provided by
    Ag Data Commons
    Authors
    Juber Herrera-Uribe; Jayne Wiarda; Sathesh K. Sivasankaran; Lance Daharsh; Haibo Liu; Kristen A. Byrne; Timothy P. L. Smith; Joan K. Lunney; Crystal L. Loving; Christopher K. Tuggle
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    This dataset contains files reconstructing single-cell data presented in 'Reference transcriptomics of porcine peripheral immune cells created through bulk and single-cell RNA sequencing' by Herrera-Uribe & Wiarda et al. 2021. Samples of peripheral blood mononuclear cells (PBMCs) were collected from seven pigs and processed for single-cell RNA sequencing (scRNA-seq) in order to provide a reference annotation of porcine immune cell transcriptomics at enhanced, single-cell resolution. Analysis of single-cell data allowed identification of 36 cell clusters that were further classified into 13 cell types, including monocytes, dendritic cells, B cells, antibody-secreting cells, numerous populations of T cells, NK cells, and erythrocytes. Files may be used to reconstruct the data as presented in the manuscript, allowing for individual query by other users. Scripts for original data analysis are available at https://github.com/USDA-FSEPRU/PorcinePBMCs_bulkRNAseq_scRNAseq. Raw data are available at https://www.ebi.ac.uk/ena/browser/view/PRJEB43826. Funding for this dataset was also provided by NRSP8: National Animal Genome Research Program (https://www.nimss.org/projects/view/mrp/outline/18464). Resources in this dataset:Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells 10X Format. File Name: PBMC7_AllCells.zipResource Description: Zipped folder containing PBMC counts matrix, gene names, and cell IDs. Files are as follows:

    matrix of gene counts* (matrix.mtx.gx) gene names (features.tsv.gz) cell IDs (barcodes.tsv.gz)

    *The ‘raw’ count matrix is actually gene counts obtained following ambient RNA removal. During ambient RNA removal, we specified to calculate non-integer count estimations, so most gene counts are actually non-integer values in this matrix but should still be treated as raw/unnormalized data that requires further normalization/transformation. Data can be read into R using the function Read10X().Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells Metadata. File Name: PBMC7_AllCells_meta.csvResource Description: .csv file containing metadata for cells included in the final dataset. Metadata columns include:

    nCount_RNA = the number of transcripts detected in a cell nFeature_RNA = the number of genes detected in a cell Loupe = cell barcodes; correspond to the cell IDs found in the .h5Seurat and 10X formatted objects for all cells prcntMito = percent mitochondrial reads in a cell Scrublet = doublet probability score assigned to a cell seurat_clusters = cluster ID assigned to a cell PaperIDs = sample ID for a cell celltypes = cell type ID assigned to a cellResource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells PCA Coordinates. File Name: PBMC7_AllCells_PCAcoord.csvResource Description: .csv file containing first 100 PCA coordinates for cells. Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells t-SNE Coordinates. File Name: PBMC7_AllCells_tSNEcoord.csvResource Description: .csv file containing t-SNE coordinates for all cells.Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells UMAP Coordinates. File Name: PBMC7_AllCells_UMAPcoord.csvResource Description: .csv file containing UMAP coordinates for all cells.Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - CD4 T Cells t-SNE Coordinates. File Name: PBMC7_CD4only_tSNEcoord.csvResource Description: .csv file containing t-SNE coordinates for only CD4 T cells (clusters 0, 3, 4, 28). A dataset of only CD4 T cells can be re-created from the PBMC7_AllCells.h5Seurat, and t-SNE coordinates used in publication can be re-assigned using this .csv file.Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - CD4 T Cells UMAP Coordinates. File Name: PBMC7_CD4only_UMAPcoord.csvResource Description: .csv file containing UMAP coordinates for only CD4 T cells (clusters 0, 3, 4, 28). A dataset of only CD4 T cells can be re-created from the PBMC7_AllCells.h5Seurat, and UMAP coordinates used in publication can be re-assigned using this .csv file.Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - Gamma Delta T Cells UMAP Coordinates. File Name: PBMC7_GDonly_UMAPcoord.csvResource Description: .csv file containing UMAP coordinates for only gamma delta T cells (clusters 6, 21, 24, 31). A dataset of only gamma delta T cells can be re-created from the PBMC7_AllCells.h5Seurat, and UMAP coordinates used in publication can be re-assigned using this .csv file.Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - Gamma Delta T Cells t-SNE Coordinates. File Name: PBMC7_GDonly_tSNEcoord.csvResource Description: .csv file containing t-SNE coordinates for only gamma delta T cells (clusters 6, 21, 24, 31). A dataset of only gamma delta T cells can be re-created from the PBMC7_AllCells.h5Seurat, and t-SNE coordinates used in publication can be re-assigned using this .csv file.Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - Gene Annotation Information. File Name: UnfilteredGeneInfo.txtResource Description: .txt file containing gene nomenclature information used to assign gene names in the dataset. 'Name' column corresponds to the name assigned to a feature in the dataset.Resource Title: Herrera-Uribe & Wiarda et al. PBMCs - All Cells H5Seurat. File Name: PBMC7.tarResource Description: .h5Seurat object of all cells in PBMC dataset. File needs to be untarred, then read into R using function LoadH5Seurat().

  13. f

    Table2_Single-cell RNA sequencing analysis identifies acute changes in the...

    • datasetcatalog.nlm.nih.gov
    Updated Nov 4, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Steinmetz, Alexis R.; Parker, Nigel R.; Manyam, Ganiraju; Yla-Herttuala, Seppo; Chen, Yan; Tholomier, Come; McConkey, David J.; Czerniak, Bogdan A.; Pierce, Morgan; Dinney, Colin P.; Martini, Alberto; Sood, Akshay; Mokkapati, Sharada; Jagannath, Chinnaswamy; Lee, Byron H.; Johnson, Burles A.; Duplisea, Jonathan J. (2024). Table2_Single-cell RNA sequencing analysis identifies acute changes in the tumor microenvironment induced by interferon α gene therapy in a murine bladder cancer model.xls [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001407839
    Explore at:
    Dataset updated
    Nov 4, 2024
    Authors
    Steinmetz, Alexis R.; Parker, Nigel R.; Manyam, Ganiraju; Yla-Herttuala, Seppo; Chen, Yan; Tholomier, Come; McConkey, David J.; Czerniak, Bogdan A.; Pierce, Morgan; Dinney, Colin P.; Martini, Alberto; Sood, Akshay; Mokkapati, Sharada; Jagannath, Chinnaswamy; Lee, Byron H.; Johnson, Burles A.; Duplisea, Jonathan J.
    Description

    IntroductionNadofaragene firadenovec (Ad-IFNα/Syn3) is now approved for BCG-unresponsive bladder cancer (BLCA). IFNα is a pleiotropic cytokine that causes direct tumor cell killing via TRAIL-mediated apoptosis, angiogenesis inhibition, and activation of the innate and adaptive immune system. We established an immunocompetent murine BLCA model to study the effects of murine adenoviral IFNα (muAd-Ifnα) gene therapy on cancer cells and the tumor microenvironment using a novel murine equivalent of Nadofaragene firadenovec (muAd-Ifnα).MethodsTumors were induced by instilling MB49 cells into the bladders of mice; luciferase imaging confirmed tumor development. Mice were treated with adenovirus control (Ad-Ctrl; empty vector), or muAd-Ifnα (3x1011 VP/mL), and survival analysis was performed. For single-cell sequencing (scRNAseq) analysis (72h), bladders were harvested and treated with collagenase/hyaluronidase and TrypLE for cell dissociation. Single cells were suspended in PBS/1% FBS buffer; viability was assessed with Vicell cell counter. scRNAseq analysis was performed using 10X genomics 3’ sequencing. Raw RNAseq data were pre-processed using Cell Ranger single-cell software. Seurat (R package) was used to normalize and cluster the scRNA data. Pooled differential gene expression analysis in specific cell clusters was performed with DESeq2.ResultsWe identified 16 cell clusters based on marker expression which were grouped into epithelial (tumor), uroplakin-enriched, endothelial, T-cells, neutrophils, and macrophage clusters. Top differentially expressed genes between muAd-Ifnα and Ad-Ctrl were identified. Within the specific cell clusters, IPA analysis revealed significant differences between muAd-Ifnα and control. IFNα signaling and hypercytokinemia/chemokinemia were upregulated in all clusters. Cell death pathways were upregulated in tumor and endothelial clusters. T-cells demonstrated upregulation of the immunogenic cell death signaling pathway and a decrease in the Th2 pathway genes. Macrophages showed upregulation of PD1/PD-L1 pathways along with downregulation of macrophage activation pathways (alternate and classical). Multiplex immunofluorescence confirmed increased infiltration with macrophages in muAd-Ifnα treated tumors compared to controls. PD1/PD-L1 expression was reduced at 72h.DiscussionThis single-cell analysis builds upon our understanding of the impact of Ad-IFNα on tumor cells and other compartments of the microenvironment. These data will help identify mechanisms to improve patient selection and therapeutic efficacy of Nadofaragene firadenovec.

  14. Z

    Datasets associated with the publication of the "satuRn" R package

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jul 13, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gilis, Jeroen; Vitting-Seerup, Kristoffer; Van den Berge, Koen; Clement, Lieven (2022). Datasets associated with the publication of the "satuRn" R package [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4438473
    Explore at:
    Dataset updated
    Jul 13, 2022
    Dataset provided by
    Ghent University
    University of Copenhagen
    Authors
    Gilis, Jeroen; Vitting-Seerup, Kristoffer; Van den Berge, Koen; Clement, Lieven
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    On this Zenodo link, we share the data that is required to reproduce all the analyses from our publication "satuRn: Scalable Analysis of differential Transcript Usage for bulk and single-cell RNA-sequencing applications".

    This repository includes input transcript-level expression matrices and metadata for all datasets, as well as intermediate results and final outputs of the respective DTU analyses. For a more elaborate description of the data, we refer to the companion GitHub for our publications; https://github.com/statOmics/satuRnPaper. Note that this is version 1.0.3 of the data (uploaded on 2022-07-08). If any changes were to be made to the datasets in the future, this will also be communicated on our companion GitHub page.

  15. u

    Single cell T-cell Atlas (V3)

    • repository.uantwerpen.be
    Updated 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mullan, Kerry A. (2024). Single cell T-cell Atlas (V3) [Dataset]. http://doi.org/10.5281/ZENODO.12606320
    Explore at:
    Dataset updated
    2024
    Dataset provided by
    Faculty of Sciences. Mathematics and Computer Science
    University of Antwerp
    Zenodo
    Authors
    Mullan, Kerry A.
    Description

    The attached datasets comprised of the merging of 21 high quality single cell T cell based dataset that had both the TCR-seq and GEx. The object contains ~1.3 paired TCR-seq with GEx in the Seurat Object (supercluster_added_ID-240531.rds). We also included the original identifiers in the Sup_Update_labels.csv a. See our https://stegor.readthedocs.io/en/latest/ for how we processed the 12 datasets (V2) and decided on the current 47 T cell annotation models using scGate (TcellFunction). Additionally, based on collaborator recommendataion, we have also now included a simpler T cell annotion model in STEGO.R process (Tsimplefunctions). This is the accompanying data set for the paper entitled ‘T cell receptor-centric approach to streamline multimodal single-cell data analysis.’, which is currently available as a preprint (https://www.biorxiv.org/content/10.1101/2023.09.27.559702v2). Details on the origin of the datasets, and processing steps can be found there. The purpose of this atlas both the full dataset and down sampling version is to aid in improving the interpretability of other T cell based datasets. This can be done by adding in the down sampled object that contains up to 500 cells per annotation model. This dataset aims to improve the capacity to identify TCR-specific signature by ensuring a well covered background, which will improve the robustness of the FindMarker Function in Seurat package.

  16. MOESM10 of Benchmarking principal component analysis for large-scale...

    • springernature.figshare.com
    application/x-gzip
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Koki Tsuyuzaki; Hiroyuki Sato; Kenta Sato; Itoshi Nikaido (2023). MOESM10 of Benchmarking principal component analysis for large-scale single-cell RNA-sequencing [Dataset]. http://doi.org/10.6084/m9.figshare.11662095.v1
    Explore at:
    application/x-gzipAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Koki Tsuyuzaki; Hiroyuki Sato; Kenta Sato; Itoshi Nikaido
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 10 Pair plots of all the pCA (BrainSpinalCord) implementations.

  17. Data from:...

    • osdr.nasa.gov
    Updated Oct 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fei Wu; David Furman; Daniel Winer; Christopher E Mason; Huixun Du; Eliah Overbey; JangKeun Kim; Priya Makhijani; Nicolas Martin; Chad A Lerner; Khiem Nguyen; Jordan Baechle; Taylor R Valentino; Matias Fuentealba; Juliet M Bartleson; Heather Halaweh; Shawn Winer; Cem Meydan; Francine Garrett-Bakelman; Nazish Sayed; Simon Melov; Masafumi Muratani; Akos A Gerencser; Herbert G Kasler; Afshin Beheshti (2025). Single-cell-analysis-identifies-conserved-features-of-immune-dysfunction-in-simulated-microgravity-and-spaceflight [Dataset]. https://osdr.nasa.gov/bio/repo/data/studies/OSD-689
    Explore at:
    Dataset updated
    Oct 1, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Authors
    Fei Wu; David Furman; Daniel Winer; Christopher E Mason; Huixun Du; Eliah Overbey; JangKeun Kim; Priya Makhijani; Nicolas Martin; Chad A Lerner; Khiem Nguyen; Jordan Baechle; Taylor R Valentino; Matias Fuentealba; Juliet M Bartleson; Heather Halaweh; Shawn Winer; Cem Meydan; Francine Garrett-Bakelman; Nazish Sayed; Simon Melov; Masafumi Muratani; Akos A Gerencser; Herbert G Kasler; Afshin Beheshti
    License

    Attribution 1.0 (CC BY 1.0)https://creativecommons.org/licenses/by/1.0/
    License information was derived automatically

    Description

    Microgravity is associated with immunological dysfunction, though the mechanisms are poorly understood. Here, using single-cell analysis of human peripheral blood mononuclear cells (PBMCs) exposed to short term (25 hours) simulated microgravity, we characterize altered genes and pathways at basal and stimulated states with a Toll-like Receptor-7/8 agonist. We validate single-cell analysis by RNA sequencing and super-resolution microscopy, and against data from the Inspiration-4 (I4) mission, JAXA (Cell-Free Epigenome) mission, Twins study, and spleens from mice on the International Space Station. Overall, microgravity alters specific pathways for optimal immunity, including the cytoskeleton, interferon signaling, pyroptosis, temperature-shock, innate inflammation (e.g., Coronavirus pathogenesis pathway and IL-6 signaling), nuclear receptors, and sirtuin signaling. Microgravity directs monocyte inflammatory parameters, and impairs T cell and NK cell functionality. Using machine learning, we identify numerous compounds linking microgravity to immune cell transcription, and demonstrate that the flavonol, quercetin, can reverse most abnormal pathways. These results define immune cell alterations in microgravity, and provide opportunities for countermeasures to maintain normal immunity in space.

  18. MOESM9 of Benchmarking principal component analysis for large-scale...

    • springernature.figshare.com
    • figshare.com
    application/x-gzip
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Koki Tsuyuzaki; Hiroyuki Sato; Kenta Sato; Itoshi Nikaido (2023). MOESM9 of Benchmarking principal component analysis for large-scale single-cell RNA-sequencing [Dataset]. http://doi.org/10.6084/m9.figshare.11662173.v1
    Explore at:
    application/x-gzipAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Koki Tsuyuzaki; Hiroyuki Sato; Kenta Sato; Itoshi Nikaido
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 9 Pair plots of all the pCA (Pancreas) implementations.

  19. Single-cell and spatial transcriptomics of stricturing Crohn's disease

    • zenodo.org
    application/gzip, bin
    Updated Sep 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lingjia Kong; Sathish Subramanian; Asa Segerstolpe; Vy Tran; Angela Shih; Grace Carter; Hiroko Kunitake; Shaina Tardus; Jasmine Li; Marco Kaper; Christy Cauley; Shivam Gandhi; Eric Chen; Caroline Porter; Toni Delorey; Liliana Bordeianou; Rocco Ricciardi; Ashwin Ananthakrishnan; Helena Lau; Richard Hodin; Jacques Deguine; Chris Smillie; Ramnik Xavier; Lingjia Kong; Sathish Subramanian; Asa Segerstolpe; Vy Tran; Angela Shih; Grace Carter; Hiroko Kunitake; Shaina Tardus; Jasmine Li; Marco Kaper; Christy Cauley; Shivam Gandhi; Eric Chen; Caroline Porter; Toni Delorey; Liliana Bordeianou; Rocco Ricciardi; Ashwin Ananthakrishnan; Helena Lau; Richard Hodin; Jacques Deguine; Chris Smillie; Ramnik Xavier (2025). Single-cell and spatial transcriptomics of stricturing Crohn's disease [Dataset]. http://doi.org/10.5281/zenodo.14509802
    Explore at:
    application/gzip, binAvailable download formats
    Dataset updated
    Sep 1, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Lingjia Kong; Sathish Subramanian; Asa Segerstolpe; Vy Tran; Angela Shih; Grace Carter; Hiroko Kunitake; Shaina Tardus; Jasmine Li; Marco Kaper; Christy Cauley; Shivam Gandhi; Eric Chen; Caroline Porter; Toni Delorey; Liliana Bordeianou; Rocco Ricciardi; Ashwin Ananthakrishnan; Helena Lau; Richard Hodin; Jacques Deguine; Chris Smillie; Ramnik Xavier; Lingjia Kong; Sathish Subramanian; Asa Segerstolpe; Vy Tran; Angela Shih; Grace Carter; Hiroko Kunitake; Shaina Tardus; Jasmine Li; Marco Kaper; Christy Cauley; Shivam Gandhi; Eric Chen; Caroline Porter; Toni Delorey; Liliana Bordeianou; Rocco Ricciardi; Ashwin Ananthakrishnan; Helena Lau; Richard Hodin; Jacques Deguine; Chris Smillie; Ramnik Xavier
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This folder contains the spatial transcriptomics data + code. This code was generated by members of the Smillie Lab @ MGH and Harvard Medical School.

    • github.tar.gz: spatial analysis code and data
    • anndata.h5ad: anndata object (scanpy)
    • V*tar.gz: raw spatial transcriptomics files

    The github.tar.gz folder contains everything you need to reproduce the spatial transcriptomics figures. It is structured as follows:

    • 1.BayesPrism: code for running BayesPrism on spatial data
    • 2.SparCC: code for running SparCC on spatial data
    • 3.Lasso: code for running lasso regression on spatial data
    • 4.Analysis: code for reproducing all figures in the paper
    • 4.Analysis/1.analysis.r: script to reproduce all figures in the paper ***
    • code: code library containing all necessary functions
    • load_data.r: code to load the single-cell and spatial datasets
    • sco.rds: single-cell analysis object (10X Chromium) formatted as an R list
    • vis.rds: spatial analysis object (10X Visium) formatted as an R list

    All scripts are numbered. You need to run everything in order. For convenience, we include the output files for 1.BayesPrism, 2.SparCC, and 3.Lasso, allowing you to skip straight to the analysis code in 4.Analysis.

    To reproduce all figures in the paper, you need to do the following:

    1. Edit your PROJECT_FOLDER in the header of load_data.r
    2. Install the packages listed at the top of load_data.r
    3. Go to the 4.Analysis directory, start an interactive R session, and type:
      > source('1.analysis.r')

    This will load the beginning of the 1.analysis.r script (until the stop() statement on line 68). You can run the code in two different ways:

    1. You can step through the code line by line in your interactive R session (starting at line 68)
    2. Alternatively, remove the stop() statement from the script, then run the code start to finish

    If you encounter any errors, try to debug them using a combination of Google+ChatGPT. If you still have trouble, please contact the Smillie Lab.

    Note: the single-cell and spatial code are also available on GitHub. However, the spatial analysis requires large files that cannot be hosted on GitHub. Therefore, it is better to download the code + files from Zenodo. The GitHub link is provided below:

    https://github.com/LJ-Kong/fibrosis_scRNA_stRNA

  20. s

    Exploring a pico-well based scRNA-seq method (HIVETM) for simplified...

    • figshare.scilifelab.se
    • researchdata.se
    application/gzip
    Updated Jan 15, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kim Fegraeus; Miia Riihimäki; Jessica Nordlund; Srinivas Akula; Sara Wernersson; Amanda Raine (2025). Exploring a pico-well based scRNA-seq method (HIVETM) for simplified processing of equine bronchoalveolar lavage cells [Dataset]. http://doi.org/10.17044/scilifelab.25471861.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Jan 15, 2025
    Dataset provided by
    Uppsala University
    Authors
    Kim Fegraeus; Miia Riihimäki; Jessica Nordlund; Srinivas Akula; Sara Wernersson; Amanda Raine
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Abstract Single-cell RNA sequencing (scRNA-seq) is a valuable tool for investigating cellular heterogeneity in diseases such as equine asthma (EA). This study evaluates the HIVE™ scRNA-seq method, a pico-well-based technology, for processing bronchoalveolar lavage (BAL) cells from horses with EA. The HIVE method offers practical advantages, including compatibility with both field and clinical settings, as well as a gentle workflow suited for handling sensitive cells.Our results show that the major cell types in equine BAL were successfully identified; however, the proportions of T cells and macrophages deviated from cytological expectations, with macrophages being overrepresented and T cells underrepresented. Despite these limitations, the HIVE method confirmed previously identified T cell and macrophage subpopulations and defined other BAL cell subsets. However, compared to previous studies, T helper subsets were less clearly defined. Additionally, consistent with previous scRNA-seq studies, the HIVE method detected fewer granulocytes and mast cells than anticipated in the total BAL samples. Nevertheless, applying the method to purified mast cells recovered an expected number of cells. A small set of eosinophils were also detected which have not been characterized in earlier studies. In summary these findings suggest that while the HIVE method shows promise for certain applications, further optimization is needed to improve the accuracy of cell type representation, particularly for granulocytes and mast cells, in BAL samples. The HIVE™ (name, not acronym) libraries were prepared with the single-cell RNA-seq Processing Kit v1 (Honeycomb Biotechnologies) and sequenced on a NovaSeq 6000 v 1.5. Raw reads were processed and converted to count matrices of gene expression values using the custom software BeeNetTM (Honeycomb Biotechnologies). The R code herein and associated Seurat-objects (.rds files) may be used to reproduce the data analysis in the manuscript.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Hsu, Jonathan; Stoop, Allart (2023). Repository for Single Cell RNA Sequencing Analysis of The EMT6 Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_10011621

Repository for Single Cell RNA Sequencing Analysis of The EMT6 Dataset

Explore at:
Dataset updated
Nov 20, 2023
Authors
Hsu, Jonathan; Stoop, Allart
Description

Table of Contents

Main Description File Descriptions Linked Files Installation and Instructions

1. Main Description

This is the Zenodo repository for the manuscript titled "A TCR β chain-directed antibody-fusion molecule that activates and expands subsets of T cells and promotes antitumor activity.". The code included in the file titled marengo_code_for_paper_jan_2023.R was used to generate the figures from the single-cell RNA sequencing data. The following libraries are required for script execution:

Seurat scReportoire ggplot2 stringr dplyr ggridges ggrepel ComplexHeatmap

File Descriptions

The code can be downloaded and opened in RStudios. The "marengo_code_for_paper_jan_2023.R" contains all the code needed to reproduce the figues in the paper The "Marengo_newID_March242023.rds" file is available at the following address: https://zenodo.org/badge/DOI/10.5281/zenodo.7566113.svg (Zenodo DOI: 10.5281/zenodo.7566113). The "all_res_deg_for_heat_updated_march2023.txt" file contains the unfiltered results from DGE anlaysis, also used to create the heatmap with DGE and volcano plots. The "genes_for_heatmap_fig5F.xlsx" contains the genes included in the heatmap in figure 5F.

Linked Files

This repository contains code for the analysis of single cell RNA-seq dataset. The dataset contains raw FASTQ files, as well as, the aligned files that were deposited in GEO. The "Rdata" or "Rds" file was deposited in Zenodo. Provided below are descriptions of the linked datasets:

Gene Expression Omnibus (GEO) ID: GSE223311(https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE223311)

Title: Gene expression profile at single cell level of CD4+ and CD8+ tumor infiltrating lymphocytes (TIL) originating from the EMT6 tumor model from mSTAR1302 treatment. Description: This submission contains the "matrix.mtx", "barcodes.tsv", and "genes.tsv" files for each replicate and condition, corresponding to the aligned files for single cell sequencing data. Submission type: Private. In order to gain access to the repository, you must use a reviewer token (https://www.ncbi.nlm.nih.gov/geo/info/reviewer.html).

Sequence read archive (SRA) repository ID: SRX19088718 and SRX19088719

Title: Gene expression profile at single cell level of CD4+ and CD8+ tumor infiltrating lymphocytes (TIL) originating from the EMT6 tumor model from mSTAR1302 treatment. Description: This submission contains the raw sequencing or .fastq.gz files, which are tab delimited text files. Submission type: Private. In order to gain access to the repository, you must use a reviewer token (https://www.ncbi.nlm.nih.gov/geo/info/reviewer.html).

Zenodo DOI: 10.5281/zenodo.7566113(https://zenodo.org/record/7566113#.ZCcmvC2cbrJ)

Title: A TCR β chain-directed antibody-fusion molecule that activates and expands subsets of T cells and promotes antitumor activity. Description: This submission contains the "Rdata" or ".Rds" file, which is an R object file. This is a necessary file to use the code. Submission type: Restricted Acess. In order to gain access to the repository, you must contact the author.

Installation and Instructions

The code included in this submission requires several essential packages, as listed above. Please follow these instructions for installation:

Ensure you have R version 4.1.2 or higher for compatibility.

Although it is not essential, you can use R-Studios (Version 2022.12.0+353 (2022.12.0+353)) for accessing and executing the code.

  1. Download the *"Rdata" or ".Rds" file from Zenodo (https://zenodo.org/record/7566113#.ZCcmvC2cbrJ) (Zenodo DOI: 10.5281/zenodo.7566113).
  2. Open R-Studios (https://www.rstudio.com/tags/rstudio-ide/) or a similar integrated development environment (IDE) for R.
  3. Set your working directory to where the following files are located:

marengo_code_for_paper_jan_2023.R Install_Packages.R Marengo_newID_March242023.rds genes_for_heatmap_fig5F.xlsx all_res_deg_for_heat_updated_march2023.txt

You can use the following code to set the working directory in R:

setwd(directory)

  1. Open the file titled "Install_Packages.R" and execute it in R IDE. This script will attempt to install all the necessary pacakges, and its dependencies in order to set up an environment where the code in "marengo_code_for_paper_jan_2023.R" can be executed.
  2. Once the "Install_Packages.R" script has been successfully executed, re-start R-Studios or your IDE of choice.
  3. Open the file "marengo_code_for_paper_jan_2023.R" file in R-studios or your IDE of choice.
  4. Execute commands in the file titled "marengo_code_for_paper_jan_2023.R" in R-Studios or your IDE of choice to generate the plots.
Search
Clear search
Close search
Google apps
Main menu