17 datasets found

n
Data from: Large-scale integration of single-cell transcriptomic data...
data.niaid.nih.gov
data-staging.niaid.nih.gov
+2more
zip
Updated Dec 14, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
David McKellar; Iwijn De Vlaminck; Benjamin Cosgrove (2021). Large-scale integration of single-cell transcriptomic data captures transitional progenitor states in mouse skeletal muscle regeneration [Dataset]. http://doi.org/10.5061/dryad.t4b8gtj34
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.t4b8gtj34
Dataset updated
Dec 14, 2021
Dataset provided by
Cornell University
Authors
David McKellar; Iwijn De Vlaminck; Benjamin Cosgrove
License
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Description
Skeletal muscle repair is driven by the coordinated self-renewal and fusion of myogenic stem and progenitor cells. Single-cell gene expression analyses of myogenesis have been hampered by the poor sampling of rare and transient cell states that are critical for muscle repair, and do not inform the spatial context that is important for myogenic differentiation. Here, we demonstrate how large-scale integration of single-cell and spatial transcriptomic data can overcome these limitations. We created a single-cell transcriptomic dataset of mouse skeletal muscle by integration, consensus annotation, and analysis of 23 newly collected scRNAseq datasets and 88 publicly available single-cell (scRNAseq) and single-nucleus (snRNAseq) RNA-sequencing datasets. The resulting dataset includes more than 365,000 cells and spans a wide range of ages, injury, and repair conditions. Together, these data enabled identification of the predominant cell types in skeletal muscle, and resolved cell subtypes, including endothelial subtypes distinguished by vessel-type of origin, fibro/adipogenic progenitors defined by functional roles, and many distinct immune populations. The representation of different experimental conditions and the depth of transcriptome coverage enabled robust profiling of sparsely expressed genes. We built a densely sampled transcriptomic model of myogenesis, from stem cell quiescence to myofiber maturation and identified rare, transitional states of progenitor commitment and fusion that are poorly represented in individual datasets. We performed spatial RNA sequencing of mouse muscle at three time points after injury and used the integrated dataset as a reference to achieve a high-resolution, local deconvolution of cell subtypes. We also used the integrated dataset to explore ligand-receptor co-expression patterns and identify dynamic cell-cell interactions in muscle injury response. We provide a public web tool to enable interactive exploration and visualization of the data. Our work supports the utility of large-scale integration of single-cell transcriptomic data as a tool for biological discovery.

Methods Mice. The Cornell University Institutional Animal Care and Use Committee (IACUC) approved all animal protocols, and experiments were performed in compliance with its institutional guidelines. Adult C57BL/6J mice (mus musculus) were obtained from Jackson Laboratories (#000664; Bar Harbor, ME) and were used at 4-7 months of age. Aged C57BL/6J mice were obtained from the National Institute of Aging (NIA) Rodent Aging Colony and were used at 20 months of age. For new scRNAseq experiments, female mice were used in each experiment.

Mouse injuries and single-cell isolation. To induce muscle injury, both tibialis anterior (TA) muscles of old (20 months) C57BL/6J mice were injected with 10 µl of notexin (10 µg/ml; Latoxan; France). At 0, 1, 2, 3.5, 5, or 7 days post-injury (dpi), mice were sacrificed and TA muscles were collected and processed independently to generate single-cell suspensions. Muscles were digested with 8 mg/ml Collagenase D (Roche; Switzerland) and 10 U/ml Dispase II (Roche; Switzerland), followed by manual dissociation to generate cell suspensions. Cell suspensions were sequentially filtered through 100 and 40 μm filters (Corning Cellgro #431752 and #431750) to remove debris. Erythrocytes were removed through incubation in erythrocyte lysis buffer (IBI Scientific #89135-030).

Single-cell RNA-sequencing library preparation. After digestion, single-cell suspensions were washed and resuspended in 0.04% BSA in PBS at a concentration of 106 cells/ml. Cells were counted manually with a hemocytometer to determine their concentration. Single-cell RNA-sequencing libraries were prepared using the Chromium Single Cell 3’ reagent kit v3 (10x Genomics, PN-1000075; Pleasanton, CA) following the manufacturer’s protocol. Cells were diluted into the Chromium Single Cell A Chip to yield a recovery of 6,000 single-cell transcriptomes. After preparation, libraries were sequenced using on a NextSeq 500 (Illumina; San Diego, CA) using 75 cycle high output kits (Index 1 = 8, Read 1 = 26, and Read 2 = 58). Details on estimated sequencing saturation and the number of reads per sample are shown in Sup. Data 1.

Spatial RNA sequencing library preparation. Tibialis anterior muscles of adult (5 mo) C57BL6/J mice were injected with 10µl notexin (10 µg/ml) at 2, 5, and 7 days prior to collection. Upon collection, tibialis anterior muscles were isolated, embedded in OCT, and frozen fresh in liquid nitrogen. Spatially tagged cDNA libraries were built using the Visium Spatial Gene Expression 3’ Library Construction v1 Kit (10x Genomics, PN-1000187; Pleasanton, CA) (Fig. S7). Optimal tissue permeabilization time for 10 µm thick sections was found to be 15 minutes using the 10x Genomics Visium Tissue Optimization Kit (PN-1000193). H&E stained tissue sections were imaged using Zeiss PALM MicroBeam laser capture microdissection system and the images were stitched and processed using Fiji ImageJ software. cDNA libraries were sequenced on an Illumina NextSeq 500 using 150 cycle high output kits (Read 1=28bp, Read 2=120bp, Index 1=10bp, and Index 2=10bp). Frames around the capture area on the Visium slide were aligned manually and spots covering the tissue were selected using Loop Browser v4.0.0 software (10x Genomics). Sequencing data was then aligned to the mouse reference genome (mm10) using the spaceranger v1.0.0 pipeline to generate a feature-by-spot-barcode expression matrix (10x Genomics).

Download and alignment of single-cell RNA sequencing data. For all samples available via SRA, parallel-fastq-dump (github.com/rvalieris/parallel-fastq-dump) was used to download raw .fastq files. Samples which were only available as .bam files were converted to .fastq format using bamtofastq from 10x Genomics (github.com/10XGenomics/bamtofastq). Raw reads were aligned to the mm10 reference using cellranger (v3.1.0).

Preprocessing and batch correction of single-cell RNA sequencing datasets. First, ambient RNA signal was removed using the default SoupX (v1.4.5) workflow (autoEstCounts and adjustCounts; github.com/constantAmateur/SoupX). Samples were then preprocessed using the standard Seurat (v3.2.1) workflow (NormalizeData, ScaleData, FindVariableFeatures, RunPCA, FindNeighbors, FindClusters, and RunUMAP; github.com/satijalab/seurat). Cells with fewer than 750 features, fewer than 1000 transcripts, or more than 30% of unique transcripts derived from mitochondrial genes were removed. After preprocessing, DoubletFinder (v2.0) was used to identify putative doublets in each dataset, individually. BCmvn optimization was used for PK parameterization. Estimated doublet rates were computed by fitting the total number of cells after quality filtering to a linear regression of the expected doublet rates published in the 10x Chromium handbook. Estimated homotypic doublet rates were also accounted for using the modelHomotypic function. The default PN value (0.25) was used. Putative doublets were then removed from each individual dataset. After preprocessing and quality filtering, we merged the datasets and performed batch-correction with three tools, independently- Harmony (github.com/immunogenomics/harmony) (v1.0), Scanorama (github.com/brianhie/scanorama) (v1.3), and BBKNN (github.com/Teichlab/bbknn) (v1.3.12). We then used Seurat to process the integrated data. After initial integration, we removed the noisy cluster and re-integrated the data using each of the three batch-correction tools.

Cell type annotation. Cell types were determined for each integration method independently. For Harmony and Scanorama, dimensions accounting for 95% of the total variance were used to generate SNN graphs (Seurat::FindNeighbors). Louvain clustering was then performed on the output graphs (including the corrected graph output by BBKNN) using Seurat::FindClusters. A clustering resolution of 1.2 was used for Harmony (25 initial clusters), BBKNN (28 initial clusters), and Scanorama (38 initial clusters). Cell types were determined based on expression of canonical genes (Fig. S3). Clusters which had similar canonical marker gene expression patterns were merged.

Pseudotime workflow. Cells were subset based on the consensus cell types between all three integration methods. Harmony embedding values from the dimensions accounting for 95% of the total variance were used for further dimensional reduction with PHATE, using phateR (v1.0.4) (github.com/KrishnaswamyLab/phateR).

Deconvolution of spatial RNA sequencing spots. Spot deconvolution was performed using the deconvolution module in BayesPrism (previously known as “Tumor microEnvironment Deconvolution”, TED, v1.0; github.com/Danko-Lab/TED). First, myogenic cells were re-labeled, according to binning along the first PHATE dimension, as “Quiescent MuSCs” (bins 4-5), “Activated MuSCs” (bins 6-7), “Committed Myoblasts” (bins 8-10), and “Fusing Myoctes” (bins 11-18). Culture-associated muscle stem cells were ignored and myonuclei labels were retained as “Myonuclei (Type IIb)” and “Myonuclei (Type IIx)”. Next, highly and differentially expressed genes across the 25 groups of cells were identified with differential gene expression analysis using Seurat (FindAllMarkers, using Wilcoxon Rank Sum Test; results in Sup. Data 2). The resulting genes were filtered based on average log2-fold change (avg_logFC > 1) and the percentage of cells within the cluster which express each gene (pct.expressed > 0.5), yielding 1,069 genes. Mitochondrial and ribosomal protein genes were also removed from this list, in line with recommendations in the BayesPrism vignette. For each of the cell types, mean raw counts were calculated across the 1,069 genes to generate a gene expression profile for BayesPrism. Raw counts for each spot were then passed to the run.Ted function, using
Marker genes of each MDS-based cluster of PBMCs.
plos.figshare.com
figshare.com
xlsx
Updated Oct 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yutaro Kumagai (2024). Marker genes of each MDS-based cluster of PBMCs. [Dataset]. http://doi.org/10.1371/journal.pcbi.1012480.s003
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pcbi.1012480.s003
Dataset updated
Oct 10, 2024
Dataset provided by
PLOShttp://plos.org/
Authors
Yutaro Kumagai
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The output of FindAllMarkers function in Seurat was printed as a table. (XLSX)
f
Human tonsillar stromal cells and immune cells
datasetcatalog.nlm.nih.gov
figshare.com
Updated Mar 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
De Martin, Angelina; Lab, Ludewig (2023). Human tonsillar stromal cells and immune cells [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001110067
Explore at:
Dataset updated
Mar 30, 2023
Authors
De Martin, Angelina; Lab, Ludewig
Description
Hematopoietic cells were stained with fluorochrome-conjugated antibodies against human CD45, CD3, CD19 and CD14 and stromal cells with fluorochrome-conjugated antibodies against human CD45 and CD235a. Live/dead cell discrimination was performed by adding 7-amino-actinomycin D (7AAD; Calbiochem) prior to acquisition. CD45– CD235a– stromal cells, CD45+ CD3+ T cells and CD45+ CD19+ B cells were sorted with a BD FACS Melody cell sorter (BD Biosciences) and run on the 10x Chromium analyzer (10X Genomics). cDNA library generation was performed following the established commercial protocol for Chromium Single Cell 3’ Reagent Kit (v3 Chemistry). Libraries were run via Novaseq 6000 for Illumina sequencing at the Functional Genomic Center Zurich. A total of 20 samples were collected from 9 patients and processed in 6 batches. All samples from the same patient were processed in the same batch. Gene expression estimation from sequencing files was done using CellRanger (v3.0.2) count with Ensembl GRCh38.9 release as reference to build the index for human samples. Next, quality control was performed in R v.4.0.0 using the R/Bioconductor package scater (v.1.16.0) and included removal of damaged and contaminating cells based on (1) very high or low UMI counts (>2.5 median absolute deviation from the median across all cells), (2) very high or low total number of detected genes (>2.5 median absolute deviation from the median across all cells) and (3) high mitochondrial gene content (> 2.5 median absolute deviations above the median across all cells). In addition, contaminating cells expressing any of the markers CD3E, PTPRC, CD79A or GYPA were removed from stromal cell samples. Downstream analysis was performed using the Seurat R package (v.4.0.1) and included normalization, scaling, dimensionality reduction with PCA and UMAP, graph-based clustering and calculation of unbiased cluster markers as well as dimensionality reduction with diffusionmap as implemented in the scater R/Bioconductor package (v.1.16.0). Clusters were characterized based on the expression of calculated cluster markers and canonical marker genes as reported in previous publications. For the extended stromal cell analysis, two contaminating clusters with 50 cycling cells and 150 cells expressing both fibroblast and endothelial marker genes (indicative of doublets) were removed. For high resolution FRC analysis, FRC subsets were re-embedded and two clusters containing 256 cells with high levels of endothelial or mitochondrial/non-coding genes, respectively, were excluded. Comparative analysis included determination of cell type-, subset- and condition-specific gene signatures. Thereby differentially expressed genes were calculated running the FindAllMarkers function from Seurat R package.
n
Data from: Dermomyotome-derived endothelial cells migrate to the dorsal...
data.niaid.nih.gov
data-staging.niaid.nih.gov
+1more
zip
Updated Oct 4, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
David Traver; Pankaj Sahai-Hernandez; Claire Pouget; Shai Eyal; Ondrej Svoboda; Jose Chacon; Lin Grimm; Tor Gjøen (2023). Dermomyotome-derived endothelial cells migrate to the dorsal aorta to support hematopoietic stem cell emergence [Dataset]. http://doi.org/10.6075/J0GB22J0
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.6075/J0GB22J0
Dataset updated
Oct 4, 2023
Dataset provided by
University of California, San Diego
University of Oslo
Authors
David Traver; Pankaj Sahai-Hernandez; Claire Pouget; Shai Eyal; Ondrej Svoboda; Jose Chacon; Lin Grimm; Tor Gjøen
License
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Description
Development of the dorsal aorta is a key step in the establishment of the adult blood-forming system since hematopoietic stem and progenitor cells (HSPCs) arise from ventral aortic endothelium in all vertebrate animals studied. Work in zebrafish has demonstrated that arterial and venous endothelial precursors arise from distinct subsets of lateral plate mesoderm. Here, we profile the transcriptome of the earliest detectable endothelial cells (ECs) during zebrafish embryogenesis to demonstrate that tissue-specific EC programs initiate much earlier than previously appreciated, by the end of gastrulation. Classic studies in the chick embryo showed that paraxial mesoderm generates a subset of somite-derived endothelial cells (SDECs) that incorporate into the dorsal aorta to replace HSPCs as they exit the aorta and enter circulation. We describe a conserved program in the zebrafish, where a rare population of endothelial precursors delaminates from the dermomyotome to incorporate exclusively into the developing dorsal aorta. Although SDECs lack hematopoietic potential, they act as a local niche to support the emergence of HSPCs from neighboring hemogenic endothelium. Thus, at least three subsets of ECs contribute to the developing dorsal aorta: vascular ECs, hemogenic ECs, and SDECs. Taken together, our findings indicate that the distinct spatial origins of endothelial precursors dictate different cellular potentials within the developing dorsal aorta. Methods Single-cell RNA sample preparation After FACS, total cell concentration and viability were ascertained using a TC20 Automated Cell Counter (Bio-Rad). Samples were then resuspended in 1XPBS with 10% BSA at a concentration between 800-3000 per ml. Samples were loaded on the 10X Chromium system and processed as per manufacturer’s instructions (10X Genomics). Single cell libraries were prepared as per the manufacturer’s instructions using the Single Cell 3’ Reagent Kit v2 (10X Genomics). Single cell RNA-seq libraries and barcode amplicons were sequenced on an Illumina HiSeq platform. Single-cell RNA sequencing analysis The Chromium 3’ sequencing libraries were generated using Chromium Single Cell 3’ Chip kit v3 and sequenced with (actually, I don’t know:( what instrument was used?). The Ilumina FASTQ files were used to generate filtered matrices using CellRanger (10X Genomics) with default parameters and imported into R for exploration and statistical analysis using a Seurat package (La Manno et al., 2018). Counts were normalized according to total expression, multiplied by a scale factor (10,000), and log-transformed. For cell cluster identification and visualization, gene expression values were also scaled according to highly variable genes after controlling for unwanted variation generated by sample identity. Cell clusters were identified based on UMAP of the first 14 principal components of PCA using Seurat’s method, Find Clusters, with an original Louvain algorithm and resolution parameter value 0.5. To find cluster marker genes, Seurat’s method, FindAllMarkers. Only genes exhibiting significant (adjusted p-value < 0.05) a minimal average absolute log2-fold change of 0.2 between each of the clusters and the rest of the dataset were considered as differentially expressed. To merge individual datasets and to remove batch effects, Seurat v3 Integration and Label Transfer standard workflow (Stuart et al., 2019)
Marker genes of each Louvain clustering results of PBMCs based on CITE-seq.
figshare.com
xlsx
Updated Oct 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yutaro Kumagai (2024). Marker genes of each Louvain clustering results of PBMCs based on CITE-seq. [Dataset]. http://doi.org/10.1371/journal.pcbi.1012480.s012
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pcbi.1012480.s012
Dataset updated
Oct 10, 2024
Dataset provided by
PLOShttp://plos.org/
Authors
Yutaro Kumagai
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The output of FindAllMarkers function in Seurat was printed as a table. (XLSX)
Marker genes of each Louvain clustering results of PBMCs based on CITE-seq.
plos.figshare.com
xlsx
Updated Oct 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yutaro Kumagai (2024). Marker genes of each Louvain clustering results of PBMCs based on CITE-seq. [Dataset]. http://doi.org/10.1371/journal.pcbi.1012480.s012
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pcbi.1012480.s012
Dataset updated
Oct 10, 2024
Dataset provided by
PLOShttp://plos.org/
Authors
Yutaro Kumagai
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The output of FindAllMarkers function in Seurat was printed as a table. (XLSX)
Marker genes of each BootCellNet2 clustering results of PBMCs based on...
plos.figshare.com
xlsx
Updated Oct 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yutaro Kumagai (2024). Marker genes of each BootCellNet2 clustering results of PBMCs based on CITE-seq. [Dataset]. http://doi.org/10.1371/journal.pcbi.1012480.s010
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pcbi.1012480.s010
Dataset updated
Oct 10, 2024
Dataset provided by
PLOShttp://plos.org/
Authors
Yutaro Kumagai
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The output of FindAllMarkers function in Seurat was printed as a table. (XLSX)
Marker genes of each BootCellNet2 cluster of BAL cells from COVID-19...
plos.figshare.com
figshare.com
xlsx
Updated Oct 10, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yutaro Kumagai (2024). Marker genes of each BootCellNet2 cluster of BAL cells from COVID-19 patients. [Dataset]. http://doi.org/10.1371/journal.pcbi.1012480.s014
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pcbi.1012480.s014
Dataset updated
Oct 10, 2024
Dataset provided by
PLOShttp://plos.org/
Authors
Yutaro Kumagai
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The output of FindAllMarkers function in Seurat was printed as a table. (XLSX)
Additional file 10 of Single-cell transcriptomics highlights immunological...
springernature.figshare.com
xlsx
Updated Feb 7, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Qiqing Huang; Yuanyuan Wang; Lili Zhang; Wei Qian; Shaoran Shen; Jingshen Wang; Shuangshuang Wu; Wei Xu; Bo Chen; Mingyan Lin; Jianqing Wu (2024). Additional file 10 of Single-cell transcriptomics highlights immunological dysregulations of monocytes in the pathobiology of COPD [Dataset]. http://doi.org/10.6084/m9.figshare.22601786.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.22601786.v1
Dataset updated
Feb 7, 2024
Dataset provided by
Figsharehttp://figshare.com/
Authors
Qiqing Huang; Yuanyuan Wang; Lili Zhang; Wei Qian; Shaoran Shen; Jingshen Wang; Shuangshuang Wu; Wei Xu; Bo Chen; Mingyan Lin; Jianqing Wu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Additional file 10: Dataset 8. Lists of markers in 2 sub-clusters of club cells, namely autoimmune-prone sub-cluster of club cells and mix sub-cluster of club cells, related to Fig. 5D and Fig. S5M. The resulting sub-cluster markers were identified by the seurat FindAllMarkers, including the p-value, the average log2(fold change) of the gene in the sub-cluster compared to all other sub-clusters, the percent of cells expressing the gene in the sub-cluster (pct.1), the percent of cells expressing the gene in all other sub-clusters (pct.2), and the adjusted p-value.
f
Skin sc-RNASeq from seven body sites (face, scalp, axilla, palmoplantar,...
plus.figshare.com
bin
Updated Mar 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lam C Tsoi; Rachael Bogle; Johann Gudjonsson; Meri Oliva; Bridget Riley-Gillis (2025). Skin sc-RNASeq from seven body sites (face, scalp, axilla, palmoplantar, arm, leg, and back) [Dataset]. http://doi.org/10.25452/figshare.plus.25696620.v2
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.25452/figshare.plus.25696620.v2
Dataset updated
Mar 11, 2025
Dataset provided by
Figshare+
Authors
Lam C Tsoi; Rachael Bogle; Johann Gudjonsson; Meri Oliva; Bridget Riley-Gillis
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This sc-RNAseq dataset is composed of disease-unaffected epidermal samples from 96 skin biopsies: 18 from published datasets - GSE173706, GSE249279 – and 78 newly generated ones. Biopsy sample and protocol details, and curated cell-type signature genes, are available in the scRNASeq_source_info_FigShare spreadsheet of this dataset. Processed Seurat object are provided herein. Raw data are available in SRA (id PRJNA1054546). Biopsies originated from seven body sites (face, scalp, axilla, palmoplantar, arm, leg, and back). The skin biopsies were separated into epidermis and dermis before dissociated and enriched for various cell fractions (keratinocytes, fibroblasts, and endothelial cells) and immune cells (myeloid and lymphoid cells) to up sample rare cell types. In total, across body sites, 274,834 cells were profiled, including 96,194 keratinocytes. Seurat v3.0. was utilized to normalize, scale, and reduce the dimensionality of the data. Low quality cells containing less than 200 genes per cell as well as greater than 5,000 genes per cell were filtered out. Cells containing more mitochondrial genes than the permitted quantile of 0.05 were removed. Ambient RNA was removed using R package SoupX v1.6.2. Doublets were removed using scDblFinder v1.12.0. Principal components (PC) were obtained from the topmost 2,000 variable genes, and the Uniform Manifold Approximation and Projection (UMAP) dimensional reduction technique was applied to the 30 topmost variable PC-reduced dataset. Batch effect correction was performed utilizing harmony v1.0, using donor as batch. After batch correction, cells were clustered using shared nearest neighbor modularity optimization-based clustering. Cluster marker genes were identified with FindAllMarkers; cluster corresponding cell type was identified by comparing marker genes to curated cell-type signature genes. Differential expression by keratinocyte subtype was performed with Seurat (v4.3.0) FindMarkers function by comparing keratinocyte subtype to non-keratinocyte clusters. The log fold-change of the average expression between a keratinocyte subtype cluster compared to the rest of clusters is utilized as keratinocyte-subtype gene expression statistic.
f
Table_1_In-depth single-cell and bulk-RNA sequencing developed a...
frontiersin.figshare.com
xlsx
Updated Oct 19, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Liangyu Zhang; Xun Zhang; Maohao Guan; Fengqiang Yu; Fancai Lai (2023). Table_1_In-depth single-cell and bulk-RNA sequencing developed a NETosis-related gene signature affects non-small-cell lung cancer prognosis and tumor microenvironment: results from over 3,000 patients.xlsx [Dataset]. http://doi.org/10.3389/fonc.2023.1282335.s001
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.3389/fonc.2023.1282335.s001
Dataset updated
Oct 19, 2023
Dataset provided by
Frontiers
Authors
Liangyu Zhang; Xun Zhang; Maohao Guan; Fengqiang Yu; Fancai Lai
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
BackgroundCell death caused by neutrophil extracellular traps (NETs) is known as NETosis. Despite the increasing importance of NETosis in cancer diagnosis and treatment, its role in Non-Small-Cell Lung Cancer (NSCLC) remains unclear.MethodsA total of 3298 NSCLC patients from different cohorts were included. The AUCell method was used to compute cells’ NETosis scores from single-cell RNA-sequencing data. DEGs in sc-RNA dataset were obtained by the Seurat’s “FindAllMarkers” function, and DEGs in bulk-RNA dataset were acquired by the DESeq2 package. ConsensusClusterPlus package was used to group patients into different NETosis subtypes, and the Enet algorithm was used to construct the NETosis-Related Riskscore (NETRS). Enrichment analyses were conducted using the GSVA and ClusterProfiler packages. Six distinct algorithms were utilized to evaluate patients’ immune cell infiltration level. Patients’ SNV and CNV data were analyzed by maftools and GISTIC2.0, respectively. Drug information was obtained from the GDSC1, and predicted by the Oncopredict package. Patient response to immunotherapy was evaluated by the TIDE algorithm in conjunction with the phs000452 immunotherapy cohort. Six NRGs’ differential expression was verified using qRT-PCR and immunohistochemistry.ResultsAmong all cell types, neutrophils had the highest AUCell score. By Intersecting the DEGs between high and low NETosis classes, DEGs between normal and LUAD tissues, and prognostic related genes, 61 prognostic related NRGs were identified. Based on the 61 NRGs, all LUAD patients can be divided into two clusters, showing different prognostic and TME characteristics. Enet regression identified the NETRS composed of 18 NRGs. NETRS significantly associated with LUAD patients’ clinical characteristics, and patients at different NETRS groups showed significant differences on prognosis, TME characteristics, immune-related molecules’ expression levels, gene mutation frequencies, response to immunotherapy, and drug sensitivity. Besides, NETRS was more powerful than 20 published gene signatures in predicting LUAD patients’ survival. Nine independent cohorts confirmed that NETRS is also valuable in predicting the prognosis of all NSCLC patients. Finally, six NRGs’ expression was confirmed using three independent datasets, qRT-PCR and immunohistochemistry.ConclusionNETRS can serves as a valuable prognostic indicator for patients with NSCLC, providing insights into the tumor microenvironment and predicting the response to cancer therapy.
Marker genes for cell clusters in the integrated scRNA-seq dataset.
figshare.com
plos.figshare.com
csv
Updated Oct 24, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alexander Ferrena; Xusheng Zhang; Rupendra Shrestha; Deyou Zheng; Wei Liu (2024). Marker genes for cell clusters in the integrated scRNA-seq dataset. [Dataset]. http://doi.org/10.1371/journal.pone.0308839.s008
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0308839.s008
Dataset updated
Oct 24, 2024
Dataset provided by
PLOShttp://plos.org/
Authors
Alexander Ferrena; Xusheng Zhang; Rupendra Shrestha; Deyou Zheng; Wei Liu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Related to Fig 1. These cluster markers were identified when one cluster was compared with all other clusters in the integrated scRNA-seq dataset using the function FindAllMarkers in Seurat. (CSV)
Cardiac cells isolated from inflamed murine and human myocardial tissues
figshare.com
application/gzip
Updated Feb 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ludewig Lab (2024). Cardiac cells isolated from inflamed murine and human myocardial tissues [Dataset]. http://doi.org/10.6084/m9.figshare.24994478.v1
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.24994478.v1
Dataset updated
Feb 13, 2024
Dataset provided by
Figsharehttp://figshare.com/
Authors
Ludewig Lab
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Droplet-based single-cell and single nucleus RNA sequencing analysis of murine heartsTo obtain sufficient numbers of cells from all cardiac cell types, a total of n=14 samples (WT: 4 samples; TCRM: 6 samples; TCRM isotype: 2 samples; TCRM 14-D10-2: 2 samples) from mouse hearts were processed for single cell RNA sequencing, and n=9 samples (WT: 2 samples; TCRM: 3 samples; TCRM isotype: 2 samples; TCRM 14-D10-2: 2 samples) were prepared for single nucleus RNA sequencing analysis. Samples were processed and sequenced in n=3 batches with all batches spanning multiple conditions. Single cell suspensions were run using the 10x Chromium (10x Genomics) system. The cDNA libraries were generated according to the established commercial protocols for Chromium Single Cell 3’ Reagent Kit (NextGem Chemistry) and Chromium Nuclei Isolation Kit. All libraries were sequenced by NovaSeq 6000 Illumina sequencing at the Functional Genomic Center Zurich. Gene expression was analyzed from sequencing data using CellRanger (v.5.0.1) count, with Ensembl GRCm38.9 as reference. Next, quality control was carried out in R (v.4.2.1) using the R/Bioconductor packages scater (v.1.24.0) and SingleCellExperiment (v.1.18.0) packages. This involved the identification and removal of damaged cells/nuclei or doublets, based on criteria including unusual UMI or gene counts (>2.5 median absolute deviation from the median across all cells) and high mitochondrial gene content (> 2.5 median absolute deviations above the median across all cells). After performing quality control, the final dataset included 31 078 cells and 24 995 nuclei.Downstream analysis was performed using the Seurat R package (v.4.1.1). First, all samples were merged and integrated across data type (single cell or single nucleus data) using the IntegrateData function from the Seurat R package to account for differences between single cell and single nucleus data. Downstream analysis further included normalization, scaling, dimensional reduction with PCA and UMAP, graph-based clustering and calculation of unbiased cluster markers. Clusters were characterized based on the expression of calculated cluster markers and canonical marker genes as reported in previous publications (refs). In order to examine expression signatures of Fibroblasts in more detail cells assigned as Fibroblasts were re-embedded and re-analysed individually.Droplet-based single nucleus RNA sequencing analysis of human heart biopsiesAs for murine samples, isolated nuclei from human heart biopsies were run using the 10x Chromium (10x Genomics) system and cDNA libraries were generated according to the established commercial protocols for Chromium Single Cell 3’ Reagent Kit (NextGem Chemistry) and Chromium Nuclei Isolation Kit. Libraries were sequenced by NovaSeq 6000 Illumina sequencing at the Functional Genomic Center Zurich and gene expression was estimated using CellRanger (v.5.0.1) count, with Ensembl GRCh38.103 as reference. Quality control included the removal of nuclei with unusual UMI or gene counts (>2.5 median absolute deviation from the median across all cells) and was performed in R v.4.2.1 using the R/Bioconductor packages scater (v.1.24.0) and SingleCellExperiment (v.1.18.0).For downstream analysis with the Seurat R package (v.4.3.0) all samples were merged and integrated across patient ID using the IntegrateData function. Integrated data was further processed running normalization, scaling, dimensional reduction with PCA and UMAP, graph-based clustering and calculation of unbiased cluster markers. Clusters were characterized based on the expression of calculated cluster markers and canonical marker genes as reported in previous publications. Following cluster assignments samples were grouped based on their T cell proportions and groups were compared by calculating differentially expressed genes using the FindAllMarkers function from the Seurat R package.
Shared equine BAL cell-type marker genes across HIVE and Drop-seq data.
figshare.com
xls
Updated Jan 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kim Fegraeus; Miia Riihimäki; Jessica Nordlund; Srinivas Akula; Sara Wernersson; Amanda Raine (2025). Shared equine BAL cell-type marker genes across HIVE and Drop-seq data. [Dataset]. http://doi.org/10.1371/journal.pone.0317343.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0317343.t003
Dataset updated
Jan 24, 2025
Dataset provided by
PLOShttp://plos.org/
Authors
Kim Fegraeus; Miia Riihimäki; Jessica Nordlund; Srinivas Akula; Sara Wernersson; Amanda Raine
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The table lists common cluster markers among the top 25 genes with the highest log2FC in each respective dataset, as identified by Seurat’s (v.4.3) FindAllMarkers function.
Additional Data
figshare.com
application/gzip
Updated Aug 5, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jennifer Nguyen (2021). Additional Data [Dataset]. http://doi.org/10.6084/m9.figshare.15109422.v7
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.15109422.v7
Dataset updated
Aug 5, 2021
Dataset provided by
Figsharehttp://figshare.com/
figshare
Authors
Jennifer Nguyen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Here, we provide:Robjects pertaining to scRNA-seq (Seurat) and snATAC-seq (Signac) analysis. These contain the single-cell and single-nuclei used in downstream analyses. Tables containing information about the gene markers identified for each cluster in scRNA-seq, peak markers identified for each cluster in snATAC-seq, and motif enrichment analyses using chromVAR motif scores. Differential gene expression and motif enrichment analyses was performed using Wilcoxon rank sum test comparing the distribution of gene expression or chromVAR motif scores between cells in the cluster and all other cells. Differential peak analyses was performed using FindAllMarkers in Signac.
Novel markers.
plos.figshare.com
xlsx
Updated Sep 18, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Peter D. Price; Sylvie M. Parkus; Victoria J. Lloyd; Ben T. Alston; Sasha L. Bradshaw; Sadé Bates; Margaret A. Hughes; Steve Paterson; Terry Burke; Iulia Darolti; Andrew Pomiankowski; Alison E. Wright (2025). Novel markers. [Dataset]. http://doi.org/10.1371/journal.pgen.1011816.s019
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pgen.1011816.s019
Dataset updated
Sep 18, 2025
Dataset provided by
PLOShttp://plos.org/
Authors
Peter D. Price; Sylvie M. Parkus; Victoria J. Lloyd; Ben T. Alston; Sasha L. Bradshaw; Sadé Bates; Margaret A. Hughes; Steve Paterson; Terry Burke; Iulia Darolti; Andrew Pomiankowski; Alison E. Wright
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Novel marker genes identified using FindAllMarkers from Seurat. In brief, for each cluster, differentially expressed genes were identified between the cluster and all other clusters in the dataset. (XLSX)
Top 10 Cluster Markers.
plos.figshare.com
xlsx
Updated Jun 13, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aparna Jorapur; Lisa A. Marshall; Scott Jacobson; Mengshu Xu; Sachie Marubayashi; Mikhail Zibinsky; Dennis X. Hu; Omar Robles; Jeffrey J. Jackson; Valentin Baloche; Pierre Busson; David Wustrow; Dirk G. Brockstedt; Oezcan Talay; Paul D. Kassner; Gene Cutler (2023). Top 10 Cluster Markers. [Dataset]. http://doi.org/10.1371/journal.ppat.1010200.s016
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.ppat.1010200.s016
Dataset updated
Jun 13, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Aparna Jorapur; Lisa A. Marshall; Scott Jacobson; Mengshu Xu; Sachie Marubayashi; Mikhail Zibinsky; Dennis X. Hu; Omar Robles; Jeffrey J. Jackson; Valentin Baloche; Pierre Busson; David Wustrow; Dirk G. Brockstedt; Oezcan Talay; Paul D. Kassner; Gene Cutler
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Gene markers for each cell cluster in the single cell RNA-Sequencing data, as identified by the “FindAllMarkers” function of the Seurat analysis package. Additional details are given at the top of the table. (XLSX)
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

David McKellar; Iwijn De Vlaminck; Benjamin Cosgrove (2021). Large-scale integration of single-cell transcriptomic data captures transitional progenitor states in mouse skeletal muscle regeneration [Dataset]. http://doi.org/10.5061/dryad.t4b8gtj34

Data from: Large-scale integration of single-cell transcriptomic data captures transitional progenitor states in mouse skeletal muscle regeneration

Explore at:

zipAvailable download formats

Unique identifier

https://doi.org/10.5061/dryad.t4b8gtj34

Dataset updated

Dec 14, 2021

Dataset provided by

Cornell University

Authors

David McKellar; Iwijn De Vlaminck; Benjamin Cosgrove

License

https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

Description

Skeletal muscle repair is driven by the coordinated self-renewal and fusion of myogenic stem and progenitor cells. Single-cell gene expression analyses of myogenesis have been hampered by the poor sampling of rare and transient cell states that are critical for muscle repair, and do not inform the spatial context that is important for myogenic differentiation. Here, we demonstrate how large-scale integration of single-cell and spatial transcriptomic data can overcome these limitations. We created a single-cell transcriptomic dataset of mouse skeletal muscle by integration, consensus annotation, and analysis of 23 newly collected scRNAseq datasets and 88 publicly available single-cell (scRNAseq) and single-nucleus (snRNAseq) RNA-sequencing datasets. The resulting dataset includes more than 365,000 cells and spans a wide range of ages, injury, and repair conditions. Together, these data enabled identification of the predominant cell types in skeletal muscle, and resolved cell subtypes, including endothelial subtypes distinguished by vessel-type of origin, fibro/adipogenic progenitors defined by functional roles, and many distinct immune populations. The representation of different experimental conditions and the depth of transcriptome coverage enabled robust profiling of sparsely expressed genes. We built a densely sampled transcriptomic model of myogenesis, from stem cell quiescence to myofiber maturation and identified rare, transitional states of progenitor commitment and fusion that are poorly represented in individual datasets. We performed spatial RNA sequencing of mouse muscle at three time points after injury and used the integrated dataset as a reference to achieve a high-resolution, local deconvolution of cell subtypes. We also used the integrated dataset to explore ligand-receptor co-expression patterns and identify dynamic cell-cell interactions in muscle injury response. We provide a public web tool to enable interactive exploration and visualization of the data. Our work supports the utility of large-scale integration of single-cell transcriptomic data as a tool for biological discovery.

Methods Mice. The Cornell University Institutional Animal Care and Use Committee (IACUC) approved all animal protocols, and experiments were performed in compliance with its institutional guidelines. Adult C57BL/6J mice (mus musculus) were obtained from Jackson Laboratories (#000664; Bar Harbor, ME) and were used at 4-7 months of age. Aged C57BL/6J mice were obtained from the National Institute of Aging (NIA) Rodent Aging Colony and were used at 20 months of age. For new scRNAseq experiments, female mice were used in each experiment.

Mouse injuries and single-cell isolation. To induce muscle injury, both tibialis anterior (TA) muscles of old (20 months) C57BL/6J mice were injected with 10 µl of notexin (10 µg/ml; Latoxan; France). At 0, 1, 2, 3.5, 5, or 7 days post-injury (dpi), mice were sacrificed and TA muscles were collected and processed independently to generate single-cell suspensions. Muscles were digested with 8 mg/ml Collagenase D (Roche; Switzerland) and 10 U/ml Dispase II (Roche; Switzerland), followed by manual dissociation to generate cell suspensions. Cell suspensions were sequentially filtered through 100 and 40 μm filters (Corning Cellgro #431752 and #431750) to remove debris. Erythrocytes were removed through incubation in erythrocyte lysis buffer (IBI Scientific #89135-030).

Single-cell RNA-sequencing library preparation. After digestion, single-cell suspensions were washed and resuspended in 0.04% BSA in PBS at a concentration of 106 cells/ml. Cells were counted manually with a hemocytometer to determine their concentration. Single-cell RNA-sequencing libraries were prepared using the Chromium Single Cell 3’ reagent kit v3 (10x Genomics, PN-1000075; Pleasanton, CA) following the manufacturer’s protocol. Cells were diluted into the Chromium Single Cell A Chip to yield a recovery of 6,000 single-cell transcriptomes. After preparation, libraries were sequenced using on a NextSeq 500 (Illumina; San Diego, CA) using 75 cycle high output kits (Index 1 = 8, Read 1 = 26, and Read 2 = 58). Details on estimated sequencing saturation and the number of reads per sample are shown in Sup. Data 1.

Spatial RNA sequencing library preparation. Tibialis anterior muscles of adult (5 mo) C57BL6/J mice were injected with 10µl notexin (10 µg/ml) at 2, 5, and 7 days prior to collection. Upon collection, tibialis anterior muscles were isolated, embedded in OCT, and frozen fresh in liquid nitrogen. Spatially tagged cDNA libraries were built using the Visium Spatial Gene Expression 3’ Library Construction v1 Kit (10x Genomics, PN-1000187; Pleasanton, CA) (Fig. S7). Optimal tissue permeabilization time for 10 µm thick sections was found to be 15 minutes using the 10x Genomics Visium Tissue Optimization Kit (PN-1000193). H&E stained tissue sections were imaged using Zeiss PALM MicroBeam laser capture microdissection system and the images were stitched and processed using Fiji ImageJ software. cDNA libraries were sequenced on an Illumina NextSeq 500 using 150 cycle high output kits (Read 1=28bp, Read 2=120bp, Index 1=10bp, and Index 2=10bp). Frames around the capture area on the Visium slide were aligned manually and spots covering the tissue were selected using Loop Browser v4.0.0 software (10x Genomics). Sequencing data was then aligned to the mouse reference genome (mm10) using the spaceranger v1.0.0 pipeline to generate a feature-by-spot-barcode expression matrix (10x Genomics).

Download and alignment of single-cell RNA sequencing data. For all samples available via SRA, parallel-fastq-dump (github.com/rvalieris/parallel-fastq-dump) was used to download raw .fastq files. Samples which were only available as .bam files were converted to .fastq format using bamtofastq from 10x Genomics (github.com/10XGenomics/bamtofastq). Raw reads were aligned to the mm10 reference using cellranger (v3.1.0).

Preprocessing and batch correction of single-cell RNA sequencing datasets. First, ambient RNA signal was removed using the default SoupX (v1.4.5) workflow (autoEstCounts and adjustCounts; github.com/constantAmateur/SoupX). Samples were then preprocessed using the standard Seurat (v3.2.1) workflow (NormalizeData, ScaleData, FindVariableFeatures, RunPCA, FindNeighbors, FindClusters, and RunUMAP; github.com/satijalab/seurat). Cells with fewer than 750 features, fewer than 1000 transcripts, or more than 30% of unique transcripts derived from mitochondrial genes were removed. After preprocessing, DoubletFinder (v2.0) was used to identify putative doublets in each dataset, individually. BCmvn optimization was used for PK parameterization. Estimated doublet rates were computed by fitting the total number of cells after quality filtering to a linear regression of the expected doublet rates published in the 10x Chromium handbook. Estimated homotypic doublet rates were also accounted for using the modelHomotypic function. The default PN value (0.25) was used. Putative doublets were then removed from each individual dataset. After preprocessing and quality filtering, we merged the datasets and performed batch-correction with three tools, independently- Harmony (github.com/immunogenomics/harmony) (v1.0), Scanorama (github.com/brianhie/scanorama) (v1.3), and BBKNN (github.com/Teichlab/bbknn) (v1.3.12). We then used Seurat to process the integrated data. After initial integration, we removed the noisy cluster and re-integrated the data using each of the three batch-correction tools.

Cell type annotation. Cell types were determined for each integration method independently. For Harmony and Scanorama, dimensions accounting for 95% of the total variance were used to generate SNN graphs (Seurat::FindNeighbors). Louvain clustering was then performed on the output graphs (including the corrected graph output by BBKNN) using Seurat::FindClusters. A clustering resolution of 1.2 was used for Harmony (25 initial clusters), BBKNN (28 initial clusters), and Scanorama (38 initial clusters). Cell types were determined based on expression of canonical genes (Fig. S3). Clusters which had similar canonical marker gene expression patterns were merged.

Pseudotime workflow. Cells were subset based on the consensus cell types between all three integration methods. Harmony embedding values from the dimensions accounting for 95% of the total variance were used for further dimensional reduction with PHATE, using phateR (v1.0.4) (github.com/KrishnaswamyLab/phateR).

Deconvolution of spatial RNA sequencing spots. Spot deconvolution was performed using the deconvolution module in BayesPrism (previously known as “Tumor microEnvironment Deconvolution”, TED, v1.0; github.com/Danko-Lab/TED). First, myogenic cells were re-labeled, according to binning along the first PHATE dimension, as “Quiescent MuSCs” (bins 4-5), “Activated MuSCs” (bins 6-7), “Committed Myoblasts” (bins 8-10), and “Fusing Myoctes” (bins 11-18). Culture-associated muscle stem cells were ignored and myonuclei labels were retained as “Myonuclei (Type IIb)” and “Myonuclei (Type IIx)”. Next, highly and differentially expressed genes across the 25 groups of cells were identified with differential gene expression analysis using Seurat (FindAllMarkers, using Wilcoxon Rank Sum Test; results in Sup. Data 2). The resulting genes were filtered based on average log2-fold change (avg_logFC > 1) and the percentage of cells within the cluster which express each gene (pct.expressed > 0.5), yielding 1,069 genes. Mitochondrial and ribosomal protein genes were also removed from this list, in line with recommendations in the BayesPrism vignette. For each of the cell types, mean raw counts were calculated across the 1,069 genes to generate a gene expression profile for BayesPrism. Raw counts for each spot were then passed to the run.Ted function, using

Clear search

Close search

Google apps

Main menu

Data from: Large-scale integration of single-cell transcriptomic data...

Marker genes of each MDS-based cluster of PBMCs.

Human tonsillar stromal cells and immune cells

Data from: Dermomyotome-derived endothelial cells migrate to the dorsal...

Marker genes of each Louvain clustering results of PBMCs based on CITE-seq.

Marker genes of each Louvain clustering results of PBMCs based on CITE-seq.

Marker genes of each BootCellNet2 clustering results of PBMCs based on...

Marker genes of each BootCellNet2 cluster of BAL cells from COVID-19...

Additional file 10 of Single-cell transcriptomics highlights immunological...

Skin sc-RNASeq from seven body sites (face, scalp, axilla, palmoplantar,...

Table_1_In-depth single-cell and bulk-RNA sequencing developed a...

Marker genes for cell clusters in the integrated scRNA-seq dataset.

Cardiac cells isolated from inflamed murine and human myocardial tissues

Shared equine BAL cell-type marker genes across HIVE and Drop-seq data.

Additional Data

Novel markers.

Top 10 Cluster Markers.

Data from: Large-scale integration of single-cell transcriptomic data captures transitional progenitor states in mouse skeletal muscle regeneration