Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
H5AD file created by following the Scanpy PBMC3K tutorial
https://www.gnu.org/licenses/agpl.txthttps://www.gnu.org/licenses/agpl.txt
This is a single cell transcriptomics dataset containing roughly 3,000 PBMCs. The original data was downloaded from the Seurat 3k PBMC tutorial: https://satijalab.org/seurat/v3.0/pbmc3k_tutorial.html. We reprocessed the dataset using the Besca package (https://github.com/bedapub/besca).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The data is derived from the 3k PBMC data used in scanpy & Seurat tutorials. In comes in the AnnData h5ad format.
Processed 3k PBMCs from a Healthy Donor from 10x Genomics, available at https://scanpy.readthedocs.io/en/stable/generated/scanpy.datasets.pbmc3k_processed.html Original 10X data available at http://cf.10xgenomics.com/samples/cell-exp/1.1.0/pbmc3k/pbmc3k_filtered_gene_bc_matrices.tar.gz from this website: https://support.10xgenomics.com/single-cell-gene-expression/datasets/1.1.0/pbmc3k
The changes made to the original scanpy.datasets.pbmc3k_processed()
data are described in this github issue: https://github.com/scverse/scverse-tutorials/issues/51
See jupyter notebook for details.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Cell type labels for the pbmc3k dataset.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This repository includes two datasets. The first one named pbmc3k.RData is a scRNA-seq dataset of 3,000 human PBMCs generated by 10x Genomics. The second one named immuno_navigator_human_expression.RData is a wrapper of Immuno-Navigator database.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This repository hosts example data for reproducible analysis of intra- and intercellular signaling in single cell RNA sequencing (scRNAseq) data based on transcription factor (TF) activation. We demonstrate analysis using domino2 on the 10X Genomics Peripheral Blood Mononuclear Cells (PBMC) data set of 2,700 cells PBMC3K. scRNA-seq data is preprocessed following the Satija Lab's Guided Clustering Tutorial. Quantification of TF activation is conducted using pySCENIC. For more details on how this analysis is conducted, please refer to the vignettes in the domino2 package.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Single-cell RNA sequencing (scRNA-seq) technologies have precipitated the development of bioinformatic tools to reconstruct cell lineage specification and differentiation processes with single-cell precision. However, current start-up costs and recommended data volumes for statistical analysis remain prohibitively expensive, preventing scRNA-seq technologies from becoming mainstream. Here, we introduce single-cell amalgamation by latent semantic analysis (SALSA), a versatile workflow that combines measurement reliability metrics with latent variable extraction to infer robust expression profiles from ultra-sparse sc-RNAseq data. SALSA uses a matrix focusing approach that starts by identifying facultative genes with expression levels greater than experimental measurement precision and ends with cell clustering based on a minimal set of Profiler genes, each one a putative biomarker of cluster-specific expression profiles. To benchmark how SALSA performs in experimental settings, we used the publicly available 10X Genomics PBMC 3K dataset, a pre-curated silver standard from human frozen peripheral blood comprising 2,700 single-cell barcodes, and identified 7 major cell groups matching transcriptional profiles of peripheral blood cell types and driven agnostically by < 500 Profiler genes. Finally, we demonstrate successful implementation of SALSA in a replicative scRNA-seq scenario by using previously published DropSeq data from a multi-batch mouse retina experimental design, thereby identifying 10 transcriptionally distinct cell types from > 64,000 single cells across 7 independent biological replicates based on < 630 Profiler genes. With these results, SALSA demonstrates that robust pattern detection from scRNA-seq expression matrices only requires a fraction of the accrued data, suggesting that single-cell sequencing technologies can become affordable and widespread if meant as hypothesis-generation tools to extract large-scale differential expression effects.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This repository contains example datasets specifically curated for the SeuratExtend tutorial, aimed at facilitating advanced analyses and visualization techniques in single-cell genomics. The datasets have been derived from publicly available data obtained from the 10X Genomics website and have undergone careful preprocessing to serve specific tutorial goals.
The collection includes the following datasets:
Myeloid Subset from PBMC 10k Dataset: This subset focuses on myeloid cells extracted from the larger PBMC 10k dataset, showcasing a preprocessed SeuratObject stored as an RDS file. The data serve as a primary example for demonstrating the capabilities of SeuratExtend differentiation trajectory analysis.
Velocyto LOOM File of Myeloid Subset from PBMC 10k Dataset: Accompanying the first dataset, this Velocyto-generated LOOM file represents a subset of the same myeloid cells, focusing on RNA velocity analyses. It provides a dynamic perspective on gene expression changes over time, enriching the tutorial with advanced single-cell transcriptomics insights.
SCENIC-Processed PBMC 3k Dataset: An outcome of running the SCENIC workflow on the PBMC 3k dataset, this LOOM file represents a refined dataset highlighting gene regulation networks. It serves as an advanced example for users interested in exploring gene regulatory mechanisms using SeuratExtend.
Each dataset has been subsetted and processed, making them ideal for users ranging from beginners to advanced researchers in the field of single-cell genomics. The provided data are intended for educational and tutorial purposes, allowing users to gain hands-on experience with real-world single-cell analysis scenarios.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
H5AD file created by following the Scanpy PBMC3K tutorial