8 datasets found

f
scanpy-pbmc3k.h5ad
figshare.com
Updated Aug 26, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Luke Zappia (2021). scanpy-pbmc3k.h5ad [Dataset]. http://doi.org/10.6084/m9.figshare.16447278.v1
Explore at:
Unique identifier
https://doi.org/10.6084/m9.figshare.16447278.v1
Dataset updated
Aug 26, 2021
Dataset provided by
figshare
Authors
Luke Zappia
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
H5AD file created by following the Scanpy PBMC3K tutorial
PBMC 3k test datasets for besca
zenodo.org
bin
Updated Jan 18, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Klas Hatje; Klas Hatje; Alice Julien-Laferrière; Alice Julien-Laferrière (2021). PBMC 3k test datasets for besca [Dataset]. http://doi.org/10.5281/zenodo.3948150
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.3948150
Dataset updated
Jan 18, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Klas Hatje; Klas Hatje; Alice Julien-Laferrière; Alice Julien-Laferrière
License
https://www.gnu.org/licenses/agpl.txthttps://www.gnu.org/licenses/agpl.txt
Description
This is a single cell transcriptomics dataset containing roughly 3,000 PBMCs. The original data was downloaded from the Seurat 3k PBMC tutorial: https://satijalab.org/seurat/v3.0/pbmc3k_tutorial.html. We reprocessed the dataset using the Besca package (https://github.com/bedapub/besca).
scverse tutorial data: Getting started with AnnData
figshare.com
hdf
Updated Apr 7, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jan Lause (2023). scverse tutorial data: Getting started with AnnData [Dataset]. http://doi.org/10.6084/m9.figshare.22577536.v2
Explore at:
hdfAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.22577536.v2
Dataset updated
Apr 7, 2023
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Jan Lause
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The data is derived from the 3k PBMC data used in scanpy & Seurat tutorials. In comes in the AnnData h5ad format.

Processed 3k PBMCs from a Healthy Donor from 10x Genomics, available at https://scanpy.readthedocs.io/en/stable/generated/scanpy.datasets.pbmc3k_processed.html Original 10X data available at http://cf.10xgenomics.com/samples/cell-exp/1.1.0/pbmc3k/pbmc3k_filtered_gene_bc_matrices.tar.gz from this website: https://support.10xgenomics.com/single-cell-gene-expression/datasets/1.1.0/pbmc3k

The changes made to the original scanpy.datasets.pbmc3k_processed() data are described in this github issue: https://github.com/scverse/scverse-tutorials/issues/51

See jupyter notebook for details.
PBMC_3k_labels
zenodo.org
bin
Updated Dec 13, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rishab Munjal; Rishab Munjal (2021). PBMC_3k_labels [Dataset]. http://doi.org/10.5281/zenodo.5775898
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.5775898
Dataset updated
Dec 13, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Rishab Munjal; Rishab Munjal
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Cell type labels for the pbmc3k dataset.
f
scClassifier's tutorial datasets
figshare.com
application/gzip
Updated Dec 19, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Feng Zeng (2019). scClassifier's tutorial datasets [Dataset]. http://doi.org/10.6084/m9.figshare.11407743.v1
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.11407743.v1
Dataset updated
Dec 19, 2019
Dataset provided by
figshare
Authors
Feng Zeng
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This repository includes two datasets. The first one named pbmc3k.RData is a scRNA-seq dataset of 3,000 human PBMCs generated by 10x Genomics. The second one named immuno_navigator_human_expression.RData is a wrapper of Immuno-Navigator database.
domino2: data for a reproducible example
zenodo.org
bin, csv, tsv
Updated Nov 14, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jacob T. Mitchell; Jacob T. Mitchell (2023). domino2: data for a reproducible example [Dataset]. http://doi.org/10.5281/zenodo.10124866
Explore at:
csv, bin, tsvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.10124866
Dataset updated
Nov 14, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Jacob T. Mitchell; Jacob T. Mitchell
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This repository hosts example data for reproducible analysis of intra- and intercellular signaling in single cell RNA sequencing (scRNAseq) data based on transcription factor (TF) activation. We demonstrate analysis using domino2 on the 10X Genomics Peripheral Blood Mononuclear Cells (PBMC) data set of 2,700 cells PBMC3K. scRNA-seq data is preprocessed following the Satija Lab's Guided Clustering Tutorial. Quantification of TF activation is conducted using pySCENIC. For more details on how this analysis is conducted, please refer to the vignettes in the domino2 package.
f
Table_1_Patterns, Profiles, and Parsimony: Dissecting Transcriptional...
frontiersin.figshare.com
xlsx
Updated Jun 3, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Oswaldo A. Lozoya; Kathryn S. McClelland; Brian N. Papas; Jian-Liang Li; Humphrey H.-C. Yao (2023). Table_1_Patterns, Profiles, and Parsimony: Dissecting Transcriptional Signatures From Minimal Single-Cell RNA-Seq Output With SALSA.XLSX [Dataset]. http://doi.org/10.3389/fgene.2020.511286.s005
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.3389/fgene.2020.511286.s005
Dataset updated
Jun 3, 2023
Dataset provided by
Frontiers
Authors
Oswaldo A. Lozoya; Kathryn S. McClelland; Brian N. Papas; Jian-Liang Li; Humphrey H.-C. Yao
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Single-cell RNA sequencing (scRNA-seq) technologies have precipitated the development of bioinformatic tools to reconstruct cell lineage specification and differentiation processes with single-cell precision. However, current start-up costs and recommended data volumes for statistical analysis remain prohibitively expensive, preventing scRNA-seq technologies from becoming mainstream. Here, we introduce single-cell amalgamation by latent semantic analysis (SALSA), a versatile workflow that combines measurement reliability metrics with latent variable extraction to infer robust expression profiles from ultra-sparse sc-RNAseq data. SALSA uses a matrix focusing approach that starts by identifying facultative genes with expression levels greater than experimental measurement precision and ends with cell clustering based on a minimal set of Profiler genes, each one a putative biomarker of cluster-specific expression profiles. To benchmark how SALSA performs in experimental settings, we used the publicly available 10X Genomics PBMC 3K dataset, a pre-curated silver standard from human frozen peripheral blood comprising 2,700 single-cell barcodes, and identified 7 major cell groups matching transcriptional profiles of peripheral blood cell types and driven agnostically by < 500 Profiler genes. Finally, we demonstrate successful implementation of SALSA in a replicative scRNA-seq scenario by using previously published DropSeq data from a multi-batch mouse retina experimental design, thereby identifying 10 transcriptionally distinct cell types from > 64,000 single cells across 7 independent biological replicates based on < 630 Profiler genes. With these results, SALSA demonstrates that robust pattern detection from scRNA-seq expression matrices only requires a fraction of the accrued data, suggesting that single-cell sequencing technologies can become affordable and widespread if meant as hypothesis-generation tools to extract large-scale differential expression effects.
SeuratExtend Tutorial: Curated Example Datasets for Single-Cell Analysis
zenodo.org
bin
Updated Dec 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yichao Hua; Yichao Hua (2024). SeuratExtend Tutorial: Curated Example Datasets for Single-Cell Analysis [Dataset]. http://doi.org/10.5281/zenodo.10944066
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.10944066
Dataset updated
Dec 12, 2024
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Yichao Hua; Yichao Hua
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This repository contains example datasets specifically curated for the SeuratExtend tutorial, aimed at facilitating advanced analyses and visualization techniques in single-cell genomics. The datasets have been derived from publicly available data obtained from the 10X Genomics website and have undergone careful preprocessing to serve specific tutorial goals.

The collection includes the following datasets:

Myeloid Subset from PBMC 10k Dataset: This subset focuses on myeloid cells extracted from the larger PBMC 10k dataset, showcasing a preprocessed SeuratObject stored as an RDS file. The data serve as a primary example for demonstrating the capabilities of SeuratExtend differentiation trajectory analysis.

Velocyto LOOM File of Myeloid Subset from PBMC 10k Dataset: Accompanying the first dataset, this Velocyto-generated LOOM file represents a subset of the same myeloid cells, focusing on RNA velocity analyses. It provides a dynamic perspective on gene expression changes over time, enriching the tutorial with advanced single-cell transcriptomics insights.

SCENIC-Processed PBMC 3k Dataset: An outcome of running the SCENIC workflow on the PBMC 3k dataset, this LOOM file represents a refined dataset highlighting gene regulation networks. It serves as an advanced example for users interested in exploring gene regulatory mechanisms using SeuratExtend.

Each dataset has been subsetted and processed, making them ideal for users ranging from beginners to advanced researchers in the field of single-cell genomics. The provided data are intended for educational and tutorial purposes, allowing users to gain hands-on experience with real-world single-cell analysis scenarios.
Not seeing a result you expected?
Learn how you can add new datasets to our index.