8 datasets found
  1. f

    scanpy-pbmc3k.h5ad

    • figshare.com
    Updated Aug 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Luke Zappia (2021). scanpy-pbmc3k.h5ad [Dataset]. http://doi.org/10.6084/m9.figshare.16447278.v1
    Explore at:
    Dataset updated
    Aug 26, 2021
    Dataset provided by
    figshare
    Authors
    Luke Zappia
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    H5AD file created by following the Scanpy PBMC3K tutorial

  2. PBMC 3k test datasets for besca

    • zenodo.org
    bin
    Updated Jan 18, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Klas Hatje; Klas Hatje; Alice Julien-Laferrière; Alice Julien-Laferrière (2021). PBMC 3k test datasets for besca [Dataset]. http://doi.org/10.5281/zenodo.3948150
    Explore at:
    binAvailable download formats
    Dataset updated
    Jan 18, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Klas Hatje; Klas Hatje; Alice Julien-Laferrière; Alice Julien-Laferrière
    License

    https://www.gnu.org/licenses/agpl.txthttps://www.gnu.org/licenses/agpl.txt

    Description

    This is a single cell transcriptomics dataset containing roughly 3,000 PBMCs. The original data was downloaded from the Seurat 3k PBMC tutorial: https://satijalab.org/seurat/v3.0/pbmc3k_tutorial.html. We reprocessed the dataset using the Besca package (https://github.com/bedapub/besca).

  3. scverse tutorial data: Getting started with AnnData

    • figshare.com
    hdf
    Updated Apr 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jan Lause (2023). scverse tutorial data: Getting started with AnnData [Dataset]. http://doi.org/10.6084/m9.figshare.22577536.v2
    Explore at:
    hdfAvailable download formats
    Dataset updated
    Apr 7, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Jan Lause
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data is derived from the 3k PBMC data used in scanpy & Seurat tutorials. In comes in the AnnData h5ad format.

    Processed 3k PBMCs from a Healthy Donor from 10x Genomics, available at https://scanpy.readthedocs.io/en/stable/generated/scanpy.datasets.pbmc3k_processed.html Original 10X data available at http://cf.10xgenomics.com/samples/cell-exp/1.1.0/pbmc3k/pbmc3k_filtered_gene_bc_matrices.tar.gz from this website: https://support.10xgenomics.com/single-cell-gene-expression/datasets/1.1.0/pbmc3k

    The changes made to the original scanpy.datasets.pbmc3k_processed() data are described in this github issue: https://github.com/scverse/scverse-tutorials/issues/51

    See jupyter notebook for details.

  4. PBMC_3k_labels

    • zenodo.org
    bin
    Updated Dec 13, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rishab Munjal; Rishab Munjal (2021). PBMC_3k_labels [Dataset]. http://doi.org/10.5281/zenodo.5775898
    Explore at:
    binAvailable download formats
    Dataset updated
    Dec 13, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Rishab Munjal; Rishab Munjal
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Cell type labels for the pbmc3k dataset.

  5. f

    scClassifier's tutorial datasets

    • figshare.com
    application/gzip
    Updated Dec 19, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Feng Zeng (2019). scClassifier's tutorial datasets [Dataset]. http://doi.org/10.6084/m9.figshare.11407743.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Dec 19, 2019
    Dataset provided by
    figshare
    Authors
    Feng Zeng
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository includes two datasets. The first one named pbmc3k.RData is a scRNA-seq dataset of 3,000 human PBMCs generated by 10x Genomics. The second one named immuno_navigator_human_expression.RData is a wrapper of Immuno-Navigator database.

  6. domino2: data for a reproducible example

    • zenodo.org
    bin, csv, tsv
    Updated Nov 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jacob T. Mitchell; Jacob T. Mitchell (2023). domino2: data for a reproducible example [Dataset]. http://doi.org/10.5281/zenodo.10124866
    Explore at:
    csv, bin, tsvAvailable download formats
    Dataset updated
    Nov 14, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Jacob T. Mitchell; Jacob T. Mitchell
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository hosts example data for reproducible analysis of intra- and intercellular signaling in single cell RNA sequencing (scRNAseq) data based on transcription factor (TF) activation. We demonstrate analysis using domino2 on the 10X Genomics Peripheral Blood Mononuclear Cells (PBMC) data set of 2,700 cells PBMC3K. scRNA-seq data is preprocessed following the Satija Lab's Guided Clustering Tutorial. Quantification of TF activation is conducted using pySCENIC. For more details on how this analysis is conducted, please refer to the vignettes in the domino2 package.

  7. f

    Table_1_Patterns, Profiles, and Parsimony: Dissecting Transcriptional...

    • frontiersin.figshare.com
    xlsx
    Updated Jun 3, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Oswaldo A. Lozoya; Kathryn S. McClelland; Brian N. Papas; Jian-Liang Li; Humphrey H.-C. Yao (2023). Table_1_Patterns, Profiles, and Parsimony: Dissecting Transcriptional Signatures From Minimal Single-Cell RNA-Seq Output With SALSA.XLSX [Dataset]. http://doi.org/10.3389/fgene.2020.511286.s005
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    Frontiers
    Authors
    Oswaldo A. Lozoya; Kathryn S. McClelland; Brian N. Papas; Jian-Liang Li; Humphrey H.-C. Yao
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Single-cell RNA sequencing (scRNA-seq) technologies have precipitated the development of bioinformatic tools to reconstruct cell lineage specification and differentiation processes with single-cell precision. However, current start-up costs and recommended data volumes for statistical analysis remain prohibitively expensive, preventing scRNA-seq technologies from becoming mainstream. Here, we introduce single-cell amalgamation by latent semantic analysis (SALSA), a versatile workflow that combines measurement reliability metrics with latent variable extraction to infer robust expression profiles from ultra-sparse sc-RNAseq data. SALSA uses a matrix focusing approach that starts by identifying facultative genes with expression levels greater than experimental measurement precision and ends with cell clustering based on a minimal set of Profiler genes, each one a putative biomarker of cluster-specific expression profiles. To benchmark how SALSA performs in experimental settings, we used the publicly available 10X Genomics PBMC 3K dataset, a pre-curated silver standard from human frozen peripheral blood comprising 2,700 single-cell barcodes, and identified 7 major cell groups matching transcriptional profiles of peripheral blood cell types and driven agnostically by < 500 Profiler genes. Finally, we demonstrate successful implementation of SALSA in a replicative scRNA-seq scenario by using previously published DropSeq data from a multi-batch mouse retina experimental design, thereby identifying 10 transcriptionally distinct cell types from > 64,000 single cells across 7 independent biological replicates based on < 630 Profiler genes. With these results, SALSA demonstrates that robust pattern detection from scRNA-seq expression matrices only requires a fraction of the accrued data, suggesting that single-cell sequencing technologies can become affordable and widespread if meant as hypothesis-generation tools to extract large-scale differential expression effects.

  8. SeuratExtend Tutorial: Curated Example Datasets for Single-Cell Analysis

    • zenodo.org
    bin
    Updated Dec 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yichao Hua; Yichao Hua (2024). SeuratExtend Tutorial: Curated Example Datasets for Single-Cell Analysis [Dataset]. http://doi.org/10.5281/zenodo.10944066
    Explore at:
    binAvailable download formats
    Dataset updated
    Dec 12, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Yichao Hua; Yichao Hua
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This repository contains example datasets specifically curated for the SeuratExtend tutorial, aimed at facilitating advanced analyses and visualization techniques in single-cell genomics. The datasets have been derived from publicly available data obtained from the 10X Genomics website and have undergone careful preprocessing to serve specific tutorial goals.

    The collection includes the following datasets:

    1. Myeloid Subset from PBMC 10k Dataset: This subset focuses on myeloid cells extracted from the larger PBMC 10k dataset, showcasing a preprocessed SeuratObject stored as an RDS file. The data serve as a primary example for demonstrating the capabilities of SeuratExtend differentiation trajectory analysis.

    2. Velocyto LOOM File of Myeloid Subset from PBMC 10k Dataset: Accompanying the first dataset, this Velocyto-generated LOOM file represents a subset of the same myeloid cells, focusing on RNA velocity analyses. It provides a dynamic perspective on gene expression changes over time, enriching the tutorial with advanced single-cell transcriptomics insights.

    3. SCENIC-Processed PBMC 3k Dataset: An outcome of running the SCENIC workflow on the PBMC 3k dataset, this LOOM file represents a refined dataset highlighting gene regulation networks. It serves as an advanced example for users interested in exploring gene regulatory mechanisms using SeuratExtend.

    Each dataset has been subsetted and processed, making them ideal for users ranging from beginners to advanced researchers in the field of single-cell genomics. The provided data are intended for educational and tutorial purposes, allowing users to gain hands-on experience with real-world single-cell analysis scenarios.

  9. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Luke Zappia (2021). scanpy-pbmc3k.h5ad [Dataset]. http://doi.org/10.6084/m9.figshare.16447278.v1

scanpy-pbmc3k.h5ad

Explore at:
Dataset updated
Aug 26, 2021
Dataset provided by
figshare
Authors
Luke Zappia
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

H5AD file created by following the Scanpy PBMC3K tutorial

Search
Clear search
Close search
Google apps
Main menu