75 datasets found

Data, R code and output Seurat Objects for single cell RNA-seq analysis of...
figshare.com
application/gzip
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yunshun Chen; Gordon Smyth (2023). Data, R code and output Seurat Objects for single cell RNA-seq analysis of human breast tissues [Dataset]. http://doi.org/10.6084/m9.figshare.17058077.v1
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.17058077.v1
Dataset updated
May 31, 2023
Dataset provided by
Figsharehttp://figshare.com/
Authors
Yunshun Chen; Gordon Smyth
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset contains all the Seurat objects that were used for generating all the figures in Pal et al. 2021 (https://doi.org/10.15252/embj.2020107333). All the Seurat objects were created under R v3.6.1 using the Seurat package v3.1.1. The detailed information of each object is listed in a table in Chen et al. 2021.
f
Scripts for Analysis
figshare.com
txt
Updated Jul 18, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sneddon Lab UCSF (2018). Scripts for Analysis [Dataset]. http://doi.org/10.6084/m9.figshare.6783569.v2
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.6783569.v2
Dataset updated
Jul 18, 2018
Dataset provided by
figshare
Authors
Sneddon Lab UCSF
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Scripts used for analysis of V1 and V2 Datasets.seurat_v1.R - initialize seurat object from 10X Genomics cellranger outputs. Includes filtering, normalization, regression, variable gene identification, PCA analysis, clustering, tSNE visualization. Used for v1 datasets. merge_seurat.R - merge two or more seurat objects into one seurat object. Perform linear regression to remove batch effects from separate objects. Used for v1 datasets. subcluster_seurat_v1.R - subcluster clusters of interest from Seurat object. Determine variable genes, perform regression and PCA. Used for v1 datasets.seurat_v2.R - initialize seurat object from 10X Genomics cellranger outputs. Includes filtering, normalization, regression, variable gene identification, and PCA analysis. Used for v2 datasets. clustering_markers_v2.R - clustering and tSNE visualization for v2 datasets. subcluster_seurat_v2.R - subcluster clusters of interest from Seurat object. Determine variable genes, perform regression and PCA analysis. Used for v2 datasets.seurat_object_analysis_v1_and_v2.R - downstream analysis and plotting functions for seurat object created by seurat_v1.R or seurat_v2.R. merge_clusters.R - merge clusters that do not meet gene threshold. Used for both v1 and v2 datasets. prepare_for_monocle_v1.R - subcluster cells of interest and perform linear regression, but not scaling in order to input normalized, regressed values into monocle with monocle_seurat_input_v1.R monocle_seurat_input_v1.R - monocle script using seurat batch corrected values as input for v1 merged timecourse datasets. monocle_lineage_trace.R - monocle script using nUMI as input for v2 lineage traced dataset. monocle_object_analysis.R - downstream analysis for monocle object - BEAM and plotting. CCA_merging_v2.R - script for merging v2 endocrine datasets with canonical correlation analysis and determining the number of CCs to include in downstream analysis. CCA_alignment_v2.R - script for downstream alignment, clustering, tSNE visualization, and differential gene expression analysis.
Z
Processed, annotated, seurat object
data.niaid.nih.gov
Updated Nov 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Guillaume Thibault (2023). Processed, annotated, seurat object [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7608211
Explore at:
Dataset updated
Nov 16, 2023
Dataset provided by
Cenk Celik
Guillaume Thibault
Description
The dataset contains an integrated, annotated Seurat v4 object. One can load the dataset into the R environment using the code below:

seurat_obj <- readRDS('PATH/TO/DOWNLOAD/seurat.rds')

The object has three assays: (I) RNA, (II) SCT and (III) integrated.
f
Seurat
figshare.com
txt
Updated Feb 22, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yanqiang Ding (2024). Seurat [Dataset]. http://doi.org/10.6084/m9.figshare.25263802.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.25263802.v1
Dataset updated
Feb 22, 2024
Dataset provided by
figshare
Authors
Yanqiang Ding
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Seurat R script.
Seurat objects associated with the tonsil cell atlas
zenodo.org
application/gzip, bin +1
Updated Sep 28, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ramon Massoni-Badosa; Ramon Massoni-Badosa (2023). Seurat objects associated with the tonsil cell atlas [Dataset]. http://doi.org/10.5281/zenodo.8373756
Explore at:
bin, application/gzip, csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.8373756
Dataset updated
Sep 28, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Ramon Massoni-Badosa; Ramon Massoni-Badosa
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
In the context of the Human Cell Atlas, we have created a single-cell-driven taxonomy of cell types and states in human tonsils. This repository contains the Seurat objects derived from this effort. In particular, we have datasets for each modality (scRNA-seq, scATAC-seq, CITE-seq, spatial transcriptomics), as well as cell type-specific datasets. Most importantly, this is the input that we used to create the HCATonsilData package, which allows programmatic access to all this datasets within R.

Version 2 of this repository includes cells from 7 additional donors, which we used as a validation cohort to validate the cell types and states defined in the atlas. In addition, in this version we also provide the Seurat object associated with the spatial transcriptomics data (10X Visium), as well as the fragments files for scATAC-seq and Multiome
Integrated Breast Cancer Atlas Seurat Object
zenodo.org
bin
Updated Jan 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Cenk Celik; Cenk Celik (2025). Integrated Breast Cancer Atlas Seurat Object [Dataset]. http://doi.org/10.5281/zenodo.14001194
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.14001194
Dataset updated
Jan 10, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Cenk Celik; Cenk Celik
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The integrated Seurat object that can be imported into R environment using the code below:

library(Seurat)
seurat_obj <- readRDS("integrated_with_quiescence.rds")
Z
Single-cell Atlas Reveals Diagnostic Features Predicting Progressive Drug...
data.niaid.nih.gov
explore.openaire.eu
+1more
Updated Sep 7, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
John Ouyang (2023). Single-cell Atlas Reveals Diagnostic Features Predicting Progressive Drug Resistance in Chronic Myeloid Leukemia [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5118610
Explore at:
Dataset updated
Sep 7, 2023
Dataset provided by
Zahid Nawaz
Shyam Prabhakar
Alice Man Sze Cheung
Pavanish Kumar
Vaidehi Krishnan
Sudipto Bari
Salvatore Albani
John Ouyang
Prasanna Nori Venkatesh
Florian Schmidt
Owen Rackham
Sin Tiong Ong
William Ying Khee Hwang
Ahmad Lajam
Chan Zhu En
Charles Chuah
Lee Kian Leong
Meera Makheja
Description
This archive contains data of scRNAseq and CyTOF in form of Seurat objects, txt and csv files as well as R scripts for data analysis and Figure generation.

A summary of the content is provided in the following.

R scripts

Script to run Machine learning models predicting group specific marker genes: CML_Find_Markers_Zenodo.R Script to reproduce the majority of Main and Supplementary Figures shown in the manuscript: CML_Paper_Figures_Zenodo.R Script to run inferCNV analysis: inferCNV_Zenodo.R Script to plot NATMI analysis results:NATMI_CvsA_FC0.32_Updown_Column_plot_Zenodo.R Script to conduct sub-clustering and filtering of NK cells NK_Marker_Detection_Zenodo.R

Helper scripts for plotting and DEG calculation:ComputePairWiseDE_v2.R, Seurat_DE_Heatmap_RCA_Style.R

RDS files

General scRNA-seq Seurat objects:

scRNA-seq seurat object after QC, and cell type annotation used for most analysis in the manuscript: DUKE_DataSet_Doublets_Removed_Relabeled.RDS

scRNA-seq including findings e.g. from NK analysis used in the shiny app: DUKE_final_for_Shiny_App.rds

Neighborhood enrichment score computed for group A across all HSPCs: Enrichment_score_global_groupA.RDS

UMAP coordinates used in the article: Layout_2D_nNeighbours_25_Metric_cosine_TCU_removed.RDS

SCENIC files:

Regulon set used in SCENIC: 2.6_regulons_asGeneSet.Rds

AUC values computed for regulons: 3.4_regulonAUC.Rds

MetaData used in SCENIC cellInfo.Rds

Group specific regulons for LCS: groupSpecificRegulonsBCRAblP.RDS

Patient specific regulons for LSC: patientSpecificRegulonsBCRAblP.RDS

Patient specificity score for LSC: PatientSpecificRegulonSpecificityScoreBCRAblP.RDS

Regulon specificty score for LSC: RegulonSpecificityScoreBCRAblP.RDS

BCR-ABL1 inference:

HSC with inferred BCR-ABL1 label: HSCs_CML_with_BCR-Abl_label.RDS

UMAP for HSC with inferred BCR-ABL1 label: HSCs_CML_with_BCR-Abl_label_UMAP.RDS

HSPCs with BCR-ABL1 module scores: HSPC_metacluster_74K_with_modscore_27thmay.RDS

NK sub-clustering and filtering:

NK object with module scores: NK_8617cells_with_modscore_1stjune.RDS

Feature genes for NK cells computed with DubStepR: NK_Cells_DubStepR

NK cells Seurat object excluding contaminating T and B cells: NK_cells_T_B_17_removed.RDS

NK Seurat object including neighbourhood enrichment score calculations: NK_seurat_object_with_enrichment_labels_V2.RDS

txt and csv files:

Proportions per cluster calculated from CyTOF: CyTOF_Proportions.txt

Correlation between scRNAseq and CyTOF cell type abundance: scRNAseq_Cor_Cytof.txt

Correlation between manual gating and FlowSOM clustering: Manual_vs_FlowSOM.txt

GSEA results:

HSPC, HSC and LSC results: FINAL_GSEA_DATA_For_GGPLOT.txt

NK: NK_For_Plotting.txt

TFRC and HLA expression: TFRC_and_HLA_Values.txt

NATMI result files:

UP-regulated_mean.csv

DOWN-regulated_mean.csv

Gene position file used in inferCNV: inferCNV_gene_positions_hg38.txt

Module scores for NK subclusters per cell: NK_Supplementary_Module_Scores.csv

Compressed folders:

All CyTOF raw data files: CyTOF_Data_raw.zip

Results of the patient-based classifier: PatientwiseClassifier.zip

Results of the single-cell based classifier: SingleCellClassifierResults.zip

For general new data analysis approaches, we recommend the readers to use the Seruat object stored in DUKE_final_for_Shiny_App.rds or to use the shiny app(http://scdbm.ddnetbio.com/) and perform further analysis from there.

RAW data is available at EGA upon request using Study ID: EGAS00001005509

Revision

The for_CML_manuscript_revision.tar.gz folder contains scripts and data for the paper revision including 1) Detection of the BCR-ABL fusion with long read sequencing; 2) Identification of BCR-ABL junction reads with scRNAseq; 3) Detection of expressed mutations using scRNAseq.
H
Dan R Laks Code of Seurat analysis integration of 8PDX scRNA-Seq...
dataverse.harvard.edu
Updated Nov 21, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dan Laks (2021). Dan R Laks Code of Seurat analysis integration of 8PDX scRNA-Seq datasets_Xie-Laks-Parada et al., 2021 [Dataset]. http://doi.org/10.7910/DVN/J5MVOR
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/J5MVOR
Dataset updated
Nov 21, 2021
Dataset provided by
Harvard Dataverse
Authors
Dan Laks
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Dan R Laks Code of Seurat analysis integration of 8PDX scRNA-Seq datasets_Xie-Laks-Parada et al., 2021
Data from: A single-cell atlas characterizes dysregulation of the bone...
zenodo.org
Updated Jan 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
William Pilcher; William Pilcher (2025). A single-cell atlas characterizes dysregulation of the bone marrow immune microenvironment associated with outcomes in multiple myeloma [Dataset]. http://doi.org/10.5281/zenodo.14624955
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.14624955
Dataset updated
Jan 14, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
William Pilcher; William Pilcher
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
May 8, 2024
Description
This repository contains R Seurat objects associated with our study titled "A single-cell atlas characterizes dysregulation of the bone marrow immune microenvironment associated with outcomes in multiple myeloma".

Single cell data contained within this object comes from MMRF Immune Atlas Consortium work.

The .rds files contains a Seurat object saved with version 4.3. This can be loaded in R with the readRDS command.

Two .RDS files are included in this version of the release.

Discovery object: MMRF_ImmuneAtlas_Full_With_Corrected_Censored_Metadata.rds contains all aliquots belonging to the 'discovery' cohort as used in the initial paper. This represents the dataset used for initial clustering, cell annotation, and analysis.

Discovery + Validation object: COMBINED_VALIDATION_MMRF_ImmuneAtlas_Full_Censored_Metadata.rds contains both aliquots belonging to the initial 'discovery' cohort, and aliquots belonging to the 'validation' cohort. The group each cell is derived from is listed under the 'cohort' variable. Labels related to cell annotation, including doublet status, are derived from a label transfer process as described in the paper. Labels for the original 'discovery' cohort are unchanged. UMAPs have been reconstructed with both the discovery and validation cohorts integrated.

--

The discovery object contains two assays:

"RNA" - The raw count matrix

"RNA_Batch_Corrected" - Counts adjusted for the combination of 'Study_Site' and 'Batch'.

Analysis should prefer the original RNA assay, unless using pipelines which does not support adjusting for technical covariates.

Currently, the validation object only includes the uncorrected RNA assay.

--

The object contains two umaps in the reduction slot:

umap - will render the UMAP for the full object with all cells.

umap.sub -contains the UMAP embeddings for individual 'compartments', as indicated by 'subcluster_V03072023_compartment'

--

Each sample has three different identifiers:

public_id

Indicates a specific patient (n=263).

MMRF_####

This is a standard identifier which is used across all MMRF CoMMpass datasets

public_ids can map to multiple d_visit_specimen_ids and aliquot_ids

As of now, all public_ids have a single sample collected at Baseline.

This can be accessed by filtering for 'collection_event' %in% c("Baseline", "Screening") or VJ_INTERVAL == 'Baseline'

d_visit_specimen_id

Indicates a specific visit by a patient (n=358)

MMRF_####_Y

Y is a number indicate that this is the 'Y' sample obtained from said patient. This does not correspond to a specific timepoint.

This is a standard identifier, which is used across all MMRF CoMMpass datasets

The purpose of the visit is indicated in 'collection_event' (Baseline, Relapse, Remmission, etc.). The approximate interval the visit corresponds to is in "VJ_INTERVAL"

d_visit_specimen_id uniquely maps to one public_id

d_visit_specimen_id can map to multiple aliquot_ids

aliquot_id

Refers to the specific bone marrow aliquot sample processed (n=361)

MMRFA-######

This is a unique identifier for each processed scRNA-seq sample.

As of now, this uniquely maps to a combination of d_visit_specimen_id, Study_Site, and Batch

As of now, is an identifier specific to the MMRF ImmuneAtlas

Each cell has the following annotation information:

subcluster_V03072023

These refer to an individual cluster derived from 'Seurat'.

Format is 'Compartment'.'Compartment-cluster'.'Compartment-subcluster'

'NkT.2.2', indicates this cell is in the 'Natural Killer + T Cell compartment', was originally part of 'Cluster 2', and then was further separated into a refined subcluster 2.2'

If a parent cluster did not need to be further seprated, the 'Compartment-subcluster' part is omitted (e.g., 'NkT.6')

As of now, this uniquely maps to a specific cellID_short annotation.

Clustering was done on a per compartment basis

For most immune cell types, clustering was based on embeddings corrected for 'siteXbatch'. For Plasma, clustering was performed on embeddings corrected on a per-sample basis.

In the combined validation object, DISCOVERY.subcluster_V03072023 will contain values only for the discovery cohort, and have NA values for validation samples.

subcluster_V03072023_compartment

These refer to one of five major compartments as identified roughly on the original UMAP. Clustering was performed on a per-compartment basis following a first pass rough annotation.

The possible compartments are

NkT (T cell + Natural Killer Cells)

Myeloid (Monocytes, Macrophages, Dendritic cells, Neutrophil/Granulocyte populations)

BEry (B Cell, Erythroblasts, bone marrow progenitor populations, pDCs)

Ery (Erythrocyte population)

Plasma (Plasma cell populations)

Each compartment has it's own UMAP generated, which can be accessed in the 'umap.sub' reduction

One cluster was isolated from all other populations, and was not assigned to a compartment. This cluster is labeled as 'Full.23'.

In the combined validation object, DISCOVERY.subcluster_V03072023_compartment will contain values only for the discovery cohort, and have NA values for validation samples.

cellID_short

This is the individual annotation for each cluster.

Please see the 'Cell Population Annotation Dictionary' for further details.

If different seurat clusters were assigned similar annotations, the celltype annotation will be appended with a distinct cluster gene, or with '_b', '_c'

lineage_group

This is an annotation driven grouping of clusters into major immune populations, as shown in Figure 2.

This includes "CD8", "CD4", "M" (Myeloid), "B" (B cell), "E" (Erythroid), "P" (Plasma), "Other" (HSC, Fibro, pDC_a), "LQ" (Doublet)

isDoublet

This is a binary 'True' or 'False' derived from manual review of clusters following doublet analysis, as described in the paper.

True indicates the cluster was determined to be a doublet population.

This is derived from 'doublet_pred', in which 'dblet_cluster' and 'poss_dblet_cluster' were flagged as doublet populations for subsequent analysis.

In the validation object, the doublet status of new samples were inferred by if label transfer from the discovery cohort mapped the cell from the new sample as one of the previously identified doublet populations. The raw doublet scores from doublet finder, pegasus, or scrublet, are not included in this release.

--

Each sample has the following information indicating shipment batches, for batch correction

Study_Site

The center which processed a specific aliquot_id

EMORY, MSSM, WashU, MAYO

Batch

The shipment batch the sample was associated with

Valued 1 to 3 for EMORY, MSSM, MAYO, and 1 to 4 for WashU

siteXbatch

A combination of the above to variables, to be used for batch correction

(Combined Validation Object only): cohort

Indicates if the sample was involved in the 'discovery' cohort, or 'validation' cohort. Samples in the 'validation' cohort will have labels inferred from label mapping

--

Each public_id has limited demographic information based on publicly available information in the MMRF CoMMpass study.

d_pt_sex

Patient sex (not self-identified). Male or Female

d_pt_race_1

Patient self-identified race

d_pt_ethnicity

Patient self-identified ethnicity

d_dx_amm_age

Patient age at diagnosis.

Not reported for patients above 90 at diagnosis

d_dx_amm_bmi

Patient BMI at diagnosis

d_pt_height_cm

Patient height at diagnosis, in centimeters.

d_dx_amm_weight_kg

Patient weight at diagnosis, in kilograms

d_specimen_visit_id contains two data points providing limited information about the visit

collection_event

Description of why the sample was collected

e.g., 'Baseline' and 'Screening' indicates the sample was obtained prior to therapy

'Relapse/Progression' indicates the sample was collected due to disease progression based on clinical assessment

'Remission/Response' indicates the sample was collected due to patient entering remission based on clinical assessment

Samples may be collected for reasons independent of the above, such as 'Pre' or 'Post' ASCT, or for other unspecified reasons

VJ_INTERVAL

Indicates the rough interval following start of therapy the sample is assigned to

"Baseline", "Month 3", "Year 2", etc.

All the single-cell raw data, along with outcome and cytogenetic information, is available at MMRF’s VLAB shared resource. Requests to access these data will be reviewed by data access committee at MMRF and any data shared will be released under a data transfer agreement that will protect the identities of patients involved in the study. Other information from the CoMMpass trial can also generally be
pbmc single cell RNA-seq matrix
zenodo.org
csv
Updated May 4, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Samuel Buchet; Samuel Buchet; Francesco Carbone; Morgan Magnin; Morgan Magnin; Mickaël Ménager; Olivier Roux; Olivier Roux; Francesco Carbone; Mickaël Ménager (2021). pbmc single cell RNA-seq matrix [Dataset]. http://doi.org/10.5281/zenodo.4730807
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.4730807
Dataset updated
May 4, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Samuel Buchet; Samuel Buchet; Francesco Carbone; Morgan Magnin; Morgan Magnin; Mickaël Ménager; Olivier Roux; Olivier Roux; Francesco Carbone; Mickaël Ménager
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Single cell RNA-sequencing dataset of peripheral blood mononuclear cells (pbmc: T, B, NK and monocytes) extracted from two healthy donors.

Cells labeled as C26 come from a 30 years old female and cells labeled as C27 come from a 53 years old male. Cells have been isolated from blood using ficoll. Samples were sequenced using standard 3' v3 chemistry protocols by 10x genomics. Cellranger v4.0.0 was used for the processing, and reads were aligned to the ensembl GRCg38 human genome (GRCg38_r98-ensembl_Sept2019). QC metrics were calculated on the count matrix generated by cellranger (filtered_feature_bc_matrix). Cells with less than 3 genes per cells, less than 500 reads per cell and more than 20% of mithocondrial genes were discarded.

The processing steps was performed with the R package Seurat (https://satijalab.org/seurat/), including sample integration, data normalisation and scaling, dimensional reduction, and clustering. SCTransform method was adopted for the normalisation and scaling steps. The clustered cells were manually annotated using known cell type markers.

Files content:

- raw_dataset.csv: raw gene counts

- normalized_dataset.csv: normalized gene counts (single cell matrix)

- cell_types.csv: cell types identified from annotated cell clusters

- cell_types_macro.csv: cell macro types

- UMAP_coordinates.csv: 2d cell coordinates computed with UMAP algorithm in Seurat
f
Processed naive T cell single-cell RNA-seq, Seurat object
figshare.com
application/gzip
Updated Jan 5, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Daniel Bunis (2021). Processed naive T cell single-cell RNA-seq, Seurat object [Dataset]. http://doi.org/10.6084/m9.figshare.11886891.v2
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.11886891.v2
Dataset updated
Jan 5, 2021
Dataset provided by
figshare
Authors
Daniel Bunis
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Processed naive CD4 and CD8 T cell single-cell RNAseq data from human samples. The file contains a Seurat object stored as an .rds file which can be read into R with the readRDS() function. It was generated using the raw data of similar name in this project, as well as the code stored here: https://github.com/dtm2451/ProgressiveHematopoiesis
f
R script for snRNAseq analysis
fairdomhub.org
text/x-r-source
Updated Jan 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Markus Wolfien; Anne-Marie Galow (2020). R script for snRNAseq analysis [Dataset]. https://fairdomhub.org/data_files/3308
Explore at:
text/x-r-source(20.2 KB)Available download formats
Dataset updated
Jan 24, 2020
Authors
Markus Wolfien; Anne-Marie Galow
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This file contains the R-script to analyse single nuclei data previously processed with kallisto and bustools. The analyses utilizes the Seurat, harmony and RNAvelocity package.
E
Single-cell transcriptomics uncovers zonation of function in the mesenchyme...
find.data.gov.scot
dtechtive.com
txt
Updated Feb 12, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
University of Edinburgh Centre for Inflammation Research (2020). Single-cell transcriptomics uncovers zonation of function in the mesenchyme during liver fibrosis - Seurat objects [Dataset]. http://doi.org/10.7488/ds/2769
Explore at:
txt(0.0166 MB), txt(0.0013 MB)Available download formats
Unique identifier
https://doi.org/10.7488/ds/2769
Dataset updated
Feb 12, 2020
Dataset provided by
University of Edinburgh Centre for Inflammation Research
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
UNITED KINGDOM
Description
We profile the transcriptomes of ~30,000 mouse single cells to deconvolve the hepatic mesenchyme in healthy and fibrotic liver at high resolution. We reveal spatial zonation of hepatic stellate cells across the liver lobule, designated portal vein-associated HSC and central vein-associated HSC, and uncover an equivalent functional zonation in a mouse model of centrilobular fibrosis. Our work illustrates the power of single-cell transcriptomics to resolve key collagen-producing cells driving liver fibrosis with high precision. We provide the contents of these data as Seurat R objects.
l
cellCounts
opal.latrobe.edu.au
researchdata.edu.au
bin
Updated Dec 19, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yang Liao; Dinesh Raghu; Bhupinder Pal; Lisa Mielke; Wei Shi (2022). cellCounts [Dataset]. http://doi.org/10.26181/21588276.v3
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.26181/21588276.v3
Dataset updated
Dec 19, 2022
Dataset provided by
La Trobe
Authors
Yang Liao; Dinesh Raghu; Bhupinder Pal; Lisa Mielke; Wei Shi
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This page includes the data and code necessary to reproduce the results of the following paper: Yang Liao, Dinesh Raghu, Bhupinder Pal, Lisa Mielke and Wei Shi. cellCounts: fast and accurate quantification of 10x Chromium single-cell RNA sequencing data. Under review. A Linux computer running an operating system of CentOS 7 (or later) or Ubuntu 20.04 (or later) is recommended for running this analysis. The computer should have >2 TB of disk space and >64 GB of RAM. The following software packages need to be installed before running the analysis. Software executables generated after installation should be included in the $PATH environment variable.

R (v4.0.0 or newer) https://www.r-project.org/ Rsubread (v2.12.2 or newer) http://bioconductor.org/packages/3.16/bioc/html/Rsubread.html CellRanger (v6.0.1) https://support.10xgenomics.com/single-cell-gene-expression/software/overview/welcome STARsolo (v2.7.10a) https://github.com/alexdobin/STAR sra-tools (v2.10.0 or newer) https://github.com/ncbi/sra-tools Seurat (v3.0.0 or newer) https://satijalab.org/seurat/ edgeR (v3.30.0 or newer) https://bioconductor.org/packages/edgeR/ limma (v3.44.0 or newer) https://bioconductor.org/packages/limma/ mltools (v0.3.5 or newer) https://cran.r-project.org/web/packages/mltools/index.html

Reference packages generated by 10x Genomics are also required for this analysis and they can be downloaded from the following link (2020-A version for individual human and mouse reference packages should be selected): https://support.10xgenomics.com/single-cell-gene-expression/software/downloads/latest After all these are done, you can simply run the shell script ‘test-all-new.bash’ to perform all the analyses carried out in the paper. This script will automatically download the mixture scRNA-seq data from the SRA database, and it will output a text file called ‘test-all.log’ that contains all the screen outputs and speed/accuracy results of CellRanger, STARsolo and cellCounts.
f
ProjecTILs murine reference atlas of tumor-infiltrating T cells, version 1
figshare.com
application/gzip
Updated Jun 29, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Massimo Andreatta; Santiago Carmona (2023). ProjecTILs murine reference atlas of tumor-infiltrating T cells, version 1 [Dataset]. http://doi.org/10.6084/m9.figshare.12478571.v2
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.12478571.v2
Dataset updated
Jun 29, 2023
Dataset provided by
figshare
Authors
Massimo Andreatta; Santiago Carmona
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
We have developed ProjecTILs, a computational approach to project new data sets into a reference map of T cells, enabling their direct comparison in a stable, annotated system of coordinates. Because new cells are embedded in the same space of the reference, ProjecTILs enables the classification of query cells into annotated, discrete states, but also over a continuous space of intermediate states. By comparing multiple samples over the same map, and across alternative embeddings, the method allows exploring the effect of cellular perturbations (e.g. as the result of therapy or genetic engineering) and identifying genetic programs significantly altered in the query compared to a control set or to the reference map. We illustrate the projection of several data sets from recent publications over two cross-study murine T cell reference atlases: the first describing tumor-infiltrating T lymphocytes (TILs), the second characterizing acute and chronic viral infection.To construct the reference TIL atlas, we obtained single-cell gene expression matrices from the following GEO entries: GSE124691, GSE116390, GSE121478, GSE86028; and entry E-MTAB-7919 from Array-Express. Data from GSE124691 contained samples from tumor and from tumor-draining lymph nodes, and were therefore treated as two separate datasets. For the TIL projection examples (OVA Tet+, miR-155 KO and Regnase-KO), we obtained the gene expression counts from entries GSE122713, GSE121478 and GSE137015, respectively.Prior to dataset integration, single-cell data from individual studies were filtered using TILPRED-1.0 (https://github.com/carmonalab/TILPRED), which removes cells not enriched in T cell markers (e.g. Cd2, Cd3d, Cd3e, Cd3g, Cd4, Cd8a, Cd8b1) and cells enriched in non T cell genes (e.g. Spi1, Fcer1g, Csf1r, Cd19). Dataset integration was performed using STACAS (https://github.com/carmonalab/STACAS), a batch-correction algorithm based on Seurat 3. For the TIL reference map, we specified 600 variable genes per dataset, excluding cell cycling genes, mitochondrial, ribosomal and non-coding genes, as well as genes expressed in less than 0.1% or more than 90% of the cells of a given dataset. For integration, a total of 800 variable genes were derived as the intersection of the 600 variable genes of individual datasets, prioritizing genes found in multiple datasets and, in case of draws, those derived from the largest datasets. We determined pairwise dataset anchors using STACAS with default parameters, and filtered anchors using an anchor score threshold of 0.8. Integration was performed using the IntegrateData function in Seurat3, providing the anchor set determined by STACAS, and a custom integration tree to initiate alignment from the largest and most heterogeneous datasets.Next, we performed unsupervised clustering of the integrated cell embeddings using the Shared Nearest Neighbor (SNN) clustering method implemented in Seurat 3 with parameters {resolution=0.6, reduction=”umap”, k.param=20}. We then manually annotated individual clusters (merging clusters when necessary) based on several criteria: i) average expression of key marker genes in individual clusters; ii) gradients of gene expression over the UMAP representation of the reference map; iii) gene-set enrichment analysis to determine over- and under- expressed genes per cluster using MAST. In order to have access to predictive methods for UMAP, we recomputed PCA and UMAP embeddings independently of Seurat3 using respectively the prcomp function from basic R package “stats”, and the “umap” R package (https://github.com/tkonopka/umap).
scRNA-seq for article: Giardia intestinalis-induced Type 2 mucosal immunity...
zenodo.org
bin
Updated May 12, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aline Sardinha-Silva; Pedro Gazzinelli-Guimaraes; Oluwaremilekun Ajakaye; Fabricio Oliveira; Tiago Rodrigues Ferreira; Tiago Rodrigues Ferreira; Eliza Alves-Ferreira; Erick Tjhin; Beth Gregg; Marc Fink; Camila Coelho; Steven Singer; Michael Grigg; Aline Sardinha-Silva; Pedro Gazzinelli-Guimaraes; Oluwaremilekun Ajakaye; Fabricio Oliveira; Eliza Alves-Ferreira; Erick Tjhin; Beth Gregg; Marc Fink; Camila Coelho; Steven Singer; Michael Grigg (2025). scRNA-seq for article: Giardia intestinalis-induced Type 2 mucosal immunity attenuates bystander intestinal inflammation - filtered and annotated data [Dataset]. http://doi.org/10.5281/zenodo.15390320
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.15390320
Dataset updated
May 12, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Aline Sardinha-Silva; Pedro Gazzinelli-Guimaraes; Oluwaremilekun Ajakaye; Fabricio Oliveira; Tiago Rodrigues Ferreira; Tiago Rodrigues Ferreira; Eliza Alves-Ferreira; Erick Tjhin; Beth Gregg; Marc Fink; Camila Coelho; Steven Singer; Michael Grigg; Aline Sardinha-Silva; Pedro Gazzinelli-Guimaraes; Oluwaremilekun Ajakaye; Fabricio Oliveira; Eliza Alves-Ferreira; Erick Tjhin; Beth Gregg; Marc Fink; Camila Coelho; Steven Singer; Michael Grigg
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Single-cell RNA-seq dataset from sorted IL-10+, TCRb+, CD4+ mouse small intestine lamina propria cells in naive or Giardia intestinalis-infected animals at 7 d.p.i. Data analyses and results are described in Sardinha-Silva et al., Nature Microbiology, 2025: "Giardia intestinalis-induced Type 2 mucosal immunity attenuates bystander intestinal inflammation". Data are Seurat objects in RDS format. Filtered-out potential doublets, low quality cells and dying cells (excluded cells with <800 genes detected, cells with >5000 genes detected and cells with mitochondrial gene expression > 10%). Data normalization, scaling and integration performed using Seurat v 4.4.0.

Full filtered dataset in the "alineGiardia.combined_v4.rds" file. Related R code is found in "giardia_mouse_integration.R".

T cells of interest only in the "T.seurat.rds" file. Related R code is in "TcellSubsets_sc_analysis.R".

Dataset was also mapped to a reference dataset by Kiner et al., Nature Immunology, 2021. The post-mapping data is found in the "refmap_kiner.seurat.rds" file. Related R code is in "referenceMapping_Kineretal2021.R".
u
Dawnn benchmarking dataset: Simulated branching trajectories processing and...
rdr.ucl.ac.uk
txt
Updated May 4, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
George Hall; Sergi Castellano Hereza (2023). Dawnn benchmarking dataset: Simulated branching trajectories processing and label simulation [Dataset]. http://doi.org/10.5522/04/22619851.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.5522/04/22619851.v1
Dataset updated
May 4, 2023
Dataset provided by
University College London
Authors
George Hall; Sergi Castellano Hereza
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This project is a collection of files to allow users to reproduce the model development and benchmarking in "Dawnn: single-cell differential abundance with neural networks" (Hall and Castellano, under review). Dawnn is a tool for detecting differential abundance in single-cell RNAseq datasets. It is available as an R package here. Please contact us if you are unable to reproduce any of the analysis in our paper. The files in this collection correspond to the benchmarking dataset based on simulated branching trajectories.

FILES: Data processing code

adapted_traj_sim_milo_paper.R Lightly adapted code from Dann et al. to simulate single-cell RNAseq datasets that form branching trajectories . generate_test_data_branching_traj_sim_milo_paper.R R code to assign simulated labels to datatsets generated from adapted_traj_sim_milo_paper.R. Seurat objects saved as cells_sim_branching_traj_gex_seed_*.rds. Simulated labels saved as benchmark_dataset_sim_branching_traj.csv.

Resulting datasets

cells_sim_branching_traj_gex_seed_*.rds Seurat objects generated by generate_test_data_branching_traj_sim_milo_paper.R. benchmark_dataset_sim_branching_traj.csv Cell labels generated by generate_test_data_branching_traj_sim_milo_paper.R.
Processed Seurat objects for GeneTrajectory inference (Gene Trajectory...
figshare.com
application/gzip
Updated Feb 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rihao Qu; Peggy Myung (2024). Processed Seurat objects for GeneTrajectory inference (Gene Trajectory Inference for Single-cell Data by Optimal Transport Metrics) [Dataset]. http://doi.org/10.6084/m9.figshare.25243225.v1
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.25243225.v1
Dataset updated
Feb 19, 2024
Dataset provided by
Figsharehttp://figshare.com/
Authors
Rihao Qu; Peggy Myung
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
These are processed Seurat objects for the two biological datasets in GeneTrajectory inference (https://github.com/KlugerLab/GeneTrajectory/):Human myeloid dataset analysisMyeloid cells were extracted from a publicly available 10x scRNA-seq dataset (https:// support.10xgenomics.com/single-cell-gene-expression/datasets/3.0.0/pbmc 10k v3). QC was performed using the same workflow in (https://github.com/satijalab/ Integration2019/blob/master/preprocessing scripts/pbmc 10k v3.R). After standard normalization, highly-variable gene selection and scaling using the Seurat R package, we applied PCA and retained the top 30 principal components. Four sub-clusters of myeloid cells were identified based on Louvian clustering with a resolution of 0.3. Wilcoxon rank-sum test was employed to find cluster-specific gene markers for cell type annotation.For gene trajectory inference, we first applied Diffusion Map on the cell PC embedding (using a local-adaptive kernel, each bandwidth is determined by the distance to its k-nearest neighbor, k = 10) to generate a spectral embedding of cells. We constructed a cell-cell kNN (k = 10) graph based on their coordinates of the top 5 non-trivial Diffusion Map eigenvectors. Among the top 2,000 variable genes, genes expressed by 0.5% − 75% of cells were retained for pairwise gene-gene Wasserstein distance computation. The original cell graph was coarse-grained into a graph of size 1,000. We then built a gene-gene graph where the affinity between genes is transformed from the Wasserstein distance using a Gaussian kernel (local-adaptive, k = 5). Diffusion Map was employed to visualize the embedding of gene graph. For trajectory identification, we used a series of time steps (11,21,8) to extract three gene trajectories. Mouse embryo skin data analysisWe separated out dermal cell populations from the newly collected mouse embryo skin samples. Cells from the wildtype and the Wls mutant were pooled for analyses. After standard normalization, highly-variable gene selection and scaling using Seurat, we applied PCA and retained the top 30 principal components. Three dermal celltypes were stratified based on the expression of canonical dermal markers, including Sox2, Dkk1, and Dkk2. For gene trajectory inference, we first applied Diffusion Map on the cell PC embedding (using a local-adaptive kernel bandwidth, k = 10) to generate a spectral embedding of cells. We constructed a cell-cell kNN (k = 10) graph based on their coordinates of the top 10 non-trivial Diffusion Map eigenvectors. Among the top 2,000 variable genes, genes expressed by 1% − 50% of cells were retained for pairwise gene-gene Wasserstein distance computation. The original cell graph was coarse-grained into a graph of size 1,000. We then built a gene-gene graph where the affinity between genes is transformed from the Wasserstein distance using a Gaussian kernel (local-adaptive, k = 5). Diffusion Map was employed to visualize the embedding of gene graph. For trajectory identification, we used a series of time steps (9,16,5) to sequentially extract three gene trajectories. To compare the differences between the wiltype and the Wls mutant, we stratified Wnt-active UD cells into seven stages according to their expression profiles of the genes binned along the DC gene trajectory.
Data from: A Single-Cell Tumor Immune Atlas for Precision Oncology
zenodo.org
bin, csv
Updated Mar 31, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Paula Nieto; Paula Nieto (2022). A Single-Cell Tumor Immune Atlas for Precision Oncology [Dataset]. http://doi.org/10.5281/zenodo.4263972
Explore at:
bin, csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.4263972
Dataset updated
Mar 31, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Paula Nieto; Paula Nieto
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Preprint version of the Single-Cell Tumor Immune Atlas

This upload contains:

TICAtlas.rds: an rds file containing a Seurat object with the whole Atlas (317111 cells, RNA and integrated assays, PCA and UMAP reductions)

TICAtlas.h5ad: an h5ad file with the whole Atlas (317111 cells, RNA assay, PCA and UMAP)

TICAtlas_RNA.rds: an rds file containing a Seurat object of the whole Atlas but only the RNA assay (317111 cells, UMAP embedding)

TICAtlas_downsampled_1000.rds: an rds file containing a downsampled version of the Seurat object of the whole Atlas (24834 cells, RNA and integrated assay, PCA and UMAP reductions)

TICAtlas_downsampled_1000.h5ad: an rds file containing a downsampled version of the Seurat object of the whole Atlas (24834 cells, RNA assay, PCA and UMAP reductions)

TICAtlas_metadata.csv: a comma-separated text file with the metadata for each of the cells

For the h5ad files, the .X slot contains the normalized data, while the .X.raw slot contains the raw counts as they were in the original datasets.

All the files contain the following patient/sample metadata variables:

patient: assigned patient identifiers

gender: the patient's gender (male/female/unknown)

source: dataset of origin

subtype: cancer type (abbreviations as indicated in the preprint)

cluster_kmeans_k6: patients clusters, NA if filtered out

cell_type: annotated cell type for each of the cells

If you have any issues with the metadata you can use the TICAtlas_metadata.csv file.

For more information, read our preprint and check our GitHub.

h5ad files can be read with Python using Scanpy, rds files can be read in R using Seurat. For format conversion between AnnData and Seurat we recommend SeuratDisk. For other single-cell data formats you can use sceasy.
H
scRNA-seq_huang2019
dataverse.harvard.edu
search.dataone.org
bin
Updated Aug 21, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Harvard Dataverse (2019). scRNA-seq_huang2019 [Dataset]. http://doi.org/10.7910/DVN/QB5CC8
Explore at:
bin(479620588), bin(461399293), bin(2249559954), bin(315337929), bin(444284989), bin(1567775705)Available download formats
Unique identifier
https://doi.org/10.7910/DVN/QB5CC8
Dataset updated
Aug 21, 2019
Dataset provided by
Harvard Dataverse
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Serialized R data files (.rds) associated with the inDrop single-cell RNA-seq analysis in Huang et al., 2019. Each file has a single Seurat object containing a subset of clusters from the full processed dataset, which were separated into different objects due to file size limitations. Raw data (UMIFM counts) are included in the corresponding slot in each Seurat object. Seurat objects can be re-merged into a single object containing the full dataset using the MergeSeurat function.

Facebook

Twitter

Click to copy link

Link copied

Cite

Yunshun Chen; Gordon Smyth (2023). Data, R code and output Seurat Objects for single cell RNA-seq analysis of human breast tissues [Dataset]. http://doi.org/10.6084/m9.figshare.17058077.v1

Data, R code and output Seurat Objects for single cell RNA-seq analysis of human breast tissues

Explore at:

application/gzipAvailable download formats

Unique identifier

https://doi.org/10.6084/m9.figshare.17058077.v1

Dataset updated

May 31, 2023

Dataset provided by

Figsharehttp://figshare.com/

Authors

Yunshun Chen; Gordon Smyth

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

This dataset contains all the Seurat objects that were used for generating all the figures in Pal et al. 2021 (https://doi.org/10.15252/embj.2020107333). All the Seurat objects were created under R v3.6.1 using the Seurat package v3.1.1. The detailed information of each object is listed in a table in Chen et al. 2021.

Clear search

Close search

Google apps

Main menu

Data, R code and output Seurat Objects for single cell RNA-seq analysis of...

Scripts for Analysis

Processed, annotated, seurat object

Seurat

Seurat objects associated with the tonsil cell atlas

Integrated Breast Cancer Atlas Seurat Object

Single-cell Atlas Reveals Diagnostic Features Predicting Progressive Drug...

Dan R Laks Code of Seurat analysis integration of 8PDX scRNA-Seq...

Data from: A single-cell atlas characterizes dysregulation of the bone...

pbmc single cell RNA-seq matrix

Processed naive T cell single-cell RNA-seq, Seurat object

R script for snRNAseq analysis

Single-cell transcriptomics uncovers zonation of function in the mesenchyme...

cellCounts

ProjecTILs murine reference atlas of tumor-infiltrating T cells, version 1

scRNA-seq for article: Giardia intestinalis-induced Type 2 mucosal immunity...

Dawnn benchmarking dataset: Simulated branching trajectories processing and...

Processed Seurat objects for GeneTrajectory inference (Gene Trajectory...

Data from: A Single-Cell Tumor Immune Atlas for Precision Oncology

scRNA-seq_huang2019

Data, R code and output Seurat Objects for single cell RNA-seq analysis of human breast tissues