100+ datasets found
  1. E

    Breast Cancer Single-Cell RNA-Seq Dataset

    • ega-archive.org
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Breast Cancer Single-Cell RNA-Seq Dataset [Dataset]. https://ega-archive.org/datasets/EGAD00001007495
    Explore at:
    License

    https://ega-archive.org/dacs/EGAC00001001974https://ega-archive.org/dacs/EGAC00001001974

    Description

    Single-cell RNA-Sequencing of 26 primary breast cancers from Wu et al. (2021) study. Data was generated using the Chromium controller (10X Genomics) and sequenced on the NextSeq 500 platform.

  2. s

    Single Cell Smart-Seq 3 RNA-Seq and Bulk Exome Seq from Breast Cancer...

    • figshare.scilifelab.se
    • researchdata.se
    • +1more
    Updated Jan 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seong-Hwan Jun; Hosein Toosi; Jeff Mold; Camilla Engblom; Xinsong Chen; Ciara O’Flanagan; Michael Hagemann-Jensen; Rickard Sandberg; Johan Hartman; Samuel Aparicio; Andrew Roth; Jens Lagergren (2025). Single Cell Smart-Seq 3 RNA-Seq and Bulk Exome Seq from Breast Cancer Patients [Dataset]. http://doi.org/10.17044/scilifelab.15082398.v1
    Explore at:
    Dataset updated
    Jan 15, 2025
    Dataset provided by
    KTH Royal Institute of Technology
    Authors
    Seong-Hwan Jun; Hosein Toosi; Jeff Mold; Camilla Engblom; Xinsong Chen; Ciara O’Flanagan; Michael Hagemann-Jensen; Rickard Sandberg; Johan Hartman; Samuel Aparicio; Andrew Roth; Jens Lagergren
    License

    https://www.scilifelab.se/data/restricted-access/https://www.scilifelab.se/data/restricted-access/

    Description

    Data Set DescriptionSingle cell RNA sequencing (Samrt-Seq3) and Whole exome sequencing from multiple regions of individual tumors from Breast Cancer patients and also single cell RNA seq for two ovarian cancer cell lines.The dataset contains raw sequencing data for various high-throughput molecular tests performed on two sample types: tumor samples from two breast cancer patients and cell lines derived from High-grade serous carcinoma Patients. The breast cancer data comes from two patients: patient 1 (BCSA1) has two tumor regions A-B and patient 2 (BCSA2) has five regions(A-E). For a normal sample and each region from each patient Whole Exome Sequencing was performed using Twist Biosciences Human Exome Kit by the SNP&SEQ Technology platform, SciLifeLab, National Genomics Infrastructure Uppsala, Sweden. Also for each patient, EPCAM+ CD45- sorted cells from all the regions where sorted to a 384 well plate, and Smart-Seq3 libraries were prepared at Karolinska Institutet and sequenced at National Genomics Infrastructure Uppsala, Sweden.The HGSOC cell-line data comes from OV2295R2 and TOV2295R cell lines described in Laks et al Cell 2019 Nov 14; 179(5): 1207–1221.e22 doi: 10.1016/j.cell.2019.10.026 . The cell line Smart-Seq3 libraries were prepared from two 384 well plates at Karolinska Institutet and sequenced at National Genomics Infrastructure Uppsala, Sweden.Terms for accessThis dataset is to be used for research on intratumor heterogeneity and subclonal evolution of tumors. To apply for conditional access to the dataset in this publication, please contact datacentre@scilifelab.se.

  3. Data, R code and output Seurat Objects for single cell RNA-seq analysis of...

    • figshare.com
    application/gzip
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yunshun Chen; Gordon Smyth (2023). Data, R code and output Seurat Objects for single cell RNA-seq analysis of human breast tissues [Dataset]. http://doi.org/10.6084/m9.figshare.17058077.v1
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Yunshun Chen; Gordon Smyth
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains all the Seurat objects that were used for generating all the figures in Pal et al. 2021 (https://doi.org/10.15252/embj.2020107333). All the Seurat objects were created under R v3.6.1 using the Seurat package v3.1.1. The detailed information of each object is listed in a table in Chen et al. 2021.

  4. Data from: scRNA-seq Datasets

    • figshare.com
    txt
    Updated Apr 9, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhengtao Xiao (2019). scRNA-seq Datasets [Dataset]. http://doi.org/10.6084/m9.figshare.7174922.v2
    Explore at:
    txtAvailable download formats
    Dataset updated
    Apr 9, 2019
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Zhengtao Xiao
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    "*.csv" files contain the single cell gene expression values (log2(tpm+1)) for all genes in each cell from melanoma and squamous cell carcinoma of head and neck (HNSCC) tumors. The cell type and origin of tumor for each cell is also included in "*.csv" files.The "MalignantCellSubtypes.xlsx" defines the tumor subtype."CCLE_RNAseq_rsem_genes_tpm_20180929.zip" is downloaded from CCLE database.

  5. E

    Breast Cancer TNBC Single-Cell RNA-Seq Dataset

    • ega-archive.org
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Breast Cancer TNBC Single-Cell RNA-Seq Dataset [Dataset]. https://ega-archive.org/datasets/EGAD00001006981
    Explore at:
    License

    https://ega-archive.org/dacs/EGAC00001001974https://ega-archive.org/dacs/EGAC00001001974

    Description

    Single-cell RNA-Sequencing of five TNBC primary breast cancers from Wu et al. (2020) EMBO J study. Data was generated using the Chromium controller (10X Genomics) and sequenced on the NextSeq 500 platform.

  6. List of tumor microenvironment scRNA-seq datasets included in TMExplorer.

    • plos.figshare.com
    • figshare.com
    xls
    Updated Jun 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Erik Christensen; Alaine Naidas; David Chen; Mia Husic; Parisa Shooshtari (2023). List of tumor microenvironment scRNA-seq datasets included in TMExplorer. [Dataset]. http://doi.org/10.1371/journal.pone.0272302.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 16, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Erik Christensen; Alaine Naidas; David Chen; Mia Husic; Parisa Shooshtari
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    List of tumor microenvironment scRNA-seq datasets included in TMExplorer.

  7. q

    Single Cell Insights Into Cancer Transcriptomes: A Five-Part Single-Cell...

    • qubeshub.org
    Updated Nov 16, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Leigh Samsa*; Melissa Eslinger; Adam Kleinschmit; Amanda Solem; Carlos Goller* (2021). Single Cell Insights Into Cancer Transcriptomes: A Five-Part Single-Cell RNAseq Case Study Lesson [Dataset]. http://doi.org/10.24918/cs.2021.26
    Explore at:
    Dataset updated
    Nov 16, 2021
    Dataset provided by
    QUBES
    Authors
    Leigh Samsa*; Melissa Eslinger; Adam Kleinschmit; Amanda Solem; Carlos Goller*
    Description

    There is a growing need for integration of “Big Data” into undergraduate biology curricula. Transcriptomics is one venue to examine biology from an informatics perspective. RNA sequencing has largely replaced the use of microarrays for whole genome gene expression studies. Recently, single cell RNA sequencing (scRNAseq) has unmasked population heterogeneity, offering unprecedented views into the inner workings of individual cells. scRNAseq is transforming our understanding of development, cellular identity, cell function, and disease. As a ‘Big Data,’ scRNAseq can be intimidating for students to conceptualize and analyze, yet it plays an increasingly important role in modern biology. To address these challenges, we created an engaging case study that guides students through an exploration of scRNAseq technologies. Students work in groups to explore external resources, manipulate authentic data and experience how single cell RNA transcriptomics can be used for personalized cancer treatment. This five-part case study is intended for upper-level life science majors and graduate students in genetics, bioinformatics, molecular biology, cell biology, biochemistry, biology, and medical genomics courses. The case modules can be completed sequentially, or individual parts can be separately adapted. The first module can also be used as a stand-alone exercise in an introductory biology course. Students need an intermediate mastery of Microsoft Excel but do not need programming skills. Assessment includes both students’ self-assessment of their learning as answers to previous questions are used to progress through the case study and instructor assessment of final answers. This case provides a practical exercise in the use of high-throughput data analysis to explore the molecular basis of cancer at the level of single cells.

  8. S

    scRNA-seq data of lung cancer

    • scidb.cn
    Updated Jul 21, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Weimin Li (2022). scRNA-seq data of lung cancer [Dataset]. http://doi.org/10.57760/sciencedb.02028
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 21, 2022
    Dataset provided by
    Science Data Bank
    Authors
    Weimin Li
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    we collected 40 tumor and adjacent normal tissue samples from 19 pathologically diagnosed NSCLC patients (10 LUAD and 9 LUSC) during surgical resections, and rapidly digested the tissues to obtain single-cell suspensions and constructed the cDNA libraries of these samples within 24 hours using the protocol of 10X gennomic. These libraries were sequenced on the Illumina NovaSeq 6000 platform. Finally we obtained the raw gene expression matrices were generated using CellRanger (version 3.0.1). Information was processed in R (version 3.6.0) using the Seurat R package (version 2.3.4).

  9. m

    Data from: A multiplex single-cell RNA-Seq pharmacotranscriptomics pipeline...

    • data.mendeley.com
    Updated Oct 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alice Dini (2024). A multiplex single-cell RNA-Seq pharmacotranscriptomics pipeline for drug discovery [Dataset]. http://doi.org/10.17632/j9j4mdm9yr.1
    Explore at:
    Dataset updated
    Oct 22, 2024
    Authors
    Alice Dini
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We developed a single-cell transcriptomics pipeline for high-throughput pharmacotranscriptomic screening. We explored the transcriptional landscape of three HGSOC models (JHOS2, a representative cell line; PDC2 and PDC3, two patient-derived samples) after treating their cells for 24 hours with 45 drugs representing 13 distinct classes of mechanism of action. Our work establishes a new precision oncology framework for the study of molecular mechanisms activated by a broad array of drug responses in cancer. . ├── 3D UMAPs/ → Interactive 3D UMAPs of cells treated with the 45 drugs used for multiplexed scRNA-seq. Related to Figure 4. Coordinates: x = UMAP 1; y = UMAP 2; z = UMAP 3. Legend: green = PDC1; blue = PDC2; red = JHOS2. │ ├── DMSO_3D_UMAP_Dini.et.al.html → 3D UMAP of untreated cells. │ └── drug_3D_UMAP_Dini.et.al.html → 3D UMAP of cells treated with (drug). ├── QC_plots/ → Diagnostic plots. Related to Figures 2–4. │ ├── model_QC_violin_plot_2023.pdf → Violin plots of the QC metrics used to filter the data. │ ├── model_col_HTO or model_row_HTO before and after filt → Heatmaps of the row or column HTO expression in each cell. │ └── model_counts_histogram_2023.pdf → Histogram of the distribution of the total counts per cell after filtering for high-quality cells. ├── scRNAseq/ → scRNA-seq data. Related to Figures 2–4. │ ├── AllData_subsampled_DGE_edgeR.csv.gz → Differential gene expression analyses results between treated and untreated cells via pseudobulk of aggregate subsamples, for each of the three models. Related to Figure 3. │ └── All_vs_all_RNAclusters_DEG_signif.txt → Differential gene expression analysis results (p.adj < 0.05) of FindAllMarkers for the Leiden/RNA clusters. ├── PDCs.transcript.counts.tsv → Bulk RNA-seq count data for PDCs 1–3 processed by Kallisto. Related to Figure S6. └── PDCs.transcript.TPM.tsv → Bulk RNA-seq TPM data for PDCs 1–3 processed by Kallisto. Related to Figure S6.

  10. Colorectal cancer spatial transcriptomics and single-cell RNA-sequencing...

    • zenodo.org
    bin, csv
    Updated Jan 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nghia Millard; Jonathan Chen; Mukta Palshikar; Karin Pelka; Maxwell Spurrell; Colles Price; Jiang He; Nir Hacohen; Soumya Raychaudhuri; Ilya Korsunsky; Nghia Millard; Jonathan Chen; Mukta Palshikar; Karin Pelka; Maxwell Spurrell; Colles Price; Jiang He; Nir Hacohen; Soumya Raychaudhuri; Ilya Korsunsky (2025). Colorectal cancer spatial transcriptomics and single-cell RNA-sequencing dataset [Dataset]. http://doi.org/10.5281/zenodo.14602110
    Explore at:
    csv, binAvailable download formats
    Dataset updated
    Jan 5, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Nghia Millard; Jonathan Chen; Mukta Palshikar; Karin Pelka; Maxwell Spurrell; Colles Price; Jiang He; Nir Hacohen; Soumya Raychaudhuri; Ilya Korsunsky; Nghia Millard; Jonathan Chen; Mukta Palshikar; Karin Pelka; Maxwell Spurrell; Colles Price; Jiang He; Nir Hacohen; Soumya Raychaudhuri; Ilya Korsunsky
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Metadata and counts matrix (barcode and genes files also provided) for the colorectal cancer (CRC) spatial transcriptomics and scRNA-seq dataset utilized in the Crescrendo manuscript published by Millard et al. (2025). Batch column indicates whether cell is from scRNA-seq data or which spatial transcriptomics slice. The sample_id column indicates the sample the cell is from. The center_x and center_y columns indicate the center of the cell in space (scRNA-seq cells have 0 in these columns). The orig_publication_type indicates fine-grained cell type labels from the original publication of the CRC scRNA-seq dataset, while the cresc_publication_type column indicates the more coarse-grained cell type labels from the Crescendo publication.

  11. E

    Single cell RNAseq of PBMC from bladder cancer patients

    • ega-archive.org
    Updated Nov 21, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2018). Single cell RNAseq of PBMC from bladder cancer patients [Dataset]. https://ega-archive.org/datasets/EGAD00001005481
    Explore at:
    Dataset updated
    Nov 21, 2018
    License

    https://ega-archive.org/dacs/EGAC00001001380https://ega-archive.org/dacs/EGAC00001001380

    Description

    This dataset contains single cell RNA sequencing data of PBMC samples from 10 bladder cancer patients. cDNAs and single cell RNA libraries were prepared following manufacturer’s user guide (10x Genomics). Each library was sequenced in HiSeq4000 (Illumina) to achieve ~300 million reads following manufacturer’s sequencing specification.

  12. f

    Annotated single-cell RNA-seq dataset used in "A Retrospective View on...

    • figshare.com
    application/x-gzip
    Updated Nov 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seokhyun Yoon (2023). Annotated single-cell RNA-seq dataset used in "A Retrospective View on Triple Negative Breast Cancer Microenvironment: Novel Markers, Interactions, and Mechanisms of Tumor-Associated Components using public Single-cell RNA Seq Datasets" [Dataset]. http://doi.org/10.6084/m9.figshare.24678549.v1
    Explore at:
    application/x-gzipAvailable download formats
    Dataset updated
    Nov 30, 2023
    Dataset provided by
    figshare
    Authors
    Seokhyun Yoon
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    AbstractPurpose: Triple-negative breast cancer presents a significant clinical challenge due to its aggressive nature and limited treatment options. This subtype is notorious for a poorer prognosis compared to other breast cancer forms, primarily due to the lack of identifiable treatment targets.Methods: In our study, we delve deep into the molecular landscape of TNBC using public single-cell RNA sequencing datasets. Our integrative analysis aims to identify unique markers specific to TNBC, unravel the intricate gene mechanisms they are involved in, and explore new avenues for potential therapeutic interventions.Results: Employing three comprehensive datasets, our study offers a novel perspective on the tumor microenvironment of TNBC. Specifically, we found 12 marker genes, including DSC2 and CDKN2A, uniquely expressed in TNBC cells, marking an advancement in understanding this cancer subtype. A comparative analysis of these markers across various components of the tumor microenvironment, including both cancerous and normal cells, highlights a distinctive feature. A key discovery of our study is the interaction between DSC2 and DSG2 genes within TNBC cells, suggesting a novel pathway of intercellular communication exclusive to this cancer type.Conclusion: This finding not only corroborates previous hypotheses but also lays the foundation for a new structural understanding of triple-negative breast cancer, as revealed through our single-cell analysis workflow.

  13. Z

    Data from: Beyondcell: targeting cancer therapeutic heterogeneity in...

    • data.niaid.nih.gov
    Updated Jan 15, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gonzalo Gómez-López (2021). Beyondcell: targeting cancer therapeutic heterogeneity in single-cell RNA-seq [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4438619
    Explore at:
    Dataset updated
    Jan 15, 2021
    Dataset provided by
    María José Jiménez-Santos
    Tomás Di Domenico
    Gonzalo Gómez-López
    Fátima Al-Shahrour
    Luis G. Jimeno
    Santiago García-Martín
    Coral Fustero-Torre
    Carlos Carretero-Puche
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Beyondcell is a methodology for the identification of drug vulnerabilities in single cell RNA-seq data. To this end, Beyondcell focuses on the analysis of drug-related commonalities between cells by classifying them into distinct therapeutic clusters. We have validated the tool in a population of MCF7-AA cells exposed to 500nM of bortezomib and collected at different time points: t0 (before treatment), t12, t48 and t96 (72h treatment followed by drug wash and 24h of recovery) obtained from Ben-David U, et al., Nature, 2018. Here, you can find the integrated Seurat object obtained from this analysis. This object is meant to help users follow Beyondcell's analysis workflow.

  14. Repository for Single Cell RNA Sequencing Analysis of The EMT6 Dataset

    • zenodo.org
    • data.niaid.nih.gov
    bin, txt
    Updated Nov 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jonathan Hsu; Allart Stoop; Jonathan Hsu; Allart Stoop (2023). Repository for Single Cell RNA Sequencing Analysis of The EMT6 Dataset [Dataset]. http://doi.org/10.5281/zenodo.10011622
    Explore at:
    bin, txtAvailable download formats
    Dataset updated
    Nov 20, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Jonathan Hsu; Allart Stoop; Jonathan Hsu; Allart Stoop
    Description

    Table of Contents

    1. Main Description
    2. File Descriptions
    3. Linked Files
    4. Installation and Instructions

    1. Main Description

    ---------------------------

    This is the Zenodo repository for the manuscript titled "A TCR β chain-directed antibody-fusion molecule that activates and expands subsets of T cells and promotes antitumor activity.". The code included in the file titled `marengo_code_for_paper_jan_2023.R` was used to generate the figures from the single-cell RNA sequencing data.

    The following libraries are required for script execution:

    • Seurat
    • scReportoire
    • ggplot2
    • stringr
    • dplyr
    • ggridges
    • ggrepel
    • ComplexHeatmap

    File Descriptions

    ---------------------------

    • The code can be downloaded and opened in RStudios.
    • The "marengo_code_for_paper_jan_2023.R" contains all the code needed to reproduce the figues in the paper
    • The "Marengo_newID_March242023.rds" file is available at the following address: https://zenodo.org/badge/DOI/10.5281/zenodo.7566113.svg (Zenodo DOI: 10.5281/zenodo.7566113).
    • The "all_res_deg_for_heat_updated_march2023.txt" file contains the unfiltered results from DGE anlaysis, also used to create the heatmap with DGE and volcano plots.
    • The "genes_for_heatmap_fig5F.xlsx" contains the genes included in the heatmap in figure 5F.

    Linked Files

    ---------------------

    This repository contains code for the analysis of single cell RNA-seq dataset. The dataset contains raw FASTQ files, as well as, the aligned files that were deposited in GEO. The "Rdata" or "Rds" file was deposited in Zenodo. Provided below are descriptions of the linked datasets:

    Gene Expression Omnibus (GEO) ID: GSE223311(https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE223311)

    • Title: Gene expression profile at single cell level of CD4+ and CD8+ tumor infiltrating lymphocytes (TIL) originating from the EMT6 tumor model from mSTAR1302 treatment.
    • Description: This submission contains the "matrix.mtx", "barcodes.tsv", and "genes.tsv" files for each replicate and condition, corresponding to the aligned files for single cell sequencing data.
    • Submission type: Private. In order to gain access to the repository, you must use a reviewer token (https://www.ncbi.nlm.nih.gov/geo/info/reviewer.html).

    Sequence read archive (SRA) repository ID: SRX19088718 and SRX19088719

    • Title: Gene expression profile at single cell level of CD4+ and CD8+ tumor infiltrating lymphocytes (TIL) originating from the EMT6 tumor model from mSTAR1302 treatment.
    • Description: This submission contains the **raw sequencing** or `.fastq.gz` files, which are tab delimited text files.
    • Submission type: Private. In order to gain access to the repository, you must use a reviewer token (https://www.ncbi.nlm.nih.gov/geo/info/reviewer.html).

    Zenodo DOI: 10.5281/zenodo.7566113(https://zenodo.org/record/7566113#.ZCcmvC2cbrJ)

    • Title: A TCR β chain-directed antibody-fusion molecule that activates and expands subsets of T cells and promotes antitumor activity.
    • Description: This submission contains the "Rdata" or ".Rds" file, which is an R object file. This is a necessary file to use the code.
    • Submission type: Restricted Acess. In order to gain access to the repository, you must contact the author.

    Installation and Instructions

    --------------------------------------

    The code included in this submission requires several essential packages, as listed above. Please follow these instructions for installation:

    > Ensure you have R version 4.1.2 or higher for compatibility.

    > Although it is not essential, you can use R-Studios (Version 2022.12.0+353 (2022.12.0+353)) for accessing and executing the code.

    1. Download the *"Rdata" or ".Rds" file from Zenodo (https://zenodo.org/record/7566113#.ZCcmvC2cbrJ) (Zenodo DOI: 10.5281/zenodo.7566113).

    2. Open R-Studios (https://www.rstudio.com/tags/rstudio-ide/) or a similar integrated development environment (IDE) for R.

    3. Set your working directory to where the following files are located:

    • marengo_code_for_paper_jan_2023.R
    • Install_Packages.R
    • Marengo_newID_March242023.rds
    • genes_for_heatmap_fig5F.xlsx
    • all_res_deg_for_heat_updated_march2023.txt

    You can use the following code to set the working directory in R:

    > setwd(directory)

    4. Open the file titled "Install_Packages.R" and execute it in R IDE. This script will attempt to install all the necessary pacakges, and its dependencies in order to set up an environment where the code in "marengo_code_for_paper_jan_2023.R" can be executed.

    5. Once the "Install_Packages.R" script has been successfully executed, re-start R-Studios or your IDE of choice.

    6. Open the file "marengo_code_for_paper_jan_2023.R" file in R-studios or your IDE of choice.

    7. Execute commands in the file titled "marengo_code_for_paper_jan_2023.R" in R-Studios or your IDE of choice to generate the plots.

  15. MIX-seq data

    • figshare.com
    txt
    Updated May 30, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cancer Data Science (2023). MIX-seq data [Dataset]. http://doi.org/10.6084/m9.figshare.10298696.v3
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Cancer Data Science
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data accompanying the manuscript describing MIX-Seq, a method for transcriptional profiling of mixtures of cancer cell lines treated with small molecule and genetic perturbations (McFarland and Paolella et al., Nat Commun, 2020). Data consists of single-cell RNA-sequencing (UMI count matrices), and associated drug sensitivity and genomic features of the cancer cell lines.See README file for more information on dataset contents.

  16. f

    Single-cell RNA sequencing data from 20 tumors

    • plus.figshare.com
    zip
    Updated Jul 28, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yilong Li; Michelle Nahas; Dennis Stephens; Kate Froburg; Emma Hintz; Devin Champagne; Amaneet Lochab; Markus Brown; Jasper Braun; Maria Antonia Fortuno; Maria-del-Mar Ocon; Andrea Pasquier; Ines Luque Vazquez; Hita Moudgalya; Sophie Kivlehan; Iliana Gjeci; Stephanie L. Korle; Arantza Campo; Maria Rodriguez; Christopher W. Seder; Patrick H. Lizotte; Raphael Bueno; Jeffrey A. Borgia; Luis Miguel Seijo; Luis M. Montuenga; Roman Yelensky (2025). Single-cell RNA sequencing data from 20 tumors [Dataset]. http://doi.org/10.25452/figshare.plus.28067336.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 28, 2025
    Dataset provided by
    Figshare+
    Authors
    Yilong Li; Michelle Nahas; Dennis Stephens; Kate Froburg; Emma Hintz; Devin Champagne; Amaneet Lochab; Markus Brown; Jasper Braun; Maria Antonia Fortuno; Maria-del-Mar Ocon; Andrea Pasquier; Ines Luque Vazquez; Hita Moudgalya; Sophie Kivlehan; Iliana Gjeci; Stephanie L. Korle; Arantza Campo; Maria Rodriguez; Christopher W. Seder; Patrick H. Lizotte; Raphael Bueno; Jeffrey A. Borgia; Luis Miguel Seijo; Luis M. Montuenga; Roman Yelensky
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Liquid biopsy is a promising non-invasive technology that is capable of diagnosing cancer. However, current ctDNA-based approaches detect only a minority of early-stage disease. We set out to improve the sensitivity of liquid biopsy by harnessing tumor recognition by T cells through the sequencing of the circulating T-cell receptor repertoire. We studied a cohort of 463 patients with lung cancer (86% stage I) and 587 subjects without cancer using gDNA extracted from blood buffy coats. We performed TCR β chain sequencing to yield a median of 113,571 TCR clonotypes per sample and built a TCR sequence similarity graph to cluster clonotypes into TCR repertoire functional units (RFUs). The TCR frequencies of RFUs were tested for association with cancer status and RFUs with a statistically significant association were combined into a cancer score using a support vector machine model. The model was evaluated by 10-fold cross-validation and compared with a ctDNA panel of 237 mutation hotspots in 154 lung cancer driver genes and 17 cancer related protein biomarkers in 85 subjects. We identified 327 cancer- associated TCR RFUs with a false discovery rate (FDR) ≤ 0.1, including 157 enriched in cancer samples and 170 enriched in controls. Levels of 247/327 (76%) RFUs were correlated with the presence of an HLA allele at FDR ≤ 0.1 and tumor-infiltrating lymphocyte TCRs from multiple RFUs bound HLA presented tumor antigen peptides, suggesting antigen recognition as a driver of the cancer-RFU associations found. The RFU cancer score detected nearly 50% of stage I lung cancers at a specificity of 80% and boosted the sensitivity by up to 20 percentage points when added to ctDNA and circulating proteins in a multi- analyte cancer screening test. Overall, we show that circulating TCR repertoire functional unit analysis can complement established analytes to improve liquid biopsy sensitivity for early-stage cancer.This dataset contains the CellRanger output for 20 cancer patients. Please refer to https://www.10xgenomics.com/support/software/cell-ranger/latest for documentation.For details on how the data was generated, please see Li Y. et al. 2025: Circulating T-cell Receptor Repertoire for Cancer Early Detection.

  17. E

    Single cell RNAseq and TCRseq data from tumor and blood samples from 4...

    • ega-archive.org
    Updated Apr 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Single cell RNAseq and TCRseq data from tumor and blood samples from 4 patients with muscle invasive bladder cancer [Dataset]. https://ega-archive.org/datasets/EGAD50000001381
    Explore at:
    Dataset updated
    Apr 4, 2025
    License

    https://ega-archive.org/dacs/EGAC00001001642https://ega-archive.org/dacs/EGAC00001001642

    Description

    For this dataset we performed single cell RNAseq paired with single cell TCR-seq on tumor and blood samples from 4 patients. This dataset contains 4 tumor samples as well as 4 blood samples. Each sample is made up of 2 sets of paired fastq files. The first pair contains reads corresponding to RNA transcripts (_Transcripts in file name), while the second pair contain reads corresponding to TCRs (_VDJ in file name). Sequenced on the Illumina NovaSeq6000 platform in a paired-end run using an SP flow cell (v1.5, 300 cycles).

  18. o

    Single Cell RNA sequence data from a human ovarian cancer sample

    • explore.openaire.eu
    Updated Dec 7, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2018). Single Cell RNA sequence data from a human ovarian cancer sample [Dataset]. https://explore.openaire.eu/search/dataset?datasetId=_OmicsDI::98f9e1906d8b81f541778c948c440345
    Explore at:
    Dataset updated
    Dec 7, 2018
    Description

    Purpose: Investigate cellular heterogeneity in a fresh human ovarian cancer tissue sample Methods: Enzymatic digestion of fresh tissue sample collected from the operating room to produce single cell suspension. Cells were labelled with fluorescent antibodies to CD3, CD14, CD19, CD20, CD56 and FACS sorted to remove immune cells. The negative population was used for sequencing. Single cells were processed using the Fluidigm C1 Chip to generate barcoded cDNA for each cell. Amplifed cDNA was sequenced using an Illumina HiSeq 2500 machine. Results: Single cell RNA sequence data was obtained for 92 cells and a "bulk" sample of 1000 cells. 26 cells were removed from analysis due to quality control standards. The remaining 66 cells and the bulk sample were analyzed. Conclusion: Single cell RNA sequence analysis reveals heterogeneity in gene expression in cells harvested from a high grade ovarian serous cancer Overall design: A single cell suspension generated from a fresh high grade serous ovarian cancer sample was run through two Fluidigm C1 chips to isolate single cells and produce barcoded cDNA. Sequencing was performed in a single lane of an Illumina HiSeq 2500 machine. 92 single cells were sequenced and 1 bulk sample was sequenced, for a total of 93 samples.

  19. Human breast cancer PDX models bulk and single cell RNA sequencing

    • zenodo.org
    • explore.openaire.eu
    bin, csv
    Updated Aug 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Long V. Nguyen; Long V. Nguyen; Yaniv Eyal-Lubling; Daniel Guerrero-Romero; Raquel Manzano Garcia; Oscar M. Rueda; Oscar M. Rueda; Carlos Caldas; Carlos Caldas; Yaniv Eyal-Lubling; Daniel Guerrero-Romero; Raquel Manzano Garcia (2024). Human breast cancer PDX models bulk and single cell RNA sequencing [Dataset]. http://doi.org/10.5281/zenodo.10978990
    Explore at:
    bin, csvAvailable download formats
    Dataset updated
    Aug 7, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Long V. Nguyen; Long V. Nguyen; Yaniv Eyal-Lubling; Daniel Guerrero-Romero; Raquel Manzano Garcia; Oscar M. Rueda; Oscar M. Rueda; Carlos Caldas; Carlos Caldas; Yaniv Eyal-Lubling; Daniel Guerrero-Romero; Raquel Manzano Garcia
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset includes information relevant to the following manuscript from the labs of Prof. Carlos Caldas (University of Cambridge), and Dr. Long V. Nguyen (Princess Margaret Cancer Centre, University Health Network):

    Nguyen LV et al. Dynamics and plasticity of human breast cancer single cell-derived clones. Under consideration for publication.

    Bulk RNA sequencing raw count matrices are provided (RawCounts.csv) along with the normalized count matrices (LogCPMNormCounts.csv).

    Single cell RNA sequencing count matrix processed from R package metacell is provided (mat.pdx_LN_v2_filt.Rda), along with the mc and mc2d files with information on metacell partitions (mc.pdx_LN_v2_filt.Rda and mc2d.pdx_LN_v2_filt.Rda).

    Code and information on data analysis is provided for reviewers in our unpublished manuscript and on Github (https://github.com/cclab-brca/clone-dynamics).

  20. M

    clonealign: 10X genomics chromium single-cell RNA-sequencing

    • datacatalog.mskcc.org
    • ega-archive.org
    Updated Jun 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    European Genome-phenome Archive (2024). clonealign: 10X genomics chromium single-cell RNA-sequencing [Dataset]. https://datacatalog.mskcc.org/dataset/11278
    Explore at:
    Dataset updated
    Jun 10, 2024
    Dataset provided by
    MSK Library
    European Genome-phenome Archive
    Description

    Description from EGA:

    "10X genomics chromium single-cell RNA-sequencing of (i) patient derived triple negative breast cancer xenograft (ii) primary tumour and ascites ovarian cancer cell lines at tumour recurrence."

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Breast Cancer Single-Cell RNA-Seq Dataset [Dataset]. https://ega-archive.org/datasets/EGAD00001007495

Breast Cancer Single-Cell RNA-Seq Dataset

Explore at:
86 scholarly articles cite this dataset (View in Google Scholar)
License

https://ega-archive.org/dacs/EGAC00001001974https://ega-archive.org/dacs/EGAC00001001974

Description

Single-cell RNA-Sequencing of 26 primary breast cancers from Wu et al. (2021) study. Data was generated using the Chromium controller (10X Genomics) and sequenced on the NextSeq 500 platform.

Search
Clear search
Close search
Google apps
Main menu