13 datasets found
  1. Multiple Single Cell RNA Expressions ARCHS4

    • kaggle.com
    zip
    Updated Jun 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alexander (2021). Multiple Single Cell RNA Expressions ARCHS4 [Dataset]. https://www.kaggle.com/alexandervc/multiple-single-cell-rna-expressions-archs4
    Explore at:
    zip(23088130184 bytes)Available download formats
    Dataset updated
    Jun 26, 2021
    Authors
    Alexander
    Description

    Context

    Dataset is downloaded from https://amp.pharm.mssm.edu/archs4/download.html The methods are described in Nature Communications paper: https://www.nature.com/articles/s41467-018-03751-6

    The ARCHS4 data provides user-friendly access to multiple gene expression data from the GEO database. (https://www.ncbi.nlm.nih.gov/geo/ ). While in GEO database most of data is stored in raw formats, ARCHS4 provides prepared count matrix expression data. While GEO contains data stored separately for each research paper, ARCHS4 collects all the information in one single matrix. One may consult the main site for further information.

    Main data files are in H5 (HD5, Hierarchical Data Format ) file format https://en.wikipedia.org/wiki/Hierarchical_Data_Format It contains expression data, as well as annotation data and futher meta-information. There are several other auxilliary files like TSNE 3d projection (in CSV format) and correlation matrices for genes for human and mouse in feather format.

    Content

    The main file (for human): human_matrix.h5 - contains data matrix - which is 238522 samples times 35238 genes, as well as, various meta information: gene names, samples information (tissue, etc), references to GEO database id where all the details can be found.

    There is also similar data for mouse, csv files with TSNE images, correlation matrices for genes.

    Acknowledgements

    The ARCHS4 project is by :

    'Alexander Lachmann', 'alexander.lachmann@mssm.edu', update: '2020-02-06'

  2. a

    archs4-human-gene-v2.5-protein-coding-expression

    • dl.aifasthub.com
    • huggingface.co
    Updated Feb 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gabriel Altay (2025). archs4-human-gene-v2.5-protein-coding-expression [Dataset]. https://dl.aifasthub.com/datasets/gabrielaltay/archs4-human-gene-v2.5-protein-coding-expression
    Explore at:
    Dataset updated
    Feb 10, 2025
    Authors
    Gabriel Altay
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    gabrielaltay/archs4-human-gene-v2.5-protein-coding-expression dataset hosted on Hugging Face and contributed by the HF Datasets community

  3. Z

    orthosData

    • data.niaid.nih.gov
    • zenodo.org
    Updated May 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Panagiotis Papasaikas (2023). orthosData [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_7554914
    Explore at:
    Dataset updated
    May 9, 2023
    Dataset provided by
    Charlotte Soneson
    Panagiotis Papasaikas
    Michael Stadler
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    orthosData is the companion database to the orthos software for mechanistic studies using differential gene expression experiments.

    It currently encompasses data for over 100,000 differential gene expression mouse and human experiments distilled and compiled from the ARCHS4 database* as well as associated pre-trained variational models.

    Together with orthos it was developed to provide a better understanding of the effects of experimental treatments on gene expression and to help map treatments to mechanisms of action.

  4. f

    Table2_Identify Tcea3 as a novel anti-cardiomyocyte hypertrophy gene...

    • frontiersin.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 19, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yingying Guo; Xian-feng Cen; Dan Li; Hong-liang Qiu; Ya-jie Chen; Meng Zhang; Si-hui Huang; Hao Xia; Man Xu (2023). Table2_Identify Tcea3 as a novel anti-cardiomyocyte hypertrophy gene involved in fatty acid oxidation and oxidative stress.xls [Dataset]. http://doi.org/10.3389/fcvm.2023.1137429.s002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 19, 2023
    Dataset provided by
    Frontiers
    Authors
    Yingying Guo; Xian-feng Cen; Dan Li; Hong-liang Qiu; Ya-jie Chen; Meng Zhang; Si-hui Huang; Hao Xia; Man Xu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundChronic pressure overload triggers pathological cardiac hypertrophy that eventually leads to heart failure. Effective biomarkers and therapeutic targets for heart failure remain to be defined. The aim of this study is to identify key genes associated with pathological cardiac hypertrophy by combining bioinformatics analyses with molecular biology experiments.MethodsComprehensive bioinformatics tools were used to screen genes related to pressure overload-induced cardiac hypertrophy. We identified differentially expressed genes (DEGs) by overlapping three Gene Expression Omnibus (GEO) datasets (GSE5500, GSE1621, and GSE36074). Correlation analysis and BioGPS online tool were used to detect the genes of interest. A mouse model of cardiac remodeling induced by transverse aortic constriction (TAC) was established to verify the expression of the interest gene during cardiac remodeling by RT-PCR and western blot. By using RNA interference technology, the effect of transcription elongation factor A3 (Tcea3) silencing on PE-induced hypertrophy of neonatal rat ventricular myocytes (NRVMs) was detected. Next, gene set enrichment analysis (GSEA) and the online tool ARCHS4 were used to predict the possible signaling pathways, and the fatty acid oxidation relevant pathways were enriched and then verified in NRVMs. Furthermore, the changes of long-chain fatty acid respiration in NRVMs were detected using the Seahorse XFe24 Analyzer. Finally, MitoSOX staining was used to detect the effect of Tcea3 on mitochondrial oxidative stress, and the contents of NADP(H) and GSH/GSSG were detected by relevant kits.ResultsA total of 95 DEGs were identified and Tcea3 was negatively correlated with Nppa, Nppb and Myh7. The expression level of Tcea3 was downregulated during cardiac remodeling both in vivo and in vitro. Knockdown of Tcea3 aggravated cardiomyocyte hypertrophy induced by PE in NRVMs. GSEA and online tool ARCHS4 predict Tcea3 involved in fatty acid oxidation (FAO). Subsequently, RT-PCR results showed that knockdown of Tcea3 up-regulated Ces1d and Pla2g5 mRNA expression levels. In PE induced cardiomyocyte hypertrophy, Tcea3 silencing results in decreased fatty acid utilization, decreased ATP synthesis and increased mitochondrial oxidative stress.ConclusionOur study identifies Tcea3 as a novel anti-cardiac remodeling target by regulating FAO and governing mitochondrial oxidative stress.

  5. Paired differential gene expression and splicing analyses results of 199...

    • zenodo.org
    Updated Feb 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Søren Helweg Dam; Søren Helweg Dam; Lars Rønn Olsen; Lars Rønn Olsen; Kristoffer Vitting-Seerup; Kristoffer Vitting-Seerup (2024). Paired differential gene expression and splicing analyses results of 199 baseline vs. case comparisons across 100 datasets (Limma) [Dataset]. http://doi.org/10.5281/zenodo.8162214
    Explore at:
    Dataset updated
    Feb 15, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Søren Helweg Dam; Søren Helweg Dam; Lars Rønn Olsen; Lars Rønn Olsen; Kristoffer Vitting-Seerup; Kristoffer Vitting-Seerup
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    OBS! This is the limma results of the analysis. See https://doi.org/10.5281/zenodo.8162229 for the DESeq2/DEXSeq results.

    This dataset contains results from paired differential expression and differential splicing analyses as well as gene-set over-representation analysis results for 199 baseline vs. case comparisons across 100 randomly curated datasets with accompanying metadata (article).
    All results were computed using the R package pairedGSEA, which utilized Limma (Ritchie et al., 2015) and fgsea (Korotkevich et al., 2019).

    Each .RDS file contains a list with four objects: A 'metadata' object with the metadata of the respective raw data, a 'genes' object with gene-level differential splicing and expression results, a 'gene_set' object with over-representation results, and 'experiment' with the experiment title.

    The filenames follow this pattern: "[dataset ID]_[GEO accession number]_[Manually assigned comparison title].RDS".

    All datasets were obtained from a local copy of the ARCHS4 v11 database of transcript counts (Lachmann et al., 2018).

  6. Additional file 2 of Meta-analysis of integrated ChIP-seq and transcriptome...

    • figshare.com
    xlsx
    Updated Aug 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zeynab Piryaei; Zahra Salehi; Esmaeil Ebrahimie; Mansour Ebrahimi; Kaveh Kavousi (2024). Additional file 2 of Meta-analysis of integrated ChIP-seq and transcriptome data revealed genomic regions affected by estrogen receptor alpha in breast cancer [Dataset]. http://doi.org/10.6084/m9.figshare.26617588.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Aug 14, 2024
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Zeynab Piryaei; Zahra Salehi; Esmaeil Ebrahimie; Mansour Ebrahimi; Kaveh Kavousi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 2: Table S1. Differentially bound sites (DBSs) obtained from MCF7 cell line treated with 10nM E2 for 45 minutes in GSE94023 study. Table S2. Differentially bound sites (DBSs) obtained from MCF7 cell line treated with 10nM E2 for 45 minutes in GSE99626 study. Table S3. Differentially bound sites (DBSs) obtained from MCF7 cell line treated with 10nM E2 for 45 minutes in GSE67295 study. Table S4. Differentially bound sites (DBSs) obtained from MCF7 cell line treated with 10nM E2 for 45 minutes in GSE115607 study. Table S5. Differentially bound sites (DBSs) obtained from T47D cell line treated with 10nM E2 for 45 minutes in GSE80367 study. Table S6. Differentially bound sites (DBSs) obtained from T47D cell line treated with 100nM E2 for 45 minutes in GSE23893 study. Table S7. Differentially bound sites (DBSs) obtained from MCF7 cell line treated with 100nM E2 for 45 minutes in GSE23893 study. Table S8. Differentially bound sites (DBSs) obtained from MCF7 cell line treated with 100nM E2 for 45 minutes in GSE54855 study. Table S9. Differentially bound sites (DBSs) obtained from MCF7 cell line treated with 100nM E2 for 45 minutes in GSE59530 study. Table S10. Default binding affinity matrix of 6 samples by the 63,612 sites that overlap in at least two of the samples using DiffBind in (GSE94023, GSE99626, GSE67295, & GSE115607) MCF7 cell line treated with 10nM E2 for 45 minutes. Table S11. Default binding affinity matrix of 6 samples by the 23,517 sites that overlap in at least two of the samples using DiffBind in (GSE23893, GSE54855, & GSE59530) MCF7 cell line treated with 100nM E2 for 45 minutes. Table S12. Meta-differentially bound sites (meta-DBSs) obtained from a meta-analysis on (GSE94023, GSE99626, GSE67295, & GSE115607) MCF7 cell line treated with 10nM E2 for 45 minutes. Table S13. Meta-differentially bound sites (meta-DBSs) obtained from a meta-analysis on (GSE23893, GSE54855, & GSE59530) MCF7 cell line treated with 100nM E2 for 45 minutes. Table S14. literature_ChIP-seq. Table S15. Enrichr. Table S16. ARCHS4—Coexpression. Table S17. ENCODE--ChIP-seq. Table S18. ReMap--ChIP-seq. Table S19. GTEx—Coexpression. Table S20. Integrated_topRank. Table S21. Integrated_meanRank. Table S22. Gene Ontology (GO) for 7,308 meta-DBSs related to 617 common genes among MCF7 & T47D cell lines using Cistrome-GO. Table S23. KEGG pathways analysis for 7,308 meta-DBSs related to 617 common genes among MCF7 & T47D cell lines using Cistrome-GO. Table S24. Differentially expressed genes (DEGs) identified from GRO-seq data in the MCF7 cell line treated with 100nM E2 for 40 minutes in the GSE27463 study.

  7. GeneRAIN Dataset: Training Data, Gene Embeddings, and Model Checkpoints

    • zenodo.org
    application/gzip, bin +1
    Updated Jul 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zheng Su; Zheng Su (2025). GeneRAIN Dataset: Training Data, Gene Embeddings, and Model Checkpoints [Dataset]. http://doi.org/10.5281/zenodo.10408775
    Explore at:
    txt, bin, application/gzipAvailable download formats
    Dataset updated
    Jul 31, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Zheng Su; Zheng Su
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains various files essential for understanding and employing the GeneRAIN models, as described in the accompanying manuscript. GeneRAIN models use bulk RNA-seq data and a 'Binning-By-Gene' normalization method. These models aim to improve upon existing methods in understanding biological information and include a vector representation of genes called GeneRAIN-vec. After thorough testing, these models have shown their effectiveness in predicting a wide range of biological characteristics, including for long non-coding RNAs. This shows their usefulness and potential in bioinformatics and computational biology. The provided dataset includes:

    1. Gene Embedding Files: These files offer 200-dimensional and 32-dimensional vector representations of genes.
    2. Checkpoint Files: Checkpoints of various GeneRAIN models.
    3. JSON Mapping Files: For gene to index mapping and tokenization processes. Note that some genes with low mean expression values might not be present in the model input dataset.
    4. ARCHS Human Bulk RNA-seq Data: Access the 'human_gene_v2.2.h5' file and corresponding metadata from the official ARCHS4 website.
    5. Normalized ARCHS Dataset: Processed via the 'Binning-By-Gene' method, this dataset is divided into five sample-based segments, ready for model training.
    6. Mean Expression and Flag Files: Contains mean expression values of genes and boolean flags to help filter duplicate gene symbols.
    7. Normalization and Binning Files: Utilize these with the 'normalize_expr_mat.ipynb' notebook to determine binning boundaries in new expression data.
    8. Example Input/Output: Provided for 'anal_dataset.ipynb' to demonstrate model application.
    9. Prediction Results: The 'genes_clf_pred_results.parquet' file contains the predicted results of coding and lncRNA genes.
  8. m

    HuBMAP ASCT+B Augmented with RNA-seq Coexpression

    • maayanlab.cloud
    gz
    Updated Jan 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ma'ayan Laboratory of Computational Systems Biology (2025). HuBMAP ASCT+B Augmented with RNA-seq Coexpression [Dataset]. https://maayanlab.cloud/Harmonizome/dataset/HuBMAP+ASCT%24plus%24B+Augmented+with+RNA-seq+Coexpression
    Explore at:
    gzAvailable download formats
    Dataset updated
    Jan 29, 2025
    Dataset provided by
    Ma'ayan Laboratory of Computational Systems Biology
    Harmonizome
    Authors
    Ma'ayan Laboratory of Computational Systems Biology
    Description

    Anatomical structure and cell type biomarker annotations from the HuBMAP ASCT+B tables, augmented with RNA-seq coexpression data from ARCHS4

  9. f

    Data from: Additional file 2 of Postmortem gene expression profiles in the...

    • springernature.figshare.com
    xlsx
    Updated Feb 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hyun Jung Kim; Hyeijung Yoo; Ji Yeon Kim; Soo Hyun Yang; Hyun Woo Lee; Heon-Jeong Lee; Gi Hoon Son; Hyun Kim (2024). Additional file 2 of Postmortem gene expression profiles in the habenulae of suicides: implication of endothelial dysfunction in the neurovascular system [Dataset]. http://doi.org/10.6084/m9.figshare.19881547.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Feb 29, 2024
    Dataset provided by
    figshare
    Authors
    Hyun Jung Kim; Hyeijung Yoo; Ji Yeon Kim; Soo Hyun Yang; Hyun Woo Lee; Heon-Jeong Lee; Gi Hoon Son; Hyun Kim
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 2: Table S1. Subject information; Table S2. Full list of the main 251 DEGs; Table S3. Cell-type enrichment analysis using single-cell level gene expression profiles obtained from murine habenula; Table S4. Top-10 GO-BP terms and related genes; Table S5. Top-10 ARCHS4 co-expressed TFs and putative target DEGs; Table S6. List of EC markers that did not exhibit significant changes in mRNA expression levels.

  10. Association of copy number alterations with the immune transcriptomic...

    • zenodo.org
    Updated Jun 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    S Loipfinger; S Loipfinger; A Bhattacharya; A Bhattacharya; RSN Fehrmann; RSN Fehrmann (2025). Association of copy number alterations with the immune transcriptomic landscape in cancer [Dataset]. http://doi.org/10.5281/zenodo.13983463
    Explore at:
    Dataset updated
    Jun 1, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    S Loipfinger; S Loipfinger; A Bhattacharya; A Bhattacharya; RSN Fehrmann; RSN Fehrmann
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository contains supplementary materials for the research paper "Association of copy number alterations with the immune transcriptomic landscape in cancer". The materials are organized in the following folders:

    • 00_code: code to reproduce the analysis
    • 01_ica_datasets: transcriptional components, sample mixing matrix activities, and gene set enrichment analysis results of the GPL570, ARCHS4, and TCGA datasets
    • 02_ica_cna_tc: identified CNA-TCs and their captured CNA regions, genomic plots of CNA-TCs
    • 03_ica_immune_tc: identified immune-TCs, list of immune gene sets
    • 04_ica_tc_dataset_overlap: reproducibility results of CNA-TCs and immune-TCs across datasets
    • 05_immune_gene_occurences: frequency of genes with high gene weight in immune-TCs, list potential novel immune involved ORFs
    • 06_projection_immune_tc_datasets: GPL570 and TCGA cancer samples corrected mixing matrix activitiy for CNA-TCs and immune-TCs of the other dataset
    • 07_inferred_cna_profiles: TACNA profiles and CNA burden per cancer sample
    • 08_cna_burden_immune_tc_association: cancer sample associations of CNA burden, individual CNAs, and single gene CNAs with immune-TCs
    • 09_projection_single_cell: cell type activity of immune-TCs in a single-cell tumor immune atlas for precision oncology
    • 10_projection_spatial_transcriptomics: activity of CNA-TC and immune-TC for each spot in spatial transcriptomic datasets from 10xGenomics
  11. Additional file 3: of Codon usage optimization in pluripotent embryonic stem...

    • springernature.figshare.com
    xlsx
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Susanne Bornelรถv; Tommaso Selmi; Sophia Flad; Sabine Dietmann; Michaela Frye (2023). Additional file 3: of Codon usage optimization in pluripotent embryonic stem cells [Dataset]. http://doi.org/10.6084/m9.figshare.8245400.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Susanne Bornelรถv; Tommaso Selmi; Sophia Flad; Sabine Dietmann; Michaela Frye
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Table S2. Gene enrichment analyses (EnrichR) identifies highly expressed genes in tissues derived from all three germ layers using ARCHS4 enrichment tool. (XLSX 11 kb)

  12. Paired differential gene expression and splicing analyses results of 199...

    • zenodo.org
    Updated Feb 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Søren Helweg Dam; Søren Helweg Dam; Lars Rønn Olsen; Lars Rønn Olsen; Kristoffer Vitting-Seerup; Kristoffer Vitting-Seerup (2024). Paired differential gene expression and splicing analyses results of 199 baseline vs. case comparisons across 100 datasets [Dataset]. http://doi.org/10.5281/zenodo.8162229
    Explore at:
    Dataset updated
    Feb 15, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Søren Helweg Dam; Søren Helweg Dam; Lars Rønn Olsen; Lars Rønn Olsen; Kristoffer Vitting-Seerup; Kristoffer Vitting-Seerup
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains results from paired differential expression and differential splicing analyses as well as gene-set over-representation analysis results for 199 baseline vs. case comparisons across 100 randomly curated datasets with accompanying metadata (article).
    All results were computed using the R package pairedGSEA, which utilized DESeq2 (Love et al., 2014), DEXSeq (Anders et al., 2012), and fgsea (Korotkevich et al., 2019).
    See limma results here: https://doi.org/10.5281/zenodo.8162214

    Each .RDS file contains a list with four objects: A 'metadata' object with the metadata of the respective raw data, a 'genes' object with gene-level differential splicing and expression results, a 'gene_set' object with over-representation results, and 'experiment' with the experiment title.

    The filenames follow this pattern: "[dataset ID]_[GEO accession number]_[Manually assigned comparison title].RDS".

    All datasets were obtained from a local copy of the ARCHS4 v11 database of transcript counts (Lachmann et al., 2018).

  13. f

    Table_2_Diagnostic, Prognostic, and Immunological Roles of HELLS in...

    • frontiersin.figshare.com
    docx
    Updated Jun 17, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiao Liang; Linji Li; Yuchao Fan (2023). Table_2_Diagnostic, Prognostic, and Immunological Roles of HELLS in Pan-Cancer: A Bioinformatics Analysis.docx [Dataset]. http://doi.org/10.3389/fimmu.2022.870726.s007
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 17, 2023
    Dataset provided by
    Frontiers
    Authors
    Xiao Liang; Linji Li; Yuchao Fan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundInappropriate repair of DNA damage drives carcinogenesis. Lymphoid-specific helicase (HELLS) is an important component of the chromatin remodeling complex that helps repair DNA through various mechanisms such as DNA methylation, histone posttranslational modification, and nucleosome remodeling. Its role in human cancer initiation and progression has garnered recent attention. Our study aims to provide a more systematic and comprehensive understanding of the role of HELLS in the development and progression of multiple malignancies through analysis of HELLS in cancers.MethodsWe explored the role of HELLS in cancers using The Cancer Genome Atlas (TCGA) and Genotype-Tissue Expression (GTEx) database. Multiple web platforms and software were used for data analysis, including R, Cytoscape, HPA, Archs4, TISIDB, cBioPortal, STRING, GSCALite, and CancerSEA.ResultsHigh HELLS expression was found in a variety of cancers and differentially expressed across molecular and immune subtypes. HELLS was involved in many cancer pathways. Its expression positively correlated with Th2 and Tcm cells in most cancers. It also correlated with genetic markers of immunomodulators in various cancers.ConclusionsOur study elucidates the role HELLS plays in promotion, inhibition, and treatment of different cancers. HELLS is a potential cancer diagnostic and prognostic biomarker with immune, targeted, or cytotoxic therapeutic value. This work is a prerequisite to clinical validation and treatment of HELLS in cancers.

  14. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Alexander (2021). Multiple Single Cell RNA Expressions ARCHS4 [Dataset]. https://www.kaggle.com/alexandervc/multiple-single-cell-rna-expressions-archs4
Organization logo

Multiple Single Cell RNA Expressions ARCHS4

Explore at:
zip(23088130184 bytes)Available download formats
Dataset updated
Jun 26, 2021
Authors
Alexander
Description

Context

Dataset is downloaded from https://amp.pharm.mssm.edu/archs4/download.html The methods are described in Nature Communications paper: https://www.nature.com/articles/s41467-018-03751-6

The ARCHS4 data provides user-friendly access to multiple gene expression data from the GEO database. (https://www.ncbi.nlm.nih.gov/geo/ ). While in GEO database most of data is stored in raw formats, ARCHS4 provides prepared count matrix expression data. While GEO contains data stored separately for each research paper, ARCHS4 collects all the information in one single matrix. One may consult the main site for further information.

Main data files are in H5 (HD5, Hierarchical Data Format ) file format https://en.wikipedia.org/wiki/Hierarchical_Data_Format It contains expression data, as well as annotation data and futher meta-information. There are several other auxilliary files like TSNE 3d projection (in CSV format) and correlation matrices for genes for human and mouse in feather format.

Content

The main file (for human): human_matrix.h5 - contains data matrix - which is 238522 samples times 35238 genes, as well as, various meta information: gene names, samples information (tissue, etc), references to GEO database id where all the details can be found.

There is also similar data for mouse, csv files with TSNE images, correlation matrices for genes.

Acknowledgements

The ARCHS4 project is by :

'Alexander Lachmann', 'alexander.lachmann@mssm.edu', update: '2020-02-06'

Search
Clear search
Close search
Google apps
Main menu