100+ datasets found
  1. d

    Data from: Gene Expression Omnibus (GEO)

    • catalog.data.gov
    • data.virginia.gov
    • +2more
    Updated Jul 26, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institutes of Health (NIH) (2023). Gene Expression Omnibus (GEO) [Dataset]. https://catalog.data.gov/dataset/gene-expression-omnibus-geo
    Explore at:
    Dataset updated
    Jul 26, 2023
    Dataset provided by
    National Institutes of Health (NIH)
    Description

    Gene Expression Omnibus is a public functional genomics data repository supporting MIAME-compliant submissions of array- and sequence-based data. Tools are provided to help users query and download experiments and curated gene expression profiles.

  2. f

    Gene expression data from Gene Expression Omnibus (GEO) database.

    • datasetcatalog.nlm.nih.gov
    Updated Mar 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dai, Minchen; Xie, Ningning; Fu, Leyi; Zhang, Songying; Jiang, Zhou; Wang, Fangfang; Zhou, Jue; Qu, Fan (2023). Gene expression data from Gene Expression Omnibus (GEO) database. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001040576
    Explore at:
    Dataset updated
    Mar 1, 2023
    Authors
    Dai, Minchen; Xie, Ningning; Fu, Leyi; Zhang, Songying; Jiang, Zhou; Wang, Fangfang; Zhou, Jue; Qu, Fan
    Description

    Gene expression data from Gene Expression Omnibus (GEO) database.

  3. Gene Expression Omnibus (GEO) Dataset: GSE68086

    • kaggle.com
    zip
    Updated Sep 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samira Alipour (2024). Gene Expression Omnibus (GEO) Dataset: GSE68086 [Dataset]. https://www.kaggle.com/datasets/samiraalipour/gene-expression-omnibus-geo-dataset-gse68086/code
    Explore at:
    zip(7850064 bytes)Available download formats
    Dataset updated
    Sep 16, 2024
    Authors
    Samira Alipour
    Description

    Gene Expression Omnibus (GEO) Dataset: GSE68086

    This dataset, available on the Gene Expression Omnibus (GEO) platform, provides valuable insights into cancer diagnostics through the analysis of tumor-educated platelets (TEPs). It highlights the potential of liquid biopsies for non-invasive cancer detection across multiple cancer types.

    Dataset Overview:

    • Title: RNA-seq of tumor-educated platelets enables blood-based pan-cancer, multiclass, and molecular pathway cancer diagnostics.
    • Organism: Homo sapiens
    • Experiment Type: Expression profiling by high-throughput sequencing
    • Sample Size: 283 blood platelet samples
      • 228 tumor-educated platelet (TEP) samples from patients with six different malignant tumors.
      • 55 samples from healthy individuals.

    Cancer Types Included: - Non-small cell lung cancer - Colorectal cancer - Pancreatic cancer - Glioblastoma - Breast cancer - Hepatobiliary carcinomas

    Methodology:

    • Sample Collection: Blood platelets were isolated from whole blood using EDTA anti-coagulant.
    • RNA Extraction: Total RNA was extracted from platelet pellets using the mirVana RNA isolation kit.
    • Sequencing: cDNA synthesis and amplification were performed using the SMARTer Ultra Low RNA Kit, followed by Covaris shearing and Illumina HiSeq 2500 sequencing.
    • Quality Control: Performed using Bioanalyzer 2100 with RNA 6000 Picochip, DNA 7500, and DNA High Sensitivity chips.

    Data Processing:

    • Quality control using Trimmomatic
    • Mapping to the hg19 reference genome using STAR (version 2.3.0)
    • Intron-spanning reads selected using Picard-tools (version 1.115)
    • Read summarization using HTseq (version 0.6.1)

    Data Structure:

    • Samples: 285 columns (including controls)
    • Features: 57,736 Ensembl gene IDs (rows)
    • Data Type: Intron-spanning read counts

    Files Included:

    1. GSE68086_TEP_data_matrix.txt.gz (3.6 MB): Original gzipped text file containing intron-spanning RNA-seq read counts.
    2. GSE68086_TEP_data_matrix.csv: Converted CSV file of the original data.
    3. GSE68086_series_matrix.txt: Series matrix file containing detailed sample information.
    4. GSE68086_series_matrix.csv: Converted CSV version of the series matrix file.

    Potential Applications:

    • Non-invasive cancer diagnostics: Exploring liquid biopsies for cancer detection.
    • Identification of cancer-specific biomarkers.
    • Study of cancer-induced changes in platelet RNA profiles.
    • Comparative analysis across different cancer types.

    Machine Learning Models for:

    • Binary classification: Healthy vs. cancer patients.
    • Multiclass classification: Distinguishing between different cancer types.
    • Molecular pathway analysis for identifying cancer-specific pathways.

    Importance:

    This dataset offers significant potential for advancing cancer diagnostics by leveraging tumor-educated platelets as biomarkers for early detection and classification of various cancer types. It represents a promising approach to non-invasive, blood-based cancer screening using gene expression profiles.

    Data Access and Analysis:

    • GEO Accession: GSE68086
    • Online Analysis: Available through GEO2R
    • R Package: Data can be accessed and analyzed using the GEOquery package.

    Citation: Best MG, Sol N, Kooi I, Tannous J, et al. RNA-Seq of Tumor-Educated Platelets Enables Blood-Based Pan-Cancer, Multiclass, and Molecular Pathway Cancer Diagnostics. Cancer Cell, 2015 Nov 9;28(5):666-676. PMID: 26525104

  4. Field-wide assessment of differential HT-seq from NCBI GEO database

    • zenodo.org
    • data.niaid.nih.gov
    application/gzip
    Updated Jan 13, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Taavi Päll; Taavi Päll; Hannes Luidalepp; Tanel Tenson; Tanel Tenson; Ülo Maiväli; Ülo Maiväli; Hannes Luidalepp (2023). Field-wide assessment of differential HT-seq from NCBI GEO database [Dataset]. http://doi.org/10.5281/zenodo.7529832
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Jan 13, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Taavi Päll; Taavi Päll; Hannes Luidalepp; Tanel Tenson; Tanel Tenson; Ülo Maiväli; Ülo Maiväli; Hannes Luidalepp
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We analysed the field of expression profiling by high throughput sequencing, or HT-seq, in terms of replicability and reproducibility, using data from the NCBI GEO (Gene Expression Omnibus) repository.

    - This release includes GEO series published up to Dec-31, 2020;

    geo-htseq.tar.gz archive contains following files:

    - output/parsed_suppfiles.csv, p-value histograms, histogram classes, estimated number of true null hypotheses (pi0).

    - output/document_summaries.csv, document summaries of NCBI GEO series.

    - output/suppfilenames.txt, list of all supplementary file names of NCBI GEO submissions.

    - output/suppfilenames_filtered.txt, list of supplementary file names used for downloading files from NCBI GEO.

    - output/publications.csv, publication info of NCBI GEO series.

    - output/scopus_citedbycount.csv, Scopus citation info of NCBI GEO series

    - output/spots.csv, NCBI SRA sequencing run metadata.

    - output/cancer.csv, cancer related experiment accessions.

    - output/transcription_factor.csv, TF related experiment accessions.

    - output/single-cell.csv, single cell experiment accessions.

    - blacklist.txt, list of supplementary files that were either too large to import or were causing computing environment crash during import.

    Workflow to produce this dataset is available on Github at rstats-tartu/geo-htseq.

    geo-htseq-updates.tar.gz archive contains files:

    - results/detools_from_pmc.csv, differential expression analysis programs inferred from published articles

    - results/n_data.csv, manually curated sample size info for NCBI GEO HT-seq series

    - results/simres_df_parsed.csv, pi0 values estimated from differential expression results obtained from simulated RNA-seq data

    - results/data/parsed_suppfiles_rerun.csv, pi0 values estimated using smoother method from anti-conservative p-value sets

  5. l

    Human DNA methylation data set GSM2819625 stored in NCBI (GEO)

    • seek.lisym.org
    Updated Jan 26, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mario Brosch (2022). Human DNA methylation data set GSM2819625 stored in NCBI (GEO) [Dataset]. https://seek.lisym.org/data_files/530
    Explore at:
    Dataset updated
    Jan 26, 2022
    Authors
    Mario Brosch
    License

    https://choosealicense.com/no-permission/https://choosealicense.com/no-permission/

    Description

    Human DNA methylation data stored in NCBI (GEO) Dataset GSM281962; liver tissue sample 7041_CV_RRBS https://seek.lisym.org/samples/135

  6. l

    Human RNA-Seq data set GSM2819712 stored in NCBI (GEO)

    • seek.lisym.org
    Updated Aug 29, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mario Brosch (2022). Human RNA-Seq data set GSM2819712 stored in NCBI (GEO) [Dataset]. https://seek.lisym.org/data_files/685
    Explore at:
    Dataset updated
    Aug 29, 2022
    Authors
    Mario Brosch
    License

    https://choosealicense.com/no-permission/https://choosealicense.com/no-permission/

    Description

    Human RNA-Seq data set GSM2819712 stored in NCBI (GEO)

  7. d

    Entrez GEO Profiles

    • dknet.org
    • scicrunch.org
    • +1more
    Updated Sep 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Entrez GEO Profiles [Dataset]. http://identifiers.org/RRID:SCR_004584
    Explore at:
    Dataset updated
    Sep 9, 2024
    Description

    The GEO Profiles database stores gene expression profiles derived from curated GEO DataSets. Each Profile is presented as a chart that displays the expression level of one gene across all Samples within a DataSet. Experimental context is provided in the bars along the bottom of the charts making it possible to see at a glance whether a gene is differentially expressed across different experimental conditions. Profiles have various types of links including internal links that connect genes that exhibit similar behaviour, and external links to relevant records in other NCBI databases. GEO Profiles can be searched using many different attributes including keywords, gene symbols, gene names, GenBank accession numbers, or Profiles flagged as being differentially expressed.

  8. GEOS-5 FP-IT 3D Time-Averaged Model-Layer Assimilated Data Geo-Colocated to...

    • catalog.data.gov
    • datasets.ai
    • +5more
    Updated Sep 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NASA/GSFC/SED/ESD/TISL/GESDISC (2025). GEOS-5 FP-IT 3D Time-Averaged Model-Layer Assimilated Data Geo-Colocated to OMI/Aura UV2 1-Orbit L2 Swath 13x24km V4 (OMUFPMET) at GES DISC [Dataset]. https://catalog.data.gov/dataset/geos-5-fp-it-3d-time-averaged-model-layer-assimilated-data-geo-colocated-to-omi-aura-uv2-1-e97eb
    Explore at:
    Dataset updated
    Sep 19, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Description

    The GEOS-5 FP-IT 3D Time-Averaged Model-Layer Assimilated Data Geo-Colocated to OMI/Aura UV2 1-Orbit L2 Swath 13x24km (OMUFPMET) product provides selected meteorlogical fields from the GEOS-5 Forward Processing for Instrument Teams (FP-IT) assimilated product produced by the Global Modeling and Assimilation Office (GMAO) co-located in space and time with the OMI UV-2 swath.The fields in this product include layer pressure thickness, surface pressure, vertical temperature profiles, surface potential, and mid-layer pressure along with geolocation info. The OMI team also provides a corresponding product for the OMI VIS swath, OMVFPMET. The OMI ancillary products were developed to provide supplementary information for use with the OMI collection 4 L1B data sets. The original GEOS-5 FP-IT data are reported on a 0.625 deg longitude by 0.5 deg latitude grid, whereas the OMI UV-2 spatial resolution is 13km x 24km at nadir.The OMUFPMET files are in netCDF4 format which is compatible with most netCDF and HDF5 readers and tools. Each file is approximately 45mb in size. The lead for this product is Zachary Fasnacht of SSAI. Joanna Joiner is the responsible NASA official.

  9. Primary Ancillary Data Geo-Colocated to OMI/Aura UV2 1-Orbit L2 Swath...

    • data.nasa.gov
    Updated Apr 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nasa.gov (2025). Primary Ancillary Data Geo-Colocated to OMI/Aura UV2 1-Orbit L2 Swath 13x24km V4 (OMUANC) at GES DISC - Dataset - NASA Open Data Portal [Dataset]. https://data.nasa.gov/dataset/primary-ancillary-data-geo-colocated-to-omi-aura-uv2-1-orbit-l2-swath-13x24km-v4-omuanc-at-2100c
    Explore at:
    Dataset updated
    Apr 1, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Description

    The Primary Ancillary Data Geo-Colocated to OMI/Aura UV2 1-Orbit L2 Swath 13x24km (OMUANC) provides selected parameters from GEOS-5 Forward Processing for Instrument Teams (FP-IT) assimilated product produced by the Global Modeling and Assimilation Office (GMAO) co-located in space and time with the OMI UV-2 swath.The fields in this product include snow cover, sea ice cover, land cover, terrain height, row anomaly flag, and pixel area. The OMI team also provides a corresponding product for the OMI VIS swath, OMVANC. This product has been generated for convenient use by the OMI/Aura team in their L2 algorithms, and for research where those L2 products are used. The original GEOS-5 FP-IT data are reported on a 0.625 deg longitude by 0.5 deg latitude grid, whereas the OMI UV-2 spatial resolution is 13km x 24km at nadir.The OMUANC files are in netCDF4 format which is compatible with most netCDF and HDF5 readers and tools. Each file is approximately 45mb in size. The lead for this product is Zachary Fasnacht of SSAI. Joanna Joiner is the responsible NASA official.

  10. Geo data

    • kaggle.com
    zip
    Updated Aug 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Victor Jacob Asuquo (2025). Geo data [Dataset]. https://www.kaggle.com/datasets/victorjacobasuquo/geo-data
    Explore at:
    zip(81153337 bytes)Available download formats
    Dataset updated
    Aug 11, 2025
    Authors
    Victor Jacob Asuquo
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Victor Jacob Asuquo

    Released under Apache 2.0

    Contents

  11. Open postcode geo - Dataset - data.gov.uk

    • ckan.publishing.service.gov.uk
    Updated Oct 17, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ckan.publishing.service.gov.uk (2016). Open postcode geo - Dataset - data.gov.uk [Dataset]. https://ckan.publishing.service.gov.uk/dataset/open-postcode-geo2
    Explore at:
    Dataset updated
    Oct 17, 2016
    Dataset provided by
    CKANhttps://ckan.org/
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Open Postcode Geo is a postcode dataset and API optimised for geocoding applications. You can use Open Postcode Geo to geocode a dataset, geocode user input, and therefore build a proximity search. Data is derived from the ONS (Office for National Statistics) postcode database and is free to use, subject to including attributions to ONS, OS (Ordinance Survey) and Royal Mail. Information is also provided on a range of topics, including education, health, crime, business, etc. Postcodes can be entered at area, district, sector, and unit level - see Postcode map for the geographical relationship between these.

  12. f

    The properties of the GEO datasets.

    • datasetcatalog.nlm.nih.gov
    Updated Dec 16, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cheng, Hao; Niu, Shuaijun; Han, Xiao; Ren, Jianxue; Wang, Huiqing; Duan, Yimeng (2024). The properties of the GEO datasets. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001413861
    Explore at:
    Dataset updated
    Dec 16, 2024
    Authors
    Cheng, Hao; Niu, Shuaijun; Han, Xiao; Ren, Jianxue; Wang, Huiqing; Duan, Yimeng
    Description

    Ovarian cancer is a malignant tumor with different clinicopathological and molecular characteristics. Due to its nonspecific early symptoms, the majority of patients are diagnosed with local or extensive metastasis, severely affecting treatment and prognosis. The occurrence of ovarian cancer is influenced by multiple complex mechanisms including genomics, transcriptomics, and proteomics. Integrating multiple types of omics data aids in predicting the survival rate of ovarian cancer patients. However, existing methods only fuse multi-omics data at the feature level, neglecting the shared and complementary neighborhood information among samples of multi-omics data, and failing to consider the potential interactions between different omics data at the molecular level. In this paper, we propose a prognostic model for ovarian cancer prediction named Dual Fusion Channels and Stacked Graph Convolutional Neural Network (DFASGCNS). The DFASGCNS utilizes dual fusion channels to learn feature representations of different omics data and the associations between samples. Stacked graph convolutional network is used to comprehensively learn the deep and intricate correlation networks present in multi-omics data, enhancing the model’s ability to represent multi-omics data. An attention mechanism is introduced to allocate different weights to important features of different omics data, optimizing the feature representation of multi-omics data. Experimental results demonstrate that compared to existing methods, the DFASGCNS model exhibits significant advantages in ovarian cancer prognosis prediction and survival analysis. Kaplan-Meier curve analysis results indicate significant differences in the survival subgroups predicted by the DFASGCNS model, contributing to a deeper understanding of the pathogenesis of ovarian cancer and providing more reliable auxiliary diagnostic information for the prognosis assessment of ovarian cancer patients.

  13. High-throughput transcriptomics platform for screening...

    • catalog.data.gov
    • datasets.ai
    Updated Nov 21, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. EPA Office of Research and Development (ORD) (2021). High-throughput transcriptomics platform for screening hepatotoxicants-NCBI/GEO GSE152128 [Dataset]. https://catalog.data.gov/dataset/high-throughput-transcriptomics-platform-for-screening-hepatotoxicants-ncbi-geo-gse152128
    Explore at:
    Dataset updated
    Nov 21, 2021
    Dataset provided by
    United States Environmental Protection Agencyhttp://www.epa.gov/
    Description

    We introduce a new high-throughput transcriptomics (HTTr) platform comprised of a collagen sandwich primary rat hepatocyte culture and the TempO-Seq assay for screening and prioritizing potential hepatotoxicants. We selected 14 chemicals based on their risk of drug-induced liver injury (DILI) and tested them in hepatocytes at two treatment concentrations. HTTr data was generated using the TempO-Seq whole transcriptome and S1500+ assays. The HTTr platform exhibited high reproducibility between technical replicates (r>0.9) but biological replication was greater for TempO-Seq S1500+ (r>0.85) than for the whole transcriptome (r>0.7). Reproducibility between biological replicates was dependent on the strength of transcriptional effects induced by a chemical treatment. Despite targeting a smaller number of genes, the S1500+ assay clustered chemical treatments and produced gene set enrichment analysis (GSEA) scores comparable to those of the whole transcriptome. Connectivity mapping showed a high-level of reproducibility between TempO-Seq data and Affymetrix GeneChip data from the Open TG-GATES project with high concordance between the S1500+ gene set and whole transcriptome. Taken together, our results provide guidance on selecting the number of technical and biological replicates and support the use of TempO-Seq S1500+ assay for a high-throughput platform for screening hepatotoxicants. FASTQ files and read counts data have been deposited in the National Center for Biotechnology Information Gene Expression Omnibus (GEO) (GSE152128). This dataset is associated with the following publication: Lee, F., I. Shah, Y.T. Soong, J. Xing, I.C. Ng, F. Tasnim, and H. Yu. Reproducibility and Robustness of High-Throughput S1500+ Transcriptomics on Primary Rat Hepatocytes for Chemical-Induced Hepatotoxicity Assessment. Current Research in Toxicology. Elsevier B.V., Amsterdam, NETHERLANDS, 2: 282-295, (2021).

  14. o

    2012 Methodology Study Public with Geo - Datasets - Open Data Pakistan

    • opendata.com.pk
    Updated Sep 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). 2012 Methodology Study Public with Geo - Datasets - Open Data Pakistan [Dataset]. https://opendata.com.pk/dataset/2012-methodology-study-public-with-geo
    Explore at:
    Dataset updated
    Sep 8, 2025
    Area covered
    Pakistan
    Description

    2012 Methodology Study Public with Geo

  15. f

    Data Sheet 3_Investigating the metabolic reprogramming mechanisms in...

    • figshare.com
    csv
    Updated Aug 29, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shan He; Yi Wei Chen; Jian Ye; Yu Wang; Qin Kai Chen; Si Yi Liu (2025). Data Sheet 3_Investigating the metabolic reprogramming mechanisms in diabetic nephropathy: a comprehensive analysis using bioinformatics and machine learning.csv [Dataset]. http://doi.org/10.3389/fcell.2025.1630708.s005
    Explore at:
    csvAvailable download formats
    Dataset updated
    Aug 29, 2025
    Dataset provided by
    Frontiers
    Authors
    Shan He; Yi Wei Chen; Jian Ye; Yu Wang; Qin Kai Chen; Si Yi Liu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    BackgroundDiabetic nephropathy (DN) is a common complication of diabetes, characterized by damage to renal tubules and glomeruli, leading to progressive renal dysfunction. The aim of our study is to explore the key role of metabolic reprogramming (MR) in the pathogenesis of DN.MethodsIn our study, three transcriptome datasets (GSE30528, GSE30529, and GSE96804) were sourced from the Gene Expression Omnibus (GEO) database. These datasets were integrated for batch effect correction and subsequently subjected to differential expression analysis to identify differentially expressed genes (DEGs) between DN and control samples. The identified DEGs were cross-referenced with genes associated with MR to derive MR associated differentially expressed genes (MRRDEGs). These MRRDEGs underwent Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses. To identify key genes and develop diagnostic models, four machine learning algorithms were employed in conjunction with weighted gene co-expression network analysis (WGCNA) and the protein interaction tool CytoHubba. Gene set enrichment analysis (GSEA) and CIBERSORT analysis were conducted on the key genes to assess immune cell infiltration in DN. Additionally, a competitive endogenous RNA (ceRNA) network was constructed using the key genes. Finally, the expression levels of core genes in human samples were validated through quantitative real-time PCR (qRT-PCR).ResultsWe identified 256 MRRDEGs, highlighting metabolic and inflammatory pathways in DN. KEGG analysis linked these genes to the MAPK signaling pathway, suggesting its key role in DN. Six key genes were pinpointed using WGCNA, PPI, and machine learning, with their diagnostic value confirmed by ROC analysis. CIBERSORT revealed a strong link between these genes and immune cell infiltration, indicating the immune response’s role in DN. GSEA showed these genes’ involvement in inflammatory and metabolic processes. A ceRNA network was predicted to clarify gene regulation. qRT-PCR confirmed the expression patterns of CXCR2, NAMPT, and CUEDC2, aligning with bioinformatics results.ConclusionThrough bioinformatics analysis, a total of six potential MRRDEGs were identified, among which CUEDC2, NAMPT, CXCR2 could serve as potential biomarkers.

  16. f

    DataSheet1_Identification of potential therapeutic targets for systemic...

    • datasetcatalog.nlm.nih.gov
    Updated Oct 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhetaer, Adalaiti; Aobulitalifu, Alimijiang; Apaer, Aishanjiang; Sulitan, Maierhaba; Ajimu, Nuermaimaiti; Cheng, Lei; Wen, Fujie; Shi, Yanyan (2024). DataSheet1_Identification of potential therapeutic targets for systemic lupus erythematosus based on GEO database analysis and Mendelian randomization analysis.csv [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001322086
    Explore at:
    Dataset updated
    Oct 16, 2024
    Authors
    Muhetaer, Adalaiti; Aobulitalifu, Alimijiang; Apaer, Aishanjiang; Sulitan, Maierhaba; Ajimu, Nuermaimaiti; Cheng, Lei; Wen, Fujie; Shi, Yanyan
    Description

    BackgroundSystemic lupus erythematosus (SLE) is a complex autoimmune disease. Current treatments mainly rely on immunosuppressants, which lack specificity and pose challenges during treatment. This study aims to deeply explore the molecular pathogenic mechanism of SLE through gene expression databases (GEO) and bioinformatics analysis methods, combined with Mendelian randomization analysis, to provide key clues for new therapeutic targets.MethodsIn this study, the SLE-related gene chip dataset GSE65391 was selected from the GEO database, and the data were preprocessed and statistically analyzed using R language and bioinformatics tools. Differential expression analysis, weighted gene co-expression network analysis (WGCNA), GO, and KEGG enrichment analysis were used to screen differentially expressed genes (DEGs) for functional annotation and pathway localization. Furthermore, Mendelian randomization analysis was conducted to identify core genes closely related to SLE risk, and immune cell infiltration analysis and compound molecular docking studies were performed on the core gene ISG15.ResultsThe study successfully screened 3,456 DEGs and identified core gene modules highly related to SLE through WGCNA analysis, including key genes closely related to the pathogenesis of SLE, such as STAT1, DDX58, ISG15, IRF7, and IFIH1. In particular, this study found a significant positive correlation between the ISG15 gene and SLE, suggesting that it may be a potential risk factor for SLE. Additionally, through molecular docking technology, it was discovered that the ISG15 gene can effectively bind to two compounds, genistein, and flavopiridol, which have anti-inflammatory and immunosuppressive effects, respectively. This provides new potential drug targets for SLE treatment.DiscussionAs an immunomodulatory cytokine, ISG15 plays a crucial role in the pathogenesis of SLE. This study found that variations in the ISG15 gene may increase the risk of SLE and exacerbate inflammatory responses and tissue damage through multiple mechanisms. Furthermore, molecular docking revealed that genistein and flavopiridol can effectively bind to ISG15, offering a new approach for SLE treatment. These two compounds, with their anti-inflammatory and immunosuppressive properties, have the potential to slow the progression of SLE by influencing the expression and function of ISG15.ConclusionThrough comprehensive bioinformatics analysis and Mendelian randomization analysis, this study deeply explored the molecular pathogenic mechanism of SLE and successfully identified ISG15 as a potential therapeutic target for SLE. Simultaneously, molecular docking technology revealed that two compounds, genistein and flavopiridol, have potential therapeutic effects with ISG15, providing new potential drugs for SLE treatment. These discoveries not only enhance our understanding of the pathogenesis of SLE but also provide important clues for developing new treatment strategies.

  17. C9ORF72 GGGGCC expanded repeats of-ALS-GSE68607

    • kaggle.com
    zip
    Updated Jan 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    bpotter_head (2023). C9ORF72 GGGGCC expanded repeats of-ALS-GSE68607 [Dataset]. https://www.kaggle.com/datasets/bhavithabairapureddy/c9orf72-ggggcc-expanded-repeats-ofalsgse68607
    Explore at:
    zip(7487 bytes)Available download formats
    Dataset updated
    Jan 11, 2023
    Authors
    bpotter_head
    Description

    C9ORF72 GGGGCC expanded repeats produce splicing dysregulation which correlates with disease severity in amyotrophic lateral sclerosis .This data set is of 45 samples of which 15 are control and 30 are diseased. The genes were chosen based on the P.value. There are both up regulated and down regulated genes.

  18. k

    UNEP GEO Data Portal Particulate Matter Concentration

    • datasource.kapsarc.org
    • data.kapsarc.org
    • +1more
    Updated Dec 20, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2016). UNEP GEO Data Portal Particulate Matter Concentration [Dataset]. https://datasource.kapsarc.org/explore/dataset/unep-geo-data-portal-particulate-matter-concentration/
    Explore at:
    Dataset updated
    Dec 20, 2016
    Description

    The GEO Data Portal is the authoritative source for data sets used by UNEP and its partners in the Global Environment Outlook (GEO) report and other integrated environment assessments. The GEO Data Portal gives access to a broad socio-economic data sets from authoritative sources at global, regional, sub-regional and national levels. The contents of the Data Portal cover environmental themes such as climate, forests and freshwater and many others, as well as socioeconomic categories, including education, health, economy, population and environmental policies.

  19. s

    StemBase

    • scicrunch.org
    • neuinfo.org
    • +1more
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    StemBase [Dataset]. http://identifiers.org/RRID:SCR_006252)
    Explore at:
    Description

    A publicly accessible database containing data on Affymetrix DNA microarray experiments, and Serial Analysis of Gene Expression, mostly on human and mouse stem cell samples and their derivatives to facilitate the discovery of gene functions relevant to stem cell control and differentiation. It has grown in both size and scope into a system with analysis tools that examine either the whole database at once, or slices of data, based on tissue type, cell type or gene of interest. There is currently more than 210 stem cell samples in 60 different experiments, with more being added regularly. The samples were originated by researchers of the Stem Cell Network and processed at the Core Facility of Stemcore Laboratories under the management of Ms. Pearl Campbell in the frame of the Stem Cell Genomics Project. Periodically, new expression data is submitted to the Gene Expression Omnibus (GEO) repository at the National Center for Biotechnological Information, in order to allow researchers to compare the data deposited in StemBase to a large amount of gene expression data sets. StemBase is different from GEO in both focus and scope. StemBase is concerned exclusively with stem cell related data. we are focused in Stem Cell research. We have made a significant effort to ensure the quality and consistency of the data included. This allows us to offer more specialized analysis tools related to Stem Cell data. GEO is intended as a large scale public archive. Deposition in a public repository such as GEO is required by most important scientific journals and it is advantageous for a further diffusion of the data since GEO is more broadly used than StemBase.

  20. W

    UNEP GEO Data Portal - WMS

    • cloud.csiss.gmu.edu
    wms
    Updated Mar 21, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GEOSS CSR (2019). UNEP GEO Data Portal - WMS [Dataset]. https://cloud.csiss.gmu.edu/uddi/zh_CN/dataset/unep-geo-data-portal-wms
    Explore at:
    wmsAvailable download formats
    Dataset updated
    Mar 21, 2019
    Dataset provided by
    GEOSS CSR
    Description

    The GEO Data Portal is the authoritative source for data sets used by UNEP and its partners in the Global Environment Outlook (GEO) report and other integrated environment assessments. Its online database holds more than 500 different variables, as national, subregional, regional and global statistics or as geospatial data sets (maps), covering themes like Freshwater, Population, Forests, Emissions, Climate, Disasters, Health and GDP. Display them on-the-fly as maps, graphs, data tables or download the data in different formats. By using the GetCapabilities request, one can see the whole list of available data layers.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
National Institutes of Health (NIH) (2023). Gene Expression Omnibus (GEO) [Dataset]. https://catalog.data.gov/dataset/gene-expression-omnibus-geo

Data from: Gene Expression Omnibus (GEO)

Related Article
Explore at:
Dataset updated
Jul 26, 2023
Dataset provided by
National Institutes of Health (NIH)
Description

Gene Expression Omnibus is a public functional genomics data repository supporting MIAME-compliant submissions of array- and sequence-based data. Tools are provided to help users query and download experiments and curated gene expression profiles.

Search
Clear search
Close search
Google apps
Main menu