42 datasets found

l
Human RNA-Seq data set GSM2819694 stored in NCBI (GEO)
seek.lisym.org
Updated Feb 3, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mario Brosch (2022). Human RNA-Seq data set GSM2819694 stored in NCBI (GEO) [Dataset]. https://seek.lisym.org/data_files/680
Explore at:
Dataset updated
Feb 3, 2022
Authors
Mario Brosch
License
https://choosealicense.com/no-permission/https://choosealicense.com/no-permission/
Description
Human RNA-Seq data set GSM2819694 stored in NCBI (GEO)
l
Human RNA-Seq data set GSM2819698 stored in NCBI (GEO)
seek.lisym.org
Updated Aug 29, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mario Brosch (2022). Human RNA-Seq data set GSM2819698 stored in NCBI (GEO) [Dataset]. https://seek.lisym.org/data_files/576
Explore at:
Dataset updated
Aug 29, 2022
Authors
Mario Brosch
License
https://choosealicense.com/no-permission/https://choosealicense.com/no-permission/
Description
Human RNA-Seq data set GSM2819698 stored in NCBI (GEO)

liver tissue sample : 6922_IZ_RNA
l
Human RNA-Seq data set GSM2819696 stored in NCBI (GEO)
seek.lisym.org
Updated Jan 26, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mario Brosch (2022). Human RNA-Seq data set GSM2819696 stored in NCBI (GEO) [Dataset]. https://seek.lisym.org/data_files/574
Explore at:
Dataset updated
Jan 26, 2022
Authors
Mario Brosch
License
https://choosealicense.com/no-permission/https://choosealicense.com/no-permission/
Description
Human RNA-Seq data set GSM2819696 stored in NCBI (GEO)

liver tissue sample : 6610_IZ_RNA
l
Human DNA methylation data set GSM2819623 stored in NCBI (GEO)
seek.lisym.org
Updated Jan 25, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mario Brosch (2022). Human DNA methylation data set GSM2819623 stored in NCBI (GEO) [Dataset]. https://seek.lisym.org/data_files/495
Explore at:
Dataset updated
Jan 25, 2022
Authors
Mario Brosch
License
https://choosealicense.com/no-permission/https://choosealicense.com/no-permission/
Description
Human DNA methylation data stored in NCBI (GEO) Dataset GSM2819623;

liver tissue sample** 6967_ CV_ RRBS** in SEEK https://seek.lisym.org/samples/133
l
Human DNA methylation data set GSM2819637 stored in NCBI (GEO)
seek.lisym.org
Updated Jan 28, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mario Brosch (2022). Human DNA methylation data set GSM2819637 stored in NCBI (GEO) [Dataset]. https://seek.lisym.org/data_files/606
Explore at:
Dataset updated
Jan 28, 2022
Authors
Mario Brosch
License
https://choosealicense.com/no-permission/https://choosealicense.com/no-permission/
Description
Human DNA methylation data set GSM2819637 stored in NCBI (GEO)
Field-wide assessment of differential HT-seq from NCBI GEO database
zenodo.org
data.niaid.nih.gov
application/gzip
Updated Jan 13, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Taavi Päll; Taavi Päll; Hannes Luidalepp; Tanel Tenson; Tanel Tenson; Ülo Maiväli; Ülo Maiväli; Hannes Luidalepp (2023). Field-wide assessment of differential HT-seq from NCBI GEO database [Dataset]. http://doi.org/10.5281/zenodo.7529832
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.7529832
Dataset updated
Jan 13, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Taavi Päll; Taavi Päll; Hannes Luidalepp; Tanel Tenson; Tanel Tenson; Ülo Maiväli; Ülo Maiväli; Hannes Luidalepp
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
We analysed the field of expression profiling by high throughput sequencing, or HT-seq, in terms of replicability and reproducibility, using data from the NCBI GEO (Gene Expression Omnibus) repository.

- This release includes GEO series published up to Dec-31, 2020;

geo-htseq.tar.gz archive contains following files:

- output/parsed_suppfiles.csv, p-value histograms, histogram classes, estimated number of true null hypotheses (pi0).

- output/document_summaries.csv, document summaries of NCBI GEO series.

- output/suppfilenames.txt, list of all supplementary file names of NCBI GEO submissions.

- output/suppfilenames_filtered.txt, list of supplementary file names used for downloading files from NCBI GEO.

- output/publications.csv, publication info of NCBI GEO series.

- output/scopus_citedbycount.csv, Scopus citation info of NCBI GEO series

- output/spots.csv, NCBI SRA sequencing run metadata.

- output/cancer.csv, cancer related experiment accessions.

- output/transcription_factor.csv, TF related experiment accessions.

- output/single-cell.csv, single cell experiment accessions.

- blacklist.txt, list of supplementary files that were either too large to import or were causing computing environment crash during import.

Workflow to produce this dataset is available on Github at rstats-tartu/geo-htseq.

geo-htseq-updates.tar.gz archive contains files:

- results/detools_from_pmc.csv, differential expression analysis programs inferred from published articles

- results/n_data.csv, manually curated sample size info for NCBI GEO HT-seq series

- results/simres_df_parsed.csv, pi0 values estimated from differential expression results obtained from simulated RNA-seq data

- results/data/parsed_suppfiles_rerun.csv, pi0 values estimated using smoother method from anti-conservative p-value sets
Field-wide assessment of differential HT-seq from NCBI GEO database
zenodo.org
application/gzip
Updated Jan 13, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Taavi Päll; Taavi Päll; Hannes Luidalepp; Tanel Tenson; Tanel Tenson; Ülo Maiväli; Ülo Maiväli; Hannes Luidalepp (2023). Field-wide assessment of differential HT-seq from NCBI GEO database [Dataset]. http://doi.org/10.5281/zenodo.5139281
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.5139281
Dataset updated
Jan 13, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Taavi Päll; Taavi Päll; Hannes Luidalepp; Tanel Tenson; Tanel Tenson; Ülo Maiväli; Ülo Maiväli; Hannes Luidalepp
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
We analyzed the field of expression profiling by high throughput sequencing, or HT-seq, in terms of replicability and reproducibility, using data from the NCBI GEO (Gene Expression Omnibus) repository.

Archived dataset contains following files:

- output/parsed_suppfiles.csv, p-value histograms, histogram classes, estimated number of true null hypotheses (pi0).

- output/document_summaries.csv, document summaries of NCBI GEO series

- output/publications.csv, publication info of NCBI GEO series

- output/scopus_citedbycount.csv, Scopus citation info of NCBI GEO series

- output/single-cell.csv, single cell experiments

- spots.csv, NCBI SRA sequencing run metadata

- suppfilenames.txt, list of all supplementary file names of NCBI GEO submissions. One filename per row.

- suppfilenames_filtered.txt, list of supplementary file names used for downloading files from NCBI GEO. One filename per row.
High-throughput transcriptomics platform for screening...
catalog.data.gov
datasets.ai
Updated Nov 21, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. EPA Office of Research and Development (ORD) (2021). High-throughput transcriptomics platform for screening hepatotoxicants-NCBI/GEO GSE152128 [Dataset]. https://catalog.data.gov/dataset/high-throughput-transcriptomics-platform-for-screening-hepatotoxicants-ncbi-geo-gse152128
Explore at:
Dataset updated
Nov 21, 2021
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Description
We introduce a new high-throughput transcriptomics (HTTr) platform comprised of a collagen sandwich primary rat hepatocyte culture and the TempO-Seq assay for screening and prioritizing potential hepatotoxicants. We selected 14 chemicals based on their risk of drug-induced liver injury (DILI) and tested them in hepatocytes at two treatment concentrations. HTTr data was generated using the TempO-Seq whole transcriptome and S1500+ assays. The HTTr platform exhibited high reproducibility between technical replicates (r>0.9) but biological replication was greater for TempO-Seq S1500+ (r>0.85) than for the whole transcriptome (r>0.7). Reproducibility between biological replicates was dependent on the strength of transcriptional effects induced by a chemical treatment. Despite targeting a smaller number of genes, the S1500+ assay clustered chemical treatments and produced gene set enrichment analysis (GSEA) scores comparable to those of the whole transcriptome. Connectivity mapping showed a high-level of reproducibility between TempO-Seq data and Affymetrix GeneChip data from the Open TG-GATES project with high concordance between the S1500+ gene set and whole transcriptome. Taken together, our results provide guidance on selecting the number of technical and biological replicates and support the use of TempO-Seq S1500+ assay for a high-throughput platform for screening hepatotoxicants. FASTQ files and read counts data have been deposited in the National Center for Biotechnology Information Gene Expression Omnibus (GEO) (GSE152128). This dataset is associated with the following publication: Lee, F., I. Shah, Y.T. Soong, J. Xing, I.C. Ng, F. Tasnim, and H. Yu. Reproducibility and Robustness of High-Throughput S1500+ Transcriptomics on Primary Rat Hepatocytes for Chemical-Induced Hepatotoxicity Assessment. Current Research in Toxicology. Elsevier B.V., Amsterdam, NETHERLANDS, 2: 282-295, (2021).
l
HumanDNA methylation data set GSM2819620 stored in NCBI (GEO)
seek.lisym.org
Updated Jan 25, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mario Brosch (2022). HumanDNA methylation data set GSM2819620 stored in NCBI (GEO) [Dataset]. https://seek.lisym.org/data_files/491
Explore at:
Dataset updated
Jan 25, 2022
Authors
Mario Brosch
License
https://choosealicense.com/no-permission/https://choosealicense.com/no-permission/
Description
HumanDNA methylation data set stored in NCBI (GEO) Data set GSM2819620;

GSM2819620_CV_6610_207 7344_PP_RNA; Homo sapiens; RNA-Seq SEEK Sample: 6610_CV_RRBS

[https://seek.lisym.org/samples/130 ]

This link contains 3 files:

GSM2819620_CV_6610_207.MCSv3.20161129.hs37.cpg.filtered.CG.bed.gz 109.4 Mb (ftp)(http) BED GSM2819620_CV_6610_207.MCSv3.20161129.hs37.cpg.filtered.CG.bw 85.6 Mb (ftp)(http) BW GSM2819620_CV_6610_207.MCSv3.20161129.hs37.cpg.filtered.CG.ct_coverage.bw 86.8 Mb (ftp)(http) BW
NCBI GEO Submission of human whole blood transcriptomes in response to a...
agdatacommons.nal.usda.gov
bin
Updated Mar 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
USDA ARS WHNRC (2025). NCBI GEO Submission of human whole blood transcriptomes in response to a high-fat meal [Dataset]. https://agdatacommons.nal.usda.gov/articles/dataset/NCBI_GEO_Submission_of_human_whole_blood_transcriptomes_in_response_to_a_high-fat_meal/25084385
Explore at:
binAvailable download formats
Dataset updated
Mar 11, 2025
Dataset provided by
National Center for Biotechnology Informationhttp://www.ncbi.nlm.nih.gov/
Authors
USDA ARS WHNRC
License
https://rightsstatements.org/vocab/UND/1.0/https://rightsstatements.org/vocab/UND/1.0/
Description
Modern humans spend most of their time having eaten recently. The purpose of the current project is to understand how the blood, which contains immune cells, responds in the hours after eating a meal that is moderately high in fat. We used a sequencing method to observe the expression of all the genes in blood cells in five participants who were each fed a high fat meal on three separate days. The results are reported in the manuscript, “Temporal changes in postprandial blood transcriptomes reveal subject-specific pattern of expression of innate immunity genes after a high-fat meal." Overall design: We used a sequencing method to observe the expression of all the genes in blood cells in five participants who were each fed a high fat meal on three separate days, resulting in 45 whole blood transcriptomes. For each sample, 3 mL of venous whole blood was drawn into a Tempus Blood RNA tube, shaken vigorously, and then frozen at -80°C until use. Total RNA was purified with the Tempus Spin RNA Isolation Kit with minor modifications to the manufacturer’s protocol. To remove residual genomic DNA, RNA samples were treated on-column with RNase-Free DNase per manufacturer’s instructions. RNA quantity, quality, and integrity were assessed with NanoDrop 1000 and 2100 Bioanalyzer. All isolated RNA had A260/A280 ratios greater than 2 and RNA integrity numbers higher than 7.3. RNA-Seq libraries were constructed at the DNA Technologies and Expression Core at the University of California, Davis, using the Ovation Human Blood RNA-Seq Library System (NuGEN Technologies). Sequencing was performed in a 2x100bp format with 45 samples multiplexed on 3 lanes on an Illumina HiSeq 4000. Analysis of the data is reported in the manuscript, “Temporal changes in postprandial blood transcriptomes reveal subject-specific pattern of expression of innate immunity genes after a high-fat meal.”
Automated Retrieval GEO GE Data MicroarrayRNASeq
kaggle.com
zip
Updated Nov 29, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dr. Nagendra (2025). Automated Retrieval GEO GE Data MicroarrayRNASeq [Dataset]. https://www.kaggle.com/datasets/mannekuntanagendra/automated-retrieval-geo-ge-data-microarrayrnaseq
Explore at:
zip(2393 bytes)Available download formats
Dataset updated
Nov 29, 2025
Authors
Dr. Nagendra
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
This dataset provides a comprehensive pipeline for automated retrieval of gene expression data.

Supports both Microarray and RNA-Seq datasets from the NCBI GEO database.

Designed for researchers and bioinformaticians to streamline GEO data analysis.

Includes R scripts to download, process, and export GEO datasets efficiently.

Handles metadata extraction, sample annotation, and expression matrix generation.

Facilitates downstream analyses such as differential gene expression and visualization.

Compatible with various GEO platforms, reducing manual data curation efforts.

Enables reproducible research by standardizing data retrieval and processing.

Useful for comparative studies, functional genomics, and biomarker discovery.

Reduces the technical barrier for users unfamiliar with GEO data structures.
Flu vaccinated blood samples
kaggle.com
zip
Updated Jan 9, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Janis (2020). Flu vaccinated blood samples [Dataset]. https://www.kaggle.com/janiscorona/flu-vaccinated-blood-samples
Explore at:
zip(7111137 bytes)Available download formats
Dataset updated
Jan 9, 2020
Authors
Janis
Description
Context

No matter how much you wash your hands, you are still susceptible to flu airborne viruses or cold viruses in close proximity to others who have a cold or flu. The flu vaccine is a treatment many folks get in hopes of not getting sick that cold/flu season. The flu vaccine is somewhat of a math cheat sheet for your body preparing for a math course final without having to know all of the formulas off hand, but only the ones that are on the exam. If you have a crooked teacher/TA that decided not to allow the cheat sheet to be a good representation of what the content of the final exam is, then you could assume that is how your body will be with a flu vaccine that doesn't have the strand(s) of flu your body is likely to encounter that flu season. I found this data set munging the GEO database sets of NCBI while searching for 'flu vaccines' and wanted some microarray gene expression data sets that I could also compare those values to other blood micro array samples from separate studies on females using EGCG for obesity, and males who do/don't have heart disease. This data can be blended with the other data sets here or in my github repositories at janjanjan2018.

Content

Blood gene expressions of microarray samples.

Acknowledgements

NCBI and the GEO grant funded data repositories of gene expression data.

Inspiration

Sick people.
GDS4399
kaggle.com
zip
Updated Oct 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bassam165 (2025). GDS4399 [Dataset]. https://www.kaggle.com/datasets/bassam165/gds4399
Explore at:
zip(11496559 bytes)Available download formats
Dataset updated
Oct 26, 2025
Authors
Bassam165
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
This dataset contains microarray-based gene expression profiles of granulosa cells collected from women diagnosed with Polycystic Ovary Syndrome (PCOS) and from healthy controls. It originates from the NCBI GEO DataSet GDS4399, which was generated to study the molecular mechanisms underlying PCOS pathogenesis and its relationship to insulin resistance, steroidogenesis, and oocyte maturation.

The data were collected using the Affymetrix Human Genome U133 Plus 2.0 Array (GPL570 platform). Each sample corresponds to an RNA expression profile of granulosa cells isolated from ovarian aspirates of PCOS and non-PCOS women undergoing in-vitro fertilization (IVF).

Key Details

NCBI GEO Accession: GDS4399

Source: Gene Expression Omnibus (GEO), NCBI. GEO Accession: GDS4399 Title: Polycystic ovary syndrome: granulosa cells Platform: Affymetrix Human Genome U133 Plus 2.0 Array (GPL570) Authors: Wood JR, et al. (Original study contributors) National Center for Biotechnology Information, U.S. National Library of Medicine. Available at: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GDS4399

Recommended citation style (IEEE): [1] J. R. Wood et al., “Polycystic ovary syndrome: granulosa cells,” Gene Expression Omnibus (GEO), GDS4399, NCBI, Bethesda, MD, USA. [Online]. Available: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GDS4399

License: This dataset is part of the public NCBI GEO database and is distributed under the Public Domain / CC0 License for research and educational use. Please cite the original GEO entry when reusing this dataset.
f
Data from: Proteomics-Based Approach Reveals the Involvement of SERPINB9 in...
acs.figshare.com
datasetcatalog.nlm.nih.gov
xlsx
Updated Jun 5, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yao Chen; Lina Quan; Chuiming Jia; Yiwei Guo; Xinya Wang; Yu Zhang; Yan Jin; Aichun Liu (2023). Proteomics-Based Approach Reveals the Involvement of SERPINB9 in Recurrent and Relapsed Multiple Myeloma [Dataset]. http://doi.org/10.1021/acs.jproteome.1c00007.s009
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.1021/acs.jproteome.1c00007.s009
Dataset updated
Jun 5, 2023
Dataset provided by
ACS Publications
Authors
Yao Chen; Lina Quan; Chuiming Jia; Yiwei Guo; Xinya Wang; Yu Zhang; Yan Jin; Aichun Liu
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Multiple myeloma (MM) is a common hematological malignancy with poorly understood recurrence and relapse mechanisms. Notably, bortezomib resistance leading to relapse makes MM treatment significantly challenging. To clarify the drug resistance mechanism, we employed a quantitative proteomics approach to identify differentially expressed protein candidates implicated in bortezomib-resistant recurrent and relapsed MM (RRMM). Bone marrow aspirates from five patients newly diagnosed with MM (NDMM) were compared with those from five patients diagnosed with bortezomib-resistant RRMM using tandem mass tag-mass spectrometry (TMT-MS). Subcellular localization and functional classification of the differentially expressed proteins were determined by gene ontology, Kyoto Encyclopedia of Genes and Genomes pathway, and hierarchical clustering analyses. The top candidates identified were validated with parallel reaction monitoring (PRM) analysis using tissue samples from 11 NDMM and 8 RRMM patients, followed by comparison with the NCBI Gene Expression Omnibus (GEO) dataset of 10 MM patients and 10 healthy controls (accession no.: GSE80608). Thirty-four differentially expressed proteins in RRMM, including proteinase inhibitor 9 (SERPINB9), were identified by TMT-MS. Subsequent functional enrichment analyses of the identified protein candidates indicated their involvement in regulating cellular metabolism, apoptosis, programmed cell death, lymphocyte-mediated immunity, and defense response pathways in RRMM. The top protein candidate SERPINB9 was confirmed by PRM analysis and western blotting as well as by comparison with an NCBI GEO dataset. We elucidated the proteome landscape of bortezomib-resistant RRMM and identified SERPINB9 as a promising novel therapeutic target. Our results provide a resource for future studies on the mechanism of RRMM.
MOESM2 of Improved cell composition deconvolution method of bulk gene...
springernature.figshare.com
xlsx
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yen-Jung Chiu; Yi-Hsuan Hsieh; Yen-Hua Huang (2023). MOESM2 of Improved cell composition deconvolution method of bulk gene expression profiles to quantify subsets of immune cells [Dataset]. http://doi.org/10.6084/m9.figshare.11415000.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.11415000.v1
Dataset updated
May 31, 2023
Dataset provided by
figshare
Figsharehttp://figshare.com/
Authors
Yen-Jung Chiu; Yi-Hsuan Hsieh; Yen-Hua Huang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Additional file 2: Table S1. The microarray samples downloaded from NCBI GEO as the raw data to be used as the reference gene expression profiles, each being annotated with its source NCBI GEO sample accession numbers (GSM). Table S2. The reference gene expression signature (RefGES) matrix created in this study. Table S3. The cumulative percentages of observations for the difference between predictions and real values in the benchmark using the simulated bulk tissues with 30% expression levels from breast tissues and 70% from immune cells. Table S4. The cumulative percentages of observations for the difference between predictions and real values in the benchmark using the simulated bulk tissues with 50% expression levels from breast tissues and 50% from immune cells. Table S5. The cumulative percentages of observations for the difference between predictions and real values in the benchmark using the simulated bulk tissues with 70% expression levels from breast tissues and 30% from immune cells. Table S6. The mapping of the cell types of NCBI GEO GSE65133 to those of LM22 (CIBERSORT) and the RefGES used in this study. Table S7. The cumulative percentages of observations for the difference between predictions and real values in the benchmark using the 20 human PBMC samples of NCBI GEO GSE65133. Table S8. The mapping of the cell types of NCBI GEO GSE106898 to those of LM22 (CIBERSORT) and the RefGES used in this study. Table S9. The cumulative percentages of observations for the difference between predictions and real values in the benchmark using the 12 human PBMC samples of NCBI GEO GSE106898. Table S10. The mapping of the cell types of NCBI GEO GSE107990 to those of LM22 (CIBERSORT) and the RefGES used in this study. Table S11. The cumulative percentages of observations for the difference between predictions and real values in the benchmark using the 164 human PBMC samples of NCBI GEO GSE107990.
NCBI accession numbers and related metadata from a study of transcriptomic...
search.datacite.org
bco-dmo.org
+1more
Updated Jul 31, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kristen Whalen; Elizabeth Harvey (2020). NCBI accession numbers and related metadata from a study of transcriptomic response of Emiliania huxleyi to 2-heptyl-4-quinolone (HHQ) [Dataset]. http://doi.org/10.26008/1912/bco-dmo.773272.1
Explore at:
Unique identifier
https://doi.org/10.26008/1912/bco-dmo.773272.1
Dataset updated
Jul 31, 2020
Dataset provided by
DataCite
Biological and Chemical Oceanography Data Management Office (BCO-DMO)
Authors
Kristen Whalen; Elizabeth Harvey
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Dataset funded by
NSF Division of Ocean Sciences
Description
NCBI accession numbers and related metadata from a study of transcriptomic response of Emiliania huxleyi to 2-heptyl-4-quinolone (HHQ). Sequences from this study are available at the NCBI GEO under accession series GSE131846 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?&acc=GSE131846
Taxol Drug Resistance cell lines in Breast Cancer
kaggle.com
zip
Updated Apr 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ali Abedi Madiseh (2023). Taxol Drug Resistance cell lines in Breast Cancer [Dataset]. https://www.kaggle.com/datasets/aliabedimadiseh/taxol-drug-resistance-cell-lines-in-breast-cancer/discussion
Explore at:
zip(247688 bytes)Available download formats
Dataset updated
Apr 12, 2023
Authors
Ali Abedi Madiseh
License
Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
Description
This dataset collected from NCBI - GEO datasets: - GSE144113 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE144113) - GSE76200 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE76200) - GSE12791 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE12791)

These datasets include four paclitaxel-resistant cell lines which includes BAS, HS578T, MCF7 and MDA-MB-231.

Gene expression analysis was performed using R in each of the datasets, which was between control cells and drug-resistant cells. And using different Bioinformatics databases, they were converted into gene symbols. Genes with a p-value of less than 0.05 were also removed.
n
Extended data tables to Haering and Habermann, F1000Res, RNfuzzyApp: an R...
data.niaid.nih.gov
search.dataone.org
+1more
zip
Updated Jul 9, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bianca Habermann; Margaux Haering (2021). Extended data tables to Haering and Habermann, F1000Res, RNfuzzyApp: an R shiny RNA-seq data analysis app for visualisation, differential expression analysis, time-series clustering and enrichment analysis [Dataset]. http://doi.org/10.5061/dryad.8pk0p2nnd
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.8pk0p2nnd
Dataset updated
Jul 9, 2021
Dataset provided by
Institut de Biologie du Développement Marseille
Authors
Bianca Habermann; Margaux Haering
License
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Description
Background

RNA-seq is a widely adopted affordable method for large scale gene expression profiling. However, user-friendly and versatile tools for wet-lab biologists to analyse RNA-seq data beyond standard analyses such as differential expression, are rare. Especially, the analysis of time-series data is difficult for wet-lab biologists lacking advanced computational training. Furthermore, most meta-analysis tools are tailored for model organisms and not easily adaptable to other species.

Results

With RNfuzzyApp, we provide a user-friendly, web-based R-shiny app for differential expression analysis, as well as time-series analysis of RNA-seq data. RNfuzzyApp offers several methods for normalization and differential expression analysis of RNA-seq data, providing easy-to-use toolboxes, interactive plots and downloadable results. For time-series analysis, RNfuzzyApp presents the first web-based, automated pipeline for soft clustering with the Mfuzz R package, including methods to aid in cluster number selection, Mfuzz loop computations, cluster overlap analysis, as well as cluster enrichments.

Conclusion

RNfuzzyApp is an intuitive, easy to use and interactive R shiny app for RNA-seq differential expression and time-series analysis, offering a rich selection of interactive plots, providing a quick overview of raw data and generating rapid analysis results. Furthermore, its orthology assignment, enrichment analysis, as well as ID conversion functions are accessible to non-model organisms.

Methods Haering_etal_extendedData_DmdevLeg_GSE143430_mean.txt: mean values calculated from raw reads of replicates, downloaded from gene expression omnibus (dataset GSE143430 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE143430).

Haering_etal_extendedDatatable_1a_Tabulamurissenis_3vs12m_DEA.txt: Tabula muris senis limb muscle data (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE132040) from 3, 12 and 27month males, processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_1b_Tabulamurissenis_3vs27m_DEA.txt: Tabula muris senis limb muscle data (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE132040) from 3, 12 and 27month males, processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_1c_Tabulamurissenis_12vs27m_DEA.txt: Tabula muris senis limb muscle data (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE132040) from 3, 12 and 27month males, processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_1d_Tabulamurissenis_3vs12m_gpofiler.txt: Tabula muris senis limb muscle data (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE132040) from 3, 12 and 27month males, processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_1e_Tabulamurissenis_3vs27m_gpofiler.txt: Tabula muris senis limb muscle data (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE132040) from 3, 12 and 27month males, processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_1f_Tabulamurissenis_12vs27m_gpofiler.txt: Tabula muris senis limb muscle data (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE132040) from 3, 12 and 27month males, processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_2a_Tabulamurissenis_cluster1_gpofiler.txt: Tabula muris senis limb muscle data (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE132040) from 3, 12 and 27month males, processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_2b_Tabulamurissenis_cluster2_gpofiler.txt: Tabula muris senis limb muscle data (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE132040) from 3, 12 and 27month males, processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_2c_Tabulamurissenis_cluster3_gpofiler.txt: Tabula muris senis limb muscle data (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE132040) from 3, 12 and 27month males, processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_2d_Tabulamurissenis_cluster4_gpofiler.txt: Tabula muris senis limb muscle data (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE132040) from 3, 12 and 27month males, processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_2e_Tabulamurissenis_cluster5_gpofiler.txt: Tabula muris senis limb muscle data (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE132040) from 3, 12 and 27month males, processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_3a_DmLeg_cluster1_gpofiler.txt: Haering_etal_extendedData_DmdevLeg_GSE143430_mean.txt processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_3b_DmLeg_cluster2_gpofiler.txt: Haering_etal_extendedData_DmdevLeg_GSE143430_mean.txt processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_3c_DmLeg_cluster3_gpofiler.txt: Haering_etal_extendedData_DmdevLeg_GSE143430_mean.txt processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_3d_DmLeg_cluster4_gpofiler.txt: Haering_etal_extendedData_DmdevLeg_GSE143430_mean.txt processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_3e_DmLeg_cluster5_gpofiler.txt: Haering_etal_extendedData_DmdevLeg_GSE143430_mean.txt processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_3f_DmLeg_cluster6_gpofiler.txt: Haering_etal_extendedData_DmdevLeg_GSE143430_mean.txt processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_3g_DmLeg_cluster7_gpofiler.txt: Haering_etal_extendedData_DmdevLeg_GSE143430_mean.txt processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_3h_DmLeg_cluster8_gpofiler.txt: Haering_etal_extendedData_DmdevLeg_GSE143430_mean.txt processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_3i_DmLeg_cluster9_gpofiler.txt: Haering_etal_extendedData_DmdevLeg_GSE143430_mean.txt processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_3j_DmLeg_cluster10_gpofiler.txt: Haering_etal_extendedData_DmdevLeg_GSE143430_mean.txt processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_3k_DmLeg_cluster11_gpofiler.txt: Haering_etal_extendedData_DmdevLeg_GSE143430_mean.txt processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)

Haering_etal_extendedDatatable_3l_DmLeg_cluster12_gpofiler.txt: Haering_etal_extendedData_DmdevLeg_GSE143430_mean.txt processed with RNfuzzyApp (https://gitlab.com/habermann_lab/rna-seq-analysis-app)
Identification of CTLA2A, DEFB29, WFDC15B, SERPINA1F and MUP19 as Novel...
plos.figshare.com
datasetcatalog.nlm.nih.gov
docx
Updated Jun 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jibin Zhang; Jinsoo Ahn; Yeunsu Suh; Seongsoo Hwang; Michael E. Davis; Kichoon Lee (2023). Identification of CTLA2A, DEFB29, WFDC15B, SERPINA1F and MUP19 as Novel Tissue-Specific Secretory Factors in Mouse [Dataset]. http://doi.org/10.1371/journal.pone.0124962
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0124962
Dataset updated
Jun 2, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Jibin Zhang; Jinsoo Ahn; Yeunsu Suh; Seongsoo Hwang; Michael E. Davis; Kichoon Lee
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Secretory factors in animals play an important role in communication between different cells, tissues and organs. Especially, the secretory factors with specific expression in one tissue may reflect important functions and unique status of that tissue in an organism. In this study, we identified potential tissue-specific secretory factors in the fat, muscle, heart, lung, kidney and liver in the mouse by analyzing microarray data from NCBI’s Gene Expression Omnibus (GEO) public repository and searching and predicting their subcellular location in GeneCards and WoLF PSORT, and then confirmed tissue-specific expression of the genes using semi-quantitative PCR reactions. With this approach, we confirmed 11 lung, 7 liver, 2 heart, 1 heart and muscle, 7 kidney and 2 adipose and liver-specific secretory factors. Among these genes, 1 lung-specific gene - CTLA2A (cytotoxic T lymphocyte-associated protein 2 alpha), 3 kidney-specific genes - SERPINA1F (serpin peptidase inhibitor, Clade A, member 1F), WFDC15B (WAP four-disulfide core domain 15B) and DEFB29 (defensin beta 29) and 1 liver-specific gene - MUP19 (major urinary protein 19) have not been reported as secretory factors. These genes were tagged with hemagglutinin at the 3’end and then transiently transfected to HEK293 cells. Through protein detection in cell lysate and media using Western blotting, we verified secretion of the 5 genes and predicted the potential pathways in which they may participate in the specific tissue through data analysis of GEO profiles. In addition, alternative splicing was detected in transcripts of CTLA2A and SERPINA1F and the corresponding proteins were found not to be secreted in cell culture media. Identification of novel secretory factors through the current study provides a new platform to explore novel secretory factors and a general direction for further study of these genes in the future.
Bootstrap-corrected model discrimination performance in terms of C-index,...
plos.figshare.com
xls
Updated Oct 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Autumn O’Donnell; Michael Cronin; Shirin Moghaddam; Eric Wolsztynski (2024). Bootstrap-corrected model discrimination performance in terms of C-index, and associated 95% bootstrap confidence intervals (CI). [Dataset]. http://doi.org/10.1371/journal.pone.0311162.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0311162.t001
Dataset updated
Oct 2, 2024
Dataset provided by
PLOShttp://plos.org/
Authors
Autumn O’Donnell; Michael Cronin; Shirin Moghaddam; Eric Wolsztynski
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The best-performing pipeline for each modelling strategy is highlighted in bold.