Facebook
TwitterNASCArrays is the Nottingham Arabidopsis Stock Centre''s microarray database. Currently most of the data is for Arabidopsis thaliana experiments run by the NASC Affymetrix Facility. There are also experiments from other species, and experiments run by other centres too. NASCArrays is an Affymetrix microarray database. It contains free Affymetrix microarray data, and also features a series of tools allowing you to query that data in powerful ways. Most of the data currently comes from NASC''s Affymetrix Service. It also includes data from other sources, notably the AtGenExpress project. They currently distribute over 30,000 tubes of seed a year. There are currently the following data mining tools available. All of these tools allow you to type in a gene(s) of interest, and identify experiments or slides that you might be interested in: -Spot History: This tool allows you to see the pattern of gene expression over all slides in the database. Easily identify slides (and therefore experimental treatments) where genes are highly, lowly, or unusually expressed -Two gene scatter plot: This tool allows you to see the pattern of gene expression over all slides for two genes as a scatter plot. If you are interested in two genes, you can find out if they act in tandem, and highlight slides (and therefore experimental conditions) where these two genes behave in an unusual manner. -Gene Swinger: If you have a gene of interest, this tool will show you which experiment the gene expression varied most -Bulk Gene Download: This tool allows you to download the expression of a list of genes over all experiments. You can get all genes over all experiments (the entire database!) from the Super Bulk Gene Download Sponsors: This is a BBSRC funded consortium to provide services to the Arabidopsis community.
Facebook
TwitterMicroarray data management and analysis system for NCI / Center for Cancer Research scientists / collaborators. Data is secured and backed up on a regular basis, and investigators can authorize levels of access privileges to their projects, allowing data privacy while still enabling data sharing with collaborators.
Facebook
TwitterAn experiment in web-database access to large multi-dimensional data sets using a standardized experimental platform to determine if the larger scientific community can be given simple, intuitive, and user-friendly web-based access to large microarray data sets. All data in PEPR is also available via NCBI GEO. The structure and goals of PEPR differ from other mRNA expression profiling databases in a number of important ways. * The experimental platform in PEPR is standardized, and is an Affymetrix - only database. All microarrays available in the PEPR web database should ascribe to quality control and standard operating procedures. A recent publication has described the QC/SOP criteria utilized in PEPR profiles ( The Tumor Analysis Best Practices Working Group 2004 ). * PEPR permits gene-based queries of large Affymetrix array data sets without any specialized software. For example, a number of large time series projects are available within PEPR, containing 40-60 microarrays, yet these can be simply queried via a dynamic web interface with no prior knowledge of microarray data analysis. * Projects in PEPR originate from scientists world-wide, but all data has been generated by the Research Center for Genetic Medicine, Children''''s National Medical Center, Washington DC. Future developments of PEPR will allow remote entry of Affymetrix data ascribing to the same QC/SOP protocols. They have previously described an initial implementation of PEPR, and a dynamic web-queried time series graphical interface ( Chen et al. 2004 ). A publication showing the utility of PEPR for pharmacodynamic data has recently been published ( Almon et al. 2003 ).
Facebook
TwitterBackground Many microarray experiments search for genes with differential expression between a common “reference” group and multiple test groups, like in the case of time-course designs or of various treatments versus a control condition. In such cases, currently employed statistical approaches based on t-test or close derivatives have limited efficacy, mostly because estimation of noise is done on only two groups at time. Alternative approaches based on ANOVA correctly capture noise from all the groups, but then do not confront single test groups with the reference. We therefore conceived a statistical test for pairwise comparisons between the reference group and each test group that uses within-group variance calculated from all the groups. Results We implemented an R-Bioconductor package named Mulcom, with a statistical test derived from the Dunnett’s test, designed to compare multiple experimental groups against a common reference. In addition to the basic Dunnett’s t value, the package includes an optional minimal fold-change threshold, m. Thanks to automated, permutation-based estimation of False Discovery Rate (FDR), the package also permits fast optimization of the test, to obtain the maximum number of significant genes at a given FDR value. When applied on a time-course experiment profiled in parallel on two microarray platforms, and compared with currently used tests, Mulcom displayed higher concordance of significant genes in the two array platforms, and higher enrichment in functional annotation to categories related to the biology of the experiment. Conclusions The Mulcom package provides a fast and powerful tool for the identification of differentially expressed genes when several experimental conditions are compared with a common reference. We found that Mulcom leads to lists of differentially expressed genes that are particularly consistent across microarray platforms and enriched in significant classes of genes. In our opinion, the main reasons for these good performances are three: (i) within-group variability is estimated from all experimental groups even if only two of them are compared each time; (ii) the optional fold-change threshold m avoids false positives due to aberrantly low within-group variability; (iii) automated test optimization allows maximizing sensitivity without compromising specificity. Ten MDA-MB-435 samples, biological duplicates of each condition (untreated, integrin Beta4 treatment, hepatocyte growth factor treatment for 1 hr, 6 hrs, or 24 hrs).
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The growth in popularity of RNA expression microarrays has been accompanied by concerns about the reliability of the data especially when comparing between different platforms. Here, we present an evaluation of the reproducibility of microarray results using two platforms, Affymetrix GeneChips and Illumina BeadArrays. The study design is based on a dilution series of two human tissues (blood and placenta), tested in duplicate on each platform. The results of a comparison between the platforms indicate very high agreement, particularly for genes which are predicted to be differentially expressed between the two tissues. Agreement was strongly correlated with the level of expression of a gene. Concordance was also improved when probes on the two platforms could be identified as being likely to target the same set of transcripts of a given gene. These results shed light on the causes or failures of agreement across microarray platforms. The set of probes we found to be most highly reproducible can be used by others to help increase confidence in analyses of other data sets using these platforms.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The prognosis of colorectal cancer (CRC) stage II and III patients remains a challenge due to the difficulties of finding robust biomarkers suitable for testing clinical samples. The majority of published gene signatures of CRC have been generated on fresh frozen colorectal tissues. Because collection of frozen tissue is not practical for routine surgical pathology practice, a clinical test that improves prognostic capabilities beyond standard pathological staging of colon cancer will need to be designed for formalin-fixed paraffin-embedded (FFPE) tissues. The NanoString nCounter® platform is a gene expression analysis tool developed for use with FFPE-derived samples. We designed a custom nCounter® codeset based on elements from multiple published fresh frozen tissue microarray-based prognostic gene signatures for colon cancer, and we used this platform to systematically compare gene expression data from FFPE with matched microarray array data from frozen tissues. Our results show moderate correlation of gene expression between two platforms and discovery of a small subset of genes as candidate biomarkers for colon cancer prognosis that are detectable and quantifiable in FFPE tissue sections.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Sources of Nanostring codeset of 414-gene list.
Facebook
TwitterDatabase characterizing and comparing pluripotent human stem cells. The growth and culture conditions of all 21 human embryonic stem cell lines approved under the August 2001 Presidential Executive Order have been analyzed. Available to the scientific community are the results of our rigorous characterization of these cell lines at a more advanced level.
Facebook
TwitterA publicly accessible database containing data on Affymetrix DNA microarray experiments, and Serial Analysis of Gene Expression, mostly on human and mouse stem cell samples and their derivatives to facilitate the discovery of gene functions relevant to stem cell control and differentiation. It has grown in both size and scope into a system with analysis tools that examine either the whole database at once, or slices of data, based on tissue type, cell type or gene of interest. There is currently more than 210 stem cell samples in 60 different experiments, with more being added regularly. The samples were originated by researchers of the Stem Cell Network and processed at the Core Facility of Stemcore Laboratories under the management of Ms. Pearl Campbell in the frame of the Stem Cell Genomics Project. Periodically, new expression data is submitted to the Gene Expression Omnibus (GEO) repository at the National Center for Biotechnological Information, in order to allow researchers to compare the data deposited in StemBase to a large amount of gene expression data sets. StemBase is different from GEO in both focus and scope. StemBase is concerned exclusively with stem cell related data. we are focused in Stem Cell research. We have made a significant effort to ensure the quality and consistency of the data included. This allows us to offer more specialized analysis tools related to Stem Cell data. GEO is intended as a large scale public archive. Deposition in a public repository such as GEO is required by most important scientific journals and it is advantageous for a further diffusion of the data since GEO is more broadly used than StemBase.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
aMean Control and Mean SL data are averages of three samples each and are expressed in quantile (log2 (RPKM+2)) units.Transcriptional and Translational Regulator mRNAs increased in expression by SL.
Facebook
Twitterhttps://ega-archive.org/dacs/EGAC00001001661https://ega-archive.org/dacs/EGAC00001001661
We retrospectively collected 150 non-metastatic, pretreatment, formalin-fixed, paraffin-embedded (FFPE) nasopharyngeal carcinoma (NPC) samples as validation cohort 1. Also, we prospectively collected 32 FFPE samples from NPC patients enrolled in a trial evaluating anti-PD-1 antibody as validation cohort 2. Total RNA was extracted and hybridised to an Affymetrix HTA 2.0 microarray. In this study, we investigated the immune status of the tumour microenvironment (TME) based on gene expression profiles to classify NPC into biologically distinct immune subtypes, and clarify their associations with prognosis and immunotherapy response.
Facebook
TwitterTHIS RESOURCE IS NO LONGER IN SERVICE, Documented on March 24, 2014. A resource for gene expression studies, storing highly curated MIAME-compliant studies (i.e. experiments) employing a variety of technologies such as filter arrays, 2-channel microarrays, Affymetrix chips, SAGE, MPSS and RT-PCR. Data were available for querying and downloading based on the MGED ontology, publications or genes. Both public and private studies (the latter viewable only by users having appropriate logins and permissions) were available from this website. Specific details on protocols, biomaterials, study designs, etc., are collected through a user-friendly suite of web annotation forms. Software has been developed to generate MAGE-ML documents to enable easy export of studies stored in RAD to any other database accepting data in this format. RAD is part of a more general Genomics Unified Schema (http://gusdb.org), which includes a richly annotated gene index (http://allgenes.org), thus providing a platform that integrates genomic and transcriptomic data from multiple organisms. NOTE: Due to changes in technology and funding, the RAD website is no longer available. RAD as a schema is still very much active and incorporated in the GUS (Genomics Unified Schema) database system used by CBIL (EuPathDB, Beta Cell Genomics) and others. The schema for RAD can be viewed along with the other GUS namespaces through our Schema Browser.
Facebook
TwitterThe Stanley Online Genomics Database uses samples from the Stanley Medical Research Institute (SMRI) Brain Bank. These samples were processed and run on gene expression arrays by a variety of researchers in collaboration with the SMRI. These researchers have performed analyses on their respective studies using a range of analytic approaches. All of the genomic data have been aggregated in this online database, and a consistent set of analyses have been applied to each study. Additionally, a comprehensive set of cross-study analyses have been performed. A thorough collection of gene expression summaries are provided, inclusive of patient demographics, disease subclasses, regulated biological pathways, and functional classifications. Raw data is also available to download. The database is derived from two sets of brain samples, the Stanley Array collection and the Stanley Consortium collection. The Stanley Array collection contains 105 patients, and the Stanley Consortium collection contains 60 patients. Multiple genomic studies have been conducted using these brain samples. From these studies, twelve were selected for inclusion in the database on the basis of number of patients studied, genomic platform used, and data quality. The Consortium collection studies have fewer patients but more diversity in brain regions and array platforms, while the Array collection studies are more homogenous. There are tradeoffs, the Consortium results will be more variable, but findings may be more broadly representative. The collections contain brain samples from subjects in four main groups: Bipolar Schizophrenia, Depression, and Controls Brain regions used in the studies include: Broadman Area 6, Broadman Area 8/9, Broadman Area 10, Broadman Area 46, Cerebellum The 12 studies encompass a range of microarray platforms: Affymetrix HG-U95Av2, Affymetrix HG-U133A, Affymetrix HG-U133 2.0+, Codelink Human 20K, Agilent Human I, Custom cDNA Publications based on any of the clinical or genomic data should credit the Stanley Medical Research Institute, as well as any individual SMRI collaborators whose data is being used. Publications which make use of analytic results/methods in the database should additionally cite Dr. Michael Elashoff. Registration is required to access the data.
Facebook
TwitterA causal role of mutations in genes encoding for multiple general transcription factors in neurodevelopmental disorders including autism suggested that alterations at the global level of gene expression regulation might also relate to disease risk in sporadic cases of autism. This premise can be tested by evaluating for global changes in the overall distribution of gene expression levels. For instance, in mice, we recently showed that variability in hippocampal-dependent behaviors was associated with variability in the pattern of the overall distribution of gene expression levels, as assessed by variance in the distribution of gene expression levels in the hippocampus. We hypothesized that a similar change in the variance in gene expression levels might be found in children with autism. Gene expression microarrays covering greater than 47,000 unique RNA transcripts were done on purified RNA from peripheral blood lymphocytes of children with autism (n=82) and controls (n=64). The variance in the distribution of gene expression levels from each microarray was compared between groups of children. Also tested was whether a risk factor for autism, increased paternal age, was associated with variance in the overall distribution of gene expression levels. A decrease in the variance in the distribution of gene expression levels in peripheral blood lymphocytes (PBL) was associated with the diagnosis of autism and a risk factor for autism, increased paternal age. Traditional approaches to microarray analysis of gene expression suggested a possible mechanism for decreased variance in gene expression. Gene expression pathways involved in transcriptional regulation were down-regulated in the blood of children with autism and children of older fathers. Thus, results from global and gene specific approaches to studying microarray data were complimentary and supported the hypothesis that alterations at the global level of gene expression regulation are related to autism and increased paternal age. Regulation of transcription, thus, represents a possible point of convergence for multiple etiologies of autism and other neurodevelopmental disorders. The study was designed to compare gene expression profiles in peripheral blood lymphocytes of children with autism (n=82) and controls(n=64). Expression profiling: Expression profiling was performed at Translational Genomics (TGen), a member of the NIMH Neuroscience Microarray Consortium. Total RNA was extracted from peripheral blood lymphocytes (PBL) within 30 minutes of the blood draw using the Qiagen Qiaquick kit (Germantown, MD). Isolated total RNA was double round amplified, cleaned, and biotin-labeled using Affymetrix’s GeneChip Two-Cycle Target Labeling kit (Santa Clara, CA) with a T7 promoter and Ambion’s MEGAscript T7 High Yield Transcription kit (Austin, TX) as per manufacturer’s protocol. Amplified and labeled cRNA was quantified on a spectrophotometer and run on a 1% TAE gel to check for an evenly distributed range of transcript sizes. Twenty micrograms of cRNA was fragmented to approximately 35-200bp by alkaline treatment (200 mM Tris-acetate, pH 8.2, 500 mM KOAc, 150 mM MgOAc) and run on a 1% TAE gell to verify fragmentation. Separate hybridization cocktails were made using 15 micrograms of fragmented cRNA from each sample as per Affymetrix’s protocol. Two hundred microliters (containing 10 micrograms of fragmented cRNA) of each cocktail was separately hybridized to an Affymetrix Human Genome U133 Plus 2.0 Array for 16h at 45 degree Celsius in the Hybridization Oven 640. The Affymetrix Human Genome Arrays measure the expression of over 47,000 transcripts and variants, including 38,500 characterized human genes. Arrays were washed on Affymetrix’s GeneChip Fluidics Station 450 using a primary streptavidin phycoerythrin (SAPE) stain, subsequent biotinylated antibody stain, and secondary SAPE stain. Arrays were scanned on Affymetrix’s GeneChip Scanner 3000 7G with AutoLoader. Scanned images obtained by the Affymetrix GeneChip Operating Software (GCOS) v1.2 were used to extract raw signal intensity values per probe set on the array. A scaling factor of 150 was used to normalize array signal intensity in MAS 5.0. Arrays were scanned over 1 day on 2 different machines. Arrays scanned on the same machine and in the same day were considered to be from the same scan batch. Rescanning of a limited number of samples indicated that there were no significant differences between machines, nonetheless, for all comparisons groups were balanced for the scan batch. Gene expression levels were not adjusted for possible batch effects as algorithms that attempt to adjust for batch effects also alter the gene expression distribution. When samples could not be prepped simultaneously they were balanced for group membership (autism vs. control). To statistically control for possible confounds related to scan batches in our analysis of gene expression variance, batch number was entered into an analysis of covariance. For traditional analysis of gene expression, experimental groups were balanced with respect to batch membership. Microarray data analysis: Affymetrix .cel files were imported into Affymetrix Expression Console version 1.1. Data was pre-processed and summarized by Microarray Analysis Suite (MAS) 5.0 and Robust Multiarray Analysis (RMA). For the analysis of gene expression distributions, MAS 5.0 was used because the algorithm does not alter the gene expression distribution, whereas, RMA utilizes quantile normalization of probes prior to summarization and, therefore, has the potential to remove group level differences in gene expression distributions. Because of the numerous advantages in its handling of noise in gene expression and background subtraction, RMA was used for traditional gene expression analyses looking for specific gene expression differences between groups. Because, we found group level differences in the distribution of gene expression levels between groups, for traditional gene expression analyses summarized gene expression levels were also quantile normalized after the summarization step. Quantile normalization adjusts all data sets such that they have identical distribution patterns. Probesets were then filtered for those that were called present in at least 50 out of the 146 subjects (n = 25,146 probesets). A p-value of .05 was used as a threshold for significance. A fold-change of 1.1 was used as a cut off for magnitude of change. All microarrays met manufacturers recommended quality control criteria. Present calls ranged from 37.4% to 49%, mean 43.7%, SD 2.7%. Actin 3’to5’ ratios ranged from .726 to 5.15, mean1.37, SD 0.5. There were no significant group level differences in quality control measures.
Facebook
TwitterDrugMatrix is a comprehensive rat toxicogenomics database and analysis tool developed to facilitate the integration of toxicogenomics into hazard assessment. Using the whole genome and a diverse set of compounds allows a comprehensive view of most pharmacological and toxicological questions and is applicable to other situations such as disease and development. Complete Drug Matrix dataset for rat heart. Approximately 600 different compounds were profiled in up to 7 different rat tissues by obtaining tissue samples from test compound-treated and vehicle control-treated rats in biological triplicates for gene expression analysis after 0.25, 1, 3, and 5 days of exposure with daily dosing. In a few studies (1.8%), 7 days of exposure was substituted for 5 days of exposure. Samples were hybridized to whole genome RG230_2.0 GeneChip arrays (Affymetrix, CA).
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Affymetrix microarray processing produced 16 (4 WT cell, 4 101LL cell, 4 WT tissue, 4 101LL tissue) probe cell intensity data (CEL) files. Primary hippocampal cell cultures are routinely used as an experimentally accessible model platform for the hippocampus and brain tissue in general. Containing multiple cell types including neurons, astrocytes and microglia in a state that can be readily analysed optically, biochemically and electrophysiologically, such cultures have been used in many in vitro studies. To what extent the in vivo environment is recapitulated in primary cultures in an on-going question. Here we compare the transcriptomic profiles of primary hippocampal cell cultures and intact hippocampal tissue. In addition, by comparing profiles from wild type and the PrP 101LL transgenic model of prion disease, we also demonstrate that gene conservation is predominantly conserved across genetically altered lines.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
aMean Control and Mean SL data are averages of three samples each and are expressed in quantile (log2 (RPKM+2)) units.Further statistical analysis of the RNA-Seq data shown in S7 Table is presented in S9 Table where 552 genes are selected as having the most reliable changes in expression as a result of SL treatment. Fig 2 shows the functional types and subcellular locations of these 552 genes in the SON.Other mRNAs in SON that are decreased in expression by SL.
Facebook
TwitterMicroarray raw data used in the analysis of the impact of 6 month conjuagted equine estrogen versus estradiol-treatment on biomarkers and enriched gene sets in healthy mammary tissue of non-human primates (Macaca fascicularis). After extraction of RNA was converted into cDNA and then biotinylated cRNA. Biotinylated cRNA was fragmented and hybridized to an Affymetrix GeneChip Rhesus Macaque Genome Arrays (Affymetrix, Santa Clara, CA), which were in turn stained, washed and scanned. GeneChip scan files were processed with Expression Console to produce probe set analysis results using the MAS5 algorithm (Affymetrix, Santa Clara, CA). RNA quality control and microarray assays were performed by EA, Quantiles Company (Morrisville, NC). Breast biopsies of the CEE group and their respective controls were treated similarly in 2012 during the trial of Ethun et al.
Facebook
TwitterFormalin fixed paraffin-embedded (FFPE) tumor specimens are the conventionally archived material in clinical practice, representing an invaluable tissue source for biomarkers development, validation and routine implementation. For many prospective clinical trials, this material has been collected allowing for a prospective-retrospective study design which represents a successful strategy to define clinical utility for candidate markers. Gene expression data can be obtained even from FFPE specimens with the broadly used Affymetrix HG-U133 Plus 2.0 microarray platform. Nevertheless, important major discrepancies remain in expression data obtained from FFPE compared to fresh-frozen samples, prompting the need for appropriate data processing which could help to obtain more consistent results in downstream analyses. In a publicly available dataset of matched frozen and FFPE expression data, the performances of different normalization methods and specifically designed Chip Description Files (CDFs) were compared. The use of an alternative CDFs together with fRMA normalization significantly improved frozen-FFPE sample correlations, frozen-FFPE probeset correlations and agreement of differential analysis between different tumor subtypes. The relevance of our optimized data processing was assessed and validated using two independent datasets. In this study we demonstrated that an appropriate data processing can significantly improve the reliability of gene expression data derived from FFPE tissues using the standard Affymetrix platform. Tools for the implementation of our data processing algorithm are made publicly available at http://www.biocut.unito.it/cdf-ffpe/.
Facebook
TwitterA microarray platform that uses the Illumina and Affymetrix GeneChip microarray technology for genome and transcriptome analyses and a web-based database that consists exclusively of high quality Affymetrix data from immunological experiments hosted by a public, non-profit consortium of three scientific public institutions aimed to develop, integrate and disseminate Functional Genomics.
Facebook
TwitterNASCArrays is the Nottingham Arabidopsis Stock Centre''s microarray database. Currently most of the data is for Arabidopsis thaliana experiments run by the NASC Affymetrix Facility. There are also experiments from other species, and experiments run by other centres too. NASCArrays is an Affymetrix microarray database. It contains free Affymetrix microarray data, and also features a series of tools allowing you to query that data in powerful ways. Most of the data currently comes from NASC''s Affymetrix Service. It also includes data from other sources, notably the AtGenExpress project. They currently distribute over 30,000 tubes of seed a year. There are currently the following data mining tools available. All of these tools allow you to type in a gene(s) of interest, and identify experiments or slides that you might be interested in: -Spot History: This tool allows you to see the pattern of gene expression over all slides in the database. Easily identify slides (and therefore experimental treatments) where genes are highly, lowly, or unusually expressed -Two gene scatter plot: This tool allows you to see the pattern of gene expression over all slides for two genes as a scatter plot. If you are interested in two genes, you can find out if they act in tandem, and highlight slides (and therefore experimental conditions) where these two genes behave in an unusual manner. -Gene Swinger: If you have a gene of interest, this tool will show you which experiment the gene expression varied most -Bulk Gene Download: This tool allows you to download the expression of a list of genes over all experiments. You can get all genes over all experiments (the entire database!) from the Super Bulk Gene Download Sponsors: This is a BBSRC funded consortium to provide services to the Arabidopsis community.