100+ datasets found

Z
Training material for ChIP-seq analysis
data.niaid.nih.gov
data-staging.niaid.nih.gov
Updated Jan 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Freeberg, Mallory; Heydarian, Mohammad (2020). Training material for ChIP-seq analysis [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_197100
Explore at:
Dataset updated
Jan 24, 2020
Dataset provided by
Johns Hopkins University
Authors
Freeberg, Mallory; Heydarian, Mohammad
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The data provided here are part of a Galaxy tutorial that analyzes ChIP-seq data from a study published by Wu et al., 2014 (DOI:10.1101/gr.164830.113). The goal of this study was to investigate "the dynamics of occupancy and the role in gene regulation of the transcription factor Tal1, a critical regulator of hematopoiesis, at multiple stages of hematopoietic differentiation." To this end, ChIP-seq experiments were performed in multiple mouse cell types including a G1E cell line and megakaryocytes, the two cell types represented here. The dataset contains biological replicate Tal1 ChIP-seq and input control experiments (*.fastqsanger files). Because of the long processing time for the large original files, we have downsampled the original raw data files to include only reads that align to chromosome 19 and a subset of interesting genomic loci (ChIPseq_regions_of_interest_v4.bed) pulled from the Wu et al. publication. Also included is a gene annotation file (RefSeq_gene_annotations_mm10.bed) with gene names added for viewing in a genome browser.
Practical Guidelines for the Comprehensive Analysis of ChIP-seq Data
plos.figshare.com
docx
Updated Jun 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Timothy Bailey; Pawel Krajewski; Istvan Ladunga; Celine Lefebvre; Qunhua Li; Tao Liu; Pedro Madrigal; Cenny Taslim; Jie Zhang (2023). Practical Guidelines for the Comprehensive Analysis of ChIP-seq Data [Dataset]. http://doi.org/10.1371/journal.pcbi.1003326
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pcbi.1003326
Dataset updated
Jun 2, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Timothy Bailey; Pawel Krajewski; Istvan Ladunga; Celine Lefebvre; Qunhua Li; Tao Liu; Pedro Madrigal; Cenny Taslim; Jie Zhang
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Mapping the chromosomal locations of transcription factors, nucleosomes, histone modifications, chromatin remodeling enzymes, chaperones, and polymerases is one of the key tasks of modern biology, as evidenced by the Encyclopedia of DNA Elements (ENCODE) Project. To this end, chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is the standard methodology. Mapping such protein-DNA interactions in vivo using ChIP-seq presents multiple challenges not only in sample preparation and sequencing but also for computational analysis. Here, we present step-by-step guidelines for the computational analysis of ChIP-seq data. We address all the major steps in the analysis of ChIP-seq data: sequencing depth selection, quality checking, mapping, data normalization, assessment of reproducibility, peak calling, differential binding analysis, controlling the false discovery rate, peak annotation, visualization, and motif analysis. At each step in our guidelines we discuss some of the software tools most frequently used. We also highlight the challenges and problems associated with each step in ChIP-seq data analysis. We present a concise workflow for the analysis of ChIP-seq data in Figure 1 that complements and expands on the recommendations of the ENCODE and modENCODE projects. Each step in the workflow is described in detail in the following sections.
Analysis of ChIP-seq data.
plos.figshare.com
xls
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kimberly R. Blahnik; Lei Dou; Lorigail Echipare; Sushma Iyengar; Henriette O'Geen; Erica Sanchez; Yongjun Zhao; Marco A. Marra; Martin Hirst; Joseph F. Costello; Ian Korf; Peggy J. Farnham (2023). Analysis of ChIP-seq data. [Dataset]. http://doi.org/10.1371/journal.pone.0017121.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0017121.t001
Dataset updated
May 30, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Kimberly R. Blahnik; Lei Dou; Lorigail Echipare; Sushma Iyengar; Henriette O'Geen; Erica Sanchez; Yongjun Zhao; Marco A. Marra; Martin Hirst; Joseph F. Costello; Ian Korf; Peggy J. Farnham
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Shown are the number of peaks called and the total number of bp covered by each peak set for H3K4me3, H3K36me3, and H3K9me3 using the original Sole-search program or the program which has been modified to identify broad regions covered by modified histones. Also shown in the increase in genome coverage (fold difference) that results when using the modified peak calling program. Both the original and the modified program can be accessed at http://chipseq.genomecenter.ucdavis.edu/cgi-bin/chipseq.cgi.
d
Data from: ChIP-Atlas
dknet.org
Updated Jan 29, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). ChIP-Atlas [Dataset]. http://identifiers.org/RRID:SCR_015511
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_015511
Dataset updated
Jan 29, 2022
Description
Database for visualizing and making use of public ChIP-seq data. ChIP-Atlas covers almost all public ChIP-seq experiments and data submitted to the SRA (Sequence Read Archives) in NCBI, DDBJ, or ENA.
o
Data from: Identification of factors associated with duplicate rate in...
omicsdi.org
Updated Jul 19, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). Identification of factors associated with duplicate rate in ChIP-seq data. [Dataset]. https://www.omicsdi.org/dataset/biostudies/S-EPMC6447195
Explore at:
Dataset updated
Jul 19, 2023
Variables measured
Unknown
Description
Chromatin immunoprecipitation and sequencing (ChIP-seq) has been widely used to map DNA-binding proteins, histone proteins and their modifications. ChIP-seq data contains redundant reads termed duplicates, referring to those mapping to the same genomic location and strand. There are two main sources of duplicates: polymerase chain reaction (PCR) duplicates and natural duplicates. Unlike natural duplicates that represent true signals from sequencing of independent DNA templates, PCR duplicates are artifacts originating from sequencing of identical copies amplified from the same DNA template. In analysis, duplicates are removed from peak calling and signal quantification. Nevertheless, a significant portion of the duplicates is believed to represent true signals. Obviously, removing all duplicates will underestimate the signal level in peaks and impact the identification of signal changes across samples. Therefore, an in-depth evaluation of the impact from duplicate removal is needed. Using eight public ChIP-seq datasets from three narrow-peak and two broad-peak marks, we tried to understand the distribution of duplicates in the genome, the extent by which duplicate removal impacts peak calling and signal estimation, and the factors associated with duplicate level in peaks. The three PCR-free histone H3 lysine 4 trimethylation (H3K4me3) ChIP-seq data had about 40% duplicates and 97% of them were within peaks. For the other datasets generated with PCR amplification of ChIP DNA, as expected, the narrow-peak marks have a much higher proportion of duplicates than the broad-peak marks. We found that duplicates are enriched in peaks and largely represent true signals, more conspicuous in those with high confidence. Furthermore, duplicate level in peaks is strongly correlated with the target enrichment level estimated using nonredundant reads, which provides the basis to properly allocate duplicates between noise and signal. Our analysis supports the feasibility of retaining the portion of signal duplicates into downstream analysis, thus alleviating the limitation of complete deduplication.
m
ChEA Transcription Factor Targets
maayanlab.cloud
gz
Updated Apr 6, 2015
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ma'ayan Laboratory of Computational Systems Biology (2015). ChEA Transcription Factor Targets [Dataset]. https://maayanlab.cloud/Harmonizome/dataset/CHEA+Transcription+Factor+Targets
Explore at:
gzAvailable download formats
Dataset updated
Apr 6, 2015
Dataset provided by
Harmonizome
Ma'ayan Laboratory of Computational Systems Biology
Authors
Ma'ayan Laboratory of Computational Systems Biology
Description
Target genes of transcription factors from published ChIP-chip, ChIP-seq, and other transcription factor binding site profiling studies
Z
Datasets for predicting TF binding using Virtual ChIP-seq
data.niaid.nih.gov
Updated Jan 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mehran Karimzadeh; Michael M. Hoffman (2020). Datasets for predicting TF binding using Virtual ChIP-seq [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_823296
Explore at:
Dataset updated
Jan 24, 2020
Dataset provided by
Department of Medical Biophysics, University of Toronto
Authors
Mehran Karimzadeh; Michael M. Hoffman
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This repository contains datasets necessary for using the Virtual ChIP-seq software.

Virtual ChIP-seq requires the following datasets to predict transcription factor binding:

chipExpDir_AtoH_V1.0.0.tar.gz: Reference matrices of correlation between TF binding and gene expression for TFs starting with letters A-H.

chipExpDir_ItoZ_V1.0.0.tar.gz: Reference matrices of correlation between TF binding and gene expression for TFs starting with letters I-Z.

refTables_V1.1.0.tar.gz: PhastCons genomic conservation, FIMO PWM scores for JASPAR motifs, and ChIP-seq data of ENCODE and Cistrome database.

hg38_chrsize.tsv: Length of chromosomes in hg38

trainedModels_V1.0.0.tar.gz: Virtual ChIP-seq scikit-learn trained models saved in joblib format

.tar.gz: Pre-calculated matrices suitable for training with other algorithms or re-training with Virtual ChIP-seq.

Some predictive features of TF binding are the same in each cell type and are stored together for simplicity in refTables_V1.0.0.tar.gz. You can use datasets from other cell types (named here as .tar.gz) for the purpose of re-training the model. The .tar.gz files contain pre-calculated predictive features of transcription factor binding in 4 chromosomes (5, 10, 15, 20).

These features include:

PhastCons genomic conservation

FIMO score for sequence motifs of TF in the JASPAR database

Chromatin accessibility

TF binding in ENCODE + Cistrome DB datasets

Virtual ChIP-seq expression score
Human list of RNA-seq data counts and average of ChIP-seq MACS2 value (HIF1A...
figshare.com
txt
Updated Oct 9, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hidemasa Bono (2019). Human list of RNA-seq data counts and average of ChIP-seq MACS2 value (HIF1A and EPAS1) from meta-analysis of the public NGS database [Dataset]. http://doi.org/10.6084/m9.figshare.9958181.v2
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.9958181.v2
Dataset updated
Oct 9, 2019
Dataset provided by
Figsharehttp://figshare.com/
figshare
Authors
Hidemasa Bono
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The number of experiments in which gene was up/down regulated in RNA-seq data and the average of ChIP-seq MACS2 values of HIF1A and EPAS1(HIF2A) in ChIP-Atlas database.Both were calculated from public NGS database (SRA).For up/donw regulated gene selection, 2 fold threshold was adopted.
Training data for ChIP-seq data analysis (Galaxy Training Material):...
zenodo.org
data.niaid.nih.gov
bin
Updated Jan 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bérénice Batut; Friederike Dündar; Anika Erxleben; Björn Grüning; Bérénice Batut; Friederike Dündar; Anika Erxleben; Björn Grüning (2020). Training data for ChIP-seq data analysis (Galaxy Training Material): Identification of the binding sites of the Estrogen receptor [Dataset]. http://doi.org/10.5281/zenodo.892432
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.892432
Dataset updated
Jan 24, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Bérénice Batut; Friederike Dündar; Anika Erxleben; Björn Grüning; Bérénice Batut; Friederike Dündar; Anika Erxleben; Björn Grüning
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The data provided here are part of a Galaxy Training Network tutorial that analyzes ChIP-seq data from a study published by Ross-Inness et al., 2012 (DOI:10.1038/nature10730) to identify the binding sites of the Estrogen receptor, a transcription factor known to be associated with different types of breast cancer.
Data from:...
osdr.nasa.gov
Updated Aug 29, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Clara Bourbousse; Neeraja Vegesna; Julie A Law (2024). The-SOG1-transcriptional-activator-and-the-MyB3R-family-of-repressors-control-a-complex-gene-network-in-response-to-DNA-damage-in-Arabidopsis-SOG1-ChIP-seq [Dataset]. https://osdr.nasa.gov/bio/repo/data/studies/OSD-496
Explore at:
Dataset updated
Aug 29, 2024
Dataset provided by
NASAhttp://nasa.gov/
Authors
Clara Bourbousse; Neeraja Vegesna; Julie A Law
License
Attribution 1.0 (CC BY 1.0)https://creativecommons.org/licenses/by/1.0/
License information was derived automatically
Description
To combat DNA damage, organisms mount a DNA damage response (DDR) that results in cell cycle regulation, DNA repair and, in severe cases, cell death. Underscoring the importance of gene regulation in this response, studies in Arabidopsis have demonstrated that all of the aforementioned processes rely on SUPPRESSOR OF GAMMA RESPONSE 1 (SOG1), a NAC family transcription factor (TF) that has been functionally equated to the mammalian tumor suppressor, p53. However, the expression networks connecting SOG1 to these processes remain largely unknown and, although the DDR spans from minutes to hours, most transcriptomic data correspond to single time-point snapshots. Here, we generated transcriptional models of the DDR from GAMMA (γ)-irradiated wild-type and sog1 seedlings during a 24-hour time course using DREM, the Dynamic Regulatory Events Miner, revealing 11 coexpressed gene groups with distinct biological functions and cis-regulatory features. Within these networks, additional chromatin immunoprecipitation and transcriptomic experiments revealed that SOG1 is the major activator, directly targeting the most strongly up-regulated genes, including TFs, repair factors, and early cell cycle regulators, while three MYB3R TFs are the major repressors, specifically targeting the most strongly down-regulated genes, which mainly correspond to G2/M cell cycle-regulated genes. Together these models reveal the temporal dynamics of the transcriptional events triggered by γ-irradiation and connects these events to TFs and biological processes over a time scale commensurate with key processes coordinated in response to DNA damage, greatly expanding our understanding of the DDR.
f
Summary of MACS analysis of the ChIP-seq data.
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Mar 20, 2014
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Arthur, William T.; Biesinger, Jacob; Cleary, Michele; Xie, Xiaohui; Salmans, Michael L.; Roberts, Brian S.; Dai, Xing; Watanabe, Kazuhide; Andersen, Bogi (2014). Summary of MACS analysis of the ChIP-seq data. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001221748
Explore at:
Dataset updated
Mar 20, 2014
Authors
Arthur, William T.; Biesinger, Jacob; Cleary, Michele; Xie, Xiaohui; Salmans, Michael L.; Roberts, Brian S.; Dai, Xing; Watanabe, Kazuhide; Andersen, Bogi
Description
Summary of MACS analysis of the ChIP-seq data.
MAGIC: A tool for predicting transcription factors and cofactors driving...
plos.figshare.com
tiff
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Avtar Roopra (2023). MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data [Dataset]. http://doi.org/10.1371/journal.pcbi.1007800
Explore at:
tiffAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pcbi.1007800
Dataset updated
Jun 1, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Avtar Roopra
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Transcriptomic profiling is an immensely powerful hypothesis generating tool. However, accurately predicting the transcription factors (TFs) and cofactors that drive transcriptomic differences between samples is challenging. A number of algorithms draw on ChIP-seq tracks to define TFs and cofactors behind gene changes. These approaches assign TFs and cofactors to genes via a binary designation of ‘target’, or ‘non-target’ followed by Fisher Exact Tests to assess enrichment of TFs and cofactors. ENCODE archives 2314 ChIP-seq tracks of 684 TFs and cofactors assayed across a 117 human cell lines under a multitude of growth and maintenance conditions. The algorithm presented herein, Mining Algorithm for GenetIc Controllers (MAGIC), uses ENCODE ChIP-seq data to look for statistical enrichment of TFs and cofactors in gene bodies and flanking regions in gene lists without an a priori binary classification of genes as targets or non-targets. When compared to other TF mining resources, MAGIC displayed favourable performance in predicting TFs and cofactors that drive gene changes in 4 settings: 1) A cell line expressing or lacking single TF, 2) Breast tumors divided along PAM50 designations 3) Whole brain samples from WT mice or mice lacking a single TF in a particular neuronal subtype 4) Single cell RNAseq analysis of neurons divided by Immediate Early Gene expression levels. In summary, MAGIC is a standalone application that produces meaningful predictions of TFs and cofactors in transcriptomic experiments.
H
Analyzed ChIP-seq Data
dataverse.harvard.edu
dataone.org
Updated Sep 3, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Parisha Shah (2025). Analyzed ChIP-seq Data [Dataset]. http://doi.org/10.7910/DVN/KRPJI1
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/KRPJI1
Dataset updated
Sep 3, 2025
Dataset provided by
Harvard Dataverse
Authors
Parisha Shah
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
See "Read Me" document and "Data Dictionary" file for detailed information. ChIP-seq: processed and ready for visualization a public genome browser (.bigwig).
n
hmChIP
neuinfo.org
rrid.site
+1more
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
hmChIP [Dataset]. http://identifiers.org/RRID:SCR_005407
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_005407
Description
A database of genome-wide chromatin immunoprecipitation (ChIP) data in human and mouse. Currently, the database contains >2000 samples from >500 ChIP-seq and ChIP-chip experiments, representing a total of >170 proteins and >10,000,000 protein-DNA interactions (March 2014). A web server provides an interface for database query. Protein-DNA binding intensities can be retrieved from individual samples for user-provided genomic regions. The retrieved intensities can be used to cluster samples and genomic regions to facilitate exploration of combinatorial patterns, cell type dependencies, and cross-sample variability of protein-DNA interactions.
E
BLUEPRINT release August 2016, ChIP-Seq for CD4-positive, alpha-beta T cell,...
ega-archive.org
Updated Aug 23, 2016
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2016). BLUEPRINT release August 2016, ChIP-Seq for CD4-positive, alpha-beta T cell, on genome GRCh38 [Dataset]. https://ega-archive.org/datasets/EGAD00001002444
Explore at:
Dataset updated
Aug 23, 2016
License
https://ega-archive.org/dacs/EGAC00001000135https://ega-archive.org/dacs/EGAC00001000135
Description
ChIP-Seq data for 9 CD4-positive, alpha-beta T cell sample(s). 68 run(s), 63 experiment(s), 63 analysis(s) on human genome GRCh38. Part of BLUEPRINT release August 2016. Analysis documentation available at http://ftp.ebi.ac.uk/pub/databases/blueprint/releases/20160816/homo_sapiens/README_chipseq_analysis_ebi_20160816
Cdx2 ChIP-seq and Microarray Data
figshare.com
xls
Updated Jul 15, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Guoji Guo (2017). Cdx2 ChIP-seq and Microarray Data [Dataset]. http://doi.org/10.6084/m9.figshare.5203621.v2
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.5203621.v2
Dataset updated
Jul 15, 2017
Dataset provided by
Figsharehttp://figshare.com/
figshare
Authors
Guoji Guo
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This data set contains raw sequencing data for Cdx2 ChIP-seq in mouse TS cells as well as raw microarray data for Cdx2 overexpression in mouse ES cells.The ChIP-seq data is generated by Illumina Genome Analyzer.The microarray data is generated by Illumina MouseRef-8_v1_1 Array.
N
ENCODE Transcription Factor Binding Sites by ChIP-seq from...
data.niaid.nih.gov
Updated Nov 10, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Snyder M; Gerstein M; Weissman S; Farnham P; Struhl K (2021). ENCODE Transcription Factor Binding Sites by ChIP-seq from Stanford/Yale/USC/Harvard [Dataset]. https://data.niaid.nih.gov/resources?id=gse31477
Explore at:
Dataset updated
Nov 10, 2021
Dataset provided by
ENCODE DCC
Authors
Snyder M; Gerstein M; Weissman S; Farnham P; Struhl K
Description
This data was generated by ENCODE. If you have questions about the data, contact the submitting laboratory directly (Philip Cayting mailto:pcayting@stanford.edu). If you have questions about the Genome Browser track associated with this data, contact ENCODE (mailto:genome@soe.ucsc.edu).This track shows probable binding sites of the specified transcription factors (TFs) in the given cell types as determined by chromatin immunoprecipitation followed by high throughput sequencing (ChIP-Seq). Included for each cell type is the input signal, which represents the control condition where no antibody targeting was performed. For each experiment (cell type vs. antibody) this track shows a graph of enrichment for TF binding (Signal), along with sites that have the greatest evidence of transcription factor binding (Peaks).For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf Cells were grown according to the approved ENCODE cell culture protocols. Further preparations were similar to those previously published (Euskirchen et al., 2007) with the exceptions that the cells were unstimulated and sodium orthovanadate was omitted from the buffers. For details on the chromatin immunoprecipitation protocol used, see Euskirchen et al. (2007) and Rozowsky et al. (2009).DNA recovered from the precipitated chromatin was sequenced on the Illumina (Solexa) sequencing platform and mapped to the genome using the Eland alignment program. ChIP-seq data was scored based on sequence reads (length ~30 bps) that align uniquely to the human genome. From the mapped tags a signal map of ChIP DNA fragments (average fragment length ~ 200 bp) was constructed where the signal height is the number of overlapping fragments at each nucleotide position in the genome.For each 1 Mb segment of each chromosome a peak height threshold was determined by requiring a false discovery rate <= 0.05 when comparing the number of peaks above threshold as compared the number obtained from multiple simulations of a random null background with the same number of mapped reads (also accounting for the fraction of mapable bases for sequence tags in that 1 Mb segment). The number of mapped tags in a putative binding region is compared to the normalized (normalized by correlating tag counts in genomic 10 kb windows) number of mapped tags in the same region from an input DNA control. Using a binomial test, only regions that have a p-value <= 0.05 are considered to be significantly enriched compared to the input DNA control.
n
CistromeFinder
neuinfo.org
rrid.site
+2more
Updated Mar 19, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2013). CistromeFinder [Dataset]. http://identifiers.org/RRID:SCR_005405
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_005405
Dataset updated
Mar 19, 2013
Description
Data portal that can help query, evaluate and visualize publicly available Chromatin immunoprecipitation and DNase I hypersensitivity assays with high-throughput sequencing data in human and mouse. The database currently contains 6378 samples over 4391 datasets, 313 factors and 102 cell lines or cell populations (May 2013). Each dataset has gone through a consistent analysis and quality control pipeline; therefore, users could evaluate the overall quality of each dataset before examining binding sites near their genes of interest. CistromeFinder is integrated with UCSC genome browser for visualization, Primer3Plus for ChIP-qPCR primer design and CistromeMap for submitting newly available datasets. It also allows users to leave comments to facilitate data evaluation and update.
E
BLUEPRINT release August 2016, ChIP-Seq for mature eosinophil, on genome...
ega-archive.org
Updated Jan 28, 2015
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2015). BLUEPRINT release August 2016, ChIP-Seq for mature eosinophil, on genome GRCh38 [Dataset]. https://ega-archive.org/datasets/EGAD00001002477
Explore at:
Dataset updated
Jan 28, 2015
License
https://ega-archive.org/dacs/EGAC00001000135https://ega-archive.org/dacs/EGAC00001000135
Description
ChIP-Seq data for 2 mature eosinophil sample(s). 14 run(s), 14 experiment(s), 14 analysis(s) on human genome GRCh38. Part of BLUEPRINT release August 2016. Analysis documentation available at http://ftp.ebi.ac.uk/pub/databases/blueprint/releases/20160816/homo_sapiens/README_chipseq_analysis_ebi_20160816
o
Data from: iRNA-seq: Computational method for genome wide assessment of...
omicsdi.org
xml
Updated Jun 17, 2015
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Susanne Mandrup,Bjørk D Larsen,Anne Loft,Jesper G Madsen,Ronni Nielsen,Søren F Schmidt (2015). iRNA-seq: Computational method for genome wide assessment of acute transcriptional regulation from total RNA-seq data [Dataset]. https://www.omicsdi.org/dataset/arrayexpress-repository/E-GEOD-60462
Explore at:
xmlAvailable download formats
Dataset updated
Jun 17, 2015
Authors
Susanne Mandrup,Bjørk D Larsen,Anne Loft,Jesper G Madsen,Ronni Nielsen,Søren F Schmidt
Variables measured
Transcriptomics,Genomics,Multiomics
Description
RNA-seq is a sensitive and accurate technique to compare steady state levels of RNA between different cellular states. However, as it does not provide an account of transcriptional activity per se, other technologies are needed to more precisely determine acute transcriptional responses. Here, we have developed an easy, sensitive and accurate novel method, iRNA-seq, for genome-wide assessment of transcriptional activity based on analysis of intron coverage from total RNA-seq data. To test our method, we have performed total RNA-seq and RNA polymerase II (RNAPII) ChIP-seq profiling of the acute transcriptional response of human adipocytes to TNFα treatment and analyzed these using iRNA-seq in addition to different publically availbale dataset. Comparison of the results derived from iRNA-seq analyses with results derived using current methods for genome-wide determination of transcriptional activity, i.e. Global Run-On (GRO)-seq and RNA polymerase II (RNAPII) ChIP-seq, demonstrate that iRNA-seq provides very similar results in terms of number of regulated genes and their fold change. However, unlike the current methods that are all very labor-intensive and demanding in terms of sample material and technologies, iRNA-seq is cheap and easy and requires very little sample material. In conclusion, iRNA-seq offers an attractive novel alternative to current methods for determination of changes in transcriptional activity at a genome-wide level. Genome-wide assesment of the acute transcriptional response to TNFa in human SGBS adiposytes using total RNA-seq data end RNAPII ChIP-seq

Facebook

Twitter

Click to copy link

Link copied

Cite

Freeberg, Mallory; Heydarian, Mohammad (2020). Training material for ChIP-seq analysis [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_197100

Training material for ChIP-seq analysis

Explore at:

Dataset updated

Jan 24, 2020

Dataset provided by

Johns Hopkins University

Authors

Freeberg, Mallory; Heydarian, Mohammad

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The data provided here are part of a Galaxy tutorial that analyzes ChIP-seq data from a study published by Wu et al., 2014 (DOI:10.1101/gr.164830.113). The goal of this study was to investigate "the dynamics of occupancy and the role in gene regulation of the transcription factor Tal1, a critical regulator of hematopoiesis, at multiple stages of hematopoietic differentiation." To this end, ChIP-seq experiments were performed in multiple mouse cell types including a G1E cell line and megakaryocytes, the two cell types represented here. The dataset contains biological replicate Tal1 ChIP-seq and input control experiments (*.fastqsanger files). Because of the long processing time for the large original files, we have downsampled the original raw data files to include only reads that align to chromosome 19 and a subset of interesting genomic loci (ChIPseq_regions_of_interest_v4.bed) pulled from the Wu et al. publication. Also included is a gene annotation file (RefSeq_gene_annotations_mm10.bed) with gene names added for viewing in a genome browser.

Clear search

Close search

Google apps

Main menu

Training material for ChIP-seq analysis

Practical Guidelines for the Comprehensive Analysis of ChIP-seq Data

Analysis of ChIP-seq data.

Data from: ChIP-Atlas

Data from: Identification of factors associated with duplicate rate in...

ChEA Transcription Factor Targets

Datasets for predicting TF binding using Virtual ChIP-seq

Human list of RNA-seq data counts and average of ChIP-seq MACS2 value (HIF1A...

Training data for ChIP-seq data analysis (Galaxy Training Material):...

Data from:...

Summary of MACS analysis of the ChIP-seq data.

MAGIC: A tool for predicting transcription factors and cofactors driving...

Analyzed ChIP-seq Data

hmChIP

BLUEPRINT release August 2016, ChIP-Seq for CD4-positive, alpha-beta T cell,...

Cdx2 ChIP-seq and Microarray Data

ENCODE Transcription Factor Binding Sites by ChIP-seq from...

CistromeFinder

BLUEPRINT release August 2016, ChIP-Seq for mature eosinophil, on genome...

Data from: iRNA-seq: Computational method for genome wide assessment of...

Training material for ChIP-seq analysis