100+ datasets found
  1. Z

    Training material for ChIP-seq analysis

    • data.niaid.nih.gov
    • data-staging.niaid.nih.gov
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Freeberg, Mallory; Heydarian, Mohammad (2020). Training material for ChIP-seq analysis [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_197100
    Explore at:
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Johns Hopkins University
    Authors
    Freeberg, Mallory; Heydarian, Mohammad
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data provided here are part of a Galaxy tutorial that analyzes ChIP-seq data from a study published by Wu et al., 2014 (DOI:10.1101/gr.164830.113). The goal of this study was to investigate "the dynamics of occupancy and the role in gene regulation of the transcription factor Tal1, a critical regulator of hematopoiesis, at multiple stages of hematopoietic differentiation." To this end, ChIP-seq experiments were performed in multiple mouse cell types including a G1E cell line and megakaryocytes, the two cell types represented here. The dataset contains biological replicate Tal1 ChIP-seq and input control experiments (*.fastqsanger files). Because of the long processing time for the large original files, we have downsampled the original raw data files to include only reads that align to chromosome 19 and a subset of interesting genomic loci (ChIPseq_regions_of_interest_v4.bed) pulled from the Wu et al. publication. Also included is a gene annotation file (RefSeq_gene_annotations_mm10.bed) with gene names added for viewing in a genome browser.

  2. Practical Guidelines for the Comprehensive Analysis of ChIP-seq Data

    • plos.figshare.com
    docx
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Timothy Bailey; Pawel Krajewski; Istvan Ladunga; Celine Lefebvre; Qunhua Li; Tao Liu; Pedro Madrigal; Cenny Taslim; Jie Zhang (2023). Practical Guidelines for the Comprehensive Analysis of ChIP-seq Data [Dataset]. http://doi.org/10.1371/journal.pcbi.1003326
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Timothy Bailey; Pawel Krajewski; Istvan Ladunga; Celine Lefebvre; Qunhua Li; Tao Liu; Pedro Madrigal; Cenny Taslim; Jie Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Mapping the chromosomal locations of transcription factors, nucleosomes, histone modifications, chromatin remodeling enzymes, chaperones, and polymerases is one of the key tasks of modern biology, as evidenced by the Encyclopedia of DNA Elements (ENCODE) Project. To this end, chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is the standard methodology. Mapping such protein-DNA interactions in vivo using ChIP-seq presents multiple challenges not only in sample preparation and sequencing but also for computational analysis. Here, we present step-by-step guidelines for the computational analysis of ChIP-seq data. We address all the major steps in the analysis of ChIP-seq data: sequencing depth selection, quality checking, mapping, data normalization, assessment of reproducibility, peak calling, differential binding analysis, controlling the false discovery rate, peak annotation, visualization, and motif analysis. At each step in our guidelines we discuss some of the software tools most frequently used. We also highlight the challenges and problems associated with each step in ChIP-seq data analysis. We present a concise workflow for the analysis of ChIP-seq data in Figure 1 that complements and expands on the recommendations of the ENCODE and modENCODE projects. Each step in the workflow is described in detail in the following sections.

  3. Analysis of ChIP-seq data.

    • plos.figshare.com
    xls
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kimberly R. Blahnik; Lei Dou; Lorigail Echipare; Sushma Iyengar; Henriette O'Geen; Erica Sanchez; Yongjun Zhao; Marco A. Marra; Martin Hirst; Joseph F. Costello; Ian Korf; Peggy J. Farnham (2023). Analysis of ChIP-seq data. [Dataset]. http://doi.org/10.1371/journal.pone.0017121.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Kimberly R. Blahnik; Lei Dou; Lorigail Echipare; Sushma Iyengar; Henriette O'Geen; Erica Sanchez; Yongjun Zhao; Marco A. Marra; Martin Hirst; Joseph F. Costello; Ian Korf; Peggy J. Farnham
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Shown are the number of peaks called and the total number of bp covered by each peak set for H3K4me3, H3K36me3, and H3K9me3 using the original Sole-search program or the program which has been modified to identify broad regions covered by modified histones. Also shown in the increase in genome coverage (fold difference) that results when using the modified peak calling program. Both the original and the modified program can be accessed at http://chipseq.genomecenter.ucdavis.edu/cgi-bin/chipseq.cgi.

  4. Training data for ChIP-seq data analysis (Galaxy Training Material):...

    • zenodo.org
    • data.niaid.nih.gov
    bin
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bérénice Batut; Friederike Dündar; Anika Erxleben; Björn Grüning; Bérénice Batut; Friederike Dündar; Anika Erxleben; Björn Grüning (2020). Training data for ChIP-seq data analysis (Galaxy Training Material): Identification of the binding sites of the Estrogen receptor [Dataset]. http://doi.org/10.5281/zenodo.892432
    Explore at:
    binAvailable download formats
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Bérénice Batut; Friederike Dündar; Anika Erxleben; Björn Grüning; Bérénice Batut; Friederike Dündar; Anika Erxleben; Björn Grüning
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data provided here are part of a Galaxy Training Network tutorial that analyzes ChIP-seq data from a study published by Ross-Inness et al., 2012 (DOI:10.1038/nature10730) to identify the binding sites of the Estrogen receptor, a transcription factor known to be associated with different types of breast cancer.

  5. Discovering Transcription Factor Binding Sites in Highly Repetitive Regions...

    • plos.figshare.com
    pdf
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dongjun Chung; Pei Fen Kuan; Bo Li; Rajendran Sanalkumar; Kun Liang; Emery H. Bresnick; Colin Dewey; Sündüz Keleş (2023). Discovering Transcription Factor Binding Sites in Highly Repetitive Regions of Genomes with Multi-Read Analysis of ChIP-Seq Data [Dataset]. http://doi.org/10.1371/journal.pcbi.1002111
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Dongjun Chung; Pei Fen Kuan; Bo Li; Rajendran Sanalkumar; Kun Liang; Emery H. Bresnick; Colin Dewey; Sündüz Keleş
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is rapidly replacing chromatin immunoprecipitation combined with genome-wide tiling array analysis (ChIP-chip) as the preferred approach for mapping transcription-factor binding sites and chromatin modifications. The state of the art for analyzing ChIP-seq data relies on using only reads that map uniquely to a relevant reference genome (uni-reads). This can lead to the omission of up to 30% of alignable reads. We describe a general approach for utilizing reads that map to multiple locations on the reference genome (multi-reads). Our approach is based on allocating multi-reads as fractional counts using a weighted alignment scheme. Using human STAT1 and mouse GATA1 ChIP-seq datasets, we illustrate that incorporation of multi-reads significantly increases sequencing depths, leads to detection of novel peaks that are not otherwise identifiable with uni-reads, and improves detection of peaks in mappable regions. We investigate various genome-wide characteristics of peaks detected only by utilization of multi-reads via computational experiments. Overall, peaks from multi-read analysis have similar characteristics to peaks that are identified by uni-reads except that the majority of them reside in segmental duplications. We further validate a number of GATA1 multi-read only peaks by independent quantitative real-time ChIP analysis and identify novel target genes of GATA1. These computational and experimental results establish that multi-reads can be of critical importance for studying transcription factor binding in highly repetitive regions of genomes with ChIP-seq experiments.

  6. f

    Summary of MACS analysis of the ChIP-seq data.

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Mar 20, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arthur, William T.; Biesinger, Jacob; Cleary, Michele; Xie, Xiaohui; Salmans, Michael L.; Roberts, Brian S.; Dai, Xing; Watanabe, Kazuhide; Andersen, Bogi (2014). Summary of MACS analysis of the ChIP-seq data. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001221748
    Explore at:
    Dataset updated
    Mar 20, 2014
    Authors
    Arthur, William T.; Biesinger, Jacob; Cleary, Michele; Xie, Xiaohui; Salmans, Michael L.; Roberts, Brian S.; Dai, Xing; Watanabe, Kazuhide; Andersen, Bogi
    Description

    Summary of MACS analysis of the ChIP-seq data.

  7. f

    DataSheet1_CSSQ: a ChIP-seq signal quantifier pipeline.pdf

    • datasetcatalog.nlm.nih.gov
    • frontiersin.figshare.com
    Updated May 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hu, Michael Y.; Mei, Yajun; Fan, Yuhong; Kumar, Ashwath (2023). DataSheet1_CSSQ: a ChIP-seq signal quantifier pipeline.pdf [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001106359
    Explore at:
    Dataset updated
    May 25, 2023
    Authors
    Hu, Michael Y.; Mei, Yajun; Fan, Yuhong; Kumar, Ashwath
    Description

    Chromatin immunoprecipitation followed by sequencing (ChIP-seq) has revolutionized the studies of epigenomes and the massive increase in ChIP-seq datasets calls for robust and user-friendly computational tools for quantitative ChIP-seq. Quantitative ChIP-seq comparisons have been challenging due to noisiness and variations inherent to ChIP-seq and epigenomes. By employing innovative statistical approaches specially catered to ChIP-seq data distribution and sophisticated simulations along with extensive benchmarking studies, we developed and validated CSSQ as a nimble statistical analysis pipeline capable of differential binding analysis across ChIP-seq datasets with high confidence and sensitivity and low false discovery rate with any defined regions. CSSQ models ChIP-seq data as a finite mixture of Gaussians faithfully that reflects ChIP-seq data distribution. By a combination of Anscombe transformation, k-means clustering, estimated maximum normalization, CSSQ minimizes noise and bias from experimental variations. Further, CSSQ utilizes a non-parametric approach and incorporates comparisons under the null hypothesis by unaudited column permutation to perform robust statistical tests to account for fewer replicates of ChIP-seq datasets. In sum, we present CSSQ as a powerful statistical computational pipeline tailored for ChIP-seq data quantitation and a timely addition to the tool kits of differential binding analysis to decipher epigenomes.

  8. o

    Data from: Identification of factors associated with duplicate rate in...

    • omicsdi.org
    Updated Jul 19, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Identification of factors associated with duplicate rate in ChIP-seq data. [Dataset]. https://www.omicsdi.org/dataset/biostudies/S-EPMC6447195
    Explore at:
    Dataset updated
    Jul 19, 2023
    Variables measured
    Unknown
    Description

    Chromatin immunoprecipitation and sequencing (ChIP-seq) has been widely used to map DNA-binding proteins, histone proteins and their modifications. ChIP-seq data contains redundant reads termed duplicates, referring to those mapping to the same genomic location and strand. There are two main sources of duplicates: polymerase chain reaction (PCR) duplicates and natural duplicates. Unlike natural duplicates that represent true signals from sequencing of independent DNA templates, PCR duplicates are artifacts originating from sequencing of identical copies amplified from the same DNA template. In analysis, duplicates are removed from peak calling and signal quantification. Nevertheless, a significant portion of the duplicates is believed to represent true signals. Obviously, removing all duplicates will underestimate the signal level in peaks and impact the identification of signal changes across samples. Therefore, an in-depth evaluation of the impact from duplicate removal is needed. Using eight public ChIP-seq datasets from three narrow-peak and two broad-peak marks, we tried to understand the distribution of duplicates in the genome, the extent by which duplicate removal impacts peak calling and signal estimation, and the factors associated with duplicate level in peaks. The three PCR-free histone H3 lysine 4 trimethylation (H3K4me3) ChIP-seq data had about 40% duplicates and 97% of them were within peaks. For the other datasets generated with PCR amplification of ChIP DNA, as expected, the narrow-peak marks have a much higher proportion of duplicates than the broad-peak marks. We found that duplicates are enriched in peaks and largely represent true signals, more conspicuous in those with high confidence. Furthermore, duplicate level in peaks is strongly correlated with the target enrichment level estimated using nonredundant reads, which provides the basis to properly allocate duplicates between noise and signal. Our analysis supports the feasibility of retaining the portion of signal duplicates into downstream analysis, thus alleviating the limitation of complete deduplication.

  9. f

    An Integrated Pipeline for the Genome-Wide Analysis of Transcription Factor...

    • figshare.com
    ai
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Eloi Mercier; Arnaud Droit; Leping Li; Gordon Robertson; Xuekui Zhang; Raphael Gottardo (2023). An Integrated Pipeline for the Genome-Wide Analysis of Transcription Factor Binding Sites from ChIP-Seq [Dataset]. http://doi.org/10.1371/journal.pone.0016432
    Explore at:
    aiAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Eloi Mercier; Arnaud Droit; Leping Li; Gordon Robertson; Xuekui Zhang; Raphael Gottardo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ChIP-Seq has become the standard method for genome-wide profiling DNA association of transcription factors. To simplify analyzing and interpreting ChIP-Seq data, which typically involves using multiple applications, we describe an integrated, open source, R-based analysis pipeline. The pipeline addresses data input, peak detection, sequence and motif analysis, visualization, and data export, and can readily be extended via other R and Bioconductor packages. Using a standard multicore computer, it can be used with datasets consisting of tens of thousands of enriched regions. We demonstrate its effectiveness on published human ChIP-Seq datasets for FOXA1, ER, CTCF and STAT1, where it detected co-occurring motifs that were consistent with the literature but not detected by other methods. Our pipeline provides the first complete set of Bioconductor tools for sequence and motif analysis of ChIP-Seq and ChIP-chip data.

  10. G

    ChIP-Seq Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Aug 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). ChIP-Seq Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/chip-seq-market
    Explore at:
    pdf, csv, pptxAvailable download formats
    Dataset updated
    Aug 29, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    ChIP-Seq Market Outlook



    According to our latest research, the global ChIP-Seq market size reached USD 1.42 billion in 2024, driven by the rapid adoption of next-generation sequencing technologies and the increasing demand for advanced epigenetic research tools. The market is expected to grow at a robust CAGR of 14.9% from 2025 to 2033, with the forecasted market size projected to reach USD 4.33 billion by 2033. This remarkable growth is primarily attributed to the expanding applications of ChIP-Seq in drug discovery, personalized medicine, and cancer research, as well as continuous technological advancements in sequencing platforms and bioinformatics analysis.




    The primary growth factor for the ChIP-Seq market is the surging interest in epigenetics and gene regulation research, which has become a cornerstone of modern molecular biology and precision medicine. Researchers and clinicians are increasingly leveraging chromatin immunoprecipitation sequencing (ChIP-Seq) to unravel complex gene regulatory mechanisms, identify disease-associated biomarkers, and develop targeted therapies. The availability of high-quality antibodies, improvements in library preparation protocols, and the reduction in sequencing costs have further democratized access to ChIP-Seq technologies, enabling a broader range of institutions and laboratories to participate in cutting-edge genomics research. Furthermore, the integration of ChIP-Seq data with other omics datasets, such as transcriptomics and proteomics, is unlocking new frontiers in systems biology and disease modeling, fueling sustained market growth.




    Another significant driver for the ChIP-Seq market is the increasing investment by pharmaceutical and biotechnology companies in drug discovery and development processes. ChIP-Seq has emerged as a critical tool for identifying druggable targets, elucidating mechanisms of action, and understanding off-target effects at the chromatin level. The growing emphasis on personalized and precision medicine, particularly in oncology and rare diseases, has spurred demand for comprehensive epigenomic profiling solutions. This trend is further supported by government initiatives and funding programs aimed at accelerating genomics research, fostering collaborations between academia and industry, and establishing large-scale biobanks that utilize ChIP-Seq for functional annotation of the genome.




    Technological advancements have played a pivotal role in shaping the trajectory of the ChIP-Seq market. The introduction of automated sample preparation systems, high-throughput sequencing platforms, and sophisticated bioinformatics software has significantly improved the reproducibility, scalability, and cost-effectiveness of ChIP-Seq workflows. Cloud-based data analysis solutions and machine learning algorithms are enabling researchers to handle and interpret massive datasets with greater accuracy and efficiency. These innovations are not only enhancing the quality of ChIP-Seq data but also expanding its utility across diverse applications, including developmental biology, neuroscience, immunology, and environmental genomics. As a result, the market is witnessing a surge in demand for integrated ChIP-Seq solutions that combine instrumentation, consumables, software, and services into seamless, end-to-end offerings.




    From a regional perspective, North America continues to dominate the ChIP-Seq market due to its advanced research infrastructure, strong presence of leading biotechnology firms, and substantial government funding for genomics initiatives. However, the Asia Pacific region is rapidly emerging as a key growth engine, fueled by increasing investments in life sciences research, expanding biopharmaceutical industries, and rising awareness of precision medicine. Europe also maintains a significant market share, supported by collaborative research networks and a robust regulatory framework for genomic technologies. Meanwhile, Latin America and the Middle East & Africa are gradually catching up, driven by improvements in healthcare infrastructure and growing participation in international genomics consortia. This dynamic regional landscape underscores the global nature of the ChIP-Seq market and its critical role in advancing biomedical research worldwide.



  11. f

    Table_4_Testing Proximity of Genomic Regions to Transcription Start Sites...

    • frontiersin.figshare.com
    xlsx
    Updated May 31, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christopher Lee; Kai Wang; Tingting Qin; Maureen A. Sartor (2023). Table_4_Testing Proximity of Genomic Regions to Transcription Start Sites and Enhancers Complements Gene Set Enrichment Testing.xlsx [Dataset]. http://doi.org/10.3389/fgene.2020.00199.s005
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    Frontiers
    Authors
    Christopher Lee; Kai Wang; Tingting Qin; Maureen A. Sartor
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Large sets of genomic regions are generated by the initial analysis of various genome-wide sequencing data, such as ChIP-seq and ATAC-seq experiments. Gene set enrichment (GSE) methods are commonly employed to determine the pathways associated with them. Given the pathways and other gene sets (e.g., GO terms) of significance, it is of great interest to know the extent to which each is driven by binding near transcription start sites (TSS) or near enhancers. Currently, no tool performs such an analysis. Here, we present a method that addresses this question to complement GSE methods for genomic regions. Specifically, the new method tests whether the genomic regions in a gene set are significantly closer to a TSS (or to an enhancer) than expected by chance given the total list of genomic regions, using a non-parametric test. Combining the results from a GSE test with our novel method provides additional information regarding the mode of regulation of each pathway, and additional evidence that the pathway is truly enriched. We illustrate our new method with a large set of ENCODE ChIP-seq data, using the chipenrich Bioconductor package. The results show that our method is a powerful complementary approach to help researchers interpret large sets of genomic regions.

  12. Z

    LFY ChIP-SEQ analysis Galaxy Training Material

    • data-staging.niaid.nih.gov
    • data.niaid.nih.gov
    • +1more
    Updated Apr 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Steven James Burgess (2023). LFY ChIP-SEQ analysis Galaxy Training Material [Dataset]. https://data-staging.niaid.nih.gov/resources?id=zenodo_7846178
    Explore at:
    Dataset updated
    Apr 25, 2023
    Dataset provided by
    University of Illinois at Urbana Champaign
    Authors
    Steven James Burgess
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Datasets for Galaxy Training on ChIP-SEQ analysis. Raw files can be downloaded from SRA project SRP051214

  13. e

    ChIP-seq analysis of RpoS in Vibrio cholerae

    • ebi.ac.uk
    Updated Sep 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    David Grainger (2025). ChIP-seq analysis of RpoS in Vibrio cholerae [Dataset]. https://www.ebi.ac.uk/biostudies/ArrayExpress/studies/E-MTAB-14434
    Explore at:
    Dataset updated
    Sep 8, 2025
    Authors
    David Grainger
    Description

    The experiment contains ChIP-seq data for an rpoS- version of Vibrio cholerae strain A1552, or a derivative encoding rpoS-3xFLAG. In both cases, smooth colony variants were used. The strains were both grown at 37 degrees, in LB medium, to an OD600 of 2.0, and crosslinked with 1 % (v/v) formaldehyde. After sonication, to break open cells and fragment DNA, immunoprecipitations were done using anti-FLAG antibodies. Libraries were prepared using DNA remaining after immunoprecipitation.

  14. Example dataset for DASiRe

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated Dec 20, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Marisol Salgado; Marisol Salgado; Chit Tong Lio; Chit Tong Lio; Amit Fenn; Amit Fenn (2021). Example dataset for DASiRe [Dataset]. http://doi.org/10.5281/zenodo.5792672
    Explore at:
    zipAvailable download formats
    Dataset updated
    Dec 20, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Marisol Salgado; Marisol Salgado; Chit Tong Lio; Chit Tong Lio; Amit Fenn; Amit Fenn
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Direct Alternative Splicing Regulator predictor (DASiRe) is a web application that allows non-expert users to perform different types of splicing analysis from RNA-seq experiments and also incorporates ChIP-seq data of a DNA-binding protein of interest to evaluate whether its presence is associated with the splicing changes detected in the RNA-seq dataset.

    DASiRe is an accessible web-based platform that performs the analysis of raw RNA-seq and ChIP-seq data to study the relationship between DNA-binding proteins and alternative splicing regulation. It provides a fully integrated pipeline that takes raw reads from RNA-seq and performs extensive splicing analysis by incorporating the three current methodological approaches to study alternative splicing: isoform switching, exon and event-level. Once the initial splicing analysis is finished, DASiRe performs ChIP-seq peak enrichment in the spliced genes detected by each one of the three approaches.

  15. d

    DoubleChEC program to identify transcription factor binding sites from...

    • dataone.org
    • datadryad.org
    • +1more
    Updated Jul 25, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jason Brickner (2025). DoubleChEC program to identify transcription factor binding sites from mapped ChEC-seq data [Dataset]. http://doi.org/10.5061/dryad.c866t1gd5
    Explore at:
    Dataset updated
    Jul 25, 2025
    Dataset provided by
    Dryad Digital Repository
    Authors
    Jason Brickner
    Time period covered
    Jan 1, 2023
    Description

    ChIP-seq (chromatin immunoprecipitation followed by sequencing) is commonly used to identify genome-wide protein-DNA interactions. However, ChIP-seq often gives a low yield, which is not ideal for quantitative outcomes. An alternative method to ChIP-seq is ChEC-seq (Chromatin endogenous cleavage with high-throughput sequencing). In this method, the endogenous TF (transcription factor) of interest is fused with MNase (micrococcal nuclease) that non-specifically cleaves DNA near binding sites. Compared to the original ChEC-seq method, the modified version requires far less amplification. Since MACS3 failed to identify peaks in data generated from the modified ChEC-seq method, a new peak finder has been developed specifically for it. There are three functions in the peak_finder/. callpeaks() is used to identify peaks from BAM files. goanalysis() is used to make GO (Gene Ontology) term plots from peaks. bedtomeme() is a wrapper function to perform MEME analysis in R after MEME Suite is inst..., ****EXCERPTED FROM BIORXIV PREPRINT; SEE PREPRINT OR PUBLISHED PAPER FOR REFERENCES AND DETAILS**** Yeast strains All yeast strains were derived from BY4741. A C-terminal micrococcal nuclease fusion was introduced to the protein of interest through transformation and homologous recombination of PCR-amplified DNA. Primers were designed with 50-bp of homology to the 3’ end of the coding sequence of interest. The 3xFLAG-MNase with a KanR marker was amplified from pGZ108 (Zentner et al., 2015) and transformed into BY4741 as previously described. Successful transformation was confirmed by immunoblotting and PCR, followed by sequencing. Lyophilized DNA oligonucleotides were resuspended in molecular-grade water to a concentration of 100 µM. For ligation, the following pair of oligonucleotides were annealed to produce the Y-adapter: Tn5ME-A (5’-TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG-3’) and Y-Adapt-i5 R (5’-CTGTCTCTTATACACATCTTCATAGTAATCATC-3’). For Tn5 Tagmentation, the following i7 oligonucle..., , # DoubleChEC TF binding site finder

    Introduction

    ChIP-seq (chromatin immunoprecipitation followed by sequencing) is commonly used to identify genome-wide protein-DNA interactions. However, ChIP-seq often gives a low yield, which is not ideal for quantitative outcomes. An alternative method to ChIP-seq is ChEC-seq (Chromatin endogenous cleavage with high-throughput sequencing). In this method, an endogenous TF (transcription factor) fused to MNase (micrococcal nuclease) cleaves DNA near binding sites. This package is designed to identify high-confidence binding sites from cleavage patterns from ChEC-seq2, a variant form of ChEC-seq.

    There are three functions in the peak_finder/. callpeaks() is used to identify peaks from single-end mapped reads input as BAM files. goanalysis() is used to make GO (Gene Ontology) term plots from peaks. bedtomeme() is a wrapper function to perform MEME analysis in R **after [MEME Suite](https://meme-...

  16. Supporting data for "Software pipelines for RNA-Seq, ChIP-Seq and Germline...

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated Sep 27, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Konstantinos Kyritsis; Konstantinos Kyritsis; Nikolaos Pechlivanis; Nikolaos Pechlivanis; Fotis Psomopoulos; Fotis Psomopoulos (2023). Supporting data for "Software pipelines for RNA-Seq, ChIP-Seq and Germline Variant calling analyses in Common Workflow Language (CWL)" [Dataset]. http://doi.org/10.5281/zenodo.8116556
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 27, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Konstantinos Kyritsis; Konstantinos Kyritsis; Nikolaos Pechlivanis; Nikolaos Pechlivanis; Fotis Psomopoulos; Fotis Psomopoulos
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Datasets produced during the validation of CWL-based pipelines, designed for the analysis of data from RNA-Seq, ChIP-Seq and germline variant calling experiments. Specifically, the workflows were tested using publicly available High-throughput (HTS) data from published studies on Chronic Lymphocytic Leukemia (CLL) (accession numbers: E-MTAB-6962, GSE115772) and Genome in a Bottle (GIAB) project samples (accession numbers: SRR6794144, SRR22476789, SRR22476790, SRR22476791).

    The supporting data include:

    • Differential transcript and gene expression results produced during the analysis with the CWL-based RNA-Seq pipeline
    • Bigwig and narrowPeak files, differential binding results, table of consensus peaks and read counts of EZH2 and H3K27me3, produced during the analysis with the CWL-based ChIP-Seq pipeline
    • VCF files containing the detected and filtered variants, along with the respective hap.py () results regarding comparisons against the GIAB golden standard truth sets for both CWL-based germline variant calling pipelines
  17. f

    Experimental validation of the binding events predicted by dPeak analysis of...

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    • +1more
    Updated Oct 17, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Grass, Jeffrey; Kiley, Patricia; Keleş, Sündüz; Chung, Dongjun; Landick, Robert; Myers, Kevin; Park, Dan (2013). Experimental validation of the binding events predicted by dPeak analysis of PET ChIP-Seq data. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001618272
    Explore at:
    Dataset updated
    Oct 17, 2013
    Authors
    Grass, Jeffrey; Kiley, Patricia; Keleş, Sündüz; Chung, Dongjun; Landick, Robert; Myers, Kevin; Park, Dan
    Description

    (a)The genes with promoters harboring the predicted binding events.(b)The true positions were determined by primer extension experiments (Figure 5A).(c)The conditions under which binding events are validated.(d)We report results based on the RegulonDB annotations for ybgI and ptsG genes as the primer extension products for these genes were too large to accurately map with the sequencing ladder.

  18. f

    Data from: ChIP-Seq analysis identifies p27(Kip1)-target genes involved in...

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Nov 20, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Biçer, Atilla; Orlando, Serena; Besson, Arnaud; Pujol, Maria Jesús; Aligué, Rosa; Islam, Abul B. M. M. K.; Bachs, Oriol; Gallastegui, Edurne (2017). ChIP-Seq analysis identifies p27(Kip1)-target genes involved in cell adhesion and cell signalling in mouse embryonic fibroblasts [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001750705
    Explore at:
    Dataset updated
    Nov 20, 2017
    Authors
    Biçer, Atilla; Orlando, Serena; Besson, Arnaud; Pujol, Maria Jesús; Aligué, Rosa; Islam, Abul B. M. M. K.; Bachs, Oriol; Gallastegui, Edurne
    Description

    The protein p27Kip1 (p27), a member of the Cip-Kip family of cyclin-dependent kinase inhibitors, is involved in tumorigenesis and a correlation between reduced levels of this protein in human tumours and a worse prognosis has been established. Recent reports revealed that p27 also behaves as a transcriptional regulator. Thus, it has been postulated that the development of tumours with low amounts of p27 could be propitiated by deregulation of transcriptional programs under the control of p27. However, these programs still remain mostly unknown. The aim of this study has been to define the transcriptional programs regulated by p27 by first identifying the p27-binding sites (p27-BSs) on the whole chromatin of quiescent mouse embryonic fibroblasts. The chromatin regions associated to p27 have been annotated to the most proximal genes and it has been considered that the expression of these genes could by regulated by p27. The identification of the chromatin p27-BSs has been performed by Chromatin Immunoprecipitation Sequencing (ChIP-seq). Results revealed that p27 associated with 1839 sites that were annotated to 1417 different genes being 852 of them protein coding genes. Interestingly, most of the p27-BSs were in distal intergenic regions and introns whereas, in contrast, its association with promoter regions was very low. Gene ontology analysis of the protein coding genes revealed a number of relevant transcriptional programs regulated by p27 as cell adhesion, intracellular signalling and neuron differentiation among others. We validated the interaction of p27 with different chromatin regions by ChIP followed by qPCR and demonstrated that the expressions of several genes belonging to these programs are actually regulated by p27. Finally, cell adhesion assays revealed that the adhesion of p27-/- cells to the plates was much higher that controls, revealing a role of p27 in the regulation of a transcriptional program involved in cell adhesion.

  19. m

    Identification of direct transcriptional targets of CgCrzA under CFW...

    • data.mendeley.com
    • figshare.com
    Updated Aug 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lin Huang (2024). Identification of direct transcriptional targets of CgCrzA under CFW conditions through Genome-wide ChIP-seq analysis [Dataset]. http://doi.org/10.17632/9nkg3p95yn.1
    Explore at:
    Dataset updated
    Aug 5, 2024
    Authors
    Lin Huang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Chromatin immunoprecipitation followed by sequencing (ChIP-seq) analysis was performed to evaluate whether CgCrzA plays a role in regulating CWI-related genes. Compared to the control, the ChIP-seq samples exhibited enrichment of CgCrzA-bound DNA fragments under CFW conditions

  20. o

    Data from: iRNA-seq: Computational method for genome wide assessment of...

    • omicsdi.org
    xml
    Updated Jun 17, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Susanne Mandrup,Bjørk D Larsen,Anne Loft,Jesper G Madsen,Ronni Nielsen,Søren F Schmidt (2015). iRNA-seq: Computational method for genome wide assessment of acute transcriptional regulation from total RNA-seq data [Dataset]. https://www.omicsdi.org/dataset/arrayexpress-repository/E-GEOD-60462
    Explore at:
    xmlAvailable download formats
    Dataset updated
    Jun 17, 2015
    Authors
    Susanne Mandrup,Bjørk D Larsen,Anne Loft,Jesper G Madsen,Ronni Nielsen,Søren F Schmidt
    Variables measured
    Transcriptomics,Genomics,Multiomics
    Description

    RNA-seq is a sensitive and accurate technique to compare steady state levels of RNA between different cellular states. However, as it does not provide an account of transcriptional activity per se, other technologies are needed to more precisely determine acute transcriptional responses. Here, we have developed an easy, sensitive and accurate novel method, iRNA-seq, for genome-wide assessment of transcriptional activity based on analysis of intron coverage from total RNA-seq data. To test our method, we have performed total RNA-seq and RNA polymerase II (RNAPII) ChIP-seq profiling of the acute transcriptional response of human adipocytes to TNFα treatment and analyzed these using iRNA-seq in addition to different publically availbale dataset. Comparison of the results derived from iRNA-seq analyses with results derived using current methods for genome-wide determination of transcriptional activity, i.e. Global Run-On (GRO)-seq and RNA polymerase II (RNAPII) ChIP-seq, demonstrate that iRNA-seq provides very similar results in terms of number of regulated genes and their fold change. However, unlike the current methods that are all very labor-intensive and demanding in terms of sample material and technologies, iRNA-seq is cheap and easy and requires very little sample material. In conclusion, iRNA-seq offers an attractive novel alternative to current methods for determination of changes in transcriptional activity at a genome-wide level. Genome-wide assesment of the acute transcriptional response to TNFa in human SGBS adiposytes using total RNA-seq data end RNAPII ChIP-seq

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Freeberg, Mallory; Heydarian, Mohammad (2020). Training material for ChIP-seq analysis [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_197100

Training material for ChIP-seq analysis

Explore at:
Dataset updated
Jan 24, 2020
Dataset provided by
Johns Hopkins University
Authors
Freeberg, Mallory; Heydarian, Mohammad
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The data provided here are part of a Galaxy tutorial that analyzes ChIP-seq data from a study published by Wu et al., 2014 (DOI:10.1101/gr.164830.113). The goal of this study was to investigate "the dynamics of occupancy and the role in gene regulation of the transcription factor Tal1, a critical regulator of hematopoiesis, at multiple stages of hematopoietic differentiation." To this end, ChIP-seq experiments were performed in multiple mouse cell types including a G1E cell line and megakaryocytes, the two cell types represented here. The dataset contains biological replicate Tal1 ChIP-seq and input control experiments (*.fastqsanger files). Because of the long processing time for the large original files, we have downsampled the original raw data files to include only reads that align to chromosome 19 and a subset of interesting genomic loci (ChIPseq_regions_of_interest_v4.bed) pulled from the Wu et al. publication. Also included is a gene annotation file (RefSeq_gene_annotations_mm10.bed) with gene names added for viewing in a genome browser.

Search
Clear search
Close search
Google apps
Main menu