100+ datasets found
  1. Z

    Training material for ChIP-seq analysis

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Freeberg, Mallory; Heydarian, Mohammad (2020). Training material for ChIP-seq analysis [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_197100
    Explore at:
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Johns Hopkins University
    Authors
    Freeberg, Mallory; Heydarian, Mohammad
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data provided here are part of a Galaxy tutorial that analyzes ChIP-seq data from a study published by Wu et al., 2014 (DOI:10.1101/gr.164830.113). The goal of this study was to investigate "the dynamics of occupancy and the role in gene regulation of the transcription factor Tal1, a critical regulator of hematopoiesis, at multiple stages of hematopoietic differentiation." To this end, ChIP-seq experiments were performed in multiple mouse cell types including a G1E cell line and megakaryocytes, the two cell types represented here. The dataset contains biological replicate Tal1 ChIP-seq and input control experiments (*.fastqsanger files). Because of the long processing time for the large original files, we have downsampled the original raw data files to include only reads that align to chromosome 19 and a subset of interesting genomic loci (ChIPseq_regions_of_interest_v4.bed) pulled from the Wu et al. publication. Also included is a gene annotation file (RefSeq_gene_annotations_mm10.bed) with gene names added for viewing in a genome browser.

  2. N

    ENCODE Transcription Factor Binding Sites by ChIP-seq from...

    • data.niaid.nih.gov
    • ebi.ac.uk
    Updated Nov 10, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Snyder M; Gerstein M; Weissman S; Farnham P; Struhl K (2021). ENCODE Transcription Factor Binding Sites by ChIP-seq from Stanford/Yale/USC/Harvard [Dataset]. https://data.niaid.nih.gov/resources?id=gse31477
    Explore at:
    Dataset updated
    Nov 10, 2021
    Dataset provided by
    ENCODE DCC
    Authors
    Snyder M; Gerstein M; Weissman S; Farnham P; Struhl K
    Description

    This data was generated by ENCODE. If you have questions about the data, contact the submitting laboratory directly (Philip Cayting mailto:pcayting@stanford.edu). If you have questions about the Genome Browser track associated with this data, contact ENCODE (mailto:genome@soe.ucsc.edu).This track shows probable binding sites of the specified transcription factors (TFs) in the given cell types as determined by chromatin immunoprecipitation followed by high throughput sequencing (ChIP-Seq). Included for each cell type is the input signal, which represents the control condition where no antibody targeting was performed. For each experiment (cell type vs. antibody) this track shows a graph of enrichment for TF binding (Signal), along with sites that have the greatest evidence of transcription factor binding (Peaks).For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf Cells were grown according to the approved ENCODE cell culture protocols. Further preparations were similar to those previously published (Euskirchen et al., 2007) with the exceptions that the cells were unstimulated and sodium orthovanadate was omitted from the buffers. For details on the chromatin immunoprecipitation protocol used, see Euskirchen et al. (2007) and Rozowsky et al. (2009).DNA recovered from the precipitated chromatin was sequenced on the Illumina (Solexa) sequencing platform and mapped to the genome using the Eland alignment program. ChIP-seq data was scored based on sequence reads (length ~30 bps) that align uniquely to the human genome. From the mapped tags a signal map of ChIP DNA fragments (average fragment length ~ 200 bp) was constructed where the signal height is the number of overlapping fragments at each nucleotide position in the genome.For each 1 Mb segment of each chromosome a peak height threshold was determined by requiring a false discovery rate <= 0.05 when comparing the number of peaks above threshold as compared the number obtained from multiple simulations of a random null background with the same number of mapped reads (also accounting for the fraction of mapable bases for sequence tags in that 1 Mb segment). The number of mapped tags in a putative binding region is compared to the normalized (normalized by correlating tag counts in genomic 10 kb windows) number of mapped tags in the same region from an input DNA control. Using a binomial test, only regions that have a p-value <= 0.05 are considered to be significantly enriched compared to the input DNA control.

  3. Z

    Training data for ChIP-seq data analysis (Galaxy Training Material):...

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bérénice Batut; Friederike Dündar; Anika Erxleben; Björn Grüning (2020). Training data for ChIP-seq data analysis (Galaxy Training Material): Identification of the binding sites of the Estrogen receptor [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_892431
    Explore at:
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    University of Freiburg
    Authors
    Bérénice Batut; Friederike Dündar; Anika Erxleben; Björn Grüning
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data provided here are part of a Galaxy Training Network tutorial that analyzes ChIP-seq data from a study published by Ross-Inness et al., 2012 (DOI:10.1038/nature10730) to identify the binding sites of the Estrogen receptor, a transcription factor known to be associated with different types of breast cancer.

  4. Summary of ChIP-seq data.

    • figshare.com
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seth Frietze; Henriette O'Geen; Kimberly R. Blahnik; Victor X. Jin; Peggy J. Farnham (2023). Summary of ChIP-seq data. [Dataset]. http://doi.org/10.1371/journal.pone.0015082.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Seth Frietze; Henriette O'Geen; Kimberly R. Blahnik; Victor X. Jin; Peggy J. Farnham
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Numbers are from the analysis of two merged replicate datasets.

  5. r

    Data from: ChIP-Atlas

    • rrid.site
    Updated Jan 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). ChIP-Atlas [Dataset]. http://identifiers.org/RRID:SCR_015511
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    Database for visualizing and making use of public ChIP-seq data. ChIP-Atlas covers almost all public ChIP-seq experiments and data submitted to the SRA (Sequence Read Archives) in NCBI, DDBJ, or ENA.

  6. Table S9: Human and Mouse HN-scores for Each Gene and ChIP-seq Scores...

    • figshare.com
    txt
    Updated Aug 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sora Yonezawa (2023). Table S9: Human and Mouse HN-scores for Each Gene and ChIP-seq Scores Obtained from the ChIP-Atlas database. [Dataset]. http://doi.org/10.6084/m9.figshare.22580542.v5
    Explore at:
    txtAvailable download formats
    Dataset updated
    Aug 17, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Sora Yonezawa
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Table of HN-scores and ChIP-seq scores (MACS2 score) for each gene. The genes listed in this data are the only human and mouse gene symbols that can be converted.The ChIP-seq score is retrieved from the ChIP-Atlas database (http://chip-atlas.org) (accessed on February 2023). Using the "Target Genes" feature, data were obtained for HSF1, HSF2, and PPARGC1A.

  7. e

    Transcription Factor Binding Sites by ChIP-seq from ENCODE/HAIB

    • ebi.ac.uk
    Updated Sep 28, 2011
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ENCODE DCC; Richard Myers; Flo Pauli (2011). Transcription Factor Binding Sites by ChIP-seq from ENCODE/HAIB [Dataset]. https://www.ebi.ac.uk/biostudies/arrayexpress/studies/E-GEOD-32465
    Explore at:
    Dataset updated
    Sep 28, 2011
    Authors
    ENCODE DCC; Richard Myers; Flo Pauli
    Description

    This data was generated by ENCODE. If you have questions about the data, contact the submitting laboratory directly (Florencia Pauli mailto:fpauli@hudsonalpha.org). If you have questions about the Genome Browser track associated with this data, contact ENCODE (mailto:genome@soe.ucsc.edu). The ChIP-Seq method was used to assay chromatin fragments bound by specific or general transcription factors as described below. DNA isolated by ChIP-Seq was size-selected (~225 bp) and sequenced. Short reads of 25-36 bp were mapped to the human reference genome, and enriched regions of high read density relative to a total input chromatin control reads were identified. The sequence reads with quality scores (fastq files) and alignment coordinates (BAM files) from these experiments are available for download. For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf Cells were grown according to the approved ENCODE cell culture protocols (http://hgwdev.cse.ucsc.edu/ENCODE/protocols/cell). Cross-linked chromatin was immunoprecipitated with an antibody. The Protein:DNA crosslinks were then reversed and the DNA fragments were recovered and sequenced. Please see protocol notes below and go to http://hudsonalpha.org/myers-lab/protocols for the most current version of the protocol. Biological replicates from each experiment were completed. Libraries were sequenced with an Illumina Genome Analyzer I or an Illumina Genome Analyzer IIx according to the manufacturer's recommendations. Sequence data produced by the Illumina data pipeline software were quality filtered and then mapped to NCBI Build37 (hg19) using the integrated Eland software; 32 nt of the sequence reads were used for alignment; up to two mismatches were tolerated; reads that mapped to multiple sites in the genome were discarded. To identify likely binding sites, peak calling was applied to the aligned sequence data sets using Model-based Analysis of Chip-Seq MACS (Zhang Y, et al., 2008) (http://liulab.dfci.harvard.edu/MACS/00README.html). MACS models the shift size of ChIP-Seq tags empirically, and uses it to improve the spatial resolution of predicted binding sites. MACS also uses a dynamic Poisson distribution to capture local biases in the genome, allowing for more robust predictions (Zhang Y, et al., 2008). Protocol Notes: Several changes and improvements were made to the original ChIP-Seq protocol (Jonshon et al.,2008). The major differences between protocols are the number of cells and magnetic beads used for IP, the method of sonication used to fragment DNA, and the number of cycles of PCR used to amplify the sequencing library. The most current protocol used by the Myers lab can be found at http://hudsonalpha.org/myers-lab/protocols. The protocol field for each file denotes the version of the protocol used as being PCR1x, PCR2x or a version number (for examples, v041610.1). The sequencing libraries labeled as PCR2x were made with two rounds of amplification (25 and 15 cycles) and those labeled as PCR1x were made with one 15-cycle round of amplification. These experiments were completed prior to January 2010 and were originally aligned to NCBI Build36 (hg18). They have been re-aligned to NCBI Build37 (hg19) with the Bowtie software (Langmead, et al., 2009) for this data release (http://bowtie-bio.sourceforge.net/index.shtml). The libraries labeled with a protocol version number were competed after January 2010 and were only aligned to NCBI Build37 (hg19). Please refer to the Myers Lab website (http://hudsonalpha.org/myers-lab/protocols) for details on each protocol version. Verification: The MACS (http://liulab.dfci.harvard.edu/MACS/00README.html) peak caller was used to call significant peaks on the individual replicates of a ChIP-Seq experiment. Afterwards, the irreproducible discovery rate (IDR) method, developed by Li et al. (submitted), was used to quantify the consistency between pairs of ranked peaks lists from replicates. The IDR methods uses a model that assumes that the ranked lists of peaks in a pair of replicates consist of two groups - a reproducible group and an irreproducible group. In general, the signals in the reproducible group are more consistent (i.e. with a larger rank correlation coefficient) and are ranked higher than the irreproducible group. The proportion of peaks that belong to the irreproducible component and the correlation of the reproducible component are estimated adaptively from the data. The model also provides an IDR score for each peak, which reflects the posterior probability of the peak belonging to the irreproducible group. The aligned reads were pooled from all replicates and the MACS peak caller was used to call significant peaks on the pooled data. Only datasets containing at least 100 peaks passing the IDR threshold are considered valid and submitted for release.

  8. Human list of RNA-seq data counts and average of ChIP-seq MACS2 value (HIF1A...

    • figshare.com
    txt
    Updated Oct 9, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hidemasa Bono (2019). Human list of RNA-seq data counts and average of ChIP-seq MACS2 value (HIF1A and EPAS1) from meta-analysis of the public NGS database [Dataset]. http://doi.org/10.6084/m9.figshare.9958181.v2
    Explore at:
    txtAvailable download formats
    Dataset updated
    Oct 9, 2019
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Hidemasa Bono
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The number of experiments in which gene was up/down regulated in RNA-seq data and the average of ChIP-seq MACS2 values of HIF1A and EPAS1(HIF2A) in ChIP-Atlas database.Both were calculated from public NGS database (SRA).For up/donw regulated gene selection, 2 fold threshold was adopted.

  9. MAGIC: A tool for predicting transcription factors and cofactors driving...

    • plos.figshare.com
    tiff
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Avtar Roopra (2023). MAGIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data [Dataset]. http://doi.org/10.1371/journal.pcbi.1007800
    Explore at:
    tiffAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Avtar Roopra
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Transcriptomic profiling is an immensely powerful hypothesis generating tool. However, accurately predicting the transcription factors (TFs) and cofactors that drive transcriptomic differences between samples is challenging. A number of algorithms draw on ChIP-seq tracks to define TFs and cofactors behind gene changes. These approaches assign TFs and cofactors to genes via a binary designation of ‘target’, or ‘non-target’ followed by Fisher Exact Tests to assess enrichment of TFs and cofactors. ENCODE archives 2314 ChIP-seq tracks of 684 TFs and cofactors assayed across a 117 human cell lines under a multitude of growth and maintenance conditions. The algorithm presented herein, Mining Algorithm for GenetIc Controllers (MAGIC), uses ENCODE ChIP-seq data to look for statistical enrichment of TFs and cofactors in gene bodies and flanking regions in gene lists without an a priori binary classification of genes as targets or non-targets. When compared to other TF mining resources, MAGIC displayed favourable performance in predicting TFs and cofactors that drive gene changes in 4 settings: 1) A cell line expressing or lacking single TF, 2) Breast tumors divided along PAM50 designations 3) Whole brain samples from WT mice or mice lacking a single TF in a particular neuronal subtype 4) Single cell RNAseq analysis of neurons divided by Immediate Early Gene expression levels. In summary, MAGIC is a standalone application that produces meaningful predictions of TFs and cofactors in transcriptomic experiments.

  10. E

    BLUEPRINT release August 2016, ChIP-Seq for mature neutrophil, on genome...

    • ega-archive.org
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BLUEPRINT release August 2016, ChIP-Seq for mature neutrophil, on genome GRCh38 [Dataset]. https://ega-archive.org/datasets/EGAD00001002449
    Explore at:
    License

    https://ega-archive.org/dacs/EGAC00001000135https://ega-archive.org/dacs/EGAC00001000135

    Description

    ChIP-Seq data for 10 mature neutrophil sample(s). 105 run(s), 86 experiment(s), 86 analysis(s) on human genome GRCh38. Part of BLUEPRINT release August 2016. Analysis documentation available at http://ftp.ebi.ac.uk/pub/databases/blueprint/releases/20160816/homo_sapiens/README_chipseq_analysis_ebi_20160816

  11. H

    Analyzed ChIP-seq Data

    • dataverse.harvard.edu
    • dataone.org
    Updated Sep 3, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Parisha Shah (2025). Analyzed ChIP-seq Data [Dataset]. http://doi.org/10.7910/DVN/KRPJI1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 3, 2025
    Dataset provided by
    Harvard Dataverse
    Authors
    Parisha Shah
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    See "Read Me" document and "Data Dictionary" file for detailed information. ChIP-seq: processed and ready for visualization a public genome browser (.bigwig).

  12. E

    BLUEPRINT release August 2016, ChIP-Seq for CD4-positive, alpha-beta T cell,...

    • ega-archive.org
    Updated Aug 23, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2016). BLUEPRINT release August 2016, ChIP-Seq for CD4-positive, alpha-beta T cell, on genome GRCh38 [Dataset]. https://ega-archive.org/datasets/EGAD00001002444
    Explore at:
    Dataset updated
    Aug 23, 2016
    License

    https://ega-archive.org/dacs/EGAC00001000135https://ega-archive.org/dacs/EGAC00001000135

    Description

    ChIP-Seq data for 9 CD4-positive, alpha-beta T cell sample(s). 68 run(s), 63 experiment(s), 63 analysis(s) on human genome GRCh38. Part of BLUEPRINT release August 2016. Analysis documentation available at http://ftp.ebi.ac.uk/pub/databases/blueprint/releases/20160816/homo_sapiens/README_chipseq_analysis_ebi_20160816

  13. ReMap 2022: a database of Human, Mouse, Drosophila and Arabidopsis...

    • zenodo.org
    • data.niaid.nih.gov
    • +1more
    application/gzip, bin
    Updated Jan 19, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Benoit Ballester; Benoit Ballester (2024). ReMap 2022: a database of Human, Mouse, Drosophila and Arabidopsis regulatory regions from an integrative analysis of DNA-binding sequencing experiments [Dataset]. http://doi.org/10.5281/zenodo.10527088
    Explore at:
    application/gzip, binAvailable download formats
    Dataset updated
    Jan 19, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Benoit Ballester; Benoit Ballester
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    ReMap is a large scale integrative analysis of DNA-binding experiments for Homo sapiens, Mus musculus, Drosophila melanogaster and Arabidopsis thaliana transcriptional regulators. The catalogues are the results of the manual curation of ChIP-seq, ChIP-exo, DAP-seq from public sources (GEO, ENCODE, ENA).

    ReMap (https://remap.univ-amu.fr) aims to provide manually curated, high-quality catalogs of regulatory regions resulting from a large-scale integrative anlysis of DNA-binding experiments in Human, Mouse, Fly and Arabidopsis thaliana for hundreds of transcription factors and regulators. In this 2022 update, we have uniformly processed >11 000 DNA-binding sequencing datasets from public sources across four species. The updated Human regulatory atlas includes 8103 datasets covering a total of 1210 transcriptional regulators (TRs) with a catalog of 182 million (M) peaks, while the updated Arabidopsis atlas reaches 4.8M peaks, 423 TRs across 694 datasets. Also, this ReMap release is enriched by two new regulatory catalogs for Mus musculus and Drosophila melanogaster. First, the Mouse regulatory catalog consists of 123M peaks across 648 TRs as a result of the integration and validation of 5503 ChIP-seq datasets. Second, the Drosophila melanogaster catalog contains 16.6M peaks across 550 TRs from the integration of 1205 datasets. The four regulatory catalogs are browsable through track hubs at UCSC, Ensembl and NCBI genome browsers. Finally, ReMap 2022 comes with a new Cis Regulatory Module identification method, improved quality controls, faster search results, and better user experience with an interactive tour and video tutorials on browsing and filtering ReMap catalogs.

    We thank our users for past and future feedback to make ReMap useful for the community. The ReMap team welcomes your feedback on the catalogs, use of the website and use of the downloadable files. Please contact benoit.ballester@inserm.fr for development requests.

    Reference:

    ReMap 2022: a database of Human, Mouse, Drosophila and Arabidopsis regulatory regions from an integrative analysis of DNA-binding sequencing experiments
    Fayrouz Hammal, Pierre de Langen, Aurélie Bergon, Fabrice Lopez, Benoit Ballester
    Nucleic Acids Research, Volume 50, Issue D1, 7 January 2022, Pages D316–D325,
    https://doi.org/10.1093/nar/gkab996

  14. o

    hbctraining/Intro-to-ChIPseq-flipped: ChIP-seq Data Analysis Lessons from...

    • explore.openaire.eu
    Updated Jan 6, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Meeta Mistry; Shannan Ho Sui; Jihe Liu; Mary Piper; William Gammerdinger; Radhika Khetani (2022). hbctraining/Intro-to-ChIPseq-flipped: ChIP-seq Data Analysis Lessons from HCBC (first release) [Dataset]. http://doi.org/10.5281/zenodo.5825871
    Explore at:
    Dataset updated
    Jan 6, 2022
    Authors
    Meeta Mistry; Shannan Ho Sui; Jihe Liu; Mary Piper; William Gammerdinger; Radhika Khetani
    Description

    First release of the ChIP-seq data analysis training materials from the Harvard Chan Bioinformatics Core (https://bioinformatics.sph.harvard.edu/training). The raw data was obtained from the SRA (https://www.ncbi.nlm.nih.gov/sra?term=SRP134733). Learning Objectives Understand the necessity for, and use of, the command line interface (bash) and HPC for analyzing high-throughput sequencing data. Understand best practices for designing a ChIP-seq experiment and analysis the resulting data. Description This workshop focuses on teaching computational skills to enable the effective use of an high-performance computing environment to implement a ChIP-seq data analysis workflow. In addition to running the workflow from FASTQ files to peak calls, the workshop covers best practice guidelines for ChIP-seq experimental design and data organization/management and data visualization for quality control. Introduction to ChIP-seq QC using FASTQC Alignment and filtering of reads (Bowtie2, Samtools) Peak calling (MACS2) Peak overlap (bedtools) Peak visualization (using deepTools)

  15. E

    BLUEPRINT release August 2016, ChIP-Seq for memory B cell, on genome GRCh38

    • ega-archive.org
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BLUEPRINT release August 2016, ChIP-Seq for memory B cell, on genome GRCh38 [Dataset]. https://ega-archive.org/datasets/EGAD00001002510
    Explore at:
    License

    https://ega-archive.org/dacs/EGAC00001000135https://ega-archive.org/dacs/EGAC00001000135

    Description

    ChIP-Seq data for 1 memory B cell sample(s). 1 run(s), 1 experiment(s), 1 analysis(s) on human genome GRCh38. Part of BLUEPRINT release August 2016. Analysis documentation available at http://ftp.ebi.ac.uk/pub/databases/blueprint/releases/20160816/homo_sapiens/README_chipseq_analysis_ebi_20160816

  16. Z

    Datasets for predicting TF binding using Virtual ChIP-seq

    • data.niaid.nih.gov
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mehran Karimzadeh; Michael M. Hoffman (2020). Datasets for predicting TF binding using Virtual ChIP-seq [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_823296
    Explore at:
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Department of Medical Biophysics, University of Toronto
    Authors
    Mehran Karimzadeh; Michael M. Hoffman
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This repository contains datasets necessary for using the Virtual ChIP-seq software.

    Virtual ChIP-seq requires the following datasets to predict transcription factor binding:

    chipExpDir_AtoH_V1.0.0.tar.gz: Reference matrices of correlation between TF binding and gene expression for TFs starting with letters A-H.

    chipExpDir_ItoZ_V1.0.0.tar.gz: Reference matrices of correlation between TF binding and gene expression for TFs starting with letters I-Z.

    refTables_V1.1.0.tar.gz: PhastCons genomic conservation, FIMO PWM scores for JASPAR motifs, and ChIP-seq data of ENCODE and Cistrome database.

    hg38_chrsize.tsv: Length of chromosomes in hg38

    trainedModels_V1.0.0.tar.gz: Virtual ChIP-seq scikit-learn trained models saved in joblib format

    .tar.gz: Pre-calculated matrices suitable for training with other algorithms or re-training with Virtual ChIP-seq.

    Some predictive features of TF binding are the same in each cell type and are stored together for simplicity in refTables_V1.0.0.tar.gz. You can use datasets from other cell types (named here as .tar.gz) for the purpose of re-training the model. The .tar.gz files contain pre-calculated predictive features of transcription factor binding in 4 chromosomes (5, 10, 15, 20).

    These features include:

    PhastCons genomic conservation

    FIMO score for sequence motifs of TF in the JASPAR database

    Chromatin accessibility

    TF binding in ENCODE + Cistrome DB datasets

    Virtual ChIP-seq expression score

  17. ENCODE TF ChIP-seq data analysis

    • figshare.com
    xlsx
    Updated Feb 2, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Namshik Han (2018). ENCODE TF ChIP-seq data analysis [Dataset]. http://doi.org/10.6084/m9.figshare.5851707.v2
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Feb 2, 2018
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Namshik Han
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We downloaded 2,216 ChIP-seq experiment data from the ENCODE Project. The list of the data is in Supplementary Table S8. The data were lifted over from hg19 to hg38. We found overlapping peaks on four different categories: (1) 500bp upstream the promoter region of pcRNA-associated coding genes, (2) 500bp upstream promoter region of pcRNAs, (3) pcRNA genomic loci, and (4) pcRNA genomic loci but not overlapping with promoter region. To understand the correlation of TF binding patterns in the four categories, we made a binary matrix per category that consists of rows of TFs and columns of pcRNA/coding genes. Hence, the matrix contains connections between TF and pcRNA/associate coding genes. The matrix of category 2 is clustered by Euclidian Distance. To check the extent to which promoter sharing or proximity determines TFBS correlation, we also separated the clustered heat-map in the pcRNA bidirectional transcript (BIDIR) subgroup to the other subgroups (Non-BIDIR). To directly compare the TF binding patterns between each category, the other three matrices were sorted by the same order of the clustered matrix. We used the MatLab function corr2 to calculate r-value between category (1) and (2). We performed Monte Carlo simulation to calculate the p-value and test the significance of the r-value.

  18. Data from: A base-resolution panorama of the in vivo impact of cytosine...

    • zenodo.org
    application/gzip
    Updated Aug 25, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aldo Hernandez-Corchado; Aldo Hernandez-Corchado; Hamed S. Najafabadi; Hamed S. Najafabadi (2022). A base-resolution panorama of the in vivo impact of cytosine methylation on transcription factor binding [Dataset]. http://doi.org/10.5281/zenodo.5573261
    Explore at:
    application/gzipAvailable download formats
    Dataset updated
    Aug 25, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Aldo Hernandez-Corchado; Aldo Hernandez-Corchado; Hamed S. Najafabadi; Hamed S. Najafabadi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    TF binding models built by JAMS (https://github.com/csglab/JAMS), ChIP-seq peak files (from ENCODE, Najafabadi et al. 2015, Schmitges et al. 2016, and Imbeault et al. 2017; called by MACS 1.4v), and ChIP-seq pulldown and control tags from said peaks.

  19. n

    CistromeFinder

    • neuinfo.org
    • dknet.org
    • +2more
    Updated Mar 19, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2013). CistromeFinder [Dataset]. http://identifiers.org/RRID:SCR_005405
    Explore at:
    Dataset updated
    Mar 19, 2013
    Description

    Data portal that can help query, evaluate and visualize publicly available Chromatin immunoprecipitation and DNase I hypersensitivity assays with high-throughput sequencing data in human and mouse. The database currently contains 6378 samples over 4391 datasets, 313 factors and 102 cell lines or cell populations (May 2013). Each dataset has gone through a consistent analysis and quality control pipeline; therefore, users could evaluate the overall quality of each dataset before examining binding sites near their genes of interest. CistromeFinder is integrated with UCSC genome browser for visualization, Primer3Plus for ChIP-qPCR primer design and CistromeMap for submitting newly available datasets. It also allows users to leave comments to facilitate data evaluation and update.

  20. o

    Su(H)_0-8_embryonic_ChIP-seq

    • omicsdi.org
    xml
    Updated Aug 11, 2013
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lijia Ma,Matthew Slattery,Kevin White,DCC modENCODE (2013). Su(H)_0-8_embryonic_ChIP-seq [Dataset]. https://www.omicsdi.org/dataset/arrayexpress-repository/E-GEOD-49776
    Explore at:
    xmlAvailable download formats
    Dataset updated
    Aug 11, 2013
    Authors
    Lijia Ma,Matthew Slattery,Kevin White,DCC modENCODE
    Variables measured
    Genomics
    Description

    modENCODE_submission_5017 This submission comes from a modENCODE project of Kevin White. For full list of modENCODE projects, see http://www.genome.gov/26524648 Project Goal: The White Lab is aiming to map the association of all the Transcription Factors (TF) on the genome of Drosophila melanogaster. One technique that we use for this purpose is chromatin immunoprecipitation coupled with deep sequencing (ChIP-seq) utilizing an Illumina next generation sequencing platform. The data generated by ChIP-seq experiments consist basically of a plot of signal intensity across the genome. The highest signals correspond to positions in the genome occupied by the tested TF. For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf EXPERIMENT TYPE: CHIP-seq. BIOLOGICAL SOURCE: Strain: Y cn bw sp; Developmental Stage: Embryo 0-8; Genotype: y[1] oc[R3.2]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2] bw[1] sp[1]; LysC[1] lab[R4.2] MstProx[1] GstD5[1] Rh6[1]; Sex: Unknown; EXPERIMENTAL FACTORS: Developmental Stage Embryo 0-8; Strain Y cn bw sp; Antibody Su(H) (target is fly genes:Su(H))

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Freeberg, Mallory; Heydarian, Mohammad (2020). Training material for ChIP-seq analysis [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_197100

Training material for ChIP-seq analysis

Explore at:
Dataset updated
Jan 24, 2020
Dataset provided by
Johns Hopkins University
Authors
Freeberg, Mallory; Heydarian, Mohammad
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The data provided here are part of a Galaxy tutorial that analyzes ChIP-seq data from a study published by Wu et al., 2014 (DOI:10.1101/gr.164830.113). The goal of this study was to investigate "the dynamics of occupancy and the role in gene regulation of the transcription factor Tal1, a critical regulator of hematopoiesis, at multiple stages of hematopoietic differentiation." To this end, ChIP-seq experiments were performed in multiple mouse cell types including a G1E cell line and megakaryocytes, the two cell types represented here. The dataset contains biological replicate Tal1 ChIP-seq and input control experiments (*.fastqsanger files). Because of the long processing time for the large original files, we have downsampled the original raw data files to include only reads that align to chromosome 19 and a subset of interesting genomic loci (ChIPseq_regions_of_interest_v4.bed) pulled from the Wu et al. publication. Also included is a gene annotation file (RefSeq_gene_annotations_mm10.bed) with gene names added for viewing in a genome browser.

Search
Clear search
Close search
Google apps
Main menu