100+ datasets found

E
ATAC-seq dataset
ega-archive.org
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ATAC-seq dataset [Dataset]. https://ega-archive.org/datasets/EGAD00001011135
Explore at:
License
https://ega-archive.org/dacs/EGAC00001002224https://ega-archive.org/dacs/EGAC00001002224
Description
This dataset contains ATAC-seq data performed in MM.1S cell line in ETOH (control) or Dexamethasone condition (Treatment)
f
ATACseq_QC_plot
figshare.com
search.datacite.org
pdf
Updated Feb 24, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chuanyu Liu; mingyue wang; xiaoyu wei; longqi liu (2019). ATACseq_QC_plot [Dataset]. http://doi.org/10.6084/m9.figshare.7763438.v1
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7763438.v1
Dataset updated
Feb 24, 2019
Dataset provided by
figshare
Authors
Chuanyu Liu; mingyue wang; xiaoyu wei; longqi liu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) is a fundamental epigenomics approach and has been widely used in profiling the chromatin accessibility dynamics in multiple species. A comprehensive reference of ATAC-seq datasets for mammalian tissues is important for the understanding of regulatory specificity and developmental abnormality caused by genetic or environmental alterations. Here, we report a mouse ATAC-seq atlas by producing a total of 66 ATAC-seq profiles from 20 primary tissues of both male and female mice. The ATAC-seq read enrichment, fragment size distribution, and reproducibility between replicates demonstrated the high quality of the full dataset.
E
Chromatin accessibility (ATAC-seq) and transcriptome (RNA-seq) data from...
ega-archive.org
Updated Sep 11, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2017). Chromatin accessibility (ATAC-seq) and transcriptome (RNA-seq) data from immune cells for healthy young and healthy old subjects [Dataset]. https://ega-archive.org/datasets/EGAD00001003602
Explore at:
Dataset updated
Sep 11, 2017
License
https://ega-archive.org/dacs/EGAC00001000721https://ega-archive.org/dacs/EGAC00001000721
Description
Dataset consisting of:

(1) N=234 genome-wide chromatin accessibility (ATAC-seq) profiles for distinct N=21 healthy old and N=28 healthy young subjects. ATAC-seq biological samples provided for the following tissues: PBMC (N=24), CD14+ monocytes (N=18), CD8+ memory T cells (N=7), CD8+ naive T cells (N=7), CD4+ memory T cells (N=7), CD4+ naive T cells (N=7), and naive B cells (N=7).

(2) N=39 genome-wide transcription (RNA-seq) data for distinct N=15 healthy old and N=24 healthy young subjects' PBMCs.
f
DataSheet_1_Mapping open chromatin by ATAC-seq in bread wheat.docx
frontiersin.figshare.com
bin
Updated Jun 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xin Wang; Chuanye Chen; Chao He; Dijun Chen; Wenhao Yan (2023). DataSheet_1_Mapping open chromatin by ATAC-seq in bread wheat.docx [Dataset]. http://doi.org/10.3389/fpls.2022.1074873.s001
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.3389/fpls.2022.1074873.s001
Dataset updated
Jun 21, 2023
Dataset provided by
Frontiers
Authors
Xin Wang; Chuanye Chen; Chao He; Dijun Chen; Wenhao Yan
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Gene transcription is largely regulated by cis-regulatory elements. Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) is an emerging technology that can accurately map cis-regulatory elements in animals and plants. However, the presence of cell walls and chloroplasts in plants hinders the extraction of high-quality nuclei, thereby affects the quality of ATAC-seq data. Meanwhile, it is tricky to perform ATAC-seq with different tissue types, especially for those with limited size and amount. Moreover, with rapid growth of ATAC-seq datasets from plants, powerful and easy-to-use data analysis pipelines for ATAC-seq, especially for wheat is lacking. Here, we provided an all-in-one solution for mapping open chromatin in wheat including both experimental and data analysis procedure. We efficiently obtained nuclei with less cell debris from various wheat tissues. High-quality ATAC-seq data from young spike and ovary, which are hard to harvest were generated. We determined that the saturation sequencing depth of wheat ATAC-seq is about 16 Gb. Particularly, we developed a powerful and easy-to-use online pipeline to analyze the wheat ATAC-seq data and this pipeline can be easily extended to other plant species. The method developed here will facilitate plant regulatory genome study not only for wheat but also for other plant species.
o
Chromatin immunoprecipitation and ATAC-Seq data from meduallar thymic...
omicsdi.org
xml
Updated Dec 12, 2016
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jakub Abramson (2016). Chromatin immunoprecipitation and ATAC-Seq data from meduallar thymic epithelial cells [ATAC-Seq] [Dataset]. https://www.omicsdi.org/dataset/geo/GSE90048
Explore at:
xmlAvailable download formats
Dataset updated
Dec 12, 2016
Authors
Jakub Abramson
Variables measured
Other
Description
Aire is a transcriptional regulator that induces promiscuous expression of thousands of tissue-restricted antigen (TRA) genes in medullary thymic epithelial cells (mTECs). While the target genes of Aire are well characterized, the transcriptional programs regulating its own expression remain elusive. We used Affymetrix microarrays to analyze the gene expression patterns of Aire expressing cells (mature mTECs and Thymic B cells) and compared them to control counterparts, namely immature mTECs, cortical Thymic epithelial cells and splenic B cells of tissue-restricted antigen (TRA) genes in medullary thymic epithelial cells (mTECs). While the target genes of Aire are well characterized, the transcriptional programs regulating its own expression remain elusive. We’ve used Assay for transposase-accessible chromatin using sequencing (ATAC-Seq) on the different thymic epithelial cell populations to assess chromatin accessibility around the Aire locus in these cells. Moreover, we’ve used the indexing-first chromatin immunoprecipitation (iChIP) technique to assess the occupancy of the Irf8 transcription factor in the Aire locus Overall design: Mature EpCAM+MHC-II high mTECs, Immature EpCAM+MHC-II low mTECs, and EpCAM+Ly51+ cTECs were flow-sorted from thymi isolated from thymi of C57BL/6 6weeks old mice. These cells were then subjected to ATAC-Seq. typically 10 thousand cells were used per replicate.
r
Single-cell ATAC sequencing data from: Single-cell genomics details the...
researchdata.se
datasetcatalog.nlm.nih.gov
+3more
Updated Jan 29, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hanna Thorsson; Rasmus Henningsson; Noelia Puente-Moncada; Ludvig Sjöström; Helena Ågerstam; Pablo Peña-Martínez; Carl Sandén; Marianne Rissler; Anders Castor; Hanne Vibeke Marquart; Signe Modvig; Kajsa Paulsson; Cornelis Jan Pronk; Kjeld Schmiegelow; Axel Hyrenius Wittsten; Christina Orsmark-Pietras; Henrik Lilljebjörn; Thoas Fioretos (2025). Single-cell ATAC sequencing data from: Single-cell genomics details the maturation block in BCP-ALL and identifies therapeutic vulnerabilities in DUX4-r cases [Dataset]. http://doi.org/10.17044/SCILIFELAB.23592303
Explore at:
Unique identifier
https://doi.org/10.17044/SCILIFELAB.23592303
Dataset updated
Jan 29, 2025
Dataset provided by
Lund University
Authors
Hanna Thorsson; Rasmus Henningsson; Noelia Puente-Moncada; Ludvig Sjöström; Helena Ågerstam; Pablo Peña-Martínez; Carl Sandén; Marianne Rissler; Anders Castor; Hanne Vibeke Marquart; Signe Modvig; Kajsa Paulsson; Cornelis Jan Pronk; Kjeld Schmiegelow; Axel Hyrenius Wittsten; Christina Orsmark-Pietras; Henrik Lilljebjörn; Thoas Fioretos
License
Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
License information was derived automatically
Description
This dataset contains single-cell ATAC sequencing data from nineteen cases of childhood BCP-ALL and four samples of mononuclear cells from normal bone marrow from healthy donors. The dataset is available as raw sequencing reads (fastq; restricted access) or as an annotated ATAC dataset (h5ad). The libraries were prepared according to the manufacturer’s instructions (10x Genomics CG000169: Nuclei Isolation for Single Cell ATAC Sequencing; 10x Genomics CG000209: Chromium Single Cell ATAC Reagent Kits v1.1).) and sequenced on a Novaseq 6000.
f
Table_2_ATACgraph: Profiling Genome-Wide Chromatin Accessibility From...
frontiersin.figshare.com
datasetcatalog.nlm.nih.gov
pdf
Updated May 31, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rita Jui-Hsien Lu; Yen-Ting Liu; Chih Wei Huang; Ming-Ren Yen; Chung-Yen Lin; Pao-Yang Chen (2023). Table_2_ATACgraph: Profiling Genome-Wide Chromatin Accessibility From ATAC-seq.PDF [Dataset]. http://doi.org/10.3389/fgene.2020.618478.s003
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.3389/fgene.2020.618478.s003
Dataset updated
May 31, 2023
Dataset provided by
Frontiers
Authors
Rita Jui-Hsien Lu; Yen-Ting Liu; Chih Wei Huang; Ming-Ren Yen; Chung-Yen Lin; Pao-Yang Chen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Assay for transposase-accessible chromatin using sequencing data (ATAC-seq) is an efficient and precise method for revealing chromatin accessibility across the genome. Most of the current ATAC-seq tools follow chromatin immunoprecipitation sequencing (ChIP-seq) strategies that do not consider ATAC-seq-specific properties. To incorporate specific ATAC-seq quality control and the underlying biology of chromatin accessibility, we developed a bioinformatics software named ATACgraph for analyzing and visualizing ATAC-seq data. ATACgraph profiles accessible chromatin regions and provides ATAC-seq-specific information including definitions of nucleosome-free regions (NFRs) and nucleosome-occupied regions. ATACgraph also allows identification of differentially accessible regions between two ATAC-seq datasets. ATACgraph incorporates the docker image with the Galaxy platform to provide an intuitive user experience via the graphical interface. Without tedious installation processes on a local machine or cloud, users can analyze data through activated websites using pre-designed workflows or customized pipelines composed of ATACgraph modules. Overall, ATACgraph is an effective tool designed for ATAC-seq for biologists with minimal bioinformatics knowledge to analyze chromatin accessibility. ATACgraph can be run on any ATAC-seq data with no limit to specific genomes. As validation, we demonstrated ATACgraph on human genome to showcase its functions for ATAC-seq interpretation. This software is publicly accessible and can be downloaded at https://github.com/RitataLU/ATACgraph.
RNA-seq and ATAC-seq Matrix
figshare.com
zip
Updated Dec 13, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yang Liu; mingyue wang; longqi liu (2019). RNA-seq and ATAC-seq Matrix [Dataset]. http://doi.org/10.6084/m9.figshare.10490228.v2
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.10490228.v2
Dataset updated
Dec 13, 2019
Dataset provided by
Figsharehttp://figshare.com/
Authors
Yang Liu; mingyue wang; longqi liu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The emergence of eusociality is one of the major events in evolution. Although several previous studies have investigated the mechanism underlying caste differentiation and social behavior of eusocial insects including ants and honeybees, the molecular circuits governing the sociality of these insects remain obscure. In this study, we profiled the brain transcriptome and chromatin accessibility of all categories of adult castes: queens, males, gynes and workers in Monomorium pharaonis which is a typical caste-dependent eusocial insect. We created a comprehensive dataset including 16 RNA-seq and 16 ATAC-seq profiles from 4 biological replicates. We also demonstrated strong reproducibility of the datasets and identified specific genes and open chromatin regions in the genome that may be associated with caste differentiation. Overall, our data will be a valuable resource for further study of the mechanisms underlying eusocial insect behavior, particularly the role of brain in the control of eusociality.
maxATAC Data
zenodo.org
data.niaid.nih.gov
application/gzip
Updated Jun 29, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tareian Cazares; Tareian Cazares; Faiz Rizvi; Faiz Rizvi; Balaji Iyer; Xiaoting Chen; Xiaoting Chen; Michael Kotliar; Michael Kotliar; Joseph A. Wayman; Anthony Bejjani; Anthony Bejjani; Omer Donmez; Omer Donmez; Benjamin Wronowski; Benjamin Wronowski; Sreeja Parameswaran; Leah Kottyan; Leah Kottyan; Artem Barski; Artem Barski; Matthew Weirauch; Matthew Weirauch; V. B. Surya Prasath; V. B. Surya Prasath; Emily Miraldi; Emily Miraldi; Balaji Iyer; Joseph A. Wayman; Sreeja Parameswaran (2022). maxATAC Data [Dataset]. http://doi.org/10.5281/zenodo.6761768
Explore at:
application/gzipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.6761768
Dataset updated
Jun 29, 2022
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Tareian Cazares; Tareian Cazares; Faiz Rizvi; Faiz Rizvi; Balaji Iyer; Xiaoting Chen; Xiaoting Chen; Michael Kotliar; Michael Kotliar; Joseph A. Wayman; Anthony Bejjani; Anthony Bejjani; Omer Donmez; Omer Donmez; Benjamin Wronowski; Benjamin Wronowski; Sreeja Parameswaran; Leah Kottyan; Leah Kottyan; Artem Barski; Artem Barski; Matthew Weirauch; Matthew Weirauch; V. B. Surya Prasath; V. B. Surya Prasath; Emily Miraldi; Emily Miraldi; Balaji Iyer; Joseph A. Wayman; Sreeja Parameswaran
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Abstract

Transcription factors read the genome, fundamentally connecting DNA sequence to gene expression across diverse cell types. Determining how, where, and when TFs bind chromatin will advance our understanding of gene regulatory networks and cellular behavior. The 2017 ENCODE-DREAM in vivo Transcription-Factor Binding Site (TFBS) Prediction Challenge highlighted the value of chromatin accessibility data to TFBS prediction, establishing state-of-the-art methods for TFBS prediction from DNase-seq. However, the more recent Assay-for-Transposase-Accessible-Chromatin (ATAC)-seq has surpassed DNase-seq as the most widely-used chromatin accessibility profiling method. Furthermore, ATAC-seq is the only such technique available at single-cell resolution from standard commercial platforms. While ATAC-seq datasets grow exponentially, suboptimal motif scanning is unfortunately the most common method for TFBS prediction from ATAC-seq. To enable community access to state-of-the-art TFBS prediction from ATAC-seq, we (1) curated an extensive benchmark dataset (127 TFs) for ATAC-seq model training and (2) built “maxATAC”, a suite of user-friendly, deep neural network models for genome-wide TFBS prediction from ATAC-seq in any cell type. With models available for 127 human TFs, maxATAC is the first collection of high-performance TFBS prediction models for ATAC-seq.

Repository Overview

This repository contains all of the processed training data used by maxATAC for model training and benchmarking. All directories have the extension .tar.gz .

In this repository you will find the directories:

ATAC_Peaks: ATAC-seq peak files called with MACS2. These files are generated for the hg38 reference genome. The files are have the extension .bed.gz. ATAC_Signal_File: ATAC-seq signal file. This file has been read-depth normalized and min-max normalized between 0,1 using the 99th percentile max value. These files are presented as bigwig files with a .bw extension. ChIP_Binding_File: ChIP-seq signal tracks. These files are the binary signal tracks in bigwig format that are found in the ChIP_Peaks directory. ChIP_Peaks: ChIP-seq peaks files. This directory contains the ENCODE IDR peak sets and peak sets created in the maxATAC publication. These files have the extension .bed.gz. Full_Models: Current set of 127 maxATAC TF models. This directory includes the information for thresholding and the .h5 model files. hg38: This directory includes the hg38 reference genome information that was used in this publication. Prediction_and_Benchmarking: This directory contains all of the predictions for chr1 used for benchmarking in a round-robin training approach. Tn5_CutSites: This directory contains the Tn5 cut sites that have been shifted +4 on the (+) strand and -5 on the (-) strand. The cut sites were then slopped 20 bp using bedtools slop. These files are presented as bed files that have been bzipped. Each file represents an individual biological replicate. scATAC: This directory includes data used for scATAC-seq based predictions.

For additional details please see the maxATAC GitHub Repository and bioRxiv pre-print.
Classification of ATAC-seq data of ATL samples.
plos.figshare.com
datasetcatalog.nlm.nih.gov
xls
Updated Jun 2, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Azusa Tanaka; Yasuhiro Ishitsuka; Hiroki Ohta; Akihiro Fujimoto; Jun-ichirou Yasunaga; Masao Matsuoka (2023). Classification of ATAC-seq data of ATL samples. [Dataset]. http://doi.org/10.1371/journal.pcbi.1008422.t005
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pcbi.1008422.t005
Dataset updated
Jun 2, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Azusa Tanaka; Yasuhiro Ishitsuka; Hiroki Ohta; Akihiro Fujimoto; Jun-ichirou Yasunaga; Masao Matsuoka
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Clinical subtypes of ATL samples and “closest cell type” computed by our algorithm.
E
CRC cell line ATAC-seq
ega-archive.org
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CRC cell line ATAC-seq [Dataset]. https://ega-archive.org/datasets/EGAD50000000296
Explore at:
License
https://ega-archive.org/dacs/EGAC50000000023https://ega-archive.org/dacs/EGAC50000000023
Description
ATAC-Seq data for C32, CACO2, CL11, HT29, SW403, SW480, SW948 MSS CRC cell lines, and HCEC-1CT normal colon cell line
Training material for the mapping and quantification of single-cell ATAC-seq...
zenodo.org
data.niaid.nih.gov
application/gzip, bin
Updated Apr 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pavankumar Videm; Pavankumar Videm (2023). Training material for the mapping and quantification of single-cell ATAC-seq 10X Datasets [Dataset]. http://doi.org/10.5281/zenodo.7855968
Explore at:
application/gzip, binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.7855968
Dataset updated
Apr 23, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Pavankumar Videm; Pavankumar Videm
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The data provided here is part of the Galaxy Training Network tutorial that analyses 10x genomics single-cell ATAC-seq data from the 10x platform. The original data is from 1k Peripheral Blood Mononuclear Cells (PBMCs) from a Healthy Donor.

Due to time constraints during training, the datasets were subsampled to reads that map to chromosome 21 only.

The 10x Genomics Datasets follow the Creative Commons Attribution license.

There is an additional count matrix in Anndata format created from full datasets.
Data from: Multimodal integration of single cell ATAC-seq data enables...
zenodo.org
application/gzip, bin +2
Updated Jun 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kewei Xiong; Kewei Xiong (2025). Multimodal integration of single cell ATAC-seq data enables highly accurate delineation of clinically relevant tumor cell subpopulations [Dataset]. http://doi.org/10.5281/zenodo.15641500
Explore at:
bin, csv, application/gzip, txtAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.15641500
Dataset updated
Jun 11, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Kewei Xiong; Kewei Xiong
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jun 11, 2025
Description
Data used for tutorial.

fragments.tsv.gz(.tbi), singlecell.csv, filtered_peak_bc_matrix.h5: scATAC-seq pre-processing and cell annotation

peak.mat.rds: corrected chromatin accessibility profile

cancer.cnv.csv: copy number profile of cancer cells

.assignment.txt, .clones.txt: cluster assignment, and genotype of each cluster by the CBM method (https://github.com/zhyu-lab/cbm)

snv.mat.rds, denoised.mat.rds: raw and denoised SNV matrix
s
References and test datasets for the Cactus pipeline
figshare.scilifelab.se
researchdata.se
txt
Updated Jan 15, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jerome Salignon; Lluis Milan Arino; Maxime Garcia; Christian Riedel (2025). References and test datasets for the Cactus pipeline [Dataset]. http://doi.org/10.17044/scilifelab.20171347.v4
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.17044/scilifelab.20171347.v4
Dataset updated
Jan 15, 2025
Dataset provided by
Karolinska Institutet
Authors
Jerome Salignon; Lluis Milan Arino; Maxime Garcia; Christian Riedel
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Overview This item contains references and test datasets for the Cactus pipeline. Cactus (Chromatin ACcessibility and Transcriptomics Unification Software) is an mRNA-Seq and ATAC-Seq analysis pipeline that aims to provide advanced molecular insights on the conditions under study.

Test datasets The test datasets contain all data needed to run Cactus in each of the 4 supported organisms. This include ATAC-Seq and mRNA-Seq data (.fastq.gz), parameter files (.yml) and design files (*.tsv). They were were created for each species by downloading publicly available datasets with fetchngs (Ewels et al., 2020) and subsampling reads to the minimum required to have enough DAS (Differential Analysis Subsets) for enrichment analysis. Datasets downloaded: - Worm and Humans: GSE98758 - Fly: GSE149339 - Mouse: GSE193393

References One of the goals of Cactus is to make the analysis as simple and fast as possible for the user while providing detailed insights on molecular mechanisms. This is achieved by parsing all needed references for the 4 ENCODE (Dunham et al., 2012; Stamatoyannopoulos et al., 2012; Luo et al., 2020) and modENCODE (THE MODENCODE CONSORTIUM et al., 2010; Gerstein et al., 2010) organisms (human, M. musculus, D. melanogaster and C. elegans). This parsing step was done with a Nextflow pipeline with most tools encapsulated within containers for improved efficiency and reproducibility and to allow the creation of customized references. Genomic sequences and annotations were downloaded from Ensembl (Cunningham et al., 2022). The ENCODE API (Luo et al., 2020) was used to download the CHIP-Seq profiles of 2,714 Transcription Factors (TFs) (Landt et al., 2012; Boyle et al., 2014) and chromatin states in the form of 899 ChromHMM profiles (Boix et al., 2021; van der Velde et al., 2021) and 6 HiHMM profiles (Ho et al., 2014). Slim annotations (cell, organ, development, and system) were parsed and used to create groups of CHIP-Seq profiles that share the same annotations, allowing users to analyze only CHIP-Seq profiles relevant to their study. 2,779 TF motifs were obtained from the Cis-BP database (Lambert et al., 2019). GO terms and KEGG pathways were obtained via the R packages AnnotationHub (Morgan and Shepherd, 2021) and clusterProfiler (Yu et al., 2012; Wu et al., 2021), respectively.

Documentation More information on how to use Cactus and how references and test datasets were created is available on the documentation website: https://github.com/jsalignon/cactus.

ATAC-seq count data from primary pediatric AML samples

zenodo.org

tsv

Updated Apr 28, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

Norio Shiba; Norio Shiba; Hayashi Yasuhide; Thomas Eder; Thomas Eder; Hayashi Yasuhide (2025). ATAC-seq count data from primary pediatric AML samples [Dataset]. http://doi.org/10.5281/zenodo.14943880

Explore at:

tsvAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.14943880

Dataset updated

Apr 28, 2025

Dataset provided by

Zenodohttp://zenodo.org/

Authors

Norio Shiba; Norio Shiba; Hayashi Yasuhide; Thomas Eder; Thomas Eder; Hayashi Yasuhide

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

ATAC-seq Data from Primary Pediatric AML Samples

Data Source

Primary pediatric AML patient ATAC-seq data were obtained from Yokohama City University (YCU) and published in:

Yamato G, Kawai T, Shiba N, Ikeda J, Hara Y, Ohki K, Tsujimoto S. I., Kaburagi T, Yoshida K, Shiraishi Y, Miyano S, Kiyokawa N, Tomizawa D, Shimada A, Sotomatsu M, Arakawa H, Adachi S, Taga T, Horibe K, Ogawa S, Hata K, Hayashi Y. Genome-wide DNA methylation analysis in pediatric acute myeloid leukemia. Blood Adv. 2022 Jun 14;6(11):3207-3219. doi: 10.1182/bloodadvances.2021005381. PMID: 35008106; PMCID: PMC9198913.

Bioinformatic Processing

A modified version of the ATAC-seq Data Processing Pipeline (Reichl, S. et al. Ultimate ATAC-seq Data Processing & Quantification Pipeline. (2024)) was applied to the raw BAM files, accessible at: https://github.com/epigen/atacseq_pipeline.

The pipeline utilized fastp (Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018)) for adapter removal and Bowtie2 (Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012)) for read alignment to the GRCh38 (hg38) human reference genome.

Duplicate marking was performed with samblaster (Faust, G. G. & Hall, I. M. SAMBLASTER: fast duplicate marking and structural variant read extraction. Bioinformatics 30, 2503–2505 (2014)). The aligned BAM files were sorted, indexed, and filtered for ENCODE blacklisted regions using samtools (Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009)).

Counts over exons were obtained using featureCounts (Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general-purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014)).

Data Structure

The table contains the following columns:

Column Name	Description
NCBI_id	RefSeq (NCBI Reference Sequence) accession number for a specific mRNA transcript
Gene_symbol	Official gene symbol
ENTREZ_id	Entrez Gene ID
YCU_NUP98-NSD1+PRDM16high-AM	Read counts per gene for this sample
YCU_NUP98-NSD1+PRDM16high-HR	Read counts per gene for this sample
YCU_RUNX1-RUNX1T1-SR	Read counts per gene for this sample
YCU_t11-19MLL_KA	Read counts per gene for this sample
YCU_t11-19MLL-NR	Read counts per gene for this sample

e
ATAC-seq data in mouse and zebrafish limbs
ebi.ac.uk
Updated Jan 5, 2015
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Juan Tena; Andrew Gehrke; Igor Schneider; Carlos Gómez-Marín; Tetsuya Nakamura; Mayuri Chandran; Elisa de la Calle-Mustienes; Ingo Braasch; John Postlethwait; José Gómez-Skarmeta; Neil Shubin (2015). ATAC-seq data in mouse and zebrafish limbs [Dataset]. https://www.ebi.ac.uk/biostudies/arrayexpress/studies/E-GEOD-61065
Explore at:
Dataset updated
Jan 5, 2015
Authors
Juan Tena; Andrew Gehrke; Igor Schneider; Carlos Gómez-Marín; Tetsuya Nakamura; Mayuri Chandran; Elisa de la Calle-Mustienes; Ingo Braasch; John Postlethwait; José Gómez-Skarmeta; Neil Shubin
Description
Assay for Transposable Accessible Chromatin (ATAC) reveals a genome wide view of areas of open chromatin at very high resolution, which are often associated with regulatory activity. The ATAC-seq technology uses a Tn5 transposase loaded with nex-generation sequencing primers in order to simultaneously fragment areas of open chromatin and ligate adapters.
Pan-Cancer chromatin analysis of the human vtRNA genes - Supplementary...
zenodo.org
csv
Updated May 24, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rafael Sebastián Fort; Rafael Sebastián Fort; María Ana Duhagon; María Ana Duhagon (2021). Pan-Cancer chromatin analysis of the human vtRNA genes - Supplementary Tables [Dataset]. http://doi.org/10.5281/zenodo.4501399
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.4501399
Dataset updated
May 24, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Rafael Sebastián Fort; Rafael Sebastián Fort; María Ana Duhagon; María Ana Duhagon
License
Attribution 1.0 (CC BY 1.0)https://creativecommons.org/licenses/by/1.0/
License information was derived automatically
Description
Tables S1-S5. ATAC-seq and DNA methylation data for vtRNA promoters in primary tumors and normal adjacent tissue samples. CSV spreadsheets: Table_S1_ATAC-seq_data_500bp: All ATAC-seq data of vtRNAs promoter (500bp) data for primary tumor samples; Table_S2_DNA_methylation_500bp: All DNA methylation data and ATAC-seq data of vtRNAs promoter (500bp) data for total primary tumor and normal adjacent samples; Table_S3_DNA_methylation_NORMAL: All DNA methylation data of vtRNAs promoter (500bp) data for normal adjacent samples; Table_S4_DNA_methylation_TUMOR: All DNA methylation data of vtRNAs promoter (500bp) data for primary tumor samples; Table_S5_Normal_&_Tumor_matched: All DNA methylation data of vtRNAs promoter (500bp) data for primary tumor and normal adjacent samples.

Table S6. VtRNAs Transcription Factors Binding and KEEG enriched terms. CSV spreadsheets: Table_S6_Binding_Factors: Transcription factors identified in the cell line K562 as ChIP-seq Peaks by ENCODE 3 project and KEEG_terms: enriched KEGG pathway terms (FDR < 0.05).

Tables S7-S8. DNA methylation, ATAC-seq data and associated survival data for primary tumors. CSV spreadsheets: Table_S7_DNA-methylation_Survival_data: All DNA methylation data of vtRNAs promoter (500bp) and survival data for primary tumor samples; Table_S8_ATAC-seq_Survival_data: ATAC-seq data of vtRNAs promoter (500bp) and survival data for primary tumor samples.

Tables S9-S10. Correlation of ATAC-seq values between vtRNA and all genome promoters in primary tumor samples. CSV spreadsheets: Table_S9_ATAC-seq_gene_promoter_spearman_correlation: Spearman correlation values of all promoter genes and vtRNAs in primary tumors samples; Table_S10_vtRNAs_pathway_enrichment_and_cluster_chromosome_localization_analysis: vtRNAs vtRNA1-1, vtRNA1-2, vtRNA1-3 and vtRNA2-1 pathway enrichment and cluster chromosome localization data.

Tables S11-S14. ATAC-seq and DNA methylation data for vtRNA promoters in primary tumors and the associated Immune Subtypes data. CSV spreadsheets: Table_S11_Immune_Subtypes_DNA_methylation_data: All DNA methylation data of vtRNAs promoter (500bp) and Immune Subtypes data for primary tumor samples; Table_S12_Spearman_corr_vtRNAs_Immune_Subtypes_DNA_methylation_data: Spearman correlation values of DNA methylation data of vtRNAs promoter (500bp) and Immune Subtypes data for primary tumor samples; Table_S13_Immune_Subtypes_ATAC-seq_data: All ATAC-seq data of vtRNAs promoter (500bp) and Immune Subtypes data for primary tumor samples; Table_S14_Spearman_corr_vtRNAs_Immune_Subtypes_ATAC-seq_data: Spearman correlation values of ATAC-seq data of vtRNAs promoter (500bp) and Immune Subtypes data for primary tumor samples.
s
Spatially-resolved chromatin accessibility and transcriptomic profiling of...
figshare.scilifelab.se
researchdata.se
Updated Jan 15, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Margherita Zamboni; Enric Llorens Bobadilla; Xinsong Chen; Johan Hartman (2025). Spatially-resolved chromatin accessibility and transcriptomic profiling of human breast cancer [Dataset]. http://doi.org/10.17044/scilifelab.21378279.v1
Explore at:
Unique identifier
https://doi.org/10.17044/scilifelab.21378279.v1
Dataset updated
Jan 15, 2025
Dataset provided by
Karolinska Institutet
Authors
Margherita Zamboni; Enric Llorens Bobadilla; Xinsong Chen; Johan Hartman
License
https://www.scilifelab.se/data/restricted-access/https://www.scilifelab.se/data/restricted-access/
Description
Human breast cancer OMICs data generated for the publication "Solid phase capture and profiling of open chromatin by spatial ATAC"

Abstract from the publication: Current methods for epigenomic profiling are limited in the ability to obtain genome wide information with spatial resolution. Here we introduce spatial ATAC, a method that integrates transposase-accessible chromatin profiling in tissue sections with barcoded solid-phase capture to perform spatially resolved epigenomics. We show that spatial ATAC enables the discovery of the regulatory programs underlying spatial gene expression during mouse organogenesis, lineage differentiation and in human pathology.

Dataset description The dataset includes spatially-resolved chromatin accessibility profiling performed on three fresh-frozen tissue sections of HER2+ breast cancer. We provide raw data in the form of fastq files, along with processed feature barcode matrices, metadata, and photomicrographs of the tissue slices. Additionally the dataset contains spatially-resolved gene expression profiling of tissue sections from the same specimen. For this too, we provide raw and processed data, along with the metadata information.

Spatial transcriptomics data were generated using 10X Genomics' Visium platform, while spatial ATAC data were created using a method introduced in our publication, which relies on an analogous workflow. Samples were sequenced on Illumina Nextseq 550 or 2000 and raw data were processed with CellRanger Gene Expression or ATAC-seq pipelines.

To apply for conditional access to the dataset, please contact datacentre@scilifelab.se.
f
Table_4_GUAVA: A Graphical User Interface for the Analysis and Visualization...
frontiersin.figshare.com
datasetcatalog.nlm.nih.gov
xlsx
Updated Jun 4, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mayur Divate; Edwin Cheung (2023). Table_4_GUAVA: A Graphical User Interface for the Analysis and Visualization of ATAC-seq Data.xlsx [Dataset]. http://doi.org/10.3389/fgene.2018.00250.s005
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.3389/fgene.2018.00250.s005
Dataset updated
Jun 4, 2023
Dataset provided by
Frontiers
Authors
Mayur Divate; Edwin Cheung
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Assay for Transposase Accessible Chromatin with high-throughput sequencing (ATAC-seq) is a powerful genomic technology that is used for the global mapping and analysis of open chromatin regions. However, for users to process and analyze such data they either have to use a number of complicated bioinformatic tools or attempt to use the currently available ATAC-seq analysis software, which are not very user friendly and lack visualization of the ATAC-seq results. Because of these issues, biologists with minimal bioinformatics background who wish to process and analyze their own ATAC-seq data by themselves will find these tasks difficult and ultimately will need to seek help from bioinformatics experts. Moreover, none of the available tools provide complete solution for ATAC-seq data analysis. Therefore, to enable non-programming researchers to analyze ATAC-seq data on their own, we developed a tool called Graphical User interface for the Analysis and Visualization of ATAC-seq data (GUAVA). GUAVA is a standalone software that provides users with a seamless solution from beginning to end including adapter trimming, read mapping, the identification and differential analysis of ATAC-seq peaks, functional annotation, and the visualization of ATAC-seq results. We believe GUAVA will be a highly useful and time-saving tool for analyzing ATAC-seq data for biologists with minimal or no bioinformatics background. Since GUAVA can also operate through command-line, it can easily be integrated into existing pipelines, thus providing flexibility to users with computational experience.
m
The prebuilt database for NIRA (Network inference and cell type...
figshare.manchester.ac.uk
zip
Updated Apr 5, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xin Ma (2023). The prebuilt database for NIRA (Network inference and cell type Identification by integrating scRNA-seq with ATAC-seq) [Dataset]. http://doi.org/10.48420/21313626.v1
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.48420/21313626.v1
Dataset updated
Apr 5, 2023
Dataset provided by
University of Manchester
Authors
Xin Ma
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
As chromatin accessibility provides rich information on transcription factor binding process, for a given TF-based raw regulon, firstly we test whether the TF motif is enriched in this regulon. To perform this efficiently, we have built our own database in BED format, which contains all available TF motifs and their occurrences across the potential binding regions (TSS$\pm10$ kb) of all HUMAN genes.

Facebook

Twitter

Click to copy link

Link copied

Cite

ATAC-seq dataset [Dataset]. https://ega-archive.org/datasets/EGAD00001011135

ATAC-seq dataset

Explore at:

License

https://ega-archive.org/dacs/EGAC00001002224https://ega-archive.org/dacs/EGAC00001002224

Description

This dataset contains ATAC-seq data performed in MM.1S cell line in ETOH (control) or Dexamethasone condition (Treatment)

Clear search

Close search

Google apps

Main menu

ATAC-seq dataset

ATACseq_QC_plot

Chromatin accessibility (ATAC-seq) and transcriptome (RNA-seq) data from...

DataSheet_1_Mapping open chromatin by ATAC-seq in bread wheat.docx

Chromatin immunoprecipitation and ATAC-Seq data from meduallar thymic...

Single-cell ATAC sequencing data from: Single-cell genomics details the...

Table_2_ATACgraph: Profiling Genome-Wide Chromatin Accessibility From...

RNA-seq and ATAC-seq Matrix

maxATAC Data

Classification of ATAC-seq data of ATL samples.

CRC cell line ATAC-seq

Training material for the mapping and quantification of single-cell ATAC-seq...

Data from: Multimodal integration of single cell ATAC-seq data enables...

References and test datasets for the Cactus pipeline

ATAC-seq count data from primary pediatric AML samples

ATAC-seq Data from Primary Pediatric AML Samples

Data Source

Bioinformatic Processing

Data Structure

ATAC-seq data in mouse and zebrafish limbs

Pan-Cancer chromatin analysis of the human vtRNA genes - Supplementary...

Spatially-resolved chromatin accessibility and transcriptomic profiling of...

Table_4_GUAVA: A Graphical User Interface for the Analysis and Visualization...

The prebuilt database for NIRA (Network inference and cell type...

ATAC-seq dataset