Skip to main navigation Skip to search Skip to main content

Analyzing histone ChIP-seq data with a bin-based probability of being signal

  • Vivian Hecht
  • , Kevin Dong
  • , Sreshtaa Rajesh
  • , Polina Shpilker
  • , Siddarth Wekhande
  • , Noam Shoresh

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Histone ChIP-seq is one of the primary methods for charting the cellular epigenomic landscape, the components of which play a critical regulatory role in gene expression. Analyzing the activity of regulatory elements across datasets and cell types can be challenging due to shifting peak positions and normalization artifacts resulting from, for example, differing read depths, ChIP efficiencies, and target sizes. Moreover, broad regions of enrichment seen in repressive histone marks often evade detection by commonly used peak callers. Here, we present a simple and versatile method for identifying enriched regions in ChIP-seq data that relies on estimating a gamma distribution fit to non-overlapping 5kB genomic bins to establish a global background. We use this distribution to assign a probability of being signal (PBS) between zero and one to each 5 kB bin. This approach, while lower in resolution than typical peak-calling methods, provides a straightforward way to identify enriched regions and compare enrichments among multiple datasets, by transforming the data to values that are universally normalized and can be readily visualized and integrated with downstream analysis methods. We demonstrate applications of PBS for both broad and narrow histone marks, and provide several illustrations of biological insights which can be gleaned by integrating PBS scores with downstream data types.

Original languageEnglish
Article numbere1011568
JournalPLoS Computational Biology
Volume19
Issue number10 October
DOIs
StatePublished - Oct 2023

Funding

The authors were funded by the National Human Genome Research Institute (https://www. genome.gov/; UM1HG011986 and UM1HG009390), and the Gene Regulation Observatory at the Broad Institute (https://www. broadinstitute.org/gene-regulation-observatorygro). N.S. received all awards. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Fingerprint

Dive into the research topics of 'Analyzing histone ChIP-seq data with a bin-based probability of being signal'. Together they form a unique fingerprint.

Cite this