GenoGAM: Genome-wide generalized additive models for ChIP-Seq analysis

Georg Stricker, Alexander Engelhardt, Daniel Schulz, Matthias Schmid, Achim Tresch, Julien Gagneur

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Motivation: Chromatin immunoprecipitation followed by deep sequencing (ChIP-Seq) is a widely used approach to study protein-DNA interactions. Often, the quantities of interest are the differential occupancies relative to controls, between genetic backgrounds, treatments, or combinations thereof. Current methods for differential occupancy of ChIP-Seq data rely however on binning or sliding window techniques, for which the choice of the window and bin sizes are subjective. Results: Here, we present GenoGAM (Genome-wide Generalized Additive Model), which brings the well-established and flexible generalized additive models framework to genomic applications using a data parallelism strategy. We model ChIP-Seq read count frequencies as products of smooth functions along chromosomes. Smoothing parameters are objectively estimated from the data by crossvalidation, eliminating ad hoc binning and windowing needed by current approaches. GenoGAM provides base-level and region-level significance testing for full factorial designs. Application to a ChIP-Seq dataset in yeast showed increased sensitivity over existing differential occupancy methods while controlling for type I error rate. By analyzing a set of DNA methylation data and illustrating an extension to a peak caller, we further demonstrate the potential of GenoGAM as a generic statistical modeling tool for genome-wide assays.

Original languageEnglish
Pages (from-to)2258-2265
Number of pages8
JournalBioinformatics
Volume33
Issue number15
DOIs
StatePublished - 1 Aug 2017

Fingerprint

Dive into the research topics of 'GenoGAM: Genome-wide generalized additive models for ChIP-Seq analysis'. Together they form a unique fingerprint.

Cite this