Base-resolution models of transcription-factor binding reveal soft motif syntax

Žiga Avsec, Melanie Weilert, Avanti Shrikumar, Sabrina Krueger, Amr Alexandari, Khyati Dalal, Robin Fropf, Charles McAnany, Julien Gagneur, Anshul Kundaje, Julia Zeitlinger

Research output: Contribution to journalArticlepeer-review

275 Scopus citations

Abstract

The arrangement (syntax) of transcription factor (TF) binding motifs is an important part of the cis-regulatory code, yet remains elusive. We introduce a deep learning model, BPNet, that uses DNA sequence to predict base-resolution chromatin immunoprecipitation (ChIP)–nexus binding profiles of pluripotency TFs. We develop interpretation tools to learn predictive motif representations and identify soft syntax rules for cooperative TF binding interactions. Strikingly, Nanog preferentially binds with helical periodicity, and TFs often cooperate in a directional manner, which we validate using clustered regularly interspaced short palindromic repeat (CRISPR)-induced point mutations. Our model represents a powerful general approach to uncover the motifs and syntax of cis-regulatory sequences in genomics data.

Original languageEnglish
Pages (from-to)354-366
Number of pages13
JournalNature Genetics
Volume53
Issue number3
DOIs
StatePublished - Mar 2021

Fingerprint

Dive into the research topics of 'Base-resolution models of transcription-factor binding reveal soft motif syntax'. Together they form a unique fingerprint.

Cite this