RSAT peak-motifs: motif analysis in full-size ChIP-seq datasets

Archive ouverte

Thomas-Chollier, Morgane | Herrmann, Carl | Defrance, Matthieu | Sand, Olivier | Thieffry, Denis | Helden, Jacques, Van

Edité par CCSD ; Oxford University Press -

International audience. ChIP-seq is increasingly used to characterize transcription factor binding and chromatin marks at a genomic scale. Various tools are now available to extract binding motifs from peak data sets. However, most approaches are only available as command-line programs, or via a website but with size restrictions. We present peak-motifs , a computational pipeline that discovers motifs in peak sequences, compares them with databases, exports putative binding sites for visualization in the UCSC genome browser and generates an extensive report suited for both naive and expert users. It relies on time- and memory-efficient algorithms enabling the treatment of several thousand peaks within minutes. Regarding time efficiency, peak-motifs outperforms all comparable tools by several orders of magnitude. We demonstrate its accuracy by analyzing data sets ranging from 4000 to 1 28 000 peaks for 12 embryonic stem cell-specific transcription factors. In all cases, the program finds the expected motifs and returns additional motifs potentially bound by cofactors. We further apply peak-motifs to discover tissue-specific motifs in peak collections for the p300 transcriptional co-activator. To our knowledge, peak-motifs is the only tool that performs a complete motif analysis and offers a user-friendly web interface without any restriction on sequence size or number of peaks.

Suggestions

Du même auteur

A complete workflow for the analysis of full-size ChIP-seq (and similar) data sets using peak-motifs

Archive ouverte | Thomas-Chollier, Morgane | CCSD

International audience. This protocol explains how to use the online integrated pipeline 'peak-motifs' (http://rsat.ulb.ac.be/rsat/) to predict motifs and binding sites in full-size peak sets obtained by chromatin i...

RSAT 2015: Regulatory Sequence Analysis Tools

Archive ouverte | Medina-Rivera, Alejandra | CCSD

International audience. RSAT (Regulatory Sequence Analysis Tools) is a modular software suite for the analysis of cis-regulatory elements in genome sequences. Its main applications are (i) motif discovery, appropria...

RSAT 2011: regulatory sequence analysis tools

Archive ouverte | Thomas-Chollier, Morgane | CCSD

International audience. The regulatory sequence analysis tools (RSAT, http://rsat.ulb.ac.be/rsat/ ) is a software suite that integrates a wide collection of modular tools for the detection of cis -regulatory element...

Chargement des enrichissements...