Imogene: identification of motifs and cis-regulatory modules underlying gene co-regulation

Archive ouverte

Rouault, Hervé | Santolini, Marc | Schweisguth, François | Hakim, Vincent

Edité par CCSD ; Oxford University Press -

International audience. Cis-regulatory modules (CRMs) and motifs play a central role in tissue and condition-specific gene expression. Here we present Imogene, an ensemble of statistical tools that we have developed to facilitate their identification and implemented in a publicly available software. Starting from a small training set of mammalian or fly CRMs that drive similar gene expression profiles, Imogene determines de novo cis-regulatory motifs that underlie this co-expression. It can then predict on a genome-wide scale other CRMs with a regulatory potential similar to the training set. Imogene bypasses the need of large datasets for statistical analyses by making central use of the information provided by the se-quenced genomes of multiple species, based on the developed statistical tools and explicit models for transcription factor binding site evolution. We test Imogene on characterized tissue-specific mouse developmental CRMs. Its ability to identify CRMs with the same specificity based on its de novo created motifs is comparable to that of previously evaluated 'motif-blind' methods. We further show, both in flies and in mammals, that Imogene de novo generated motifs are sufficient to discriminate CRMs related to different developmental programs. Notably, purely relying on sequence data, Imogene performs as well in this discrimination task as a previously reported learning algorithm based on Chromatin Immunopre-cipitation (ChIP) data for multiple transcription factors at multiple developmental stages.

Suggestions

Du même auteur

Genome-wide analyses of Shavenbaby target genes reveals distinct features of enhancer organization

Archive ouverte | Menoret, Delphine | CCSD

International audience. Background: Developmental programs are implemented by regulatory interactions between Transcription Factors (TFs) and their target genes, which remain poorly understood. While recent studies ...

Self-organized Notch dynamics generate stereotyped sensory organ patterns in Drosophila

Archive ouverte | Corson, Francis | CCSD

International audience. INTRODUCTION: Spatial patterning in developing multicellular organisms relies on positional cues and cell-cell interactions. Stereotyped sensory organ arrangements in Drosophila are commonly ...

MyoD reprogramming requires Six1 and Six4 homeoproteins: genome-wide cis-regulatory module analysis

Archive ouverte | Santolini, Marc | CCSD

International audience. Myogenic regulatory factors of the MyoD family have the ability to reprogram differentiated cells toward a myogenic fate. In this study, we demonstrate that Six1 or Six4 are required for the ...

Chargement des enrichissements...