De novo motif identification improves the accuracy of predicting transcription factor binding sites in ChIP-Seq data analysis

Archive ouverte

Boeva, Valentina | Surdez, Didier | Guillon, Noëlle | Tirode, Franck | Fejes, Anthony, P | Delattre, Olivier | Barillot, Emmanuel

Edité par CCSD ; Oxford University Press -

International audience. Dramatic progress in the development of next-generation sequencing technologies has enabled accurate genome-wide characterization of the binding sites of DNA-associated proteins. This technique , baptized as ChIP-Seq, uses a combination of chromatin immunoprecipitation and massively parallel DNA sequencing. Other published tools that predict binding sites from ChIP-Seq data use only positional information of mapped reads. In contrast, our algorithm MICSA (Motif Identification for ChIP-Seq Analysis) combines this source of positional information with information on motif occurrences to better predict binding sites of transcription factors (TFs). We proved the greater accuracy of MICSA with respect to several other tools by running them on datasets for the TFs NRSF, GABP, STAT1 and CTCF. We also applied MICSA on a dataset for the oncogenic TF EWS-FLI1. We discovered >2000 binding sites and two functionally different binding motifs. We observed that EWS-FLI1 can activate gene transcription when (i) its binding site is located in close proximity to the gene transcription start site (up to $150 kb), and (ii) it contains a microsat-ellite sequence. Furthermore, we observed that sites without microsatellites can also induce regulation of gene expression-positively as often as negatively-and at much larger distances (up to $1 Mb).

Suggestions

Du même auteur

The Oncogenic EWS-FLI1 Protein Binds In Vivo GGAA Microsatellite Sequences with Potential Transcriptional Activation Function

Archive ouverte | Guillon, Noëlle | CCSD

International audience. The fusion between EWS and ETS family members is a key oncogenic event in Ewing tumors and important EWS-FLI1 target genes have been identified. However, until now, the search for EWS-FLI1 ta...

Transcriptional Programs Define Intratumoral Heterogeneity of Ewing Sarcoma at Single-Cell Resolution

Archive ouverte | Aynaud, Marie-Ming | CCSD

International audience. EWSR1-FLI1, the chimeric oncogene specific for Ewing sarcoma (EwS), induces a cascade of signaling events leading to cell transformation. However, it remains elusive how genetically homogeneo...

Heterogeneity of neuroblastoma cell identity defined by transcriptional circuitries

Archive ouverte | Boeva, Valentina | CCSD

International audience. Neuroblastoma is a tumor of the peripheral sympathetic nervous system(1), derived from multipotent neural crest cells (NCCs). To define core regulatory circuitries (CRCs) controlling the gene...

Chargement des enrichissements...