Unsupervised Classification for Tiling Arrays: ChIP-chip and Transcriptome

Archive ouverte

Berard, Caroline | Magniette, Marie-Laure | Brunaud, Veronique | Aubourg, Sebastien | Robin, Stephane

Edité par CCSD ; De Gruyter -

Tiling arrays make possible a large-scale exploration of the genome thanks to probes which cover the whole genome with very high density, up to 2,000,000 probes. Biological questions usually addressed are either the expression difference between two conditions or the detection of transcribed regions. In this work, we propose to consider both questions simultaneously as an unsupervised classification problem by modeling the joint distribution of the two conditions. In contrast to previous methods, we account for all available information on the probes as well as biological knowledge such as annotation and spatial dependence between probes. Since probes are not biologically relevant units, we propose a classification rule for non-connected regions covered by several probes. Applications to transcriptomic and ChIP-chip data of Arabidopsis thaliana obtained with a NimbleGen tiling array highlight the importance of a precise modeling and of the region classification. The "TAHMMAnnot" package is implemented in R and C and is freely available from CRAN.

Consulter en ligne

Suggestions

Du même auteur

The RNA helicases AtMTR4 and HEN2 target specific subsets of nuclear transcripts for degradation by the nuclear exosome in Arabidopsis thaliana

Archive ouverte | Lange, Heike | CCSD

The RNA exosome is the major 3'-5' RNA degradation machine of eukaryotic cells and participates in processing, surveillance and turnover of both nuclear and cytoplasmic RNA. In both yeast and human, all nuclear functions of the ex...

GEM2Net: from gene expression modeling to -omics networks, a new CATdb module to investigate Arabidopsis thaliana genes involved in stress response

Archive ouverte | Zaag, Rim | CCSD

publié Epub 2014 Nov 11. CATdb (http://urgv.evry.inra.fr/CATdb) is a database providing a public access to a large collection of transcriptomic data, mainly for Arabidopsis but also for other plants. This resource h...

CATdb: a public access to Arabidopsis transcriptome data from the URGV-CATMA platform

Archive ouverte | Gagnot, Séverine | CCSD

CATdb is a free resource available at http://urgv.evry.inra.fr/CATdb that provides public access to a large collection of transcriptome data for Arabidopsis thaliana produced by a single Complete Arabidopsis Transcriptome Micro Ar...

Chargement des enrichissements...