Discovering novelty in sequential patterns: application for analysis of microarray data on Alzheimer disease

Archive ouverte

Bringay, Sandra | Roche, Mathieu | Teisseire, Maguelonne | Poncelet, Pascal | Abdel Rassoul, Ronza | Verdier, Jean-Michel | Devau, Gina

Edité par CCSD ; Stud Health Technol Inform -

[Departement_IRSTEA]Territoires [TR1_IRSTEA]SYNERGIE. International audience. Analyzing microarrays data is still a great challenge since existing methods produce huge amounts of useless results. We propose a new method called NoDisco for discovering novelties in gene sequences obtained by applying data-mining techniques to microarray data. Method: We identify popular genes, which are often cited in the literature, and innovative genes, which are linked to the popular genes in the sequences but are not mentioned in the literature. We also identify popular and innovative sequences containing these genes. Biologists can thus select interesting sequences from the two sets and obtain the k-best documents. Results: We show the efficiency of this method by applying it on real data used to decipher the mechanisms underlying Alzheimer disease. Conclusion: The first selection of sequences based on popularity and innovation help experts focus on relevant sequences while the top-k documents help them understand the sequences.

Suggestions

Du même auteur

Identification of Gene Expression Changes in the Brain of Microcebus Murinus During Aging by Using Data Mining

Archive ouverte | Devau, Gina | CCSD

National audience

Distinct transcriptome expression of the temporal cortex of the primate Microcebus murinus during brain aging versus Alzheimer's disease-like pathology.

Archive ouverte | Abdel Rassoul, Ronza | CCSD

International audience. Aging is the primary risk factor of neurodegenerative disorders such as Alzheimer's disease (AD). However, the molecular events occurring during brain aging are extremely complex and still la...

Mining microarray data to predict the histological grade of a breast cancer

Archive ouverte | Fabrègue, Mickaël | CCSD

[Departement_IRSTEA]Territoires [TR1_IRSTEA]SYNERGIE. BACKGROUND: The aim of this study was to develop an original method to extract sets of relevant molecular biomarkers (gene sequences) that can be used for class ...

Chargement des enrichissements...