Hierarchical classification of microorganisms based on high-dimensional phenotypic data

Archive ouverte

Tafintseva, Valeria | Vigneau, Evelyne | Shapaval, Volha | Cariou, Véronique | Qannari, El Mostafa | Kohler, Achim

Edité par CCSD ; Wiley -

International audience. The classification of microorganisms by high-dimensional phenotyping methods such as FTIR spectroscopy is often a complicated process due to the complexity of microbial phylogenetic taxonomy. A hierarchical structure developed for such data can often facilitate the classification analysis. The hierarchical tree structure can either be imposed to a given set of phenotypic data by integrating the phylogenetic taxonomic structure or set up by revealing the inherent clusters in the phenotypic data. In this study, we wanted to compare different approaches to hierarchical classification of microorganisms based on high-dimensional phenotypic data. A set of 19 different species of molds (filamentous fungi) obtained from the mycological strain collection of the Norwegian Veterinary Institute (Oslo, Norway) is used for the study. Hierarchical cluster analysis is performed for setting up the classification trees. Classification algorithms such as artificial neural networks (ANN), partial least-squared discriminant analysis and random forest (RF) are used and compared. The 2 methods ANN and RF outperformed all the other approaches even though they did not utilize predefined hierarchical structure. To our knowledge, the RF approach is used here for the first time to classify microorganisms by FTIR spectroscopy.

Consulter en ligne

Suggestions

Du même auteur

ComDim: From multiblock data analysis to path modeling

Archive ouverte | Cariou, Véronique | CCSD

International audience. ComDim (Common Dimensions) analysis was initially introduced within the context of sensometrics to analyze conventional and free choice sensory profiling data, and more generally multiblock d...

A new approach for the analysis of data and the clustering of subjects in a CATA experiment

Archive ouverte | Llobell, Fabien | CCSD

International audience. A new approach for the analysis of the data and the clustering of the subjects in a Check All That Apply (CATA) experiment is outlined. It encompasses indices to assess the agreements among t...

Unsupervised multiblock data analysis: A unified approach and extensions. Unsupervised multiblock data analysis: A unified approach and extensions: Unsupervised multiblock data analysis: A unified approach and extensions

Archive ouverte | Tchandao Mangamana, Essomanda | CCSD

ISI Document Delivery No.: JQ2BB Times Cited: 0 Cited Reference Count: 37 Managmana, Essomanda Tchandao Cariou, Veronique Vigneau, Evelyne Kakai, Romain Lucas Glele Qannari, El Mostafa Glele Kakai, Romain/0000-0002-6965-4331 Franc...

Chargement des enrichissements...