Comparative study on supervised learning methods for identifying phytoplankton species

Archive ouverte

Phan, Thi-Thu-Hong | Poisson Caillault, Emilie | Bigand, André

Edité par CCSD ; IEEE -

International audience. — Phytoplankton plays an important role in marine ecosystem. It is defined as a biological factor to assess marine quality. The identification of phytoplankton species has a high potential for monitoring environmental, climate changes and for evaluating water quality. However, phytoplankton species identification is not an easy task owing to their variability and ambiguity due to thousands of micro and pico-plankton species. Therefore, the aim of this paper is to build a framework for identifying phytoplankton species and to perform a comparison on different features types and classifiers. We propose a new features type extracted from raw signals of phytoplankton species. We then analyze the performance of various classifiers on the proposed features type as well as two other features types for finding the robust one. Through experiments, it is found that Random Forest using the proposed features gives the best classification results with average accuracy up to 98.24%.

Suggestions

Du même auteur

eDTWBI: Effective Imputation Method for Univariate Time Series

Archive ouverte | Phan, Thi-Thu-Hong | CCSD

International audience. Missing data frequently occur in many applied domains and pose serious problems such as loss of efficiency and unreliable results for various approaches. Many real applications require comple...

Comparative Study on Univariate Forecasting Methods for Meteorological Time Series

Archive ouverte | Phan, Thi-Thu-Hong | CCSD

International audience. Time series forecasting has an important role in many real applications in meteorology and environment to understand phenomena as climate change and to adapt monitoring strategy. This paper a...

Dynamic time warping-based imputation for univariate time series data

Archive ouverte | Phan, Thi-Thu-Hong | CCSD

International audience. Time series with missing values occur in almost any domain of applied sciences. Ignoring missing values can lead to a loss of efficiency and unreliable results, especially for large missing s...

Chargement des enrichissements...