D2.2 Quality annotation protocols for phenotypic platform data

Archive ouverte

Hilgert, Nadine | Sanchez, Isabelle | Millet, Emilie J. | van Eeuwijk, Fred

Edité par CCSD -

This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 731013. This publication reflects only the view of the author, and the European Commission cannot be held responsible for any use which may be made of the information contained therein.

EPPN 2020.The present deliverable specifically addresses quality annotation protocols for phenotyping platform data. We first explain what cleaning phenotypic data is and why it is important to do it and keep track of how it was done. We then provide platform users with clearly described and defined rules for outliers identification and annotation in an automatic and traceable way. An outlier is usually defined as an observation that appears to be inconsistent with the remainder of the dataset. After visiting a number of facilities and discussing with platform users, we have defined three types of outliers to annotate in the phenotypic data: (1) time points within a time course, (2) whole time courses of one or more variables and (3) a whole plant, defined here as a biological replicate deviating from the overall distribution of plants on a multi-criteria basis. This classification of outliers was proven relevant by the consortium partners. In this document, we propose procedures to identify them. For the first two types of outliers, statistical methods already exist and have been adapted and applied to datasets from differentplatform/species. The «plant outlier» type is new and a method has recently been published (Alvarez Prado et al., 2019). The common idea here is to provide annotated data to the user who, in the end, will decide whether or not to keep the annotated points, time course or plant for further analyses.

Suggestions

Du même auteur

statgenHTP: High Throughput Phenotyping (HTP) Data Analysis

Archive ouverte | Millet, Emilie J. | CCSD

Phenotypic analysis of data coming from high throughput phenotyping (HTP) platforms, including different types of outlier detection, spatial analysis, and parameter estimation. The package is being developed within the EPPN2020 pr...

Modelling strategies for assessing and increasing the effectiveness of new phenotyping techniques in plant breeding

Archive ouverte | van Eeuwijk, Fred | CCSD

International audience. New types of phenotyping tools generate large amounts of data on many aspects of plant physiology and morphology with high spatial and temporal resolution. These new phenotyping data are pote...

PHIS, a plant science ontology-driven Phenotyping Hybrid Information System

Archive ouverte | Neveu, Pascal | CCSD

Poster illustrant la publication Neveu et al. 2018 https://doi.org/10.1111/nph.15385. International audience. Plant phenomics datasets are unprecedented resources for identifying and testing novel mechanisms and mod...

Chargement des enrichissements...