Assessing the effect of sample bias correction in species distribution models

Archive ouverte

Dubos, Nicolas | Préau, Clémentine | Lenormand, Maxime | Papuga, Guillaume | Monsarrat, Sophie | Denelle, Pierre | Le Louarn, Marine | Heremans, Stien | May, Roel | Roche, Philip | Luque, Sandra

Edité par CCSD ; Elsevier -

International audience. Open-source biodiversity databases contain a large number of species occurrence records but are often spatially biased; which affects the reliability of species distribution models based on these records. Sample bias correction techniques require data filtering which comes at the cost of record numbers, or require considerable additional sampling effort. Since independent data is rarely available, assessment of the correction technique often relies solely on performance metrics computed using subsets of the availablebiaseddata, which may prove misleading. 2. Here, we assess the extent to which an acknowledged sample bias correction technique is likely to improve models' ability to predict species distributions in the absence of independent data. We assessed variation in model predictions induced by the aforementioned correction and model stochasticity; the variability between model replicates related to a random component (pseudo-absences sets and cross-validation subsets). We present, then, an index of the effect of correction relative to model stochasticity; the Relative Overlap Index (ROI). We investigated whether the ROI better represented the effect of correction than classic performance metrics (Boyce index, cAUC, AUC and TSS) and absolute overlap metrics (Schoener's D, Pearson's and Spearman's correlation coefficients) when considering data related to 64 vertebrate species and 21 virtual species with a generated sample bias. 3. When based on absolute overlaps and cross-validation performance metrics, we found that correction produced no significant effects. When considering its effect relative to model stochasticity, the effect of correction was strong for most species at one of the three sites. The use of virtual species enabled us to verify that the correction technique improved both distribution predictions and the biological relevance of the selected variables at the specific site, when these were not correlated with sample bias patterns. 4. In the absence of additional independent data, the assessment of sample bias correction based on subsample data may be misleading. We propose to investigate both the biological relevance of environmental variables selected, and, the effect of sample bias correction based on its effect relative to model stochasticity.

Suggestions

Du même auteur

Dispersal-based species pools as sources of connectivity area mismatches

Archive ouverte | Préau, Clémentine | CCSD

International audience. Context Prioritising is likely to differ depending on the species considered for connectivity assessments, leading to a lack of consensual decisions for territorial planning. Objectives The o...

Habitat connectivity in agricultural landscapes improving multi-functionality of constructed wetlands as nature-based solutions

Archive ouverte | Préau, Clémentine | CCSD

International audience. The prevention of biodiversity loss in agricultural landscapes to protect ecosystem stability and functions is of major importance in itself and for the maintenance of associated ecosystem se...

Protection gaps in Amazon floodplains will increase with climate change: Insight from the world's largest scaled freshwater fish

Archive ouverte | Dubos, Nicolas | CCSD

International audience. The Amazon floodplains represent important surfaces of highly valuable ecosystems, yet they remain neglected from protected areas. Although the efficiency of the protected area network of the...

Chargement des enrichissements...