TY - JOUR
T1 - Evaluating the performance of habitat models for predicting the environmental flow requirements of benthic macroinvertebrates
AU - Theodoropoulos, Christos
AU - Vourka, Aikaterini
AU - Skoulikidis, Nikolaos
AU - Rutschmann, Peter
AU - Stamou, Anastasios
N1 - Publisher Copyright:
© 2018 International Association for Hydro-Environment Engineering and Research.
PY - 2018
Y1 - 2018
N2 - Although various methods are currently available for modelling the habitat preferences of aquatic biota, studies comparing the performance of data-driven habitat models are limited. In this study, we assembled a benthic-macroinvertebrate microhabitat-preference dataset and used it to evaluate the predictive accuracy of regression-based univariate Habitat Suitability Curves (HSC), Boosted Regression Trees (BRT), Random Forests (RF), fuzzy-logic-based models using the weighted average (FLWA), maximum membership (FLMM), mean of maximum (FLM) and centroid (FLC) defuzzification algorithms and fuzzy rule-based Bayesian inference (FRB). The results show that the BRT model was the most accurate, closely followed by RF, FRB, FLM and FLMM while the FLC and FLWA algorithms had the lowest performance. However, due to the imbalanced nature of the dataset and in contrast to the fuzzy rule-based algorithms, the HSC, BRT and RF models failed to accurately predict the habitat suitability in low-scored microhabitats. We conclude that, given balanced datasets, BRT and RF can be effectively used in habitat suitability modelling. For imbalanced datasets, a properly adjusted RF model can be applied but when the input dataset is large enough to provide sufficient data-driven IF–THEN rules to train an FRB, FLMM or FLM algorithm, these models will produce the most accurate predictions.
AB - Although various methods are currently available for modelling the habitat preferences of aquatic biota, studies comparing the performance of data-driven habitat models are limited. In this study, we assembled a benthic-macroinvertebrate microhabitat-preference dataset and used it to evaluate the predictive accuracy of regression-based univariate Habitat Suitability Curves (HSC), Boosted Regression Trees (BRT), Random Forests (RF), fuzzy-logic-based models using the weighted average (FLWA), maximum membership (FLMM), mean of maximum (FLM) and centroid (FLC) defuzzification algorithms and fuzzy rule-based Bayesian inference (FRB). The results show that the BRT model was the most accurate, closely followed by RF, FRB, FLM and FLMM while the FLC and FLWA algorithms had the lowest performance. However, due to the imbalanced nature of the dataset and in contrast to the fuzzy rule-based algorithms, the HSC, BRT and RF models failed to accurately predict the habitat suitability in low-scored microhabitats. We conclude that, given balanced datasets, BRT and RF can be effectively used in habitat suitability modelling. For imbalanced datasets, a properly adjusted RF model can be applied but when the input dataset is large enough to provide sufficient data-driven IF–THEN rules to train an FRB, FLMM or FLM algorithm, these models will produce the most accurate predictions.
KW - Boosted Regression Trees
KW - Habitat models
KW - Random Forests
KW - environmental flows
KW - fuzzy Bayesian
KW - fuzzy logic
UR - http://www.scopus.com/inward/record.url?scp=85049580158&partnerID=8YFLogxK
U2 - 10.1080/24705357.2018.1440360
DO - 10.1080/24705357.2018.1440360
M3 - Article
AN - SCOPUS:85049580158
SN - 2470-5357
VL - 3
SP - 30
EP - 44
JO - Journal of Ecohydraulics
JF - Journal of Ecohydraulics
IS - 1
ER -