River reach-level machine learning estimation of nutrient concentrations in Great Britain
- 1UK Centre for Ecology and Hydrology (UKCEH), United Kingdom
- 2The University of Manchester, United Kingdom
- 3Met Office, United Kingdom
Nitrogen (N) and phosphorus (P) are essential nutrients necessary for plant growth and to support life in aquatic ecosystems. However, excessive N and P can lead to algal blooms that deplete oxygen and lead to fish death and the release of toxins that are harmful to humans. Predictions of N and P in rivers are typically calculated at station or grid (>1km) scale; therefore, it is difficult to visualize the evolution of water quality as water travels downstream. Using a high-resolution reach-scale river network and associating each reach with land cover fractions and catchment descriptors, we trained random forest models against aggregated data (2010-2020) from the Environmental Agency Open Water Quality Data Archive for 2343 stations to predict long-term nitrate and orthophosphate concentrations at each river reach in Great Britain (GB). We separate the model-training and predictions for different seasons to investigate the potential difference in feature importance. Our model predicted concentrations with an average testing coefficient of determination (R 2 ) of 0.71 for nitrate and 0.58 for orthophosphate using five-fold cross-validation. Our model shows slightly better performance for higher Strahler stream order, highlighting the challenges for making predictions in small streams. Our results reveal that arable and horticultural land use is the strongest and most reliable predictor for nitrate, while floodplain extents and standard percentage runoff are stronger predictors for orthophosphate. Nationally, higher orthophosphate concentrations are observed in urbanized areas. This study demonstrates the joint use of a river network model and machine learning can readily help provide a river-network understanding on the spatial distribution of water quality levels.
Keywords: River network, machine learning, Nutrients, Water Quality, random forest
Received: 21 Jun 2023;
Accepted: 17 Aug 2023.
Copyright: © 2023 Tso, Magee, Huxley, Eastman and Fry. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Mx. Chak-Hau M. Tso, UK Centre for Ecology and Hydrology (UKCEH), Wallingford, OX10 8BB, Oxfordshire, United Kingdom