Estimation of nitrogen and phosphorus concentrations from water quality surrogates using machine learning in the Tri An Reservoir, Vietnam

Environ Monit Assess. 2020 Nov 26;192(12):789. doi: 10.1007/s10661-020-08731-2.

Abstract

Surface water eutrophication due to excessive nutrients has become a major environmental problem around the world in the past few decades. Among these nutrients, nitrogen and phosphorus are two of the most important harmful cyanobacterial bloom (HCB) drivers. A reliable prediction of these parameters, therefore, is necessary for the management of rivers, lakes, and reservoirs. The aim of this study is to test the suitability of the powerful machine learning (ML) algorithm, random forest (RF), to provide information on water quality parameters for the Tri An Reservoir (TAR). Three species of nitrogen and phosphorus, including nitrite (N-NO2-), nitrate (N-NO3-), and phosphate (P-PO43-), were empirically estimated using the field observation dataset (2009-2014) of six surrogates of total suspended solids (TSS), total dissolved solids (TDS), turbidity, electrical conductivity (EC), chemical oxygen demand (COD), and biochemical oxygen demand (BOD5). Field data measurement showed that water quality in the TAR was eutrophic with an up-trend of N-NO3- and P-PO43- during the study period. The RF regression model was reliable for N-NO2-, N-NO3-, and P-PO43- prediction with a high R2 of 0.812-0.844 for the training phase (2009-2012) and 0.888-0.903 for the validation phase (2013-2014). The results of land use and land cover change (LUCC) revealed that deforestation and shifting agriculture in the upper region of the basin were the major factors increasing nutrient loading in the TAR. Among the meteorological parameters, rainfall pattern was found to be one of the most influential factors in eutrophication, followed by average sunshine hour. Our results are expected to provide an advanced assessment tool for predicting nutrient loading and for giving an early warning of HCB in the TAR.

Keywords: Harmful cyanobacterial blooms; Random forest; Tri An eutrophic reservoir; Water quality.

MeSH terms

  • Environmental Monitoring
  • Eutrophication
  • Machine Learning
  • Nitrogen / analysis
  • Phosphorus* / analysis
  • Vietnam
  • Water Quality*

Substances

  • Phosphorus
  • Nitrogen