Freshwater lakes worldwide suffer from eutrophication caused by excessive nutrient loads, particularly nitrogen (N) and phosphorus (P) from wastewater and runoff, affecting aquatic life and public health. Using a large (1800 km2) subtropical lake as an example (Lake Okeechobee, Florida, USA), this study aims to (1) predict key water quality parameters using machine learning (ML) algorithms based on easily measurable variables, (2) identify spatial patterns of these parameters, and (3) determine environmental drivers influencing turbidity levels. The study employs four ML algorithms-Extreme Gradient Boosting (XGB), Light Gradient-Boosting Machine (LGBM), Support Vector Regression (SVR), and Random Forests (RFs)-to predict total phosphorus (TP), total nitrogen (TN), nitrate + nitrite (NOx-N), and turbidity, via station-specific and lake-wide modeling approaches. The station-specific models uncover spatial patterns, while the lake-wide models support operational decision-making. Results indicated that lake stage (water level), water temperature, and, most notably, turbidity were the main nutrient predictors, with XGB demonstrating superior prediction performance. Spatial analysis using K-means clustering identified three distinct lake regions based on nutrient levels and turbidity. Due to its importance, SHapley Additive exPlanations (SHAP) were employed to identify and quantify environmental factors affecting turbidity. Inflows and lake stage were found as primary drivers of turbidity near lake inlets, while wind speed and air temperature affected turbidity in the middle of the lake. This research advances the understanding of lake water quality dynamics, emphasizing the importance of frequent monitoring of turbidity and its environmental drivers for enhanced management and future mitigation efforts.
Keywords: Explainable machine learning; Lake Okeechobee; Nitrogen; Phosphorus; Turbidity.
Copyright © 2024 Elsevier Ltd. All rights reserved.