Spatiotemporal variations of air pollutants and ozone prediction using machine learning algorithms in the Beijing-Tianjin-Hebei region from 2014 to 2021

Environ Pollut. 2022 Aug 1:306:119420. doi: 10.1016/j.envpol.2022.119420. Epub 2022 May 5.

Abstract

China was seriously affected by air pollution in the past decade, especially for particulate matter (PM) and emerging ozone pollution recently. In this study, we systematically examined the spatiotemporal variations of six air pollutants and conducted ozone prediction using machine learning (ML) algorithms in the Beijing-Tianjin-Hebei (BTH) region. The annual-average concentrations of CO, PM10, PM2.5 and SO2 decreased at a rate of 141, 11.0, 6.6 and 5.6 μg/m3/year, while a pattern of initial increase and later decrease was observed for NO2 and O3_8 h. The concentration of SO2, CO and NO2 was higher in Tangshan and Xingtai, while northern BTH region has lower levels of CO, NO2 and PM. Spatial variations of ozone were relatively small in the BTH region. Monthly variations of PM10 displayed an increase in March probably due to wind-blown dusts from Northwest China. A seasonal and diurnal pattern with summer and afternoon peaks was found for ozone, which was contrast with other pollutants. Further ML algorithms such as Random Forest (RF) model and Decision tree (DT) regression showed good ozone prediction performance (daily: R2 = 0.83 and 0.73, RMSE = 30.0 and 37.3 μg/m3, respectively; monthly: R2 = 0.93 and 0.88, RMSE = 12.1 and 15.8 μg/m3, respectively) based on 10-fold cross-validation. Both RF model and DT regression relied more on the spatial trend as higher temporal prediction performance was achieved. Solar radiation- and temperature-related variables presented high importance at daily level, whereas sea level pressure dominated at monthly level. The spatiotemporal heterogeneity in variable importance was further confirmed using case studies based on RF model. In addition, variable importance was possibly influenced by the emission reductions due to COVID-19 pandemic. Despite its possible weakness to capture ozone extremes, RF model was beneficial and suggested for predicting spatiotemporal variations of ozone in future studies.

MeSH terms

  • Air Pollutants* / analysis
  • Air Pollution* / analysis
  • Beijing
  • COVID-19*
  • China
  • Environmental Monitoring
  • Humans
  • Machine Learning
  • Nitrogen Dioxide / analysis
  • Ozone* / analysis
  • Pandemics
  • Particulate Matter / analysis

Substances

  • Air Pollutants
  • Particulate Matter
  • Ozone
  • Nitrogen Dioxide