Space-time trends of PM2.5 constituents in the conterminous United States estimated by a machine learning approach, 2005-2015

Environ Int. 2018 Dec;121(Pt 2):1137-1147. doi: 10.1016/j.envint.2018.10.029. Epub 2018 Nov 7.

Abstract

Particulate matter with aerodynamic diameter less than 2.5 μm (PM2.5) is a complex mixture of chemical constituents emitted from various emission sources or through secondary reactions/processes; however, PM2.5 is regulated mostly based on its total mass concentration. Studies to identify the impacts on climate change, visibility degradation and public health of different PM2.5 constituents are hindered by limited ground measurements of PM2.5 constituents. In this study, national models were developed based on random forest algorithm, one of machine learning methods that is of high predictive capacity and able to provide interpretable results, to predict concentrations of PM2.5 sulfate, nitrate, organic carbon (OC) and elemental carbon (EC) across the conterminous United States from 2005 to 2015 at the daily level. The random forest models achieved high out-of-bag (OOB) R2 values at the daily level, and the mean OOB R2 values were 0.86, 0.82, 0.71 and 0.75 for sulfate, nitrate, OC and EC, respectively, over 2005-2015. The long-term temporal trends of PM2.5 sulfate, nitrate, OC and EC predictions agreed well with their corresponding ground measurements. The annual mean of predicted PM2.5 sulfate and EC levels across the conterminous United States decreased substantially from 2005 to 2015; while concentrations of predicted PM2.5 nitrate and OC decreased and fluctuated during the study period. The annual prediction maps captured the characterized spatial patterns of the PM2.5 constituents. The distributions of annual mean concentrations of sulfate and nitrate were generally regional in the extent that sulfate decreased from east to west smoothly with enhancement in California and nitrate had higher concentration in Midwest, Metro New York area, and California. OC and EC had regional high concentrations in the Southeast and Northwest as well as localized high levels around urban centers. The spatial patterns of PM2.5 constituents were consistent with the distributions of their emission sources and secondary processes and transportation. Hence, the national models developed in this study could provide supplementary evaluations of spatio-temporal distributions of PM2.5 constituents with full time-space coverages in the conterminous United States, which could be beneficial to assess the impacts of PM2.5 constituents on radiation budgets and visibility degradation, and support exposure assessment for regional to national health studies at county or city levels to understand the acute and chronic toxicity and health impacts of PM2.5 constituents, and consequently provide scientific evidence for making targeted and effective regulations of PM2.5 pollution.

Keywords: Elemental carbon; Nitrate; Organic carbon; PM(2.5); Random forest; Sulfate.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Air Pollutants / analysis*
  • California
  • Carbon / analysis
  • Cities
  • Environmental Monitoring / methods*
  • Humans
  • Machine Learning*
  • New York
  • Nitrogen Oxides / analysis
  • Particulate Matter / analysis*
  • Transportation
  • United States

Substances

  • Air Pollutants
  • Nitrogen Oxides
  • Particulate Matter
  • Carbon