Assessing NO2 Concentration and Model Uncertainty with High Spatiotemporal Resolution across the Contiguous United States Using Ensemble Model Averaging

Author/Creator ORCID

Date

2019-12-18

Department

Program

Citation of Original Publication

Di, Qian; Amini, Heresh; Shi, Liuhua; Kloog, Itai; Silvern, Rachel; Kelly, James; Sabath, M. Benjamin; Choirat, Christine; Koutrakis, Petros; Lyapustin, Alexei; Wang, Yujie; Mickley, Loretta J.; Schwartz, Joel; Assessing NO2 Concentration and Model Uncertainty with High Spatiotemporal Resolution across the Contiguous United States Using Ensemble Model Averaging; Environmental Science and Technology 54, 3, 1372-1384 (2019); https://pubs.acs.org/doi/abs/10.1021/acs.est.9b03358

Rights

This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
Public Domain Mark 1.0
This work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.

Subjects

Abstract

NO₂ is a combustion byproduct that has been associated with multiple adverse health outcomes. To assess NO₂ levels with high accuracy, we propose the use of an ensemble model to integrate multiple machine learning algorithms, including neural network, random forest, and gradient boosting, with a variety of predictor variables, including chemical transport models. This NO₂ model covers the entire contiguous U.S. with daily predictions on 1-km-level grid cells from 2000 to 2016. The ensemble produced a cross-validated R² of 0.788 overall, a spatial R² of 0.844, and a temporal R² of 0.729. The relationship between daily monitored and predicted NO₂ is almost linear. We also estimated the associated monthly uncertainty level for the predictions and address-specific NO₂ levels. This NO₂ estimation has a very high spatiotemporal resolution and allows the examination of the health effects of NO₂ in unmonitored areas. We found the highest NO₂ levels along highways and in cities. We also observed that nationwide NO₂ levels declined in early years and stagnated after 2007, in contrast to the trend at monitoring sites in urban areas, where the decline continued. Our research indicates that the integration of different predictor variables and fitting algorithms can achieve an improved air pollution modeling framework.