Search for a command to run...
Surface ozone (O<sub>3</sub>), one of the harmful air pollutants, generated significantly negative effects on human health and plants. Existing O<sub>3</sub> datasets with coarse spatiotemporal resolution and limited coverage, and the uncertainties of O<sub>3</sub> influential factors seriously restrain related epidemiology and air pollution studies. To tackle above issues, we proposed a novel scheme to estimate daily O<sub>3</sub> concentrations on a fine grid scale (1 km × 1 km) from 2018 to 2020 across China based on machine learning methods using hourly observed ground-level pollutant concentrations data, meteorological data, satellite data, and auxiliary data including digital elevation model (DEM), land use data (LUD), normalized difference vegetation index (NDVI), population (POP), and nighttime light images (NTL), and to identify the difference of influential factors of O<sub>3</sub> on diverse urbanization and topography conditions. Some findings were achieved. The correlation coefficients (R<sup>2</sup>) between O<sub>3</sub> concentrations and surface net solar radiation (SNSR), boundary layer height (BLH), 2 m temperature (T2M), 10 m v-component (MVW), and NDVI were 0.80, 0.40, 0.35, 0.30, and 0.20, respectively. The random forest (RF) demonstrated the highest validation R<sup>2</sup> (0.86) and lowest validation RMSE (13.74 μg/m<sup>3</sup>) in estimating O<sub>3</sub> concentrations, followed by support vector machine (SVM) (R<sup>2</sup> = 0.75, RMSE = 18.39 μg/m<sup>3</sup>), backpropagation neural network (BP) (R<sup>2</sup> = 0.74, RMSE = 19.26 μg/m<sup>3</sup>), and multiple linear regression (MLR) (R<sup>2</sup> = 0.52, RMSE = 25.99 μg/m<sup>3</sup>). Our China High-Resolution O<sub>3</sub> Dataset (CHROD) exhibited an acceptable accuracy at different spatial-temporal scales. Additionally, O<sub>3</sub> concentrations showed decreasing trend and represented obviously spatiotemporal heterogeneity across China from 2018 to 2020. Overall, O<sub>3</sub> was mainly affected by human activities in higher urbanization regions, while O<sub>3</sub> was mainly controlled by meteorological factors, vegetation coverage, and elevation in lower urbanization regions. The scheme of this study is useful and valuable in understanding the mechanism of O<sub>3</sub> formation and improving the quality of the O<sub>3</sub> dataset.
Published in: Environment International
Volume 170, pp. 107606-107606