Search for a command to run...
<strong class="journal-contentHeaderColor">Abstract.</strong> The online version of the Regional Air Quality Deterministic Prediction System (RAQDPS) is a chemical weather forecast system that has been employed operationally by Environment and Climate Change Canada (ECCC) since 2009. It is run twice daily to produce 72 hour forecasts of hourly 10 km abundance fields of three key predictands, NO<sub>2</sub>, O<sub>3</sub>, and PM<sub>2.5</sub> total mass, as well as other gas-phase chemical species, PM<sub>2.5</sub> chemical components, and dry and wet deposition for Canada, the contiguous U.S., and northern Mexico. Version 023 of the RAQDPS (RAQDPS023) went into service at ECCC in December 2021 and was replaced by the RAQDPS025 in June 2024. A companion paper by Moran et al. (2025) describes the RAQDPS023 in detail. In this paper we present the results of a five-year performance evaluation of prospective and retrospective annual air quality (AQ) simulations made with the RAQDPS023. The annual simulations considered were the first year of RAQDPS023 forecasts in 2021/22 and four years of retrospective annual simulations for the 2013‒2016 period that used historical, year-specific emissions. Forecasts made by the RAQDPS-FW023, a duplicate operational system to the RAQDPS023 except for the addition of near-real-time (NRT) biomass burning (BB) emissions, were also evaluated for the 2021/22 period. A NRT measurement data set consisting of hourly NO<sub>2</sub>, O<sub>3</sub>, and PM<sub>2.5</sub> surface measurements for Canada and the U.S. was used for the 2021/22 evaluation whereas a much more extensive set of air-chemistry and precipitation-chemistry measurements was used for the 2013‒2016 evaluations. Some evaluation results were also compared with results for the 2010‒2019 period for forecasts made by earlier operational versions of the RAQDPS and with evaluation results for several peer AQ forecast models. In addition to looking at a number of highly aggregated “headline” scores, many stratified analyses were also performed, including evaluations by network, season, month, hour of day, region, and land-use type. Consideration of simulations for multiple years with the same model and year-specific input emissions helped to identify systematic model errors by reducing the influence of year-to-year variations in meteorology, and a comprehensive evaluation for many species for 2013‒2016 supported by stratified analyses provided diagnostic insights that allowed the scientific basis for the RAQDPS023 forecasts to be assessed (i.e., “right answers for the right reasons?”). Although one confounding factor for this study was the sizable reduction in the emissions of some pollutants in North America that occurred from 2013 to 2021, it was found that the trends in AQ observations over this period agreed with the year-specific description of emissions used for the five annual simulations from a rank-ordered perspective. While RAQDPS023 evaluation scores for hourly NO<sub>2</sub> and O<sub>3</sub> volume mixing ratio forecasts were found to be competitive with peer models and often met suggested performance benchmarks for the five simulation years, another key finding was that the RAQDPS023 forecasts consistently underpredicted hourly PM<sub>2.5</sub> total mass concentrations for all months in 2021/22 and for the majority of months in 2013‒2016. The largest underpredictions occurred in summer and at rural stations whereas overpredictions often occurred in the cold season at urban stations. The model also missed the observed bimodality in monthly PM<sub>2.5 </sub>concentrations and exaggerated the observed diurnal variations in hourly PM<sub>2.5</sub> concentrations. Additional evaluations with daily PM<sub>2.5</sub> chemical composition measurements and daily gravimetric PM<sub>2.5</sub> total mass measurements were also examined to better understand the hourly PM<sub>2.5</sub> underpredictions. Consistent overpredictions of elemental carbon and sea salt concentrations and underpredictions of sulfate concentration were identified, but scores for predictions of daily gravimetric PM<sub>2.5</sub> total mass were better than those for hourly PM<sub>2.5</sub> total mass, directing attention to differences in measurement methods. SO<sub>2</sub> and HNO<sub>3</sub> levels were also found to be overpredicted in general while NH<sub>3</sub> levels were underpredicted: these three gas-phase species are all PM<sub>2.5</sub> precursors, which raises concerns about some process representations such as those for sulfur oxidation and gas-phase dry deposition. As well, springtime O<sub>3</sub> levels were underpredicted while isoprene levels were consistently overpredicted. The impact of BB emissions on predictions of NO<sub>2</sub>, O<sub>3</sub>, and PM<sub>2.5</sub> was also characterized in detail by comparing evaluation results for the 2021/22 RAQDPS023 and RAQDPS-FW023 forecasts. Negligible impact was found for monthly NO<sub>2</sub> forecasts when BB emissions were included, but monthly O<sub>3</sub> forecast scores were modestly improved and monthly PM<sub>2.5 </sub>forecast scores were markedly improved from July to September 2021, as well as summer and annual scores. Taken together, the results of this comprehensive multi-year evaluation point to a number of RAQDPS023 system components where improvements are desirable. These results also provide a strong benchmark against which to compare the performance of future versions of the RAQDPS.