%0 Journal Article %T A convolutional neural networks method for tropospheric ozone vertical distribution retrieval from Multi-AXis Differential Optical Absorption Spectroscopy measurements. %A Wang Z %A Tian X %A Xie P %A Xu J %A Zheng J %A Pan Y %A Zhang T %A Fan G %J Sci Total Environ %V 951 %N 0 %D 2024 Jul 26 %M 39067587 %F 10.753 %R 10.1016/j.scitotenv.2024.175049 %X The vertical distribution of tropospheric ozone (O3) is crucial for understanding atmospheric physicochemical processes. A Convolutional Neural Networks (CNN) method for the retrieval of tropospheric O3 vertical distribution from ground-based Multi-AXis Differential Optical Absorption Spectroscopy (MAX-DOAS) measurements to tackle the issue of stratospheric O3 absorption interference faced by MAX-DOAS in obtaining tropospheric O3 profiles. Firstly, a hybrid model, named PCA-F_Regression-SVR, is developed to screen features sensitive to O3 inversion based on the MAX-DOAS spectra and EAC4 reanalysis O3 profiles, which incorporates Principal Component Analysis (PCA), F_Regression function, and Support Vector Regression (SVR) algorithm. Thus, these screened features for ancillary inversion include the profiles of temperature, specific humidity, fraction of cloud coverage, eastward and northward wind, the profiles of SO2, NO2, and HCHO, as well as season and time features to serve as sensitive factors. Secondly, the preprocessed MAX-DOAS spectra dataset and the sensitive factor dataset are utilized as input, while the O3 profiles of the EAC4 reanalysis dataset incorporating the surface O3 concentrations are employed as output for constructing the CNN model. The Mean Absolute Percentage Error (MAPE) decreases from 26 % to approximately 19 %. Finally, the CNN model is applied for inversion and comparison of tropospheric O3 profiles using independent input data. The CNN model effectively reproduces the O3 profiles of the EAC4 dataset, showing a Gaussian-like spatial distribution with peaks primarily around 950 hPa (550 m). Since the reanalysis data used for model training has been smoothed, the CNN model is insensitive to extreme values. This behavior can be attributed to the MAPE loss function, which evaluates Absolute Percentage Errors (APEs) of O₃ concentration at all altitudes, resulting in varying retrieval accuracy across different altitudes while maintaining overall MAPE control. Temporally, the CNN model tends to overestimate surface O3 in summer by around 20 μg/m3, primarily due to the influence of the temperature feature in the sensitivity factor dataset. In conclusion, leveraging MAX-DOAS spectra enables the retrieval of tropospheric O3 vertical distribution through the established CNN model.