Application of imputation methods for missing values of PM10 and O3 data: Interpolation, moving average and K-nearest neighbor methods
Journal: Environmental Health Engineering and Management Journal (Vol.8, No. 3)Publication Date: 2021-08-30
Authors : Parisa Saeipourdizaj Parvin Sarbakhsh Akbar Gholampour;
Page : 215-226
Keywords : Air pollution; Algorithms; Environmental pollutants; Spatio-temporal analysis; Humans;
Abstract
Background: PIn air quality studies, it is very often to have missing data due to reasons such as machine failure or human error. The approach used in dealing with such missing data can affect the results of the analysis. The main aim of this study was to review the types of missing mechanism, imputation methods, application of some of them in imputation of missing of PM10 and O3 in Tabriz, and compare their efficiency. Methods: Methods of mean, EM algorithm, regression, classification and regression tree, predictive mean matching (PMM), interpolation, moving average, and K-nearest neighbor (KNN) were used. PMM was investigated by considering the spatial and temporal dependencies in the model. Missing data were randomly simulated with 10, 20, and 30% missing values. The efficiency of methods was compared using coefficient of determination (R2), mean absolute error (MAE) and root mean square error (RMSE). Results: Based on the results for all indicators, interpolation, moving average, and KNN had the best performance, respectively. PMM did not perform well with and without spatio-temporal information. Conclusion: Given that the nature of pollution data always depends on next and previous information, methods that their computational nature is based on before and after information indicated better performance than others, so in the case of pollutant data, it is recommended to use these methods.
Other Latest Articles
- Assessment of toxicity and kinetic effects of erythromycin on activated sludge consortium by fast respirometry method
- Feasibility study of the application of treated wastewater for the irrigation of forest species in a Saharan area
- Association of urinary triclosan and methyl-triclosan levels with predictive indicators of cardiovascular disease and obesity in children and adolescents in 2020 (case study: Kerman, Iran)
- Climate change and its effects on farm workers
- Health sector’s flood response plan: A comprehensive review
Last modified: 2021-09-28 17:53:22