Smart City Gnosys

Smart city article details

Title Environmental Data Analytics For Smart Cities: A Machine Learning And Statistical Approach
ID_Doc 24215
Authors AlSalehy A.S.; Bailey M.
Year 2025
Published Smart Cities, 8, 3
DOI http://dx.doi.org/10.3390/smartcities8030090
Abstract Highlights: What are the main findings? CO pollution in Jubail shows strong diurnal and moderate weekly patterns, with local sources dominating spatial variation. Ensemble machine learning models, especially Extreme Gradient Boosting (XGBoost) and Categorical Boosting (CatBoost), achieved highly accurate CO forecasts (R2 > 0.95). What is the implication of the main finding? Predictive analytics enable proactive air quality management in smart cities, improving public health outcomes. Identifying pollution hotspots and weather interactions supports targeted interventions and smarter urban planning. Effectively managing carbon monoxide (CO) pollution in complex industrial cities like Jubail remains challenging due to the diversity of emission sources and local environmental dynamics. This study analyzes spatiotemporal CO patterns and builds accurate predictive models using five years (2018–2022) of data from ten monitoring stations, combined with meteorological variables. Exploratory analysis revealed distinct diurnal and moderate weekly CO cycles, with prevailing northwesterly winds shaping dispersion. Spatial correlation of CO was low (average 0.14), suggesting strong local sources, unlike temperature (0.92) and wind (0.5–0.6), which showed higher spatial coherence. Seasonal Trend decomposition (STL) confirmed stronger seasonality in meteorological factors than in CO levels. Low wind speeds were associated with elevated CO concentrations. Key predictive features, such as 3-h rolling mean and median values of CO, dominated feature importance. Spatiotemporal analysis highlighted persistent hotspots in industrial areas and unexpectedly high levels in some residential zones. A range of models was tested, with ensemble methods (Extreme Gradient Boosting (XGBoost) and Categorical Boosting (CatBoost)) achieving the best performance ((Formula presented.)) and XGBoost producing the lowest Root Mean Squared Error (RMSE) of 0.0371 ppm. This work enhances understanding of CO dynamics in complex urban–industrial areas, providing accurate predictive models ((Formula presented.)) and highlighting the importance of local sources and temporal patterns for improving air quality forecasts. © 2025 by the authors.
Author Keywords air quality monitoring; forecasting models; gas and weather data; machine learning for environmental data; multi-location data; predictive analytics; smart cities; spatiotemporal analysis; time-series analysis


Similar Articles


Id Similarity Authors Title Published
53645 View0.883Essamlali I.; Nhaila H.; El Khaili M.Supervised Machine Learning Approaches For Predicting Key Pollutants And For The Sustainable Enhancement Of Urban Air Quality: A Systematic ReviewSustainability (Switzerland), 16, 3 (2024)
36027 View0.877Zareba M.; Cogiel S.; Danek T.; Weglinska E.Machine Learning Techniques For Spatio-Temporal Air Pollution Prediction To Drive Sustainable Urban Development In The Era Of Energy And Data TransformationEnergies , 17, 11 (2024)
42864 View0.873Varde A.S.; Pandey A.; Du X.Prediction Tool On Fine Particle Pollutants And Air Quality For Environmental EngineeringSN Computer Science, 3, 3 (2022)
10552 View0.873Neo E.X.; Hasikin K.; Lai K.W.; Mokhtar M.I.; Azizan M.M.; Hizaddin H.F.; Razak S.A.; YantoArtificial Intelligence-Assisted Air Quality Monitoring For Smart City ManagementPeerJ Computer Science, 9 (2023)
25420 View0.872Bilotta S.; Ipsaro Palesi L.A.; Nesi P.Exploiting Open Data For Co2 Estimation Via Artificial Intelligence And Explainable AiExpert Systems with Applications, 291 (2025)
32137 View0.869Mekouar Y.; Saleh I.; Karim M.Integration Of Data Acquisition And Modeling For Ecological Simulation Of Co2 Emissions In Urban Traffic: The Case Of The City Of Paris2024 International Conference on Computer and Applications, ICCA 2024 (2024)
60587 View0.866Alshanbari A.H.; Altalhi S.H.; Alsharif S.S.; Alharthi G.M.; Althubiti R.W.; Meshref H.Using Machine Learning Techniques To Moderate Co2 Emission In Ksa Future Smart Cities2025 2nd International Conference on Advanced Innovations in Smart Cities, ICAISC 2025 (2025)
23737 View0.865Shahbazi Z.; Shahbazi Z.; Nowaczyk S.Enhancing Air Quality Forecasting Using Machine Learning TechniquesIEEE Access, 12 (2024)
42245 View0.862Faydi M.; Zrelli A.; Ezzedine T.Pm2.5 Prediction Using Deep Learning Models17th International Conference on INnovations in Intelligent SysTems and Applications, INISTA 2023 - Proceedings (2023)
29779 View0.862Mohammed Q.H.; Namburu A.Hybrid Model And Framework For Predicting Air Pollutants In Smart CitiesJournal of Engineering and Sustainable Development, 28, 3 (2024)