Predicting Air Quality Index in Başakşehir, Istanbul with Hybrid AI Models: Unveiling Key Drivers Through Catboost-Based SHAP and Feature Importance Analysis

dc.contributor.author Akiner, Muhammed Ernur
dc.contributor.author Katipoglu, Okan Mert
dc.contributor.author Cintas, Emre
dc.date.accessioned 2026-03-26T14:57:43Z
dc.date.available 2026-03-26T14:57:43Z
dc.date.issued 2025
dc.description Akiner, Muhammed Ernur/0000-0002-5192-2473; Çintaş, Emre/0000-0002-4954-5816; en_US
dc.description.abstract Urban air quality influences public health, ecosystem sustainability and economic productivity. This study focuses on predicting the Air Quality Index (AQI) in Ba & scedil;ak & scedil;ehir, Istanbul. The study proposes a hybrid artificial intelligence (AI) model that amalgamates Categorical Boosting (CatBoost), Shapley Additive Explanations (SHAP) and the feature importance analysis. The dataset encompasses various meteorological parameters, including Tempmax, Tempmin, Temp, Dew, Humidity, Precip, Windspeed, Sea level pressure, Cloud cover, Solar radiation, Solar energy and UV index, in addition to air quality parameters such as PM10, SO2, CO, NO2, NOX, NO and O3. These variables serve as inputs for models like ANN, BAT-ANN,- BBO-ANN, GWO-ANN, HCA-ANN, CatBoost and CNN; the intent is to enhance the accuracy of the AQI prediction. When the combined set of variables were employed as inputs, the most precise results emerged from the CNN model, which yielded an RMSE of 1.43, an AIC of 949.21 and an NSE and R2 of 0.99. The CatBoost model exhibited exceptional performance among the various input combinations, providing the most accurate results for these configurations. Non-parametric statistical Friedman and Nemenyi post-hoc tests were used for multi-model comparison, and it was concluded that there were significant performance differences between the models used according to the p-value values, both in general and based on pairs. While prior studies have explored hybrid AI models for AQI prediction, this study uniquely integrates CatBoost and SHAP for enhanced explainability and model performance evaluation. SHAP analysis provided transparent insights into variable contributions; however, PM10 emerged as the dominant predictor, achieving the highest mutual information score of 6.88. These findings underscore the importance of integrating pollutant and meteorological data. The proposed methodology aligns with global sustainability goals, including SDG 11 (Sustainable Cities and Communities) and SDG 13 (Climate Action). en_US
dc.identifier.doi 10.1007/s00704-025-05658-x
dc.identifier.issn 0177-798X
dc.identifier.issn 1434-4483
dc.identifier.scopus 2-s2.0-105010647993
dc.identifier.uri https://doi.org/10.1007/s00704-025-05658-x
dc.identifier.uri https://hdl.handle.net/20.500.14901/3028
dc.language.iso en en_US
dc.publisher Springer Wien en_US
dc.relation.ispartof Theoretical and Applied Climatology en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.title Predicting Air Quality Index in Başakşehir, Istanbul with Hybrid AI Models: Unveiling Key Drivers Through Catboost-Based SHAP and Feature Importance Analysis en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.id Akiner, Muhammed Ernur/0000-0002-5192-2473
gdc.author.id Çintaş, Emre/0000-0002-4954-5816
gdc.author.scopusid 35602629200
gdc.author.scopusid 57203751801
gdc.author.scopusid 57221615114
gdc.author.wosid Akiner, Muhammed Ernur/Jbj-5949-2023
gdc.author.wosid Çintaş, Emre/Aac-4713-2021
gdc.author.wosid Katipoğlu, Okan/Aaq-2658-2020
gdc.description.department Erzurum Technical University en_US
gdc.description.departmenttemp [Akiner, Muhammed Ernur] Akdeniz Univ, Vocat Sch Tech Sci, Dept Environm Protect Technol, Antalya, Turkiye; [Katipoglu, Okan Mert] Erzincan Binali Yildirim Univ, Fac Engn & Architecture, Dept Civil Engn, Erzincan, Turkiye; [Cintas, Emre] Ataturk Univ, Dept Comp Engn, Erzurum, Turkiye; [Cintas, Emre] Erzurum Tech Univ, Dept Comp Engn, Erzurum, Turkiye en_US
gdc.description.issue 8 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q2
gdc.description.volume 156 en_US
gdc.description.woscitationindex Science Citation Index Expanded
gdc.description.wosquality Q3
gdc.identifier.wos WOS:001541668300001
gdc.virtual.author Çintaş, Emre
relation.isAuthorOfPublication 6a414faa-66b1-46da-8af5-9bb907a757cf
relation.isAuthorOfPublication.latestForDiscovery 6a414faa-66b1-46da-8af5-9bb907a757cf

Files