Bilgilendirme: Kurulum ve veri kapsamındaki çalışmalar devam etmektedir. Göstereceğiniz anlayış için teşekkür ederiz.
 

Autoencoder-Based Anomaly Detection and Analysis in Log Data Generated in Cloud Systems Using Natural Language Processing

dc.contributor.author Ayvaz, I.
dc.contributor.author Karabey Aksakalli, I.K.
dc.contributor.author Baygin, M.
dc.date.accessioned 2026-03-26T15:03:12Z
dc.date.available 2026-03-26T15:03:12Z
dc.date.issued 2025
dc.description.abstract In this study, an Autoencoder-based model was developed to detect anomalies in log data obtained from cloud systems. The dataset used consists of log records from the Blue Gene/L (BGL) supercomputer. In the preprocessing phase, log messages were vectorized using the TF-IDF method, and structural features such as content length, word count, and the presence of component/type information were extracted to create an enriched feature matrix. The model attempted to reconstruct each log entry and calculated the reconstruction errors. Records were then classified as normal or anomalous based on a threshold corresponding to the 95th percentile of these errors. The developed model achieved a high performance with an accuracy rate of 99.61%, as well as strong results in precision, recall, and F1-score metrics. Additional evaluations using ROCAUC and Precision-Recall curves further confirmed the model's robustness. The results demonstrate that the Autoencoder architecture can effectively detect anomalies in large and complex log datasets. Within the scope of the study, the proposed model was also evaluated comparatively against recent approaches such as DeepLog, LogRobust, MLP, and LogEvent2Vec. The proposed model outperformed all other methods across all performance metrics. These findings highlight the Autoencoder-based method as a strong alternative in terms of both computational efficiency and anomaly detection capability. © 2025 IEEE. en_US
dc.identifier.doi 10.1109/ISAS66241.2025.11101866
dc.identifier.isbn 9798331514822
dc.identifier.scopus 2-s2.0-105014921774
dc.identifier.uri https://doi.org/10.1109/ISAS66241.2025.11101866
dc.identifier.uri https://hdl.handle.net/20.500.14901/3746
dc.language.iso tr en_US
dc.publisher Institute of Electrical and Electronics Engineers Inc. en_US
dc.relation.ispartof -- 9th International Symposium on Innovative Approaches in Smart Technologies, ISAS 2025 -- 2025-06-27 through 2025-06-28 -- Gaziantep -- 211342 en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Anomaly Detection en_US
dc.subject Autoencoder en_US
dc.subject Cloud Logs en_US
dc.subject Natural Language Processing en_US
dc.subject Tf-Idf en_US
dc.subject Unstructured Data Analysis en_US
dc.title Autoencoder-Based Anomaly Detection and Analysis in Log Data Generated in Cloud Systems Using Natural Language Processing en_US
dc.title.alternative Bulut Sistemlerinde Uretilen Log Verilerinde Dogal Dil Isleme Kullanilarak Otomatik Kodlayici Tabanli Anomali Tespiti ve Analizi
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.scopusid 58899617900
gdc.author.scopusid 56780440800
gdc.author.scopusid 55293658600
gdc.description.department Erzurum Technical University en_US
gdc.description.departmenttemp [Ayvaz] Ilayda, Bilgisayar Mühendisliǧi Bölümü, Erzurum Technical University, Erzurum, Erzurum, Turkey; [Karabey Aksakalli] Isil, Bilgisayar Mühendisliǧi Bölümü, Erzurum Technical University, Erzurum, Erzurum, Turkey; [Baygin] Mehmet, Bilgisayar Mühendisliǧi Bölümü, Erzurum Technical University, Erzurum, Erzurum, Turkey en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.wosquality N/A
gdc.virtual.author Bayğın, Mehmet
gdc.virtual.author Karabey Aksakallı, İşıl
relation.isAuthorOfPublication 131a2dd2-0bc0-4048-a02f-13336fbc84f6
relation.isAuthorOfPublication f5e94616-9c08-4c88-bbf7-a49e759664a1
relation.isAuthorOfPublication.latestForDiscovery 131a2dd2-0bc0-4048-a02f-13336fbc84f6

Files