Smart City Gnosys

Smart city article details

Title The Effect Of Generating Synthetic Data In Smart City Network Systems
ID_Doc 55376
Authors Čech P.; Ponce D.; Mikulecký P.; Žváčková A.; Mls K.; Otčenášková T.; Tučník P.
Year 2025
Published SN Computer Science, 6, 2
DOI http://dx.doi.org/10.1007/s42979-025-03673-3
Abstract This study examines the effect of synthetic data generation for balancing class distributions on the performance of classification algorithms in smart city network systems. Contrary to the assumption that data balancing improves classification performance, the analysis reveals a more complex impact. Using three publicly available network traffic benchmark datasets and four different balancing techniques, the study evaluates the performance of five classifiers on 65 classification tasks. The findings indicate that, for smaller datasets, classifiers that achieved the highest accuracy on unbalanced data did not benefit from synthetic data generation for minority classes. Although neural network-based classifiers showed improved performance with balanced data, these improvements came at the cost of lower overall classification scores. For larger datasets, balancing through random oversampling of minority classes and undersampling of majority classes helped improve classification. However, these improvements were limited to precision, with no significant gains in recall. The study offers valuable insights into using synthetic data for intrusion detection, emphasizing the challenges of intricate dependencies in network traffic data for generative models. The results align with previous research showing mixed effects of data balancing on classifier performance, contributing to a broader understanding of the limited efficacy of synthetic data in real-world network contexts. This experimental study highlights the need for a systematic benchmarking framework for synthetic data research, ensuring consistency in data balancing and classification processes. This work contributes to the ongoing discourse on the intersection of machine learning and cybersecurity, emphasizing the critical role of data in developing resilient intrusion detection systems. © The Author(s) 2025.
Author Keywords Attack classification; Generative adversarial networks; Imbalanced datasets; Intrusion detection


Similar Articles


Id Similarity Authors Title Published
27811 View0.909Čech P.; Ponce D.; Mikulecký P.; Mls K.; Žváčková A.; Tučník P.; Otčenášková T.Generating Synthetic Data To Improve Intrusion Detection In Smart City Network SystemsLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 14482 LNCS (2024)
55661 View0.878Baptist Andrews L.J.; Midhun Chakkaravarthy D.; Raj R.A.; Selvam J.; Sarathkumar D.; Akbar S.S.The Identification Of Conflicting Determinations Of Anomalies In Computer Network Behaviour: Cyber Security In Smart CitiesInternational Conference on Recent Advances in Science and Engineering Technology, ICRASET 2023 (2023)
27812 View0.87Alabdulwahab S.; Kim Y.-T.; Seo A.; Son Y.Generating Synthetic Dataset For Ml-Based Ids Using Ctgan And Feature Selection To Protect Smart Iot EnvironmentsApplied Sciences (Switzerland), 13, 19 (2023)
14811 View0.857Palli A.S.; Jaafar J.; Hashmani M.A.; Gomes H.M.; Alsughayyir A.; Gilal A.R.Combined Effect Of Concept Drift And Class Imbalance On Model Performance During Stream ClassificationComputers, Materials and Continua, 75, 1 (2023)
17029 View0.853Protic D.; Gaur L.; Stankovic M.; Rahman M.A.Cybersecurity In Smart Cities: Detection Of Opposing Decisions On Anomalies In The Computer Network BehaviorElectronics (Switzerland), 11, 22 (2022)
13293 View0.852Khan J.; Elfakharany R.; Saleem H.; Pathan M.; Shahzad E.; Dhou S.; Aloul F.Can Machine Learning Enhance Intrusion Detection To Safeguard Smart City Networks From Multi-Step Cyberattacks?Smart Cities, 8, 1 (2025)
814 View0.852Basheer L.; Ranjana P.A Comparative Study Of Various Intrusion Detections In Smart Cities Using Machine Learning2022 International Conference on IoT and Blockchain Technology, ICIBT 2022 (2022)
23834 View0.851Al-Atawi A.A.Enhancing Internet Of Smart City Security: Utilizing Logistic Boosted Algorithms For Anomaly Detection And Cyberattack PreventionSN Computer Science, 5, 5 (2024)