Strategic Data Augmentation for IoT Intrusion Detection Using Ensemble Machine Learning

Authors

  • Rafid Hamid Islamic University of Lebanon

DOI:

https://doi.org/10.54153/sjpas.2026.v8i2.1500

Keywords:

Network Intrusion Detection System (NIDS), Strategic Data Augmentation, XGBoost, Random Forest, UNSW-NB15, Zero Trust Security, Class Imbalance, IoT Security

Abstract

The Internet of Things (IoT) and modern web-based infrastructure are increasing at a very fast rate, resulting in a rise in complex and compact cyber-attack patterns that tend to evade traditional signature-based protection methods. A key trade-off in the detection capability and rate of false alarms arises from the prevailing methods' common difficulty in addressing the “Class Imbalance” problem that exists in realistic network traffics, although Machine Learning has shown promise as a potential paradigm for Network Intrusion Detection Systems (NIDS). The “Zero False Negative” challenge in high-security applications remains unaddressed in prevailing state-of-the-art methods that tend to plateau at very high levels of accuracy (around 99.6 percent). This paper proposes a novel High-Redundancy Network Intrusion Detection Framework that relies on a Strategic Data Augmentation approach that can address the “Zero False Negative” challenge.

The proposed framework tests three distinct classifiers, namely Random Forest, Extreme Gradient Boosting (XGBoost), and a constrained Decision Tree (for baseline complexity testing). Based on the experimental results, the Random Forest and XGBoost models were able to fill the performance gap created in the previous study by achieving 100.00% Accuracy, 1.00 Precision, and 0.00% False Alarm Rate (FAR) due to the proposed augmentation method. However, the baseline Decision Tree could only achieve 89.09% accuracy, proving that the dataset still retains a large amount of structural complexity that demands robust ensemble learning. Although both ensemble methods made perfect detections, XGBoost is more efficient in computation and converged 3.3 times faster (21.48 seconds) than Random Forest in an efficiency comparison study. Hence, in real-time Zero Trust Network Intrusion Detection in high velocity networks, the proposed study recommends the Augmented XGBoost model to be the most optimal choice.

References

1. Vitorino, J., Andrade, R., Praca, I., Sousa, O., & Maia, E. (2021). ‘A comparative analysis of

machine learning techniques for IoT intrusion detection’, Foundations and Practice of

Security, pp. 191–207. DOI: 10.1007/978-3-031-08147-7_13.

2. Samantaray, M., Barik, R. C., & Biswal, A. K. (2024). ‘A comparative assessment of machine

learning algorithms in the IoT-based network intrusion detection systems’, Decision

Analytics Journal, 11, 100478. DOI: 10.1016/j.dajour.2024.100478.

3. More, S., Idrissi, M., Mahmoud, H., & Asyhari, A. T. (2024). ‘Enhanced intrusion detection

systems performance with UNSW-NB15 data analysis’, Algorithms, 17(2), 64. DOI:

10.3390/a17020064.

4. Talukder, M. A., Islam, M. M., Uddin, M. A., Hasan, K. F., Sharmin, S., Alyami, S. A., & Moni, M.

A. (2024). ‘Machine learning-based network intrusion detection for big and imbalanced

data using oversampling, stacking feature embedding and feature extraction’, Journal of Big

Data, 11(1), 33. DOI: 10.1186/s40537-024-00886-w.

5. Yulianton, H., Sutanto, F. A., & Santi, R. C. N. (2025). ‘Optimized network intrusion detection

using XGBoost with hyperparameter tuning: An empirical study on UNSW-NB15 dataset’,

Journal of Software Engineering and Simulation, 11(8), 1–7. DOI: 10.35629/3795-

11080107

6. Binbusayyis, A., Alaskar, H., Vaiyapuri, T., & Dinesh, M. J. (2022). ‘An investigation and

comparison of machine learning approaches for intrusion detection in IoMT network’, The

Journal of Supercomputing, 78, 17403–17422. DOI: 10.1007/s11227-022-04568-3.

7. Laqtib, S., El Yassini, K., & Hasnaoui, M. L. (2020). ‘A technical review and comparative

analysis of machine learning techniques for intrusion detection systems in MANET’,

International Journal of Electrical and Computer Engineering, 10(3), 2701–2709. DOI:

10.11591/ijece.v10i3.pp2701-2709.

8. Udurume, M., Shakhov, V., & Koo, I. (2024). ‘Comparative analysis of deep convolutional

neural network-bidirectional long short-term memory and machine learning methods in

intrusion detection systems’, Applied Sciences, 14(16), 6967. DOI: 10.3390/app14166967.

9. Note, J., & Ali, M. (2022). ‘Comparative analysis of intrusion detection system using machine

learning and deep learning algorithms’, Annals of Emerging Technologies in Computing,

6(3), 19–36.

10. Mahadevappa, P., Muzammal, S. M., & Murugesan, R. K. (2021). ‘A comparative analysis of

machine learning algorithms for intrusion detection in edge-enabled IoT networks’, arXiv

preprint arXiv:2111.01383.

11. Inuwa, M. M., & Das, R. (2024). ‘A comparative analysis of various machine learning

methods for anomaly detection in cyber attacks on IoT networks’, Internet of Things, 26,

101162. DOI: 10.1016/j.iot.2024.101162

12. Benamor, Z., Seghir, Z. A., Djezzar, M., & Hemam, M. (2023). ‘A comparative study of

machine learning algorithms for intrusion detection in IoT networks’, Revue d Intelligence

Artificielle, 37(3), 567–576.

13. Saran, N., & Kesswani, N. (2023). ‘A comparative study of supervised machine learning

classifiers for intrusion detection in Internet of Things’, Procedia Computer Science, 218,

2049–2057.

14. Magan-Carrion, R., Urda, D., Diaz-Cano, I., & Dorronsoro, B. (2020). ‘Towards a reliable

comparison and evaluation of network intrusion detection systems based on machine

learning approaches’, Applied Sciences, 10(5), 1775. DOI: 10.3390/app10051775.357

15. Azam, Z., Islam, M. M., & Huda, M. N. (2023). ‘Comparative analysis of intrusion detection

systems and machine learning-based model analysis through decision tree’, IEEE Access,

11, 80348–80391.

16. Airlangga, G. (2024). ‘Comparative analysis of machine learning models for intrusion

detection in Internet of Things networks using the RT-IoT2022 dataset’, MALCOM:

Indonesian Journal of Machine Learning and Computer Science, 4(2), 656–662.

Published

2026-06-30

How to Cite

Strategic Data Augmentation for IoT Intrusion Detection Using Ensemble Machine Learning. (2026). Samarra Journal of Pure and Applied Science, 8(2), 335-348. https://doi.org/10.54153/sjpas.2026.v8i2.1500

Similar Articles

11-20 of 142

You may also start an advanced similarity search for this article.