Skip to main content
Articles

Ensemble Based Intrusion Detection in Heterogeneous Networks: A Machine Learning Framework with Zero Trust Integration

Abstract

Contemporary enterprise networks face an escalating proliferation of sophisticated cyber threats that evade traditional signature-based detection systems. This paper proposes the Adaptive Ensemble Threat Detection (AETD) framework   a novel machine learning architecture integrating Random Forest, Bidirectional Long Short-Term Memory (BiLSTM), Autoencoder based anomaly detection, and XGBoost within a dynamically weighted ensemble voting mechanism. AETD addresses three critical limitations of existing intrusion detection systems: (1) inadequate generalization to zero day and novel attack variants; (2) class imbalance in training data leading to high false positive rates; and (3) the absence of real time adaptive weight updating in response to evolving threat patterns. The framework is further contextualized within a Zero Trust Architecture (ZTA) paradigm, enabling identity aware, continuous verification-based access control that complements network layer detection. Extensive experimental evaluation on the CICIDS2018 benchmark dataset (16.2 million network flow records, 14 attack classes) demonstrates that AETD achieves 99.43% classification accuracy and 0.57% false positive rate, outperforming all individual constituent models and classical baselines by statistically significant margins (p < 0.001). Ablation studies confirm the contribution of each ensemble component. The proposed framework is positioned within a comprehensive Security Operations Centre (SOC) integration model encompassing SIEM correlation, SOAR automated response, and MITRE ATT&CK aligned detection coverage. Results demonstrate a 38% improvement in mean time to detection (MTTD) compared to rule only SIEM deployments.

References

[1] ENISA. (2022). ENISA Threat Landscape 2022. European Union Agency for Cybersecurity. https://www.enisa.europa.eu/publications/enisa threat landscape 2022
[2] Sommer, R., & Paxson, V. (2010). Outside the closed world: On using machine learning for network intrusion detection. IEEE Symposium on Security and Privacy, 305–316. https://doi.org/10.1109/SP.2010.25
[3] Lippmann, R., et al. (2000). Evaluating intrusion detection systems: The 1998 DARPA offline intrusion detection evaluation. DARPA Information Survivability Conference and Exposition. https://doi.org/10.1109/DISCEX.2000.821506
[4] Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: Synthetic minority over sampling technique. Journal of Artificial Intelligence Research, 16, 321–357. https://doi.org/10.1613/jair.953
[5] Rose, S., Borchert, O., Mitchell, S., & Connelly, S. (2020). Zero Trust Architecture. NIST Special Publication 800 207. National Institute of Standards and Technology. https://doi.org/10.6028/NIST.SP.800 207
[6] Buczak, A. L., & Guven, E. (2016). A survey of data mining and machine learning methods for cyber security intrusion detection. IEEE Communications Surveys & Tutorials, 18(2), 1153–1176. https://doi.org/10.1109/COMST.2015.2494502
[7] Khraisat, A., Gondal, I., Vamplew, P., & Kamruzzaman, J. (2019). Survey of intrusion detection systems: techniques, datasets and challenges. Cybersecurity, 2(1), 1–22. https://doi.org/10.1186/s42400 019 0038 7
[8] Kim, J., Kim, J., Thu, H. L. T., & Kim, H. (2016). Long short term memory recurrent neural network classifier for intrusion detection. Proc. 2016 International Conference on Platform Technology and Service, 1–5. https://doi.org/10.1109/PlatCon.2016.7456805
[9] Javaid, A., Niyaz, Q., Sun, W., & Alam, M. (2016). A deep learning approach for network intrusion detection system. Proc. 9th EAI International Conference on Bio inspired Information and Communications Technologies, 21–26.
[10] Wang, W., et al. (2020). HAST IDS: Learning hierarchical spatial temporal features using deep neural networks to improve intrusion detection. IEEE Access, 6, 1792–1806. https://doi.org/10.1109/ACCESS.2017.2780250
[11] Mirsky, Y., Doitshman, T., Elovici, Y., & Shabtai, A. (2018). Kitsune: An ensemble of autoencoders for online network intrusion detection. Proc. NDSS 2018. https://doi.org/10.14722/ndss.2018.23204
[12] An, J., & Cho, S. (2015). Variational autoencoder based anomaly detection using reconstruction probability. Special Lecture on IE, Seoul National University, 2(1), 1–18.
[13] Sagi, O., & Rokach, L. (2018). Ensemble learning: A survey. WIREs Data Mining and Knowledge Discovery, 8(4), e1249. https://doi.org/10.1002/widm.1249
[14] Chawla, N. V. (2009). Data mining for imbalanced datasets: An overview. In O. Maimon & L. Rokach (Eds.), Data Mining and Knowledge Discovery Handbook, 875–886. Springer.
[15] Bifet, A., Holmes, G., Kirkby, R., & Pfahringer, B. (2010). MOA: Massive online analysis. Journal of Machine Learning Research, 11, 1601–1604.
[16] Syed, N. F., Shah, S. W., Shaghaghi, A., Anwar, A., Baig, Z., & Doss, R. (2022). Zero trust architecture (ZTA): A comprehensive survey. IEEE Access, 10, 57143–57179. https://doi.org/10.1109/ACCESS.2022.3174679
[17] Chen, B., Qiao, S., Zhao, J., Liu, D., Shi, X., & Lyu, M. (2019). A security awareness and protection system for 5G smart healthcare based on zero trust architecture. IEEE Internet of Things Journal, 8(13), 10248–10263.
[18] Insider Threat Task Force. (2020). Common Sense Guide to Mitigating Insider Threats, 6th Edition. Carnegie Mellon University SEI. https://resources.sei.cmu.edu
[19] Sharafaldin, I., Lashkari, A. H., & Ghorbani, A. A. (2018). Toward generating a new intrusion detection dataset and intrusion traffic characterization. Proc. ICISSP 2018, 108–116. https://doi.org/10.5220/0006639801080116
[20] Gama, J., Žliobaitė, I., Bifet, A., Pechenizkiy, M., & Bouchachia, A. (2014). A survey on concept drift adaptation. ACM Computing Surveys, 46(4), 1–37. https://doi.org/10.1145/2523813
[21] Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324
[22] Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems, 30. https://dl.acm.org/doi/10.5555/3295222.3295230
[23] Fernández, A., García, S., Galar, M., Prati, R. C., Krawczyk, B., & Herrera, F. (2018). Learning from Imbalanced Data Sets. Springer. https://doi.org/10.1007/978 3 319 98074 4
[24] Canadian Institute for Cybersecurity. (2018). CICIDS2018 Dataset. University of New Brunswick. https://www.unb.ca/cic/datasets/ids 2018.html
[25] Mirkovic, J., & Reiher, P. (2004). A taxonomy of DDoS attack and DDoS defense mechanisms. ACM SIGCOMM Computer Communication Review, 34(2), 39–53.
[26] Zhang, H., et al. (2021). Network intrusion detection based on targeted online sequential extreme learning machine. PLOS ONE, 16(2), e0246741. https://doi.org/10.1371/journal.pone.0246741
[27] Li, Z., Rios, A. L. G., Xu, G., & Trajkovic, L. (2022). Network intrusion detection using graph neural network. Proc. 2022 IEEE International Symposium on Circuits and Systems. https://doi.org/10.1109/ISCAS48785.2022.9937559
[28] Huang, S., et al. (2019). A federated learning approach to network intrusion detection. IEEE Transactions on Network and Service Management, 20(1), 402–415.
[29] Veeramachaneni, K., Arnaldo, I., Korrapati, V., Bassias, C., & Li, K. (2016). AI2: Training a big data machine to defend. Proc. 2016 IEEE 2nd International Conference on Big Data Security, 49–54.
[30] NIST. (2022). Post Quantum Cryptography: FIPS 203 (Kyber), FIPS 204 (Dilithium), FIPS 205 (SPHINCS+). National Institute of Standards and Technology. https://csrc.nist.gov/projects/post quantum cryptography
[31] IBM Security. (2019). Cost of a Data Breach Report 2019. IBM Corporation. https://www.ibm.com/security/data breach
[32] MITRE Corporation. (2022). MITRE ATT&CK Framework v15. https://attack.mitre.org
[33] Biggio, B., & Roli, F. (2018). Wild patterns: Ten years after the rise of adversarial machine learning. Pattern Recognition, 84, 317–331. https://doi.org/10.1016/j.patcog.2018.07.023