Adversarial AI attack detection: a novel approach using explainable AI and deception mechanisms

Maria NICULAE; George SUCIU; Vlad  STANESCU; Mari-Anais SACHIAN; Aristeidis FARAO; Athanasia SABAZIOTI; Christos XENAKIS; Dionysios XENAKIS; Ignacio LACALLE; Panagiotis Radoglou GRAMMATIKIS; Nikolaos Sachpelidis BROZOS; Zacharenia  LEKKA; Giorgio   BERNARDINETTI; Anastasia TSIOTA; Georgios KALPAKTSOGLOU; Stylianos KARAGIANNIS

Adversarial AI attack detection: a novel approach using explainable AI and deception mechanisms

Authors

Maria NICULAE Beia Consult International, Bucharest, Romania
George SUCIU Beia Consult International, Bucharest, Romania
Vlad STANESCU Beia Consult International, Bucharest, Romania
Mari-Anais SACHIAN Beia Consult International, Bucharest, Romania
Aristeidis FARAO University of Piraeus, Piraeus, Greece
Athanasia SABAZIOTI University of Piraeus, Piraeus, Greece
Christos XENAKIS University of Piraeus, Piraeus, Greece
Dionysios XENAKIS Department of Digital Industry Technologies of the National and Kapodistrian University of Athens, Athens, Greece
Ignacio LACALLE Universitat Politècnica de València, Valencia, Spain
Panagiotis Radoglou GRAMMATIKIS K3Y, Sofia, Bulgaria
Nikolaos Sachpelidis BROZOS K3Y, Sofia, Bulgaria
Zacharenia LEKKA K3Y, Sofia, Bulgaria
Giorgio BERNARDINETTI Consorzio Nazionale Interuniversitario per le Telecomunicazioni, Parma, Italy
Anastasia TSIOTA Fogus Innovations and Services, Athens, Greece
Georgios KALPAKTSOGLOU Fogus Innovations and Services, Athens, Greece
Stylianos KARAGIANNIS PDM, Lisbon, Portugal

Keywords:

Adversarial AI detection, adversarial training, deception mechanisms, explainable AI

Abstract

Detecting adversarial AI attacks has emerged as a critical issue since AI systems are becoming integral across all industries, from healthcare to finance and even transportation. Adversarial attacks stand on the fact that there exist weaknesses within machine learning and deep learning models, which they exploit on the grounds of their potential to cause serious disruptions and severe threats towards the integrity of AI operational procedures. In this light, the discussion will focus on developing robust mechanisms for detecting adversarial inputs in real-time to ensure that AI systems remain resilient against such sophisticated threats. While adversarial AI — software input sanitization, anomaly detection, and adversarial training — has some important foundational work, most approaches to them suffer from generalization challenges across attack types or real-time performance. This work will introduce novelty by extending the detection capabilities with explainable AI (XAI) and deception mechanisms. Adversarial activities will be detected based on adversarial training in combination with honeypots and digital twins, while keeping the process of detection transparent with XAI. While honeypots and digital twins decoy attackers, observing their behaviors can further strengthen detection methods. The results so-far promise tremendous improvements in the detection of adversarial attacks in high-risk AI applications, efficacy of honeypots for the capture of malicious behavior, and XAI for enhanced interpretability and reliability of the detection process. These techniques will enhance the robustness of AI systems against adversarial threats. Presented research contributes significantly by providing practical tools for cybersecurity professionals and AI practitioners against these attacks, thus offering new insights into AI for cybersecurity. The novelty value of the paper is the innovative integration of adversarial training, XAI, and deception techniques, which offers a combined, interpretable, and effective method toward the detection of adversarial AI attacks on cross-industry sectors.

Downloads

.pdf

Published

2025-09-11 — Updated on 2025-09-16

Versions

2025-09-16 (2)
2025-09-11 (1)

Issue

Vol. 12 (2024): Resilient Communities Empowered by Collective Intelligence

Section

Articles

License

Copyright (c) 2025 Maria NICULAE, Georgios KALPAKTSOGLOU, Anastasia TSIOTA, Giorgio BERNARDINETTI, Zacharenia LEKKA, Nikolaos Sachpelidis BROZOS, Panagiotis Radoglou GRAMMATIKIS, Ignacio LACALLE, Dionysios XENAKIS, Christos XENAKIS, Athanasia SABAZIOTI, Aristeidis FARAO, Mari-Anais SACHIAN, Vlad STANESCU, George SUCIU, Stylianos KARAGIANNIS

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

How to Cite

[1]

NICULAE, M. et al. 2025. Adversarial AI attack detection: a novel approach using explainable AI and deception mechanisms. Smart Cities International Conference (SCIC) Proceedings. 12, (Sep. 2025), 623–647.

Most read articles by the same author(s)

Eduard-Nicolae HANGANU, Oana ORZA, Sabina BOSOC, Cristina BALACEANU, George SUCIU, Smart toilet cleanliness detection system using IoT , Smart Cities International Conference (SCIC) Proceedings: Vol. 9 (2021): Speeding Up History
Elena NEGRUȘA, George SUCIU, Andrei-Cristian BÎRDICI, Ana Garcia-LOPEZ, Giacomo Di BENEDETTO, Giuseppe Falvo D’URSO LABATE, Study on state of the art in reusing recyclable waste materials in orange, circular and green economy , Smart Cities International Conference (SCIC) Proceedings: Vol. 9 (2021): Speeding Up History
Cristina BALACEANU, George SUCIU, Oana ORZA, Sabina BOSOC, Improvement of the quality in smart toilet , Smart Cities International Conference (SCIC) Proceedings: Vol. 8 (2020): Spotlight 2030
George SUCIU, Ioana PETRE, Andrei SCHEIANU, Cristian BECEANU, Denisa PASTEA, Innovative automatic sorting system of the construction and demolition waste materials , Smart Cities International Conference (SCIC) Proceedings: Vol. 8 (2020): Spotlight 2030
Agnia CODREANU, Razvan Alexandru BRATULESCU, Pavel MURESAN, Mari Anais SACHIAN, George SUCIU, Automatic Number Plate Recognition using YOLOv5 , Smart Cities International Conference (SCIC) Proceedings: Vol. 10 (2022): Accelerating innovation
Elena-Camelia MORARU, Oana ORZA, Sabina BOSOC, Roxana ROSCANEANU, Cristina BALACEANU, George SUCIU, The use of smart sensors in viticulture , Smart Cities International Conference (SCIC) Proceedings: Vol. 9 (2021): Speeding Up History
Andrei DANILA, Robert-Alexandru STRECHE, Filip-Emanuel OSIAC, Andrei-Mihai STEREA, Cristina DOBRE, George SUCIU, IoT systems for efficient preservation of cultural heritage , Smart Cities International Conference (SCIC) Proceedings: Vol. 11 (2023): Sustainability and Innovation
Cristina BALACEANU, Roxana ROȘCĂNEANU, Robert-Alexandru STRECHE, Filip-Emanuel OSIAC, George SUCIU, Artefacts conservation using an IoT system , Smart Cities International Conference (SCIC) Proceedings: Vol. 9 (2021): Speeding Up History
Ana-Maria TUDOR, George SUCIU, George Valentin IORDACHE, Gabriela BUCUR, Hussain IJAZ, Marius VOCHIN, Smart city cyber-physical security , Smart Cities International Conference (SCIC) Proceedings: Vol. 8 (2020): Spotlight 2030
George SUCIU, Teodora UȘURELU, Ioana ROGOJANU, Ruxandra Ioana RĂDUCANU, Raluca IOSU, Felix Jesus VILLANUEVA, Maria Jose SANTOFIMIA, David VILLA, CitiSim – IoT platform for monitoring and management of the city , Smart Cities International Conference (SCIC) Proceedings: Vol. 6 (2018): Smart Solutions - From Design to Practice

1 2 > >>

Adversarial AI attack detection: a novel approach using explainable AI and deception mechanisms

Authors

Keywords:

Abstract

Downloads

Published

Versions

Issue

Section

License

How to Cite

Similar Articles

Most read articles by the same author(s)

Links

Information