Aliasing black box adversarial attack with joint self-attention distribution and confidence probability

Jun Liu; Haoyu Jin; Guangxia Xu; Mingwei Lin; Tao Wu; Majid Nour; Fayadh Alenezi; Adi Alhudhaif; Kemal Polat

doi:10.1016/j.eswa.2022.119110

Back

Aliasing black box adversarial attack with joint self-attention distribution and confidence probability

Journal article

Peer reviewed

Aliasing black box adversarial attack with joint self-attention distribution and confidence probability

Jun Liu, Haoyu Jin, Guangxia Xu, Mingwei Lin, Tao Wu, Majid Nour, Fayadh Alenezi, Adi Alhudhaif and Kemal Polat

Expert systems with applications, Vol.214, p.119110

15/03/2023

DOI: https://doi.org/10.1016/j.eswa.2022.119110

Abstract

Adversarial attack

Self-attention distribution

Text classification

•A novel score-based attack is proposed to deceive the DNN models.•Using substitute and target model to evaluate the word importance.•Synonym is used for generating adversarial samples.•Adversarial training can help to enhance the robustness of DNN models. Deep neural networks (DNNs) are vulnerable to adversarial attacks, in which a small perturbation to samples can cause misclassification. However, how to select important words for textual attack models is a big challenge. Therefore, in this paper, an innovative score-based attack model is proposed to solve the important words selection problem for textual attack models. To this end, the generation of semantically adversarial examples in this model is adopted to mislead a text classification model. Then, this model integrates the self-attention mechanism and confidence probabilities for the selection of the important words. Moreover, an alternative model similar to the transfer attack is introduced to reflect the correlation degree of words inside the texts. Finally, adversarial training experimental results demonstrate the superiority of the proposed model.

Metrics

1 Record Views

Details

Title: Aliasing black box adversarial attack with joint self-attention distribution and confidence probability
Creators - without role: Jun Liu - Chongqing University of Posts and Telecommunications
Haoyu Jin - Chongqing University of Posts and Telecommunications
Guangxia Xu - Chongqing University of Posts and Telecommunications
Mingwei Lin - Fujian Normal University
Tao Wu - Chongqing University of Posts and Telecommunications
Majid Nour - King Abdulaziz University
Fayadh Alenezi - Department of Electrical Engineering, College of Engineering, Jouf University, Saudi Arabia
Adi Alhudhaif - Prince Sattam Bin Abdulaziz University
Kemal Polat - Bolu Abant İzzet Baysal University
Publication Details: Expert systems with applications, Vol.214, p.119110
Publisher: Elsevier Ltd
Identifiers: 9912962508331
Academic Unit: Al Jouf University; Prince Sattam Bin Abdulaziz University; King Abdulaziz University
Language: English
Resource Type: Journal article