Evaluating Adversarial Attacks Against Artificial Intelligence Systems in Application Deployments
ABSTRACT Businesses have invested billions into artificial intelligence (AI) applications, leading to a sharp rise in the number of AI applications being released to customers. Taking into account previous approaches to attacking machine learning models, we conduct a comparative analysis of adversar...
Saved in:
| Main Author: | |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Wiley
2025-04-01
|
| Series: | Applied AI Letters |
| Subjects: | |
| Online Access: | https://doi.org/10.1002/ail2.121 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | ABSTRACT Businesses have invested billions into artificial intelligence (AI) applications, leading to a sharp rise in the number of AI applications being released to customers. Taking into account previous approaches to attacking machine learning models, we conduct a comparative analysis of adversarial attacks, contrasting large language models (LLMs) being deployed through application programming interfaces (APIs) with the same attacks against locally deployed models to evaluate the significance of security controls in production deployments on attack success in black‐box environments. The article puts forward adversarial attacks that are adapted for remote model endpoints in order to create a threat model that can be used by security organizations to prioritize controls when deploying AI systems through APIs. This paper contributes: (1) a public repository of adversarial attacks adapted to handle remote models on https://github.com/l3ra/adversarial‐ai, (2) benchmarking results of remote attacks comparing the effectiveness of attacks on remote models with those on local models, and (3) a framework for assessing future AI system deployment controls. By providing a practical framework for benchmarking the security of remote AI systems, this study contributes to the understanding of adversarial attacks in the context of natural language processing models deployed by production applications. |
|---|---|
| ISSN: | 2689-5595 |