Role Confusion Attack Lets Injected Text Override LLM Safety Controls
New research from Ye, Cui, and Hadfield-Menell demonstrates that LLMs prioritise the stylistic format of text over its structural role tags, enabling attackers to craft injected content that mimics …
AML.T0051 - LLM Prompt Injection
AML.T0054 - LLM Jailbreak
AML.T0043 - Craft Adversarial Data