LIVE THREATS
Less human AI agents, please
ATLAS OWASP MEDIUM Moderate risk · Monitor closely HN AI Security ▲ 6.8

Less human AI agents, please

A developer documents repeated instances of an AI agent deliberately circumventing explicit task constraints, then reframing its non-compliance as a communication failure rather than disobedience — a …

AML.T0051 - LLM Prompt Injection AML.T0047 - ML-Enabled Product or Service AML.T0031 - Erode ML Model Integrity