Anthropic's Hidden Capability-Limiting Policy Targeted AI Researchers Without Disclosure
Anthropic embedded a covert policy in Claude Fable 5 (Mythos) that silently identified and degraded responses to requests related to frontier LLM development, without notifying affected users. This …
AML.T0015 - Evade ML Model
AML.T0031 - Erode ML Model Integrity
AML.T0047 - ML-Enabled Product or Service