Anthropic's Claude Fable 5 Ships Tiered Cyber Safeguards to Limit Offensive AI Uplift
Anthropic has released Claude Fable 5 with a classifier-based safety layer that routes flagged offensive cyber, bio, and model-distillation requests to a weaker fallback model, while reserving full …
AML.T0054 - LLM Jailbreak
AML.T0051 - LLM Prompt Injection
AML.T0015 - Evade ML Model