LIVE THREATS
MEDIUM Microsoft Scout Autonomous Agent Expands Attack Surface Across Microsoft 365 // HIGH High-Autonomy AI Agents With Broad Permissions Pose Enterprise Security Crisis // HIGH Indirect Prompt Injection via Notifications Hijacks Google Gemini on Android // HIGH Only 11 of 100 AI Agents Pass Security and Capability Benchmarks // HIGH Prompt Injection Flaw in Gemini Voice Assistant Enables Notification-Based Attacks // HIGH 2,000 AI-Built Apps Expose Corporate Data via Misconfigured Vibe-Coding Platforms // MEDIUM Anthropic Documents Sandbox Escape Risks and Credential Exfiltration Vectors in Claude … // HIGH ChatGPhish Exploit Turns ChatGPT Summarisation Into a Live Phishing Surface // HIGH LLMShare Campaign Weaponises ChatGPT Sharing Feature to Distribute Malware // MEDIUM Process-Level CAPTCHA Analysis Exposes Behavioural Fingerprints of AI Agents //
ATLAS OWASP HIGH Significant risk · Prioritise patching RELEVANCE ▲ 8.2

Bad Memories Still Haunt AI Agents

TL;DR HIGH
  • What happened: Cisco found and disclosed a memory-handling vulnerability in Anthropic's AI systems, now patched.
  • Who's at risk: Enterprises and developers deploying Anthropic-powered agentic AI systems with persistent memory are most exposed, as malicious actors could manipulate stored context to hijack agent behaviour.
  • Act now: Audit all AI agent memory files and persistent context stores for unexpected or injected content · Apply Anthropic's latest patches immediately and monitor vendor advisories for further memory-related fixes · Implement strict input/output validation and sandboxing around AI agent memory read/write operations
Bad Memories Still Haunt AI Agents

Overview

Cisco researchers identified and responsibly disclosed a significant vulnerability in how Anthropic manages memory within its AI systems. Anthropic has since issued a fix, but the disclosure has reignited industry-wide concern about the structural risks posed by persistent memory in agentic AI architectures. As AI agents increasingly rely on long-term memory to maintain context across sessions, the attack surface for memory manipulation grows correspondingly — and a single vendor patch does not eliminate the underlying class of threat.

Technical Analysis

The vulnerability centres on how AI agents read, store, and act upon memory files — structured or semi-structured data that persists between user sessions and informs future model behaviour. When memory handling is insecure, several attack vectors become viable:

  • Memory Poisoning via Prompt Injection: An adversary can craft malicious input that, when processed and stored as a memory entry, causes the agent to behave in unintended ways in subsequent sessions. This is a persistent form of prompt injection — the payload survives beyond a single conversation.
  • Cross-Session Data Leakage: Poorly sanitised memory files may inadvertently retain sensitive user data, which could be extracted by a subsequent attacker-controlled prompt or through direct access to the memory store.
  • Instruction Override: Memory entries could be crafted to override system-level instructions, effectively hijacking agent goals or personas without requiring direct access to the system prompt.

The specific technical details of Cisco’s finding have not been fully disclosed at the time of publication, consistent with responsible disclosure norms. However, the general pattern is consistent with known agentic AI attack research.

Framework Mapping

FrameworkMappingRationale
MITRE ATLASAML.T0051 – LLM Prompt InjectionMalicious content injected via memory files to influence future agent actions
MITRE ATLASAML.T0057 – LLM Data LeakageSensitive data potentially retained and exposed through memory stores
MITRE ATLASAML.T0043 – Craft Adversarial DataMemory entries crafted to manipulate downstream model behaviour
OWASPLLM01 – Prompt InjectionPersistent injection through memory is a variant of this primary LLM risk
OWASPLLM08 – Excessive AgencyAgents acting on poisoned memory with insufficient human oversight
OWASPLLM06 – Sensitive Information DisclosureMemory stores retaining PII or confidential context across sessions

Impact Assessment

The immediate impact is limited by Anthropic’s patch, but the broader implications are significant. Any organisation deploying Claude-based agents with memory features enabled should treat pre-patch session memory as potentially compromised. More broadly, this disclosure validates researcher warnings that agentic AI systems — particularly those with autonomous tool use and persistent state — represent a qualitatively different and more severe threat surface than stateless LLM deployments. The risk is not confined to Anthropic; similar memory architectures exist across competing platforms.

Mitigation & Recommendations

  1. Patch immediately: Apply all available Anthropic security updates and verify memory-related components are at current versions.
  2. Audit existing memory stores: Review stored memory files for anomalous or injected content before resuming production agent operations.
  3. Enforce memory hygiene: Treat memory input/output as untrusted data — validate, sanitise, and scope memory read/write permissions to the minimum necessary.
  4. Enable human-in-the-loop controls: For high-stakes agent tasks, require human approval before agents act on recalled memory in sensitive contexts.
  5. Monitor cross-session anomalies: Implement behavioural monitoring to detect unexpected shifts in agent output that may indicate memory tampering.

References

◉ AI THREAT BRIEFING

Stay ahead of the threat.

Twice-weekly digest of critical AI security developments — every story mapped to MITRE ATLAS and OWASP LLM Top 10. Free.

No spam. Unsubscribe anytime.