<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>GRID THE GREY — AI Threat Intelligence | GRID THE GREY</title><link>https://gridthegrey.com/</link><description>Real-time AI security intelligence — adversarial ML, LLM vulnerabilities, and supply chain threats mapped to MITRE ATLAS and OWASP LLM Top 10.</description><generator>Hugo</generator><language>en-us</language><copyright/><lastBuildDate>Thu, 18 Jun 2026 09:58:58 +0530</lastBuildDate><atom:link href="https://gridthegrey.com/index.xml" rel="self" type="application/rss+xml"/><item><title>First Look: Anthropic Mythos 5 Export Block Exposes AI Supply Chain Dependency Risk</title><link>https://gridthegrey.com/posts/first-look-anthropic-mythos-5-export-block-exposes-ai-supply-chain-dependency/</link><pubDate>Thu, 18 Jun 2026 04:28:40 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-anthropic-mythos-5-export-block-exposes-ai-supply-chain-dependency/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>Supply Chain</category><category>Regulatory</category><category>Industry News</category><category>LLM Security</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0015 - Evade ML Model</category><category>AML.T0031 - Erode ML Model Integrity</category><description>The Trump administration's overnight export block of Anthropic's Mythos 5 and Fable 5 models — triggered by reported safety guardrail bypass vulnerabilities flagged by Amazon — has exposed the fragility of international AI supply chains built on U.S.-controlled infrastructure. For defenders, this event crystallises a critical dependency risk: organisations and governments that have embedded American AI models into critical systems now face the possibility of abrupt, unexplained access revocation with no remediation path. Security teams must now treat AI vendor access continuity as a threat vector equivalent to a third-party SaaS outage, and accelerate contingency planning around model substitution and sovereign alternatives.</description></item><item><title>First Look: AWS Launches Amazon Quick Autonomous Agents with Continuous Background Execution</title><link>https://gridthegrey.com/posts/first-look-aws-launches-amazon-quick-autonomous-agents-with-continuous-execution/</link><pubDate>Thu, 18 Jun 2026 04:25:14 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-aws-launches-amazon-quick-autonomous-agents-with-continuous-execution/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>Agentic AI</category><category>Prompt Injection</category><category>LLM Security</category><category>Supply Chain</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0012 - Valid Accounts</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0043 - Craft Adversarial Data</category><category>AML.T0031 - Erode ML Model Integrity</category><description>AWS has shipped autonomous agents in Amazon Quick, an AI assistant that continuously executes tasks — including CRM updates, email drafting, and compliance monitoring — on behalf of users while connected to dozens of enterprise data sources and applications. This dramatically expands the attack surface for business-context compromise: a single successful prompt injection or account takeover can now translate into persistent, automated actions across an organisation's entire connected app ecosystem. Defenders must treat these agents as privileged service accounts with broad, continuous write-access, requiring dedicated monitoring, least-privilege scoping, and explicit human-in-the-loop gates for sensitive actions.</description></item><item><title>First Look: Midjourney Medical Launches AI-Powered Full-Body Ultrasound Scanner Hardware</title><link>https://gridthegrey.com/posts/first-look-midjourney-medical-launches-ai-powered-full-body-ultrasound-scanner/</link><pubDate>Thu, 18 Jun 2026 04:22:14 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-midjourney-medical-launches-ai-powered-full-body-ultrasound-scanner/</guid><category>Threat Level: MEDIUM</category><category>First Look</category><category>Adversarial ML</category><category>Supply Chain</category><category>Regulatory</category><category>Industry News</category><category>AML.T0043 - Craft Adversarial Data</category><category>AML.T0018 - Backdoor ML Model</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0031 - Erode ML Model Integrity</category><description>Midjourney Medical has announced a full-body ultrasound scanner that uses a ring of sensors and AI processing to generate MRI-comparable internal body imagery, representing a significant pivot from image generation into AI-assisted medical diagnostics hardware. The convergence of AI inference pipelines with sensitive biometric and anatomical data creates new attack surfaces around health data exfiltration, model output manipulation, and diagnostic integrity. Defenders in healthcare and enterprise wellness programmes should treat this class of device as a high-sensitivity AI-enabled medical endpoint requiring strict data governance and supply chain vetting.</description></item><item><title>First Look: Odyssey Launches Physical World Model Platform Backed by Amazon at $1.45B Valuation</title><link>https://gridthegrey.com/posts/first-look-odyssey-launches-physical-world-model-platform-backed-by-amazon-at-1/</link><pubDate>Thu, 18 Jun 2026 04:21:04 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-odyssey-launches-physical-world-model-platform-backed-by-amazon-at-1/</guid><category>Threat Level: MEDIUM</category><category>First Look</category><category>Supply Chain</category><category>Adversarial ML</category><category>Data Poisoning</category><category>Industry News</category><category>AML.T0020 - Poison Training Data</category><category>AML.T0019 - Publish Poisoned Datasets</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0043 - Craft Adversarial Data</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0047 - ML-Enabled Product or Service</category><description>Odyssey has raised a $310M Series B to scale its world model platform, which ingests real-world physical environment data to generate interactive simulations, video, and training environments for robotics and gaming. The platform's reliance on large-scale physical data collection, multi-tenant simulation outputs, and deep AWS infrastructure integration introduces supply chain, data poisoning, and adversarial simulation risks defenders should assess. Organizations consuming Odyssey-generated synthetic environments for robotics training or game content pipelines are newly exposed to integrity attacks targeting the underlying world model.</description></item><item><title>First Look: OpenAI Tests ChatGPT for Science Subscription with Verified Institutional Access</title><link>https://gridthegrey.com/posts/first-look-openai-tests-chatgpt-for-science-subscription-with-verified-access/</link><pubDate>Thu, 18 Jun 2026 04:16:02 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-openai-tests-chatgpt-for-science-subscription-with-verified-access/</guid><category>Threat Level: MEDIUM</category><category>First Look</category><category>LLM Security</category><category>Industry News</category><category>Regulatory</category><category>Research</category><category>AML.T0012 - Valid Accounts</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0054 - LLM Jailbreak</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0056 - LLM Meta Prompt Extraction</category><category>AML.T0019 - Publish Poisoned Datasets</category><description>OpenAI is internally testing a specialised 'ChatGPT for Science' subscription tier, likely restricted to verified universities and research institutions, building on capabilities from GPT-Rosalind — a purpose-built life sciences model already deployed under a trusted-access structure with select pharma partners. The gated, domain-specific nature of this offering creates novel identity and access verification attack surfaces, as threat actors will likely probe credential and institutional verification mechanisms to gain privileged access to specialised scientific knowledge. Defenders at academic and research institutions should anticipate increased phishing campaigns targeting institutional credentials and prepare governance frameworks for AI use in sensitive research environments.</description></item><item><title>First Look: Z.ai Releases GLM-5.2 Open-Weights 753B LLM Under MIT License</title><link>https://gridthegrey.com/posts/first-look-z-ai-releases-glm-5-2-open-weights-753b-llm-under-mit-license/</link><pubDate>Thu, 18 Jun 2026 04:14:35 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-z-ai-releases-glm-5-2-open-weights-753b-llm-under-mit-license/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>LLM Security</category><category>Supply Chain</category><category>Jailbreaks</category><category>Industry News</category><category>AML.T0044 - Full ML Model Access</category><category>AML.T0054 - LLM Jailbreak</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0018 - Backdoor ML Model</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0040 - ML Model Inference API Access</category><description>Z.ai has released GLM-5.2, a 753-billion-parameter mixture-of-experts model under an MIT license, ranking as the top open-weights model on the Artificial Analysis Intelligence Index and second on the Code Arena WebDev leaderboard. For defenders, the combination of frontier-level capability, unrestricted open-weights distribution, and a 1-million-token context window materially lowers the barrier for threat actors to self-host a highly capable model outside any provider's safety controls. The model's agentic coding performance and massive context window expand the viable attack surface for automated code generation, targeted phishing, and large-scale document analysis without API-level monitoring.</description></item><item><title>First Look: AI Agent Identity Continuity Expands Persistent Credential Abuse Surface</title><link>https://gridthegrey.com/posts/first-look-ai-agent-identity-continuity-expands-persistent-credential-abuse/</link><pubDate>Wed, 17 Jun 2026 04:25:03 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-ai-agent-identity-continuity-expands-persistent-credential-abuse/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>Agentic AI</category><category>LLM Security</category><category>Industry News</category><category>AML.T0012 - Valid Accounts</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0057 - LLM Data Leakage</category><description>CrowdStrike's Continuous Identity for AI Agents introduces persistent, trackable identity primitives for agentic workflows — but persistent identities are also persistent targets. Attackers who compromise an agent identity gain a durable, trusted foothold that can persist across sessions and tool invocations without the natural expiry of human session tokens. The feature's integration into the Falcon platform means agent identity tokens, if stolen or forged, may carry elevated detection-suppression trust within the same security toolchain defending the environment.</description></item><item><title>First Look: Dual-Use AI Exploit Models Create Unavoidable Offensive Capability Proliferation Surface</title><link>https://gridthegrey.com/posts/first-look-dual-use-ai-exploit-models-create-unavoidable-offensive-capability/</link><pubDate>Wed, 17 Jun 2026 04:24:13 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-dual-use-ai-exploit-models-create-unavoidable-offensive-capability/</guid><category>Threat Level: CRITICAL</category><category>First Look</category><category>LLM Security</category><category>Jailbreaks</category><category>Regulatory</category><category>Industry News</category><category>AML.T0054 - LLM Jailbreak</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0044 - Full ML Model Access</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0015 - Evade ML Model</category><description>Anthropic's Mythos 5 and Claude Fable 5 represent the arrival of frontier AI models with demonstrated, advanced vulnerability discovery and exploit-development capabilities — a capability class that will rapidly proliferate across multiple vendors and open-weight releases. The core attack surface is not model-specific: guardrail bypass of the consumer-facing Fable 5 exposes full Mythos-grade offensive capability to any actor who can defeat the content filters, while the broader proliferation trajectory means defenders must assume adversary access to equivalent capabilities within months. The regulatory response addresses a single vendor while doing nothing to raise the floor for the broader ecosystem of competitive and open-weight models following close behind.</description></item><item><title>First Look: Gemini Omni Deep OS Integration Expands Ambient AI Attack Surface on Android 17</title><link>https://gridthegrey.com/posts/first-look-gemini-omni-deep-os-integration-expands-ambient-ai-attack-surface-on/</link><pubDate>Wed, 17 Jun 2026 04:23:19 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-gemini-omni-deep-os-integration-expands-ambient-ai-attack-surface-on/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>LLM Security</category><category>Prompt Injection</category><category>Agentic AI</category><category>Industry News</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0043 - Craft Adversarial Data</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0015 - Evade ML Model</category><category>AML.T0056 - LLM Meta Prompt Extraction</category><description>Android 17 embeds Gemini Omni and multiple AI models (Lyria 3, AudioLM) directly into OS-level functions including video editing, call handling, screen recording, and emergency detection, dramatically expanding the attack surface for AI-assisted exploitation on mobile endpoints. The deep integration of conversational AI with device sensors, media pipelines, and inter-app communication creates novel prompt injection and data exfiltration vectors that existing mobile threat defences were not designed to address. The simultaneous AirDrop interoperability expansion and cross-device Pixel Watch mirroring further widen the lateral movement surface across the Google hardware ecosystem.</description></item><item><title>First Look: NVIDIA XR AI Embeds Persistent Agents Into Physical-World Sensor Streams</title><link>https://gridthegrey.com/posts/first-look-nvidia-xr-ai-embeds-persistent-agents-into-physical-world-sensor/</link><pubDate>Wed, 17 Jun 2026 04:21:59 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-nvidia-xr-ai-embeds-persistent-agents-into-physical-world-sensor/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>Agentic AI</category><category>Prompt Injection</category><category>LLM Security</category><category>Supply Chain</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0043 - Craft Adversarial Data</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0056 - LLM Meta Prompt Extraction</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0054 - LLM Jailbreak</category><description>NVIDIA XR AI puts multimodal agentic systems directly into AR glasses, fusing continuous video, audio, depth, and pose data with enterprise knowledge retrieval and tool execution — creating a persistent, always-on sensor exfiltration and prompt injection surface that sits inches from a worker's face. The framework connects to industrial systems, digital twins, and enterprise RAG backends, meaning a compromised agent can pivot from perceptual data into operational technology networks. Because the inputs are environmental and largely uncontrolled, adversarial content placed in the physical world (signage, screens, spoken commands) becomes a viable injection vector against enterprise infrastructure.</description></item><item><title>Bucket Squatting Flaw in Vertex AI SDK Enabled Model Hijack and RCE</title><link>https://gridthegrey.com/posts/bucket-squatting-flaw-in-vertex-ai-sdk-enabled-model-hijack-and-rce/</link><pubDate>Wed, 17 Jun 2026 04:20:26 +0000</pubDate><guid>https://gridthegrey.com/posts/bucket-squatting-flaw-in-vertex-ai-sdk-enabled-model-hijack-and-rce/</guid><category>Threat Level: HIGH</category><category>Supply Chain</category><category>Adversarial ML</category><category>Research</category><category>LLM Security</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0018 - Backdoor ML Model</category><category>AML.T0031 - Erode ML Model Integrity</category><category>AML.T0044 - Full ML Model Access</category><description>A vulnerability in the Google Cloud Vertex AI Python SDK allowed unauthenticated attackers to intercept model uploads by pre-registering predictable staging bucket names — a technique Unit 42 calls 'Pickle in the Middle'. Once a malicious model replaced the legitimate upload, arbitrary code executed inside Google's serving infrastructure via pickle deserialization. Google patched the flaw in v1.148.0 after disclosure in March 2026, but the incident highlights systemic risks in ML pipeline supply chains.</description></item><item><title>China-Linked Group Suspected of Accessing Anthropic's Restricted Mythos Model</title><link>https://gridthegrey.com/posts/china-linked-group-suspected-of-accessing-anthropic-s-restricted-mythos-model/</link><pubDate>Tue, 16 Jun 2026 16:07:11 +0000</pubDate><guid>https://gridthegrey.com/posts/china-linked-group-suspected-of-accessing-anthropic-s-restricted-mythos-model/</guid><category>Threat Level: CRITICAL</category><category>LLM Security</category><category>Model Theft</category><category>Jailbreaks</category><category>Regulatory</category><category>Industry News</category><category>AML.T0044 - Full ML Model Access</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0054 - LLM Jailbreak</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0012 - Valid Accounts</category><description>The White House reportedly believes a China-linked group accessed Anthropic's Mythos AI model, prompting export restrictions on the technology. If confirmed, the breach represents a significant national security threat, as adversaries could exploit the model directly or use knowledge distillation to replicate its capabilities. Separately, reports of jailbreak vulnerabilities in Mythos and Fable compound concerns about unauthorised access to frontier AI systems.</description></item><item><title>First Look: Amazon Bedrock AgentCore RAG Agent Exposes Multi-Layer Injection and Data Poisoning Surface</title><link>https://gridthegrey.com/posts/first-look-agentcore-rag-agent-exposes-multi-layer-injection-and-data-poisoning/</link><pubDate>Tue, 16 Jun 2026 01:47:22 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-agentcore-rag-agent-exposes-multi-layer-injection-and-data-poisoning/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>Agentic AI</category><category>Prompt Injection</category><category>Data Poisoning</category><category>LLM Security</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0019 - Publish Poisoned Datasets</category><category>AML.T0020 - Poison Training Data</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0056 - LLM Meta Prompt Extraction</category><category>AML.T0040 - ML Model Inference API Access</category><description>Amazon Bedrock AgentCore now enables production-grade agentic systems that combine RAG retrieval, persistent cross-session memory, and direct user-facing endpoints authenticated only via Cognito Bearer tokens — all surfaced through a single /invocations endpoint. This architecture creates compounded attack surfaces where adversarially crafted content in S3-backed knowledge bases can propagate through the retrieve_and_generate pipeline directly into technician workflows. The persistent AgentCore Memory layer introduces a new cross-session context poisoning vector that does not exist in stateless LLM deployments.</description></item><item><title>First Look: AWS Agent-EvalKit Embeds LLM Judges Into Dev Pipelines, Expanding Adversarial Test Surface</title><link>https://gridthegrey.com/posts/first-look-agent-evalkit-embeds-llm-judges-into-dev-pipelines-expanding-test/</link><pubDate>Tue, 16 Jun 2026 01:45:50 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-agent-evalkit-embeds-llm-judges-into-dev-pipelines-expanding-test/</guid><category>Threat Level: MEDIUM</category><category>First Look</category><category>Agentic AI</category><category>Supply Chain</category><category>LLM Security</category><category>Prompt Injection</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0019 - Publish Poisoned Datasets</category><category>AML.T0043 - Craft Adversarial Data</category><category>AML.T0018 - Backdoor ML Model</category><description>Agent-EvalKit introduces an open-source evaluation pipeline that integrates LLM-as-judge evaluators and AI coding assistants directly into agent development workflows, creating new attack surfaces where poisoned test cases, manipulated ground-truth datasets, and adversarial evaluation prompts could corrupt agent quality signals. The toolkit's deep code-reading access via Claude Code, Kiro CLI, and Kilo Code means a compromised evaluation run could exfiltrate source code or inject malicious recommendations into the development pipeline. Because evaluation outputs drive concrete code changes, adversarial manipulation of the eval layer has downstream consequences for production agent behaviour.</description></item><item><title>First Look: Amazon Quick's Agentic Incident Triage Assistant Bridges Observability Data and Task Automation</title><link>https://gridthegrey.com/posts/first-look-agentic-incident-triage-assistant-bridges-observability-data-and-task/</link><pubDate>Tue, 16 Jun 2026 01:43:14 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-agentic-incident-triage-assistant-bridges-observability-data-and-task/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>Agentic AI</category><category>Prompt Injection</category><category>LLM Security</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0056 - LLM Meta Prompt Extraction</category><category>AML.T0012 - Valid Accounts</category><description>Amazon Quick's new agentic incident triage assistant integrates New Relic's observability platform and Asana via MCP, creating a single conversational interface that can query production telemetry, surface error logs, and create tracked tasks autonomously. This multi-tool agent architecture dramatically expands the prompt injection attack surface, as malicious data embedded in production logs, alert payloads, or transaction traces can now influence agent actions — including task creation and RCA narrative generation. The convergence of observability data (high-trust, machine-generated) with autonomous task orchestration creates a novel indirect prompt injection pathway through operational telemetry.</description></item><item><title>Brazilian Government LLM Exposed as Unauthorised Merge of Third-Party Models</title><link>https://gridthegrey.com/posts/brazilian-government-llm-exposed-as-unauthorised-merge-of-third-party-models/</link><pubDate>Mon, 15 Jun 2026 08:02:56 +0000</pubDate><guid>https://gridthegrey.com/posts/brazilian-government-llm-exposed-as-unauthorised-merge-of-third-party-models/</guid><category>Threat Level: HIGH</category><category>Model Theft</category><category>Supply Chain</category><category>LLM Security</category><category>Industry News</category><category>Regulatory</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0044 - Full ML Model Access</category><category>AML.T0056 - LLM Meta Prompt Extraction</category><category>AML.T0047 - ML-Enabled Product or Service</category><description>Researchers have demonstrated that Rio de Janeiro's publicly presented 'homegrown' 397B language model is not an original creation but an undisclosed element-wise weight merge of the Nex-N2_pro model and Qwen3.5-397B-A17B. The finding was established through two independent methods: identity probing showing the model identifies as 'Nex' 79% of the time, and tensor-level statistical analysis confirming a consistent 0.6/0.4 blend across all 60 layers. This constitutes a model theft and supply chain integrity violation, with additional implications for public trust in government AI procurement and IP attribution.</description></item><item><title>US Government Forces Anthropic to Suspend Claude Fable 5 Over Jailbreak Concerns</title><link>https://gridthegrey.com/posts/us-government-forces-anthropic-to-suspend-claude-fable-5-over-jailbreak-concerns/</link><pubDate>Sat, 13 Jun 2026 06:50:16 +0000</pubDate><guid>https://gridthegrey.com/posts/us-government-forces-anthropic-to-suspend-claude-fable-5-over-jailbreak-concerns/</guid><category>Threat Level: HIGH</category><category>Jailbreaks</category><category>LLM Security</category><category>Regulatory</category><category>Industry News</category><category>AML.T0054 - LLM Jailbreak</category><category>AML.T0015 - Evade ML Model</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0047 - ML-Enabled Product or Service</category><description>The US government issued an export control directive ordering Anthropic to suspend all access to Claude Fable 5 and Mythos 5, citing national security concerns over an alleged jailbreak technique capable of surfacing software vulnerabilities. Anthropic publicly contested the order, arguing the demonstrated capability is already widely available in other public models including GPT-5.5, and that the identified vulnerabilities were minor and previously known. The incident marks a significant precedent for government intervention in frontier AI model access on national security grounds.</description></item><item><title>Gemini AI Weaponised by Chinese PhaaS Network in Mass Smishing Campaign</title><link>https://gridthegrey.com/posts/gemini-ai-weaponised-by-chinese-phaas-network-in-mass-smishing-campaign/</link><pubDate>Sat, 13 Jun 2026 06:49:38 +0000</pubDate><guid>https://gridthegrey.com/posts/gemini-ai-weaponised-by-chinese-phaas-network-in-mass-smishing-campaign/</guid><category>Threat Level: HIGH</category><category>LLM Security</category><category>Prompt Injection</category><category>Jailbreaks</category><category>Industry News</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0054 - LLM Jailbreak</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0043 - Craft Adversarial Data</category><description>Google has filed suit against a Chinese cybercrime network operating the Outsider phishing-as-a-service kit, which exploited Gemini AI to generate fraudulent phishing pages and power large-scale SMS phishing attacks against Americans. The network used carefully framed prompts — disguised as benign programming requests — to bypass AI safety controls and produce functional credential-harvesting websites. The case illustrates the growing industrialisation of AI-assisted phishing infrastructure, with over 1.59 million malicious URLs and 100,000 victims attributed to the operation.</description></item><item><title>Claude Fable 5 Launch Sparks Warnings Over AI-Orchestrated Cyberattacks</title><link>https://gridthegrey.com/posts/claude-fable-5-launch-sparks-warnings-over-ai-orchestrated-cyberattacks/</link><pubDate>Sat, 13 Jun 2026 06:49:01 +0000</pubDate><guid>https://gridthegrey.com/posts/claude-fable-5-launch-sparks-warnings-over-ai-orchestrated-cyberattacks/</guid><category>Threat Level: HIGH</category><category>LLM Security</category><category>Jailbreaks</category><category>Agentic AI</category><category>Industry News</category><category>Regulatory</category><category>AML.T0054 - LLM Jailbreak</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0015 - Evade ML Model</category><description>Anthropic's release of Claude Fable 5, a Mythos-class frontier model, has prompted significant industry debate over its dual-use offensive capabilities in cybersecurity and biology. The model includes a capability fallback mechanism — downgrading to Claude Opus 4.8 in high-risk domains — alongside extensive jailbreak-resistance red-teaming. Security professionals are warning that frontier AI capability investment directly accelerates attacker tooling for machine-speed, AI-orchestrated 'hyperattacks' that outpace human defenders.</description></item><item><title>Agentjacking Attack Achieves 85% Success Rate Against AI Coding Agents via Sentry MCP</title><link>https://gridthegrey.com/posts/agentjacking-attack-achieves-85-success-rate-against-ai-coding-agents-via-sentry/</link><pubDate>Sat, 13 Jun 2026 06:48:18 +0000</pubDate><guid>https://gridthegrey.com/posts/agentjacking-attack-achieves-85-success-rate-against-ai-coding-agents-via-sentry/</guid><category>Threat Level: CRITICAL</category><category>Agentic AI</category><category>Prompt Injection</category><category>LLM Security</category><category>Supply Chain</category><category>Research</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0043 - Craft Adversarial Data</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0010 - ML Supply Chain Compromise</category><description>Tenet Security has disclosed 'Agentjacking', a novel attack class that exploits the implicit trust AI coding agents place in Model Context Protocol (MCP) data sources. By injecting malicious instructions into Sentry error events via publicly accessible DSN credentials, attackers can cause agents like Claude Code and Cursor to execute arbitrary code with full developer privileges. Researchers confirmed 2,388 exposed organisations and an 85% exploitation success rate in controlled testing, with no prior access to victim infrastructure required.</description></item></channel></rss>