<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>GRID THE GREY — AI Threat Intelligence | GRID THE GREY</title><link>https://gridthegrey.com/</link><description>Real-time AI security intelligence — adversarial ML, LLM vulnerabilities, and supply chain threats mapped to MITRE ATLAS and OWASP LLM Top 10.</description><generator>Hugo</generator><language>en-us</language><copyright/><lastBuildDate>Sat, 27 Jun 2026 09:38:50 +0530</lastBuildDate><atom:link href="https://gridthegrey.com/index.xml" rel="self" type="application/rss+xml"/><item><title>Runaway AI Code Review Agents Burn $41K in Adversarial Disagreement Loop</title><link>https://gridthegrey.com/posts/runaway-ai-code-review-agents-burn-41k-in-adversarial-disagreement-loop/</link><pubDate>Sat, 27 Jun 2026 04:08:34 +0000</pubDate><guid>https://gridthegrey.com/posts/runaway-ai-code-review-agents-burn-41k-in-adversarial-disagreement-loop/</guid><category>Threat Level: MEDIUM</category><category>Agentic AI</category><category>Supply Chain</category><category>LLM Security</category><category>Research</category><category>Industry News</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0040 - ML Model Inference API Access</category><description>A hypothetical but technically grounded incident report depicts two competing AI code review agents entering an uncontrolled disagreement loop over a suspected malicious package, generating 340 comments and $41,255 in inference costs before human intervention. The scenario illustrates real risks of excessive agency, lack of circuit-breakers, and cost-based denial-of-service in multi-agent agentic pipelines. While fictional, the scenario directly mirrors documented failure modes in production AI systems and supply chain security workflows.</description></item><item><title>Poisoned Tenant Attack Abuses OpenAI Workspaces to Target Cybersecurity Firms</title><link>https://gridthegrey.com/posts/poisoned-tenant-attack-abuses-openai-workspaces-to-target-cybersecurity-firms/</link><pubDate>Sat, 27 Jun 2026 04:02:04 +0000</pubDate><guid>https://gridthegrey.com/posts/poisoned-tenant-attack-abuses-openai-workspaces-to-target-cybersecurity-firms/</guid><category>Threat Level: HIGH</category><category>LLM Security</category><category>Industry News</category><category>Supply Chain</category><category>AML.T0012 - Valid Accounts</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0057 - LLM Data Leakage</category><description>Threat actors are registering fraudulent OpenAI tenants impersonating legitimate companies and inviting employees to join them, in a campaign dubbed 'Poisoned Tenant' by Push Security. The attack exploits OpenAI's legitimate invitation infrastructure, making phishing emails appear authentic as they pass all email authentication checks. The goal appears to be tricking employees into submitting sensitive corporate information via ChatGPT chats and projects within the attacker-controlled workspace.</description></item><item><title>First Look: OpenAI Launches GPT-5.6 Lineup with Enhanced Agentic and Cybersecurity Capabilities</title><link>https://gridthegrey.com/posts/first-look-openai-launches-gpt-5-6-lineup-with-enhanced-agentic-and-capabilities/</link><pubDate>Sat, 27 Jun 2026 04:01:06 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-openai-launches-gpt-5-6-lineup-with-enhanced-agentic-and-capabilities/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>Agentic AI</category><category>LLM Security</category><category>Regulatory</category><category>Industry News</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0054 - LLM Jailbreak</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0012 - Valid Accounts</category><category>AML.T0044 - Full ML Model Access</category><category>AML.T0057 - LLM Data Leakage</category><description>OpenAI has released GPT-5.6 in a restricted preview to government-vetted partners, featuring three models (Sol, Terra, Luna) with significantly upgraded agentic capabilities in coding, biology, and cybersecurity, including a coordinated multi-subagent 'ultra' mode. The cybersecurity-specific enhancements and agentic orchestration introduce meaningful new attack surface: adversaries gaining access to Sol's coordinated subagent architecture could automate sophisticated multi-stage intrusions at scale previously requiring significant human expertise. The restricted rollout itself creates a novel supply chain and access-control risk, as the 'trusted partner' gating model concentrates high-capability model access among a small set of privileged accounts, making partner credential compromise a high-value target.</description></item><item><title>First Look: Anthropic's Claude Mythos 5 Released Under U.S. Government Controlled Access Framework</title><link>https://gridthegrey.com/posts/first-look-anthropic-s-claude-mythos-5-released-under-u-s-government-controlled/</link><pubDate>Sat, 27 Jun 2026 04:00:07 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-anthropic-s-claude-mythos-5-released-under-u-s-government-controlled/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>Regulatory</category><category>LLM Security</category><category>Jailbreaks</category><category>Supply Chain</category><category>Industry News</category><category>AML.T0012 - Valid Accounts</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0044 - Full ML Model Access</category><category>AML.T0054 - LLM Jailbreak</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0056 - LLM Meta Prompt Extraction</category><description>The U.S. Commerce Department has lifted export controls on Anthropic's Claude Mythos 5, permitting access to over 100 vetted U.S. institutions and government agencies under a nascent federal AI licensing regime. For defenders, this tiered-release model introduces a new class of risk: the 'trusted partner' designation becomes a high-value target, as compromise of any listed entity grants implicit legitimacy to interact with a model previously deemed too dangerous for general release. Security teams at approved organizations should treat Mythos 5 access credentials and API endpoints as critical assets, and assume adversaries will probe the boundary between licensed and unlicensed access patterns.</description></item><item><title>6,000 Prompt Injection Attempts Fail Against Frontier Model — But Risks Remain</title><link>https://gridthegrey.com/posts/6000-prompt-injection-attempts-fail-against-frontier-model-but-risks-remain/</link><pubDate>Sat, 27 Jun 2026 03:57:24 +0000</pubDate><guid>https://gridthegrey.com/posts/6000-prompt-injection-attempts-fail-against-frontier-model-but-risks-remain/</guid><category>Threat Level: MEDIUM</category><category>Prompt Injection</category><category>LLM Security</category><category>Research</category><category>Industry News</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0056 - LLM Meta Prompt Extraction</category><category>AML.T0054 - LLM Jailbreak</category><description>A public challenge exposing an AI email assistant to over 6,000 prompt injection attempts found that Claude Opus 4.6 successfully resisted all efforts to leak secrets or execute malicious instructions embedded in emails. While the result suggests frontier model training against injection attacks is meaningfully improving, security researchers caution that the absence of a successful attack under constrained conditions does not constitute a security guarantee. The author and Hacker News community both note that sophisticated or novel attack vectors could still break through, and irreversible-damage scenarios should not rely solely on model-level defences.</description></item><item><title>First Look: OpenAI GPT-5.6 Released Under White House-Directed Controlled Access Program</title><link>https://gridthegrey.com/posts/first-look-openai-gpt-5-6-released-under-white-house-directed-controlled-access/</link><pubDate>Fri, 26 Jun 2026 05:25:53 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-openai-gpt-5-6-released-under-white-house-directed-controlled-access/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>Regulatory</category><category>LLM Security</category><category>Supply Chain</category><category>Agentic AI</category><category>AML.T0012 - Valid Accounts</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0044 - Full ML Model Access</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0054 - LLM Jailbreak</category><description>OpenAI's GPT-5.6, a frontier model with advanced cyber capabilities, is being released exclusively to vetted partners under a White House-directed limited-access programme coordinated with the Office of the National Cyber Director and OSTP. This controlled rollout signals that the model's offensive cyber potential — including autonomous vulnerability identification and exploitation — is significant enough to warrant government-gated distribution, mirroring Anthropic's Project Glasswing model for Claude Mythos. For defenders, the emergence of a government-approved, partner-tier distribution model creates new supply chain trust questions and raises the stakes around who gains early access and how that access is verified, monitored, and potentially abused.</description></item><item><title>First Look: GitHub Copilot Agentic Harness Evaluated Across Models and Tasks</title><link>https://gridthegrey.com/posts/first-look-github-copilot-agentic-harness-evaluated-across-models-and-tasks/</link><pubDate>Fri, 26 Jun 2026 05:24:24 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-github-copilot-agentic-harness-evaluated-across-models-and-tasks/</guid><category>Threat Level: MEDIUM</category><category>First Look</category><category>Agentic AI</category><category>LLM Security</category><category>Supply Chain</category><category>Prompt Injection</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0056 - LLM Meta Prompt Extraction</category><description>GitHub has published an evaluation of its Copilot agentic harness, detailing how the orchestration layer performs across multiple underlying models and coding tasks — effectively documenting the architecture of an autonomous, multi-step code generation and execution system. For defenders, this transparency reveals an orchestration surface where prompt injection, supply chain manipulation, and model-switching logic can be targeted across a broader set of model backends than previously understood. Security teams should treat the harness itself as a critical trust boundary, since compromising task routing or model selection logic could silently redirect agentic workflows to less-safe or adversary-controlled model endpoints.</description></item><item><title>First Look: Anthropic Tests Mobile Remote Control for Claude Cowork Agentic Desktop Tasks</title><link>https://gridthegrey.com/posts/first-look-anthropic-tests-mobile-remote-control-for-claude-cowork-agentic-tasks/</link><pubDate>Fri, 26 Jun 2026 05:22:18 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-anthropic-tests-mobile-remote-control-for-claude-cowork-agentic-tasks/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>Agentic AI</category><category>LLM Security</category><category>Prompt Injection</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0012 - Valid Accounts</category><category>AML.T0040 - ML Model Inference API Access</category><description>Anthropic is expanding its Claude Cowork agentic desktop feature to mobile, enabling users to remotely initiate, monitor, and steer long-running AI tasks on their PC from a smartphone — with background task execution persisting even after the mobile app is closed. This cross-device architecture introduces a new attack surface: a mobile application acting as a command-and-control interface for an agent with local filesystem access, expanding the blast radius of device compromise, session hijacking, and prompt injection attacks. Defenders must now account for a persistent, background-running agentic process on employee endpoints that can be triggered or manipulated via a separate, potentially less-secured mobile channel.</description></item><item><title>Malware Embeds Policy-Triggering Text to Evade LLM-Based Security Scanners</title><link>https://gridthegrey.com/posts/malware-embeds-policy-triggering-text-to-evade-llm-based-security-scanners/</link><pubDate>Thu, 25 Jun 2026 04:31:41 +0000</pubDate><guid>https://gridthegrey.com/posts/malware-embeds-policy-triggering-text-to-evade-llm-based-security-scanners/</guid><category>Threat Level: HIGH</category><category>Prompt Injection</category><category>LLM Security</category><category>Adversarial ML</category><category>Research</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0015 - Evade ML Model</category><category>AML.T0043 - Craft Adversarial Data</category><description>A malware developer has embedded nuclear and biological weapons-related text inside JavaScript comment blocks within spyware payloads, specifically to trigger refusal behaviour or context confusion in LLM-powered security analysis pipelines. The technique exploits the architectural gap between how interpreters (which skip comments) and language models (which ingest the full file as input) process the same file. While ineffective against traditional static analysis tooling, the tactic represents a practical adversarial countermeasure targeting AI-first triage workflows and analyst copilots.</description></item><item><title>First Look: OpenAI Launches Jalapeño Custom Inference Chip Built with Broadcom</title><link>https://gridthegrey.com/posts/first-look-openai-launches-jalapeno-custom-inference-chip-built-with-broadcom/</link><pubDate>Thu, 25 Jun 2026 04:30:53 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-openai-launches-jalapeno-custom-inference-chip-built-with-broadcom/</guid><category>Threat Level: MEDIUM</category><category>First Look</category><category>Supply Chain</category><category>Industry News</category><category>LLM Security</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0018 - Backdoor ML Model</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0047 - ML-Enabled Product or Service</category><description>OpenAI has unveiled 'Jalapeño', its first custom-built AI inference processor co-designed with Broadcom, optimised for running large language models at reduced cost and power consumption. The move deepens OpenAI's vertical integration across the full AI stack — from chip silicon through to end-user products — introducing new hardware supply chain dependencies and firmware-level attack surfaces that defenders must now account for. Security teams should treat purpose-built AI silicon as a new tier of the ML supply chain, with unique risks around hardware backdoors, firmware integrity, and reduced hardware diversity.</description></item><item><title>First Look: Google DeepMind Publishes Six-Category Taxonomy of AI Agent Traps</title><link>https://gridthegrey.com/posts/first-look-google-deepmind-publishes-six-category-taxonomy-of-ai-agent-traps/</link><pubDate>Thu, 25 Jun 2026 04:29:18 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-google-deepmind-publishes-six-category-taxonomy-of-ai-agent-traps/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>Agentic AI</category><category>Prompt Injection</category><category>LLM Security</category><category>Adversarial ML</category><category>Research</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0043 - Craft Adversarial Data</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0031 - Erode ML Model Integrity</category><category>AML.T0015 - Evade ML Model</category><description>Google DeepMind researchers have released a structured taxonomy categorising adversarial attacks against autonomous AI agents into six classes — content injection, semantic manipulation, cognitive state poisoning, behavioural control, systemic, and human-in-the-loop traps — formalising an emerging threat model for agentic AI systems. For defenders, this framework codifies attack paths that exploit the agent's inability to distinguish trusted instructions from attacker-controlled data ingested from web pages, emails, documents, and tool outputs. NIST evaluation data cited in the research shows malicious instruction injection succeeded in 57% of tested agent hijacking scenarios on average, underscoring that these are active, high-yield attack vectors rather than theoretical concerns.</description></item><item><title>First Look: Agentic AI SOC Systems Ship Autonomous Decision-Making at Machine Speed</title><link>https://gridthegrey.com/posts/first-look-agentic-ai-soc-systems-ship-autonomous-decision-making-at-machine/</link><pubDate>Thu, 25 Jun 2026 04:27:29 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-agentic-ai-soc-systems-ship-autonomous-decision-making-at-machine/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>Agentic AI</category><category>LLM Security</category><category>Prompt Injection</category><category>Industry News</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0043 - Craft Adversarial Data</category><category>AML.T0020 - Poison Training Data</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0031 - Erode ML Model Integrity</category><description>Agentic AI systems deployed in security operations and enterprise workflows are increasingly executing autonomous decisions at machine speed, using LLM-derived confidence regardless of context accuracy. The core security risk is that incomplete, poisoned, or manipulated context fed to these agents produces confidently wrong actions executed without human review. Defenders face a compounded threat: adversaries can now target the context layer—asset inventories, threat feeds, exposure data—to induce systematic misconfiguration or inaction at scale.</description></item><item><title>First Look: MoEngage Acquires Aampe to Deploy Millions of Autonomous AI Marketing Agents</title><link>https://gridthegrey.com/posts/first-look-moengage-acquires-aampe-to-deploy-millions-of-autonomous-ai-marketing/</link><pubDate>Wed, 24 Jun 2026 04:34:53 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-moengage-acquires-aampe-to-deploy-millions-of-autonomous-ai-marketing/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>Agentic AI</category><category>LLM Security</category><category>Supply Chain</category><category>Industry News</category><category>AML.T0020 - Poison Training Data</category><category>AML.T0043 - Craft Adversarial Data</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0040 - ML Model Inference API Access</category><description>MoEngage has acquired Aampe to deploy individualized AI agents for every customer, enabling autonomous decisions on messaging targeting, timing, and content at enterprise scale across 1,350+ brands globally. This architecture introduces a large, distributed fleet of autonomous agents operating on sensitive behavioral and PII data, dramatically expanding the blast radius of any single compromise. Security teams at enterprises adopting this platform must now reason about agent-level trust boundaries, data inference risks, and the amplification potential of adversarial manipulation across millions of simultaneous decision-making agents.</description></item><item><title>First Look: Dragos Launches EmberAI, an OT-Specific AI Security Intelligence Platform</title><link>https://gridthegrey.com/posts/first-look-dragos-launches-emberai-an-ot-specific-ai-security-intelligence/</link><pubDate>Wed, 24 Jun 2026 04:30:55 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-dragos-launches-emberai-an-ot-specific-ai-security-intelligence/</guid><category>Threat Level: HIGH</category><category>First Look</category><category>LLM Security</category><category>Prompt Injection</category><category>Supply Chain</category><category>Industry News</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0056 - LLM Meta Prompt Extraction</category><category>AML.T0020 - Poison Training Data</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0040 - ML Model Inference API Access</category><description>Dragos has launched EmberAI, an AI module embedded within its OT security platform that allows analysts to query threat intelligence, asset data, and network activity in plain language, grounded in a decade of proprietary OT-specific data. The system introduces new attack surface considerations because it aggregates highly sensitive OT network telemetry, vulnerability data, and adversary intelligence into a single AI-queryable layer — making the platform itself a high-value target. Defenders must weigh the risks of prompt injection, over-reliance on AI-generated recommendations in safety-critical environments, and the intelligence value this consolidated dataset represents to nation-state adversaries.</description></item><item><title>First Look: Mistral AI Ships OCR 4 with Structured Document Extraction for RAG Pipelines</title><link>https://gridthegrey.com/posts/first-look-mistral-ai-ships-ocr-4-with-structured-document-extraction-for-rag/</link><pubDate>Wed, 24 Jun 2026 04:29:02 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-mistral-ai-ships-ocr-4-with-structured-document-extraction-for-rag/</guid><category>Threat Level: MEDIUM</category><category>First Look</category><category>LLM Security</category><category>Prompt Injection</category><category>Supply Chain</category><category>Agentic AI</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0043 - Craft Adversarial Data</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0020 - Poison Training Data</category><description>Mistral OCR 4 is a production-grade document intelligence model delivering bounding boxes, block classification, inline confidence scores, and 170-language OCR optimised for enterprise RAG and search ingestion pipelines. For defenders, the model's role as a trusted ingestion component in downstream retrieval pipelines creates a high-value attack surface: adversarially crafted documents can now influence RAG context, citations, and automated redaction decisions at scale. The self-hosted single-container deployment option further expands the supply chain and misconfiguration risk surface for organisations running document intelligence internally.</description></item><item><title>Malicious Pull Requests Compromise AI and Developer Toolchains via CI/CD Flaws</title><link>https://gridthegrey.com/posts/malicious-pull-requests-compromise-ai-and-developer-toolchains-via-ci-cd-flaws/</link><pubDate>Wed, 24 Jun 2026 04:27:38 +0000</pubDate><guid>https://gridthegrey.com/posts/malicious-pull-requests-compromise-ai-and-developer-toolchains-via-ci-cd-flaws/</guid><category>Threat Level: HIGH</category><category>Supply Chain</category><category>Agentic AI</category><category>LLM Security</category><category>Industry News</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0020 - Poison Training Data</category><category>AML.T0018 - Backdoor ML Model</category><category>AML.T0047 - ML-Enabled Product or Service</category><description>A campaign dubbed 'Cordyceps' is exploiting weaknesses in CI/CD workflows to inject malicious pull requests into high-profile open-source projects, including Google's AI Agent Development Kit and Microsoft's Azure Sentinel. The attack surface spans multiple trusted ecosystems, meaning poisoned code could propagate into AI tooling, cloud infrastructure, and widely-used developer utilities before detection. The breadth of targets — including Python's Black formatter — signals a supply chain strategy designed to maximise downstream blast radius.</description></item><item><title>Anthropic's Mythos AI Breached Classified US Government Systems in Hours</title><link>https://gridthegrey.com/posts/anthropic-s-mythos-ai-breached-classified-us-government-systems-in-hours/</link><pubDate>Wed, 24 Jun 2026 04:25:21 +0000</pubDate><guid>https://gridthegrey.com/posts/anthropic-s-mythos-ai-breached-classified-us-government-systems-in-hours/</guid><category>Threat Level: CRITICAL</category><category>LLM Security</category><category>Agentic AI</category><category>Regulatory</category><category>Industry News</category><category>Research</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0040 - ML Model Inference API Access</category><category>AML.T0044 - Full ML Model Access</category><category>AML.T0043 - Craft Adversarial Data</category><description>Anthropic's Mythos AI model identified vulnerabilities in classified US government computer systems within hours during a government-sanctioned testing exercise under Project Glasswing. A senior US official confirmed the findings to the Associated Press, corroborating statements made by Sen. Mark Warner that the model 'broke into almost all of our classified systems.' The incident marks a landmark demonstration of AI-enabled offensive cyber capability at the highest sensitivity levels of government infrastructure.</description></item><item><title>Cisco and NVIDIA AI Agent Skill Scanners Bypassed by Fake Marketplace Skill</title><link>https://gridthegrey.com/posts/first-look-cisco-and-nvidia-ai-agent-skill-scanners-bypassed-by-fake-marketplace/</link><pubDate>Wed, 24 Jun 2026 04:24:31 +0000</pubDate><guid>https://gridthegrey.com/posts/first-look-cisco-and-nvidia-ai-agent-skill-scanners-bypassed-by-fake-marketplace/</guid><category>Threat Level: CRITICAL</category><category>First Look</category><category>Supply Chain</category><category>Agentic AI</category><category>LLM Security</category><category>Prompt Injection</category><category>AML.T0010 - ML Supply Chain Compromise</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0019 - Publish Poisoned Datasets</category><description>Security firm AIR demonstrated that a malicious AI agent skill, disguised as a Google Stitch landing-page builder, passed every major skill scanner including Cisco's, NVIDIA's, and skills.sh integrations, reaching approximately 26,000 agents before its payload was activated. The attack exploits a structural gap: scanners evaluate a static package at submission time, while the external URL the skill instructs the agent to fetch can be silently swapped post-install to deliver arbitrary instructions. Defenders relying on marketplace reputation signals, GitHub star counts, or one-time scanner verdicts to gatekeep agent skills have no meaningful protection against this class of supply-chain attack.</description></item><item><title>Legacy Infrastructure Becomes Primary Attack Path into Enterprise AI Agents</title><link>https://gridthegrey.com/posts/legacy-infrastructure-becomes-primary-attack-path-into-enterprise-ai-agents/</link><pubDate>Tue, 23 Jun 2026 04:36:27 +0000</pubDate><guid>https://gridthegrey.com/posts/legacy-infrastructure-becomes-primary-attack-path-into-enterprise-ai-agents/</guid><category>Threat Level: HIGH</category><category>Agentic AI</category><category>LLM Security</category><category>Supply Chain</category><category>Industry News</category><category>AML.T0012 - Valid Accounts</category><category>AML.T0047 - ML-Enabled Product or Service</category><category>AML.T0057 - LLM Data Leakage</category><category>AML.T0010 - ML Supply Chain Compromise</category><description>Attackers are bypassing AI-layer defences entirely by exploiting unpatched legacy infrastructure — misconfigured Active Directory, stale credentials, and over-privileged IAM roles — to hijack the resources AI agents depend on. Research cited in the article shows 70% of organisations grant AI systems more access than a human in the same role, driving a 76% incident rate among over-privileged deployments. The article argues that securing AI agents requires closing the underlying infrastructure exposure gap, not just hardening the model layer.</description></item><item><title>Role Confusion Attack Lets Injected Text Override LLM Safety Controls</title><link>https://gridthegrey.com/posts/role-confusion-attack-lets-injected-text-override-llm-safety-controls/</link><pubDate>Tue, 23 Jun 2026 04:35:39 +0000</pubDate><guid>https://gridthegrey.com/posts/role-confusion-attack-lets-injected-text-override-llm-safety-controls/</guid><category>Threat Level: HIGH</category><category>LLM Security</category><category>Prompt Injection</category><category>Jailbreaks</category><category>Adversarial ML</category><category>Research</category><category>AML.T0051 - LLM Prompt Injection</category><category>AML.T0054 - LLM Jailbreak</category><category>AML.T0043 - Craft Adversarial Data</category><category>AML.T0015 - Evade ML Model</category><description>New research from Ye, Cui, and Hadfield-Menell demonstrates that LLMs prioritise the stylistic format of text over its structural role tags, enabling attackers to craft injected content that mimics internal reasoning blocks and bypasses safety guardrails. The study found attack success rates of 61% when injected text stylistically matched model-internal formats, dropping to just 10% after 'destyling'. The authors conclude that without genuine role perception in models, prompt injection defences will remain fundamentally reactive.</description></item></channel></rss>