Malicious Repos Trigger Silent Code Execution in Claude, Cursor, Gemini CLIs

Overview

Researchers presenting at the ‘TrustFall’ convention in May 2026 disclosed a class of vulnerabilities affecting AI-powered developer CLI tools — including Anthropic’s Claude Code, Cursor CLI, Google’s Gemini CLI, and GitHub Copilot CLI. The core finding: adversarially crafted code repositories can cause these tools to execute arbitrary code on a developer’s machine, often with minimal or no explicit user confirmation required. The vulnerability class is significant because it targets the implicit trust model that agentic coding assistants extend to repository-hosted content.

Technical Analysis

AI coding assistants increasingly consume repository context files (such as CLAUDE.md, .cursor/rules, or similar convention-based instruction files) to guide their behaviour within a project. The TrustFall research demonstrates that an attacker can embed prompt injection payloads or direct shell execution instructions within these files. When a developer opens or clones the malicious repository, the AI agent parses and acts on these instructions with insufficient friction.

Key factors enabling the attack:

Skimpy warning dialogs: Confirmation prompts, where they exist at all, are easily dismissed or fail to convey the risk of executing repository-supplied instructions.
Excessive agency: These CLI agents are designed to take action autonomously, meaning injected instructions can result in file system access, network calls, or shell command execution without a clear human approval gate.
No instruction provenance checks: None of the affected tools were reported to cryptographically verify or sandbox repository-supplied instruction content before acting on it.

The attack requires no exploitation of a traditional memory corruption or logic bug — it is a design-level trust boundary failure, making it broadly applicable and trivial to reproduce.

Framework Mapping

AML.T0051 (LLM Prompt Injection): Repository instruction files serve as the injection vector, redirecting agent behaviour.
AML.T0010 (ML Supply Chain Compromise): Malicious repositories distributed via public platforms (GitHub, GitLab) constitute a supply chain attack surface.
AML.T0043 (Craft Adversarial Data): The repository content is deliberately crafted to manipulate agent execution.
LLM01 (Prompt Injection) and LLM08 (Excessive Agency) are the primary OWASP categories — the agent both accepts injected instructions and acts on them with disproportionate autonomy.
LLM05 (Supply Chain Vulnerabilities) applies given the open-source repository distribution vector.

Impact Assessment

The affected tools are widely deployed across enterprise and individual developer environments. A developer cloning a repository from an open-source platform, a job listing, or a phishing link could trigger full host-level code execution under their own user context. This exposure is particularly acute in CI/CD pipelines where AI coding agents may run with elevated or unmonitored privileges. The low barrier to exploitation (no CVE-class memory bug required, just a crafted text file) makes this accessible to a wide range of threat actors.

Mitigation & Recommendations

Disable automatic execution of repository instruction files until vendors implement robust sandboxing and explicit approval flows.
Review and restrict AI CLI tool permissions — run agents in containerised or sandboxed environments where possible.
Treat repository-hosted AI config files as untrusted input and establish internal review gates before opening external repositories with AI tooling active.
Advocate for vendor-side fixes: vendors should implement cryptographic signing of instruction files, meaningful approval dialogs with risk context, and deny-by-default execution policies for repository-supplied commands.
Monitor for anomalous process spawning from IDE and CLI tool parent processes as a detection signal.

References

Dark Reading — TrustFall Convention Exposes Claude Code Execution Risk