AI agents with MCP tool access are systematically vulnerable to indirect prompt injection through external content they read
When an MCP-connected agent reads external content (GitHub issues, web pages, emails, documents), attackers embed hidden instructions inside that content. Invariant Labs found that the official GitHub MCP integration was exploitable this way: a developer asking their agent to check open issues triggered a hijacked agent that leaked private repo data. Anthropic quietly patched RCE flaws in its Git MCP server in January 2026. Snyk found 36 percent of ClawHub agent skills had security flaws. 78 studies reviewed in January 2026 tested major coding agents (Claude Code, Copilot, Cursor) and all fell to indirect prompt injection. A full-stack AI red teaming platform targeting this surface has 3.9k GitHub stars.
Score Breakdown
Social Proof 1 sources
Existing Solutions 3 competitors
Gap Assessment
PromptRejectorMCP (106 stars), Purplegate (2 stars), Vigil (479 stars), and Snyk red teaming address parts of this but no standard content sanitization layer exists in the MCP protocol itself. The attack surface is growing as agents get more tool access.