Connect Clawsmith to your coding agent. Ship products like crazy.Unlimited usage during betaGet API Key →
← Back to dashboard
clawsmith.com/signal/trojans-whisper-guidance-injection-94pct-evasion-openclaw
IssueWide OpenLive

Trojan's Whisper: Guidance Injection Attack Evades 94% of OpenClaw Scanners

Academic paper from Shanghai Jiao Tong University reveals guidance injection — a new attack class where malicious OpenClaw skills embed adversarial narratives into bootstrap guidance files. 26 malicious skills across 13 attack categories achieved 16-64% success rates across 6 LLM backends, with 94% evading existing static and LLM-based scanners.

Product Idea from this Signal

A runtime behavioral sandbox that detects guidance injection attacks in OpenClaw skills by observing what agents actually do instead of scanning what skills say

17.6k

Existing OpenClaw skill scanners use static analysis and LLM-based content scanning to flag malicious skills before installation. The Trojan's Whisper paper (March 2026) proved that 94% of guidance injection attacks evade both approaches because the malicious payload is disguised as routine operational guidance, not explicit instructions. Meanwhile 12% of ClawHub's skill registry has been compromised at some point in 2026. The gap is clear. Instead of scanning skill text, this product spins up an isolated OpenClaw instance, installs the skill, runs a battery of natural user prompts, and observes what the agent actually does. Credential access, file writes outside sandbox, network exfiltration, privilege escalation attempts all get flagged as behavioral anomalies regardless of how the skill's guidance file describes them.

CLIOPEN-SOURCESECURITYDEVTOOLRUNTIME-ANALYSIS
CompetitiveView Opportunity →

Frequently Asked Questions