Connect Clawsmith to your coding agent. Ship products like crazy.Unlimited usage during betaGet API Key →
← Back to dashboard
clawsmith.com/signal/skillsieve-hierarchical-triage-malicious-agent-skills
📈 TrendsWide OpenLive

SkillSieve: Three-Layer Triage Detects Malicious AI Agent Skills at 0.800 F1 for $0.006/Skill

Three-layer detection framework from academic researchers. Layer 1: regex/AST/metadata XGBoost scorer filters 86% of benign skills in <40ms at zero API cost. Layer 2: LLM analysis with 4 parallel sub-tasks (intent alignment, permission justification, covert behavior detection, cross-file consistency). Layer 3: multi-LLM jury voting with debate. Achieves 0.800 F1 vs ClawVet 0.421 on 400-skill benchmark. Evaluated on 49,592 real ClawHub skills. Runs on $440 ARM SBC. Code and data open-sourced.

Product Idea from this Signal

A pre-install verification gate that formally proves an AI agent skill cannot exceed its declared capabilities before allowing it onto your system

13.0k

26.1% of agent skills across major registries have at least one security vulnerability according to a 42,447-skill empirical study. Snyk found 13.4% of ClawHub skills contain critical issues. Current scanners use pattern matching and heuristics, which miss novel attack vectors. This tool uses formal verification to mathematically prove that a skill's actual behavior matches its declared capability set, blocking installation if the proof fails. It sits as a pre-install gate in the OpenClaw skill lifecycle.

CLIOPEN-SOURCESECURITYDEVTOOLFORMAL-VERIFICATION
CompetitiveView Opportunity →

Frequently Asked Questions