clawsmith.com/idea/prune-openclaw-workspace-injection-to-relevant-files-per-message

IdeaCompetitivePLUGINOPEN-SOURCECOST-OPTIMIZATIONLive

A plugin that prunes OpenClaw's 40K-token workspace injection down to only the files relevant to each message, cutting baseline API costs by 60-80%

OpenClaw injects the entire workspace context (SOUL.md, agent configs, skill descriptions, tool declarations) into every single message, consuming 40,000+ tokens before the user even types a word. This fixed overhead makes smaller models unusable (they hit context limits immediately) and multiplies API costs 3-5x for conversational sessions. This plugin intercepts the context assembly step and applies relevance filtering, only including workspace files and tool declarations that match the current message's intent, dropping the baseline from 40K to under 10K tokens.

Demand Breakdown

GitHub

217

Social Proof 3 sources

OpenClaw is using much tokens and it cost too much

2026-01-15

127 GH

Baseline Context Load: 40K tokens per message regardless of content

2026-03-28

53 GH

perf: 150k+ token system prompt investigation

2026-02-10

Gap Assessment

CompetitiveMultiple tools exist but differentiation opportunities remain

4 tools exist (lossless-claw, OpenClaw-Token-Optimizer, ClawRouter, maxHistoryMessages (built-in)) but gaps remain: Only compresses history, does not touch baseline workspace injection or tool declaration bloat. The 40K baseline persists even with lossless-claw active.; Works at the routing and scheduling level, not at the context assembly level. Cannot prune workspace files or tool declarations from the prompt..

Features4 agent-ready prompts

Context interceptor plugin that hooks into OpenClaw's pre-prompt assembly and replaces the full workspace dump with a relevance-filtered subset

▶

Token budget dashboard that shows per-message breakdown of baseline vs. history vs. user content with before/after comparison when the plugin is active

▶

Skill-aware tool declaration pruner that removes tool definitions for inactive or irrelevant skills from each prompt, reducing the tool overhead from 15K+ tokens to only what's needed

▶

Model compatibility checker that warns when baseline context exceeds a model's effective window and suggests which files to exclude for smaller models

▶

Competitive LandscapeFREE

Product	Does	Missing
lossless-claw	Compresses conversation history without losing critical information via a Context Engine plugin	Only compresses history, does not touch baseline workspace injection or tool declaration bloat. The 40K baseline persists even with lossless-claw active.
OpenClaw-Token-Optimizer	General token optimization skill combining model routing, heartbeat tuning, and usage tracking	Works at the routing and scheduling level, not at the context assembly level. Cannot prune workspace files or tool declarations from the prompt.
ClawRouter	Smart LLM router that reduces costs by routing simple tasks to cheaper models	Routing alone doesn't fix baseline bloat. A 40K-token baseline still goes to the cheap model, just at a lower per-token rate. The waste is the same tokens, just cheaper.
maxHistoryMessages (built-in)	Limits conversation history to N messages to reduce context growth over time	Only limits history messages, not the system prompt or workspace injection. Baseline stays at 40K+ regardless of history settings.

Aggregate Score

217

0 leads found

Details

TypeProduct Idea

Competitors4

Features4

Issues3

Leads0

Source Signals

All signals →

217OpenClaw Baseline Token Bloat: 40K Tokens Per Message From Workspace File Injection

Related Ideas

All ideas →

0A CLI tool that tracks your Claude Agent SDK credit burn rate in real time and routes tasks to the cheapest qualifying model before credits run out 0A plugin that adds persistent cross-session memory and skill-learning to OpenClaw without requiring a fork or migration to Hermes 0A proxy service that tracks Agent SDK credit consumption in real time and routes to cheaper providers before the monthly cap hits