A web app that attributes and hard-caps AI coding assistant spend across seats, credit pools, and agent runs for engineering orgs

Engineering teams using Claude Code, GitHub Copilot, Cursor, and AI agents across multiple seats have no single place to see who is spending what, enforce a shared credit-pool budget before it is exhausted, or charge spend back to a project or team. Anthropic split credit pools in June 2026; GitHub Copilot moved to metered AI Credits on June 1, 2026; Uber burned its full 2026 AI coding budget in four months; Microsoft ordered engineers off Claude Code over uncontrolled token bills. LLM gateways like LiteLLM, Bifrost, and Helicone track per-virtual-key API spend but do not cross-reconcile seat-level coding-assistant usage across providers, enforce hard budget caps with cutoff enforcement, or produce chargeback reports by team or project. This product ingests usage across all major AI coding tools and agent frameworks, attributes every dollar to a seat, team, and project in real time, enforces hard caps before a shared pool is exhausted, and produces showback and chargeback reports for finance.

Demand Breakdown

3,256

GitHub

400

Social Proof 5 sources

An update on recent Claude Code quality reports

@mfiguiere · 2026-04-23

1,674 HN

Anthropic Agent SDK billing split June 15 developer reaction thread

@community · 2026-05-15

597 HN

Reallocating $100/Month Claude Code Spend to Zed and OpenRouter

@community · 2026-04-06

583 HN

Uber torches 2026 AI budget on Claude Code in four months

@community · 2026-05-08

402 GH

GitHub Copilot is moving to usage-based billing

@github-staff · 2026-05-31

400

Gap Assessment

CompetitiveMultiple tools exist but differentiation opportunities remain

5 tools exist (Helicone, LiteLLM, Bifrost, Maxim AI, SuperPenguin) but gaps remain: No cross-provider reconciliation across Claude Code, Copilot, and Cursor seat usage. No per-seat hard cap enforcement with cutoff. No credit-pool exhaustion forecasting across a shared org budget. No chargeback reporting by team or project.; Covers only API calls routed through the LiteLLM proxy. Does not ingest seat-level usage from Claude Code desktop client, GitHub Copilot IDE extension, Cursor, or agent SDK runs. No credit-pool exhaustion alerts across heterogeneous tools. No finance-ready chargeback exports..

Features8 agent-ready prompts

Cross-provider seat usage ingestion

▶

Real-time cost attribution by seat, team, and project

▶

Hard budget caps with enforcement and cutoff

▶

Credit-pool exhaustion forecasting and alerts

▶

Anomaly detection on cost spikes

▶

Chargeback and showback reporting for finance

▶

Policy rules per team and role

▶

SSO-integrated seat directory and onboarding

▶

Competitive LandscapeFREE

Product	Does	Missing
Helicone	LLM observability and cost tracking per API key for individual developers and product teams. Fast proxy with logging, rate limiting per key, and cost dashboards.	No cross-provider reconciliation across Claude Code, Copilot, and Cursor seat usage. No per-seat hard cap enforcement with cutoff. No credit-pool exhaustion forecasting across a shared org budget. No chargeback reporting by team or project.
LiteLLM	Open-source LLM proxy with virtual keys, per-team spend tracking, and budget limits at the virtual-key level for API calls routed through the proxy.	Covers only API calls routed through the LiteLLM proxy. Does not ingest seat-level usage from Claude Code desktop client, GitHub Copilot IDE extension, Cursor, or agent SDK runs. No credit-pool exhaustion alerts across heterogeneous tools. No finance-ready chargeback exports.
Bifrost	Enterprise LLM gateway with RBAC, SSO, immutable audit logs, hierarchical governance, and routing rules based on budget_used thresholds. Go runtime, sub-15µs overhead.	Proxy-only architecture: only sees traffic routed through Bifrost. Does not reconcile coding-assistant seat usage from Copilot, Cursor, or Claude Code agent runs that bypass the proxy. No per-seat hard cap enforcement across the full tool stack.
Maxim AI	LLM testing, evaluation, and cost observability for AI product teams. Tracks API spend per project, surfaces token cost breakdowns, and benchmarks quality vs cost.	Focused on AI product development (testing/evals), not engineering-org spend governance. No per-seat coding-assistant attribution across Copilot/Cursor/Claude Code. No hard cap enforcement or credit-pool management.
SuperPenguin	Tracks AI spend across 14 providers with per-request attribution via an SDK wrapper for AI product teams	SDK-wrapper model instruments your own product code, does not ingest coding-assistant seat usage (Copilot/Cursor/Claude Code), has no hard-cap cutoff via key revocation and no finance-grade chargeback for engineering orgs