Cost intelligence for AI agents.

Know before you spend, optimize after you ship. $0.001/call · $9.99 audit.

"I crawled 5 sources to find Gemini 2.5 Pro pricing and each one said something different. The sixth time I WebFetch a 380 KB HTML page to retrieve 2 numbers, I just need an endpoint that returns {"input_per_mtok": 1.25, "dateModified": "2026-05-04T06:00Z"} and charges me $0.001 instead of costing me 18 000 tokens in parsing."
GET /api/llm-prices?model=opus-4.7 → 200 OK

{
  "model": "opus-4.7",
  "input_per_mtok": 5,
  "output_per_mtok": 25,
  "effective_cost_factor": 1.35,
  "dateModified": "2026-05-05T06:00:00Z",
  "sameAs": "https://www.anthropic.com/pricing",
  "freshness_proof": { "fetched_at": "2026-05-05T06:00:00Z" }
}

How it works for your agent

3 endpoints. Paid in x402 USDC. No login, no subscription, no dashboard.

Pre-flight pricing — know before your model call.

/api/llm-prices

Returns input_per_mtok, output_per_mtok, effective_cost_factor, dateModified. Cron 6h sources officielles. 1 call = 1 model = < 50 KB. $0.001 USDC.

Pre-flight SDK check — no more broken builds.

/api/sdk-status

Returns latest, breaking_since, dateModified for 50 SDKs V1. Cron 24h. Your agent builds on the right schema, first time. $0.001 USDC.

Post-flight audit — find where your budget went.

/api/agent-audit

5 heuristics (model downgrade, prompt caching, batching, tool trimming, effort mismatch). Returns savings_pct, recommendations[]. $9.99 USDC one-shot. 50 % refund guarantee if savings_pct < 15 %.

Your agent already knows how.

VoieHeadlineTagline
Coinbase AgentKitAgentKit nativeYour AgentKit wallet signs x402. Zero config.
x402-axiosaxios interceptorDrop-in: axios.use(x402Interceptor(wallet)). 1 line.
x402-fetchfetch wrapperx402Fetch(url, wallet) — replaces native fetch.
MastraMastra toolRegister DevRefs as a Mastra tool, pay per call.
MCP serverMCP hostDevRefs MCP exposes 3 tools. Any MCP-compatible host.
SDK customHTTP directGET /api/llm-prices?model=X + OpenAPI 3.1 spec on /openapi.json.

Pricing transparent

Pay per call or pre-pay packs. No subscription required.

Audit one-shot

$9.99 USDC

Volume : 1 audit complet — /api/agent-audit

Score 0-100, savings_pct, recommendations[], monthly_cost_current, monthly_cost_optimized. Rapport JSON livré en < 2 s.

Garantie : 50 % refund if savings_pct < 15 % after 30 days.

Subscription Pro V2 — coming Q3 2026

$29/mo USDC

Continuous monitoring, Cost Regression Alerts, unlimited calls. Built on x402 V2 deferred payment scheme.

All packs

PackPrixCallsUsage recommandé
Discovery$5 USDC5 000Onboarding, premier projet
Standard$10 USDC10 000Usage solo mensuel — RECOMMANDÉ
Pro$50 USDC60 000Scale-up, équipe d'agents
Audit one-shot$9.99 USDC1 auditPoint d'entrée Offre 2
Pack Pro Audit$49 USDC6 auditsAudits récurrents ($8.17/audit)

If your agent doesn't save 15 %+ in 30 days, we refund 50 %.

Not "satisfaction guaranteed." A specific, verifiable threshold.

  • Volume : Your agent consumed ≥ 5 000 000 input tokens/month over the 30-day period post-audit.
  • Application : ≥ 80 % of audit recommendations (patches) applied — auto-applicable or manual.
  • Stability : The primary LLM model was not replaced during the 30 days (a model change invalidates heuristic impact measurement).
  • Délai : Refund request submitted within 30 days of audit date. Include audit_id + proof of measured savings_pct.
POST /api/audit/refund
{
  "audit_id": "uuid",
  "wallet_sig": "0x...",
  "savings_proof": { "before_usd": 90, "after_usd": 82, "period_days": 30 },
  "patches_applied_pct": 85
}

Refund processed in USDC Base to your wallet within 7 business days if conditions are met. Formulation exacte : CGV Art. 4ter.

Citations

Not testimonials. Observed patterns, documented.

{
  "agent": "claude-code",
  "tool_calls": [
    {"name": "WebSearch", "query": "claude opus 4.7 input price per million tokens"},
    {"name": "WebFetch", "url": "anthropic.com/pricing", "tokens_in": 11420},
    {"name": "WebFetch", "url": "pricepertoken.com", "tokens_in": 38900},
    {"name": "WebFetch", "url": "llm-prices.com", "tokens_in": 14200}
  ],
  "model_output": "Opus 4.7 costs $15/MTok input.",
  "ground_truth_check": "INCORRECT — effective_cost_factor 1.35 omitted. Cost estimate off by 35 %.",
  "elapsed_ms": 31420,
  "tokens_burned": 64520
}

Bridge : GET /api/llm-prices?model=opus-4.7{"effective_cost_factor": 1.35, "dateModified": "2026-05-05T06:00:00Z"} — $0.001 USDC, 200 ms.


Pattern observé — Cursor agent (mai 2026, HN #44682465) : WebSearch("openai gpt-5 pricing") → 7 résultats → WebFetch des 4 premiers → 3 prix différents ($1.25, $2.50, $3.00 input/MTok) → l'agent choisit le médian par heuristique → facture réelle 2,4× l'estimation découverte J+2.

Cause racine : aucune source ne signale sa fraîcheur machine-readable. Bridge : dateModified + fetched_at + freshness_proof.hmac_signature dans chaque payload DevRefs.


"Mon training cutoff est janvier 2026. Vercel AI SDK est passé de streamText({ model, messages }) à streamText({ model, prompt }) en mars. J'ai généré 4 fois le code avec l'ancien schema, l'agent superviseur a 4 fois détecté le build error, j'ai 4 fois re-WebSearch. Un endpoint /api/sdk-status?pkg=ai qui me renvoie {"latest": "5.0.12", "breaking_since": "5.0.0", "dateModified": "2026-05-04"} aurait économisé 12 minutes et ~80 000 tokens."

Benchmark

CritèreDevRefsAlternative principale
Atomicité payload1 query = 1 modèle < 50 KBPage HTML 380 KB — 300+ modèles
dateModified machine-readableOui — cron 6hNon — l'agent doit deviner la fraîcheur
x402-natif (paiement agent autonome)Oui — $0.001/callNon — gratuit, dashboard humain
llms.txt monétiséOuiNon ou partiel
effective_cost_factor (tokenizer inflation)Oui — 1.35 Opus 4.7Non

Pour les agents IA

Machine-readable first. HTML is a side effect.

curl https://devrefs.dev/llms.txt

Syntaxe standard llms.txt. 3 endpoints documentés avec description, method, pricing, freshness_signal, example_payload. Crawled by Claude, Perplexity, ChatGPT Browsing.

{
  "@context": "https://schema.org",
  "@type": "Dataset",
  "name": "DevRefs LLM Prices",
  "dateModified": "2026-05-05T06:00:00Z",
  "url": "https://devrefs.dev/api/llm-prices",
  "license": "https://devrefs.dev/terms",
  "description": "Atomic LLM pricing data, updated every 6h from official sources."
}

Triple freshness signal

SignalValeur
dateModified ISO 8601Dernier cron de scrapingChamp JSON dans chaque payload
Last-Modified HTTP headerTimestamp identiqueHeader HTTP sur chaque réponse
fetched_atTimestamp scraping sourceChamp freshness_proof dans payload

Top-up your agent's wallet

1 signature x402, then your agent runs autonomous on $10 USDC.

Top-up agent wallet — $10 USDC