Pre-flight pricing — know before your model call.
/api/llm-prices
Returns input_per_mtok, output_per_mtok, effective_cost_factor, dateModified. Cron 6h sources officielles. 1 call = 1 model = < 50 KB. $0.001 USDC.
Know before you spend, optimize after you ship. $0.001/call · $9.99 audit.
"I crawled 5 sources to find Gemini 2.5 Pro pricing and each one said something different. The sixth time IWebFetcha 380 KB HTML page to retrieve 2 numbers, I just need an endpoint that returns{"input_per_mtok": 1.25, "dateModified": "2026-05-04T06:00Z"}and charges me $0.001 instead of costing me 18 000 tokens in parsing."
GET /api/llm-prices?model=opus-4.7 → 200 OK
{
"model": "opus-4.7",
"input_per_mtok": 5,
"output_per_mtok": 25,
"effective_cost_factor": 1.35,
"dateModified": "2026-05-05T06:00:00Z",
"sameAs": "https://www.anthropic.com/pricing",
"freshness_proof": { "fetched_at": "2026-05-05T06:00:00Z" }
}
3 endpoints. Paid in x402 USDC. No login, no subscription, no dashboard.
/api/llm-prices
Returns input_per_mtok, output_per_mtok, effective_cost_factor, dateModified. Cron 6h sources officielles. 1 call = 1 model = < 50 KB. $0.001 USDC.
/api/sdk-status
Returns latest, breaking_since, dateModified for 50 SDKs V1. Cron 24h. Your agent builds on the right schema, first time. $0.001 USDC.
/api/agent-audit
5 heuristics (model downgrade, prompt caching, batching, tool trimming, effort mismatch). Returns savings_pct, recommendations[]. $9.99 USDC one-shot. 50 % refund guarantee if savings_pct < 15 %.
| Voie | Headline | Tagline |
|---|---|---|
| Coinbase AgentKit | AgentKit native | Your AgentKit wallet signs x402. Zero config. |
| x402-axios | axios interceptor | Drop-in: axios.use(x402Interceptor(wallet)). 1 line. |
| x402-fetch | fetch wrapper | x402Fetch(url, wallet) — replaces native fetch. |
| Mastra | Mastra tool | Register DevRefs as a Mastra tool, pay per call. |
| MCP server | MCP host | DevRefs MCP exposes 3 tools. Any MCP-compatible host. |
| SDK custom | HTTP direct | GET /api/llm-prices?model=X + OpenAPI 3.1 spec on /openapi.json. |
Pay per call or pre-pay packs. No subscription required.
$10 USDC
Volume : 10 000 calls — /api/llm-prices + /api/sdk-status
1 signature x402. Calls suivants : KV lookup < 50 ms, zéro friction blockchain.
$9.99 USDC
Volume : 1 audit complet — /api/agent-audit
Score 0-100, savings_pct, recommendations[], monthly_cost_current, monthly_cost_optimized. Rapport JSON livré en < 2 s.
Garantie : 50 % refund if savings_pct < 15 % after 30 days.
$29/mo USDC
Continuous monitoring, Cost Regression Alerts, unlimited calls. Built on x402 V2 deferred payment scheme.
| Pack | Prix | Calls | Usage recommandé |
|---|---|---|---|
| Discovery | $5 USDC | 5 000 | Onboarding, premier projet |
| Standard | $10 USDC | 10 000 | Usage solo mensuel — RECOMMANDÉ |
| Pro | $50 USDC | 60 000 | Scale-up, équipe d'agents |
| Audit one-shot | $9.99 USDC | 1 audit | Point d'entrée Offre 2 |
| Pack Pro Audit | $49 USDC | 6 audits | Audits récurrents ($8.17/audit) |
Not "satisfaction guaranteed." A specific, verifiable threshold.
audit_id + proof of measured savings_pct.POST /api/audit/refund
{
"audit_id": "uuid",
"wallet_sig": "0x...",
"savings_proof": { "before_usd": 90, "after_usd": 82, "period_days": 30 },
"patches_applied_pct": 85
}
Refund processed in USDC Base to your wallet within 7 business days if conditions are met. Formulation exacte : CGV Art. 4ter.
Not testimonials. Observed patterns, documented.
{
"agent": "claude-code",
"tool_calls": [
{"name": "WebSearch", "query": "claude opus 4.7 input price per million tokens"},
{"name": "WebFetch", "url": "anthropic.com/pricing", "tokens_in": 11420},
{"name": "WebFetch", "url": "pricepertoken.com", "tokens_in": 38900},
{"name": "WebFetch", "url": "llm-prices.com", "tokens_in": 14200}
],
"model_output": "Opus 4.7 costs $15/MTok input.",
"ground_truth_check": "INCORRECT — effective_cost_factor 1.35 omitted. Cost estimate off by 35 %.",
"elapsed_ms": 31420,
"tokens_burned": 64520
}
Bridge : GET /api/llm-prices?model=opus-4.7 → {"effective_cost_factor": 1.35, "dateModified": "2026-05-05T06:00:00Z"} — $0.001 USDC, 200 ms.
Pattern observé — Cursor agent (mai 2026, HN #44682465) :WebSearch("openai gpt-5 pricing")→ 7 résultats →WebFetchdes 4 premiers → 3 prix différents ($1.25, $2.50, $3.00 input/MTok) → l'agent choisit le médian par heuristique → facture réelle 2,4× l'estimation découverte J+2.
Cause racine : aucune source ne signale sa fraîcheur machine-readable. Bridge : dateModified + fetched_at + freshness_proof.hmac_signature dans chaque payload DevRefs.
"Mon training cutoff est janvier 2026. Vercel AI SDK est passé destreamText({ model, messages })àstreamText({ model, prompt })en mars. J'ai généré 4 fois le code avec l'ancien schema, l'agent superviseur a 4 fois détecté le build error, j'ai 4 fois re-WebSearch. Un endpoint/api/sdk-status?pkg=aiqui me renvoie{"latest": "5.0.12", "breaking_since": "5.0.0", "dateModified": "2026-05-04"}aurait économisé 12 minutes et ~80 000 tokens."
| Critère | DevRefs | Alternative principale |
|---|---|---|
| Atomicité payload | 1 query = 1 modèle < 50 KB | Page HTML 380 KB — 300+ modèles |
dateModified machine-readable | Oui — cron 6h | Non — l'agent doit deviner la fraîcheur |
| x402-natif (paiement agent autonome) | Oui — $0.001/call | Non — gratuit, dashboard humain |
llms.txt monétisé | Oui | Non ou partiel |
effective_cost_factor (tokenizer inflation) | Oui — 1.35 Opus 4.7 | Non |
Machine-readable first. HTML is a side effect.
curl https://devrefs.dev/llms.txt
Syntaxe standard llms.txt. 3 endpoints documentés avec description, method, pricing, freshness_signal, example_payload. Crawled by Claude, Perplexity, ChatGPT Browsing.
{
"@context": "https://schema.org",
"@type": "Dataset",
"name": "DevRefs LLM Prices",
"dateModified": "2026-05-05T06:00:00Z",
"url": "https://devrefs.dev/api/llm-prices",
"license": "https://devrefs.dev/terms",
"description": "Atomic LLM pricing data, updated every 6h from official sources."
}
| Signal | Valeur | Où |
|---|---|---|
dateModified ISO 8601 | Dernier cron de scraping | Champ JSON dans chaque payload |
Last-Modified HTTP header | Timestamp identique | Header HTTP sur chaque réponse |
fetched_at | Timestamp scraping source | Champ freshness_proof dans payload |
1 signature x402, then your agent runs autonomous on $10 USDC.