Overview
Token optimization across models, requests, and savings
— for the selected billing period.
API Status
Z
Metrics below are scoped to the selected billing period.
Rollout preparation status
Zur-lix is in controlled rollout preparation, not a public or global
launch.
Savings figures
Informational estimates
only — corpus-measured (typically ~16%, up to ~46–62%
on favorable prompts), not guaranteed, and not a billing-grade
figure unless promoted after go-live.
Billing
Billing is not active.
No invoice and no charge is authorized. Billing readiness is
operator-controlled.
API
Core API readiness is
improving; streaming is not customer-contract-pinned yet.
Access
New external customers
are not yet enabled; final activation is controlled by an explicit
go-live decision.
Savings on your own traffic will vary and are not guaranteed. No
billing action is authorized by this view.
Total Requests
—
Tokens Processed
—
Tokens Saved
—
Savings Rate
—
of input tokens recovered
Avg Latency
—
Est. Cost Savings
—
Total Customer Savings
—
Compression + Cache + Routing
Routing estimated — • actual: not yet available
Daily Token Savings
Requests by Model
OpenAI
Anthropic
Other
Model Breakdown
Route Breakdown
Direct Savings Sources
Decomposition of headline tokens_saved. Not added on top of
headline savings.
Awaiting savings data…
Routing Decisions
Per-tenant routing-decision summary. Routing is fail-open;
see safety note below.
Total Decisions
—
Swaps
—
No-Swap
—
Swap Rate
—
Total Saved
—
Avg / Swap
—
Top model pairs (swaps)
No swaps in this window.
Recent swaps
No recent swaps.
Routing summary is temporarily unavailable for this window.
Last successful values shown when available; otherwise zero
placeholders.
Account Status
Read-only account/subscription/discount/billing-period
summary. Runs no billing, no Stripe calls, no provider
calls, and authorizes nothing.
Use the current live billing period when verifying new demo
traffic. Future periods may correctly show no usage yet.
Choose a billing period (YYYY-MM) and click
Load account status. This card reflects the selected
period only. It does not create usage, generate billing, push
savings, or call Stripe.
Observed Savings
Informational estimate from Zur-lix optimization layers.
Not billing-grade. Not invoiced. No customer charge.
Stripe / billing push: off. Raw prompt capture remains
off. Realized savings vary by workload, prompt shape, model,
and usage pattern. Authorizes nothing.
Click Load observed savings to view the latest
informational estimate from the Zur-lix optimization layers.
These numbers are informational only, not billing-grade, and
do not authorize any customer charge or Stripe push. Waiting
for measured optimization activity will show
no current measured savings rather than fake totals.
Billing Operator Tools
CLI remains source-of-truth; dashboard is visibility only.
Operator admin token
Used only in memory for read-only billing operator cards.
Not stored. The token is never written to
localStorage, sessionStorage,
cookies, or the URL, and is never logged or rendered into
result text.
Admin token not set
Savings Event Generation Preview pre-generation preview
Read-only. Does not call Stripe. Does not mutate DB.
Preview only. Does not insert BillingSavingsEvent rows.
Not an execute or generation surface.
CLI (
python -m app.operator.savings_generation_preview_cli)
remains source-of-truth; dashboard is visibility only.
Requires the shared Operator admin token above.
idle
Click Run generation preview to fetch live state.
Savings Billing Reconciliation completed periods
Read-only. Does not call Stripe. Does not mutate DB.
CLI (
python -m app.operator.savings_reconciliation_readout_cli)
remains the source-of-truth; this card is visibility only.
Requires the shared Operator admin token above.
Expected Stripe event id (optional)
idle
Click Run reconciliation readout to fetch live state.
Next Cycle Readiness future / pending periods
Read-only. Does not call Stripe. Does not mutate DB. Not an
execute surface — readiness reports what is pending; the
C3.7 CLI
(
python -m app.operator.savings_next_cycle_readiness_cli)
remains the source-of-truth for command-line operation.
Requires the shared Operator admin token above.
idle
Click Run next-cycle readiness scan to fetch live state.
Recent Requests
| Time | Route | Model | Original | Compressed | Saved | Savings | Latency | Status |
|---|---|---|---|---|---|---|---|---|
| Loading... | ||||||||
Provider Cache Savings Anthropic prompt caching · this billing period —
Cache read tokens
0
Cache creation tokens
0
Active input tokens
0
Cache hit ratio
0.00%
Cache savings
0.00%
Provider cache reads reduce active input tokens on repeated stable prompts.
Results depend on prompt reuse and provider cache eligibility.
Cost Accuracy provider-aware estimate · this billing period —
Estimated cost saved
$0.00
vs no-caching baseline
0.00%
— breakdown —
Active input cost
$0.00
Cache write cost
$0.00
Cache read cost
$0.00
Output cost
$0.00
Estimated total this period
$0.00
Without caching, would have been
$0.00
Costs are estimated against the current published rate card.
Cache writes are billed at ~125% of input rate; cache reads at
~10%. Unpriced models are excluded from totals to avoid
overclaiming.