← All Resources

CalculatorApril 13, 2026

AI Token Billing Calculator + Stripe Mapping Sheet

A practical spec and copyable-sheet blueprint to turn provider token costs into per‑1K client prices by model and token_type, with CSV exports for Stripe Pricing Plans and Meter dimensions, plus example meter_event JSON and Make/n8n posting configs. Built for solo AI service operators running usage billing without employees.

From EpisodeBill AI Usage Without Losing Margin: Stripe Meters + Token Billing

Contents↓ Download PDF

Input: [Currency]Input: [Global Markup Multiplier]Input: [Per‑Model Markup Overrides]Input: [Provider Base Costs per 1K Tokens]Input: [Overhead per 1K Tokens]Input: [Monthly Free Credits]Input: [Tiering Strategy]Input: [Round to $ Increment]Input: [Stripe Event + Dimension Keys]Input: [Client Usage Forecast]Output: [Per‑1K Client Price Table + Margin]Output: [CSV — Pricing Plan Rate Card]Output: [CSV — Meter Dimension Values]Output: [Example billing.meter_event JSON]Output: [No‑Code Posting — n8n + Make Specs]Output: [Invoice Preview + Margin Checks]Reference: [Vercel AI Gateway Path]Go‑Live Checklist (copy into your build)

Price AI usage without eating margin. Enter your per‑1K provider token costs, choose markup and credits, and the sheet outputs client rates by model and token_type (input/output/cache), plus ready‑to‑copy CSV tabs for Stripe Pricing Plan rate cards and Meter dimension values. It also includes example billing.meter_event JSON and Make/n8n posting specs for self‑metering or gateway setups. Note: Stripe Token Billing is in private preview as of April 2026—this sheet works with standard Meters using model and token_type dimensions, and maps cleanly if/when Token Billing is enabled.

Input: [Currency]

Why: Currency for all computed prices.
Default: USD.
Valid: Any ISO 4217 (e.g., USD, EUR, GBP).

↑ Back to top

Input: [Global Markup Multiplier]

Why: Global multiplier applied to provider costs to protect margin.
Default: 2.00 (100% markup).
Valid: 0.00–10.00.
Formula usage: effective_unit_price = base_cost * global_markup (before per‑model overrides and overhead).

↑ Back to top

Input: [Per‑Model Markup Overrides]

Why: Some models deserve different markups.
Default: blank (inherits global).
Valid: 0.00–10.00 per row.
Table columns: [model_id] [token_type] [override_markup].
Preloaded token_types: input, output, cache (optional).

↑ Back to top

Input: [Provider Base Costs per 1K Tokens]

Why: Your hard costs per 1K tokens by model and token_type.
Defaults: blank — fill from your provider’s pricing page.
Valid range: 0.0000–100.0000 (currency per 1K tokens).
Table columns:
- [provider] (e.g., OpenAI, Anthropic, Google, Mistral)
- [model_id] (exact identifier your app/gateway emits)
- [input_cost_per_1k]
- [output_cost_per_1k]
- [cache_cost_per_1k] (optional; set 0 if unused)
Note: If a provider prices per token differently (e.g., input vs output), split across the appropriate columns.

↑ Back to top

Input: [Overhead per 1K Tokens]

Why: Capture non‑token platform costs (e.g., gateway per‑request fees) as a per‑1K token equivalent.
Options:
1. Direct per‑1K entry: [overhead_per_1k] (default 0.00).
2. Derived from request cost: [gateway_cost_per_request] / [avg_tokens_per_request] * 1000.
Valid ranges:
- overhead_per_1k: 0.00–10.00
- gateway_cost_per_request: 0.000–5.000
- avg_tokens_per_request: 50–200,000

↑ Back to top

Input: [Monthly Free Credits]

Why: Offer monthly included tokens that expire at period end.
Default: 0.
Valid: 0–10,000,000 (in 1K‑token units for convenience).
Two fields:
- [monthly_free_1k_tokens_total] (applies across all models/types)
- Optional per‑meter credit: [monthly_free_1k_tokens_by_meter] table.
Stripe mapping: Configure as a Pricing Plan service action that grants units each period.

↑ Back to top

Input: [Tiering Strategy]

Why: Decide how customers advance through rates.
Default: volume.
Valid: flat | volume | graduated.
Tier table columns (when graduated or volume):
- [tier_start_1k] (inclusive)
- [tier_end_1k] (inclusive; use blank for ∞)
- [discount_pct] (0–100; applied after markup and overhead)

↑ Back to top

Input: [Round to $ Increment]

Why: Friendly price endings and fewer rounding artifacts.
Default: $0.01.
Valid: $0.01, $0.05, $0.10.
Formula: rounded = ROUND(client_unit_price / increment) * increment.

↑ Back to top

Input: [Stripe Event + Dimension Keys]

Why: Ensure your usage events land in the right customer and meter.
Defaults:
- [event_name]: ai_tokens
- [dimension_key_model]: model
- [dimension_key_token_type]: token_type
- [customer_id_key]: stripe_customer_id
Valid: non‑empty strings.
Notes: Keep keys consistent with your Meter’s "Dimensions" and payload mapping in Stripe Dashboard.

↑ Back to top

Input: [Client Usage Forecast]

Optional forecast to sanity‑check margin before shipping rates.
Fields (per client):
- [expected_input_1k_tokens]
- [expected_output_1k_tokens]
- [expected_cache_1k_tokens] (if used)
Defaults: 0.
Valid: 0–100,000,000 per month.

↑ Back to top

Output: [Per‑1K Client Price Table + Margin]

For each [model_id, token_type] row:

base = base_cost_per_1k(model_id, token_type)
markup = override_markup_or_global(model_id, token_type)
price_before_overhead = base * markup
unit_overhead = overhead_per_1k (direct) OR (gateway_cost_per_request / avg_tokens_per_request) * 1000
pre_tier_unit = price_before_overhead + unit_overhead
If tiering = flat: client_unit = pre_tier_unit
- If volume/graduated: apply [discount_pct] for the tier where usage lands
client_unit_rounded = round_to_increment(client_unit)
gross_margin_per_1k = client_unit_rounded - (base + unit_overhead)
gross_margin_pct = gross_margin_per_1k / client_unit_rounded

Result table columns:

[provider] [model_id] [token_type] [client_price_per_1k] [gross_margin_pct]

↑ Back to top

Output: [CSV — Pricing Plan Rate Card]

CSV header:

model_id,token_type,currency,unit_amount_decimal,price_type,tier_start_1k,tier_end_1k,rate_card_name

Row rules:

unit_amount_decimal = client_price_per_1k
price_type = flat|volume|graduated
tier_start_1k/tier_end_1k required for volume/graduated; blank for flat
rate_card_name optional (e.g., "H0 Token Rates v1")

Example rows:

gpt-4o-mini,input,USD,0.3200,volume,0,1000,H0 Token Rates v1
gpt-4o-mini,output,USD,0.4800,volume,0,1000,H0 Token Rates v1
claude-3-haiku,input,USD,0.1500,volume,0,1000,H0 Token Rates v1

Import: Create a Pricing Plan in Stripe, add a rate card, and paste rows or use the API to create prices that reference your Meter.

↑ Back to top

Output: [CSV — Meter Dimension Values]

CSV header:

dimension_key,allowed_value

Populate with every distinct value your gateway/app will emit so UI filters and alerts work.

Example rows:

model,gpt-4o-mini
model,claude-3-haiku
token_type,input
token_type,output

Dashboard mapping: Meter → Advanced → Dimensions → add model and token_type; paste these values where supported or maintain the list for audits.

↑ Back to top

Output: [Example billing.meter_event JSON]

Send a meter event per usage chunk (gateway or your app). Use a restricted key with Billing meter events: Write.

HTTP request (v2 synchronous):

POST https://api.stripe.com/v2/billing/meter_events
Headers:
  Authorization: Bearer &lt;restricted_key&gt;
  Idempotency-Key: &lt;uuid-per-client-interval-or-message&gt;
Body (application/json):
{
  &quot;event_name&quot;: &quot;ai_tokens&quot;,
  &quot;payload&quot;: {
    &quot;stripe_customer_id&quot;: &quot;cus_123&quot;,
    &quot;value&quot;: 523,
    &quot;timestamp&quot;: &quot;2026-04-13T17:05:00Z&quot;,
    &quot;model&quot;: &quot;gpt-4o-mini&quot;,
    &quot;token_type&quot;: &quot;input&quot;
  }
}

Notes:

value = tokens (not 1K units). Stripe aggregates to your Meter.
Send a second event for token_type":"output" when applicable.
Backfill: include historical timestamp within your billing window.

↑ Back to top

Output: [No‑Code Posting — n8n + Make Specs]

n8n Stripe node → "Create Meter Event"

Authentication: Restricted API Key (scoped to Meter Events Write)
Fields:
- event_name: ai_tokens
- payload.stripe_customer_id: from CRM lookup
- payload.value: aggregated tokens (integer)
- payload.timestamp: ISO8601
- payload.model: model id
- payload.token_type: input|output|cache
Idempotency: set Idempotency-Key header via node options to a stable hash of [customer_id, model, token_type, yyyy-mm-dd-hh].
Schedule: cron hourly or daily per client.

Make (HTTP module) — POST to /v2/billing/meter_events

Method: POST
Headers: Authorization Bearer <restricted_key>; Idempotency-Key <hash>; Content-Type application/json
Body: same as JSON example
Error handling: retry 429/5xx with exponential backoff.

↑ Back to top

Output: [Invoice Preview + Margin Checks]

Show customers that included units apply and what’s next on invoice.

Per client calculator:

net_billable_1k = max(0, (input_1k * price_in + output_1k * price_out + cache_1k * price_cache) - free_credits_1k * blended_unit_price)
projected_invoice = platform_fee_min + net_billable_1k
gross_margin_estimate = projected_invoice - (input_1k * base_in + output_1k * base_out + cache_1k * base_cache) - (total_1k * overhead_per_1k)

Tips:

Use the Pricing Plan "Upcoming invoice" preview in Dashboard to validate numbers against your sheet before go‑live.
Enable Smart Retries in Billing → Revenue recovery to harden cash collection (no code).

↑ Back to top

Reference: [Vercel AI Gateway Path]

Use when you want automatic two‑event metering (input/output) per AI call.

Headers to set on each request through the gateway:

stripe-customer-id: <cus_123>
authorization: Bearer <restricted_key_with_meter_events_write>

Behavior:

Gateway emits two billing.meter_event payloads to Stripe with model and token_type populated.
Your Meter must define Dimensions model and token_type to match.

Sheet mapping:

Keep model_id values identical to what the gateway returns (e.g., gpt-4o-mini).
Prices from the Rate Card CSV will align 1:1 with model + token_type.

↑ Back to top

Go‑Live Checklist (copy into your build)

Meter: aggregation = sum; Dimensions: model, token_type.
Pricing Plan: create rate card, attach Meter, choose price type and tiers.
Service action: grant [monthly_free_1k_tokens_total] units per period (optional).
Alerts: configure 50/80/100% usage webhooks; route to Slack.
Keys: use a restricted API key limited to Meter Events Write.
Preview: validate a test customer’s upcoming invoice before inviting real clients.
Preview status: Token Billing is private preview (April 2026). If not enabled, this sheet still works via standard Meters + Pricing Plans.

↑ Back to top