✨ New Plugin Alert ✨ SleekRank is now available with €50 launch discount
✨ New Plugin Alert ✨ SleekRank is now available with €50 launch discount
✨ New Plugin Alert ✨ SleekRank is now available with €50 launch discount
✨ New Plugin Alert ✨ SleekRank is now available with €50 launch discount
✨ New Plugin Alert ✨ SleekRank is now available with €50 launch discount
✨ New Plugin Alert ✨ SleekRank is now available with €50 launch discount
✨ New Plugin Alert ✨ SleekRank is now available with €50 launch discount
✨ New Plugin Alert ✨ SleekRank is now available with €50 launch discount
✨ New Plugin Alert ✨ SleekRank is now available with €50 launch discount
✨ New Plugin Alert ✨ SleekRank is now available with €50 launch discount

AI Chatbot With Token Usage Reporting for WordPress

SleekAI captures input and output token counts on every turn and groups them in the admin dashboard. Filter by bot, model, originating page, or user role to find token-heavy patterns before they hit your monthly bill. Pair with cost tracking for the full picture.

♾️ Lifetime License available

SleekAI chatbot for Chatbots With Token Usage Reporting

Provider dashboards stop where WordPress context begins

OpenAI's usage dashboard tells you the project consumed 14 million tokens last month. It doesn't tell you which chatbot, which page, which user, or which conversation pattern drove that volume. Anthropic's console is the same. Google's console is the same. The provider has zero visibility into your WordPress setup, so the breakdown stops at API key granularity, which is almost never the granularity you need to make decisions.

SleekAI captures token counts at the same moment the provider returns them and writes them into the conversation log alongside WordPress-specific metadata: which chatbot fired, which template page the visitor was on, which post type, which user role, which display condition matched. The admin dashboard slices token usage by any of these dimensions. Suddenly the question 'which page is eating my tokens' has a one-click answer instead of being a multi-hour spreadsheet investigation.

Generic chatbot tools either don't expose token counts at all, or expose them in a single aggregate that's worse than the provider dashboard because it lacks even the raw split between input and output tokens. SleekAI logs both, calculates the ratio, and shows that long-context bots with short replies tip heavily toward input cost while reasoning-heavy bots tip toward output cost. Two different optimization paths, only visible with the right granularity.

Workflow

How token reporting captures usage

1

Capture from API response

Every provider response includes input and output token counts in its usage object. SleekAI reads them during normal response parsing and stores them as part of the conversation row. The numbers match the provider's own usage dashboard exactly because they come from the same source.
2

Tag with WordPress context

Each row also stores the chatbot ID, originating page URL, post type, user role, and any display-condition tags. This metadata is what makes the dashboard slicing useful: tokens grouped by bot, by page, by post type, instead of one undifferentiated total.
3

Display in the dashboard

The admin dashboard shows token usage charts grouped by your dimension of choice. Input vs output split is visible per bot. Trend lines compare current period to previous. Forecasts project month-end based on week-to-date pace. Drilldowns from chart to specific conversation are one click.
4

Export or alert as needed

CSV export brings raw token data into spreadsheets or BI tools. Webhook alerts fire when usage thresholds are crossed (per-bot daily, monthly, or cumulative). The reporting is layered so casual admins see charts and serious analysts get the raw rows.

Try it now

A typical token-investigation chat

An admin investigates why one chatbot consumes 5x the tokens of another with similar traffic.

Comparison

Generic chatbot vs SleekAI for token reporting

Generic chatbot

  • No token visibility at all, only message counts
  • Aggregates tokens at API key level, not per bot
  • Cannot break down by page or user context
  • Hides input-vs-output split behind a single total
  • Has no historical comparison or trend reporting

SleekAI chatbot

  • Input and output tokens logged separately per turn
  • Group by bot, model, page, post type, or user role
  • Trend reports across days, weeks, and months
  • CSV export for spreadsheet or BI tool analysis
  • Spot prompt-caching candidates from token patterns

Features

What SleekAI gives you for Chatbots With Token Usage Reporting

Input vs output split

Every turn logs input tokens and output tokens separately. A bot heavy on input (long system prompts, lots of variables) has different optimization paths than one heavy on output (long reasoning, verbose replies). Seeing the split makes the right intervention obvious instead of guessing.

WordPress-aware grouping

Group token usage by chatbot ID, originating page, post type, user role, or membership level. The provider dashboards can never offer this granularity because they don't know which page the API call came from. SleekAI's log brings WordPress context into token analysis.

Trend and forecast

Compare this week to last week, this month to last. Spot a chatbot whose token usage has crept up 40% since a prompt change. Forecast end-of-month volume based on week-to-date pace. Webhooks fire when usage spikes past a configured threshold so anomalies get caught early.

Use cases

What good token reporting unlocks

Prompt caching candidates

If a bot's input tokens are 90% the same on every turn (a long static system instruction), it's a prime candidate for Anthropic prompt caching, which cuts input cost roughly 90% after the first call. SleekAI surfaces this pattern automatically when input ratios are above a threshold.

Model right-sizing

A bot using GPT-4o that consistently produces short factual replies may not need the full model. Token reports combined with quality sampling can drive a switch to GPT-4o mini, cutting cost 95% with negligible quality loss for the use case.

Growth headroom planning

Knowing the per-conversation token cost makes scaling decisions defensible. If your chatbot averages 5k tokens per chat and you expect to 10x traffic, you can model the full infrastructure cost before the launch instead of finding out from the invoice.

The bigger picture

Why token reporting is the prerequisite for optimization

You cannot optimize what you cannot measure. Most teams running WordPress chatbots have zero visibility into which bot, which page, or which prompt pattern consumes the bulk of their token budget. They tweak the system instruction blindly and hope cost goes down next month.

Sometimes it does, often it doesn't, and either way no one understands why. With granular token reporting, the optimization process becomes a tight loop. Change the system prompt, watch the token average per turn shift in the dashboard within hours, decide whether the change paid off, iterate.

The prompt-caching opportunity is the clearest example. Anthropic and a few other providers offer aggressive caching on static portions of the input. If 80% of your input is a long fixed policy section, caching it cuts input cost roughly 90% after the first call in a session.

Without token reporting, this opportunity is invisible. With it, the dashboard literally surfaces a 'high cache-ratio candidate' tag on bots that fit the pattern. The same logic applies to model right-sizing, conversation length tuning, variable injection scope, and dozens of other smaller decisions.

Granular data turns chatbot operation from gut feel into a discipline.

Questions

Common questions about SleekAI for Chatbots With Token Usage Reporting

Every provider's API response includes input and output token counts. SleekAI reads them directly from the response payload, so the numbers match what the provider's own dashboard reports. There's no estimation or local counting. Accuracy is end-to-end identical to the source of truth.

 

Yes. OpenRouter returns token counts plus the underlying model that handled the request, so SleekAI logs both. Reports can group by the actual model (e.g. claude-3-5-sonnet via OpenRouter) instead of just bucketing everything as 'OpenRouter', which would mask provider-level patterns.

 

Yes. The conversation log lists each turn with its input tokens, output tokens, total, and the model that handled it. Drill into a specific conversation to see how token usage built up across the exchange. Useful for diagnosing why one chat used 50k tokens while a similar one used 5k.

 

No. The token counts are already in the response payload SleekAI is parsing. Writing them to the conversation row is part of the same database insert that logs everything else. There's no extra request and no extra round-trip to the database. Latency is identical to a bot without reporting.

 

Yes. CSV export from the admin dashboard includes input tokens, output tokens, model, bot, page, user, and timestamp per turn. WP-CLI has a matching command for scripted exports. Common usage: pull a monthly CSV, load into Google Sheets or BigQuery, generate custom reports the dashboard doesn't cover.

 

Anthropic returns cached input tokens separately from regular input tokens in the response. SleekAI logs them as a distinct column. Reports show cache hit ratios per bot, which is essential for measuring whether prompt caching is paying off. High cache ratios mean the optimization is working.

 

Yes. Configure a daily or monthly token threshold per bot. When usage crosses it, SleekAI fires a webhook or sends an admin email. Useful for catching runaway prompts (a bot configuration accident, a prompt injection that triggers verbose replies) before they generate days of unintended spend.

 

No. All token data stays in your WordPress database. SleekAI does not collect telemetry on token usage. The plugin's license check pings home for validity but transmits no usage data. If you need to export for analysis, you do it explicitly via CSV or WP-CLI.

 

Pricing

More than 1000+
happy customers

Explore our flexible licensing options tailored to your needs. Upgrade your license anytime to access more features, or opt for a lifetime license for ongoing value, including lifetime updates and lifetime support. Our hassle-free upgrade process ensures that our platform can grow with you, starting from whichever plan you choose.

Starter

€79

EUR

per year

  • 3 websites
  • 1 year of updates
  • 1 year of support

Pro

€149

EUR

per year

  • Unlimited websites
  • 1 year of updates
  • 1 year of support

Lifetime ♾️

Most popular

€249

EUR

once

  • Unlimited websites
  • Lifetime updates
  • Lifetime support

...or get the Bundle Deal
and save €250 🎁

The Bundle (unlimited sites)

Pay once, own it forever

Elevate your WordPress site with our exclusive plugin bundle that includes all of our premium plugins in one package. Enjoy lifetime updates and lifetime support. Save significantly compared to buying plugins individually.

What’s included

  • SleekAI

  • SleekByte

  • SleekMotion

  • SleekPixel

  • SleekRank

  • SleekView