Open-source LiteLLM plugin

Traffic shaping for LLM spend.

PaceLLM brings traffic shaping to your LLM budget. It throttles low-priority features to cheaper models, protects what matters, and smooths spend across the month.

How it works

The problem

Day 25. Everything goes dark.

Every LLM budget tool is one shared credit card. When it hits zero, your $50/hr chatbot dies alongside your $2/hr summarizer.

Without PaceLLM

Day 1$2,000 budget. All features on GPT-4o.

Day 12Engineer starts hammering the internal summarizer.

Day 18Spend rate up 40%. Alert fires. Nobody acts.

Day 25Budget gone. Chatbot down. CEO texts at 11pm.

With PaceLLM

Day 1$2,000 budget. PaceLLM observes traffic.

Day 12Summarizer spike. Auto-downgraded to Haiku.

Day 22Sunday. Idle chatbot budget flows to batch jobs.

Day 30Month ends. All features ran. $82 surplus.

What it does

Three things LiteLLM budgets can't do.

⏱

Pace over time

Learns your traffic. Weekdays get more budget than weekends. You never hit zero on day 25.

🛡

Protect what matters

Set priorities. When budget tightens, the summarizer downgrades. The chatbot stays on GPT-4o.

🔄

Move idle budget

Chatbot quiet on Sunday? That budget flows to other features. Monday it snaps back.

Dashboard

See where every dollar goes.

Status

On Track

42% spent · Day 13 of 30

Spent

$842

of $2,000 this month

Reallocations

$18.40 moved today

Forecast

$1,918

projected month-end

Daily Spend: Planned vs Actual

Planned On track Over 10%

Use Cases

Feature	Pri	Model	Spend	Status
customer_support	9	gpt-4o	$421	Normal
product_search	6	gpt-4o-mini	$268	Normal
agent_pipeline	8	gpt-4o	$98	Strict
internal_summary	3	haiku	$55	Downgraded

Pricing

Plugin is free. Dashboard is the product.

Plugin

$0 forever

Full pacing engine. Open-source. MIT.

All pacing & reallocation
Unlimited use cases
Local dashboard & CLI
Webhook alerts

Dashboard Pro

$49 /month

The intelligence layer LiteLLM doesn't have.

Everything in Plugin, plus:
Remote Config — change priorities from UI, no redeploy
Budget forecast — "you'll exhaust on Day 22"
What-if simulator — "what if traffic doubles?"
Savings report — "$340 saved this week, 0 quality impact"
Anomaly detection — spike alerts before they drain budget
COGS vs SG&A cost tagging for finance
90-day history + CSV export
Team access (up to 5 seats)