LLM Cost Estimator

Estimate LLM cost across models by comparing workflow tokens and refinement passes—not just headline token rates.

The problem

If two models finish the job with different numbers of calls, the cheaper one isn’t always the one with the lower per-token rate.

Framework

Routing lets you use cheaper models for simpler steps and premium models only for final quality.

Model A: 1 call. Model B: 2 calls. Even if B has a better headline rate, A can still win overall.

Optimize your workflow before choosing a model.