Benchmark Database

How Much Does Each Task Actually Cost?

Token counts and per-run costs for common agent tasks. Filter by model to see how pricing changes. Monthly estimates at different frequencies so you can plan your budget.

Tasks: 10Models: 6Frequencies: 1x to 50x/day

Tip: Switch the model dropdown to see the same task priced across different models. The "Recommended" option uses the cheapest model that handles each task well. That is usually what you want for routing configs.

TaskTokensModelPer run

Email triage

Read subject + body, classify priority, route to folder

3.0k in
800 out
Sonnet 4$0.021

PR review

Read diff, check for bugs and style, post inline comments

8.0k in
2.0k out
Opus 4$0.270

Slack channel summary

Scan last 50 messages, extract action items and decisions

5.0k in
1.2k out
Sonnet 4$0.033

Code generation

Generate function from spec, includes tests and docstring

2.5k in
3.5k out
Opus 4$0.300

Heartbeat check-in

Agent periodic scan: check inbox, calendar, pending tasks

6.0k in
200 out
Haiku 4$0.0056

Support ticket routing

Classify ticket, assign priority, suggest response template

1.5k in
300 out
Haiku 4$0.0024

Meeting notes extraction

Process transcript, extract decisions, assign action items

12.0k in
2.0k out
Sonnet 4$0.066

Data pipeline run

Parse structured data, transform, generate summary report

10.0k in
3.0k out
Sonnet 4$0.075

Research summarization

Read 3-5 articles, synthesize key findings, draft brief

15.0k in
3.0k out
Sonnet 4$0.090

Security alert triage

Parse alert payload, check severity, decide escalation

2.0k in
400 out
Haiku 4$0.0032

Cheapest task

Support ticket routing

$0.0024/run

Most expensive task

Code generation

$0.300/run

All tasks at 5x/day

$130/mo

Methodology

Token counts are estimates based on typical task payloads. Actual usage varies with content length, context window size, and prompt engineering. Costs use published API pricing as of March 2026. Monthly estimates assume 30 days. Context loading overhead (workspace files loaded per call) is not included. Use the calculator for full cost modeling with context and heartbeat overhead.

Ready to estimate your full setup?

These benchmarks show per-task costs. The calculator adds heartbeat overhead, context loading, sub-agents, and multi-channel multipliers for a complete monthly estimate.