Prompt cache cost estimation #4

khromov · 2025-12-12T12:49:42Z

When I started the PR I first thought we'd allow the user to select whether to enable prompt caching or not for a run but there is no central toggle for it in AI SDK so it's sort of a PITA to support it for all providers. Since prompt cache "logic" is fairly basic (system prompt + previous prompts are cached) we can quite easily make a "good enough" estimation that would support all providers where AI SDK has the cache insertion and cached token output costs.

lib/pricing.ts

Copilot

Pull request overview

This PR implements prompt cache simulation to estimate cost savings when using prompt caching across multi-step agent interactions. The implementation introduces a token cache tracking system that models how prompts would be cached and reused across conversation steps.

Key changes:

New TokenCache class to track cached tokens across conversation steps with a growing prefix model
simulateCacheSavings function that estimates costs when prompt caching is enabled
Updated pricing module to support cache creation and cache read costs separately
Enhanced HTML report to display estimated cache savings

Reviewed changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
lib/token-cache.ts	New class implementing token cache tracking with cost calculation
lib/utils.ts	Added cache simulation logic and moved buildAgentPrompt from test-discovery
lib/utils.test.ts	Comprehensive test coverage for TokenCache and simulateCacheSavings
lib/pricing.ts	Refactored to use inline types and added cacheCreationInputTokenCost support
lib/pricing.test.ts	Updated tests with proper type usage
lib/report.ts	Added cacheSimulation field to metadata
lib/report-template.ts	Enhanced UI to display cache simulation results
index.ts	Integrated cache simulation into main flow with console output
results/result-2025-12-12-23-02-22-anthropic-claude-haiku-4.5.json	Example result file with cache simulation data
AGENTS.md	Updated test command from test:self to bun test

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

lib/utils.ts

khromov added 17 commits December 12, 2025 13:44

wip

cb4c283

wip

17a8758

Merge branch 'main' into prompt-cache

3ad1191

run prettier

e9f2524

simulate cached token pricing

92c6fb0

cache reporting

5f4beec

cache simulation example

05ee6dc

Update report-template.ts

53d8871

Update AGENTS.md

c40d090

Update AGENTS.md

9d98a8d

wip

0473269

Update index.ts

e5a3ace

wip

ccb56d0

infer the types

5a476fa

wip

9db962c

wip2

75ef74f

fix hanges

34db035

khromov commented Dec 12, 2025

View reviewed changes

lib/pricing.ts Outdated Show resolved Hide resolved

khromov added 4 commits December 13, 2025 15:50

wip

771c4b9

token cache

7f8a91c

wip

ee8df4d

wip

f20c207

khromov requested a review from Copilot December 13, 2025 15:25

Copilot started reviewing on behalf of khromov December 13, 2025 15:26 View session

wip

14379d8

Copilot AI reviewed Dec 13, 2025

View reviewed changes

lib/utils.ts Show resolved Hide resolved

khromov added 4 commits December 14, 2025 02:24

Update report-template.ts

f36158b

wip

9b3d4dd

Update index.ts

e72e4a5

Update AGENTS.md

04364ae

khromov requested a review from paoloricciuti December 14, 2025 01:38

khromov marked this pull request as ready for review December 14, 2025 01:38

khromov changed the title ~~Prompt cache~~ Prompt cache cost estimation Dec 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prompt cache cost estimation #4

Prompt cache cost estimation #4

Uh oh!

khromov commented Dec 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Prompt cache cost estimation #4

Are you sure you want to change the base?

Prompt cache cost estimation #4

Uh oh!

Conversation

khromov commented Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

khromov commented Dec 12, 2025 •

edited

Loading