🤖 feat: make Anthropic prompt cache TTL configurable by ThomasK33 · Pull Request #2293 · coder/mux

ThomasK33 · 2026-02-09T13:42:28Z

Summary

Add configurable Anthropic prompt cache TTL support in Mux, allowing 5m (default) or 1h to flow through provider options, message/tool cache markers, and fetch-level request patching.

Background

Mux already applies Anthropic prompt caching automatically, but it always used cache_control: { type: "ephemeral" } with no TTL selection. Anthropic now supports explicit TTL values of "5m" and "1h", with different write pricing. This change makes TTL selectable while preserving existing defaults.

Implementation

Added anthropic.cacheTtl to MuxProviderOptionsSchema as z.enum(["5m", "1h"]).nullish().
Extended cache strategy helpers to accept optional TTL:
- applyCacheControl(..., cacheTtl?)
- createCachedSystemMessage(..., cacheTtl?)
- applyCacheControlToTools(..., cacheTtl?)
Updated Anthropic provider option construction to include cacheControl when cacheTtl is configured.
Updated fetch wrapper wrapFetchWithAnthropicCacheControl to inject TTL-aware raw cache_control on tools/messages for both direct Anthropic and mux-gateway Anthropic routes.
Threaded TTL through request prep and streaming paths:
- aiService -> prepareMessagesForProvider
- streamManager tool/system cache control application
Added defensive runtime narrowing in streamManager with typed guards:
- isRecord
- isAnthropicCacheTtl
- getAnthropicCacheTtl
Added/updated tests in:
- src/common/utils/ai/cacheStrategy.test.ts
- src/common/utils/ai/providerOptions.test.ts

Validation

bun test src/common/utils/ai/cacheStrategy.test.ts src/common/utils/ai/providerOptions.test.ts
make typecheck
bun test src/node/services/streamManager.test.ts src/node/services/aiService.test.ts
make static-check

Risks

Low-to-medium risk in Anthropic request shaping paths (provider options + fetch wrapper), mitigated by unit coverage and full static checks. Default behavior remains unchanged when cacheTtl is unset.

Generated with mux • Model: openai:gpt-5.3-codex • Thinking: xhigh • Cost: $2.50

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 4b9d0ca9b9

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

src/node/services/streamManager.ts

ThomasK33 · 2026-02-09T13:58:37Z

@codex review

Addressed the TTL propagation issue for Anthropic-routed gateway models and pushed a follow-up fix.

chatgpt-codex-connector · 2026-02-09T14:06:46Z

Codex Review: Didn't find any major issues. Can't wait for the next one!

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

ThomasK33 · 2026-02-09T14:18:42Z

FYI: I addressed the Codex review thread, pushed a follow-up fix, and reran failed checks.

Test / Integration is still failing due OpenAI quota errors in CI (not deterministic code assertions), e.g.:

AI_APICallError: You exceeded your current quota
downstream expectation mismatches caused by that API failure

Latest failing job: https://github.com/coder/mux/actions/runs/21827954163/job/62979120548

Add Anthropic cache TTL support (`5m` / `1h`) across provider options, cache strategy, stream pipeline, and fetch-level cache_control injection, with tests for TTL propagation. --- _Generated with `mux` • Model: `openai:gpt-5.3-codex` • Thinking: `xhigh` • Cost: `.50`_

Ensure stream-level system/tool cache markers honor configured Anthropic TTL even for Anthropic-routed gateway models whose providerOptions are not under the anthropic namespace. --- _Generated with `mux` • Model: `openai:gpt-5.3-codex` • Thinking: `xhigh` • Cost: `.50`_

Add a provider-level Anthropic setting in Providers to configure prompt cache TTL with defensive value guards and default clearing behavior. Also add a regression test ensuring persisted Anthropic cache TTL is propagated into send options from storage. --- _Generated with `mux` • Model: `openai:gpt-5.3-codex` • Thinking: `xhigh` • Cost: `$3.19`_

Move Anthropic prompt cache TTL persistence from frontend localStorage to backend providers.jsonc and make backend config authoritative for Anthropic-routed models. - expose anthropic cacheTtl in provider config IPC schema - surface valid cacheTtl in ProviderService getConfig with tests - inject backend cacheTtl in ProviderModelFactory for anthropic and anthropic/* routes - update Providers settings UI to read/write cacheTtl through provider config API --- _Generated with `mux` • Model: `openai:gpt-5.3-codex` • Thinking: `xhigh` • Cost: `$4.21`_

Fix flaky/deterministic failure in tests/ipc/resumeStream.test.ts where collector1 could observe duplicate user messages. The first collector started and sent a message immediately without waiting for the onChat subscription caught-up signal. Under CI timing, initial history replay can race with live append and emit the same user message twice. - await collector1.waitForSubscription(10000) before sending - add explanatory comment about replay/live race --- _Generated with `mux` • Model: `openai:gpt-5.3-codex` • Thinking: `xhigh` • Cost: `$4.21`_

Resolve post-rebase CI failures by removing a leftover `anthropicCacheTtlOverride` reference in `buildStreamRequestConfig`. During rebase conflict resolution, StreamManager kept HEAD's request-header-based API but retained one line from an older cache-ttl-override approach, which triggered lint/type errors. --- _Generated with `mux` • Model: `openai:gpt-5.3-codex` • Thinking: `xhigh` • Cost: `$4.21`_

Simplify the Anthropic prompt cache TTL selector by removing the explicit "5 minutes" entry and keeping only: - Default (5m) - 1 hour Also normalize persisted values in the selector so non-1h values map to default behavior. --- _Generated with `mux` • Model: `openai:gpt-5.3-codex` • Thinking: `xhigh` • Cost: `$4.21`_

ThomasK33 · 2026-02-17T09:23:31Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: ab4a022aac

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

src/node/services/streamManager.ts

Address Codex review feedback by restoring TTL override propagation for Anthropic-routed requests that do not carry `providerOptions.anthropic.cacheControl` (notably OpenRouter Anthropic routes). - thread optional `anthropicCacheTtlOverride` through StreamManager start/create/build request methods - prefer explicit override over providerOptions-derived TTL when building cached system/tool cache markers - pass effective mux Anthropic cache TTL from AIService into StreamManager.startStream This ensures 1h TTL selections are applied consistently across stream cache breakpoints. --- _Generated with `mux` • Model: `openai:gpt-5.3-codex` • Thinking: `xhigh` • Cost: `$4.21`_

ThomasK33 · 2026-02-17T09:39:08Z

@codex review

Addressed the TTL propagation issue for Anthropic-routed OpenRouter requests and resolved the prior thread. Please re-review.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 77df6f7677

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

src/node/services/providerModelFactory.ts

Address Codex review feedback by applying configured Anthropic cache TTL even when cache markers are already present in request payloads. - add mergeAnthropicCacheControl() helper - merge existing marker values and override ttl when cacheTtl is configured - replace `??=` cache marker writes for tools, prompt providerOptions, and content parts This ensures 1h TTL selection is reliably reflected in final Anthropic/OpenRouter HTTP payloads. --- _Generated with `mux` • Model: `openai:gpt-5.3-codex` • Thinking: `xhigh` • Cost: `$4.21`_

ThomasK33 · 2026-02-17T09:51:01Z

@codex review

Addressed the cache-marker TTL override feedback in providerModelFactory and resolved the thread. Please re-review.

chatgpt-codex-connector · 2026-02-17T09:59:23Z

Codex Review: Didn't find any major issues. Hooray!

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector bot reviewed Feb 9, 2026

View reviewed changes

src/node/services/streamManager.ts Outdated Show resolved Hide resolved

ThomasK33 force-pushed the cache-config-p7zj branch 3 times, most recently from e45bfa6 to 6fc00d9 Compare February 10, 2026 09:08

ThomasK33 linked an issue Feb 10, 2026 that may be closed by this pull request

Anthropic Cache TTL is hard-coded #221

Closed

ThomasK33 force-pushed the cache-config-p7zj branch from ad46c50 to 643e093 Compare February 13, 2026 09:28

ThomasK33 added 6 commits February 17, 2026 08:31

ThomasK33 force-pushed the cache-config-p7zj branch from f853444 to 27153a9 Compare February 17, 2026 08:31

chatgpt-codex-connector bot reviewed Feb 17, 2026

View reviewed changes

src/node/services/streamManager.ts Outdated Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Feb 17, 2026

View reviewed changes

src/node/services/providerModelFactory.ts Outdated Show resolved Hide resolved

ThomasK33 added this pull request to the merge queue Feb 17, 2026

Merged via the queue into main with commit eaa0a6f Feb 17, 2026
23 checks passed

ThomasK33 deleted the cache-config-p7zj branch February 17, 2026 10:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🤖 feat: make Anthropic prompt cache TTL configurable#2293

🤖 feat: make Anthropic prompt cache TTL configurable#2293
ThomasK33 merged 9 commits intomainfrom
cache-config-p7zj

ThomasK33 commented Feb 9, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

ThomasK33 commented Feb 9, 2026

Uh oh!

chatgpt-codex-connector bot commented Feb 9, 2026

Uh oh!

ThomasK33 commented Feb 9, 2026

Uh oh!

ThomasK33 commented Feb 17, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

ThomasK33 commented Feb 17, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

ThomasK33 commented Feb 17, 2026

Uh oh!

chatgpt-codex-connector bot commented Feb 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ThomasK33 commented Feb 9, 2026

Summary

Background

Implementation

Validation

Risks

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

ThomasK33 commented Feb 9, 2026

Uh oh!

chatgpt-codex-connector bot commented Feb 9, 2026

Uh oh!

ThomasK33 commented Feb 9, 2026

Uh oh!

ThomasK33 commented Feb 17, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

ThomasK33 commented Feb 17, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

ThomasK33 commented Feb 17, 2026

Uh oh!

chatgpt-codex-connector bot commented Feb 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant