Skip to content

fix: update OpenAI model metadata for openai/gpt-oss-120b#580

Merged
Caitlin Pinn (cpinn) merged 2 commits into
mainfrom
chore/autofix-issue-537
May 20, 2026
Merged

fix: update OpenAI model metadata for openai/gpt-oss-120b#580
Caitlin Pinn (cpinn) merged 2 commits into
mainfrom
chore/autofix-issue-537

Conversation

@github-actions
Copy link
Copy Markdown
Contributor

fix: update OpenAI model metadata for openai/gpt-oss-120b

Closes #537

Source issue: #537

Summary

Field Value
Provider openai
Primary model openai/gpt-oss-120b
Changed models openai/gpt-oss-120b
Added models None
Updated models openai/gpt-oss-120b
Verification sources 1
2

Verified metadata

Model Display name Parent Providers Format Flavor Token limits Pricing Lifecycle
openai/gpt-oss-120b OpenAI GPT-OSS (120B) groq, together, baseten openai chat input=131072, output=32766 in/out=0.15/0.6 per 1M; cache read=0.075 per 1M reasoning=true

Verification notes

Verification

Sources and fields verified

Source Fields verified
https://www.together.ai/pricing input_cost_per_mil_tokens ($0.15), output_cost_per_mil_tokens ($0.60)
https://groq.com/pricing/ input_cost_per_mil_tokens ($0.15), output_cost_per_mil_tokens ($0.60), input_cache_read_cost_per_mil_tokens ($0.075)
https://huggingface.co/openai/gpt-oss-120b reasoning (true — model supports chain-of-thought)
https://docs.baseten.co/inference/model-apis/overview available_providers — confirms Baseten hosts the model on shared inference

sync_models cross-check

The sync_models catalog has a single entry azure_ai/gpt-oss-120b (no Together/Groq-prefixed entry exists).

Field Proposed value sync_models value (azure_ai/gpt-oss-120b) Match? Notes
input_cost_per_mil_tokens 0.15 0.15 (1.5e-07 × 1M) Yes
output_cost_per_mil_tokens 0.60 0.60 (6e-07 × 1M) Yes
input_cache_read_cost_per_mil_tokens 0.075 n/a (not in sync_models) Verified on Groq prompt caching page
max_input_tokens 131072 (unchanged) 131072 Yes
max_output_tokens 32766 (unchanged) 131072 No sync_models shows 131072 but this likely reflects context window, not a verified output limit. Keeping existing catalog value (32766) since no official provider doc explicitly states a higher max output.

Fields not changed

  • format, flavor, displayName, reasoning, max_input_tokens, max_output_tokens, available_providers, input_cache_read_cost_per_mil_tokens — all remain at current catalog values; only the two pricing fields are updated.

sync_models vs proposed update

sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.

Model Field Proposed update sync_models sync_models source models
openai/gpt-oss-120b max_input_tokens 131072 n/a baseten/openai/gpt-oss-120b, groq/openai/gpt-oss-120b, together_ai/openai/gpt-oss-120b
openai/gpt-oss-120b max_output_tokens 32766 n/a baseten/openai/gpt-oss-120b, groq/openai/gpt-oss-120b, together_ai/openai/gpt-oss-120b
openai/gpt-oss-120b input_cost_per_mil_tokens 0.15 0.1 baseten/openai/gpt-oss-120b, groq/openai/gpt-oss-120b, together_ai/openai/gpt-oss-120b
openai/gpt-oss-120b output_cost_per_mil_tokens 0.6 0.5 baseten/openai/gpt-oss-120b, groq/openai/gpt-oss-120b, together_ai/openai/gpt-oss-120b
openai/gpt-oss-120b input_cache_read_cost_per_mil_tokens 0.075 n/a baseten/openai/gpt-oss-120b, groq/openai/gpt-oss-120b, together_ai/openai/gpt-oss-120b

@vercel
Copy link
Copy Markdown

vercel Bot commented May 20, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
ai-proxy Ready Ready Preview, Comment May 20, 2026 3:50pm

Request Review

Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: fcfb46b13a

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "Codex (@codex) review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "Codex (@codex) address that feedback".

Comment thread packages/proxy/schema/index.ts Outdated
"databricks-meta-llama-3-1-405b-instruct": ["databricks"],
"databricks-meta-llama-3-1-8b-instruct": ["databricks"],
"openai/gpt-oss-120b": ["together", "groq", "baseten"],
"openai/gpt-oss-120b": ["groq","together","baseten"],
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Keep Together first for fallback key routing

Changing the provider order for openai/gpt-oss-120b alters runtime behavior when API-secret lookup fails: EdgeProxyV1 falls back to endpointTypes[0] (packages/proxy/edge/index.ts lines 275-279), so this change now forces fallback traffic to Groq instead of Together. In that failure mode, requests authenticated with a Together key will be sent to Groq and fail authentication, which is a regression from the previous ordering.

Useful? React with 👍 / 👎.

@cpinn Caitlin Pinn (cpinn) merged commit 654cefc into main May 20, 2026
6 of 7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[BOT ISSUE] fix: update stale pricing for openai/gpt-oss-120b

1 participant