Skip to content

fix: update Groq model metadata for openai/gpt-oss-120b#576

Merged
Erin McNulty (erin2722) merged 2 commits into
mainfrom
chore/autofix-issue-566
May 20, 2026
Merged

fix: update Groq model metadata for openai/gpt-oss-120b#576
Erin McNulty (erin2722) merged 2 commits into
mainfrom
chore/autofix-issue-566

Conversation

@github-actions
Copy link
Copy Markdown
Contributor

fix: update Groq model metadata for openai/gpt-oss-120b

Closes #566

Source issue: #566

Summary

Field Value
Provider groq
Primary model openai/gpt-oss-120b
Changed models openai/gpt-oss-120b
Added models None
Updated models openai/gpt-oss-120b
Verification sources 1
2

Verified metadata

Model Display name Parent Providers Format Flavor Token limits Pricing Lifecycle
openai/gpt-oss-120b OpenAI GPT-OSS (120B) groq, together, baseten openai chat input=131072, output=65536 in/out=0.15/0.6 per 1M; cache read=0.075 per 1M reasoning=true

Verification notes

Verification

Official sources consulted

  • Groq models docs (https://console.groq.com/docs/models): Verified input_cost_per_mil_tokens=0.15, output_cost_per_mil_tokens=0.60, max_input_tokens=131072, max_output_tokens=65536. Model listed as production.
  • Together AI serverless models (https://docs.together.ai/docs/serverless-models): Verified input_cost_per_mil_tokens=0.15, output_cost_per_mil_tokens=0.60. Context window listed as 128,000. Max output tokens not published.

sync_models (LiteLLM) cross-check

The sync_models catalog only has azure_ai/gpt-oss-120b (no groq/ or together_ai/ prefixed entries). Comparison:

Field Proposed Value sync_models Value (azure_ai/gpt-oss-120b) Source justifying deviation
input_cost_per_mil_tokens 0.15 0.15 (= $1.5e-07/token * 1M) Matches — no deviation
output_cost_per_mil_tokens 0.60 0.60 (= $6e-07/token * 1M) Matches — no deviation
max_output_tokens 65536 131072 Groq docs (https://console.groq.com/docs/models) explicitly list 65,536 max completion tokens. The sync_models value of 131,072 appears to be the azure_ai context window, not the actual max output. Groq's official docs are preferred.
max_input_tokens 131072 131072 Matches — no deviation

Fields not changed

  • input_cache_read_cost_per_mil_tokens (0.075): Retained from existing catalog. Neither Groq nor Together publish cache pricing for this model, and no official source contradicts the current value.
  • available_providers: Retained as ["groq", "together", "baseten"]. Baseten pricing not independently verified but the provider mapping already exists in the catalog.
  • displayName, format, flavor, reasoning: Retained unchanged from existing catalog entry — no official source suggests these should change.

Fields not applicable or not published

  • parent: Not applicable — this is a base model, not a dated snapshot or variant.
  • deprecated / deprecation_date: Not applicable — model is active/production on both Groq and Together.
  • multimodal: Not published by either provider for this model.
  • supported_regions: Not applicable — no vertex provider.
  • locations: Not applicable — no vertex provider.

sync_models vs proposed update

sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.

Model Field Proposed update sync_models sync_models source models
openai/gpt-oss-120b max_input_tokens 131072 n/a baseten/openai/gpt-oss-120b, groq/openai/gpt-oss-120b, together_ai/openai/gpt-oss-120b
openai/gpt-oss-120b max_output_tokens 65536 n/a baseten/openai/gpt-oss-120b, groq/openai/gpt-oss-120b, together_ai/openai/gpt-oss-120b
openai/gpt-oss-120b input_cost_per_mil_tokens 0.15 0.1 baseten/openai/gpt-oss-120b, groq/openai/gpt-oss-120b, together_ai/openai/gpt-oss-120b
openai/gpt-oss-120b output_cost_per_mil_tokens 0.6 0.5 baseten/openai/gpt-oss-120b, groq/openai/gpt-oss-120b, together_ai/openai/gpt-oss-120b
openai/gpt-oss-120b input_cache_read_cost_per_mil_tokens 0.075 n/a baseten/openai/gpt-oss-120b, groq/openai/gpt-oss-120b, together_ai/openai/gpt-oss-120b

@vercel
Copy link
Copy Markdown

vercel Bot commented May 19, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
ai-proxy Ready Ready Preview, Comment May 20, 2026 2:01pm

Request Review

Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 16b813669e

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "Codex (@codex) review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "Codex (@codex) address that feedback".

Comment thread packages/proxy/schema/index.ts Outdated
"databricks-meta-llama-3-1-405b-instruct": ["databricks"],
"databricks-meta-llama-3-1-8b-instruct": ["databricks"],
"openai/gpt-oss-120b": ["together", "groq", "baseten"],
"openai/gpt-oss-120b": ["groq","together","baseten"],
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Preserve Together as first endpoint for gpt-oss-120b

Changing the provider order here alters runtime behavior in the API-key lookup fallback path: EdgeProxy uses getModelEndpointTypes(model) and then selects endpointTypes[0] as the secret type when lookup fails (packages/proxy/edge/index.ts lines 275-279). With this reorder, fallback requests for openai/gpt-oss-120b now default to groq instead of together, which can cause authentication/routing failures for users supplying a Together key during lookup outages or 5xx conditions.

Useful? React with 👍 / 👎.

@erin2722 Erin McNulty (erin2722) merged commit 73b8490 into main May 20, 2026
5 of 6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[BOT ISSUE] Groq/Together: stale pricing for openai/gpt-oss-120b

2 participants