fix: update Groq model metadata for openai/gpt-oss-120b by github-actions[bot] · Pull Request #576 · braintrustdata/braintrust-proxy

github-actions · 2026-05-19T23:49:30Z

fix: update Groq model metadata for openai/gpt-oss-120b

Closes #566

Source issue: #566

Summary

Field	Value
Provider	groq
Primary model	openai/gpt-oss-120b
Changed models	`openai/gpt-oss-120b`
Added models	None
Updated models	`openai/gpt-oss-120b`
Verification sources	1 2

Verified metadata

Model	Display name	Parent	Providers	Format	Flavor	Token limits	Pricing	Lifecycle
openai/gpt-oss-120b	OpenAI GPT-OSS (120B)		groq, together, baseten	openai	chat	input=131072, output=65536	in/out=0.15/0.6 per 1M; cache read=0.075 per 1M	reasoning=true

Verification notes

Verification

Official sources consulted

Groq models docs (https://console.groq.com/docs/models): Verified input_cost_per_mil_tokens=0.15, output_cost_per_mil_tokens=0.60, max_input_tokens=131072, max_output_tokens=65536. Model listed as production.
Together AI serverless models (https://docs.together.ai/docs/serverless-models): Verified input_cost_per_mil_tokens=0.15, output_cost_per_mil_tokens=0.60. Context window listed as 128,000. Max output tokens not published.

sync_models (LiteLLM) cross-check

The sync_models catalog only has azure_ai/gpt-oss-120b (no groq/ or together_ai/ prefixed entries). Comparison:

Field	Proposed Value	sync_models Value (`azure_ai/gpt-oss-120b`)	Source justifying deviation
`input_cost_per_mil_tokens`	0.15	0.15 (= $1.5e-07/token * 1M)	Matches — no deviation
`output_cost_per_mil_tokens`	0.60	0.60 (= $6e-07/token * 1M)	Matches — no deviation
`max_output_tokens`	65536	131072	Groq docs (https://console.groq.com/docs/models) explicitly list 65,536 max completion tokens. The sync_models value of 131,072 appears to be the azure_ai context window, not the actual max output. Groq's official docs are preferred.
`max_input_tokens`	131072	131072	Matches — no deviation

Fields not changed

input_cache_read_cost_per_mil_tokens (0.075): Retained from existing catalog. Neither Groq nor Together publish cache pricing for this model, and no official source contradicts the current value.
available_providers: Retained as ["groq", "together", "baseten"]. Baseten pricing not independently verified but the provider mapping already exists in the catalog.
displayName, format, flavor, reasoning: Retained unchanged from existing catalog entry — no official source suggests these should change.

Fields not applicable or not published

parent: Not applicable — this is a base model, not a dated snapshot or variant.
deprecated / deprecation_date: Not applicable — model is active/production on both Groq and Together.
multimodal: Not published by either provider for this model.
supported_regions: Not applicable — no vertex provider.
locations: Not applicable — no vertex provider.

sync_models vs proposed update

sync_models cross-check found differences. Official provider verification was used for the applied values, and sync_models discrepancies are listed below for review.

Model	Field	Proposed update	sync_models	sync_models source models
openai/gpt-oss-120b	max_input_tokens	131072	n/a	baseten/openai/gpt-oss-120b, groq/openai/gpt-oss-120b, together_ai/openai/gpt-oss-120b
openai/gpt-oss-120b	max_output_tokens	65536	n/a	baseten/openai/gpt-oss-120b, groq/openai/gpt-oss-120b, together_ai/openai/gpt-oss-120b
openai/gpt-oss-120b	input_cost_per_mil_tokens	0.15	0.1	baseten/openai/gpt-oss-120b, groq/openai/gpt-oss-120b, together_ai/openai/gpt-oss-120b
openai/gpt-oss-120b	output_cost_per_mil_tokens	0.6	0.5	baseten/openai/gpt-oss-120b, groq/openai/gpt-oss-120b, together_ai/openai/gpt-oss-120b
openai/gpt-oss-120b	input_cache_read_cost_per_mil_tokens	0.075	n/a	baseten/openai/gpt-oss-120b, groq/openai/gpt-oss-120b, together_ai/openai/gpt-oss-120b

vercel · 2026-05-19T23:49:34Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
ai-proxy	Ready	Preview, Comment	May 20, 2026 2:01pm

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 16b813669e

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "Codex (@codex) review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "Codex (@codex) address that feedback".

chatgpt-codex-connector · 2026-05-19T23:50:50Z

  "databricks-meta-llama-3-1-405b-instruct": ["databricks"],
  "databricks-meta-llama-3-1-8b-instruct": ["databricks"],
-  "openai/gpt-oss-120b": ["together", "groq", "baseten"],
+  "openai/gpt-oss-120b": ["groq","together","baseten"],


Preserve Together as first endpoint for gpt-oss-120b

Changing the provider order here alters runtime behavior in the API-key lookup fallback path: EdgeProxy uses getModelEndpointTypes(model) and then selects endpointTypes[0] as the secret type when lookup fails (packages/proxy/edge/index.ts lines 275-279). With this reorder, fallback requests for openai/gpt-oss-120b now default to groq instead of together, which can cause authentication/routing failures for users supplying a Together key during lookup outages or 5xx conditions.

Useful? React with 👍 / 👎.

fix: update Groq model metadata for openai/gpt-oss-120b

16b8136

github-actions Bot added the auto-sync label May 19, 2026

github-actions Bot requested review from Alex Z (CLowbrow), aswink, Caitlin Pinn (cpinn) and Erin McNulty (erin2722) May 19, 2026 23:49

github-actions Bot requested a review from Ken Jiang (knjiang) May 19, 2026 23:49

github-actions Bot mentioned this pull request May 19, 2026

[BOT ISSUE] Groq/Together: stale pricing for openai/gpt-oss-120b #566

Closed

vercel Bot deployed to Preview May 19, 2026 23:50 View deployment

chatgpt-codex-connector Bot reviewed May 19, 2026

View reviewed changes

undo model order change

c8bb095

Erin McNulty (erin2722) approved these changes May 20, 2026

View reviewed changes

Erin McNulty (erin2722) merged commit 73b8490 into main May 20, 2026
5 of 6 checks passed

vercel Bot deployed to Preview May 20, 2026 14:01 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: update Groq model metadata for openai/gpt-oss-120b#576

fix: update Groq model metadata for openai/gpt-oss-120b#576
Erin McNulty (erin2722) merged 2 commits into
mainfrom
chore/autofix-issue-566

github-actions Bot commented May 19, 2026

Uh oh!

vercel Bot commented May 19, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

github-actions Bot commented May 19, 2026

Verification

Official sources consulted

sync_models (LiteLLM) cross-check

Fields not changed

Fields not applicable or not published

Uh oh!

vercel Bot commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vercel Bot commented May 19, 2026 •

edited

Loading