Skip to content

Commit 7915980

Browse files
committed
Lower prefill queue ms threshold
1 parent 7b8712c commit 7915980

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

web/src/server/free-session/fireworks-health.ts

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ export type FireworksHealth = 'healthy' | 'degraded' | 'unhealthy'
1818
/** Degrade once median prefill-queue latency crosses this bound. Strict by
1919
* design — a 1s queue on top of ~1s prefill already means users feel 2s+
2020
* before first token. */
21-
export const PREFILL_QUEUE_DEGRADED_MS = 1_000
21+
export const PREFILL_QUEUE_DEGRADED_MS = 600
2222

2323
/** Leading indicator of load — responds instantly to memory pressure, while
2424
* prefill-queue p50 is a lagging window statistic. Degrading here lets us

0 commit comments

Comments
 (0)