feat: add recovery middleware to handle panic gracefully by hhc7 · Pull Request #1537 · apache/answer

hhc7 · 2026-05-28T10:25:31Z

Proposed Changes

Add middleware.Recovery() that catches panics from any subsequent middleware or handler, logs the panic message with full stack trace via log.Errorf + debug.Stack(), and returns a unified 500 JSON response (reason: base.unknown) via handler.NewRespBody + TrMsg — consistent with how other errors are handled in internal/base/handler/handler.go.
Mount Recovery() as the first middleware in internal/base/server/http.go so it covers all subsequent middleware (brotli, accept-language, short-id, auth, etc.) and handlers.
Add unit tests in recovery_test.go covering both the panic path (verifies 500 + base.unknown reason) and the no-panic path (verifies normal requests pass through unaffected).

LinkinStars · 2026-05-30T06:00:52Z

I agree that adding a recovery layer makes sense. gin.New() does not include one by default, so today a panic can indeed end up as a dropped connection instead of a controlled response.

That said, I do not think this implementation is quite correct yet, because it assumes every panic can be turned into the same JSON 500 response.

There are at least two cases where that is not very elegant:

SSE / streaming endpoints
For /chat/completions, the handler starts an SSE response very early: it sets Content-Type: text/event-stream, sends status 200, flushes headers, and may already write streamed chunks before later logic runs. If a panic happens after that point,
recovery cannot reliably convert the response into a JSON 500. In practice it may just append JSON into an already-started event stream, which produces an invalid/mixed response rather than a clean error.
HTML routes
This server also handles UI / HTML routes, not only JSON APIs. With a global recovery mounted at the engine level, a panic in an HTML route would now also return the API-style JSON body. That is probably not the intended behavior for page requests,
and it is not really “consistent” for the whole server, only for JSON endpoints.

So I think the underlying concern is valid, but the current fix is a bit too broad. A more robust approach would be to either:

scope JSON recovery only to API routes, or
make recovery response-aware, for example checking whether headers/body were already written and whether the request is an API, SSE, or HTML route.

In short: the direction is reasonable, but handling all panics with the same AbortWithStatusJSON(500, ...) is not especially elegant for SSE and HTML paths.

feat: add recovery middleware to handle panic gracefully

3f1545d

LinkinStars self-requested a review May 30, 2026 06:00

LinkinStars self-assigned this May 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add recovery middleware to handle panic gracefully#1537

feat: add recovery middleware to handle panic gracefully#1537
hhc7 wants to merge 1 commit into
apache:mainfrom
hhc7:feat/add-recovery-middleware

hhc7 commented May 28, 2026

Uh oh!

LinkinStars commented May 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hhc7 commented May 28, 2026

Proposed Changes

Uh oh!

LinkinStars commented May 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants