fix(flask): wrap wsgi_app call in try/except to prevent active_requests gauge leak by alliasgher · Pull Request #4433 · open-telemetry/opentelemetry-python-contrib

alliasgher · 2026-04-14T17:18:48Z

Description

Wrap the wsgi_app call in try/finally so that http.server.active_requests is decremented when the wrapped WSGI app raises. Without this, every exception leaves the gauge permanently above the real number of in-flight requests, causing steady drift upward over the process lifetime.

Fixes #4403

Type of change

Bug fix (non-breaking change which fixes an issue)

How Has This Been Tested?

Added test_active_requests_decremented_on_error in test_programmatic.py which drives a handler that raises and asserts the gauge ends at 0.

Checklist

Changelog entry added under Unreleased / Fixed

liyaka · 2026-04-14T19:25:59Z

Thanks for the quick fix! We hit this exact bug in production — the leaked gauge was preventing our Kubernetes HPA from scaling down.

One suggestion: consider using try/finally instead of try/except to keep the decrement in a single code path. With the current approach, active_requests_counter.add(-1, ...) exists in two places (the except block and line 458), which is easy to get out of sync during future refactors.

The WSGI instrumentation already uses this pattern:

# wsgi/__init__.py
try:
    ...
    return _end_span_after_iterating(iterable, span, token)
except Exception as ex:
    raise
finally:
    self.active_requests_counter.add(-1, active_requests_count_attrs)

For Flask it would look like:

try:
    result = wsgi_app(wrapped_app_environ, _start_response)
    # ...duration recording unchanged...
    return result
finally:
    active_requests_counter.add(-1, active_requests_count_attrs)

This also removes the original decrement at line 458, so there's exactly one decrement path.

Also worth adding a test to prevent regression — I have one in #4437 (test_active_requests_counter_decremented_on_error) that could be adapted here.

alliasgher · 2026-04-14T20:00:05Z

Good catch — updated to use try/finally so the decrement is in a single code path regardless of whether wsgi_app succeeds or raises @liyaka

alliasgher · 2026-04-14T20:20:30Z

Added the regression test test_active_requests_counter_decremented_on_error. It hits /hello/500 (which raises ValueError internally), collects http.server.active_requests, and asserts the value is back to 0 after both requests complete.

…ts gauge leak If wsgi_app() raises an uncaught exception, the active_requests_counter decrement at the end of _wrapped_app was never reached, causing the gauge to permanently read high. Kubernetes HPA and similar systems would see phantom load. Add a bare try/except that decrements the counter and re-raises on exception, matching the pattern already used in the WSGI instrumentation. Fixes open-telemetry#4431 Signed-off-by: alliasgher <alliasgher123@gmail.com>

Signed-off-by: alliasgher <alliasgher123@gmail.com>

Signed-off-by: Ali <alliasgher123@gmail.com>

MikeGoldsmith

Thanks for this fix @alliasgher - looks good to me.

liyaka · 2026-04-16T20:33:51Z

Any estimation for this fix to be released?
thank you!

Signed-off-by: Ali <alliasgher123@gmail.com>

The check-links workflow fails on any PR that touches CHANGELOG.md because the full file is scanned and five historical entries contain broken URLs: - open-telemetry#1670 and open-telemetry#227 entries have a stray `]` inside the URL. - The open-telemetry#1033 entry is missing the `/` between the org and repo in the URL. - The `aws.ecs.*` spec link points to the old path in opentelemetry-specification; the content has since moved to the semantic-conventions repo. - The 1.12.0rc2-0.32b0 release tag does not exist on opentelemetry-python; drop the link, keep the heading text. Signed-off-by: Ali <alliasgher123@gmail.com>

alliasgher · 2026-04-17T16:55:50Z

Pushed bb17ad0 to fix broken links in CHANGELOG.md that were failing check-links CI.

Signed-off-by: Ali <alliasgher123@gmail.com>

github-project-automation Bot added this to Python PR digest Apr 14, 2026

alliasgher force-pushed the fix-flask-active-requests-gauge-leak branch from bb98692 to 5c2b07a Compare April 14, 2026 20:20

alliasgher added 4 commits April 16, 2026 01:20

fix(flask): use try/finally for active_requests decrement per review

dcc913e

Signed-off-by: alliasgher <alliasgher123@gmail.com>

fix(flask): add regression test for active_requests gauge leak on error

f135023

Signed-off-by: alliasgher <alliasgher123@gmail.com>

chore: add CHANGELOG entry for open-telemetry#4433

efb677f

Signed-off-by: Ali <alliasgher123@gmail.com>

alliasgher force-pushed the fix-flask-active-requests-gauge-leak branch from 5c2b07a to efb677f Compare April 15, 2026 20:21

alliasgher requested a review from a team as a code owner April 15, 2026 20:21

MikeGoldsmith approved these changes Apr 15, 2026

View reviewed changes

github-project-automation Bot moved this to Approved PRs in Python PR digest Apr 15, 2026

test: flatten nested blocks to satisfy pylint R1702

b578c8d

Signed-off-by: Ali <alliasgher123@gmail.com>

alliasgher force-pushed the fix-flask-active-requests-gauge-leak branch from e56a8af to b578c8d Compare April 16, 2026 21:14

alliasgher and others added 2 commits April 17, 2026 21:58

Merge branch 'main' into fix-flask-active-requests-gauge-leak

a4152af

style: apply ruff format to flask regression test

8584aed

Signed-off-by: Ali <alliasgher123@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(flask): wrap wsgi_app call in try/except to prevent active_requests gauge leak#4433

fix(flask): wrap wsgi_app call in try/except to prevent active_requests gauge leak#4433
alliasgher wants to merge 8 commits intoopen-telemetry:mainfrom
alliasgher:fix-flask-active-requests-gauge-leak

alliasgher commented Apr 14, 2026 •

edited

Loading

Uh oh!

liyaka commented Apr 14, 2026

Uh oh!

alliasgher commented Apr 14, 2026 •

edited

Loading

Uh oh!

alliasgher commented Apr 14, 2026

Uh oh!

MikeGoldsmith left a comment

Uh oh!

liyaka commented Apr 16, 2026

Uh oh!

alliasgher commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

alliasgher commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

How Has This Been Tested?

Checklist

Uh oh!

liyaka commented Apr 14, 2026

Uh oh!

alliasgher commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alliasgher commented Apr 14, 2026

Uh oh!

MikeGoldsmith left a comment

Choose a reason for hiding this comment

Uh oh!

liyaka commented Apr 16, 2026

Uh oh!

alliasgher commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

alliasgher commented Apr 14, 2026 •

edited

Loading

alliasgher commented Apr 14, 2026 •

edited

Loading