Detect supervisor-subprocess trigger count mismatch in Triggerer#68792
Open
JH0917 wants to merge 1 commit into
Open
Detect supervisor-subprocess trigger count mismatch in Triggerer#68792JH0917 wants to merge 1 commit into
JH0917 wants to merge 1 commit into
Conversation
|
Congratulations on your first Pull Request and welcome to the Apache Airflow community! If you have any issues or are unsure about any anything please check our Contributors' Guide
|
3daa0bb to
2a820f1
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
In rare cases the Triggerer subprocess can lose track of a trigger — it
is assigned in the supervisor's
running_triggersbut no coroutine isactually running. The affected task instance stays in
deferred permanently. A similar symptom was reported in #63913.Fix: The
TriggerRunnernow reportsnum_running(its actualtrigger count) in every
TriggerStateChangesmessage. The supervisorcompares this against
len(running_triggers)after finished triggersare removed but before new ones are added. If they diverge, the
supervisor shuts down so the orchestrator can restart it and recover the
stuck tasks.
related: #63913
Was generative AI tooling used to co-author this PR?
Generated-by: Claude Code (Opus 4.6) following the guidelines