feat: Separate Runtime Statistics Collection from UI Updates by kunwp1 · Pull Request #4205 · apache/texera

kunwp1 · 2026-02-10T20:56:59Z

What changes were proposed in this PR?

This PR introduces a new configuration parameter runtime-statistics-persistence-interval to independently control the frequency of runtime statistics persistence, separate from the UI update frequency (status-update-interval). Previously, both UI updates and runtime statistics persistence were controlled by a single parameter status-update-interval. This means frequent UI updates (e.g., 500ms) caused excessive statistics writes to storage. This change allows independent control:

status-update-interval: Controls how often the frontend UI refreshes (default: 500ms)
runtime-statistics-persistence-interval: Controls how often statistics are persisted to storage (default: 2000ms)

Do two timers mean more frequent worker queries?

No. The controller tracks the timestamp of the last completed full-graph worker query and uses min(status-update-interval, runtime-statistics-persistence-interval) as a freshness threshold. When a timer fires, if the elapsed time since the last query is within this threshold, the controller forwards stats from cache without querying workers — so the faster timer drives all real worker queries and the slower timer always reuses the result. If a query is already in-flight when the second timer fires, the controller serves stats from the previous completed query's cache. Cache reuse applies to timer-triggered queries only; event-triggered queries (e.g., from worker completion events) always proceed to real worker RPCs.

Changes

Added runtime-statistics-persistence-interval parameter (default: 2000ms) in application.conf
Protobuf: Added StatisticsUpdateTarget enum (UI_ONLY, PERSISTENCE_ONLY, BOTH_UI_AND_PERSISTENCE) to QueryStatisticsRequest
Added RuntimeStatisticsPersist event for statistics-only updates; ExecutionStatsUpdate now handles UI-only updates
Added separate timer for runtime statistics persistence that runs independently from the UI update timer
Query Handling
- Timer-triggered queries specify target: UI-only or persistence-only
- Event-triggered queries (port/worker completion, pause, resume) send both UI and persistence updates to preserve original behavior
- QueryWorkerStatisticsHandler routes to the appropriate event based on StatisticsUpdateTarget
Worker query deduplication in QueryWorkerStatisticsHandler: when the second timer fires, the controller checks whether worker stats were already fetched recently (within min(status-update-interval, runtime-statistics-persistence-interval)). If so, it forwards the cached stats to the appropriate sink (UI or persistence) without issuing any worker RPCs. If a query is already in-flight, cached stats from the previous completed query are forwarded.

Any related issues, documentation, discussions?

Closes #4204

How was this PR tested?

Tested with the following workflow and dataset, change the runtime-statistics-persistence-interval parameter to see if the runtime stats size reduces if we increase the parameter value.
Iris Dataset Analysis.json
Iris.csv

Was this PR authored or co-authored using generative AI tooling?

Generated-by: Claude-4.6

Xiao-zhen-Liu

LGTM, left minor comments and some questions. Tested and can verify the size changes of the persisted runtime stats by adjusting this new parameter.

common/config/src/main/resources/application.conf

...xera/amber/engine/architecture/controller/promisehandlers/QueryWorkerStatisticsHandler.scala

...in/scala/org/apache/texera/amber/engine/architecture/controller/ControllerTimerService.scala

...xera/amber/engine/architecture/controller/promisehandlers/QueryWorkerStatisticsHandler.scala

kunwp1 · 2026-02-20T04:58:53Z

@Xiao-zhen-Liu Can you do a one more pass of the review?

@chenlica and I discussed the design and decided to keep the two control messages independent (one for UI updates, one for persistence) to avoid coupling their intervals together.

To address the concern about increased worker query frequency, I added an optimization in QueryWorkerStatisticsHandler: when a timer fires, the controller checks whether a full-graph worker query was already completed recently (within min(status-update-interval, runtime-statistics-persistence-interval)). If so, it reuses the cached stats from WorkerExecution and forwards them to the appropriate sink (UI or persistence) without issuing any worker RPCs. This means the number of queryStatistics RPCs sent to workers does not increase compared to before. The faster timer drives all real worker queries and the slower timer always reuses the result.

Xiao-zhen-Liu

LGTM, thanks for the new changes. Only have a minor clarification about the new behavior.

Xiao-zhen-Liu · 2026-02-20T19:26:13Z

...xera/amber/engine/architecture/controller/promisehandlers/QueryWorkerStatisticsHandler.scala

    if (globalQueryStatsOngoing && msg.filterByWorkers.isEmpty) {
+      // A query is already in-flight: serve the last completed query's cached data,
+      // or drop silently if no prior query has finished yet.
+      if (lastWorkerQueryTimestampNs > 0) forwardStats(msg.updateTarget)


I'm trying to understand the behavior of this change. Does this mean only the concurrent requests before the first globalQuery finishes will be dropped, and after the first globalQuery of a workflow finishes, all subsequent concurrent requests of an ongoing globalQuery will be served from cache? (Previously, any concurrent request will be dropped.)

Xiao-zhen-Liu · 2026-02-20T19:36:40Z

...xera/amber/engine/architecture/controller/promisehandlers/QueryWorkerStatisticsHandler.scala

+      forwardStats(msg.updateTarget)
+      // Record the completion timestamp before releasing the lock so that any timer
+      // firing in between sees a valid cache entry rather than triggering a redundant query.
      if (globalQueryStatsOngoing) {


Can the completion of filtered requests also trigger this lock-release?

kunwp1 added 2 commits February 10, 2026 12:45

Add new param

4dab632

Update application.conf

7c27789

kunwp1 requested a review from Xiao-zhen-Liu February 10, 2026 20:56

kunwp1 self-assigned this Feb 10, 2026

kunwp1 and others added 2 commits February 10, 2026 12:57

Merge branch 'main' into chris-introduce-new-interval-param

3226966

Remove unnecessary changes

c35f055

github-actions bot added engine common labels Feb 10, 2026

chenlica requested a review from aglinxinyuan February 11, 2026 14:59

Merge branch 'main' into chris-introduce-new-interval-param

723392f

Xiao-zhen-Liu approved these changes Feb 11, 2026

View reviewed changes

chenlica and others added 4 commits February 14, 2026 21:09

Merge branch 'main' into chris-introduce-new-interval-param

d035da3

Address comments

8e2c431

Revert unnecessary changes

be757bb

Fix

47fc254

Merge branch 'main' into chris-introduce-new-interval-param

9e2cf74

Xiao-zhen-Liu approved these changes Feb 20, 2026

View reviewed changes

Xiao-zhen-Liu reviewed Feb 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feat: Separate Runtime Statistics Collection from UI Updates#4205

feat: Separate Runtime Statistics Collection from UI Updates#4205
kunwp1 wants to merge 10 commits intoapache:mainfrom
kunwp1:chris-introduce-new-interval-param

kunwp1 commented Feb 10, 2026 •

edited

Loading

Uh oh!

Xiao-zhen-Liu left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kunwp1 commented Feb 20, 2026

Uh oh!

Xiao-zhen-Liu left a comment

Uh oh!

Xiao-zhen-Liu Feb 20, 2026

Uh oh!

Xiao-zhen-Liu Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

kunwp1 commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this PR?

Do two timers mean more frequent worker queries?

Changes

Any related issues, documentation, discussions?

How was this PR tested?

Was this PR authored or co-authored using generative AI tooling?

Uh oh!

Xiao-zhen-Liu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kunwp1 commented Feb 20, 2026

Uh oh!

Xiao-zhen-Liu left a comment

Choose a reason for hiding this comment

Uh oh!

Xiao-zhen-Liu Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

Xiao-zhen-Liu Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kunwp1 commented Feb 10, 2026 •

edited

Loading