Skip to content

[WIP] MetricsV2 + OTel+Prometheus sample and Grafana dashboard#3154

Draft
csviri wants to merge 49 commits intooperator-framework:nextfrom
csviri:otel-metrics-grafana
Draft

[WIP] MetricsV2 + OTel+Prometheus sample and Grafana dashboard#3154
csviri wants to merge 49 commits intooperator-framework:nextfrom
csviri:otel-metrics-grafana

Conversation

@csviri
Copy link
Collaborator

@csviri csviri commented Feb 4, 2026

Goal of this PR is to provide a OTel + Prometheus + Grafana setup. So we:

  1. verify integration with OTel
  2. Provide a default Grafana dashboard for metrics.
  3. Changes webpage sample to use Otel and deploy the app and dasboard to a cluster with OTel + Prometheus + Grafana. This should be easy to done by a users, so dey can check and validate the dashboard.
  4. new metrics implementations

Notes on new metrics implementation:

  • only static gauges (lives until the operator lives) - that makes much easir to manage them
  • no manual counter removal (should not be done)
  • sanitized names and labels according to best practices
  • added some metrics counter for last attempt retries.

csviri and others added 17 commits February 4, 2026 15:30
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Steve Hawkins <shawkins@redhat.com>
…work#3027)

Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
…amework#3076)

simplified the javadoc

Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Chris Laprun <metacosm@gmail.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
…3093)

Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
…ork#3106)

- Adds utility that provides methods to update resources using comparable resource versions
- Integrates this utility to the core of the framework (thus simplifying `ReconciliationDispatcher`)
  -  note that this introduces a change in the behavior for the `UpdateControl.patchStatus` (and others), since it won't trigger the reconiliation for the event in that update.
-  Renames former `ReconcilerUtils` to `ReconcilerUtilsInternal`, this is breaking but that utils was never advertised for non-internal usage 
- Includes also fixes for ControllerEventSource event filtering
- Improves TemporaryResourceCache event filtering algorithm

Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Steve Hawkins <shawkins@redhat.com>
Co-authored-by: Steve Hawkins <shawkins@redhat.com>
)

Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
…rator-framework#3137)

This should not be user facing. At least not in any obvious scenerio.

Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
…om Context (operator-framework#3142)

Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Chris Laprun <metacosm@gmail.com>
Co-authored-by: Chris Laprun <metacosm@gmail.com>
…erator-framework#3141)

Signed-off-by: Chris Laprun <metacosm@gmail.com>
Co-authored-by: Attila Mészáros <a_meszaros@apple.com>
@openshift-ci openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Feb 4, 2026
@csviri csviri changed the title OTel+Prometheus sample and Graphana dashboard [WIP] OTel+Prometheus sample and Graphana dashboard Feb 4, 2026
metacosm and others added 3 commits February 5, 2026 16:20
…ator-framework#3156)

* refactor: avoid creating intermediate collections when unneeded

Also use constant filters and collectors

Signed-off-by: Chris Laprun <metacosm@gmail.com>

* fix: incorrect test

Signed-off-by: Chris Laprun <metacosm@gmail.com>

---------

Signed-off-by: Chris Laprun <metacosm@gmail.com>
…erator-framework#3161)

* improve: prepare for removal of exitOnStopLeading from public API

Signed-off-by: Chris Laprun <metacosm@gmail.com>

* fix: missed deprecated annotation

[skip ci]

Signed-off-by: Chris Laprun <metacosm@gmail.com>

---------

Signed-off-by: Chris Laprun <metacosm@gmail.com>
@csviri csviri changed the title [WIP] OTel+Prometheus sample and Graphana dashboard [WIP] OTel+Prometheus sample and Grfana dashboard Feb 8, 2026
@csviri csviri changed the title [WIP] OTel+Prometheus sample and Grfana dashboard [WIP] OTel+Prometheus sample and Grafana dashboard Feb 8, 2026
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
@csviri
Copy link
Collaborator Author

csviri commented Feb 9, 2026

JVM metrics:

image image

@csviri
Copy link
Collaborator Author

csviri commented Feb 9, 2026

JODSK metrics:

image

Added TODOs to improve those, like we should not have controller name as suffix, rather as a tag:

operator_sdk_reconciliations_executions_webpagestandalonedependentsreconciler

Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
@csviri csviri linked an issue Feb 10, 2026 that may be closed by this pull request
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
@csviri csviri changed the title [WIP] OTel+Prometheus sample and Grafana dashboard [WIP] MetricsV2 + OTel+Prometheus sample and Grafana dashboard Feb 11, 2026
Signed-off-by: Attila Mészáros <a_meszaros@apple.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress.

Projects

None yet

3 participants