Skip to content

Conversation

@mukrishn
Copy link
Member

@mukrishn mukrishn commented Oct 22, 2025

The metrics-service added during the installation is configured with an incorrect port reference, causing Prometheus to fail in scraping metrics from the Kueue endpoint - Updated the right port name, from this spec

Also the service selector label control-plane: controller-manager selects more than 1 service from the namespace, causes multiple invalid entries in to prometheus target config - Added a new label and updated the selector.

Fixes: RHOAIENG-36974

current label `control-plane: controller-manager` selects more than 1 service from the namespace, causes multiple invalid entries in to prometheus target
@openshift-ci openshift-ci bot requested review from cpmeadors and mrunalp October 22, 2025 22:47
@openshift-ci
Copy link

openshift-ci bot commented Oct 22, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: mukrishn
Once this PR has been reviewed and has the lgtm label, please assign sohankunkerkar for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@kannon92
Copy link
Contributor

Thank you for the fix!

/hold

I'm going to verify that I can reproduce this and that this actually fixes it.

@openshift-ci openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Oct 23, 2025
Interval: "30s",
Path: "/metrics",
Port: "metrics", // Name of the port you want to monitor
Port: "https", // Name of the port you want to monitor
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This may break our network policies actually.

@tkashem made this change as part of ec72771

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see, actually that PR had updated the port name in both places, later it was revoked partially during this change and I guess that introduced this problem.
I will update the fix.

updating this name as suggested in the comment, this is required for the network policy rules.
@openshift-ci
Copy link

openshift-ci bot commented Oct 23, 2025

@mukrishn: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/test-e2e-4-18 3fc9b59 link true /test test-e2e-4-18
ci/prow/test-e2e-4-19 3fc9b59 link true /test test-e2e-4-19
ci/prow/test-e2e-4-20 3fc9b59 link true /test test-e2e-4-20

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

@kannon92
Copy link
Contributor

This does look like it fixes the metrics issue.

@rphillips
Copy link
Contributor

rphillips commented Oct 24, 2025

We need to fix this upstream and downstream. Opened 2 PRs:

kubernetes-sigs/kueue#7371
#856

The kueue-operator PR won't work until upstream merges, and get's backported to 0.14.

@rphillips
Copy link
Contributor

Thank you for the contribution. Fixed via #856

/close

@openshift-merge-robot openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Nov 4, 2025
@openshift-merge-robot
Copy link

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@openshift-ci openshift-ci bot closed this Nov 4, 2025
@openshift-ci
Copy link

openshift-ci bot commented Nov 4, 2025

@rphillips: Closed this PR.

In response to this:

Thank you for the contribution. Fixed via #856

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants