-
-
Notifications
You must be signed in to change notification settings - Fork 12
Closed
Labels
Description
Service(s)
Azure
Summary
The AKS cluster publick8s
suffers from SNAT port exhaustion since around 1 month (example below for the last 24 hours):

It causes the following problems:
- Pagerduty is spamming us with false positive alerts reporting unusual response time from repo.jenkins-ci.org and reports.jenkins.io services while other probes (on other clusters) are reporting valid response times.
- The rest of the public network (including ci.jenkins.io and its agents, even on the peered network on the sponsored subscription,) is slowed down:
- ci.jenkins.io jobs on Windows agents are much slower than 21 days ago #3904 (even if the network slowness is not the only reason)
- Mirrorbits got pods restart on publick8s #3799 (mirrorbit pods are failing when trying to scan mirrors)
Reproduction steps
No response