Production issue after updating the EKS cluster to 1.21

Hi Community Friends

After we updated our Jenkins EKS cluster to 1.21 we are seeing an issue where the slave is not able to connect with the Master. Below are the error details - . Due to this, our builds are getting failed.


Terminated Kubernetes instance for agent ingress-nginx/default-7f1fd
Jun 28, 2023 8:18:52 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave _terminate
Disconnected computer default-7f1fd
Jun 28, 2023 8:18:52 AM INFO org.jenkinsci.plugins.cloudstats.CloudStatistics getIdFor
No support for cloud-stats-plugin by class org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave
Jun 28, 2023 8:19:07 AM INFO hudson.slaves.NodeProvisioner update
default-01qjt provisioning successfully completed. We have now 3 computer(s)
Jun 28, 2023 8:19:07 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesLauncher launch
Created Pod: kubernetes ingress-nginx/default-01qjt
Jun 28, 2023 8:21:47 AM WARNING org.csanchez.jenkins.plugins.kubernetes.KubernetesLauncher launch
Error in provisioning; agent=KubernetesSlave name: default-trf49, template=PodTemplate{id='f733cac9-6e44-4c40-9aff-eb2e01e82ee2', name='default', namespace='ingress-nginx', label='jenkins-slave', serviceAccount='jenkins', nodeUsageMode=NORMAL, podRetention='Never', volumes=[HostPathVolume [mountPath=/var/run/docker.sock, hostPath=/var/run/docker.sock]], containers=[ContainerTemplate{name='jnlp', image='https://artifactory.guidewire.com/hub-docker-remote/jenkins/inbound-agent:latest', alwaysPullImage=true, workingDir='/home/jenkins/agent', command='', args='', resourceRequestCpu='1024m', resourceRequestMemory='1024Mi', resourceRequestEphemeralStorage='', resourceLimitCpu='1024', resourceLimitMemory='1024Mi', resourceLimitEphemeralStorage='', envVars=[ContainerEnvVar [getValue()=http://jenkins.ingress-nginx.svc.cluster.local:8080, getKey()=JENKINS_URL]], livenessProbe=ContainerLivenessProbe{execArgs='', timeoutSeconds=0, initialDelaySeconds=0, failureThreshold=0, periodSeconds=0, successThreshold=0}}]}
io.fabric8.kubernetes.client.KubernetesClientTimeoutException: Timed out waiting for [1000000] milliseconds for [Pod] with name:[default-trf49] in namespace [ingress-nginx].
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.waitUntilCondition(BaseOperation.java:896)
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.waitUntilReady(BaseOperation.java:878)
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.waitUntilReady(BaseOperation.java:93)
	at org.csanchez.jenkins.plugins.kubernetes.KubernetesLauncher.launch(KubernetesLauncher.java:169)
	at hudson.slaves.SlaveComputer.lambda$_connect$0(SlaveComputer.java:298)
	at jenkins.util.ContextResettingExecutorService$2.call(ContextResettingExecutorService.java:46)
	at jenkins.security.ImpersonatingExecutorService$2.call(ImpersonatingExecutorService.java:80)
	at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
	at java.base/java.lang.Thread.run(Thread.java:829)

Jun 28, 2023 8:21:47 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave _terminate
Terminating Kubernetes instance for agent default-trf49
Jun 28, 2023 8:21:47 AM SEVERE org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave _terminate
Computer for agent is null: default-trf49
Jun 28, 2023 8:21:47 AM INFO hudson.slaves.AbstractCloudSlave terminate
FATAL: Computer for agent is null: default-trf49
Jun 28, 2023 8:21:52 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave _terminate
Terminating Kubernetes instance for agent default-25wk2
Jun 28, 2023 8:21:52 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave deleteSlavePod
Terminated Kubernetes instance for agent ingress-nginx/default-25wk2
Jun 28, 2023 8:21:52 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave _terminate
Disconnected computer default-25wk2
Jun 28, 2023 8:21:52 AM INFO org.jenkinsci.plugins.cloudstats.CloudStatistics getIdFor
No support for cloud-stats-plugin by class org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave
Jun 28, 2023 8:22:07 AM INFO hudson.slaves.NodeProvisioner update
default-pn9dk provisioning successfully completed. We have now 3 computer(s)
Jun 28, 2023 8:22:07 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesLauncher launch
Created Pod: kubernetes ingress-nginx/default-pn9dk
Jun 28, 2023 8:24:47 AM WARNING org.csanchez.jenkins.plugins.kubernetes.KubernetesLauncher launch
Error in provisioning; agent=KubernetesSlave name: default-3bm0q, template=PodTemplate{id='f733cac9-6e44-4c40-9aff-eb2e01e82ee2', name='default', namespace='ingress-nginx', label='jenkins-slave', serviceAccount='jenkins', nodeUsageMode=NORMAL, podRetention='Never', volumes=[HostPathVolume [mountPath=/var/run/docker.sock, hostPath=/var/run/docker.sock]], containers=[ContainerTemplate{name='jnlp', image='https://artifactory.guidewire.com/hub-docker-remote/jenkins/inbound-agent:latest', alwaysPullImage=true, workingDir='/home/jenkins/agent', command='', args='', resourceRequestCpu='1024m', resourceRequestMemory='1024Mi', resourceRequestEphemeralStorage='', resourceLimitCpu='1024', resourceLimitMemory='1024Mi', resourceLimitEphemeralStorage='', envVars=[ContainerEnvVar [getValue()=http://jenkins.ingress-nginx.svc.cluster.local:8080, getKey()=JENKINS_URL]], livenessProbe=ContainerLivenessProbe{execArgs='', timeoutSeconds=0, initialDelaySeconds=0, failureThreshold=0, periodSeconds=0, successThreshold=0}}]}
io.fabric8.kubernetes.client.KubernetesClientTimeoutException: Timed out waiting for [1000000] milliseconds for [Pod] with name:[default-3bm0q] in namespace [ingress-nginx].
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.waitUntilCondition(BaseOperation.java:896)
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.waitUntilReady(BaseOperation.java:878)
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.waitUntilReady(BaseOperation.java:93)
	at org.csanchez.jenkins.plugins.kubernetes.KubernetesLauncher.launch(KubernetesLauncher.java:169)
	at hudson.slaves.SlaveComputer.lambda$_connect$0(SlaveComputer.java:298)
	at jenkins.util.ContextResettingExecutorService$2.call(ContextResettingExecutorService.java:46)
	at jenkins.security.ImpersonatingExecutorService$2.call(ImpersonatingExecutorService.java:80)
	at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
	at java.base/java.lang.Thread.run(Thread.java:829)

Jun 28, 2023 8:24:47 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave _terminate
Terminating Kubernetes instance for agent default-3bm0q
Jun 28, 2023 8:24:47 AM SEVERE org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave _terminate
Computer for agent is null: default-3bm0q
Jun 28, 2023 8:24:47 AM INFO hudson.slaves.AbstractCloudSlave terminate
FATAL: Computer for agent is null: default-3bm0q
Jun 28, 2023 8:25:52 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave _terminate
Terminating Kubernetes instance for agent default-01qjt
Jun 28, 2023 8:25:52 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave deleteSlavePod
Terminated Kubernetes instance for agent ingress-nginx/default-01qjt
Jun 28, 2023 8:25:52 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave _terminate
Disconnected computer default-01qjt
Jun 28, 2023 8:25:52 AM INFO org.jenkinsci.plugins.cloudstats.CloudStatistics getIdFor
No support for cloud-stats-plugin by class org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave
Jun 28, 2023 8:26:07 AM INFO hudson.slaves.NodeProvisioner update
default-92lg6 provisioning successfully completed. We have now 3 computer(s)
Jun 28, 2023 8:26:07 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesLauncher launch
Created Pod: kubernetes ingress-nginx/default-92lg6
Jun 28, 2023 8:28:47 AM WARNING org.csanchez.jenkins.plugins.kubernetes.KubernetesLauncher launch
Error in provisioning; agent=KubernetesSlave name: default-7f1fd, template=PodTemplate{id='f733cac9-6e44-4c40-9aff-eb2e01e82ee2', name='default', namespace='ingress-nginx', label='jenkins-slave', serviceAccount='jenkins', nodeUsageMode=NORMAL, podRetention='Never', volumes=[HostPathVolume [mountPath=/var/run/docker.sock, hostPath=/var/run/docker.sock]], containers=[ContainerTemplate{name='jnlp', image='https://artifactory.guidewire.com/hub-docker-remote/jenkins/inbound-agent:latest', alwaysPullImage=true, workingDir='/home/jenkins/agent', command='', args='', resourceRequestCpu='1024m', resourceRequestMemory='1024Mi', resourceRequestEphemeralStorage='', resourceLimitCpu='1024', resourceLimitMemory='1024Mi', resourceLimitEphemeralStorage='', envVars=[ContainerEnvVar [getValue()=http://jenkins.ingress-nginx.svc.cluster.local:8080, getKey()=JENKINS_URL]], livenessProbe=ContainerLivenessProbe{execArgs='', timeoutSeconds=0, initialDelaySeconds=0, failureThreshold=0, periodSeconds=0, successThreshold=0}}]}
io.fabric8.kubernetes.client.KubernetesClientTimeoutException: Timed out waiting for [1000000] milliseconds for [Pod] with name:[default-7f1fd] in namespace [ingress-nginx].
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.waitUntilCondition(BaseOperation.java:896)
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.waitUntilReady(BaseOperation.java:878)
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.waitUntilReady(BaseOperation.java:93)
	at org.csanchez.jenkins.plugins.kubernetes.KubernetesLauncher.launch(KubernetesLauncher.java:169)
	at hudson.slaves.SlaveComputer.lambda$_connect$0(SlaveComputer.java:298)
	at jenkins.util.ContextResettingExecutorService$2.call(ContextResettingExecutorService.java:46)
	at jenkins.security.ImpersonatingExecutorService$2.call(ImpersonatingExecutorService.java:80)
	at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
	at java.base/java.lang.Thread.run(Thread.java:829)

Jun 28, 2023 8:28:47 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave _terminate
Terminating Kubernetes instance for agent default-7f1fd
Jun 28, 2023 8:28:47 AM SEVERE org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave _terminate
Computer for agent is null: default-7f1fd
Jun 28, 2023 8:28:47 AM INFO hudson.slaves.AbstractCloudSlave terminate
FATAL: Computer for agent is null: default-7f1fd
Jun 28, 2023 8:28:52 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave _terminate
Terminating Kubernetes instance for agent default-pn9dk
Jun 28, 2023 8:28:52 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave deleteSlavePod
Terminated Kubernetes instance for agent ingress-nginx/default-pn9dk
Jun 28, 2023 8:28:52 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave _terminate
Disconnected computer default-pn9dk
Jun 28, 2023 8:28:52 AM INFO org.jenkinsci.plugins.cloudstats.CloudStatistics getIdFor
No support for cloud-stats-plugin by class org.csanchez.jenkins.plugins.kubernetes.KubernetesSlave
Jun 28, 2023 8:29:07 AM INFO hudson.slaves.NodeProvisioner update
default-sqp4q provisioning successfully completed. We have now 3 computer(s)
Jun 28, 2023 8:29:07 AM INFO org.csanchez.jenkins.plugins.kubernetes.KubernetesLauncher launch
Created Pod: kubernetes ingress-nginx/default-sqp4q
Jun 28, 2023 8:31:47 AM WARNING org.csanchez.jenkins.plugins.kubernetes.KubernetesLauncher launch
Error in provisioning; agent=KubernetesSlave name: default-25wk2, template=PodTemplate{id='f733cac9-6e44-4c40-9aff-eb2e01e82ee2', name='default', namespace='ingress-nginx', label='jenkins-slave', serviceAccount='jenkins', nodeUsageMode=NORMAL, podRetention='Never', volumes=[HostPathVolume [mountPath=/var/run/docker.sock, hostPath=/var/run/docker.sock]], containers=[ContainerTemplate{name='jnlp', image='https://artifactory.guidewire.com/hub-docker-remote/jenkins/inbound-agent:latest', alwaysPullImage=true, workingDir='/home/jenkins/agent', command='', args='', resourceRequestCpu='1024m', resourceRequestMemory='1024Mi', resourceRequestEphemeralStorage='', resourceLimitCpu='1024', resourceLimitMemory='1024Mi', resourceLimitEphemeralStorage='', envVars=[ContainerEnvVar [getValue()=http://jenkins.ingress-nginx.svc.cluster.local:8080, getKey()=JENKINS_URL]], livenessProbe=ContainerLivenessProbe{execArgs='', timeoutSeconds=0, initialDelaySeconds=0, failureThreshold=0, periodSeconds=0, successThreshold=0}}]}
io.fabric8.kubernetes.client.KubernetesClientTimeoutException: Timed out waiting for [1000000] milliseconds for [Pod] with name:[default-25wk2] in namespace [ingress-nginx].
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.waitUntilCondition(BaseOperation.java:896)
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.waitUntilReady(BaseOperation.java:878)
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.waitUntilReady(BaseOperation.java:93)
	at org.csanchez.jenkins.plugins.kubernetes.KubernetesLauncher.launch(KubernetesLauncher.java:169)
	at hudson.slaves.SlaveComputer.lambda$_connect$0(SlaveComputer.java:298)
	at jenkins.util.ContextResettingExecutorService$2.call(ContextResettingExecutorService.java:46)
	at jenkins.security.ImpersonatingExecutorService$2.call(ImpersonatingExecutorService.java:80)
	at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
	at java.base/java.lang.Thread.run(Thread.java:829)

I would greatly appreciate any assistance provided.
Many Thanks

Hi @GreatHornbill and welcome to this community. :wave:

As a reminder, the term “slave” to refer to an agent has been deprecated since 2016. Please refer to On Jenkins Terminology Updates for more details. We request you update your post.