-
Notifications
You must be signed in to change notification settings - Fork 228
Closed
Labels
Description
spec.runPolicy.activeDeadlineSeconds was set to 300s, mpijob was running over 22 minites and not terminated.
By reading the code, I found that activeDeadlineSeconds was not processed in mpi-operator, so is this a bug?
# mpijob:
apiVersion: kubeflow.org/v1
kind: MPIJob
metadata:
name: mpijob-v1-3
namespace: default
spec:
...
runPolicy:
activeDeadlineSeconds: 300
backoffLimit: 2
# mpijob status
status:
conditions:
- lastTransitionTime: "2020-07-06T05:15:20Z"
lastUpdateTime: "2020-07-06T05:15:20Z"
message: MPIJob default/mpijob-v1-3 is created.
reason: MPIJobCreated
status: "True"
type: Created
- lastTransitionTime: "2020-07-06T05:59:47Z"
lastUpdateTime: "2020-07-06T05:59:47Z"
message: MPIJob default/mpijob-v1-3 is running.
reason: MPIJobRunning
status: "True"
type: Running
replicaStatuses:
Launcher:
active: 1
Worker:
active: 1
# running pod
NAME READY STATUS RESTARTS AGE
mpijob-v1-3-launcher 1/1 Running 0 22m
mpijob-v1-3-worker-0 1/1 Running 0 22m