Skip to content

[BUG]: dynamographdeployment operator didn't handle multinode spec update operation #4749

@Monokaix

Description

@Monokaix

Describe the Bug

Current behavior:
When set multinode count=2 and enable lws, the dynamographdeployment operator will create a lws with 2 workers, and when didn't enable multinode, the operator will just create a deployment.

But if we deploy a backed with aggregated mode without setting multinode filed, and then update the dgd cr with setting multinode count=2, the previous created deployment will not be deleted, and the lws and deployment co-exist.

Similarly, if we deploy a backend with setting multinode=2 and then remove this field, the lws will not be removed either.

I'd like to know if update the multinode spec is a not allowed behavior, if true we should add the validation here and if not we should handle this correctly.

Steps to Reproduce

  1. deploy a vllm backend with agg mode using this example
  2. edit the dgd cr and set multinode=2
  3. use kubectl and find that both deployment and lws exist.

Expected Behavior

The deployment should be deleted when set multinode field

Actual Behavior

Both deployment and lws exist and the deployment is not deleted.

Environment

This issue is environment-independent.

Additional Context

No response

Screenshots

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions