Breaking change: perf: Enable scheduling overlap by default #4174

kaiyux · 2025-05-09T05:02:40Z

This PR:

Renamed the enable_overlap_scheduler argument in PyTorchConfig to disable_overlap_scheduler.
Enabling scheduling overlap by default.

kaiyux · 2025-05-09T05:02:55Z

/bot run --disable-fail-fast

Copilot

Pull Request Overview

This PR reverses the overlap scheduler flag by replacing the old “enable_overlap_scheduler” parameter with “disable_overlap_scheduler” (defaulted to False) across the codebase to enable scheduling overlap by default.

Inverted the flag logic in all relevant modules (worker, executor, pyexecutor, model_engine, decoder, etc.).
Updated CLI examples, documentation, and configuration files to reflect the new “disable_overlap_scheduler” naming and semantics.

Reviewed Changes

Copilot reviewed 47 out of 47 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
tensorrt_llm/scaffolding/worker.py	Parameter renamed and updated in worker initialization.
tensorrt_llm/executor/worker.py	Flag condition inverted for determining scheduling overlap.
tensorrt_llm/commands/*	Removed legacy flag usage in serve and eval commands.
tensorrt_llm/_torch/pyexecutor/*	Multiple files updated to invert flag logic consistently.
examples/* and docs/*	CLI/example scripts and documentation now reference the new flag.

tensorrt-cicd · 2025-05-09T05:08:30Z

PR_Github #4651 [ run ] triggered by Bot

tensorrt-cicd · 2025-05-09T06:33:17Z

PR_Github #4651 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #3354 completed with status: 'FAILURE'

kaiyux · 2025-05-09T09:03:42Z

/bot run --disable-fail-fast

tensorrt-cicd · 2025-05-09T09:09:10Z

PR_Github #4692 [ run ] triggered by Bot

tensorrt-cicd · 2025-05-09T17:29:21Z

PR_Github #4692 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #3382 completed with status: 'FAILURE'

kaiyux · 2025-05-11T07:20:24Z

/bot run --disable-fail-fast

tensorrt-cicd · 2025-05-11T07:26:04Z

PR_Github #4767 [ run ] triggered by Bot

tensorrt-cicd · 2025-05-11T11:25:32Z

PR_Github #4767 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #3443 completed with status: 'FAILURE'

QiJune · 2025-05-12T00:08:56Z

/bot run --disable-fail-fast

tensorrt-cicd · 2025-05-12T00:14:39Z

PR_Github #4790 [ run ] triggered by Bot

tensorrt-cicd · 2025-05-12T07:09:03Z

PR_Github #4790 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #3464 completed with status: 'FAILURE'

kaiyux · 2025-05-14T01:32:52Z

/bot run --disable-fail-fast --add-multi-gpu-test

tensorrt-cicd · 2025-05-14T01:41:39Z

PR_Github #5075 [ run ] triggered by Bot

tensorrt-cicd · 2025-05-14T06:54:33Z

PR_Github #5075 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #3694 completed with status: 'FAILURE'

kaiyux · 2025-05-14T06:58:54Z

/bot run --stage-list "DGX_H100-4_GPUs-PyTorch-Others-1"

tensorrt-cicd · 2025-05-14T07:22:06Z

PR_Github #5127 [ run ] triggered by Bot

tensorrt-cicd · 2025-05-14T10:09:12Z

PR_Github #5127 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #3733 (Partly Tested) completed with status: 'FAILURE'

kaiyux · 2025-05-14T12:10:40Z

/bot run --disable-fail-fast --add-multi-gpu-test

tensorrt-cicd · 2025-05-14T12:16:28Z

PR_Github #5171 [ run ] triggered by Bot

...gregated/test_configs/disagg_config_ctxtp2_gentp2_deepseek_v3_lite_attention_dp_overlap.yaml

tensorrt-cicd · 2025-05-14T20:50:09Z

PR_Github #5171 [ run ] completed with state FAILURE
/LLM/main/L0_MergeRequest_PR pipeline #3772 completed with status: 'FAILURE'

Signed-off-by: Kaiyu Xie <[email protected]> Fix Signed-off-by: Kaiyu Xie <[email protected]> Fix test_ptp_quickstart_advanced_eagle3 Signed-off-by: Kaiyu Xie <[email protected]> Fix Signed-off-by: Kaiyu Xie <[email protected]> Fix Signed-off-by: Kaiyu Xie <[email protected]> Fix Signed-off-by: Kaiyu Xie <[email protected]>

Signed-off-by: Kaiyu Xie <[email protected]>

kaiyux · 2025-05-15T01:06:53Z

/bot run --disable-fail-fast --add-multi-gpu-test

tensorrt-cicd · 2025-05-15T01:18:06Z

PR_Github #5227 [ run ] triggered by Bot

kaiyux · 2025-05-15T06:10:17Z

Pre-merge pipeline has passed, merging this in case there are going to be more conflicts.

kaiyux · 2025-05-15T06:11:02Z

/bot skip --comment "pre-merge pipeline has passed"

tensorrt-cicd · 2025-05-15T06:18:13Z

PR_Github #5278 [ skip ] triggered by Bot

tensorrt-cicd · 2025-05-15T06:18:15Z

PR_Github #5227 [ run ] completed with state ABORTED

tensorrt-cicd · 2025-05-15T06:27:34Z

PR_Github #5278 [ skip ] completed with state SUCCESS
Skipping testing for commit a0ac332

kaiyux requested review from Funatiq, QiJune, Copilot, hlu1 and nv-yilinf May 9, 2025 05:05

Copilot AI reviewed May 9, 2025

View reviewed changes

kaiyux requested a review from amukkara May 9, 2025 06:13

kaiyux force-pushed the user/kaiyu/def_schedule_overlap branch from 8e4b83f to dfd073f Compare May 9, 2025 09:03

Funatiq approved these changes May 9, 2025

View reviewed changes

hlu1 approved these changes May 9, 2025

View reviewed changes

amukkara approved these changes May 9, 2025

View reviewed changes

kaiyux force-pushed the user/kaiyu/def_schedule_overlap branch from dfd073f to 3c8cf62 Compare May 11, 2025 07:20

kaiyux requested review from HuiGao-NV, dcampora, dongxuy04, lucaslie and suyoggupta as code owners May 11, 2025 07:20

QiJune approved these changes May 12, 2025

View reviewed changes

kaiyux enabled auto-merge (squash) May 14, 2025 01:33

kaiyux force-pushed the user/kaiyu/def_schedule_overlap branch from fdeeb26 to cc02fae Compare May 14, 2025 12:10

lucaslie added the AutoDeploy <NV> AutoDeploy Backend label May 14, 2025

github-project-automation bot added this to AutoDeploy Board May 14, 2025

github-project-automation bot moved this to Backlog in AutoDeploy Board May 14, 2025

lucaslie removed the AutoDeploy <NV> AutoDeploy Backend label May 14, 2025

lucaslie removed this from AutoDeploy Board May 14, 2025

nv-yilinf reviewed May 14, 2025

View reviewed changes

...gregated/test_configs/disagg_config_ctxtp2_gentp2_deepseek_v3_lite_attention_dp_overlap.yaml Outdated Show resolved Hide resolved

kaiyux added 3 commits May 15, 2025 00:53

Fix

949c323

Signed-off-by: Kaiyu Xie <[email protected]>

Fix

a0ac332

Signed-off-by: Kaiyu Xie <[email protected]>

kaiyux force-pushed the user/kaiyu/def_schedule_overlap branch from cc02fae to a0ac332 Compare May 15, 2025 01:06

kaiyux merged commit b4e5df0 into NVIDIA:main May 15, 2025
3 checks passed

rmccorm4 mentioned this pull request May 30, 2025

fix: Update breaking change to enable_overlap_scheduler field from TRTLLM commit b4e5df0e ai-dynamo/dynamo#1310

Merged

Breaking change: perf: Enable scheduling overlap by default #4174

Breaking change: perf: Enable scheduling overlap by default #4174

Uh oh!

Conversation

kaiyux commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kaiyux commented May 9, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

tensorrt-cicd commented May 9, 2025

Uh oh!

tensorrt-cicd commented May 9, 2025

Uh oh!

kaiyux commented May 9, 2025

Uh oh!

tensorrt-cicd commented May 9, 2025

Uh oh!

tensorrt-cicd commented May 9, 2025

Uh oh!

kaiyux commented May 11, 2025

Uh oh!

tensorrt-cicd commented May 11, 2025

Uh oh!

tensorrt-cicd commented May 11, 2025

Uh oh!

QiJune commented May 12, 2025

Uh oh!

tensorrt-cicd commented May 12, 2025

Uh oh!

tensorrt-cicd commented May 12, 2025

Uh oh!

kaiyux commented May 14, 2025

Uh oh!

tensorrt-cicd commented May 14, 2025

Uh oh!

tensorrt-cicd commented May 14, 2025

Uh oh!

kaiyux commented May 14, 2025

Uh oh!

tensorrt-cicd commented May 14, 2025

Uh oh!

tensorrt-cicd commented May 14, 2025

Uh oh!

kaiyux commented May 14, 2025

Uh oh!

tensorrt-cicd commented May 14, 2025

Uh oh!

Uh oh!

tensorrt-cicd commented May 14, 2025

Uh oh!

kaiyux commented May 15, 2025

Uh oh!

tensorrt-cicd commented May 15, 2025

Uh oh!

kaiyux commented May 15, 2025

Uh oh!

kaiyux commented May 15, 2025

Uh oh!

tensorrt-cicd commented May 15, 2025

Uh oh!

tensorrt-cicd commented May 15, 2025

Uh oh!

tensorrt-cicd commented May 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

kaiyux commented May 9, 2025 •

edited

Loading