docs:update 0.19 doc. #4120

nv-guomingz · 2025-05-07T10:47:06Z

PR title

Please write the PR title by following template:

[JIRA ticket link/nvbug link/github issue link][fix/feat/doc/infra/...] <summary of this PR>

For example, assume I have a PR hope to support a new feature about cache manager of Jira TRTLLM-1000 ticket, it would be like

[TRTLLM-1000][feat] Support a new feature about cache manager

Description

Please explain the issue and the solution in short.

Test Coverage

GitHub Bot Help

/bot [-h] ['run', 'kill', 'skip', 'reuse-pipeline'] ...

Provide a user friendly way for developers to interact with a Jenkins server.

Run /bot [-h|--help] to print this help message.

See details below for each supported subcommand.

run [--disable-fail-fast --skip-test --stage-list "A10-1, xxx" --gpu-type "A30, H100_PCIe" --add-multi-gpu-test --only-multi-gpu-test --disable-multi-gpu-test --post-merge --extra-stage "H100_PCIe-[Post-Merge]-1, xxx"]

Launch build/test pipelines. All previously running jobs will be killed.

--disable-fail-fast (OPTIONAL) : Disable fail fast on build/tests/infra failures.

--skip-test (OPTIONAL) : Skip all test stages, but still run build stages, package stages and sanity check stages. Note: Does NOT update GitHub check status.

--stage-list "A10-1, xxx" (OPTIONAL) : Only run the specified test stages. Examples: "A10-1, xxx". Note: Does NOT update GitHub check status.

--gpu-type "A30, H100_PCIe" (OPTIONAL) : Only run the test stages on the specified GPU types. Examples: "A30, H100_PCIe". Note: Does NOT update GitHub check status.

--only-multi-gpu-test (OPTIONAL) : Only run the multi-GPU tests. Note: Does NOT update GitHub check status.

--disable-multi-gpu-test (OPTIONAL) : Disable the multi-GPU tests. Note: Does NOT update GitHub check status.

--add-multi-gpu-test (OPTIONAL) : Force run the multi-GPU tests. Will also run L0 pre-merge pipeline.

--post-merge (OPTIONAL) : Run the L0 post-merge pipeline instead of the ordinary L0 pre-merge pipeline.

--extra-stage "H100_PCIe-[Post-Merge]-1, xxx" (OPTIONAL) : Run the ordinary L0 pre-merge pipeline and specified test stages. Examples: --extra-stage "H100_PCIe-[Post-Merge]-1, xxx".

kill

kill

Kill all running builds associated with pull request.

skip

skip --comment COMMENT

Skip testing for latest commit on pull request. --comment "Reason for skipping build/test" is required. IMPORTANT NOTE: This is dangerous since lack of user care and validation can cause top of tree to break.

reuse-pipeline

reuse-pipeline

Reuse a previous pipeline to validate current commit. This action will also kill all currently running builds associated with the pull request. IMPORTANT NOTE: This is dangerous since lack of user care and validation can cause top of tree to break.

Signed-off-by: nv-guomingz <[email protected]>

nv-guomingz · 2025-05-07T10:49:15Z

/bot skip --comment "docs change"

tensorrt-cicd · 2025-05-07T10:54:46Z

PR_Github #4369 [ skip ] triggered by Bot

tensorrt-cicd · 2025-05-07T11:00:06Z

PR_Github #4369 [ skip ] completed with state SUCCESS
Skipping testing for commit a7575cc

Signed-off-by: nv-guomingz <[email protected]>

* fix: Fix/fused moe 0.19 (#3799) * fix bug of stream init Signed-off-by: bhsueh <[email protected]> * fix bug Signed-off-by: bhsueh <[email protected]> --------- Signed-off-by: bhsueh <[email protected]> * fix: Add pre-download of checkpoint before benchmark. (#3772) * Add pre-download of checkpoint before benchmark. Signed-off-by: Frank Di Natale <[email protected]> * Add missing remote code flag. Signed-off-by: Frank Di Natale <[email protected]> * Move from_pretrained to throughput benchmark. Signed-off-by: Frank Di Natale <[email protected]> * Move download and use snapshot_download. Signed-off-by: Frank Di Natale <[email protected]> * Removed trusted flag. Signed-off-by: Frank Di Natale <[email protected]> * Fix benchmark command in iteration log test. Signed-off-by: Frank Di Natale <[email protected]> --------- Signed-off-by: Frank Di Natale <[email protected]> * [https://nvbugspro.nvidia.com/bug/5241495][fix] CUDA Graph padding with overlap scheduler (#3839) * fix Signed-off-by: Enwei Zhu <[email protected]> * fuse Signed-off-by: Enwei Zhu <[email protected]> * fix Signed-off-by: Enwei Zhu <[email protected]> * fix Signed-off-by: Enwei Zhu <[email protected]> --------- Signed-off-by: Enwei Zhu <[email protected]> * TRTLLM-4875 feat: Add version switcher to doc (#3871) Signed-off-by: Kaiyu Xie <[email protected]> * waive a test (#3897) Signed-off-by: Superjomn <[email protected]> * docs:fix https://nvbugs/5244616 by removing new invalid links. (#3939) Signed-off-by: nv-guomingz <[email protected]> Co-authored-by: nv-guomingz <[email protected]> * fix: remote mpi session abort (#3884) * fix remote mpi session Signed-off-by: Superjomn <[email protected]> * fix Signed-off-by: Superjomn <[email protected]> --------- Signed-off-by: Superjomn <[email protected]> * skip fp8 gemm for pre-hopper (#3931) Signed-off-by: Ivy Zhang <[email protected]> * [https://nvbugspro.nvidia.com/bug/5247148][fix] Attention DP with overlap scheduler (#3975) * fix Signed-off-by: Enwei Zhu <[email protected]> * update multigpu list Signed-off-by: Enwei Zhu <[email protected]> * fix namings Signed-off-by: Enwei Zhu <[email protected]> --------- Signed-off-by: Enwei Zhu <[email protected]> * Doc: Fix H200 DeepSeek R1 perf doc (#4006) * fix doc Signed-off-by: jiahanc <[email protected]> * update perf number Signed-off-by: jiahanc <[email protected]> --------- Signed-off-by: jiahanc <[email protected]> * Fix the perf regression caused by insufficient cache warmup. (#4042) Force tuning up to 8192 sequence length for NVFP4 linear op. Also, make this runtime-selectable with UB enabled. Signed-off-by: Yukun He <[email protected]> * doc: Update 0.19.0 release notes (#3976) Signed-off-by: Kaiyu Xie <[email protected]> * Optimize the AutoTuner cache access code to reduce host code overhead. (#4060) The NVFP4 Linear op is very sensitive to the host overhead. This PR introduces customizable `find_nearest_profile` and `get_cache_key_specifc`, which allow users to override the default method for generating the cache key. Signed-off-by: Yukun He <[email protected]> * Update switcher (#4098) Signed-off-by: Kaiyu Xie <[email protected]> * doc: update release notes (#4108) Signed-off-by: Kaiyu Xie <[email protected]> * docs:update 0.19 doc. (#4120) Signed-off-by: nv-guomingz <[email protected]> * docs:add torch flow supported model list. (#4129) Signed-off-by: nv-guomingz <[email protected]> * doc: Release V0.19 Perf Overview Update (#4166) Signed-off-by: zpatel <[email protected]> * Fix readme of autodeploy. Signed-off-by: Daniel Campora <[email protected]> * Update tensorrt_llm/_torch/pyexecutor/llm_request.py Co-authored-by: Enwei Zhu <[email protected]> Signed-off-by: Daniel Cámpora <[email protected]> * Revert mgmn worker node. Signed-off-by: Daniel Campora <[email protected]> * Change to disable_overlap_scheduler. Signed-off-by: Daniel Campora <[email protected]> --------- Signed-off-by: bhsueh <[email protected]> Signed-off-by: Frank Di Natale <[email protected]> Signed-off-by: Enwei Zhu <[email protected]> Signed-off-by: Kaiyu Xie <[email protected]> Signed-off-by: Superjomn <[email protected]> Signed-off-by: nv-guomingz <[email protected]> Signed-off-by: Ivy Zhang <[email protected]> Signed-off-by: jiahanc <[email protected]> Signed-off-by: Yukun He <[email protected]> Signed-off-by: nv-guomingz <[email protected]> Signed-off-by: zpatel <[email protected]> Signed-off-by: Daniel Campora <[email protected]> Signed-off-by: Daniel Cámpora <[email protected]> Co-authored-by: bhsueh_NV <[email protected]> Co-authored-by: Frank <[email protected]> Co-authored-by: Enwei Zhu <[email protected]> Co-authored-by: Kaiyu Xie <[email protected]> Co-authored-by: Yan Chunwei <[email protected]> Co-authored-by: nv-guomingz <[email protected]> Co-authored-by: nv-guomingz <[email protected]> Co-authored-by: Ivy Zhang <[email protected]> Co-authored-by: jiahanc <[email protected]> Co-authored-by: Yukun He <[email protected]> Co-authored-by: Zac Patel <[email protected]>

nv-guomingz requested a review from a team as a code owner May 7, 2025 10:47

docs:update 0.19 doc.

a7575cc

Signed-off-by: nv-guomingz <[email protected]>

nv-guomingz force-pushed the user/guomingz/doc_fix_0.19 branch from 868b546 to a7575cc Compare May 7, 2025 10:49

nv-guomingz enabled auto-merge (squash) May 7, 2025 10:49

chzblych approved these changes May 7, 2025

View reviewed changes

nv-guomingz merged commit 00955ce into NVIDIA:release/0.19 May 7, 2025
3 checks passed

nv-guomingz deleted the user/guomingz/doc_fix_0.19 branch May 7, 2025 14:29

dcampora pushed a commit to dcampora/tensorrt_llm that referenced this pull request May 16, 2025

docs:update 0.19 doc. (NVIDIA#4120)

4afc1ce

Signed-off-by: nv-guomingz <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs:update 0.19 doc. #4120

docs:update 0.19 doc. #4120

Uh oh!

nv-guomingz commented May 7, 2025

Uh oh!

nv-guomingz commented May 7, 2025

Uh oh!

tensorrt-cicd commented May 7, 2025

Uh oh!

tensorrt-cicd commented May 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

docs:update 0.19 doc. #4120

docs:update 0.19 doc. #4120

Uh oh!

Conversation

nv-guomingz commented May 7, 2025

PR title

Description

Test Coverage

GitHub Bot Help

kill

skip

reuse-pipeline

Uh oh!

nv-guomingz commented May 7, 2025

Uh oh!

tensorrt-cicd commented May 7, 2025

Uh oh!

tensorrt-cicd commented May 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants