Skip to content

Conversation

Ubospica
Copy link
Contributor

@Ubospica Ubospica commented May 5, 2025

This PR supports the structural tag in guided decoding.

It also adds support for the previously unsupported response_format parameter in the OpenAI API protocol, including both JSON format and structural tag format.

Signed-off-by: Ubospica [email protected]

Signed-off-by: Ubospica <[email protected]>
@syuoni syuoni requested review from LinPoly, kaiyux and syuoni May 6, 2025 07:09
@syuoni syuoni added Community want to contribute PRs initiated from Community Community Engagement help/insights needed from community labels May 6, 2025
Copy link
Collaborator

@LinPoly LinPoly left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, since this PR also enables guided decoding in trtllm-serve, can you pls add test for both json and structural_tag in chat test?

@LinPoly LinPoly changed the title [feat] Support the Structural Tag in guided decoding feat: Support the Structural Tag in guided decoding May 6, 2025
Ubospica and others added 6 commits May 8, 2025 03:52
Signed-off-by: Ubospica <[email protected]>
Signed-off-by: Ubospica <[email protected]>
Signed-off-by: Enwei Zhu <[email protected]>
Signed-off-by: Enwei Zhu <[email protected]>
Signed-off-by: Enwei Zhu <[email protected]>
@syuoni
Copy link
Collaborator

syuoni commented May 9, 2025

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #4620 [ run ] triggered by Bot

Copy link
Contributor Author

@Ubospica Ubospica left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The tests look great. Thanks @syuoni!

Signed-off-by: Enwei Zhu <[email protected]>
@syuoni syuoni requested a review from a team as a code owner May 9, 2025 04:14
@syuoni
Copy link
Collaborator

syuoni commented May 9, 2025

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #4639 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #4620 [ run ] completed with state ABORTED
/LLM/main/L0_MergeRequest_PR pipeline #3331 completed with status: 'FAILURE'

@syuoni syuoni requested a review from juney-nvidia May 9, 2025 05:42
Copy link
Collaborator

@juney-nvidia juney-nvidia left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.
Thanks for the contribution!

@syuoni
Copy link
Collaborator

syuoni commented May 9, 2025

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #4696 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #4639 [ run ] completed with state ABORTED

@tensorrt-cicd
Copy link
Collaborator

PR_Github #4696 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #3386 completed with status: 'FAILURE'

@syuoni
Copy link
Collaborator

syuoni commented May 10, 2025

/bot run --disable-fail-fast

@syuoni
Copy link
Collaborator

syuoni commented May 10, 2025

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #4747 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #4747 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #3426 completed with status: 'FAILURE'

Signed-off-by: Enwei Zhu <[email protected]>
@syuoni syuoni force-pushed the main-dev/2025-05-01-upgrade-xgrammar branch from 1ad65ce to c1e91d2 Compare May 12, 2025 00:39
@syuoni
Copy link
Collaborator

syuoni commented May 12, 2025

/bot run

@tensorrt-cicd
Copy link
Collaborator

PR_Github #4794 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #4794 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #3468 completed with status: 'SUCCESS'

@syuoni
Copy link
Collaborator

syuoni commented May 12, 2025

Thanks @Ubospica for the contribution!

@syuoni syuoni merged commit c90ebad into NVIDIA:main May 12, 2025
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Community Engagement help/insights needed from community Community want to contribute PRs initiated from Community

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants