-
Notifications
You must be signed in to change notification settings - Fork 1.8k
feat: Support the Structural Tag in guided decoding #4066
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Support the Structural Tag in guided decoding #4066
Conversation
Signed-off-by: Ubospica <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, since this PR also enables guided decoding in trtllm-serve
, can you pls add test for both json and structural_tag in chat test?
Signed-off-by: Ubospica <[email protected]>
Signed-off-by: Ubospica <[email protected]>
Signed-off-by: Enwei Zhu <[email protected]>
Signed-off-by: Enwei Zhu <[email protected]>
Signed-off-by: Enwei Zhu <[email protected]>
Signed-off-by: Enwei Zhu <[email protected]>
/bot run |
PR_Github #4620 [ run ] triggered by Bot |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The tests look great. Thanks @syuoni!
Signed-off-by: Enwei Zhu <[email protected]>
/bot run |
PR_Github #4639 [ run ] triggered by Bot |
PR_Github #4620 [ run ] completed with state |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
Thanks for the contribution!
/bot run |
PR_Github #4696 [ run ] triggered by Bot |
PR_Github #4639 [ run ] completed with state |
PR_Github #4696 [ run ] completed with state |
/bot run --disable-fail-fast |
/bot run |
PR_Github #4747 [ run ] triggered by Bot |
PR_Github #4747 [ run ] completed with state |
Signed-off-by: Enwei Zhu <[email protected]>
1ad65ce
to
c1e91d2
Compare
/bot run |
PR_Github #4794 [ run ] triggered by Bot |
PR_Github #4794 [ run ] completed with state |
Thanks @Ubospica for the contribution! |
This PR supports the structural tag in guided decoding.
It also adds support for the previously unsupported response_format parameter in the OpenAI API protocol, including both JSON format and structural tag format.
Signed-off-by: Ubospica [email protected]