-
Notifications
You must be signed in to change notification settings - Fork 1.8k
refactor: Allow models to override apply_qk_norm. #4078
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
/bot run --disable-fail-fast |
PR_Github #4152 [ run ] triggered by Bot |
/bot run --disable-fail-fast |
PR_Github #4172 [ run ] triggered by Bot |
PR_Github #4152 [ run ] completed with state |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good on qwen3 side.
PR_Github #4172 [ run ] completed with state |
/bot run --disable-fail-fast |
PR_Github #4301 [ run ] triggered by Bot |
/bot run --disable-fail-fast |
PR_Github #4303 [ run ] triggered by Bot |
PR_Github #4301 [ run ] completed with state |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
/bot run --disable-fail-fast |
PR_Github #4330 [ run ] triggered by Bot |
PR_Github #4303 [ run ] completed with state |
PR_Github #4330 [ run ] completed with state |
We should add more attention unit tests to test these features. Relying on e2e accuracy tests is too heavy and it takes time to debug the accuracy issues. |
/bot run --disable-fail-fast |
1 similar comment
/bot run --disable-fail-fast |
PR_Github #4462 [ run ] triggered by Bot |
PR_Github #4462 [ run ] completed with state |
/bot run --disable-fail-fast |
PR_Github #4508 [ run ] triggered by Bot |
PR_Github #4508 [ run ] completed with state |
/bot run --disable-fail-fast |
PR_Github #4595 [ run ] triggered by Bot |
PR_Github #4595 [ run ] completed with state |
Signed-off-by: Yuxian Qiu <[email protected]>
Signed-off-by: Yuxian Qiu <[email protected]>
/bot run --disable-fail-fast |
PR_Github #4811 [ run ] triggered by Bot |
Looks good on qwen3. |
PR_Github #4811 [ run ] completed with state |
No description provided.