Skip to content

Conversation

@Wennie396
Copy link
Contributor

@Wennie396 Wennie396 commented Oct 12, 2024

PR Category

Auto Parallel

PR Types

Bug fixes

Description

Pcard-76459
静半需要和动手精度对齐,grad clip中param累加的顺序需要保证跟动手相同,因此需要给优化器设置优化的param_list,但是经过fuse_qkv_ffn pass之后Program中qkv和ffn的matmul算子被融合,param也更换名称,而engine._parameter_name_list从没有fuse的动态图获取param_name_list,导致最终优化器优化的param里面没有fuse过后的param

@paddle-bot
Copy link

paddle-bot bot commented Oct 12, 2024

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

Copy link
Contributor

@From00 From00 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@From00 From00 merged commit 2af6ab7 into PaddlePaddle:develop Oct 14, 2024
26 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants