Skip to content

Conversation

@zhupengyang
Copy link
Contributor

PR types

Performance optimization

PR changes

Others

Describe

  • beam_size=1 情况下,优化部分 write_to_array, read_from_array, gather 算子

@paddle-bot
Copy link

paddle-bot bot commented Apr 20, 2023

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

MemcpySyncH2D 中会调用 dev_ctx->Wait(); 所以这里的wait是不需要的

@zhupengyang zhupengyang force-pushed the xpu_remove_small_ops branch from 362c3aa to fbe1fa0 Compare April 20, 2023 09:26
Copy link
Contributor

@hong19860320 hong19860320 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@zhupengyang zhupengyang merged commit e8e9d6c into PaddlePaddle:develop Apr 21, 2023
@zhupengyang zhupengyang deleted the xpu_remove_small_ops branch April 21, 2023 03:55
lijialin03 pushed a commit to lijialin03/Paddle that referenced this pull request Apr 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants