Skip to content

Conversation

@gitliuyf
Copy link
Contributor

PR Category

Performance Optimization

PR Types

New features

Description

新增融合算子,将layer_norm和relu进行了融合。在xpu平台上,经过实测,性能有提升

@paddle-bot paddle-bot bot added the contributor External developers label Sep 25, 2024
@CLAassistant
Copy link

CLAassistant commented Sep 26, 2024

CLA assistant check
All committers have signed the CLA.

@paddle-ci-bot
Copy link

paddle-ci-bot bot commented Oct 5, 2024

Sorry to inform you that 7695d5e's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

@gitliuyf gitliuyf force-pushed the develop branch 6 times, most recently from 956209b to 6eb5f41 Compare October 10, 2024 09:02
@newway
Copy link
Contributor

newway commented Oct 11, 2024

lgtm

@gitliuyf gitliuyf force-pushed the develop branch 7 times, most recently from a1fff89 to 42c442f Compare October 15, 2024 06:34

if __name__ == "__main__":
np.random.seed(200)
unittest.main()
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

from paddle.framework import core
if core.get_xpu_device_version(0) == core.XPUVersion.XPU2:
unittest.main()

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done


from paddle.framework import core


Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@unittest.skipIf(
not core.get_xpu_device_version(0) == core.XPUVersion.XPU2,
"XpuLayerNormReluFuse only support XPU2",
)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

@gitliuyf gitliuyf force-pushed the develop branch 3 times, most recently from 36380fe to dff6f2d Compare October 16, 2024 10:10
@hong19860320 hong19860320 changed the title [xpu]: add layernorm_relu pass and kernel ;test=develop [XPU] Add layernorm_relu pass and kernel Oct 17, 2024
@hong19860320 hong19860320 merged commit b9532d4 into PaddlePaddle:develop Oct 17, 2024
27 checks passed
gitliuyf added a commit to gitliuyf/Paddle that referenced this pull request Oct 17, 2024
gitliuyf added a commit to gitliuyf/Paddle that referenced this pull request Oct 18, 2024
@houj04 houj04 added the XPU label Oct 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

contributor External developers XPU

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants