Skip to content

Conversation

@penPenf28
Copy link
Contributor

@penPenf28 penPenf28 commented Jul 16, 2024

PR Category

Inference

PR Types

New features

Description

Pcard-71501

新增Fused MoE融合优化,目前支持float16 bfloat16,后续增加wint8 wint4

@paddle-bot
Copy link

paddle-bot bot commented Jul 16, 2024

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@paddle-bot paddle-bot bot added the contributor External developers label Jul 16, 2024
@penPenf28 penPenf28 changed the title [Feature] Fused MoE Support. [Feature] Fused MoE Support Jul 22, 2024
@penPenf28 penPenf28 changed the title [Feature] Fused MoE Support Fused MoE Support Jul 23, 2024
Copy link
Contributor

@vivienfanghuagood vivienfanghuagood left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

bfloat16需要加上编译控制吧,sm70的设备会编译报错?

@penPenf28
Copy link
Contributor Author

bfloat16需要加上编译控制吧,sm70的设备会编译报错?

暂时先移除了bf16 wint8 wint4,后续组网完成后进行支持

yuanlehome
yuanlehome previously approved these changes Jul 30, 2024
Copy link
Contributor

@XiaoguangHu01 XiaoguangHu01 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@yuanlehome yuanlehome merged commit ff5ba2f into PaddlePaddle:develop Aug 1, 2024
Lans1ot pushed a commit to Lans1ot/Paddle that referenced this pull request Aug 5, 2024
* [Feature] Fused MoE Support.

* [Bugfix] Fix compile error.

* [Bugfix] fix test error and compile error.

* [Bugfix] fix compile error.

* [Bugfix] remove bf16

* [Bugfix] fix compile error

* [Bugfix] deal with float16

* [Bugfix] deal with float16

* [Bugfix] skip test if not gpu

* [Refactor] add doc explain.

* [Refactor] modify  doc explain.

* [Feature] add wint support

* [Refactor] Refine doc.

* [Refactor] fix CI problem

* [Bugfix] fix error

* [Feature] add bf16

* [Refactor] fix bf16 compile flag and docs

* [Refactor] remove redundant code

* [Refactor] refine code

* [Refactor] doc refine

* [Refactor] doc refine2

* [Bugfix] refine doc and code
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

contributor External developers

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants