Skip to content

Conversation

@risemeup1
Copy link
Contributor

@risemeup1 risemeup1 commented Aug 21, 2024

PR Category

Execute Infrastructure

PR Types

Improvements

Description

优化自定义算子的编译,从串行编译改为并行编译,创建一个线程池,利用上所有的cpu core count编译并行编译所有的文件,原来是一个个串行编译。改为并行编译遇到很多编译问题,定位原因发现,单线程下每次都会根据文件的名字判断是用nvcc,或者hipcc还是gcc来编译,最后又回切换成gcc,单线程下是不会有问题的,但是多线程下就会出现用nvcc去编译.cc文件导致报错

To do
解决增量编译问题

pcard-67164

@paddle-bot
Copy link

paddle-bot bot commented Aug 21, 2024

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@risemeup1 risemeup1 force-pushed the optimize_custom_op_compile branch from 0ff9e79 to 58019aa Compare August 21, 2024 19:13
@risemeup1 risemeup1 merged commit 962c8f6 into develop Aug 22, 2024
@SigureMo SigureMo deleted the optimize_custom_op_compile branch September 12, 2024 18:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants