[Inference] save_optimized_model_pass support tensorrt #55893

yuanlehome · 2023-08-01T14:45:26Z

PR types

New features

PR changes

Others

Description

This PR's work:

save_optimized_model_pass support tensorrt, cpu/gpu/xpu has been supported and verified in [Inference] Save optimized model by pass #53696, [Inference] save_optimized_model_pass support gpu #55551.
Uniformly package all non runtime parameters of TenosrRTEngine to ConstructionParams.
Support load trt engine in runtime(in tensorrt_engine op run).
Remove some unused function and variable in trt related code.
Refine some unittest related to trt.
Add unittest for function of save_optimized_model_pass(gpu/trt).

How to use:

//
// Step 1: first run
//
// other code
config.SetModel(/*model dir*/);
// other code
config.EnableUseGpu(...); // for gpu
config.EnableTensorRtEngine(..., use_static=true, ...). // for tensorrt
// other code
config.SwitchIrOptim(true);
config.SetOptimCacheDir("./optim_cache");
config.EnableSaveOptimModel(true);
// other code
auto predictor = CreatePredictor(config);

// Optimized model will be save to ./optim_cache/_optimized.*

//
// Step 2: second run
//
// other code
config.SetModel(/*optimized model dir*/);
// other code
config.EnableUseGpu(...); // for gpu
config.EnableTensorRtEngine(..., use_static=true, ...). // for tensorrt
// other code
config.SwitchIrOptim(false);
// other code
auto predictor = CreatePredictor(config);
// other code
predictor->Run();
// other code

Note:

Not support EnableTensorRtEngine(..., use_cuda_graph=true).

Others

Pcard-71500

… develop

…evelop

paddle-bot · 2023-08-01T14:45:33Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

XieYunshen

LGTM for set_tests_properties(test_save_optimized_model_pass PROPERTIES TIMEOUT 300)

Aurelius84

LGTM for ENFORCE MSG

zhangjun

LGTM

XiaoguangHu01

LGTM

…55893) * fix cudnn 8.7+ bug on cudnnConvolutionBiasActivationForward * save_optimized_model_pass support tensorrt * update * update * fix compile * update * fix ut timeout

yuanlehome added 7 commits July 13, 2023 08:15

fix cudnn 8.7+ bug on cudnnConvolutionBiasActivationForward

e41ee6d

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

9ecefe9

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

d092636

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

b74a6df

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

9ffdf8e

… develop

Merge branch 'develop' of https://github.com/yuanlehome/Paddle into d…

2ebb74e

…evelop

save_optimized_model_pass support tensorrt

813ddc4

yuanlehome force-pushed the save_optimized_model_support_trt branch from 0aea06a to b7cc00c Compare August 2, 2023 06:46

PaddlePaddle locked and limited conversation to collaborators Aug 2, 2023

PaddlePaddle unlocked this conversation Aug 2, 2023

update

ee94157

yuanlehome force-pushed the save_optimized_model_support_trt branch from fc4104d to ee94157 Compare August 2, 2023 09:45

yuanlehome added 2 commits August 2, 2023 14:09

update

87d6828

fix compile

f70a5ca

yuanlehome force-pushed the save_optimized_model_support_trt branch from 282aa9b to 74830e2 Compare August 4, 2023 08:48

update

7fd1336

yuanlehome force-pushed the save_optimized_model_support_trt branch from 3a97204 to 7fd1336 Compare August 4, 2023 17:12

fix ut timeout

8ec9593

XieYunshen approved these changes Aug 7, 2023

View reviewed changes

Aurelius84 approved these changes Aug 7, 2023

View reviewed changes

zhangjun approved these changes Aug 7, 2023

View reviewed changes

XiaoguangHu01 approved these changes Aug 7, 2023

View reviewed changes

yuanlehome merged commit 6b10c0e into PaddlePaddle:develop Aug 7, 2023

vivienfanghuagood mentioned this pull request Jan 30, 2024

请问使用C++ paddle inference推理，是否产生类似.engine的文件，这样下次使用不需要再经历.pdmodel模型转换的过程？ PaddlePaddle/Paddle-Inference-Demo#408

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Inference] save_optimized_model_pass support tensorrt #55893

[Inference] save_optimized_model_pass support tensorrt #55893

Uh oh!

yuanlehome commented Aug 1, 2023 •

edited

Loading

Uh oh!

paddle-bot bot commented Aug 1, 2023

Uh oh!

XieYunshen left a comment

Uh oh!

Aurelius84 left a comment

Uh oh!

zhangjun left a comment

Uh oh!

XiaoguangHu01 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[Inference] save_optimized_model_pass support tensorrt #55893

[Inference] save_optimized_model_pass support tensorrt #55893

Uh oh!

Conversation

yuanlehome commented Aug 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Description

Others

Uh oh!

paddle-bot bot commented Aug 1, 2023

Uh oh!

XieYunshen left a comment

Choose a reason for hiding this comment

Uh oh!

Aurelius84 left a comment

Choose a reason for hiding this comment

Uh oh!

zhangjun left a comment

Choose a reason for hiding this comment

Uh oh!

XiaoguangHu01 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

yuanlehome commented Aug 1, 2023 •

edited

Loading