Disable delayload for cuda dlls #3147

ke1337 · 2020-03-05T20:06:10Z

Description: This change fixes #3129.
Motivation and Context

When running onnxruntime as dll on Windows, CUDA does some internal cleanups when process exits. After this, any call to CUDA would cause crash. Delayload makes thread_local destructor to happen after CUDA cleanup, thus the crash.

This change fixes #3129. When running onnxruntime as dll on Windows, CUDA does some internal cleanups when process exits. After this, any call to CUDA would cause crash. Delayload makes thread_local destructor to happen after CUDA cleanup, thus the crash.

yufenglee · 2020-03-05T22:23:25Z

Do we also need remove the delay load in C#:

onnxruntime/csharp/src/Microsoft.ML.OnnxRuntime/SessionOptions.cs

Line 40 in 4188b11

    
           private static string[] cudaDelayLoadedLibs = { "cublas64_10.dll", "cudnn64_7.dll", "curand64_10.dll" };

This change fixes #3129. When running onnxruntime as dll on Windows, CUDA does some internal cleanups when process exits. After this, any call to CUDA would cause crash. Delayload makes thread_local destructor to happen after CUDA cleanup, thus the crash.

* Publish release symbols (#3152) * Publish release symbols * Publish symbols if IsReleaseBuild * Disable delayload for cuda dlls (#3147) This change fixes #3129. When running onnxruntime as dll on Windows, CUDA does some internal cleanups when process exits. After this, any call to CUDA would cause crash. Delayload makes thread_local destructor to happen after CUDA cleanup, thus the crash. * Update Gelu Fusion to support new graph pattern from PyTorch 1.4 (#3148) * update GeluFusion to support pattern from PyTorch 1.4; * Fix a bug that missing the check of an edge between mul2 and root. * update script to fuse gelu from PyTorch 1.4 * Add test for python optimizer Co-authored-by: Tiago Koji Castro Shibata <[email protected]> Co-authored-by: KeDengMS <[email protected]> Co-authored-by: Tianlei Wu <[email protected]>

ke1337 requested a review from a team as a code owner March 5, 2020 20:06

ke1337 closed this Mar 5, 2020

ke1337 force-pushed the kedeng/bug3129 branch from e29c653 to 2c446a7 Compare March 5, 2020 20:15

ke1337 reopened this Mar 5, 2020

snnn approved these changes Mar 5, 2020

View reviewed changes

ke1337 merged commit ade4fa1 into master Mar 5, 2020

ke1337 deleted the kedeng/bug3129 branch March 5, 2020 22:40

yufenglee mentioned this pull request Mar 6, 2020

Cherry pick 3 fixes to rel-1.2.0 #3158

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Disable delayload for cuda dlls #3147

Disable delayload for cuda dlls #3147

Uh oh!

ke1337 commented Mar 5, 2020

Uh oh!

yufenglee commented Mar 5, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Disable delayload for cuda dlls #3147

Disable delayload for cuda dlls #3147

Uh oh!

Conversation

ke1337 commented Mar 5, 2020

Uh oh!

yufenglee commented Mar 5, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants