Skip to content

ElemwiseGradCompute is non-deterministic #10122

@panyx0718

Description

@panyx0718

ElemwiseGradCompute

With the same inputs, dy can be different at different times.

x: 3398586.250000
y: 0.000000
out: 3398586.250000
E0422 12:33:49.270020 35231 elementwise_op_function.h:524] 2, 32, 220, 220 32
dout: 138643.281250
dx: 138643.281250
dy: 246.410400

x: 3398586.250000
y: 0.000000
out: 3398586.250000
E0422 12:33:52.696183 35231 elementwise_op_function.h:524] 2, 32, 220, 220 32
dout: 138643.281250
dx: 138643.281250
dy: 264.202179

reproduce
https://github.com/panyx0718/Paddle/tree/qingqing01-parallel_do_and_exe_compare2
CUDA_VISIBLE_DEVICES=3 ctest -R test_parallel_executor_grad

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions