Skip to content

[C++][Python] Segfault during pyarrow.dataset.write_dataset with dataset source read with pre_buffer=True #38438

@austin3dickey

Description

@austin3dickey

Describe the bug, including details regarding any error messages, version, and platform.

Please see voltrondata-labs/arrow-benchmarks-ci#166 for further detail.

Since #37854, the arrow-benchmarks-ci runners have been attempting to run the dataset-serialize benchmark on an x86_64 Ubuntu runner using Python 3.8. Each time, somewhere between 0 and 3 cases succeed before we see Fatal Python error: Segmentation fault.

Component(s)

Benchmarking, Python

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions