[C++][Python] Segfault during `pyarrow.dataset.write_dataset` with dataset source read with pre_buffer=True

### Describe the bug, including details regarding any error messages, version, and platform.

Please see https://github.com/voltrondata-labs/arrow-benchmarks-ci/issues/166 for further detail.

Since https://github.com/apache/arrow/pull/37854, the arrow-benchmarks-ci runners have been attempting to run the `dataset-serialize` [benchmark](https://github.com/voltrondata-labs/benchmarks/blob/93a57ba1019f65a4cea124d17772b0e8c60ac5ce/benchmarks/dataset_serialize_benchmark.py) on an x86_64 Ubuntu runner using Python 3.8. Each time, somewhere between 0 and 3 cases succeed before we see `Fatal Python error: Segmentation fault`.

### Component(s)

Benchmarking, Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[C++][Python] Segfault during `pyarrow.dataset.write_dataset` with dataset source read with pre_buffer=True #38438

Describe the bug, including details regarding any error messages, version, and platform.

Component(s)

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[C++][Python] Segfault during pyarrow.dataset.write_dataset with dataset source read with pre_buffer=True #38438

Description

Describe the bug, including details regarding any error messages, version, and platform.

Component(s)

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[C++][Python] Segfault during `pyarrow.dataset.write_dataset` with dataset source read with pre_buffer=True #38438