Skip to content

[R] Enable write_csv_arrow to take Dataset or arrow_dplyr_query as input #30557

@asfimport

Description

@asfimport

Currently, this code fails:

dataset <- open_dataset("some/folder/with/parquet/files")
write_csv_arrow(dataset, sink = "dataset.csv")

with this error message:

Error: x must be an object of class 'data.frame', 'RecordBatch', or 'Table', not 'FileSystemDataset'.

In ARROW-14741, support was added for reading from a RecordBatchReader, so we should be able to now extend write_csv_arrow() to allow this behaviour.

 

Note: We would need to make sure whatever write_csv(record_batch_reader) function can take a filesystem= argument

Reporter: Nicola Crane / @thisisnic
Assignee: Nicola Crane / @thisisnic

Related issues:

PRs and other links:

Note: This issue was originally created as ARROW-15040. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions