Skip to content

Allow Inserts to Partitioned Listing Table #7744

@devinjdangelo

Description

@devinjdangelo

Is your feature request related to a problem or challenge?

It is currently unsupported to run an insert into query for a listing table which is partitioned by a column.

Describe the solution you'd like

  1. ListingTable must be able to inject into FileSinkExec a distribution requirement that the incoming RecordBatchStreams are partitioned by the correct columns
  2. Each FileSink must be able to identify which stream belongs to which partition and create the appropriate ObjectStore writer

For 2, unless there is a slick solution FileSink could simply peak at each stream before initializing a writer.

Describe alternatives you've considered

No response

Additional context

Progress on inserts to sorted tables may be relevant https://github.com/apache/arrow-datafusion/pull/7743/files

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions