Skip to content

[C++] Create "engine" module for the query engine #30735

@asfimport

Description

@asfimport

Circular dependencies are popping up in the query engine as the compute module is very low level. For example, it would be nice if the default registry included the scan node and dataset write node. We will want to be adding spillover support at some point and that will rely on parquet/dataset operations.

We should create a dedicated engine module which includes the query plans, the nodes, etc. This module would not contain the kernels or other low level compute primitives. This way we could have something like...

engine -> datasets (for scanning) -> parquet -> compute (for calculating statistics)

The base ExecPlan itself could either go in compute or engine depending on which has the least amount of friction.

Reporter: Weston Pace / @westonpace
Assignee: Jeroen van Straten / @jvanstraten

Related issues:

PRs and other links:

Note: This issue was originally created as ARROW-15238. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions