Skip to content

genererating schemas from arbitrary map[string]interface{} (parquet, avro) #1353

@loicalleyne

Description

@loicalleyne

When reading from a source (ie. avro-ocf, parquet) where the reader outputs a map[string]interface{} it would be great if it wasn't necessary to redefine the exit schema but instead convert the input schema to the equivalent schema for the output writer.

Could this be done by iterating over the map and using type assertions to assemble the schema for different writers? And then perhaps configurable filters/regex for mapping field names+primitive type combinations to logical types (ie. field-name: (event[[:graph:]]*) type:INT64 logital-type:TIMESTAMP unit:MILLIS).

Use case brainstorm:
data stream sinks to object storage
transferring OLTP DB data to OLAP DB using federated tables (ie. BigQuery external tables)
converting from row-based to column-based format

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementoutputsAny tasks or issues relating specifically to outputs

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions