Skip to content

metadata info apart from maxElements in dataframe schema #550

@saikumare-a

Description

@saikumare-a

Background

Hi, Thank you for the making this wonderful package.

Currently cobrix provides the maxLength(String), MinElements(Array) and MaxElementsArray) metadata info in dataframe schema

Feature

for ASCII Files, adding the below info will help in debugging purpose for pyspark users. i see that we could use scala/java to get the corresponding info from cobol converter/parser. but we only know python/pyspark.

  1. adding the length,decimal precision and decimal scale for all primitive column types (decimal, integer,long,float etc..)
  2. start position, end position
  3. redefines, assumed scale, occurs_depends_on etc..
  4. other metadata that is already available in cobol converter

Example [Optional]

A simple example if applicable.

Proposed Solution [Optional]

Solution Ideas

  1. adding the existing(converter/parser) metadata information to dataframe schema metadata

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions