Skip to content

Conversation

austinzh
Copy link
Contributor

@austinzh austinzh commented Oct 22, 2024

This Transformer is added to better support spark transformers that has inputCols and outputCols.
The rules are:
In Spark side, all parameters save/load as its, we use the information in the NodeShape to determine we should set inputCol/outputCol or inputCols/outputCols.

In MLeap side, when load mode, we will transformer
"input" -> "input0"
"output" -> "output0" so that transformer can build correct input/output schema.

This approach implemented in two place.

  1. MultiInOutSparkOp responsible to provide correct port->parameter mapping for load and store
  2. MultiInOutMleapOp responsible to convert single input/output to multiple input/output.

@austinzh austinzh force-pushed the u/austinzh/stringindexer_multiple_colums branch from 905fd03 to 65d672c Compare October 22, 2024 01:42
@austinzh austinzh force-pushed the u/austinzh/stringindexer_multiple_colums branch from 609eef9 to 7744013 Compare October 22, 2024 20:08
@austinzh austinzh force-pushed the u/austinzh/stringindexer_multiple_colums branch from 855e7a5 to 33d061d Compare October 22, 2024 20:18
Copy link
Contributor

@ltrottier-yelp ltrottier-yelp left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Mostly generic comments and questions. Otherwise, LGTM

OneHotEncoder already implement a logic to due with
multiple input columns and output columns in mleap
It's better to reuse what is ready done
Copy link
Contributor

@ltrottier-yelp ltrottier-yelp left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@austinzh austinzh merged commit 11c4f54 into combust:master Nov 12, 2024
1 check passed
@austinzh austinzh deleted the u/austinzh/stringindexer_multiple_colums branch November 12, 2024 15:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants