-
Notifications
You must be signed in to change notification settings - Fork 57
PR2 (nullability bug): adding new OH SparkCatalog which enables preserving non-nullable schemas #288
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
PR2 (nullability bug): adding new OH SparkCatalog which enables preserving non-nullable schemas #288
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,8 @@ | ||
package com.linkedin.openhouse.spark; | ||
|
||
public class SparkCatalog extends org.apache.iceberg.spark.SparkCatalog { | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I'd strongly prefer we not introduce this layer (ie.SparkCatalog) in OSS codebase (if its a must, a better place would be li-wrapper)
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I also don't see this as a perfect option and appreciate the points here.
|
||
public boolean useNullableQuerySchema() { | ||
// Preserve DataFrame nullability when writing to OH tables | ||
return false; | ||
} | ||
} |
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -37,7 +37,7 @@ public static void configureCatalogs( | |
builder | ||
.config( | ||
String.format("spark.sql.catalog.%s", catalogName), | ||
"org.apache.iceberg.spark.SparkCatalog") | ||
"com.linkedin.openhouse.spark.SparkCatalog") | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. We'd need to change Docker code too, and all other references to this connector. |
||
.config( | ||
String.format("spark.sql.catalog.%s.catalog-impl", catalogName), | ||
"com.linkedin.openhouse.spark.OpenHouseCatalog") | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
if both are nullable how is the
SparkCatalog
helping?