Skip to content

Conversation

@wesm
Copy link
Member

@wesm wesm commented Aug 25, 2019

This patch allows the user to select a compression level to be used for when column data in Parquet is compressed. The user can select between better compression ratio or speed.

Reopening PR to supersede #5071 since the GitHub UI closed the PR when I attempted to push commits to the contributor's master branch

martinradev and others added 2 commits August 22, 2019 15:35
In some situations the user might benefit by using a compression
level other than the default one in Arrow. This patch adds a method
to the ColumnProperties builder to allow the user to select a
compression level.
@wesm
Copy link
Member Author

wesm commented Aug 25, 2019

I also moved the handling of the default compression hint into the codec ctors, and moved the default compression levels into the header files in case that doesn't offend anyone

@martinradev
Copy link
Contributor

As you wish, thanks for the effort.

@codecov-io
Copy link

Codecov Report

Merging #5192 into master will increase coverage by 1.59%.
The diff coverage is 91.5%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master    #5192      +/-   ##
==========================================
+ Coverage   87.65%   89.25%   +1.59%     
==========================================
  Files        1025      742     -283     
  Lines      146568   105790   -40778     
  Branches     1437        0    -1437     
==========================================
- Hits       128477    94420   -34057     
+ Misses      17729    11370    -6359     
+ Partials      362        0     -362
Impacted Files Coverage Δ
cpp/src/parquet/column_writer.h 95.65% <ø> (ø) ⬆️
cpp/src/parquet/types.h 100% <ø> (ø) ⬆️
cpp/src/arrow/util/compression.h 100% <ø> (ø) ⬆️
cpp/src/arrow/util/compression_zstd.h 50% <ø> (ø) ⬆️
cpp/src/arrow/util/compression_snappy.cc 73.91% <ø> (ø) ⬆️
cpp/src/arrow/util/compression_bz2.h 50% <ø> (ø) ⬆️
cpp/src/arrow/util/compression_brotli.h 50% <ø> (ø) ⬆️
cpp/src/parquet/types_test.cc 100% <ø> (ø) ⬆️
cpp/src/arrow/util/compression_bz2.cc 57.35% <100%> (+1.62%) ⬆️
cpp/src/parquet/file_deserialize_test.cc 99.4% <100%> (ø) ⬆️
... and 299 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e0fa3d1...8f12461. Read the comment docs.

@wesm
Copy link
Member Author

wesm commented Aug 26, 2019

+1. Thanks @martinradev!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants