ARROW-1364: [C++] IPC support machinery for record batch roundtrips to GPU device memory #1000

wesm · 2017-08-28T15:08:58Z

This additionally does a few things:

Change libarrow_gpu to use CUDA driver API instead of runtime API
Adds code for exporting buffers using CUDA IPC on Linux, but this is not yet tested

Change-Id: Ife0b48c87b27983352a498f1d3adbcc5d952265b

Change-Id: I97ba1480dfc13b1a95b2bb05e68a1de2afe9c7cc

Change-Id: I487458dd66e7f544402a3b1fc91793d492ff114e

Change-Id: I3ad52acd665869f0755566f3d7331dcf6567d654

Change-Id: I24d9d9510c8164dea36d83028b3c4bdbddbe2d85

Change-Id: I844b83e2e88c10d60f7c3d6bdd818c6642905a81

… tests yet Change-Id: Ib46b646219e83c35828cf19a5e8d3bc8cc096f25

Change-Id: I8dd313ac4e1cc0c01fdbe760bcae325a55ec8818

Change-Id: I9eaf54a1a058a18f17251816ec22e5e4e3a260da

Change-Id: I438d03b64f713e24299cc107b90f36a69da59f25

wesm · 2017-08-28T15:13:28Z

cc @m1mc @asuhan @seibert @sklam

This patch enables writing record batches to GPU device memory with code like:

std::shared_ptr<CudaBuffer> device_serialized;
ASSERT_OK(arrow::gpu::SerializeRecordBatch(*batch, context_.get(),
                                           &device_serialized));

You can also do zero-copy read from device memory:

Status ReadRecordBatch(const std::shared_ptr<Schema>& schema,
                       const std::shared_ptr<CudaBuffer>& buffer,
                       MemoryPool* pool, std::shared_ptr<RecordBatch>* out);

The returned record batch in this case will have device pointers in its buffers.

We'll need to do a tiny bit of work so that libarrow_gpu can "attach" to existing CUcontext created by others (see ARROW-1423), but I wanted to get this reviewed and merged first.

Additionally, we have the new functions for schema serialization:

Status SerializeSchema(const Schema& schema, MemoryPool* pool,
                       std::shared_ptr<Buffer>* out);

Status ReadSchema(io::InputStream* stream, std::shared_ptr<Schema>* out);

pcmoritz · 2017-08-28T17:13:47Z

cpp/src/arrow/gpu/.gitignore

+# specific language governing permissions and limitations
+# under the License.
+
+cuda_version.h


pcmoritz · 2017-08-28T17:16:30Z

Thanks, this is very exciting!

xhochy

+1, LGTM

Change-Id: I4b47942c3621d67cfeacb0b14502b89d8ba318cc

wesm · 2017-08-28T18:46:34Z

Thanks. I will resolve the outstanding issues around shared CUDA contexts in the course of making a PR into MapD to simplify their Arrow serialization code

asuhan · 2017-08-28T20:38:43Z

Thanks, I forward to using this in our product.

wesm added 10 commits August 28, 2017 10:37

Start cuda_ipc file

03d0baf

Change-Id: Ife0b48c87b27983352a498f1d3adbcc5d952265b

Start cuda context class

3a37fdf

Change-Id: I97ba1480dfc13b1a95b2bb05e68a1de2afe9c7cc

Progress

2840c60

Change-Id: I487458dd66e7f544402a3b1fc91793d492ff114e

More progress

5d686fe

Change-Id: I3ad52acd665869f0755566f3d7331dcf6567d654

Get things compiling / linking using driver API

f3c724e

Change-Id: I24d9d9510c8164dea36d83028b3c4bdbddbe2d85

Test suite passing again

508febb

Change-Id: I844b83e2e88c10d60f7c3d6bdd818c6642905a81

Add classes and methods for simplifying use of CUDA IPC machinery. No…

84e4525

… tests yet Change-Id: Ib46b646219e83c35828cf19a5e8d3bc8cc096f25

Draft SerializeRecordBatch for CUDA

591aceb

Change-Id: I8dd313ac4e1cc0c01fdbe760bcae325a55ec8818

More Arrow IPC scaffolding

16d628f

Change-Id: I9eaf54a1a058a18f17251816ec22e5e4e3a260da

Complete basic IPC message and record batch reads on GPU device memory

a8812af

Change-Id: I438d03b64f713e24299cc107b90f36a69da59f25

pcmoritz reviewed Aug 28, 2017

View reviewed changes

xhochy approved these changes Aug 28, 2017

View reviewed changes

Add newline at end of file

e436755

Change-Id: I4b47942c3621d67cfeacb0b14502b89d8ba318cc

asfgit closed this in 0728148 Aug 28, 2017

wesm deleted the ARROW-1364 branch August 28, 2017 18:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ARROW-1364: [C++] IPC support machinery for record batch roundtrips to GPU device memory #1000

ARROW-1364: [C++] IPC support machinery for record batch roundtrips to GPU device memory #1000

Uh oh!

wesm commented Aug 28, 2017

Uh oh!

wesm commented Aug 28, 2017 •

edited

Loading

Uh oh!

pcmoritz Aug 28, 2017

Uh oh!

pcmoritz commented Aug 28, 2017

Uh oh!

xhochy left a comment

Uh oh!

wesm commented Aug 28, 2017

Uh oh!

asuhan commented Aug 28, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ARROW-1364: [C++] IPC support machinery for record batch roundtrips to GPU device memory #1000

ARROW-1364: [C++] IPC support machinery for record batch roundtrips to GPU device memory #1000

Uh oh!

Conversation

wesm commented Aug 28, 2017

Uh oh!

wesm commented Aug 28, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pcmoritz Aug 28, 2017

Choose a reason for hiding this comment

Uh oh!

pcmoritz commented Aug 28, 2017

Uh oh!

xhochy left a comment

Choose a reason for hiding this comment

Uh oh!

wesm commented Aug 28, 2017

Uh oh!

asuhan commented Aug 28, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

wesm commented Aug 28, 2017 •

edited

Loading