Expose graph and dataset accessors for CAGRA to C/Python #1086

benfred · 2025-07-04T22:51:58Z

Add the ability to get the graph and dataset from a CAGRA index to the c-api and python apis, as well as being able to reconstruct the cagra index from a graph and dataset.

The eventual goal here is to be more flexible in terms of allowing other serialization formats. Rather than supporting every format inside of cuvs, by exposing the raw data needed to recreate a cagra index - we can let consumers of cuvs decide how they want to serialize an index.

Add the ability to get the graph and dataset from a CAGRA index to the c-api and python apis, as well as being able to reconstruct the cagra index from a graph and dataset. The eventual goal here is to be more flexible in terms of allowing other serialization formats. Rather than supporting every format inside of cuvs, by exposing the raw data needed to recreate a cagra index - we can let consumers of cuvs decide how they want to serialize an index.

mythrocks · 2025-07-07T22:15:11Z

.pre-commit-config.yaml

            files: |
              (?x)
-                  [.](cmake|cpp|cu|cuh|h|hpp|sh|pxd|py|pyx|rs)$|
+                  [.](cmake|cpp|cu|cuh|h|hpp|sh|pxd|py|pyx|rs|java)$|


mythrocks · 2025-07-07T22:15:41Z

cpp/include/cuvs/cluster/kmeans.h

 */

-enum cuvsKMeansInitMethod {
+typedef enum {


ldematte · 2025-07-10T09:07:30Z

@benfred I tried to check this out and build locally, but it does not compile, claiming template errors around matrix_vector_op.
However, I have the same problem on the current "main" branch-25.08 -- so I suppose it's unrelated to your changes, and something else broke it (I'm 100% sure it was compiling a week ago)

Scratch that, I needed to update my conda environment as the raft include files changed.

ldematte

Some minor comments while I was writing a Java wrapper for this

python/cuvs/cuvs/neighbors/cagra/cagra.pyx

cpp/include/cuvs/neighbors/cagra.h

… into cagra_c_graph_dataset

cpp/src/neighbors/cagra_c.cpp

ldematte · 2025-07-17T07:49:21Z

TL;DR: I liked the API as it was in 2f22d53 (cuvsCagraIndexGetGraphView + cuvsCagraIndexCopyGraph), but I like the last implementation ("view" + cuvsCopyMatrix) even more.

If anything, in a follow-up PR, it would be great to have partial copy (e.g. cuvsCopyMatrix with fromRow, toRow or something similar) so in the Java side I don't have to replicate what is done there and/or in raft::copy_matrix<T>.

benfred · 2025-07-17T16:38:32Z

If anything, in a follow-up PR, it would be great to have partial copy (e.g. cuvsCopyMatrix with fromRow, toRow or something similar) so in the Java side I don't have to replicate what is done there and/or in raft::copy_matrix.

How about having something like a cuvsSliceMatrix or cuvsSliceRows functions to do this ? (where the cuvsSliceRows function doesn't copy data, just adjusts the shape / strides / data pointer to slice the matrix without copying data). We could then pass the slice to the copy matrix function , like:

// get the cagra graph
DLManagedTensor graph;
cuvsCagraIndexGetGraph(index, &graph);

// get the first 1K rows from the graph
DLManagedTensor subgraph;
cuvsSliceMatrix(&graph, 0, 1024,  &subgraph);

// copy the subgraph to host  memory
cuvsCopyMatrix(&subgraph,  &subgraphHost);

(this actually makes me think that I should rename cuvsCopyMatrix to cuvsMatrixCopy , so that we can have cuvsMatrixSlice etc - and to be consistent with other cuvs API's)

mythrocks · 2025-07-17T17:24:29Z

if you think there is value for C users too, and want to add a specific C API, I can use that. My 2c: it's not worth the effort: C users can/will just use cudaMemcpy/cudaMemcpy2D (but I'll let you decide on this point)...

@benfred / @cjnolet will keep me honest here.

I think the C-lang user should not use cudaMemcpy directly. The dataset is likely to be padded to an 8-byte boundary when stored in __device__ memory. A naive cudaMemcpy will copy/interpret padding bytes that aren't actual data.

Using a cuVS-specific copy() API would be preferable to insulate the user from padding. If a future CAGRA implementation does away with the padding, the cuVSMatrixCopy() user would be insulated from the change. The cudaMemcpy() user might not.

add ability to page through returned rows via a new 'cuvsMatrixSliceRows' function

ldematte · 2025-07-18T07:23:57Z

How about having something like a cuvsSliceMatrix or cuvsSliceRows functions to do this

That's even better!

this actually makes me think that I should rename cuvsCopyMatrix to cuvsMatrixCopy

++

mythrocks

A couple of nitpicks, but this looks good to go.

cpp/include/cuvs/core/detail/interop.hpp

cpp/include/cuvs/neighbors/cagra.h

cpp/include/cuvs/core/detail/interop.hpp

cpp/src/core/c_api.cpp

Co-authored-by: MithunR <[email protected]>

mythrocks · 2025-07-22T03:51:36Z

/merge

lowener · 2025-07-22T12:02:24Z

python/cuvs/cuvs/common/device_tensor_view.pyx

+from cuvs.common.resources import auto_sync_resources
+
+
+cdef class DeviceTensorView:


Can't we return device_matrix_view without intermediate copies? I am not 100% sure I see the benefits of adding this class compared to device views?

The python API is using the c-api (instead of the c++ api) - meaning we can't use device_matrix_view directly inside python. Instead we're using dlpack.DLManagedTensor inside our C-API.

Previous to this PR, our c-api only accepted dlpack arrays as inputs (and would either use the contents in functions like cagra::build - or fill pre-allocated arrays with the outputs in functions like cagra::search). We hadn't yet exposed memory that was allocated in our C++ codebase to python.

This PR changes that - and we are now returning dlpack DLManagedTensor objects that return memory that is owned by and allocated inside our C++ codebase. This code does that without copying the data , with the flow going : device_matrix_view (c++) -> DLManagedTensor (C) -> DeviceTensorView (python) . At each step we aren't copying the data, there isn't an intermediate copy - so much as intermediate objects that have a pointer to the original data, and also extra information such as the shape/dtype/strides etc.

This DeviceTensorView code is necessary because we didn't have anything that would take a DLManagedTensor and return something that could be easily consumed in python. the closest object we had was the pylibraft.device_ndarray object - but that object wouldn't have worked here.

cjnolet · 2025-07-22T17:11:22Z

cpp/include/cuvs/neighbors/cagra.h

+ *
+ * @endcode
+ */
+cuvsError_t cuvsCagraIndexFromGraph(cuvsResources_t res,


Nitpick- can we please rename this to cuvsCagraIndexFromParams or cuvsCagraIndexFromArgs? I'd like to keep tthe API design consistent and having to specify specific args in the name will get unwieldy quickly.

I renamed to cuvsCagraIndexFromArgs in the last commit - (went with FromArgs instead of FromParams - since I think the FromParams could be confused with the Index Params we use to build the index)

bdice

Approving packaging changes.

…1216) This PR leverages the functions introduced by #1086 and the data structures introduced by #1111 to access, copy, and re-create an index to/from a CAGRA graph. Supersedes #1105 Authors: - Lorenzo Dematté (https://github.com/ldematte) - MithunR (https://github.com/mythrocks) Approvers: - MithunR (https://github.com/mythrocks) URL: #1216

…apidsai#1216) This PR leverages the functions introduced by rapidsai#1086 and the data structures introduced by rapidsai#1111 to access, copy, and re-create an index to/from a CAGRA graph. Supersedes rapidsai#1105 Authors: - Lorenzo Dematté (https://github.com/ldematte) - MithunR (https://github.com/mythrocks) Approvers: - MithunR (https://github.com/mythrocks) URL: rapidsai#1216

benfred added the improvement Improves an existing functionality label Jul 4, 2025

benfred requested review from a team as code owners July 4, 2025 22:51

benfred added the non-breaking Introduces a non-breaking change label Jul 4, 2025

benfred requested a review from AyodeAwe July 4, 2025 22:52

github-actions bot added cpp Python labels Jul 4, 2025

mythrocks reviewed Jul 7, 2025

View reviewed changes

cpp/include/cuvs/cluster/kmeans.h

*/

enum cuvsKMeansInitMethod {

typedef enum {

Copy link

Contributor

mythrocks Jul 7, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

Merge branch 'branch-25.08' into cagra_c_graph_dataset

3f92c4f

mythrocks added this to Vector Search, ML, & Data Mining Release Board Jul 9, 2025

github-project-automation bot moved this to Todo in Vector Search, ML, & Data Mining Release Board Jul 9, 2025

mythrocks moved this from Todo to In Progress in Vector Search, ML, & Data Mining Release Board Jul 9, 2025

mythrocks mentioned this pull request Jul 9, 2025

[FEA] Make graph / dataset accessors available via the Java API #1098

Open

ldematte reviewed Jul 10, 2025

View reviewed changes

python/cuvs/cuvs/neighbors/cagra/cagra.pyx Outdated Show resolved Hide resolved

cpp/include/cuvs/neighbors/cagra.h Outdated Show resolved Hide resolved

benfred added 4 commits July 10, 2025 09:13

Fix docs

5a8cb2b

Merge branch 'cagra_c_graph_dataset' of https://github.com/benfred/cuvs…

14c92f1

… into cagra_c_graph_dataset

Merge branch 'branch-25.08' into cagra_c_graph_dataset

97d7323

Address code review comments

19757af

chatman mentioned this pull request Jul 10, 2025

[WIP][Java] Exposing CAGRA graph #1102

Closed

ldematte mentioned this pull request Jul 11, 2025

[WIP][Java] Java API for Cagra graph accessors #1105

Closed

cjnolet assigned benfred Jul 11, 2025

Merge branch 'branch-25.08' into cagra_c_graph_dataset

5d168e3

mythrocks moved this to In Progress in Elasticsearch + cuVS Team Jul 15, 2025

mythrocks added this to Elasticsearch + cuVS Team Jul 15, 2025

mythrocks reviewed Jul 15, 2025

View reviewed changes

cpp/src/neighbors/cagra_c.cpp Show resolved Hide resolved

mythrocks reviewed Jul 15, 2025

View reviewed changes

cpp/src/neighbors/cagra_c.cpp Outdated Show resolved Hide resolved

cuvsCopyMatrix -> cuvsMatrixCopy

29b83db

benfred added 4 commits July 17, 2025 12:53

.

e447402

add cuvsMatrixSliceRows

fe2f03f

add ability to page through returned rows via a new 'cuvsMatrixSliceRows' function

fix docs

8206323

Merge branch 'branch-25.08' into cagra_c_graph_dataset

ca991af

Merge branch 'branch-25.08' into cagra_c_graph_dataset

07acc6b

mythrocks approved these changes Jul 21, 2025

View reviewed changes

cpp/include/cuvs/core/detail/interop.hpp Outdated Show resolved Hide resolved

cpp/include/cuvs/core/detail/interop.hpp Show resolved Hide resolved

cpp/include/cuvs/neighbors/cagra.h Outdated Show resolved Hide resolved

mythrocks reviewed Jul 21, 2025

View reviewed changes

cpp/include/cuvs/core/detail/interop.hpp Outdated Show resolved Hide resolved

mythrocks reviewed Jul 21, 2025

View reviewed changes

cpp/src/core/c_api.cpp Show resolved Hide resolved

Apply suggestions from code review

e199a4f

Co-authored-by: MithunR <[email protected]>

lowener reviewed Jul 22, 2025

View reviewed changes

cjnolet reviewed Jul 22, 2025

View reviewed changes

FromGraph -> FromArgs

b03889b

cjnolet approved these changes Jul 22, 2025

View reviewed changes

cjnolet removed the request for review from AyodeAwe July 22, 2025 18:37

bdice approved these changes Jul 22, 2025

View reviewed changes

rapids-bot bot merged commit cf9b256 into rapidsai:branch-25.08 Jul 22, 2025
53 checks passed

github-project-automation bot moved this from In Progress to Done in Elasticsearch + cuVS Team Jul 22, 2025

github-project-automation bot moved this from In Progress to Done in Vector Search, ML, & Data Mining Release Board Jul 22, 2025

benfred deleted the cagra_c_graph_dataset branch July 23, 2025 21:55

This was referenced Aug 5, 2025

[Java] Add CAGRA index graph accessor/build from graph (host memory) #1216

Merged

[Java] CuVSMatrix for device memory #1232

Merged

mythrocks mentioned this pull request Aug 11, 2025

[FEA] Evaluate exposing CAGRA graph size as a long #1242

Open

		from cuvs.common.resources import auto_sync_resources


		cdef class DeviceTensorView:

Expose graph and dataset accessors for CAGRA to C/Python #1086

Expose graph and dataset accessors for CAGRA to C/Python #1086

Uh oh!

Conversation

benfred commented Jul 4, 2025

Uh oh!

mythrocks Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

mythrocks Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

ldematte commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ldematte left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ldematte commented Jul 17, 2025

Uh oh!

benfred commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mythrocks commented Jul 17, 2025

Uh oh!

ldematte commented Jul 18, 2025

Uh oh!

mythrocks left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mythrocks commented Jul 22, 2025

Uh oh!

lowener Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

benfred Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

cjnolet Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

benfred Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

bdice left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

ldematte commented Jul 10, 2025 •

edited

Loading

benfred commented Jul 17, 2025 •

edited

Loading