[Feat] Add support of logical merge in Cagra #713

rhdong · 2025-02-20T20:14:43Z

[FEA] Add support of logical merge in Cagra #712

cpp/include/cuvs/neighbors/cagra.hpp

achirkin · 2025-02-21T09:05:34Z

I may have missed some prior discussion, but the cagra-specific logical merge seems a little bit artificial for me.
If I understand this correctly, you essentially implement a multi-index search. Then, why don't we go one step further and make it independent of a particular index implementation? Moreover, it looks like much of this functionality is implemented already in multi-gpu index. Maybe we can generalize that one a bit to decouple "multi-index" from "multi-gpu" aspect? Optionally, one can also combine different index types and erase the upstream index type like we do in dynamic batching index.

cjnolet · 2025-02-21T16:28:30Z

I may have missed some prior discussion, but the cagra-specific logical merge seems a little bit artificial for me.

@achirkin you and I haven't discussed this yet but this feature is critical for certain database architectures like Lucene, which are variants of the log-structured-merge pattern but build indexes immediately after segments (flat files containing vectors) are created, rather than merging the segments together before building the indexes. This causes Lucene to have to first merge many tiny CAGRA indexes together, but eventually it'll end up merging very large indexes together. It is this latter case thart we care about doing a logical merge. It's more efficient to create a single cagra index with many (potentially hundreds) of tiny cagra indexes but when they reach a certain size, it's actually more efficient to logically merge them; meaning we essentially wrap them as if they were shards and broadcast the query to all the shards during search.

I agree this is similar in theory to the multi-gpu sharding and perhaps there is some code to be reused there. The next step for the merge() API is to be able to implement a SMART merge strategy, whereby we can more efficiently merge two cagra graphs together without having to rebuild from scratch, and that's ultimately the strategy we have discussed offline in more detail. The problem is that Lucene can make use of this today, so it gives us time to work towards the SMART option.

Then, why don't we go one step further and make it independent of a particular index implementation?

I do agree with this- it doesn't have to be (and ideally wouldn't be) specifically for CAGRA, though that just happens to be the index we care about today because it unblocks Lucene.

achirkin · 2025-02-26T16:01:00Z

I totally agree and don't doubt the usefulness of logical merge.
I've just pointed out that I think we can accomplish a more general solution (supporting any index type) with actually less code by re-using/adjusting what we already have in multi-index/dynamic batching.

rhdong · 2025-03-13T00:13:52Z

I may have missed some prior discussion, but the cagra-specific logical merge seems a little bit artificial for me. If I understand this correctly, you essentially implement a multi-index search. Then, why don't we go one step further and make it independent of a particular index implementation? Moreover, it looks like much of this functionality is implemented already in multi-gpu index. Maybe we can generalize that one a bit to decouple "multi-index" from "multi-gpu" aspect? Optionally, one can also combine different index types and erase the upstream index type like we do in dynamic batching index.

Hi @achirkin , sorry for the late response to your comments here. I studied the code of SNMG, which is a super useful feature. When I tried to reuse some code in it, I found the caller has to work with locker, nccl, which is unnecessary in the logical merge of CAGRA. I'd like to keep the simple implementation if you agree. Many thanks! cc: @cjnolet

rhdong · 2025-03-18T23:02:50Z

Hi @cjnolet @achirkin, could you please merge it if there are no more comments? Many thanks!

cjnolet · 2025-04-25T14:35:54Z

When I tried to reuse some code in it, I found the caller has to work with locker, nccl, which is unnecessary in the logical merge of CAGRA. I'd like to keep the simple implementation if you agree. Many thanks! cc: @cjnolet

@rhdong I don't think the ask here is that you use the multi-gpu apis, but rather that we have a more consistent and general means of being able to support wrapper types that can work with any index type and avoid the need to, for example, implement a separate composite_index type for each index type within cuVS. Dynamic batching and the multi-gpu apis are two examples of where we have a single index that can be used by any other index. This is ultimately where we should strive to be.

rhdong · 2025-04-30T18:29:47Z

When I tried to reuse some code in it, I found the caller has to work with locker, nccl, which is unnecessary in the logical merge of CAGRA. I'd like to keep the simple implementation if you agree. Many thanks! cc: @cjnolet

@rhdong I don't think the ask here is that you use the multi-gpu apis, but rather that we have a more consistent and general means of being able to support wrapper types that can work with any index type and avoid the need to, for example, implement a separate composite_index type for each index type within cuVS. Dynamic batching and the multi-gpu apis are two examples of where we have a single index that can be used by any other index. This is ultimately where we should strive to be.

Hi @cjnolet @achirkin , just want to clarify before moving forward, what you're suggesting is lifting up the composite_index to the upper level, like by moving it to common.hpp, and using the cuvs::neighbour::index as array item type, am I right? Many thanks!

cjnolet · 2025-05-07T14:43:13Z

Hi @cjnolet @achirkin , just want to clarify before moving forward, what you're suggesting is lifting up the composite_index to the upper level, like by moving it to common.hpp, and using the cuvs::neighbour::index as array item type, am I right? Many thanks!

Yes, though I think we could address this in a follow-up PR and merge this change in the meantime. We really need CAGRA merge capability initially so that we can unblock our Lucene friends. @rhdong can you create a new github issue to follow-up with the first-class formal generalized composite_index?

The other big reason why centralizing this composite index is important is because we want it to work out of the box with the other APIs that work with general indexes (such as the snmg apis).

rhdong · 2025-05-27T20:42:08Z

Hi @cjnolet , I’ve provided a multi-stream implementation, but it appears to show no performance improvement. (Sorry for the accidental force push during this). According to your suggestion, I’ve submitted an issue(#946) and will continue working on further optimizations.

cjnolet · 2025-05-28T15:04:34Z

cpp/bench/ann/src/cuvs/cuvs_cagra_wrapper.h

+        }
+
+        raft::resources composite_handle(handle_);
+        size_t n_streams = std::min(wrapped_indices.size(), size_t(8));


out of curiosity, why 8 here? I don't think we need to hold up the release for this, but can you at least create a Github issue to make this configurable (or to find a good reasonable default which can be overridden)?

I just changed it to be equal to wrapped_indices.size()

cjnolet · 2025-05-28T21:12:54Z

/merge

This reverts commit e00fabe.

- rapidsai#712 Authors: - rhdong (https://github.com/rhdong) Approvers: - Corey J. Nolet (https://github.com/cjnolet) URL: rapidsai#713

[Feat] Add support of logical merge in Cagra

cb8f16b

rhdong added feature request New feature or request non-breaking Introduces a non-breaking change labels Feb 20, 2025

rhdong requested review from achirkin and cjnolet February 20, 2025 20:14

rhdong requested a review from a team as a code owner February 20, 2025 20:14

github-actions bot added the cpp label Feb 20, 2025

rhdong commented Feb 20, 2025

View reviewed changes

cpp/include/cuvs/neighbors/cagra.hpp Show resolved Hide resolved

rhdong added 7 commits February 26, 2025 09:14

Merge branch 'branch-25.04' into rhdong/logical-merge

6c00e52

Merge branch 'branch-25.04' into rhdong/logical-merge

450df2a

Merge branch 'branch-25.04' into rhdong/logical-merge

4debc02

Merge branch 'branch-25.04' into rhdong/logical-merge

c5ecfe8

Merge branch 'branch-25.04' into rhdong/logical-merge

b3a99c6

Merge branch 'branch-25.04' into rhdong/logical-merge

bcfa656

fix ci fail

186be95

rhdong closed this Mar 13, 2025

rhdong reopened this Mar 13, 2025

rhdong added 2 commits March 17, 2025 15:32

Merge branch 'branch-25.04' into rhdong/logical-merge

6138f92

Merge branch 'branch-25.04' into rhdong/logical-merge

b962830

rhdong added 3 commits March 24, 2025 08:46

Merge branch 'branch-25.04' into rhdong/logical-merge

1e61e1b

Merge branch 'branch-25.04' into rhdong/logical-merge

590c3a8

resolve conflicts

a986d2e

rhdong added 4 commits May 22, 2025 16:58

add benchmark for CAGRA merge

da8c2e5

Merge branch 'branch-25.06' into rhdong/logical-merge

0342358

Merge branch 'branch-25.06' into rhdong/logical-merge

ddef8ce

Refactor composite wrappers and merge API with redesigned index system

147be54

github-actions bot added the CMake label May 27, 2025

add back default constructor

acbcb50

rhdong requested a review from cjnolet May 27, 2025 02:57

rhdong added 2 commits May 27, 2025 09:32

Merge branch 'branch-25.06' into rhdong/logical-merge

82d978b

remove MergeStrategy reference of C header

007b9da

cjnolet assigned rhdong May 27, 2025

cjnolet added this to Vector Search, ML, & Data Mining Release Board May 27, 2025

cjnolet moved this to In Progress in Vector Search, ML, & Data Mining Release Board May 27, 2025

rhdong added 2 commits May 27, 2025 13:30

Merge branch 'branch-25.06' into rhdong/logical-merge

9a69747

add support multi-stream for CompositeIndex::search

e19590d

rhdong force-pushed the rhdong/logical-merge branch from 03b773c to e19590d Compare May 27, 2025 20:31

rhdong mentioned this pull request May 27, 2025

[Tracker] Optimizing the performance of the Cagra::merge under Logical Strategy #946

Closed

rhdong added 2 commits May 27, 2025 15:38

setup stream number for resource handle by using set_cuda_stream_pool

eca7283

Merge branch 'branch-25.06' into rhdong/logical-merge

bf0bca5

cjnolet reviewed May 28, 2025

View reviewed changes

rhdong added 3 commits May 28, 2025 09:39

Merge branch 'branch-25.06' into rhdong/logical-merge

7d49411

set n_stream equal to the size of sub_indices

f2ec31c

optimize naming for split num.

d08f458

rhdong requested a review from cjnolet May 28, 2025 17:16

cjnolet approved these changes May 28, 2025

View reviewed changes

rapids-bot bot merged commit e00fabe into rapidsai:branch-25.06 May 28, 2025
75 checks passed

github-project-automation bot moved this from In Progress to Done in Vector Search, ML, & Data Mining Release Board May 28, 2025

copy-pr-bot bot pushed a commit that referenced this pull request Jun 3, 2025

Revert "[Feat] Add support of logical merge in Cagra (#713)"

06c4765

This reverts commit e00fabe.

[Feat] Add support of logical merge in Cagra #713

[Feat] Add support of logical merge in Cagra #713

Uh oh!

Conversation

rhdong commented Feb 20, 2025

Uh oh!

Uh oh!

achirkin commented Feb 21, 2025

Uh oh!

cjnolet commented Feb 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

achirkin commented Feb 26, 2025

Uh oh!

rhdong commented Mar 13, 2025

Uh oh!

rhdong commented Mar 18, 2025

Uh oh!

cjnolet commented Apr 25, 2025

Uh oh!

rhdong commented Apr 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cjnolet commented May 7, 2025

Uh oh!

rhdong commented May 27, 2025

Uh oh!

cjnolet May 28, 2025

Choose a reason for hiding this comment

Uh oh!

rhdong May 28, 2025

Choose a reason for hiding this comment

Uh oh!

cjnolet commented May 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cjnolet commented Feb 21, 2025 •

edited

Loading

rhdong commented Apr 30, 2025 •

edited

Loading