Update score calculation for CAGRA-Q instance selection #938

enp1s0 · 2025-05-27T17:11:29Z

The current CAGRA-Q instance selection criterion is the same as the one for the standard CAGRA, which is not always optimal for CAGRA-Q. This PR updates the criterion for CAGRA-Q to improve the throughput when team_size=AUTO.

The size (Byte) of each vector is smaller in a dataset compressed with CAGRA-Q compared to an uncompressed one. Because of this, we may be able to improve throughput by using a smaller team_size. This PR updates the scoring method for selecting a CAGRA-Q instance to take that into account. Based on my performance tests for SIFT, GloVe, GIST, NYTimes, and OpenAI 5M, the updated scoring method avoided selecting the worst team_size values, unlike the current method.

cjnolet · 2025-05-27T19:18:54Z

/merge

The current CAGRA-Q instance selection criterion is the same as the one for the standard CAGRA, which is not always optimal for CAGRA-Q. This PR updates the criterion for CAGRA-Q to improve the throughput when `team_size=AUTO`. The size (Byte) of each vector is smaller in a dataset compressed with CAGRA-Q compared to an uncompressed one. Because of this, we may be able to improve throughput by using a smaller `team_size`. This PR updates the scoring method for selecting a CAGRA-Q instance to take that into account. Based on my performance tests for SIFT, GloVe, GIST, NYTimes, and OpenAI 5M, the updated scoring method avoided selecting the worst `team_size` values, unlike the current method. Authors: - tsuki (https://github.com/enp1s0) Approvers: - Corey J. Nolet (https://github.com/cjnolet) URL: rapidsai#938

Update score calculation for CAGRA-Q instance selection

6e39ca7

enp1s0 requested a review from a team as a code owner May 27, 2025 17:11

enp1s0 self-assigned this May 27, 2025

github-actions bot added the cpp label May 27, 2025

enp1s0 added improvement Improves an existing functionality non-breaking Introduces a non-breaking change cpp and removed cpp labels May 27, 2025

cjnolet added this to Vector Search, ML, & Data Mining Release Board May 27, 2025

cjnolet moved this to In Progress in Vector Search, ML, & Data Mining Release Board May 27, 2025

enp1s0 added 2 commits May 28, 2025 02:22

Merge branch 'branch-25.06' into cagra-q-auto-team-size

33d4555

Merge branch 'branch-25.06' into cagra-q-auto-team-size

9e262da

cjnolet approved these changes May 27, 2025

View reviewed changes

rapids-bot bot merged commit d8733c6 into rapidsai:branch-25.06 May 27, 2025
75 checks passed

github-project-automation bot moved this from In Progress to Done in Vector Search, ML, & Data Mining Release Board May 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update score calculation for CAGRA-Q instance selection #938

Update score calculation for CAGRA-Q instance selection #938

enp1s0 commented May 27, 2025

Uh oh!

cjnolet commented May 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Update score calculation for CAGRA-Q instance selection #938

Update score calculation for CAGRA-Q instance selection #938

Conversation

enp1s0 commented May 27, 2025

Uh oh!

cjnolet commented May 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants