Update CUDA sdpa #2468

jagrit06 · 2025-08-06T22:07:52Z

Proposed changes

Add one and 2 pass vector sdpa impelmentations in Cuda
Add cudnn for matrix attention in supported types / hardware

Checklist

Put an x in the boxes that apply.

I have read the CONTRIBUTING document
I have run pre-commit run --all-files to format my code / installed pre-commit prior to committing changes
I have added tests that prove my fix is effective or that my feature works
I have updated the necessary documentation (if needed)

awni · 2025-08-06T23:03:23Z

mlx/backend/cuda/scaled_dot_product_attention.cu

+inline fe::DataType_t dtype_to_cudnn_type(Dtype dtype) {
+  switch (dtype) {
+    case int8:
+      return fe::DataType_t::INT8;
+    case int32:
+      return fe::DataType_t::INT32;
+    case uint8:
+      return fe::DataType_t::UINT8;
+    case float16:
+      return fe::DataType_t::HALF;
+    case bfloat16:
+      return fe::DataType_t::BFLOAT16;
+    case float32:
+      return fe::DataType_t::FLOAT;
+    case float64:
+      return fe::DataType_t::DOUBLE;
+    default:
+      throw std::runtime_error(fmt::format(
+          "Unsupported dtype in SDPA: {}.", dtype_to_string(dtype)));
+  }
+}


Maybe we can refactor that into a shared header cuddn.h that gets reused by conv.cpp?

awni

Looks awesome! Let's merge it after you fix the compile issue and the tests clear

jagrit06 added 7 commits August 6, 2025 09:56

Add base cudnn attention support

d8ed6c1

Add sdpa file

e74bcdc

Add more nvtx range for debug

c28249b

[WIP] 2 pass sdpav

7f8ba2a

Complete 2 pass sdpav

f81edd1

Update routing

c66b76a

Fix cudnn routing

99d8de8

jagrit06 requested a review from awni August 6, 2025 22:08

awni reviewed Aug 6, 2025

View reviewed changes

awni approved these changes Aug 6, 2025

View reviewed changes

angeloskath added 2 commits August 6, 2025 19:51

Add stricter condition to matrix sdpa

a22d0bf

Remove batch sdpa

7fa520e

angeloskath merged commit a9bdd67 into main Aug 7, 2025
6 checks passed

angeloskath deleted the sdpav-base branch August 7, 2025 04:40

BrewTestBot mentioned this pull request Aug 7, 2025

mlx 0.28.0 Homebrew/homebrew-core#232635

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update CUDA sdpa #2468

Update CUDA sdpa #2468

Uh oh!

jagrit06 commented Aug 6, 2025

Uh oh!

awni Aug 6, 2025

Uh oh!

awni left a comment

Uh oh!

Uh oh!

Uh oh!

Update CUDA sdpa #2468

Update CUDA sdpa #2468

Uh oh!

Conversation

jagrit06 commented Aug 6, 2025

Proposed changes

Checklist

Uh oh!

awni Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

awni left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!