Skip to content

Commit e0ef9d7

Browse files
LucasWilkinsonavigny
authored andcommitted
[Perf] Improve/Fix-regression for FA3 in High QPS regimes (vllm-project#19463)
Signed-off-by: Lucas Wilkinson <[email protected]> Signed-off-by: avigny <[email protected]>
1 parent 3418f7b commit e0ef9d7

File tree

2 files changed

+2
-1
lines changed

2 files changed

+2
-1
lines changed

cmake/external_projects/vllm_flash_attn.cmake

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@ else()
3838
FetchContent_Declare(
3939
vllm-flash-attn
4040
GIT_REPOSITORY https://github.com/vllm-project/flash-attention.git
41-
GIT_TAG 763ad155a1c826f71ff318f41edb1e4e5e376ddb
41+
GIT_TAG 2c6bcfc0feb3d9d4a57b243fc159a68aa9933f5b
4242
GIT_PROGRESS TRUE
4343
# Don't share the vllm-flash-attn build between build types
4444
BINARY_DIR ${CMAKE_BINARY_DIR}/vllm-flash-attn

test-qwen

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
Subproject commit 34c31c0af8fc975140b8c85548fefa1eb7f523e4

0 commit comments

Comments
 (0)