Dao-AILab / flash-attention Public

Notifications You must be signed in to change notification settings
Fork 2k
Star 19.9k

Code
Issues 851
Pull requests 79
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: Dao-AILab/flash-attention

Labels 9 Milestones 0

New pull request New

79 Open 293 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

support cpu run fa triton kernel

#1938 opened Oct 15, 2025 by hellozmz • Draft

[CUTE] Enable Pack GQA for score mods

#1937 opened Oct 14, 2025 by drisspg

Loading…

Fix backward preprocess kernel in FlashAttention3

#1936 opened Oct 14, 2025 by rfbr

Loading…

windows error error C2039:

#1932 opened Oct 12, 2025 by Granddyser

Loading…

[NFC] Trivial fix to silence linter

#1928 opened Oct 8, 2025 by jduprat

Loading…

feat: add to support float8 kvcache in fa4

#1914 opened Sep 28, 2025 by yicwang

Loading…

fix forward and backward kernel

#1907 opened Sep 24, 2025 by rz2778

Loading…

Add flash_attn_varlen_qkvpacked_func to hopper (flash_attn_3)

#1902 opened Sep 22, 2025 by foreverYoungGitHub

Loading…

Feature/varlen rotray

#1899 opened Sep 19, 2025 by mhoangvslev

Loading…

Fix the torch.compile failure of flash_attn_varlen_func

#1894 opened Sep 17, 2025 by zhenwendai

Loading…

Improve setup.py

#1859 opened Sep 3, 2025 by cyyever

Loading…

feat: Implement Sink Attention

#1819 opened Aug 18, 2025 by aoxy

Loading…

fix race condition bug in cute _flash_attn_fwd in multiple gpu env

#1793 opened Aug 1, 2025 by beiw-nv

Loading…

feat: blocksparse support

#1784 opened Jul 30, 2025 by guangyunh-nv • Draft

[CI] build upon manylinux, improve compatibility

#1780 opened Jul 29, 2025 by zipzou

Loading…

Change the update method of the sub-module

#1774 opened Jul 25, 2025 by RealTapeL

Loading…

add var_len case for benchmark_mla_decode

#1770 opened Jul 22, 2025 by XiaobingSuper

Loading…

Add torch.compile support to flash attention 3

#1769 opened Jul 22, 2025 by guilhermeleobas

Loading…

Enable the deterministic mode option in the backward kernel

#1766 opened Jul 21, 2025 by GD06

Loading…

Suppress warnings in windows compilation

#1748 opened Jul 10, 2025 by XXXXRT666

Loading…

Fix illegal memory access through off-by-one error in num_splits_dynamic_ptr init

#1747 opened Jul 10, 2025 by klondenberg-bioptimus

Loading…

Theoretically make compiling from pip quicker

#1703 opened Jun 8, 2025 by whrit

Loading…

fix: fa3 backward check qkv with qkv_scale and dqkv

#1686 opened May 29, 2025 by yuyu5333

Loading…

[skip ci] libtorch agnostic FA3 north star proposal

#1685 opened May 28, 2025 by janeyx99 • Draft

Fix/deterministic dk dv

#1678 opened May 26, 2025 by yuWeiCute

Loading…

Previous 1 2 3 4 Next

Previous Next

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!