-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Open
Labels
ep:WebGPUort-web webgpu providerort-web webgpu providerplatform:webissues related to ONNX Runtime web; typically submitted using templateissues related to ONNX Runtime web; typically submitted using template
Description
From my local debugging when running a GenAI model with ort-web WebGPU EP, I find the context.HasFeature(wgpu::FeatureName::Subgroups)
in following file always return false
. Thus the ApplyFlashAttention()
will never be invoked.
context.HasFeature(wgpu::FeatureName::Subgroups) && |
However, I can get this feature from dev tool:

The ort-web I built was based on commit c1cbe08
Metadata
Metadata
Assignees
Labels
ep:WebGPUort-web webgpu providerort-web webgpu providerplatform:webissues related to ONNX Runtime web; typically submitted using templateissues related to ONNX Runtime web; typically submitted using template