fix torch compile when using fp8 fla #1177

guangzlu · 2025-10-13T06:02:50Z

Motivation

Found bug when using fp8 fla + torch compile

Technical Details

When datatype for QKV is FP8, datatype for out should be bf16

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

Copilot

Pull Request Overview

This PR fixes a bug in torch compile when using FP8 data types with Flash Attention (FLA). The fix ensures that when QKV tensors are in FP8 format, the output tensor is correctly created with BF16 data type instead of inheriting the FP8 type.

Added conditional logic to handle FP8 input tensors by creating BF16 output tensors
Maintains existing behavior for non-FP8 data types

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-10-13T06:03:10Z

aiter/ops/mha.py

    else:
-        out = torch.empty(
-            (batch_size, seqlen_q, num_heads, head_size_v),
-            dtype=q.dtype,
-            device=q.device,
-            requires_grad=q.requires_grad,
-        )
+        if q.dtype == dtypes.fp8:
+            out = torch.empty(
+                (batch_size, seqlen_q, num_heads, head_size_v),
+                dtype=dtypes.bf16,
+                device=q.device,
+                requires_grad=q.requires_grad,
+            )
+        else:
+            out = torch.empty(
+                (batch_size, seqlen_q, num_heads, head_size_v),


The nested if-else structure creates duplicated tensor creation logic. Consider restructuring to determine the output dtype first, then create the tensor once to reduce code duplication.

fix torch compile when using fp8 fla

b6cef61

Copilot AI review requested due to automatic review settings October 13, 2025 06:02

Copilot AI reviewed Oct 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix torch compile when using fp8 fla #1177

fix torch compile when using fp8 fla #1177

Uh oh!

guangzlu commented Oct 13, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix torch compile when using fp8 fla #1177

Are you sure you want to change the base?

fix torch compile when using fp8 fla #1177

Uh oh!

Conversation

guangzlu commented Oct 13, 2025

Motivation

Technical Details

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant