Skip to content

Conversation

@zhouyu5
Copy link

@zhouyu5 zhouyu5 commented May 23, 2025

The PR add max_encoder_seq_len attribute to attn_metadata for encoder-decoder model, which thus simplify the code for mllama. The accuracy and throughput is not influenced by this PR.

@zhouyu5
Copy link
Author

zhouyu5 commented May 23, 2025

/run-gaudi-tests

Signed-off-by: zhouyu5 <[email protected]>
@zhouyu5
Copy link
Author

zhouyu5 commented May 23, 2025

/run-gaudi-tests

@adobrzyn
Copy link

/run-gaudi-tests

@jikunshang jikunshang enabled auto-merge (squash) May 26, 2025 00:53
@jikunshang jikunshang merged commit 1fb633a into habana_main May 26, 2025
49 checks passed
@jikunshang jikunshang deleted the dev/yu/enc-dec-max-seq-len branch May 26, 2025 00:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants