Skip to content

Conversation

@hwchen2017
Copy link
Contributor

  • Remove useless kernel compilation
  • Set gradient accumulation fusion based on GPU type
  • Update GPT and LLAMA scripts to remove gradient accumulation fusion

@hwchen2017 hwchen2017 requested a review from tjruwase as a code owner February 14, 2025 07:32
Signed-off-by: Hongwei Chen <[email protected]>
@loadams loadams merged commit 223665c into master Mar 27, 2025
2 checks passed
hwchen2017 added a commit that referenced this pull request Jun 8, 2025
Signed-off-by: Hongwei Chen <[email protected]>
Co-authored-by: Hongwei Chen <hongweichen@ftqtmec25000002.taxzvufipdhelhupulxcbvr15f.ux.internal.cloudapp.net>
Co-authored-by: Logan Adams <[email protected]>
@hwchen2017 hwchen2017 deleted the hongwei_domino_amd branch June 12, 2025 18:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants