Skip to content

Conversation

@ckl117
Copy link
Contributor

@ckl117 ckl117 commented Sep 6, 2024

PR types

New features

PR changes

Others

Description

FP8 gemm自动调优

@paddle-bot
Copy link

paddle-bot bot commented Sep 6, 2024

Thanks for your contribution!

@CLAassistant
Copy link

CLAassistant commented Sep 6, 2024

CLA assistant check
All committers have signed the CLA.

@ckl117 ckl117 changed the title Fp8 cutlass tune [Inference] FP8 gemm auto-tune Sep 6, 2024
os.makedirs(cutlass_dir)
clone_git_repo("v3.5.0", "https://github.com/NVIDIA/cutlass.git", cutlass_dir)

json_dir = "third_party/nlohmann_json"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

把cutlass也挪到third_party里去

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

@codecov
Copy link

codecov bot commented Sep 9, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 53.34%. Comparing base (2f31866) to head (0453e8e).
Report is 227 commits behind head on develop.

Additional details and impacted files
@@           Coverage Diff            @@
##           develop    #9094   +/-   ##
========================================
  Coverage    53.34%   53.34%           
========================================
  Files          652      652           
  Lines       105401   105401           
========================================
  Hits         56222    56222           
  Misses       49179    49179           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@ckl117 ckl117 force-pushed the fp8_cutlass_tune branch 2 times, most recently from 00d2585 to 81971ff Compare September 9, 2024 08:26
yuanlehome
yuanlehome previously approved these changes Sep 9, 2024
@qingqing01 qingqing01 merged commit 3675ea2 into PaddlePaddle:develop Sep 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants