[NVIDIA] Support Cutlass w8a8 FP8 for Blackwell Geforce GPUs (sm120) #17280

@mgoin

2 rules match and 16 potential rules

⚠️ The pull request has been merged by @mgoin

Rule: label-documentation (label)

any of:
- files~=^[^/]+\.md$
- files~=^docs/
- files~=^examples/

✅ Rule: label-ci-build (label)

Rule: label-deepseek (label)

any of:
- files~=^examples/.*deepseek.*\.py
- files~=^tests/.*deepseek.*\.py
- files~=^vllm/entrypoints/openai/tool_parsers/.*deepseek.*\.py
- files~=^vllm/model_executor/models/.*deepseek.*\.py
- files~=^vllm/reasoning/.*deepseek.*\.py
- files~=^vllm/transformers_utils/.*deepseek.*\.py
- title~=(?i)DeepSeek

Rule: label-frontend (label)

files~=^vllm/entrypoints/

Rule: label-llama (label)

any of:
- files~=^examples/.*llama.*\.py
- files~=^tests/.*llama.*\.py
- files~=^vllm/entrypoints/openai/tool_parsers/llama.*\.py
- files~=^vllm/model_executor/models/.*llama.*\.py
- files~=^vllm/transformers_utils/configs/.*llama.*\.py
- title~=(?i)llama

Rule: label-multi-modality (label)

any of:
- files=tests/models/test_vision.py
- files~=^tests/models/*/audio_language/
- files~=^tests/models/*/vision_language/
- files~=^tests/models/multimodal/
- files~=^tests/multimodal/
- files~=^vllm/multimodal/

Rule: label-performance (label)

Rule: label-qwen (label)

Rule: label-rocm (label)

Rule: label-structured-output (label)

Rule: label-speculative-decoding (label)

any of:
- files=vllm/model_executor/layers/spec_decode_base_sampler.py
- files~=^tests/spec_decode/
- files~=^vllm/spec_decode/

Rule: label-v1 (label)

any of:
- files~=^tests/v1/
- files~=^vllm/v1/

Rule: label-tpu (label)

✅ Rule: label-tpu-remove (label)

Rule: label-tool-calling (label)

Rule: ping author on conflicts and add 'needs-rebase' label (comment, label)

-closed
conflict

Rule: assign reviewer for tensorizer changes (assign)

files~=^tests/entrypoints/openai/test_tensorizer_entrypoint.py
files~=^tests/tensorizer_loader/
files~=^vllm/model_executor/model_loader/tensorizer.py
files~=^vllm/model_executor/model_loader/tensorizer_loader.py

Rule: remove 'needs-rebase' label when conflict is resolved (label)

-closed
-conflict

💖 Mergify is proud to provide this service for free to open source projects.

🚀 You can help us by becoming a sponsor!

Mergify commands and options

More conditions and actions can be found in the documentation.

You can also trigger Mergify actions by commenting on this pull request:

@Mergifyio refresh will re-evaluate the rules
@Mergifyio rebase will rebase this PR on its base branch
@Mergifyio update will merge the base branch into this PR
@Mergifyio backport <destination> will backport this PR on <destination> branch

Additionally, on Mergify dashboard you can:

look at your merge queues
generate the Mergify configuration with the config editor.

Finally, you can contact us on https://mergify.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[NVIDIA] Support Cutlass w8a8 FP8 for Blackwell Geforce GPUs (sm120) #17280

Uh oh!

Uh oh!

[NVIDIA] Support Cutlass w8a8 FP8 for Blackwell Geforce GPUs (sm120) #17280

Uh oh!

2 rules match and 16 potential rules

Rule: label-documentation (label)

✅ Rule: label-ci-build (label)

Rule: label-deepseek (label)

Rule: label-frontend (label)

Rule: label-llama (label)

Rule: label-multi-modality (label)

Rule: label-performance (label)

Rule: label-qwen (label)

Rule: label-rocm (label)

Rule: label-structured-output (label)

Rule: label-speculative-decoding (label)

Rule: label-v1 (label)

Rule: label-tpu (label)

✅ Rule: label-tpu-remove (label)

Rule: label-tool-calling (label)

Rule: ping author on conflicts and add 'needs-rebase' label (comment, label)

Rule: assign reviewer for tensorizer changes (assign)

Rule: remove 'needs-rebase' label when conflict is resolved (label)

Re-running checks...

Uh oh!

[NVIDIA] Support Cutlass w8a8 FP8 for Blackwell Geforce GPUs (sm120) #17280

Uh oh!

Merge branch 'main' into kaln27/main

Uh oh!

[NVIDIA] Support Cutlass w8a8 FP8 for Blackwell Geforce GPUs (sm120) #17280

Uh oh!

2 rules match and 16 potential rules

Rule: label-documentation (label)

✅ Rule: label-ci-build (label)

Rule: label-deepseek (label)

Rule: label-frontend (label)

Rule: label-llama (label)

Rule: label-multi-modality (label)

Rule: label-performance (label)

Rule: label-qwen (label)

Rule: label-rocm (label)

Rule: label-structured-output (label)

Rule: label-speculative-decoding (label)

Rule: label-v1 (label)

Rule: label-tpu (label)

✅ Rule: label-tpu-remove (label)

Rule: label-tool-calling (label)

Rule: ping author on conflicts and add 'needs-rebase' label (comment, label)

Rule: assign reviewer for tensorizer changes (assign)

Rule: remove 'needs-rebase' label when conflict is resolved (label)

Re-running checks...