Support torch_fp8 #13196

cyita · 2025-05-29T08:25:45Z

Description

Add torch_fp8, torch_fp8_e5m2 and torch_fp8_e4m3

model = optimize_model(model, low_bit="torch_fp8", torch_dtype=dtype)
model = model.to("xpu")
output_fp8 = model.linear(input)

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

N/A
Unit test: Please manually trigger the PR Validation here by inputting the PR number (e.g., 1234). And paste your action link here once it has been successfully finished.
Application test
Document test
...

5. New dependencies

New Python dependencies
- Dependency1
- Dependency2
- ...
New Java/Scala dependencies and their license
- Dependency1 and license1
- Dependency2 and license2
- ...

python/llm/src/ipex_llm/transformers/low_bit_linear.py

rnwang04

LGTM.
There may need more restrictions for this new type, like only support Linux / only used for BMG / save & load etc. But these are not urgent, we can consider such issues later.

qiyuangong

LGTM

cyita · 2025-06-04T12:07:35Z

PR validation https://github.com/intel-analytics/ipex-llm-workflow/actions/runs/15440629456

support torch_fp8

972d169

cyita commented May 29, 2025

View reviewed changes

python/llm/src/ipex_llm/transformers/low_bit_linear.py Show resolved Hide resolved

fix style

4e4de39

cyita requested review from rnwang04 and xiangyuT May 29, 2025 08:35

update

70b00d8

rnwang04 reviewed May 29, 2025

View reviewed changes

python/llm/src/ipex_llm/transformers/low_bit_linear.py Outdated Show resolved Hide resolved

update

9207a90

rnwang04 reviewed May 30, 2025

View reviewed changes

python/llm/src/ipex_llm/transformers/low_bit_linear.py Outdated Show resolved Hide resolved

cyita added 2 commits May 30, 2025 10:06

meet comment fix prefill

ab68c2a

fix style

fdf0ff0

rnwang04 approved these changes May 30, 2025

View reviewed changes

qiyuangong approved these changes May 30, 2025

View reviewed changes

cyita merged commit e032156 into intel:main Jun 4, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support torch_fp8 #13196

Support torch_fp8 #13196

Uh oh!

cyita commented May 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rnwang04 left a comment

Uh oh!

qiyuangong left a comment

Uh oh!

cyita commented Jun 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Support torch_fp8 #13196

Support torch_fp8 #13196

Uh oh!

Conversation

cyita commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

5. New dependencies

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rnwang04 left a comment

Choose a reason for hiding this comment

Uh oh!

qiyuangong left a comment

Choose a reason for hiding this comment

Uh oh!

cyita commented Jun 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cyita commented May 29, 2025 •

edited

Loading