Implement AVX512_FP16 #1605

sayantn · 2024-07-02T11:21:52Z

This PR adds the AVX512_FP16 intrinsics in Rust. These intrinsics will be behind the feature gate #[feature(stdarch_x86_avx512_f16)] (rust-lang/rust#127213).

Progress:

This also adds some missing inlining in avx512ifma and updates the x86-intel.xml file to v3.6.9

The set1_pch intrinsics were not implemented due to a lack of complex number type.
cmpph and fpclassph intrinsics use inline asm because of no i1 support yet.

rustbot · 2024-07-02T11:21:56Z

r? @Amanieu

rustbot has assigned @Amanieu.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

tgross35 · 2024-07-04T20:05:51Z

Have you run into any weird behavior with these, or do things seem to be working smoothly? (ignoring the ABI issue for system function calls, that is)

sayantn · 2024-07-04T20:08:58Z

No problems yet, just that simd_fabs doesn't accept a f16 argument, so i will just use an and operation. I am actively avoiding doing f16 operations in rust, but that's not a blocker for sure.

bors · 2024-07-06T09:02:25Z

☔ The latest upstream changes (presumably 3dd9579) made this pull request unmergeable. Please resolve the merge conflicts.

sayantn · 2024-07-17T13:22:00Z

cc @tgross35 @beetrees

crates/core_arch/src/simd.rs

crates/core_arch/src/x86/test.rs

Add-Sub-Mul-Div, Load-Store-Move, `comi`, `set`

Reciprocal, RSqrt, Sqrt, Max, Min

`getexp`, `getmant`, `roundscale`, `scalef`, `reduce`

`cmpph`, `fpclass`, reduce, `blend`, `permutex`

Add `#[inline]` to avx512ifma intrinsics Fix the test equality. Remove the stability attributes in simd types and test functions

rustbot assigned Amanieu Jul 2, 2024

sayantn force-pushed the fp16 branch from 2f2dac7 to d5e5ea3 Compare July 3, 2024 18:46

sayantn force-pushed the fp16 branch 4 times, most recently from 91e0971 to 403897c Compare July 12, 2024 07:11

tgross35 mentioned this pull request Jul 12, 2024

Add f16 and f128 as simd types in LLVM rust-lang/rust#127487

Merged

sayantn force-pushed the fp16 branch 3 times, most recently from c9588c5 to e907eba Compare July 15, 2024 17:17

sayantn marked this pull request as ready for review July 17, 2024 13:18

sayantn mentioned this pull request Jul 17, 2024

Tracking Issue for AVX512_FP16 intrinsics rust-lang/rust#127213

Open

2 tasks

tgross35 mentioned this pull request Jul 18, 2024

Tracking Issue for f16 and f128 float types rust-lang/rust#116909

Open

92 tasks

Amanieu reviewed Jul 25, 2024

View reviewed changes

crates/core_arch/src/simd.rs Outdated Show resolved Hide resolved

crates/core_arch/src/x86/test.rs Outdated Show resolved Hide resolved

sayantn added 11 commits July 26, 2024 08:55

AVX512FP16 Part 0: Types

ac370a7

AVX512FP16 Part 1

1b093be

Add-Sub-Mul-Div, Load-Store-Move, `comi`, `set`

AVX512_FP16 Part 2: Complex Multiplication

bf92f83

AVX512FP16 Part 3: FMA

0bec23b

AVX512FP16 Part 4: Math functions

e6a5910

Reciprocal, RSqrt, Sqrt, Max, Min

AVX512FP16 Part 5: FP-Support

4872108

`getexp`, `getmant`, `roundscale`, `scalef`, `reduce`

AVX512FP16 Part 6: Remaining

d304918

`cmpph`, `fpclass`, reduce, `blend`, `permutex`

AVX512FP16 Part 7: Convert to f16

2ae57f0

AVX512FP16 Part 8: Convert from f16

57641cc

AVX512FP16 Part 9: Remaining avx512fp16 and avxneconvert

cf01aba

Update Intrinsics List to v3.6.9

8a5e971

Add `#[inline]` to avx512ifma intrinsics Fix the test equality. Remove the stability attributes in simd types and test functions

sayantn force-pushed the fp16 branch from 81512ca to 8a5e971 Compare July 26, 2024 03:26

Amanieu merged commit fb90dfa into rust-lang:master Jul 26, 2024

sayantn deleted the fp16 branch July 27, 2024 11:06

tgross35 mentioned this pull request Aug 20, 2024

Add SIMD operations that use f16 and f128 rust-lang/rust#125440

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement AVX512_FP16 #1605

Implement AVX512_FP16 #1605

Uh oh!

sayantn commented Jul 2, 2024 •

edited

Loading

Uh oh!

rustbot commented Jul 2, 2024

Uh oh!

tgross35 commented Jul 4, 2024

Uh oh!

sayantn commented Jul 4, 2024

Uh oh!

bors commented Jul 6, 2024

Uh oh!

sayantn commented Jul 17, 2024

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Implement AVX512_FP16 #1605

Implement AVX512_FP16 #1605

Uh oh!

Conversation

sayantn commented Jul 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Progress:

Uh oh!

rustbot commented Jul 2, 2024

Uh oh!

tgross35 commented Jul 4, 2024

Uh oh!

sayantn commented Jul 4, 2024

Uh oh!

bors commented Jul 6, 2024

Uh oh!

sayantn commented Jul 17, 2024

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

sayantn commented Jul 2, 2024 •

edited

Loading