-
Notifications
You must be signed in to change notification settings - Fork 304
Implement AVX512_FP16 #1605
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement AVX512_FP16 #1605
Conversation
|
Have you run into any weird behavior with these, or do things seem to be working smoothly? (ignoring the ABI issue for system function calls, that is) |
|
No problems yet, just that |
|
☔ The latest upstream changes (presumably 3dd9579) made this pull request unmergeable. Please resolve the merge conflicts. |
91e0971 to
403897c
Compare
c9588c5 to
e907eba
Compare
Add-Sub-Mul-Div, Load-Store-Move, `comi`, `set`
Reciprocal, RSqrt, Sqrt, Max, Min
`getexp`, `getmant`, `roundscale`, `scalef`, `reduce`
`cmpph`, `fpclass`, reduce, `blend`, `permutex`
Add `#[inline]` to avx512ifma intrinsics Fix the test equality. Remove the stability attributes in simd types and test functions
This PR adds the AVX512_FP16 intrinsics in Rust. These intrinsics will be behind the feature gate
#[feature(stdarch_x86_avx512_f16)](rust-lang/rust#127213).Progress:
getexp,getmant,scale)This also adds some missing inlining in
avx512ifmaand updates thex86-intel.xmlfile to v3.6.9The
set1_pchintrinsics were not implemented due to a lack of complex number type.cmpphandfpclassphintrinsics use inline asm because of noi1support yet.