Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove Kahan algorithm in _mm_dp_ps #597

Merged
merged 1 commit into from
May 18, 2023

Conversation

Cuda-Chen
Copy link
Collaborator

@Cuda-Chen Cuda-Chen commented May 18, 2023

Remove Kahan algorithm in _mm_dp_ps to align conversion result with SSE.

Also, apply shortcut when immediate is 0xXF and target is ARMv8-A.

Last, add more tests for testing possible combinations of immediate
including 0xXF.

Close #595.

@Cuda-Chen Cuda-Chen marked this pull request as ready for review May 18, 2023 04:17
Copy link
Member

@jserv jserv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Consolidate the reason to drop Kahan algorithm in git commit messages.
Check https://cbea.ms/git-commit/ for general rules.

Remove Kahan algorithm in _mm_dp_ps to align conversion result with SSE.

Also, apply shortcut when immediate is 0xXF and target is ARMv8-A.

Last, add more tests for testing possible combinations of immediate
including 0xXF.

Close DLTcollab#595.
@Cuda-Chen Cuda-Chen changed the title Improve _mm_dp_ps and its test cases Remove Kahan algorithm in _mm_dp_ps May 18, 2023
@Cuda-Chen Cuda-Chen requested a review from jserv May 18, 2023 05:36
@jserv jserv merged commit 39d8540 into DLTcollab:master May 18, 2023
12 checks passed
@jserv
Copy link
Member

jserv commented May 18, 2023

Thank @Cuda-Chen for contributing!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

_mm_dp_ps does not always match x86_64
2 participants