Is it safe/possible/advisable to cast floats directly to __m128 if they are 16 byte

Question

0

Editorial Team

Asked: June 9, 20262026-06-09T11:58:23+00:00 2026-06-09T11:58:23+00:00

Is it safe/possible/advisable to cast floats directly to __m128 if they are 16 byte

0

Is it safe/possible/advisable to cast floats directly to __m128 if they are 16 byte aligned?

I noticed using _mm_load_ps and _mm_store_ps to “wrap” a raw array adds a significant overhead.

What are potential pitfalls I should be aware of?

EDIT :

There is actually no overhead in using the load and store instructions, I got some numbers mixed and that is why I got better performance. Even thou I was able to do some HORRENDOUS mangling with raw memory addresses in a __m128 instance, when I ran the test it took TWICE AS LONG to complete without the _mm_load_ps instruction, probably falling back to some fail safe code path.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-09T11:58:24+00:00

Editorial Team

2026-06-09T11:58:24+00:00Added an answer on June 9, 2026 at 11:58 am

What makes you think that _mm_load_ps and _mm_store_ps “add a significant overhead” ? This is the normal way to load/store float data to/from SSE registers assuming source/destination is memory (and any other method eventually boils down to this anyway).

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Is it safe/possible/advisable to cast floats directly to __m128 if they are 16 byte

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply