What are the gcc’s intrinsic for loading 4 ints into __m128 and 8 ints into __m256 (aligned/unaligned)? What about unsigned ints?
What are the gcc’s intrinsic for loading 4 ints into __m128 and 8 ints
Share
Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.
Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Please briefly explain why you feel this question should be reported.
Please briefly explain why you feel this answer should be reported.
Please briefly explain why you feel this user should be reported.
Using Intel’s SSE intrnisics, the ones you’re looking for are:
_mm_load_si128()_mm_loadu_si128()_mm256_load_si256()_mm256_loadu_si256()Documentation:
There’s no distinction between signed or unsigned. You’ll need to cast the pointer to
__m128i*or__m256i*.Note that these are Intel’s SSE intrinsics and will work in GCC, Clang, MSVC, and ICC.
The GCC intrinsics work only in, well, GCC AFAIK of.