I’m looking at an application that does not require 32bit of precision, 12-16bit will be enough.
Is thrust capable of dealing with float16’s (i.e. packing/unpacking two 16bit precision floats into a 32bit word)?
Should I use fixed-point arithmetic?
Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.
Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Please briefly explain why you feel this question should be reported.
Please briefly explain why you feel this answer should be reported.
Please briefly explain why you feel this user should be reported.
CUDA hardware does not include native support for
half-precision arithmetic, only conversion to and fromfloat.Since C does not have a built-in
halftype, the conversion intrinsics useunsigned short: