For my term paper, I am required to study how parallel compilers esp those used in GPUs perform task mapping and the various heuristics used to perform data mapping/alignment.
Any pointers to papers covering existing literature, new trends would be immensely helpful and appreciated.
Best,
Subramanian
NVIDIA now uses the open-source LLVM compiler for CUDA. You will find here LLVM related publications.