Integrating GPU support for OpenMP offloading directives into clangCarlo BertolliSamuel F. Antaoet al.2015LLVM-HPC 2015
Performance analysis of OpenMP on a GPU using a CORAL proxy applicationGheorghe-Teodor BerceaCarlo Bertolliet al.2015PMBS/SC 2015
Generalizing run-time tiling with the loop chain abstractionMichelle Mills StroutFabio Luporiniet al.2014IPDPS 2014