Publication details

Automatic Fusions of CUDA-GPU Kernels for Parallel Map

Authors

FOUSEK Jan FILIPOVIČ Jiří MADZIN Matúš

Year of publication 2011
Type Article in Periodical
Magazine / Source Computer architecture news
MU Faculty or unit

Faculty of Informatics

Citation
Field Informatics
Keywords GPU;CUDA;kernels fusion;map;mapped function
Description When implementing a function mapping on the contemporary GPU, several contradictory performance factors affecting distribution of computation into GPU kernels have to be balanced. A decomposition-fusion scheme suggests to decompose the computational problem to be sol\-ved by several simple functions implemented as standalone kernels and to fuse some of these functions later into more complex kernels to improve memory locality. In this paper, a prototype of source-to-source compiler automating the fusion phase is presented and the impact of fusions generated by the compiler as well as compiler efficiency is experimentally evaluated.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.

More info