6533b856fe1ef96bd12b2ec6

RESEARCH PRODUCT

On the Use of GPU for Accelerating Communication-Aware Mapping Techniques

Guillermo ViguerasJuan M. Orduña

subject

General Computer Sciencebusiness.industryComputer scienceGraphics processing unit02 engineering and technologyParallel computingSupercomputer020202 computer hardware & architectureAcceleration0202 electrical engineering electronic engineering information engineeringTechnology integration020201 artificial intelligence & image processingLocal search (optimization)Mapping techniquesArchitecturebusiness

description

Different communication-aware mapping techniques were proposed in recent years for improving the performance of distributed systems based on both, off-chip and on-chip networks. Some of these proposals were based on heuristic search for finding pseudo-optimal assignments of tasks and processing elements. However, the technology integration improvements have allowed a significant increase in the number of network nodes, requiring the acceleration of the heuristic search. In this paper, we propose a comparative study of the local search method used in a communication-aware mapping technique, when implemented on different parallel architectures. We compare the performance provided by a version of the local search method when executed on a single Graphics Processing Unit (GPU) with the one provided by the MPI version executed on a supercomputer with the same theoretical performance of the GPU platform, in order to study a fair scenario. We have considered a GPU based on the Fermi architecture, evaluating the improvements achieved by some new architectural features of this platform. The results show that a mixed parallel implementation on a single GPU outperforms the MPI implementation of the local search method. These results validate the GPU implementation as a very cost-effective accelerator for the local search method.

https://doi.org/10.1093/comjnl/bxv037