6533b7d2fe1ef96bd125f575

RESEARCH PRODUCT

Real-time computation of parameter fitting and image reconstruction using graphical processing units

Werner LustermannAndreas SuterGünther DissertoriQiulin WangJannis FischerUldis LocansUldis LocansAndreas Adelmann

subject

FOS: Computer and information sciencesMulti-core processorSpeedup010308 nuclear & particles physicsComputer scienceComputationFOS: Physical sciencesGeneral Physics and AstronomyIterative reconstructionComputational Physics (physics.comp-ph)Supercomputer01 natural sciences030218 nuclear medicine & medical imagingComputational science03 medical and health sciencesRange (mathematics)CUDA0302 clinical medicineComputer Science - Distributed Parallel and Cluster ComputingHardware and Architecture0103 physical sciencesSingle-coreDistributed Parallel and Cluster Computing (cs.DC)Physics - Computational Physics

description

Abstract In recent years graphical processing units (GPUs) have become a powerful tool in scientific computing. Their potential to speed up highly parallel applications brings the power of high performance computing to a wider range of users. However, programming these devices and integrating their use in existing applications is still a challenging task. In this paper we examined the potential of GPUs for two different applications. The first application, created at Paul Scherrer Institut (PSI), is used for parameter fitting during data analysis of μ SR (muon spin rotation, relaxation and resonance) experiments. The second application, developed at ETH, is used for PET (Positron Emission Tomography) image reconstruction and analysis. Applications currently in use were examined to identify parts of the algorithms in need of optimization. Efficient GPU kernels were created in order to allow applications to use a GPU, to speed up the previously identified parts. Benchmarking tests were performed in order to measure the achieved speedup. During this work, we focused on single GPU systems to show that real time data analysis of these problems can be achieved without the need for large computing clusters. The results show that the currently used application for parameter fitting, which uses OpenMP to parallelize calculations over multiple CPU cores, can be accelerated around 40 times through the use of a GPU. The speedup may vary depending on the size and complexity of the problem. For PET image analysis, the obtained speedups of the GPU version were more than × 40 larger compared to a single core CPU implementation. The achieved results show that it is possible to improve the execution time by orders of magnitude.

https://doi.org/10.1016/j.cpc.2017.02.007