Evaluation of the Intel Xeon Phi Co-processor to accelerate the sensitivity map calculation for PET imaging

T. Dey; P. Rodrigue

doi:10.1088/1748-0221/10/07/C07011

Journal of Instrumentation

Evaluation of the Intel Xeon Phi Co-processor to accelerate the sensitivity map calculation for PET imaging

T. Dey¹ and P. Rodrigue¹

Published 21 July 2015 • © 2015 IOP Publishing Ltd and Sissa Medialab srl
Journal of Instrumentation, Volume 10, July 2015 Citation T. Dey and P. Rodrigue 2015 JINST 10 C07011 DOI 10.1088/1748-0221/10/07/C07011

Download Article PDF

Article metrics

36 Total downloads

Permissions

Get permission to re-use this article

Author affiliations

¹ Oncology Solutions, Philips Research High Tech Campus, 5656 AE Eindhoven, the Netherlands

Dates

Received 26 March 2015
Accepted 18 May 2015
Published 21 July 2015

Buy this article in print

Journal RSS

Sign up for new issue notifications

Abstract

We aim to evaluate the Intel Xeon Phi coprocessor for acceleration of 3D Positron Emission Tomography (PET) image reconstruction. We focus on the sensitivity map calculation as one computational intensive part of PET image reconstruction, since it is a promising candidate for acceleration with the Many Integrated Core (MIC) architecture of the Xeon Phi. The computation of the voxels in the field of view (FoV) can be done in parallel and the 10³ to 10⁴ samples needed to calculate the detection probability of each voxel can take advantage of vectorization.

We use the ray tracing kernels of the Embree project to calculate the hit points of the sample rays with the detector and in a second step the sum of the radiological path taking into account attenuation is determined. The core components are implemented using the Intel single instruction multiple data compiler (ISPC) to enable a portable implementation showing efficient vectorization either on the Xeon Phi and the Host platform. On the Xeon Phi, the calculation of the radiological path is also implemented in hardware specific intrinsic instructions (so-called `intrinsics') to allow manually-optimized vectorization. For parallelization either OpenMP and ISPC tasking (based on pthreads) are evaluated.Our implementation achieved a scalability factor of 0.90 on the Xeon Phi coprocessor (model 5110P) with 60 cores at 1 GHz. Only minor differences were found between parallelization with OpenMP and the ISPC tasking feature. The implementation using intrinsics was found to be about 12% faster than the portable ISPC version. With this version, a speedup of 1.43 was achieved on the Xeon Phi coprocessor compared to the host system (HP SL250s Gen8) equipped with two Xeon (E5-2670) CPUs, with 8 cores at 2.6 to 3.3 GHz each. Using a second Xeon Phi card the speedup could be further increased to 2.77. No significant differences were found between the results of the different Xeon Phi and the Host implementations. The examination showed that a reasonable speedup of sensitivity map calculation could be achieved on the Xeon Phi either by a portable or a hardware specific implementation.

Export citation and abstract BibTeX RIS

Previous article in issue

Next article in issue

Evaluation of the Intel Xeon Phi Co-processor to accelerate the sensitivity map calculation for PET imaging

Article metrics

Permissions

Share this article

Author affiliations

Dates

Abstract