TY - JOUR
T1 - On the usage of GPUs for efficient motion estimation in medical image sequences
AU - Thiyagalingam, Jeyarajan
AU - Goodman, Daniel
AU - Schnabel, Julia A.
AU - Trefethen, Anne
AU - Grau, Vicente
PY - 2011
Y1 - 2011
N2 - Images are ubiquitous in biomedical applications from basic research to clinical practice. With the rapid increase in resolution, dimensionality of the images and the need for real-time performance in many applications, computational requirements demand proper exploitation of multicore architectures. Towards this, GPU-specific implementations of image analysis algorithms are particularly promising. In this paper, we investigate the mapping of an enhanced motion estimation algorithm to novel GPU-specific architectures, the resulting challenges and benefits therein. Using a database of three-dimensional image sequences, we show that the mapping leads to substantial performance gains, up to a factor of 60, and can provide near-real-time experience. We also show how architectural peculiarities of these devices can be best exploited in the benefit of algorithms, most specifically for addressing the challenges related to their access patterns and different memory configurations. Finally, we evaluate the performance of the algorithm on three different GPU architectures and perform a comprehensive analysis of the results.
AB - Images are ubiquitous in biomedical applications from basic research to clinical practice. With the rapid increase in resolution, dimensionality of the images and the need for real-time performance in many applications, computational requirements demand proper exploitation of multicore architectures. Towards this, GPU-specific implementations of image analysis algorithms are particularly promising. In this paper, we investigate the mapping of an enhanced motion estimation algorithm to novel GPU-specific architectures, the resulting challenges and benefits therein. Using a database of three-dimensional image sequences, we show that the mapping leads to substantial performance gains, up to a factor of 60, and can provide near-real-time experience. We also show how architectural peculiarities of these devices can be best exploited in the benefit of algorithms, most specifically for addressing the challenges related to their access patterns and different memory configurations. Finally, we evaluate the performance of the algorithm on three different GPU architectures and perform a comprehensive analysis of the results.
UR - http://www.scopus.com/inward/record.url?scp=80053467671&partnerID=8YFLogxK
U2 - 10.1155/2011/137604
DO - 10.1155/2011/137604
M3 - Article
AN - SCOPUS:80053467671
SN - 1687-4188
VL - 2011
JO - International Journal of Biomedical Imaging
JF - International Journal of Biomedical Imaging
M1 - 137604
ER -