A feature-based approach to big data analysis of medical images

Matthew Toews, Christian Wachinger, Raul San Jose Estepar, William M. Wells

Research output: Contribution to journalConference articlepeer-review

22 Scopus citations

Abstract

This paper proposes an inference method well-suited to large sets of medical images. The method is based upon a framework where distinctive 3D scale-invariant features are indexed efficiently to identify approximate nearest-neighbor (NN) feature matches in O(log N) computational complexity in the number of images N. It thus scales well to large data sets, in contrast to methods based on pair-wise image registration or feature matching requiring O(N) complexity. Our theoretical contribution is a density estimator based on a generative model that generalizes kernel density estimation and K-nearest neighbor (KNN) methods. The estimator can be used for on-the-fly queries, without requiring explicit parametric models or an off-line training phase. The method is validated on a large multi-site data set of 95, 000, 000 features extracted from 19, 000 lung CT scans. Subject-level classification identifies all images of the same subjects across the entire data set despite deformation due to breathing state, including unintentional duplicate scans. State-of-the-art performance is achieved in predicting chronic pulmonary obstructive disorder (COPD) severity across the 5-category GOLD clinical rating, with an accuracy of 89% if both exact and one-off predictions are considered correct.

Original languageEnglish
Article numberA26
Pages (from-to)339-350
Number of pages12
JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9123
DOIs
StatePublished - 2015
Externally publishedYes
Event24th International Conference on Information Processing in Medical Imaging, IPMI 2015 - Isle of Skye, United Kingdom
Duration: 28 Jun 20153 Jul 2015

Fingerprint

Dive into the research topics of 'A feature-based approach to big data analysis of medical images'. Together they form a unique fingerprint.

Cite this