TY - GEN
T1 - Statistical shape modelling
T2 - 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops
AU - Mei, Lin
AU - Figl, Michael
AU - Rueckert, Daniel
AU - Darzi, Ara
AU - Edwards, Philip
PY - 2008
Y1 - 2008
N2 - Statistical shape modelling is a technique whereby the variation of shape across the population is modelled by principal component analysis (PCA) on a set of sample shape vectors. The number of principal modes retained in the model (PCA dimension) is often determined by simple rules, for example choosing those cover a percentage of total variance. We show that this rule is highly dependent on sample size. The principal modes retained should ideally correspond to genuine anatomical variation. In this paper, we propose a mathematical framework for analysing the source of PCA model error. The optimum PCA dimension is a pay-off between discarding structural variation (under-modelling) and including noise (over-modelling). We then propose a stopping rule that identifies the noise dominated modes using a t-test of the bootstrap stability between the real data and artificial Gaussian noise. We retain those modes that are not dominated by noise. We show that our method determines the correct PCA dimension for synthetic data, where conventional rules fail. Comparison between our rule and conventional rules are also performed on a series of real datasets. We provide a foundation for analysing rules that are used to determine the number of modes to retain and also allows the study of PCA sample sufficiency.
AB - Statistical shape modelling is a technique whereby the variation of shape across the population is modelled by principal component analysis (PCA) on a set of sample shape vectors. The number of principal modes retained in the model (PCA dimension) is often determined by simple rules, for example choosing those cover a percentage of total variance. We show that this rule is highly dependent on sample size. The principal modes retained should ideally correspond to genuine anatomical variation. In this paper, we propose a mathematical framework for analysing the source of PCA model error. The optimum PCA dimension is a pay-off between discarding structural variation (under-modelling) and including noise (over-modelling). We then propose a stopping rule that identifies the noise dominated modes using a t-test of the bootstrap stability between the real data and artificial Gaussian noise. We retain those modes that are not dominated by noise. We show that our method determines the correct PCA dimension for synthetic data, where conventional rules fail. Comparison between our rule and conventional rules are also performed on a series of real datasets. We provide a foundation for analysing rules that are used to determine the number of modes to retain and also allows the study of PCA sample sufficiency.
UR - http://www.scopus.com/inward/record.url?scp=51849155903&partnerID=8YFLogxK
U2 - 10.1109/CVPRW.2008.4562996
DO - 10.1109/CVPRW.2008.4562996
M3 - Conference contribution
AN - SCOPUS:51849155903
SN - 9781424423408
T3 - 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops
BT - 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops
Y2 - 23 June 2008 through 28 June 2008
ER -