The multivariate watson distribution: Maximum-likelihood estimation and other aspects

Suvrit Sra, Dmitrii Karp

Research output: Contribution to journalArticlepeer-review

52 Scopus citations

Abstract

This paper studies fundamental aspects of modelling data using multivariate Watson distributions. Although these distributions are natural for modelling axially symmetric data (i.e., unit vectors where ±x are equivalent), for high-dimensions using them can be difficult-largely because for Watson distributions even basic tasks such as maximumlikelihood are numerically challenging. To tackle the numerical difficulties some approximations have been derived. But these are either grossly inaccurate in high-dimensions [K.V. Mardia, P. Jupp, Directional Statistics, second ed., John Wiley & Sons, 2000] or when reasonably accurate [A. Bijral, M. Breitenbach, G.Z. Grudic, Mixture of Watson distributions: a generative model for hyperspherical embeddings, in: Artificial Intelligence and Statistics, AISTATS 2007, 2007, pp. 35-42], they lack theoretical justification. We derive new approximations to the maximum-likelihood estimates; our approximations are theoretically welldefined, numerically accurate, and easy to compute. We build on our parameter estimation and discuss mixture-modelling with Watson distributions; here we uncover a hitherto unknown connection to the "diametrical clustering"algorithm of Dhillon et al. [I.S. Dhillon, E.M. Marcotte, U. Roshan, Diametrical clustering for identifying anticorrelated gene clusters, Bioinformatics 19 (13) (2003) 1612-1619].

Original languageEnglish
Pages (from-to)256-269
Number of pages14
JournalJournal of Multivariate Analysis
Volume114
Issue number1
DOIs
StatePublished - 2013
Externally publishedYes

Keywords

  • Confluent hypergeometric
  • Diametrical clustering
  • Directional statistics
  • Hypergeometric identities
  • Kummer function
  • Special function
  • Watson distribution

Fingerprint

Dive into the research topics of 'The multivariate watson distribution: Maximum-likelihood estimation and other aspects'. Together they form a unique fingerprint.

Cite this