TY - JOUR
T1 - Modeling longitudinal data using a pair-copula decomposition of serial dependence
AU - Smith, Michael
AU - Min, Aleksey
AU - Almeida, Carlos
AU - Czado, Claudia
N1 - Funding Information:
Michael Smith is Professor of Management, Melbourne Business School, University of Melbourne, 200 Leicester Street, Carlton, VIC, 3053, Australia (E-mail: [email protected]). Aleksey Min is Postdoctoral Fellow, Carlos Almeida is Postdoctoral Fellow, and Claudia Czado is Chair of Mathematical Statistics, Zentrum Mathematik, Technische Universität München, 85748 Garching, Germany. The work of Michael Smith was partially supported by Australian Research Council grant DP0985505. Claudia Czado and Carlos Almeida gratefully acknowledge the financial support from the Deutsche Forschungsgemeinschaft (Cz 86/1-3: Statistical inference for high dimensional dependence models using pair-copulas). The authors thank three referees and associate editor, all of whom made comments that improved the paper.
PY - 2010/12
Y1 - 2010/12
N2 - Copulas have proven to be very successful tools for the flexible modeling of cross-sectional dependence. In this paper we express the dependence structure of continuous-valued time series data using a sequence of bivariate copulas. This corresponds to a type of decomposition recently called a "vine" in the graphical models literature, where each copula is entitled a "pair-copula." We propose a Bayesian approach for the estimation of this dependence structure for longitudinal data. Bayesian selection ideas are used to identify any independence paircopulas, with the end result being a parsimonious representation of a time-inhomogeneous Markov process of varying order. Estimates are Bayesian model averages over the distribution of the lag structure of the Markov process. Using a simulation study we show that the selection approach is reliable and can improve the estimates of both conditional and unconditional pairwise dependencies substantially. We also show that a vine with selection outperforms a Gaussian copula with a flexible correlation matrix. The advantage of the pair-copula formulation is further demonstrated using a longitudinal model of intraday electricity load. Using Gaussian, Gumbel, and Clayton pair-copulas we identify parsimonious decompositions of intraday serial dependence, which improve the accuracy of intraday load forecasts. We also propose a new diagnostic for measuring the goodness of fit of high-dimensional multivariate copulas. Overall, the pair-copula model is very general and the Bayesian method generalizes many previous approaches for the analysis of longitudinal data. Supplemental materials for the article are also available online.
AB - Copulas have proven to be very successful tools for the flexible modeling of cross-sectional dependence. In this paper we express the dependence structure of continuous-valued time series data using a sequence of bivariate copulas. This corresponds to a type of decomposition recently called a "vine" in the graphical models literature, where each copula is entitled a "pair-copula." We propose a Bayesian approach for the estimation of this dependence structure for longitudinal data. Bayesian selection ideas are used to identify any independence paircopulas, with the end result being a parsimonious representation of a time-inhomogeneous Markov process of varying order. Estimates are Bayesian model averages over the distribution of the lag structure of the Markov process. Using a simulation study we show that the selection approach is reliable and can improve the estimates of both conditional and unconditional pairwise dependencies substantially. We also show that a vine with selection outperforms a Gaussian copula with a flexible correlation matrix. The advantage of the pair-copula formulation is further demonstrated using a longitudinal model of intraday electricity load. Using Gaussian, Gumbel, and Clayton pair-copulas we identify parsimonious decompositions of intraday serial dependence, which improve the accuracy of intraday load forecasts. We also propose a new diagnostic for measuring the goodness of fit of high-dimensional multivariate copulas. Overall, the pair-copula model is very general and the Bayesian method generalizes many previous approaches for the analysis of longitudinal data. Supplemental materials for the article are also available online.
KW - Bayesian model selection
KW - Copula diagnostic
KW - Covariance selection
KW - D-vine
KW - Goodness of fit
KW - Inhomogeneous Markov process
KW - Intraday electricity load
KW - Longitudinal copulas
UR - http://www.scopus.com/inward/record.url?scp=78651271358&partnerID=8YFLogxK
U2 - 10.1198/jasa.2010.tm09572
DO - 10.1198/jasa.2010.tm09572
M3 - Article
AN - SCOPUS:78651271358
SN - 0162-1459
VL - 105
SP - 1467
EP - 1479
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 492
ER -