Multiple reference picture video coding using polynomial motion models

Thomas Wiegand, Eckehard Steinbach, Axel Stensrud, Bernd Girod

Research output: Contribution to journalConference articlepeer-review

10 Scopus citations

Abstract

We present a new video coding scheme that uses several reference frames for improved motion-compensated prediction. The reference pictures are warped versions of the previously decoded frame applying polynomial motion compensation. In contrast to global motion compensation, where typically one motion model is transmitted, we show that in the general case more than one motion model is of benefit in terms of coding efficiency. In order to determine the multiple motion models we employ a robust clustering method based on the iterative application of the Least Median of Squares estimator. The approach is incorporated into an H.263-based video codec and embedded into a rate-constrained motion estimation and macroblock mode decision frame work. It is demonstrated that adaptive multiple reference picture coding in general improves rate-distortion performance. PSNR gains of 1.2 dB in comparison to the H.263 codec for the high global and local motion sequence Stefan and 1 dB for the sequence Mobile & Calendar, which contains no global motion, are reported. These PSNR gains correspond to bit-rate savings of 21% and 30% comparing to the H.263 codec, respectively. The average number of motion models selected by the encoder for our test sequences is between 1 and 7 depending on the actual bit-rate.

Original languageEnglish
Pages (from-to)134-145
Number of pages12
JournalProceedings of SPIE - The International Society for Optical Engineering
Volume3309
Issue number1
DOIs
StatePublished - 1998
Externally publishedYes
EventVisual Communications and Image Processing '98 - San Jose, CA, United States
Duration: 28 Jan 199830 Jan 1998

Keywords

  • Motion Estimation
  • Multiple Pictures
  • Rate-Constrained Video Coding
  • Robust Motion Clustering

Fingerprint

Dive into the research topics of 'Multiple reference picture video coding using polynomial motion models'. Together they form a unique fingerprint.

Cite this