Evolving object-based coding standards, such as MPEG-4, introduce radically
different functionalities at the expense of increased computational complexity,
and there exists a demand for greater compression to permit communication over
widely available low bandwidth channels, for example in mobile multimedia
communications. Current developments which permit arbitrary-shaped objects
to be encoded and decoded as separate video object planes (VOPs) have
largely adopted existing coding techniques and then modified and optimised
the methods for an object-based framework. Fixed size block matching (FSBM)
has remained the preferred approach to motion estimation due to its backward
compatibility with previous standards. The main aim of this project was to
develop efficient motion estimation and shape coding techniques specifically
designed for object-based video coding.
Investigations showed that a variable size block matching (VSBM) motion
compensation approach can offer significant coding efficiency improvements over
FSBM, while maintaining the simplicity of implementation and computational
complexity of FSBM. The technique was extended to produce a modified VSBM
(MVSBM) motion compensation strategy that exploits irregularly shaped
areas of uniform motion within small objects. Both VSBM and MVSBM utilise
a quad-tree for the representation of the irregular motion segmentation
structure, which is predictively coded and transmitted with the motion
compensation information. Since the arbitrary areas are a composition of
4 x 4 blocks, the whole structure can be encoded using a quad-tree, where
multiple blocks undergoing the same motion form a single area. While a
motion vector per block representation would normally be very expensive to
transmit, a motion vector redundancy coding (MVRC) scheme has proved ideally
suited to producing compact descriptions of MV structures exhibiting high
spatial redundancy. Additionally, a temporal quad-tree coding (TQC) scheme
was developed that exploits any temporal redundancies between successive
quad-tree structures, using a differential coding mechanism. In comparison
with conventional motion information coding schemes, the combined algorithms
provide bit coding reductions of up to 21% for the same PSNR.
A shape coding strategy which adapts the MPEG-4 arbitrary shape coding
techniques to a variable block size framework has been developed. It
successfully integrates the shape and quad-tree coding requirements of VSBM and
MVSBM in a unified structure while minimising temporal redundancies. The coding
efficiency of the combined MVSBM motion compensation and shape coding cost
is compared with the MPEG-4 motion vector and shape coding requirements. For
the same quality prediction, the new scheme shows a bit coding reduction
of up to 15%. The shape coding strategy was further improved, making it
appropriate for small video objects undergoing fast shape changes due to
either rapid object movement or camera focal length changes. Compared with
conventional motion and shape coding, the new technique provides notable
bit coding improvements for the same PSNR.
The combined motion compensation and shape coding algorithms have been
implemented in a hybrid video object codec which employs shape adaptive DCT
texture coding. Evaluations on a wide selection of test sequences confirm the
quoted coding efficiency gains for the same PSNR and perceptual quality of the
reconstructed images. Additionally, the techniques developed are shown to be
appropriate for the coding of multiple video objects, and are readily scalable.
More details are available through the
MCG website.