Multiple Description Coding Using Directional Discrete Cosine Transform
Multiple Description Coding Using Directional Discrete Cosine Transform
Journal of Information and Communication Convergence Engineering. 2013. Dec, 11(4): 293-297
Copyright ©2013, The Korean Institute of Information and Commucation Engineering
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • Received : June 25, 2013
  • Accepted : September 12, 2013
  • Published : December 31, 2013
Export by style
Cited by
About the Authors
Ramesh Kumar Lama
Goo-Rak Kwon

Delivery of high quality video over a wide area network with large number of users poses great challenges for the video communication system. To ensure video quality, multiple descriptions have recently attracted various attention as a way of encoding and visual information delivery over wireless network. We propose a new efficient multiple description coding (MDC) technique. Quincunx lattice sub-sampling is used for generating multiple descriptions of an image. In this paper, we propose the application of a directional discrete cosine transform (DCT) to a sub-sampled quincunx lattice to create an MDC representation. On the decoder side, the image is decoded from the received side information. If all the descriptions arrive successfully, the image is reconstructed by combining the descriptions. However, if only one side description is received, decoding is executed using an interpolation process. The experimental results show that such the directional DCT can achieve a better coding gain as well as energy packing efficiency than the conventional DCT with re-alignment.
Due to network congestion and delay sensibility, video transmission over a lossy network is always a great challenge. Multiple description coding (MDC) [1] is an attractive approach to solving this problem as shown in Fig. 1 . It can efficiently combat packet loss without any retrainsmission, thus satisfying the demand of real-time services and relieving network congestion.
MDC encodes the source message into several bit streams (descriptions) carrying different information, which can then be transmitted over multiple channels [2] . In MDC’s simplest form, two parallel channels are assumed to connect the source with the destination. If only one channel works, the descriptions can individually be decoded to sufficiently guarantee a minimum fidelity in the reconstruction at the receiver [3] . However, when both channels work, the descriptions from the channels can be combined to yield a relatively high fidelity reconstruction.
Numerous MDC techniques have been proposed in recent years, such as the multiple description scalar quantization (MDSQ) proposed in [2] . In MDSQ, two descriptions are created by two coarse quantizers, each ensuring an aceptable distortion when only one of them is received.
These two coarse quantizers can be combined to produce a finer quantizer if two descriptions are received. Further, various types of coding techniques such as subband coding and wavelet coding have also implemented MDC [4 - 7] .
In this paper, we re-visit the MDC scheme based on the pixel domain sub-sampling. In particular, we focus on the quincunx sub-sampling lattice. Instead of applying a horizontal or vertical realignment so as to form regular
Lager Image
Multiple description (MD) coding with two descriptions and threedecoders.
square blocks, we retain the quincunx lattice and apply the directional discrete cosine transform (DDCT). Both theoretical analysis and simulation test will be discussed to confirm that an improved coding efficiency can be achieved in our DDCT, as compared to the traditional DCT with horizontal or vertical re-alignment.
In Section II, we briefly introduce the traditional pixeldomain sub-lattice on MDC. The proposed directionally sampled discrete cosine transform (DS-DCT) for the quincunx sub sampling lattice is presented in Section III. Further, we explain how to handle some boundary blocks that remain after the DS-DCT. In Section IV, we describe the experimental setup and present some simulation results. Finally, some conclusions are presented in Section V.
In this section, we will discuss the sub-lattice technique used in the proposed method. Given the source image I, which is typically a subset of Z 2 , in the proposed method, signal samples are partitioned into two subsets as follows:
Lager Image
There are two different methods to partition the image into two parts. Fig. 2 (a) shows orthogonal sub-sampling, and Fig. 2 (b) illustrates quincunx sub-sampling. In the proposed method, descriptions generated by scheme two are used. One of the major advantages of this scheme is the increase in correlation between samples. Under this scheme, two descriptions are generated according to a chess-box pattern, and the Euclidian distance between two neighboring samples is constantly equal to √2. After the splitting process, each description is transformed to the transform domain.
Lager Image
Two different pixel-domain sub-sampling lattices for multipledescription coding: (a) orthogonal sub-sampling and (b) quincunx subsampling.
The DCT and the discrete wavelet transform used in image compression are implemented by separable onedimensional (1D) transforms in the rows and columns of images.
The conventional N × N 2D DCT is implemented separately by two N -point 1D transforms. Let B(i, j)N × N and CN × N be the image block and the transform matrix. Then, the corresponding block of transformed coefficients B(u,v) can be expressed as follows:
Lager Image
Lager Image
Naturally, the conventional 2D DCT seems to be the best choice for image blocks in which vertical and/or horizontal edges dominate. However, it may cause some defects when it is applied to an image block in which other directional edges dominate. The major shortcoming of the separable transform is that it cannot represent the anisotropic edges in the image sparsely. In order to obtain the better representation of edges in all directions, the given image block is transformed on the basis of the directional DCT in Fig. 3 .
Lager Image
Exemplified elementary matrix operation: (a) no directional and(b) directional. The circles denote pixels, and the squares represent halfpixels.
Lager Image
Five direction modes: (a) vertical prediction, (b) diagonal down-right,(c) diagonal down-left, (d) vertical-right, and (e) horizontal-down. Thecircles denote pixels, and the dashed lines represent direction lines.
In the proposed method, there are in total five directional modes. Among these modes, one is the vertical prediction (mode 0), and the remaining are labeled diagonal down-right (mode 1), diagonal down-left (mode 2), vertical-right (mode 3), and horizontal-down (mode 4), as shown in Fig. 4 (a), (b), (c), (d), and (e), respectively.
On the encoder side, the input image is first analyzed block-by-block to decide the transform directions. The 1D DCT transform is performed in each block of the selected direction along the vertical direction. Next, the horizontal DCT is applied in the second step.
On the decoder side, when only one side description is received, the main task of the decoder is to interpolate the missing sub-image from the received sub-image. Although the proposed method involves the use of the directional data, all pixels in the partitioned blocks share a common direction
Lager Image
Experimental results. The horizontal axis is bit per pixel and thevertical axis is peak signal-to-noise ratio (PSNR). (a) PSNR of thereceived interpolated image with side decoder1 of Lena image. (b) PSNRof the interpolated received image with side decoder1 with Barbara image.(c) PSNR of the received interpolated image with side decoder1 with Boatimage. MDC: multiple description coding.
and the lost description is estimated from the four connected neighbors by using the conventional bilinear interpolation method. Since there are numerous interpolation algorithms for preserving the original textue contents of an image, we can enhance the quality of the reconstructed image by selecting any of the appropriate interpolation schemes.
Similarly, when two descriptions are simultaneously available at the decoder, a straightforward method is to decode the two descriptions simultaneously and then merge the two sub-images. Since each side description is compressed using quantization, any decoded pixel value from one description is only an approximation of the original.
Several experiments were conducted, and their results are presented in this section in order to evaluate the performance of the proposed image MDC scheme. The implementation of the new MDC scheme is integrated into JPEG coding, with the directional transform replacing the original 2D separable rectilinear discrete cosine transform.
Naturally, the JPEG MDC scheme is used as a benchmark for performance comparisons in terms of the peak signal-to-noise ratio (PSNR), where the input image is first split into two descriptions by quincunx lattice sub-sampling and then coded individually by JPEG coding.
To make a fair comparison, we also adopt the proposed texture-oriented interpolation and data fusion algorithms for the central decoding of JPEG MDC.
Two JPEG test images (Lena and Barbara) having a resolution of 512 × 512 are used in our experiments. They are split into two descriptions, each of which is a quincunx lattice. Each description is compressed using JPEG coding and the proposed scheme.
We evaluate the PSNR performance of a side decoder when only one description is received. As proposed, the full-resolution images are reconstructed from the received side description by the texture orientation interpolation method. For the sake of comparison, we also compute the PSNR results of the widespread linear interpolation method when applied to the received quincunx image, as shown in Fig. 5 . Since the two descripttions are balanced in our experiments, it suffices to list the PSNR values for description 1 of the proposed MDC scheme. The PSNR values shown in this figure are calculated over all samples including both the decoded and the interpolated images. The rates are also calculated over all samples in terms of a fullresolution image. One can observe that at low rates (e.g., 0.125 bpp), the two interpolation methods perform roughly the same, with only a small advantage to the texture orientation method. This is due to the lack of highfrequency components in the received side description at a low rate.
In this paper, we proposed a new MDC scheme using the directional DCT transform. The input image was directly split into two descriptions in the pixel domain using quincunx lattice sub-sampling. Using DDCT, we represented the image pixels oriented in different directions perfectly. The experimental results confirmed that the proposed directional MDC scheme could outperform the JPEG MDC scheme by up to 0.9 in the cases of both side decoding and central decoding.
This research was supported by the Ministry of Science,ICT and Future Planning (MSIP), Korea, under the IT/SWCreative Research Program supervised by the National ITIndustry Promotion Agency (NIPA-2013-H0502-13-1094).Further, this research was supported by the Basic ScienceResearch Program through the National Research Foundationof Korea funded by the Ministry of Education, Scienceand Technology (No. 2010-0008974).
Vaishampayan V. A. 1993 “Design of multiple description scalarquantizers” IEEE Transactions on Information Theory 39 (3) 821 - 834    DOI : 10.1109/18.256491
Goyal V. K. 2001 “Multiple description coding: compression meets thenetwork” IEEE Signal Processing Magazine 18 (5) 74 - 93    DOI : 10.1109/79.952806
Vaishampayan V. A. , Sloane N. J. A. , Servetto S. D. 2001 “Multipledescription vector quantization with lattice codebook: design andanalysis” IEEE Transactions on Information Theory 47 (5) 1718 - 1734    DOI : 10.1109/18.930913
Franchi N. , Fumagalli M. , Lancini R. , Tubaro S. 2005 “Multipledescription video coding for scalable and robust transmission overIP” IEEE Transactions on Circuits Systems for Video Technology 15 (3) 321 - 334    DOI : 10.1109/TCSVT.2004.842606
Servetto S. D. , Ramchandran K. , Vaishampayan V. A. , Nahrstedt K. 2000 “Multiple description wavelet based image coding” IEEE Transactions on Image Processing 9 (5) 813 - 826    DOI : 10.1109/83.841528
Channappayya S. S. , Lee J. , Heath R. W. , Bovik A. C. 2005 “Framebased multiple description image coding in the wavelet domain” in Proceedings of the IEEE International Conference on Image Processing Genoa, Italy 920 - 923
Kauff P. , Schuur K. 1998 “Shape-adaptive DCT with blockbasedDC separation and ΔDC correction” IEEE Transactions on Circuits and Systems for Video Technology 8 (3) 237 - 242    DOI : 10.1109/76.678616