Conventional forward error correction (FEC) code rate decision schemes using analytical source coding distortion model and channel-induced distortion model are usually complex, and require the typical process of model parameter training which involves potentially high computational complexity and implementation cost. To avoid the complex modeling procedure, we propose a simple but accurate joint source-channel distortion model to estimate channel loss threshold set for optimal FEC code rate decision.
Ⅰ. Introduction
When transporting video over the wireline/wireless channel, each video packet can be received correctly, received with errors, or lost by the network. It is necessary for the video sender to provide adequate error resilience features. It is efficient to adaptively change the FEC code rate in response to time-varying packet loss dynamics in order to ensure consistent optimal video quality
[1]
. The optimal FEC code rate decision is a crucial procedure to determine the optimal source and channel coding rates to minimize the overall picture distortion when transporting video packets over packet loss channels. Several previous studies have focused on the optimal code rate decision for joint source channel coding (JSCC). In
[2]
, a source coding distortion model and a channel-induced distortion model were proposed as a means of determining a combination of the optimal intra-refresh rate and code rate. Similarly, Frossard et al.
[3]
proposed an end-to-end distortion model, in which channel-induced distortion was assumed to be proportional to the number of lost pixels. Their modeling equations are very complex and include numerous modeling parameters. Kwon et al.
[4]
proposed a practical method based on an observation that the residual packet loss probability in the optimal code rate stays near to a constant value. However, actually this residual packet loss probability is not constant any more at the high channel packet loss probabilities.
In this paper, we propose an accurate estimation of channel loss threshold set for optimal code rate decision.
Ⅱ. Video Transmission System and Channel Model
The fundamental structure of the video transmissionsystem that supports both source and channel codings is presented in
Fig. 1
. The system consists of a video encoder/decoder, a channel encoder/decoder, and rate allocation block. The rate allocation block optimizes the video quality for a given channel condition. The sender periodically receives information about the instantaneous channel status from the receiver through a RTCP feedback report. From this information, the block for the channel characteristic estimation calculates the current status of the channel packet loss rate,
PL
, and the available total transmission rate,
RTC
. The channel encoder in
Fig. 1
generates
n
FEC packets for every
k
video packets by the RS code, denoted as RS(
n
,
k
). The code rate of RS code is defined as
비디오 전송 시스템의 기본 구조 Fig. 1. Fundamental structure of the video transmission system
Let
RS
and
RC
be the source and channel coding bit rates. When we define the total transmission rate by
RTC
, then
RS
and
RC
can be expressed as
For an
RS
(
n
,
k
) code, lost packets up to n-k packets in a transmission group can be completely recovered
[3]
. However, if more than
n-k
packets are lost, this transmission group cannot be recovered in its entirety. The residual packet loss probability is defined as the probability that a video packet is lost but cannot be correctly recovered even by the channel decoding. The residual packet loss probability denoted as
PR
depends on
r
and
PL
, and can be represented as
[2]
.
The overall distortion (
DOV
) of the reconstructed video depends upon both the source coding distortion (
DS
) and channel induced distortion (
DC
). We can also express the optimal code rate decision with respect to minimizing the overall distortion for the given
n
as follows:
Ⅲ. Channel Loss Threshold Set
In this section, we introduce two types of loss threshold sets, the channel loss threshold set and the residual loss threshold set. We employed an H.264/AVC source coder and combined it with
RS
(5,
k
) (
k
=1,2,…,5) channel coder using R
TC
=100 Kbps. Since we can adjust
r
by changing
k
and fixing
n
as
n
0
,
r
can be redefined as follows:
In
Fig. 2
, we can see that though
PL
increases,
PSNR
(
rk
,
PL
) barely decreases in the ranges of low
PL
where almost all lost packets are recovered by the channel decoder. The optimal code rate can be classified as follows:
다양한 패킷손실 조건에서 Akiyo 시퀀스에 대한 최적의 부호율 결정을 위한 PSNR 곡선 Fig. 2. PSNR curves for determining optimum code rates in various packet loss channel condition for Akiyo sequence
We call the set of
PL
values which correspond to the threshold boundary to change the code rate
a channel loss threshold set
(
CLTS
). The CLTS is represented as
To get the CLTS quickly and accurately, we introduce another loss threshold set called residual loss threshold set(RLTS). We define the RLTS as
in which each element of RLTS,
PRK
, is the value shown in (3) for the given
rk
and
PLK
.
Ⅳ. Joint Source-channel Distortion Model
If we use MSE as a distortion measure, we could observe that the difference of source distortion between
rk
and
rk
-1
is likely equal to the channel distortion measured at
r
=
rk
and
PL
=
PLk
. We can formulate this observation as
where
DC
(
rk
,
PLk
) means the channel distortion.
The
D
(
rk
, 0) can be represented as
where (
θ
,
R
0
,
D
0
) are the parameters of the model which depend on the encoded video sequence
[2]
.
Fig. 3
shows that the distortion of the received video is linearly proportional to the residual packet loss probability. Therefore, by using (3), we can represent
DC
(
rk
,
PLk
) as
n0=20일 때 k 값에 대한 잔여패킷 손실확률에 대비한 비디오 왜곡 Fig. 3. Video distortion versus residual packet loss probability PRkfor various k where n0=20
where
b
is a proportional constant. If we substitute (5), (10), and (11) into (9), we can obtain
PRK
expressed as
where α and β are the model parameters. By using (12), we can get all elements of the RLTS defined in (8), and then the corresponding CLTS can be obtained from (3).
Ⅴ. Performance Evaluation
To evaluate the accuracy of the estimated CLTS using the joint source-channel distortion model, we performed simulations using H.264/AVC codec. The n
0
is fixed to be 20. We evaluate the PSNR performance of the code rates estimated by the proposed method, and compare it with the optimal code rate case when the packet loss ratio varies from 0.001 to 0.2.
Table 1
compares the average PSNR performance. It can be seen that the PSNR performance obtained by the proposed method is very close to the optimal PSNR performance. The PSNR gap between the two approaches is just within 0.5 dB.
Fig. 4
demonstrates the PSNR performance for various packet loss rates, in which the Soccer sequenceis used.
제안된 방법에 의해 추정된 부호율과 실제 최적의 부호율 간의 평균 PSNR 성능비교
Table 1. Comparison of the average PSNR between using real optimal code rates and using code rates estimated by the proposed method
Soccer 시퀀스에 대한 PSNR 성능 비교. Fig. 4. Comparison of PSNR performance for Soccer sequence
Ⅵ. Conclusions
In this paper, we proposed an accurate estimation of channel loss threshold set for optimal FEC code rate decision.
Azni M.
2009
“Joint source and channel coding for real time H.264 coded video transmission using mobile hand-held devices,”
IETE J. of Research
55
(4)
154 -
161
DOI : 10.4103/0377-2063.55984
Stuhlmuller K.
,
Farber N.
,
Girod B.
2000
“Analysis of video transmission over lossy channels,”
IEEE J. Sel. Areas in Commun.
18
(6)
1012 -
1032
DOI : 10.1109/49.848253
Frossard P.
,
Verscheure O.
2001
“Joint source/FEC rate selection for quality-optimal MPEG-2 video delivery,”
IEEE Trans. Image Process.
10
(12)
1815 -
1824
DOI : 10.1109/83.974566
Kwon J.
,
Kim J.
2002
“Adaptive code rate decision of joint source-channel coding for wireless video,”
Electron. Lett.
38
(25)
1752 -
1753
DOI : 10.1049/el:20021163