A H.264 Based Joint Source Channel Coding Scheme over Wireless Channels
Xuejuan Gao, Li Zhuo, Suyu Wang, Lansun Shen Signal & Information Processing Lab, Beijing University of Technology, Beijing, 100022, China snow03129@emails.bjut.edu.cn Abstract
Joint source channel coding (JSCC) is an effective method which trade off the efficiency and the quality of the video transmission. In this paper, a new ratequality (R-Q) model is first proposed to represent the coding characteristics of H.264 instead of ratedistortion (R-D) model. Then, the end-to-end video transmission distortion is analyzed and an adaptive JSCC over wireless channel based on the R-Q model of the H.264 and the error protection characteristics of the Turbo code is proposed, which can optimize the rate allocation of the available network bandwidth between source coding and channel coding according to the current status of the wireless channels, so as to improve the robustness. The experiment results indicate that, compared with the scheme using the fixed channel coding rate, under the same channel conditions, our proposed JSCC scheme can greatly improve the transmission robustness and achieve better reconstructed video quality at the receiver. channel coding method, they all optimize the source and channel rate allocation based on the source R-D model and channel transmission distortion model in essence. In this paper, instead of using R-D model to perform rate control as many existing methods do, we propose a new rate-quality (R-Q) model. If use PSNR (Peak Signal to Noise Ratio) as the measurement metric, we find that there is a conic relationship between video quality Q and log(R), QP and log(R), no matter it is an I frame or a P frame. So we first set up a novel R-Q model of H.264. Then the end-to-end video transmission distortion is analyzed and an adaptive JSCC over wireless channel based on the R-Q model and the error protection characteristics of the Turbo code is proposed, which can optimize the rate allocation of the available network bandwidth between source coding and channel coding adaptively according to the actual condition of wireless channel, so as to improve the robustness of the video transmission. The experiment results indicate that our proposed JSCC scheme can greatly improve the transmission robustness and achieve better reconstructed video quality compared with the JSCC scheme using the fixed channel coding rate.
1. Introduction
Joint source channel coding (JSCC), an effective coding method, can minimize the end-to-end transmission distortion by optimizing the bit rate allocation between source encoder and the channel encoder or between different parts of source encoder. JSCC has been studied for many years. Therefore, there are a lot of literatures [1-3]: ZhiHai He et al proposed a joint source channel rate-distortion (R-D) analysis method [4], where an analytic solution for adaptive intra mode selection and joint source-channel rate control based on the R-D model has been developed. Zhang et al developed a MPEG-4 PFGS coding method based power-optimized joint sourcechannel coding method over wireless channel [5], which reduces total power consumption based on the proposed end-to-end power-optimized architecture. Although those schemes adopted different source or
683
each GOP is I frame, and the others are all P frames. If we use PSNR as the reconstructed video quality measurement metric, and calculate the logarithm value of the rate R, we can find that, there is a linear relationship between QP and Q while log(R)-Q and log(R)-QP curves approximately render a conic relationship. The relationship between QP and Q, log(R) and Q, as well as log(R) and Q are shown in Figure 1. From Figure 1, we can see that we can use one simple conic model to describe the coding characteristics of H.264. Therefore, three mathematical models are proposed as follows to describe the coding performance of H.264: QP = p1 [log( R )] 2 + p 2 log( R ) + p3 (1) Q = p 41 [log( R )] 2 + p5 log( R ) + p 6
Q = p7 QP + p8 QP + p9
2
Where QP represents quantization step, R coding rate and Q the reconstructed video quality (i.e.PSNR value) and pi, i=1,2,,9 are model parameters. The three models are named together as R-Q model in this paper. To prove the accuracy of our proposed R-Q model, we performed experiments with different video sequences. Figure 2 shows the comparison of model data and actual tested data of Akiyo sequence (CIF format). It can be seen clearly from Figure 2 that the proposed R-Q model can exactly describe the relationship among Q, R and QP. We have performed the test over a wide range of video sequences also, and the proposed R-Q model has been found to hold.
Rate allocation module Channel condition Reconstruction picture H.264 decoder Turbo decoder
wireless channel
Figure 3. The architecture of our JSCC stream transmitted over wireless channel is decoded to achieve the reconstructed video.
684
prediction coding mode, channel bit error will cause error propagation. Therefore, error concealment methods are usually exploited to restrict error propagation. Let Pe be the channel bit error rate (BER), the expected average of DS+C(n) is defined as: (3) DS +C ( n ) = ( 1 Pe )DS ( n ) + Pe Der ( n ) where DS(n) represents the distortion between the reconstructed picture and the original picture without bit error, Der(n) represents the distortion between the reconstructed picture after error concealment and the original picture if the channel errors occur. In terms of the characteristic of Turbo code, Pe can be formulated as follows [7]:
Pe 1 = 2 r Eb e rN 0
rE b N0
~ ~ as F ( n ) , where j is the position n ,iF ( n 1, j )e of pixel i in the n-1 th frame according to the motion ( n ) is the quantization value of vector (MV) and e prediction error. Otherwise, if there are bit errors, we use the same error concealment method as I frame. Then the expected average of DS+C(n) of P frame is given by:
2 ~ ( n 1, j ) F D S + C ( n ) = D S ( n ) + ( 1 Pe ) E F ( n 1, j )
( n ,i ) F ( n 1,i ) + Pe E F
}+ P E {F ( n ,i ) F ( n 1,i ) }
2 e
(9)
(4)
Note that Eb is the power of every signal, N0 is yawp power of channel and r is bit rate of channel coding. If we assume F(n,i) to be the original value of pixel n, ithe construction value of i in the nth frame, F ~ pixel i at the encoder side and Fn,i the reconstructed pixel i at the decoder side, then DS(n) and Der(n) can be given respectively:
( n,i ) DS ( n ) = E F ( n ,i ) F
the mean square error of reference frames of motion compensation between the encoder end and the decoder end, which can be obtained in reference [4].
2 ~ Der ( n ) = E F ( n ,i ) F ( n ,i )
D
n =1
S +C
(n)
(10)
(5)
Due to the different coding mode of I frame and P frame, we need to analyze them separately. For I frame, if there are bit errors occurred, we use the MB at the same position in the n-1 th frame to replace the corrupted one. Then we will have:
2 ~ Der ( n ) = F ( n ,i ) F ( n 1,i )
2 ~ ( n ,i ) + F ( n ,i ) F = E F ( n ,i ) F ( n 1,i )
(6)
Because of the uncorrelated characteristics of source distortion and channel distortion, (6) can be changed into:
~ ( n ,i ) |2 } + E{| F ( n ,i ) F Der ( n ) = E{| F ( n ,i ) F ( n 1,i ) |2 } ~ ( n ,i ) F (7) = DS ( n ) + E{| F ( n 1,i ) |2 }
We can obtain DS+C(n) from (8) or (9) according to different coding mode. Apparently, (2) is an optimization problem with the restriction conditions, whose optimal answers are RS*and RC*. Let R be the total bit rate of channel bandwidth. If consider the relationship among the coding rate r of Turbo code, RS and R : r=RS/R , then RS = r R RC = 1 r R . Therefore, (2) is transformed to find only one optimal r* to minimize the DS+C, which can use Lagrange optimization algorithm to search.
( n ,i ) F ( n 1, i ) | 2 } D S + C ( n ) = D S ( n ) + Pe E {| F ~ ( n 1, i ) F + P E {| F ( n 1, i ) | 2 }
e
(8)
Note that DS(n) can be obtained from the R-Q model of (1), E F ( n ,i ) F ( n1, i ) 2 represents the mean
square error of the construction picture between the nth frame and the n-1 th frame at the encoder side and 2 ~ which describes the source ( n 1,i ) F E F ( n1,i )
distortion of the n-1th frame, can be obtained in reference [4]. For P frame, if there are no bit errors, the construction value at the receiver can be described
685
that compared with the fixed channel coding rate scheme, our proposed one can achieve a higher reconstructed quality of 1~2dB at the receiver side. Figure 6 shows the reconstructed pictures of different sequences (CIF format), where BER is 10-3 and available channel bandwidth is 150kbps. Figure 6 illustrates that our proposed method can also achieve a better reconstructed video quality. Tests over other video sequences yielded similar results.
5. Conclusion
In this paper, we first propose a novel R-Q model of H.264. Then, the end-to-end video transmission distortion is analyzed and an adaptive JSCC over wireless channel based on the R-Q model and the error protection characteristics Turbo code is proposed, which can optimize the rate allocation of the available bandwidth between source coding and channel coding adaptively according to the actual condition of current wireless channel. The experiment results indicate that the proposed JSCC scheme can greatly improve the transmission robustness and achieve better reconstructed video quality.
6. Acknowledgments
This work was supported by NSFC (60772069), the Beijing Novel Program (2005B08).
7. Reference
[1] Robert E. Van Dyck, David J, Transport of Wireless Video Using Separate, Concatenated and Joint SourceChannel Coding, Proceedings of the IEEE, VOL. 87, NO.10, Oct. 1999, pp.1734-1750. [2] Fang Zhijun, Xu Shenghua, et al, Joint Source-Channel Coding for MPEG-4 Streams Transmission Over 3G Networks, Wireless Communications, Networking and Mobile Computing, 2005. Proceedings, Volume 2, 23-26 Sept. 2005 Volume: 2, pp.1261-1264. [3] Feng Wang, Guangxi Zhu, Zhenming Zhang, et al, Joint Source-Channel Rate Allocation and Unequal Error Protection for Dependent Video Transmission, Wireless Communications, Networking and Mobile Computing, 2005. Proceedings, Volume 2, 23-26 Sept. 2005, pp.1275 -1280. [4] Zhihai He, Jianfei Cai , Chang Wen Chen, Joint Source Channel Rate-Distortion Analysis for Adaptive Mode Selection and Rate Control in Wireless Video Coding, IEEE Transactions on circuits and systems for video technology,VOL.12,NO.6,June. 2002, pp. 511-523. [5] Qian Zhang, Wen Wu Zhu et al, A power-optimized joint source-channel coding for scalable video streaming over wireless channel, IEEE International Symposium on Circuits & Systems, Sydney, Australia, 2001, pp. 510-513. (b) BER 10-4, total bit rate 450kbps, mobile sequence
686