1. H.264
a. H.264 Basic process
H.264 divided into two basic process : encoding and decoding. On encoding
process carries out prediction, transforming, and encoding process to produce
a compressed H.264 bitstream. An H.264 video decoder carries out the
complementary processes of decoding, inverse transform and reconstruction
to produce a decoded video sequence.
Bitstream Encoding
the valu in the type field of the IP header is UDP. When the UDP receives
the datagram from IP, it examines the UDP checksum. All 16-bit fields in
the UDP header are added together, including the checksum. If this sum
equals 1111111111111111, the datagram has no errors. If one of the bits
in the computed sum is zero, it indicates that the datagram was
inadvertenly altered during transmission and thus has some errors. If the
checksum field in the UDP header is zero, it means that the sender did not
calculate checksum field in the UDP header is zero, it means that the
sender did not calculate checksum and the field can be ignored. If the
checksum is valid or nonzero, UDP at the destination computer examines
the destination port number and if an application is bound to that port, the
datagram is transferred to an application message queue to buffer the
incoming datagrams before transferring them to the application. If the
checksum is not valid, the destination computer discards the UDP
datagram. Example :
UDP header has three fields that contains the following 16-bits values:
0110011001100101,0101010101010110, and 0000111100001111, the
checksum can be calculated as follows :
First two 16-bit are added:
0110011001100101
0101010101010110
The sum of first and second 16-bit data is :
1011101110111011
Adding the third 16 bit data to the above sum gives :
1011101110111011
0000111100001111
The sum of these values is :
1100101011001010
The
1s
complement
of
the
sum
1100101011001010
is
0011010100110101. Now the checksum computed by the senders UDP is
0011010100110101. At the destination computer, the values of all the
four 16-bit fields, source & destination ports, length and checksum are
added. If no errors were introduced in the datagram, the sum at the
receiver will be 1111111111111111. If one of the bits is a zero, error
detected.
3. a. on JPEG/MPEG-1/2 use fixed size block matching (FSBM). On MPEG 4 using
various size block matching (VSBM).
b. VSBM is improved version of FSBM by varying the size of blocks to more
accurately match moving areas. This method was proposed by chan,yu,and
Constantinides. VSBM is a scheme that starts with relatively large blocks,
which are then repeadtedly divided, this is a so-called top down approach. If
the best matching error for a block is above some threshold, the block is
divided into four smaller blocks, until the maximum number of blocks or
locally minimum errors are obtained. The application of such top-down
methods may generate block structures for an image that match real moving
objects, but it seems that an approach which more directly seeks out areas of
univorm motion might be more effective. For the same number of blocks per
frame as FSBM, VSBM method results in a smaller mean sequare error (MSE),
or better prediction. More significantly, for a similar level of MSE as FSBM, the
VSBM technique can represent the inherent motion using fewer blocks, and
thus a reduced number of motion vectors.
4. a. chroma substantion benefits : chroma subsampling is a technique that are
used to reduce bandwidth in many video systems. Since the human visual
system is not very sensitive to color, color resolution can be reduced to lower
bandwidth. Video systems do this via chroma subsampling.
b. artefact that found in an MPEG-encoded video :
Aliasing : occurs when a signal being sampled contains frequencies
that are too high to be successfully digitized at a given sampling
frequency. When samped these high frequencies fold back on top of
the lower frequencies producing distortion. In most method of video
digitizing, this will produced pronounced vertical lines in the picture.
This problem can be reduced by applying a low pass filter to the
video signal before it is digitized to remove the unwanted high
frequency components.
Quantisation Noise : this form of distortion occurs because, when
digitized the continuously variable analogue waveform must be
quantized into a fixed finite number of levels. It is the coarseness of
these levels that causes quantisation noise. A 24-bit colour picture
suffers from virtually no quantisation noise, since the number of
available colours is so high 16.7 million. Reasonable results can be
obtained from an 8-bits per pixel picture, especially if the picture is
greyscale rather than colour.
Overload : like quantisation noise, overload is related to the finite
number of levels that the signal can take. If a signal is digitized that
is too high in amplitude, then the picture will appear bleached. For
example, if the signal level of a greyscale image is too high for the
conversion process to cope with, then all levels above the
maximum will be converted to white, causing the washed out
appearance.
Video signal degradation : video in digital form degrades far less
gracefully than its analogue counterpart. While digital information
may in theory be duplicated an infinite number of times without any
degradation, once that degradation does occur, it is very
noticeable. Due to the compression techinques used, a single bit
error in the data stream could for example cause a large block of
pixels to be displayed in a completely different colour to that
intended.
Gibbs effect : this is most noticeable around artificial objects such
as plain coloured, large text and geometric shape such as square. It
shows up as a blirring or haze around the object, where the sudden
transition is made from the artificial object to the background. It is
caused by the discrete cosine transform used to digitize