Demand for access via Internet, transfer across telecommunication network Examples:
1 hour of TV-quality movie requires (640 x 480 x 30 fps) x 24 b/pixel= .. Mbps bandwidth for xmission and . Gb for storage If it is HDTV (1280 x 720 x 60 fps) x 24 = ..Mbps and .. Gb for storage A 14 x 17 in2 X- ray scanned @ 70 m requires .. storage
Desirable features
minimal loss of information and perceptual quality; highest reduction in amount of data
Types of compression
Lossy techniques
decompressed image is not identical to original it may appear visually identical
Perceptual redundancy
Refers to limitations in the human visual system Eye has greater sensitivity to distortion in
smoother regions dark regions luminance (I) component
Regions other than these can therefore be represented with less data.
Spatial redundancy
Refers to correlation among neighbouring pixels in an image
Ex. Neighbouring pixels in an object have (by and large) similar brightness and motion values
Statistical redundancy
All brightness values dont occur in an image with equal frequency
i.e., the image histogram is not always uniform
Structural redundancy
Images are 2-D projections of 3-D objects An image can be represented in terms of structural image models
a texture image coded as {Ci, Ti}
- Ci contours of regions - Ti texture in each region
Definitions
Entropy measure of the disorder in the image and
hence of information content
high degree of order
- Ex. line image
H = pi log 2 pi
i =0
Definitions contd.
Compression ratio bo and bc are bit rates for original and compressed
images Cmax = m/H
bo C= bc
1 MSE = MN
( xd [ m, n] x[m, n]) 2
Definitionscontd.
PSNR Peak signal to noise ratio in dB
255 PSNR = 20 log10 ( ) MSE
Run-length coding
A lossless method (C ~ 2 or 3) A Run is a sequence of pixels with identical values (0 or 1) in a scan direction 1-D RLC Each scan line = {gi li}; terminate with End Of Line symbol.
Ex: I = [100 100 100 32 32 200 200 50]
RLC ..contd.
2-D RLC uses connected areas of pixels with identical values Applications Fax transmission standard (due to max efficiency for
binary images)
2. Find and combine 2 nodes with lowest p(gi) to produce a parent node. Label this with the sum of p(gi) of the 2 child nodes 3. Continue until you get only 1 root node with label = probability of 1
Example of HC
3-bit image with pixel values gi [0,1,2.7] Probabilities:
p(0) p(1) p(2) p(3) p(4) p(5) p(6) p(7) 0.4 0.08 0.08 0.2 0.12 0.08 0.04 0
si s0 s1 s2 s3 s4 s5 s6 s7
s l = 2.52
i i
s is the symbol and l is the length of the Huffman codeword for s; m is the no of bits used for the original image
Compression ratio C = m/n = 3/2.52 =1.24 Upper bound for C = source entropy
H = 2.42 bits/greylevel
H = pi log 2 pi
i =0 2 m 1
AC, represents a sequence of symbols by some interval in [0.0,1.0) Most efficient for binary image coding
Predictive coding
Can be lossless or lossy Differential Pulse Code Modulation (DPCM)
Lossless version of predictive coding Used in the sequential lossless mode of jpeg Exploits spatial redundancy Predicts a pixel value based on neighbouring pixels value
DPCM - coding
1. Predict a pixel value based its neighbours. Round to the nearest integer
Prediction is usually a linear one
2. Find the error between predicted and actual pixel values 3. Encode the error using variable length coding
2-D prediction
input
encoder
Compressed output
predictor
Ex. Delta modulation which uses 1 bit quantiser and horizontal prediction
Transform coding
Image is compressed in the transform domain by modifying the image transform Steps Image is divided into n x n blocks and transform is computed for each block Transform coefficients are processed (quantisation etc) and then encoded
What level of quantisation to use? Should quantisation level be adapted to local image content?
JPEG
JPEG: Joint Photographic Experts Group Four modes: 1. Sequential lossless mode
(de)compression requires a single scan of the image DPCM + entropy encoding for the prediction error
JPEG modes
DCT-based lossy modes 2. Sequential 3. Progressive scan
- (de)compression requires multiple scans of the image
4. Hierarchical
- Compress at multiple resolutions for different display devices
Drawback: Blocking artifacts appear at high compression rates
Baseline JPEG
DCT
Quantiser
Zig-zag ordering
Entropy encoding
Coded image
JPEG 2000
Strategy: Compress once and decode in many modes Based on discrete wavelet transform
Daubechies biorthogonal spline
JPEG2000
Rate control Input image block Multicomponent transformation Wavelet transform Quantiser Encoding Coded image Region of interest
Multicomponent transformation: tiling (optional), dc level shifting and RGB conversion with or without subsampling
YUV