Anda di halaman 1dari 50

# Two-Di mensi onal Mot i on Est i mat i on

## (Par t I : Fundament al s & Basi c Tec hni ques) ( q )

Yao Wang
Polytechnic University Brooklyn NY11201 Polytechnic University, Brooklyn, NY11201
http://eeweb.poly.edu/~yao
Out l i ne
3Dmotion model 3D motion model
2-D motion model
2-D motion vs. optical flow
Optical flow equation and ambiguity in motion estimation
General methodologies in motion estimation
Motion representation
Motion estimation criterion
Optimization methods
Pixel-based motion estimation
Block-based motion estimation
EBMA algorithm
Yao Wang, 2003 2-D Motion Estimation, Part 1
EBMA algorithm
2-D Mot i on Model
Camera projection Camera projection
3D motion
Projection of 3-D motion Projection of 3 D motion
2D motion due to rigid object motion
Projective mapping
Approximation of projective mapping
Affine model
Bilinear model Bilinear model
Yao Wang, 2003 2-D Motion Estimation, Part 1 3
Pi nhol e Camer a
Y
Y
X
Z
X
3-D
point
X
Z
C
x
y
F
Camera
center
x
x
y
2-D
The image of an object is reversed from its
3-Dposition The object appears smaller
Yao Wang, 2003 2-D Motion Estimation, Part 1 4
Image
plane
image
3-D position. The object appears smaller
when it is farther away.
Pi nhol e Camer a Model :
Per spec t i ve Pr oj ec t i on Per spec t i ve Pr oj ec t i on
X
All points in this ray will
h th i Y
Y
Z
X
have the same image
X
x
x
y
x
y
Z
X
F
Z
Y
F y
Z
X
F x
Z
Y
F
y
Z
X
F
x
, , = = = =
C
(a)
Yao Wang, 2003 2-D Motion Estimation, Part 1 5
Z y x
Z Z Z F Z F
to related inversely are ,
Appr ox i mat e Model :
Or t hogr aphi c Pr oj ec t i on Or t hogr aphi c Pr oj ec t i on
Y
X
Z Y Z
X
x
y
x
Z
x
y
Z
(b) (b)
small is object inthe ationwith depthvari the as long as used be Can
,
) ( far very is object When the
Y y X x
Z
= =

Yao Wang, 2003 2-D Motion Estimation, Part 1 6
object. the of distance the to compared
small is object in the ation with depth vari the as long as used be Can
Ri gi d Obj ec t Mot i on
X
Object before
motion
Z
C
X C
T
Y
X C
X
Object after
motion
T T T : ; : ] [ ; ) ]( [ '
: center object the on wrp. translati and Rotation
T R C T C X R X u u u + + =
Yao Wang, 2003 2-D Motion Estimation, Part 1 7
z y x z y x
T T T , , : ; , , : ] [ ; ) ]( [ T R C T C X R X u u u + + =
Yao Wang, 2003 2-D Motion Estimation, Part 1 8
Fl ex i bl e Obj ec t Mot i on
Two ways to describe Two ways to describe
Decompose into multiple, but connected rigid sub-objects
Global motion plus local motion in sub-objects
Ex. Human body consists of many parts each undergo a
rigid motion
Yao Wang, 2003 2-D Motion Estimation, Part 1 9
3-D Mot i on -> 2-D Mot i on
X
Y
X
D
Z
2-D MV
3-D MV
X
Z
x
y
x
d
X
x
d
x
C
Yao Wang, 2003 2-D Motion Estimation, Part 1 10
2-D Mot i on Cor r espondi ng t o
Ri gi d Obj ec t Mot i on Ri gi d Obj ec t Mot i on
General case:
( ( ( (
General case:
T
T
T
Z
Y
X
r r r
r r r
r r r
Z
Y
X
z
y
x
(
(
(

+
(
(
(

(
(
(

=
(
(
(

'
'
'
9 8 7
6 5 4
3 2 1
F T Z F r y r x r
F T Z F r y r x r
F x
z
x
+ + +
+ + +
=

) (
) (
'
9 8 7
3 2 1
Projection e Perspectiv
F T Z F r y r x r
F T Z F r y r x r
F y
y
z
y
z
+ + +
+ + +
=
) (
) (
'
) (
9 8 7
6 5 4
9 8 7
Projective mapping:
y c x c
y b x b b
y
y c x c
y a x a a
x
2 1
2 1 0
2 1
2 1 0
1
' ,
1
'
: c) bY aX (Z planar is surface object When the
+ +
+ +
=
+ +
+ +
=
+ + =
Yao Wang, 2003 2-D Motion Estimation, Part 1 11
2 1 2 1
Sampl e Mot i on Fi el d
Oc c l usi on Ef f ec t
x
T
Changed region
Moving
object
Unchanged region
Background
to be covered
Motion is undefined in occluded regions
Frame k
M i
Uncovered
Frame k1
Yao Wang, 2003 2-D Motion Estimation, Part 1 13
Moving
object
Uncovered
Background
Typi c al Camer a Mot i ons
B
Track right
Dolly
backward
Boom up
Pan right
Tilt up
Track left
Dolly forward
Boom down
Pan left
Tilt down
Roll
Tilt down
Yao Wang, 2003 2-D Motion Estimation, Part 1 14
2-D Mot i on Cor r espondi ng t o
Camer a Mot i on Camer a Mot i on
(b) (a)
Camera zoom
Camera rotation around Z-axis (roll)
Yao Wang, 2003 2-D Motion Estimation, Part 1 15
Camera rotation around Z axis (roll)
2-D Mot i on Cor r espondi ng t o
Ri gi d Obj ec t Mot i on Ri gi d Obj ec t Mot i on
General case:
( ( ( (
General case:
T
T
T
Z
Y
X
r r r
r r r
r r r
Z
Y
X
z
y
x
(
(
(

+
(
(
(

(
(
(

=
(
(
(

'
'
'
9 8 7
6 5 4
3 2 1
F T Z F r y r x r
F T Z F r y r x r
F x
z
x
+ + +
+ + +
=

) (
) (
'
9 8 7
3 2 1
Projection e Perspectiv
F T Z F r y r x r
F T Z F r y r x r
F y
y
z
y
z
+ + +
+ + +
=
) (
) (
'
) (
9 8 7
6 5 4
9 8 7
Projective mapping:
y c x c
y b x b b
y
y c x c
y a x a a
x
2 1
2 1 0
2 1
2 1 0
1
' ,
1
'
: c) bY aX (Z planar is surface object When the
+ +
+ +
=
+ +
+ +
=
+ + =
Yao Wang, 2003 2-D Motion Estimation, Part 1 16
2 1 2 1
Pr oj ec t i ve Mappi ng
Non-chirping models Chirping models
(Original) (Affine) (Bilinear) (Projective) (Relative-
projective)
(Pseudo-
perspective)
Two features of projective mapping:
Chirping: increasing perceived spatial frequency for far away objects
Converging (Keystone): parallel lines converge in distance
Yao Wang, 2003 2-D Motion Estimation, Part 1 17
Af f i ne and Bi l i near Model
Affine (6 parameters): Affine (6 parameters):
(

+ +
+ +
=
(

y b x b b
y a x a a
y x d
y x d
y
x
2 1 0
2 1 0
) , (
) , (
Good for mapping triangles to triangles
Bilinear (8 parameters): Bilinear (8 parameters):
(

+ + +
+ + +
=
(

xy b y b x b b
xy a y a x a a
y x d
y x d
y
x
3 2 1 0
3 2 1 0
) , (
) , (
Good for mapping blocks to quadrangles
Yao Wang, 2003 2-D Motion Estimation, Part 1 18
Mot i on Fi el d Cor r espondi ng t o
Di f f er ent 2 D Mot i on Model s Di f f er ent 2-D Mot i on Model s
Translation
Affine
( ) (b) (a) (b)
Bilinear
Projective
Yao Wang, 2003 2-D Motion Estimation, Part 1 19
(c) (d)
2-D Mot i on vs. Opt i c al Fl ow p
2-DMotion: Projection of 3-Dmotion depending on 3Dobject motion and 2-D Motion: Projection of 3-D motion, depending on 3D object motion and
projection operator
Optical flow: Perceived 2-D motion based on changes in image pattern,
also depends on illumination and object surface texture
On the left, a sphere is rotating
under a constant ambient
illumination, but the observed image
does not change.
On the right, a point light source is
rotating around a stationary
sphere, causing the highlight point
th h t t t
Yao Wang, 2003 2-D Motion Estimation, Part 1 20
on the sphere to rotate.
Opt i c al Fl ow Equat i on p q
When illumination condition is unknown the best one When illumination condition is unknown, the best one
can do it to estimate optical flow.
Constant intensity assumption -> Optical flow equation y p p q
) , , ( ) , , (
: " assumption intensity constant " Under
= + + + t y x d t d y d x
t y x

) , , ( ) , , (
: expansion s Taylor' using But,
c
c
+
c
c
+
c
c
+ = + + + d
y
d
y
d
x
t y x d t d y d x
t y x t y x

0 or 0 or 0
: equation flow optical the have we two, above the Compare
=
c
+ V =
c
+
c
+
c
=
c
+
c
+
c
v v d d d
T
t

v
Yao Wang, 2003 2-D Motion Estimation, Part 1 21
0 or 0 or 0
c
+ V
c
+
c
+
c c
+
c
+
c t t
v
y
v
x
d
t
d
y
d
x
y x t y x
v
Ambi gui t i es i n Mot i on Est i mat i on g
Optical flowequation only
y
Optical flow equation only
constrains the flow vector
The flowvector in the
n
v
y

## The flow vector in the

tangent direction ( ) is
under-determined
In regions with constant
v

t
e
t

n
e
n

t
v
In regions with constant
brightness ( ), the
flow is indeterminate ->
Motion estimation is
0 = V
x
Tangent line
Motion estimation is
unreliable in regions with
flat texture, more reliable
near edges
0 =
c
c
+ V
+ =
t
v
v v
n
t t n n

e e v
Yao Wang, 2003 2-D Motion Estimation, Part 1 22
g
ct
Gener al Consi der at i ons
f or Mot i on Est i mat i on f or Mot i on Est i mat i on
Two categories of approaches: Two categories of approaches:
Feature based (more often used in object tracking, 3D
reconstruction from 2D)
Intensity based (based on constant intensity assumption)
(more often used for motion compensated prediction,
required in video coding, frame interpolation) -> Our focus
Three important questions
How to represent the motion field?
What criteria to use to estimate motion parameters? What criteria to use to estimate motion parameters?
How to search motion parameters?
Yao Wang, 2003 2-D Motion Estimation, Part 1 23
Mot i on Repr esent at i on p
Global:
Entire motion field is
represented by a few
global parameters
Pixel-based:
One MV at each pixel,
with some smoothness
constraint between
(a) (b)
( ) ( )
Region-based:
Entire frame is divided
into regions, each
region corresponding
Block-based:
Entire frame is divided
into blocks, and motion
in eachblock is
to an object or sub-
object with consistent
motion, represented by
a few parameters.
in each block is
characterized by a few
parameters.
Yao Wang, 2003 2-D Motion Estimation, Part 1 24
(c) (d)
Other representation: mesh-based (control grid) (to be discussed later)
Not at i ons
Time t
Time t t
Backward motion estimation
Anchor frame:
Target frame:
Motion parameters:
) (
1
x
) (
2
x
a
x
d(x; t, t t)
x
Time t t
Motion parameters:
Motion vector at a
pixel in the anchor
frame: ) (x d
a
x
d(x; t, t t)
Target frame
Anchor frame
Target frame
Forward motion estimation
Motion field:
Mapping function:
A e x a x d ), ; (
A e + = x a x d x a x w ), ; ( ) ; (
Target frame
A e + x a x d x a x w ), ; ( ) ; (
Yao Wang, 2003 2-D Motion Estimation, Part 1 25
Mot i on Est i mat i on Cr i t er i on
To minimize the displaced frame difference (DFD) To minimize the displaced frame difference (DFD)
MSE : 2 MAD; : 1
min ) ( )) ; ( ( ) (
1 2 DFD
= =
+ =

A e
P p
E
x
p
x a x d x a
To satisfy the optical flow equation
T i dditi l th t i t i l i ti
( ) min ) ( ) ( ) ; ( ) ( ) (
1 2 1 OF
+ V =

A e x
p
T
E x x a x d x a
To impose additional smoothness constraint using regularization
technique (Important in pixel- and block-based representation)
) ; ( ) ; ( ) (
2
=

a y d a x d a
s
E
Bayesian (MAP) criterion: to maximize the a posteriori probability
min ) ( ) (
DFD
+
A e e
a a
x y
s s DFD
N
E w E w
x
Yao Wang, 2003 2-D Motion Estimation, Part 1 26
max ) , (
1 2
= d D P
Rel at i on Among Di f f er ent Cr i t er i a g
OF criterion is good only if motion is small OF criterion is good only if motion is small.
OF criterion can often yield closed-form solution as
the objective function is quadratic in MVs. j q
When the motion is not small, can iterate the solution
based on the OF criterion to satisfy the DFD criterion.
Bayesian criterion can be reduced to the DFD
criterion plus motion smoothness constraint
More in the textbook More in the textbook
Yao Wang, 2003 2-D Motion Estimation, Part 1 27
Opt i mi zat i on Met hods p
Exhaustive search Exhaustive search
Typically used for the DFD criterion with p=1 (MAD)
Guarantees reaching the global optimal
Computation required may be unacceptable when number of p q y p
parameters to search simultaneously is large!
Fast search algorithms reach sub-optimal solution in shorter time
Typically used for the DFD or OF criterion with p=2 (MSE)
the gradient can often be calculated analytically
When used with the OF criterion, closed-form solution may be obtained
Reaches the local optimal point closest to the initial solution Reaches the local optimal point closest to the initial solution
Multi-resolution search
Search from coarse to fine resolution, faster than exhaustive search
Avoid being trapped into a local minimum
Yao Wang, 2003 2-D Motion Estimation, Part 1 28
Avoid being trapped into a local minimum
Gr adi ent Desc ent Met hod
Iteratively update the current estimate in the direction opposite the gradient
di ti direction.
Not a good initial g
A good initial
Appropriate
stepsize
Stepsize
The solution depends on the initial condition. Reaches the local minimum closest
to the initial condition
too big
Yao Wang, 2003 2-D Motion Estimation, Part 1 29
Choice of step side:
Fixed stepsize: Stepsize must be small to avoid oscillation, requires many iterations
New t ons Met hod
Newtons method Newtons method
Converges faster than 1
st
order method (I.e. requires fewer number of iterations to
reach convergence)
Requires more calculation in each iteration
More prone to noise (gradient calculation is subject to noise, more so with 2
nd
order
than with 1
st
order)
May not converge if \alpha >=1. Should choose \alpha appropriate to reach a good
Yao Wang, 2003 2-D Motion Estimation, Part 1 30
compromise between guaranteeing convergence and the convergence rate.
New t on-Raphson Met hod p
Newton-Ralphson method Newton-Ralphson method
Approximate 2
nd
order gradient with product of 1
st
Applicable when the objective function is a sum of squared errors
Only needs to calculate 1
st
order gradients, yet converge at a rate similar to y g y g
Newtons method.
Yao Wang, 2003 2-D Motion Estimation, Part 1 31
Pi x el -Based Mot i on Est i mat i on
Horn Schunck method Horn-Schunck method
OF + smoothness criterion
Multipoint neighborhood method p g
Assuming every pixel in a small block surrounding a pixel
has the same MV
Pel-recurrsive method Pel-recurrsive method
MV for a current pel is updated from those of its previous
pels, so that the MV does not need to be coded
D l d f l ti f id d Developed for early generation of video coder
Yao Wang, 2003 2-D Motion Estimation, Part 1 32
Mul t i poi nt Nei ghbor hood Met hod
Estimate the MV at each pixel independently by minimizing the Estimate the MV at each pixel independently, by minimizing the
DFD error over a neighborhood surrounding this pixel
Every pixel in the neighborhood is assumed to have the same
MV MV
Minimizing function:
i ) ( ) ( ) ( ) (
2
+

E d d
Optimization method:
Exhaustive search (feasible as one only needs to search one MV at
min ) ( ) ( ) ( ) (
) (
1 2 n DFD
+ =

e
n
B
n
w E
x x
x d x x d
Exhaustive search (feasible as one only needs to search one MV at
a time)
Need to select appropriate search range and search step-size
Yao Wang, 2003 2-D Motion Estimation, Part 1 33
Ex ampl e: Gr adi ent Desc ent
Met hod Met hod
min ) ( ) ( ) ( ) (
2
1 2 DFD
w E x d x x d + =

) ( ) ( ) (
min ) ( ) ( ) ( ) (
2
) (
n
n
) (
1 2 n DFD
B
n
B
n
n
n
n
e w
E
w E
x
d x x
d
d g
x d x x d
d x
x x
x x
+
e
e
c
c
+ =
c
c
=
+

| | ) ( ) ( ) ( ) (
2
2
2
2 2
) (
2
n
2
n n
T
B
n
n
n
n
n
e w w
E
x
d x x
x x
x
d
d H
d x
d x
x x
+
+
e
c
c
+ +
|
.
|

\
|
c
c
c
c
=
c
c
=

) (
2 2
) (
T
B
n
n
w
x x
x
d x
x x
+
e
|
.
|

\
|
c
c
c
c
~

| |
: method Raphson - Newton
) (
1
) (
n
) (
n
) 1 (
n
l l l
d g d d
+
= o
Yao Wang, 2003 2-D Motion Estimation, Part 1 34
| | ) ( ) (
) (
n
1
) (
n
) (
n
) 1 (
n
l l l l
d g d H d d

+
= o
Si mpl i f i c at i on Usi ng OF Cr i t er i on
2
( )
( ) ( ) = V + V =
c
c
+ V =

e
1 1 2 1
) (
2
1 2 1 n OF
0 ) ( ) ( ) ( ) ( ) (
min ) ( ) ( ) ( ) ( ) (
n
n
T
B
n
T
w
E
w E
x x
x x x d x x
d
x x d x x d

( )
( ) ( )
|
|
.
|

\
|
V
|
|
.
|

\
|
V V =
c

e
e
) (
1 2 1
1
) (
1 1 opt n,
) ( n
) ( ) ( ) ( ) ( ) ( ) ( ) (
n n
n
B B
T
B
w w
x x x x
x x
x x x x x x x d
d

. \ . \
) ( ) (
n n
The solution is good only if the actual MV is small. When this is not the
case one should iterate the above solution with the following update: case, one should iterate the above solution, with the following update:
) ( ) (
) 1 ( ) (
n
) 1 (
n
) (
n 2
) 1 (
2
+ +
+
A + =
+ =
l
n
l l
l l
d d
d x x
Yao Wang, 2003 2-D Motion Estimation, Part 1 35
iteration at that found MV the denote
) 1 ( +
A
l
n
where
Bl oc k -Based Mot i on Est i mat i on:
Over vi ew Over vi ew
Assume all pixels in a block undergo a coherent motion and Assume all pixels in a block undergo a coherent motion, and
search for the motion parameters for each block independently
Block matching algorithm (BMA): assume translational motion, 1
MV per block (2 parameter) MV per block (2 parameter)
Exhaustive BMA (EBMA)
Fast algorithms
Deformable block matching algorithm(DBMA): allowmore Deformable block matching algorithm (DBMA): allow more
complex motion (affine, bilinear), to be discussed later.
Yao Wang, 2003 2-D Motion Estimation, Part 1 36
Bl oc k Mat c hi ng Al gor i t hm
Overview: Overview:
Assume all pixels in a block undergo a translation, denoted by a
single MV
Estimate the MV for each block independently, by minimizing the
DFD error over this block
Minimizing function:
Optimization method:
min ) ( ) ( ) (
1 2 m DFD
+ =

e
m
B
p
m
E
x
x d x d
Exhaustive search (feasible as one only needs to search one MV at
a time), using MAD criterion (p=1)
Fast search algorithms
Integer vs fractional pel accuracy search
Yao Wang, 2003 2-D Motion Estimation, Part 1 37
Integer vs. fractional pel accuracy search
Ex haust i ve Bl oc k Mat c hi ng
Al gor i t hm (EBMA) Al gor i t hm (EBMA)
Target frame
R
x
Target frame
R
y
d
m
Anchor frame
B
Search region
B
m
Best match
B
m
Current block
Yao Wang, 2003 2-D Motion Estimation, Part 1 38
Compl ex i t y of I nt eger -Pel EBMA
Assumption Assumption
Image size: MxM
Block size: NxN
Search range: (-R,R) in each dimension g ( , )
Search stepsize: 1 pixel (assuming integer MV)
Operation counts (1 operation=1 -, 1 +, 1 *):
Each candidate position: N^2
Each block going through all candidates: (2R+1)^2 N^2
Entire frame: (M/N)^2 (2R+1)^2 N^2=M^2 (2R+1)^2
Independent of block size!
Example: M=512, N=16, R=16, 30 fps
Total operation count = 2.85x10^8/frame =8.55x10^9/second
Regular structure suitable for VLSI implementation
Yao Wang, 2003 2-D Motion Estimation, Part 1 39
Challenging for software-only implementation
Sampl e Mat l ab Sc r i pt f or
I nt eger pel EBMA I nt eger -pel EBMA
%f1: anchor frame; f2: target frame, fp: predicted image;
%mvx mvy: store the MV image %mvx,mvy: store the MV image
%widthxheight: image size; N: block size, R: search range
for i=1:N:height-N,
for j=1:N:width-N %for every block in the anchor frame
for k=-R:1:R,
for l=-R:1:R %for every search candidate
end;
end;end;
fp(i:i+N-1,j:j+N-1)=f2(i+dy:i+dy+N-1,j+dx:j+dx+N-1);
%put the best matching block in the predicted image
iblk=(floor)(i-1)/N+1; jblk=(floor)(j-1)/N+1; %block index
mvx(iblk,jblk)=dx; mvy(iblk,jblk)=dy; %record the estimated MV
end;end;
Yao Wang, 2003 2-D Motion Estimation, Part 1 40
end;end;
Note: A real working program needs to check whether a pixel in the candidate matching block falls outside the image
boundary and such pixel should not count in MAD. This program is meant to illustrate the main operations involved. Not the
actual working matlab script.
Fr ac t i onal Ac c ur ac y EBMA
Real MV may not always be multiples of pixels To allowsub- Real MV may not always be multiples of pixels. To allow sub-
pixel MV, the search stepsize must be less than 1 pixel
Half-pel EBMA: stepsize=1/2 pixel in both dimension
Difficulty: Difficulty:
Target frame only have integer pels
Solution:
Interpolate the target frame by factor of two before searching p g y g
Bilinear interpolation is typically used
Complexity:
4 times of integer-pel, plus additional operations for interpolation.
Fast algorithms:
Search in integer precisions first, then refine in a small search
region in half-pel accuracy.
Yao Wang, 2003 2-D Motion Estimation, Part 1 41
Hal f -Pel Ac c ur ac y EBMA
B
m
:
current
d
m
block
B : B
m
:
matching
block
Yao Wang, 2003 2-D Motion Estimation, Part 1 42
Bi l i near I nt er pol at i on
(x+1,y) (x,y) (2x,2y) (2x+1,2y)
(2x,2y+1) (2x+1,2y+1)
(x+1,y+1) (x,y+!)
O[2x,2y]=I[x,y]
O[2x+1,2y]=(I[x,y]+I[x+1,y])/2
O[2x,2y+1]=(I[x,y]+I[x+1,y])/2
O[2 1 2 1] (I[ ] I[ 1 ] I[ 1] I[ 1 1])/4
Yao Wang, 2003 2-D Motion Estimation, Part 1 43
O[2x+1,2y+1]=(I[x,y]+I[x+1,y]+I[x,y+1]+I[x+1,y+1])/4
e
e
n
c
h
o
r

f
r
a
m
e
a
r
g
e
t

f
r
a
m
e
a
n t
a
8
6
d
B
)
f
r
a
m
e

(
2
9
.
8
o
n

f
i
e
l
d
t
e
d

a
n
c
h
o
r

M
o
t
i
P
r
e
d
i
c
t
Example: Half-pel EBMA
Pr os and Cons w i t h EBMA
Blocking effect (discontinuity across block boundary) in the Blocking effect (discontinuity across block boundary) in the
predicted image
Because the block-wise translation model is not accurate
Fix: Deformable BMA (next lecture) Fix: Deformable BMA (next lecture)
Motion field somewhat chaotic
because MVs are estimated independently from block to block
Fi 1 M h b d ti ti ti ( t l t ) Fix 1: Mesh-based motion estimation (next lecture)
Fix 2: Imposing smoothness constraint explicitly
Wrong MV in the flat region
because motion is indeterminate when spatial gradient is near zero
Nonetheless, widely used for motion compensated prediction in
video coding
Yao Wang, 2003 2-D Motion Estimation, Part 1 45
Because its simplicity and optimality in minimizing prediction error
Fast Al gor i t hms f or BMA
Key idea to reduce the computation in EBMA: Key idea to reduce the computation in EBMA:
Reduce #of search candidates:
Only search for those that are likely to produce small errors.
Predict possible remaining candidates based on previous search result Predict possible remaining candidates, based on previous search result
Simplify the error measure (DFD) to reduce the computation
involved for each candidate
Classical fast algorithms Classical fast algorithms
Three-step
2D-log
Conjugate direction Conjugate direction
Many new fast algorithms have been developed since then
Some suitable for software implementation, others for VLSI
implementation(memory access, etc)
Yao Wang, 2003 2-D Motion Estimation, Part 1 46
implementation (memory access, etc)
Vc Demo Ex ampl e
VcDemo: Image and Video Compression Learning Tool
Developed at Delft University of Technology
http://www-ict.its.tudelft.nl/~inald/vcdemo/
Use the ME tool to show the motion estimation results with different parameter choices
Yao Wang, 2003 2-D Motion Estimation, Part 1 47
Summar y (I ) y ( )
3DMotion 3D Motion
Rigid vs. non-rigid motion
Camera model: 3D -> 2D projection
Perspective projection vs. orthographic projection
What causes 2Dmotion? What causes 2D motion?
Object motion projected to 2D
Camera motion
Models corresponding to typical camera motion and object motion
Piece wise projective mapping is a good model for projected rigid object Piece-wise projective mapping is a good model for projected rigid object
motion
Can be approximated by affine or bilinear functions
Affine functions can also characterize some global camera motions
Constraints for 2Dmotion Constraints for 2D motion
Optical flow equation
Derived from constant intensity and small motion assumption
Ambiguity in motion estimation
Yao Wang, 2003 2-D Motion Estimation, Part 1 48
Summar y (I I )
Howto represent motion: How to represent motion:
Pixel-based, block-based, region-based, global, etc.
Estimation criterion:
DFD(constant intensity) DFD (constant intensity)
OF (constant intensity+small motion)
Bayesian (MAP, DFD+motion smoothness)
Search method:
Exhaustive search, gradient-descent, multi-resolution (next lecture)
Pixel-based motion estimation
Most accurate representation, but also most costly to estimate
Block-based motion estimation
Good trade-off between accuracy and speed
EBMA and its fast but suboptimal variant is widely used in video
di f ti t dt l di ti
Yao Wang, 2003 2-D Motion Estimation, Part 1 49
coding for motion-compensated temporal prediction.
Homew or k
Chap 5: Sec. 5.1, 5.5
Chap 6: Sec. 6.1-6.4, Sec. 6.4.5,6.4.6 not required, Apx. A, B.
Written assignment
Prob. 5.3, 5.4, 5.6
Correction:
Prob. 5.3: Show that the projected 2-D motion of a 3-D object
undergoing rigid motion can be described by Eq.(5.5.13)
5.4: change aX+bY+cZ=1 to Z=aX+bY+c
Prob. 6.4,6.5,6.6
Computer assignment (Due 2 weeks from lecture date)
Prob 6 12 6 13 Prob. 6.12, 6.13