Matrix Identities: Sam Roweis

matrix identities
sam roweis
(revised June 1999)
note that a,b,c and A,B,C do not depend on X,Y,x,y or z
0.1 basic formulae
A(B + C) = AB + AC (1a)
T T T
(A + B) = A + B (1b)
(AB)T = BT AT (1c)
−1 −1 −1
if individual inverses exist (AB) =B A (1d)
−1 T T −1
(A ) = (A ) (1e)
0.2 trace, determinant and rank
|AB| = |A||B| (2a)

1
|A−1 | = (2b)
|A|
Y
|A| = evals (2c)
X
Tr [A] = evals (2d)
if the cyclic products are well defined,
Tr [ABC . . .] = Tr [BC . . . A] = Tr [C . . . AB] = . . . (2e)
T T

rank [A] = rank A A = rank AA (2f)
r
biggest eval
condition number = γ = (2g)
smallest eval
derivatives of scalar forms with respect to scalars, vectors, or matricies are
indexed in the obvious way. similarly, the indexing for derivatives of vectors
and matrices with respect to scalars is straightforward.
1
0.3 derivatives of traces
∂Tr [X]
=I (3a)
∂X
∂Tr [XA] ∂Tr [AX]
= = AT (3b)
T∂X ∂X
∂Tr X A ∂Tr AXT
= =A (3c)
∂X ∂X
∂Tr XT AX
= (A + AT )X (3d)
∂X
∂Tr X−1 A

= −X−1 AT X−1 (3e)
∂X
0.4 derivatives of determinants

∂|AXB|
= |AXB|(X−1 )T = |AXB|(XT )−1 (4a)
∂X
∂ ln |X|
= (X−1 )T = (XT )−1 (4b)
∂X
∂ ln |X(z)| ∂X
= Tr X−1 (4c)
∂z ∂z
∂|XT AX|
= |XT AX|(AX(XT AX)−1 + AT X(XT AT X)−1 ) (4d)
∂X
0.5 derivatives of scalar forms

∂(aT x) ∂(xT a)
= =a (5a)
∂x ∂x
T
∂(x Ax)
= (A + AT )x (5b)
∂x
∂(aT Xb)
= abT (5c)
∂X
∂(a XT b)
T
= baT (5d)
∂X
T
∂(a Xa) ∂(aT XT a)
= = aaT (5e)
∂X ∂X
T T
∂(a X CXb)
= CT XabT + CXbaT (5f)
∂X
∂ (Xa + b)T C(Xa + b)

= (C + CT )(Xa + b)aT (5g)
∂X
2
the derivative of one vector y with respect to another vector x is a matrix
whose (i, j)th element is ∂y(j)/∂x(i). such a derivative should be written as
∂yT /∂x in which case it is the Jacobian matrix of y wrt x. its determinant
R
represents the ratio of the hypervolume dy to that of dx so that f (y)dy =
f (y(x))|∂yT /∂x|dx. however, the sloppy forms ∂y/∂x, ∂yT /∂xT and
R
∂y/∂xT are often used for this Jacobain matrix.
0.6 derivatives of vector/matrix forms
∂(X−1 ) ∂X −1
= −X−1 X (6a)
∂z ∂z
∂(Ax) ∂x
=A (6b)
∂z ∂z
∂(XY) ∂Y ∂X
=X + Y (6c)
∂z ∂z ∂z
∂(AXB) ∂X
=A B (6d)
∂z ∂z
∂(xT A)
=A (6e)
∂x
∂(xT )
=I (6f)
∂x
∂(x AxxT )
T
= (A + AT )xxT + xT AxI (6g)
∂x
0.7 constrained maximization

the maximum over x of the quadratic form:
1
µT x − xT A−1 x (7a)
2
subject to the J conditions cj (x) = 0 is given by:
Aµ + ACΛ, Λ = −4(CT AC)CT Aµ (7b)
where the jth column of C is ∂cj (x)/∂x
0.8 symmetric matrices

have real eigenvalues, though perhaps not distinct and can always be diag-
onalized to the form:
A = CΛCT (8)
3
where the columns of C are (orthonormal) eigenvectors (i.e. CCT = I) and
the diagonal of Λ has the eigenvalues
0.9 block matrices

for conformably partitioned block matrices, addition and multiplication is
performed by adding and multiplying blocks in exactly the same way as
scalar elements of regular matrices
however, determinants and inverses of block matrices are very tricky; for 2
blocks by 2 blocks the results are:

A11 A12
A21 A22 = |A22 | · |F11 | = |A11 | · |F22 | (9a)

−1 −1
−A−1 A12 F−1

A11 A12 F11 11 22
= (9b)
A21 A22 −F−122 A21 A11
−1
F−1
22
−1
A11 + A−1 A12 F−1 A21 A−1 −F−1 A12 A−1

= 11 22 11 11 22
−A−1 −1
22 A21 F11 A−1 −1 −1 −1
22 + A22 A21 F11 A12 A22
where
F11 = A11 − A12 A−1
22 A21 F22 = A22 − A21 A−1
11 A12
for block diagonal matrices things are much easier:

A11 0
0 A22 = |A11 ||A22 | (9d)

−1 −1
A11 0 A11 0
= (9e)
0 A22 0 A−1
22
0.10 matrix inversion lemma (sherman-morrison-woodbury)

using the above results for block matrices we can make some substitutions
and get the following important results:
(A + XBXT )−1 = A−1 − A−1 X(B−1 + XT A−1 X)−1 XT A−1 (10)

|A + XBXT | = |B||A||B−1 + XT A−1 X| (11)
where A and B are square and invertible matrices but need not be of the
same dimension. this lemma often allows a really hard inverse to be con-
verted into an easy inverse. the most typical example of this is when A is
large but diagonal, and X has many rows but few columns

Matrix Identities: Sam Roweis

Diunggah oleh

Informasi Dokumen

Deskripsi Asli:

Judul Asli

Hak Cipta

Format Tersedia

Bagikan dokumen Ini

Bagikan atau Tanam Dokumen

Opsi Berbagi

Apakah menurut Anda dokumen ini bermanfaat?

Apakah konten ini tidak pantas?

Hak Cipta:

Format Tersedia

Matrix Identities: Sam Roweis

Diunggah oleh

Hak Cipta:

Format Tersedia

matrix identities

(revised June 1999)

note that a,b,c and A,B,C do not depend on X,Y,x,y or z

0.1 basic formulae

0.2 trace, determinant and rank

|AB| = |A||B| (2a)

0.4 derivatives of determinants

0.5 derivatives of scalar forms

∂y/∂xT are often used for this Jacobain matrix.

0.6 derivatives of vector/matrix forms

0.7 constrained maximization

Aµ + ACΛ, Λ = −4(CT AC)CT Aµ (7b)

where the jth column of C is ∂cj (x)/∂x

0.8 symmetric matrices

0.9 block matrices

for block diagonal matrices things are much easier:

0.10 matrix inversion lemma (sherman-morrison-woodbury)

(A + XBXT )−1 = A−1 − A−1 X(B−1 + XT A−1 X)−1 XT A−1 (10)

Anda mungkin juga menyukai