Processing math: 100%

Thursday, September 3, 2015

Multivariate Normal distribution

This was forthcoming, especially, if you want to understand Kalman filter.

A k-dimensional random vector xx=(x1,...,xk)T follows a multivariate normal distribution with mean μμ=(μ1,...,μk)T and positive-definite covariance matrix ΣΣ=[σij] if its probability density function is f(x|μ,Σ)=1(2π)k/2|Σ|1/2e12(xμ)TΣ1(xμ).
This is denoted by xNk(μ,Σ). A square matrix A(m×m) is a positive-definite matrix if A is symmetric, and all eigenvalues of A are positive. Alternatively, A is a positive-definite matrix if for any nonzero m-dimensional vector b, we have bTAb>0. For a positive-definite matrix A all eigenvalues are positive and matrix can be decomposed as A=PΛPT, where λ is a diagonal matrix consisting of all eigenvalues of A and P is an m×m matrix consisting of the m right eigenvectors of A, making P an orthogonal matrix, if eigenvalues are distinct.

For a symmetric matrix A, there exists a lower triangular matrix L with diagonal elements being 1 and a diagonal matrix G such that A=LGLT. If A is positive definite, then the diagonal elements of G are positive. In this case we can write A=(LG)(LG)T, where LG again is a lower triangle matrix. Such a decomposition is called Cholesky decomposition of A. This shows that a positive-definite matrix A can be diagonalized as L1A(LT)1=L1A(L1)T=G.

Let c=[c1,...,ck]T be a nonzero vector partitioned as x=[xT1,xT2]T, with the first of size p and the second of size kp such that, [x1x2]N([μ1μ2],[Σ11Σ12Σ21Σ22]).
Some properties of x are:

  1. cTxN(cTμ,cTΣc), any nonzero linear combination of x is univariate normal and vice-versa.
  2. The marginal distribution of xi is normal, xiNk(μi,Σii).
  3. Σ12=0 if an only if x1 and x2 are independent.
  4. The variable (xμ)TΣ1(xμ) follows a chi-squared distribution with m degrees of freedom.
  5. The conditional distribution of x1 given x2=b is also normally distributed as (x1|x2=b)N(μ1+Σ12Σ122(bμ2),Σ11Σ12Σ122Σ21).
Suppose that x, y, and z are three random vectors such that their joint distribution is multivariate normal. In addition, assume that the diagonal block covariance matrix Σww is nonsingular for w=x,y,z, and Σyz=0. Then,

  1. (x|y)N(μx+ΣxyΣ1yy(yμy),ΣxxΣxxΣ1yyΣyx)
  2. (x|y,z)N(E(x|y)+ΣxzΣ1zz(zμz),Var(x|y)ΣxzΣ1zzΣzx)


No comments:

Post a Comment