Solving Systems of Linear Equations (3)

Language:中文/EN

Pseudoinverse of a Matrix

The pseudoinverse introduced here is the Moore-Penrose inverse matrix, defined as follows: Given a matrix ARmnA\in R^{m*n}, if a matrix ARnmA^{\dagger}\in R^{n*m} satisfies AAA=AAA^{\dagger}A=A, and there exist two matrices URnn,VRmmU\in R^{n*n},V\in R^{m*m} such that

A=UAT,A=ATVA^{\dagger}=UA^{T},A^{\dagger}=A^{T}V

then AA^{\dagger} is called the pseudoinverse of matrix AA. It can be proven that the pseudoinverse of a matrix is unique.

For a matrix ARmn,mnA\in R^{m*n},m\ge n and rank(A)=nrank(A)=n, according to the above definition, the pseudoinverse of AA can be verified as

A=(ATA)1ATA^{\dagger}=(A^{T}A)^{-1}A^{T}

For a matrix ARmn,mnA\in R^{m*n},m\le n and rank(A)=mrank(A)=m, similarly, according to the above definition, the pseudoinverse of AA can be verified as

A=AT(AAT)1A^{\dagger}=A^{T}(AA^{T})^{-1}

The above two cases are the pseudoinverses when the matrix is full column rank or full row rank. For a general matrix ARmn,rank(A)=r,rmin(m,n)A\in R^{m*n},rank(A)=r,r\le min(m,n), we can use the full rank decomposition method to find its pseudoinverse.

For any matrix ARmn,rank(A)=r,rmin(m,n)A\in R^{m*n},rank(A)=r,r\le min(m,n), it can be decomposed into the product of a full row rank matrix and a full column rank matrix: That is, A=BC,BRmr,cRrn,rank(A)=rank(B)=rank(C)=rA=BC,B\in R^{m*r},c\in R^{r*n},rank(A)=rank(B)=rank(C)=r

It can be proven that: A=CBA^{\dagger}=C^{\dagger}B^{\dagger}, where B=(BTB)1BT,C=CT(CCT)1B^{\dagger}=(B^{T}B)^{-1}B^{T},C^{\dagger}=C^{T}(CC^{T})^{-1}, which is the method for finding the pseudoinverse of a general matrix.

Solving Linear Equations in General Cases

For a linear equation system Ax=b,ARmn,rank(A)=rAx=b,A\in R^{m*n},rank(A)=r, the vector x=Abx^{*}=A^{\dagger}b minimizes Axb2||Ax-b||^{2} in the space RnR^{n}; among all vectors in RnR^{n} that can minimize Axb2||Ax-b||^{2}, the vector x=Abx^{*}=A^{\dagger}b has the smallest norm and is unique.

When r=mr=m, AA is a full row rank matrix, and in this case, x=Ab=AT(AAT)1bx^{*}=A^{\dagger}b=A^{T}(AA^{T})^{-1}b, which is the minimum norm solution of the equation system Ax=bAx=b.

When r=nr=n, AA is a full column rank matrix, and in this case, x=Ab=(ATA)1ATbx^{*}=A^{\dagger}b=(A^{T}A)^{-1}A^{T}b, which is the least squares solution of the equation system Ax=bAx=b.