The Moore-Penrose Generalized Inverse - Regularization of inverse problems by the Landweber ite

Consider equation (1) in Euclidean spaces: Computing the generalised inverse when A is a matrix of full rank is relatively easy. Also, if Ran(A) is not the full image space, then the right hand sideyof equation (1) becomes complicated to solve. In such a case, it is important to find x such that Ax has the minimal distance to x. If on the other hand Ker(A) 6= {0}, then equation (1) does not have a unique solution and one might be interested in choosing the specific solution with minimal norm among the multiple solutions.

In the Hilbert space real-valued setting, we consider following definition:

Definition 2.6. Let A : H₁ → H₂ be bounded linear operator. An element x ∈ H₁ is called

(i) least squares solution of equation(1)if

kAx−yk= inf{kAz −yk |z ∈H₁}. (6)

(ii) best-approximate solution or minimum norm solution of equation(1)ifxis a least square solution of equation(1)and

kxk= inf{kzk |z is least-squares solution of equation(1)}. (7)

It is easily seen that if the least squares solution exists, then the minimum norm solu-tion is unique because it is the minimizer of quadratic error funcsolu-tions and thus the best-approximate can be defined as the least-squares solution of minimal norm. The notion of a best-approximate solution is related to the Moore-Penrose generalized inverse ofA.

Given bounded linear operatorsAonly, that is,A ∈ L(H₁, H₂)whereA˜ : Ker(A)^⊥ → Ran(A)is its restriction, thenthe Moore-Penrose generalised inverseofA, denotedA^†, is the unique linear extension ofA˜⁻¹ to the domain ofA^†denoted as:

D(A^†) := Ran(A)⊕Ran(A)^⊥ (8) such that

Ker(A^†) =Ran(A)^⊥. (9)

Thus due to the restriction toKer(A)^⊥,A˜is injective (one-to one) and surjective (onto) due to the restriction to Ran(A). Hence, A˜ is bijective, and, A˜⁻¹ exists. For any y ∈ D(A^†), there is unique y₁ ∈ Ran(A) and y₂ ∈ Ran(A)^⊥ with y = y₁ + y₂. Since Ker( ˜A) = {0} and Ran( ˜A) = Ran(A), the operator A^† is well-defined and from (9) and the linearity ofA^†, we have that

A^†y=A^†y₁+A^†y₂ =A^†y₁ = ˜A⁻¹y₁. (10) Proposition 2.7. Let P = AA^† andQ = A^†A be the orthogonal projection operators onto Ker(A) andRan(A), respectively. ThenA^† is uniquely characterized by the four Moore-Penrose equations:

AA^†A = A (11)

A^†AA^† = A^† (12)

A^†A = I−P (13)

AA^† = Q|_D(A^†₎, (14) whereIis the identity operator.

Proof. For each y ∈ D(A^†)and by the definition of the Moore-Penrose inverseA^†, we

have

A^†y= ˜A⁻¹Qy =A^†Qy (15) so thatA^†y ∈Ran( ˜A⁻¹) =Ker(A)^⊥. For eachx∈Ker(A)^⊥, it follows that

A^†Ax = ˜A⁻¹Ax˜ =x.

The above assertion proves thatRan(A^†) = Ker(A)^⊥. Now equation (15) implies that AA^†y=AA^†Qy =AA˜⁻¹Qy = ˜AA˜⁻¹Qy =Qy,

sinceA˜⁻¹Qy ∈Ker(A)^⊥and hence equation (14) holds. By the definition ofA^†, it holds for eachx∈H1 that

A^†Ax= ˜A⁻¹A(P x+ (I−P)x) = ˜A⁻¹AP x

| {z }

+ ˜A⁻¹A(I˜ −P)x= (I−P)x. (16)

Inserting equation (14) into equation (15) yields

A^†y =A^†Qy =A^†AA^†y

for ally∈D(A^†). equation (16) implies equation (13) and equation (13) also implies that AA^†A =A(I−P) = A−AP =A.

Hence equations (11), (15) and (14) imply equation (12).

Any operator satisfying equation (13) or equation (14) is referred to as aninner inverseor outer inverseofA, respectively.

The following theorem provides a connection between the least-squares solutions and how they are computed via the Moore-Penrose inverse.

Theorem 2.8. For ally ∈ D(A^†), the equation(1)has a unique best-approximate solu-tion given by

x^†:=A^†y.

The set of all least-squares solution isA^†y+Ker(A).

Proof. For a fixedy∈ D(A^†), let us construct a set S ={z ∈H₁ |Az =Qy}.

Sincey ∈ D(A^†) = Ran(A)⊕Ran(A)^⊥, it follows thatQy ∈ Ran(A) and therefore S 6=∅. BecauseQis an orthogonal projector we have for allz ∈S and for allx∈H₁:

kAz−yk=kQy−yk ≤ kAx−yk.

So, all elements inS are least-squares solutions ofAx =y. Conversely, letz be a least-squares solutions ofAx=y. Then

kQy−yk ≤ kAz−yk= inf{ku−yk |u∈Ran(A)}=kQy−yk.

Thus,Az is the closest element toyinRan(A), that is,Az =Qyand S ={x∈H₁ |x is least-squares solution of Ax =y}.

Now, let z¯be the element of minimal norm in S = A⁻¹({Qy}). Since then S = ¯z + Ker(A), it suffices to show that

z =A^†y.

As an element of minimal norm inS = ¯z+Ker(A), z¯is orthogonal toKer(A), that is

z ∈Ker(A)^⊥. This implies that

z = (I −P)¯z =A^†A¯z =A^†Qy =A^†AA^†y =A^†y, (17) that is,z¯=A^†y.

In linear algebra, it is a well-known fact that the least-squares solutions can be character-ized by the normal equations and this leads us to the next theorem to verify if the assertion is true in the continuous case.

Theorem 2.9. LetA^∗ denote the adjoint operator ofA. For giveny ∈ D(A^†),x∈ H₁ is a least-squares solution of equation(1)if and only ifxsatisfies the normal equations

A^∗Ax=A^∗y. (18)

Proof. An elementx ∈H₁is least-squares solution of equation (1) if and only ifAx co-incides with the projection ofyontoRan(A). This is equivalent toAx−y ∈Ran(A)^⊥= Ker(A^∗). Thus, we conclude thatA^∗(Ax−y) = 0which is equivalent to equation (18).

Furthermore, a least-squares solution has minimal norm if and only ifx∈Ker(A)^⊥.

A direct consequence from Theorem 2.9 is that A^†y is a solution of equation (18) with

minimal norm. Consequently, we have

A^†y = (A^∗A)^†A^∗y, (19) and this means that in order to approximate A^†y, we may be required to compute an approximation via equation (18) instead.

To analyse the domain of the generalised inverse, one could show from the Moore-Penrose inverse as defined earlier in equation (8) thatD(A^†)is the natural domain of the definition forA^†. This is in the sense that if y /∈ D(A^†), then there is no existence of least-squares solution of equation (1). Thus contrary to the finite-dimensional case, the concept of minimnorm solution as introduced does not always give a solution to a problem al-though the concept imposes uniqueness. As we knowAis a bounded linear operator and thus the orthogonal complements always admits a closure. We can then conclude from equation (8) that

D(A^†) :=Ran(A)⊕Ran(A)^⊥ =Ran(A)⊕Ker(A^∗)^⊥ =H₂.

The domain D(A)is dense in H₁ whereasD(A^†) is dense in H₂. Thus, it follows that D(A^†) = H₂ ifRan(A)is closed and vice versa. D(A^†) = H₂ implies that Ran(A)is also closed. Furthermore, fory ∈ Ran(A)^⊥ = Ker(A^†), the best-approximate solution isx^† = 0. It is therefore important to check when yalso satisfies y ∈ Ran(A), for any giveny ∈ Ran(A). In such a case,A^†has to be continuous. However, ify ∈ Ran(A)\ Ran(A) exists, then it is just enough to prove that A^† is discontinuous. This leads us to the following theorems and the introduction of compact operators which discuss the discontinuities of the Moore-Penrose inverses.

The following theorem is a result which implies that theRan(A)of continuous operator between two Hilbert spaces is closed.

Theorem 2.10 (Bounded inverse theorem). Let H₁ and H₂ be Hilbert spaces. If A ∈ L(H₁, H₂) is bijective (one-to-one and onto mapping), then the inverse map A⁻¹ ∈ L(H₁, H₂).

Proof. The proof of Theorem 2.10 can be found in [28, Theorem 8.72].

The Closed Graph Theorem provides conditions for which a closed linear operator as defined in Definition 2.3 is bounded.

Theorem 2.11 (Closed Graph Theorem). Let H1 and H2 be Hilbert spaces and let A :

D(A) → H₂ be a linear operator fromH₁ to H₂. Then if A : D(A) → H₂ is a closed linear operator and its domainD(A)is closed inH₁, then the operatorAis bounded.

Proof. Assume thatH₁×H₂ is complete. Assume also that gr(A)is a closed subspace inH1×H2 andD(A)is a closed subspace inH1, thusgr(A)andD(A)are complete. We now define the projection mapping

P :gr(A)−→ D(A) by

P(x, Ax) :=x.

The mappingP is linear and bounded since

kP(x, Ax)k=kxk ≤ kxk+kAxk=k(x, Ax)k for allx∈ D(A). In fact, its inverse

P⁻¹ :D(A)→gr(A) is defined by

P⁻¹x:= (x, Ax).

for allx∈H₁. Sincegr(A)andD(A)are complete, by the bounded inverse theorem 2.10, the projectionP⁻¹ is bounded and there is a constantbsuch that

k(x, Ax)k=kP⁻¹xk ≤bkxk for allx∈ D(A). But this implies thatAis bounded since

kAxk ≤ kAxk+kxk=k(x, Ax)k ≤bkxk for allx∈ D(A).

Closed Graph Theorem is applied to Moore-Penrose generalized inverse and gives the following result.

Theorem 2.12. LetA ∈ L(H₁, H₂). ThenA^† ∈ L(D(A^†), H₁)if and only ifRan(A)is closed.

Proof. Before proving the result, we will derive the following identity:

{(y₁,A˜⁻¹y₁)|y₁ ∈Ran(A)}={(Ax, x)|x∈H₁} ∩(H₂×Ker(A)^⊥). (20) Lety1 ∈Ran(A),x:= ˜A⁻¹y1; by the definition ofA,˜ x∈Ker(A)^⊥, and due to equation (14), we have

Ax=AA^†y₁ =y₁. Hence, it follows that

(y₁,A˜⁻¹) = (Ax, x)∈H₂×Ker(A)^⊥.

Ifx∈Ker(A)^⊥andy1 :=Ax(hence,y1 ∈Ran(A)), thenA˜⁻¹y1 =A^†Ax=x, so that (y₁,A˜⁻¹y₁) = (Ax, x).

Thus, equation (20) holds.

By the definition ofA^†, we have for the graph ofA^†: gr(A^†) = {(y, A^†y)|y∈ D(A^†)}

= {(y₁+y₂,A˜⁻¹y₁)|y₁ ∈Ran(A), y₂ ∈Ran(A)^⊥}

= {(y₁,A˜⁻¹y₁)|y₁ ∈Ran(A)}+ (Ran(A)^⊥× {0}), which implies with equation (20) that

gr(A^†) = [{(Ax, x)|x∈H₁} ∩(H₂×Ker(A)^⊥)] + [Ran(A)^⊥× {0}]. (21) The spaces on the right-hand side of equation (21) are closed and orthogonal to each other inH₂×H₁, so that also their sumgr(A^†)is also closed.

To prove the second part of the proposition, we assume now thatRan(A)is closed, so that D(A^†) = H₂. Because of the Closed Graph Theorem 2.11, A^†is bounded. Conversely, letA^† be bounded, thenA^† has a unique continuous extensionA^† toH₂. From equation (14) and the continuity ofAwe conclude that

AA^†=Q.

Hence fory∈Ran(A), we havey=Qy =AA^†y∈Ran(A). Consequently, we obtain Ran(A)⊆Ran(A)

and thereforeRan(A)is closed.

In document Regularization of inverse problems by the Landweber iteration (sivua 11-18)