The reproducing kernel (semi-)Hilbert space

Radial Basis Functions on Scattered Data

5.1 Nonsingularity of interpolation matrices

5.2.2 The reproducing kernel (semi-)Hilbert space

There is a close connection between the radial basis functions we study in this book and the notion of reproducing kernels. All of the radial basis functions we have mentioned give rise to such reproducing kernels with respect to some Hilbert space and/or semi-Hilbert spaces, and, more generally, the conditionally positive definite functions which occurred in the previous section give rise to reproducing kernels and Hilbert spaces and/or semi-Hilbert spaces. A semi- Hilbert space can also be made by considering a Hilbert space which is additionally equipped with a semi-norm and semi-inner product as defined above, where it is usual to require that the semi-norm of any element of the space is bounded above by a fixed constant multiple of its norm.

5.2 Convergence analysis 109

Deﬁnition 5.2. _{Let X be a Hilbert space or a semi-Hilbert space of real-}

valued functions onRn_{, equipped with an inner product or a semi-inner product}

(·, ·) , respectively. A reproducing kernel for ( X ,(·, ·)) is a function k₍_·_, _·₎_: Rn _×Rn _→ R _{such that for any element f} _∈ _{X we have the pointwise equality}

with respect to x

( f ,k₍_·_, _x₎₎ ₌ _f₍_x₎_, _x_∈ Rn_.

In particular, k₍_·_, _x₎ _∈ _{X for all x}_∈ Rn_.

Standard properties of the reproducing kernel are that it is Hermitian and non- negative for all arguments from X (see, for instance, Cheney and Light, 1999). There are well-known necessary and sufﬁcient conditions known for the linear space X to be a reproducing kernel (semi-)Hilbert space, i.e. for the existence of such a reproducing kernel k _{within the Hilbert space setting. A}

classical result (Saitoh, 1988, or Yosida, 1968, for instance) states that a reproducing kernel exists if and only if the point evaluation operator is a bounded operator on the (semi-)Hilbert space. If we are in a Hilbert space setting, the reproducing kernel is, incidentally, unique. In our case, a reproducing kernel exists for the subspace X ˆ of all functions g that vanish on the K -unisolvent subset ˆ because the condition |g( x )| = O(g_X) = O(g_∗) from the previous subsection gives boundedness, i.e. continuity, for the linear operator of point evaluation in that subspace ˆ X .

In our example above, the boundedness of the point evaluation functional amounts to the uniform boundedness of any function g that is in X =

D−k L2(Rn_{) and vanishes on ˆ}__{, by a ﬁxed} _g_{-independent multiple of its semi-}

norm g_∗. That this is so is a consequence of the two following observations, but there are some extra conditions on the domain .

The ﬁrst one is related to the Sobolev inequality. We have taken this particular formulation from the standard reference (Brenner and Scott, 1994, p. 32) and restricted it again to the case that interests us. We also recall, and shall use often from now on, the special notation for the semi-norm

| f |2_k_, :=





|α|=k α∈Zn + k ! α_! | D α f ( x )|2 d x

as is usual elsewhere in the – especially ﬁnite element – literature, and, as a further notational simpliﬁcation, | f |_k := | f |_k_,Rn. Accordingly, we shall some-

times use the superspace D−k L2(_{) of} _X = D−k L2(Rn_{) that contains all} _f

with ﬁnite semi-norm when restricted to the domain . If f is only deﬁned on

110 5. Radial basis functions on scattered data

Sobolev inequality. Let  be a domain in n-dimensional Euclidean space with Lipschitz-continuous boundary and let k > n/2 be an integer. Then there is a constant C such that for any f ∈ D−k L2(Rn₎ _{the pointwise inequality}

(5.5) | f ( x )| ≤ C



j =0

| f |_j,

holds on the domain  , where C only depends on the domain, the dimension of the underlying real space and on k.

The second one is the Bramble–Hilbert lemma (Bramble and Hilbert, 1971). We still assume that  is a domain in Rn _{and that it has a Lipschitz-continuous}

boundary ∂. Moreover, from now on it should additionally satisfy an interior

cone condition which means that there exists a vector ξ ( x ) ∈ Rn _{of unit length}

for each x ∈  such that for a ﬁxed positive ˆr and ϑ > 0,

 ⊃ { x + λη|η ∈ Rn_, __η_{ =} _{r , η}_ˆ _· _ξ₍_x₎ _≥ _cos_ϑ, ₀ _≤ _λ _≤ ₁_}_.

Note that the radius and angle are fixed but arbitrary otherwise, i.e. they may depend on the domain. In fact, Bezhaev and Vasilenko (2001) state that the cone condition is superfluous if we require a Lipschitz-continuous boundary of the domain. We recall the definition of H k from the end of the fourth chapter.

Bramble–Hilbert lemma. Let  ⊂ Rn _{be a domain that satisﬁes an interior}

cone condition and is contained in a ball which has diameter ρ. Let F be a linear functional on the Sobolev space H k () such that F (q) = 0 for all q ∈ Pk −1

n .

Suppose that the inequality

|F (u)| ≤ C ₁





j =0

ρ j −n/2|u|_j,



holds for u with ﬁnite semi-norm |u|_j, , j ≤ k, with a positive constant C 1 that

is independent of u and ρ. Then it is true that

|F (u)| ≤ C 2



ρk −n/2|u|k ,



where C ₂ does not depend on u or ρ.

For our application of the Bramble–Hilbert lemma we let k > n₂ be a positive integer and the operator F map a continuous function f to the value of the interpolation error by polynomial interpolation on ˆ with polynomials of total degree less than k . Thus, it is zero if f is already such a polynomial, as a result of ˆ being unisolvent. Therefore the ﬁrst assumption of the Bramble– Hilbert lemma is satisﬁed. The second assumption is a consequence of the Sobolev inequality, where it is also important that f vanishes on ˆ, since the

5.2 Convergence analysis 111

interpolating polynomial for f is zero if f ∈ X ˆ , as is the case for our setting. Therefore, the Bramble–Hilbert lemma implies

| f ( x )| ≤ C | f |_k , ≤ C  f ∗,

as required, the constant C being independent of f . To this end, we also recall that the polynomial interpolant of degreek − 1 itself vanishes when differenti- atedk times, which is why the polynomial parts of the interpolation error expression disappear in both the middle and the right-hand side of the above inequality. Now that we know about the existence of a reproducing kernel in our particular example (as a special case of the general radial basis function setting), we wish to identify it explicitly in the sequel. Once more, we begin here with the general case and, becoming progressively more speciﬁc, restrict to our examples for the purpose of illustrating the general case later on.

The reproducing kernel has an especially simple form in our setting. In order to derive this form, we may let, because of the condition of unisolvency of ˆ, the functions pξ , ξ ∈ ˆ , be polynomial Lagrange functions, i.e. such that pξ

span K and satisfy

pξ (ζ ) = δξ ζ , ξ , ζ ∈ .ˆ

Since we are looking for a reproducing kernel on a subspace, our reproducing kernel k has a particular, K -dependent form. Speciﬁcally, for our application, we can express the property of the reproducing kernel by the identity

(5.6) f ( x ) =



f , φ(· − x ) −



ξ ∈ˆ

p_ξ( x )φ(· − ξ )



∗

, x ∈ Rn_, _f _∈_X .ˆ

We have therefore the reproducing kernel of the form

k( y, x ) = φ( y − x ) −



ξ ∈ˆ

p_ξ( x )φ( y − ξ )

and φ is a function Rn _→ R _{which is, notably, not necessarily itself in} _X_{. This}

is why φ does not appear itself on the right-hand side of the expression above but in the shape of a certain linear combination involving the p_ξ. This particular shape also includes the property that it vanishes on ˆ. We will come to this in a moment. In the case that the function f does not vanish on ˆ, then we subtract from it the ﬁnite sum of multivariate polynomials



ξ ∈ˆ

f (ξ ) p_ξ,

so that the difference does, and henceforth use instead the difference as follows:

(5.7) f −



ξ ∈ˆ

112 5. Radial basis functions on scattered data

The same principle was applied to the shift of φ above to guarantee that the kernel kvanishes on ˆ. If the null-space K of the semi-inner product and of the semi-norm is trivial, no such operation is required – the sums in the preceding two displays remain empty for K = {0}. The fact that the reproducing kernel

k( y, x ) is a function of the difference of two arguments (that is, k( y, x ) =

k( y− x )) is a simple consequence of the shift-invariance of the space X (see the interesting paper by Schaback, 1997, on these issues), because any invariance properties such as shift- or rotational invariance that hold for the space X are immediately carried over to properties of the reproducing kernel k through Deﬁnition 5.2. The proofs of these facts are easy and we omit them because we do not need them any further in this text.

Most importantly, however, we require that the right-hand side expression of the inner product in (5.6) above is in X . We make the hypothesis that φ is such that, whenever we form a linear combination of shifts of φ, whose coefﬁcients are such that they give zero when summed against any element of K , then that linear combination must be in X :



ξ ∈

λ_ξφ(· − ξ ) ∈ X if



ξ ∈

λ_ξq(ξ ) = 0∀_q∈K.

We will show later that for the X which we have given as an example above, this hypothesis is fulﬁlled.

For the semi-inner product in the above display (5.6), the sum over the coefﬁcients of the linear combination above against any p ∈ K is indeed zero:

p( x ) −



ξ ∈ˆ

pξ ( x ) p(ξ ) = p( x ) − p( x ) = 0,

because of the Lagrange conditions on the basis elements pξ , ξ ∈ . Therefore,

assuming our above hypothesis on φ being in place for the rest of this section now, the right-hand side in the inner product in the display (5.6) is in the required space X . Moreover, since f ( x ) vanishes for x = ζ ∈ ˆ on the left-hand side in the above identity (5.6), so must the expression

φ(· − ζ ) −



ξ ∈ˆ

pξ (ζ )φ(· − ξ )

on the right-hand side, which it does because pξ (ζ ) = δξ ζ , and therefore we

have veriﬁed that it is, in particular, in ˆ X .

We may circumvent the above hypothesis on φ which may appear strange at this point, because, as we shall see, it is fulfilled for all our radial basis functions (4.4). More specifically, given a fixed basis function φ, we shall construct a (semi-)Hilbert space X whose reproducing kernel has the required form (5.6).

5.2 Convergence analysis 113

This can be done as follows. In the case of all the radial basis functionsφ(·) we use in this chapter, the constructed X is the same as the above X we started with as we shall see shortly in this section. We follow especially the analysis of Schaback (1999).

To begin with, we deﬁne now a seemingly new semi-inner product for any two functions f :  → R _and _g_: _ _→ R _{that are ﬁnite linear combinations of}

shifts of a given general, continuous function φ: Rn _→ R_,

f =



ξ ∈1 λξ φ(· − ξ ) and g =



ξ ∈2 µξ φ(· − ξ ),

where₁ and₂ are arbitrary ﬁnite subsets of ; those sets need not (but usually do) agree. The functionφ is required to be continuous and conditionally positive deﬁnite of order k , so that we let K = Pk −1

n . Therefore we have an additional

condition on the coefficients in order to define the semi-inner product which follows, that is we require that for the given finite-dimensional linear space K , the coefficient sequences must satisfy the conditions



_ξ_∈_

1 λξ p(ξ ) = 0 and



ξ ∈2 µξ p(ξ ) = 0 for all p ∈ K . Moreover, we let LK () denote the linear

space of all functionals of the form

λ: h →



ξ ∈1

λ_ξh(ξ ),

with the aforementioned property of the coefﬁcients and with any ﬁnite set ₁. Its Hilbert space completion will beL_K₍__{). So}_λ _{is a linear functional on}_C₍__).

Now deﬁne the semi-inner product for the above two functions, (5.8) ( f , g)∗ := (λ, µ)∗ :=



ξ ∈1



ζ ∈2 λξ µζ φ(ξ − ζ ),

where we identify functions f and g with linear functionals λ and µ in an obvious way deﬁned through their coefﬁcients. So functionalsλ and functions

f are related via λ → f = λ x φ( x − ·), where the superscript x refers to the variable with respect to which we evaluate the functional. Note that we can always come back from one such functional to a function in x by applying it in this way to φ( x − ·).

As a consequence of the conditional positive deﬁniteness of φ, this semi- inner product is positive semi-deﬁnite as required. Further, let ˆ still contain a

K -unisolvent set and deﬁne the modiﬁed point evaluation functional δ₍_x₎ by

δ₍_x₎ f = f ( x ) −



ξ ∈ˆ

114 5. Radial basis functions on scattered data

Here, pξ are Lagrange polynomials. Then the modiﬁed point evaluation func-

tional is in LK () for every x ∈ , which the ordinary Dirac function δ(· − x ),

i.e. the usual function evaluation at a point x , is not , as it does not annihilate the polynomials in the space K unless K = (0). Now letX be the completion of the range of

( LK (), δ( x ))∗, x ∈ ,

that is in short, but very useful, notation the space generated by all functions from L_K() with the semi-inner product taken with δ₍_x₎, always modulo K . This is a Hilbert space of functions deﬁned for every x ∈ .

In particular, for all µ and λ from L_K₍__{), we get} _µ₍₍_{λ, δ}₍_x₎₎_∗₎ = (λ, µ)_∗, due to our definition of the inner product in (5.8). Here it is also relevant that the polynomials which appear in the definition of the modified point evaluation functional are annihilated by the semi-inner product

(δ₍_x₎, λ)∗ = (δ₍_x₎, f )∗ = δ₍_x₎ f = f ( x ) −



ξ ∈_ˆ

p_ξ( x ) f (ξ ),

to which then µ has to be applied in the usual way. Thereby, a semi-inner product and a semi-norm are also deﬁned on the aforementioned range X _{. All}

functions in X _{vanish on our} _K_{-unisolvent point set ˆ}__{. To form} _X_{, take the}

direct sum of K and X _{. That is the so-called ‘native space’. Further, let} _f ∈ X . Thus there exists an f -dependent functional λ such that

f ( x ) = (λ, δ₍_x₎)_∗ = ( f ,(δ₍_·₎, δ₍_x₎)_∗)_∗,

the second equality in this display being a consequence of the deﬁnition of ( f , g)∗ above. That display provides the reproducing kernel property. Indeed, it

is exactly the same identity as before, because, according to our initial deﬁnition of the bilinear (·,·)_∗ as a semi-inner product on L_K(),

(δ₍·₎, δ₍_x₎)∗ = δ₍t _·₎δ z₍_x₎φ(t − z),

where the superscript ﬁxes the argument with respect to which the linear functional is applied. The above is the same as

δ₍t _·₎



φ(t − x ) −



ξ ∈_ p_ξ( x )φ(t − ξ )



= φ(· − x ) −



ξ ∈_ pξ ( x )φ(· − ξ ) −



ζ ∈_ p_ζ(· )φ(ζ − x ) +



ζ ∈_ p_ζ(·)



ξ ∈_ p_ξ( x )φ(ξ − ζ ).

5.2 Convergence analysis 115

When inserted into the semi-inner product, the last two terms from the display disappear, since pζ (· ) ∈ K and any such arguments in the kernel are annihi-

lated. Therefore the two expressions for the reproducing kernel are the same.

In document Buhmann M D - Radial Basis Functions, Theory and Implementations (CUP 2004)(271s) (Page 121-128)