5.2 Tensor Structures for Wavelet Coordinates
5.2.1 Low-Rank Tensor Formats
µ,ν∈∇d (5.8)
defines an elliptic operator on `2(∇d). We additionally assume the Ψν to be L2-orthonormal, and hence the corresponding eigenvalue problem reads
X
µ
sνhΨν, HΨµisµ uµ= λs2νuν, ν ∈ ∇d, (5.9)
where u = (uν)ν∈∇d = (s−1ν hu, Ψνi)ν∈∇d for u∈ H1(Rd), or briefly, Hu = λs2u. Note that by the Riesz basis property,kukH1(Rd) ∼ kuk`2(∇d).
5.2.1 Low-Rank Tensor Formats
Let us first consider low-rank approximation in the case of two-dimensional problems. Let u ∈
`2(Λ1× Λ2) with finite Λ1, Λ2 ⊂ ∇, then singular value decomposition yields a representation in the form
u=
r
X
k=1
(U(1)k ⊗ U(2)k ) σk, (5.10)
where σ1 ≥ · · · ≥ σr > 0 and, for i ∈ {1, 2} and k, l ∈ {1, . . . , r}, U(i)k = (Uk,ν(i))ν∈Λi with supp U(i)k ∈ Λi and hU(i)k , U(i)l i = δkl.
Here r is the rank of the matrix (uν1,ν2)ν1∈Λ1,ν2∈Λ2, and the decomposition (5.10) also provides a means of computing the best approximation by an element of `2(Λ1× Λ2) with rank ˜r < r: by the Eckart-Young theorem [47], the error
u−
˜ r
X
k=1
(U(1)k ⊗ U(2)k )σk =
Xr
k=˜r+1
|σk|212
(5.11)
is minimal among all rank-˜r approximations of u.
This can be extended to u∈ `2(∇2) without the restriction of finite support, in which case the singular value decomposition is replaced by the Hilbert-Schmidt decomposition of operators. For such a general u, the infinite matrix (uν1,ν2)ν1∈∇,ν2∈∇ defines a Hilbert-Schmidt operator
Tu: `2(∇) → `2(∇) , c 7→ X
ν∈∇
uν,ν˜ cν
˜ ν∈∇, and the spectral theorem yields a decomposition
u=
∞
X
k=1
(U(1)k ⊗ U(2)k ) σk, (5.12)
with a nonnegative nonincreasing sequence (σk)k∈N∈ `2(N), and orthonormal bases{U(i)k }k∈N of
`2(∇) for i ∈ {1, 2}. The low-rank approximation property (5.11) carries over to this case, where in general r =∞.
At first glance, a natural extension of (5.10) to the higher-dimensional case would be a repre-sentation of u∈ `2(∇d) of the form
u=
r
X
k=1
(U(1)k ⊗ · · · ⊗ U(d)k ) ak (5.13)
with kU(i)k k = 1 and ak ∈ R. This is typically referred to as canonical format, canonical polyadic
5.2 Tensor Structures for Wavelet Coordinates
decomposition, or parallel factors. The smallest r∈ N0∪{∞} for which such a representation exists is referred to as the canonical rank of u. For our purposes, the major problem with this type of representation is the lack of a sufficiently reliable recompression procedure. In fact, the problem of approximating a given u by an expansion (5.13) of specified rank is in general ill-posed [39], which additionally necessitates a suitable regularization. For such regularized problems, however, one still needs to rely on minimization procedures that generally cannot be guaranteed to converge to the global minimum.
In this regard, a representation in the form
u= has substantially more favorable properties, and in this chapter we will be dealing with this format, and remark on generalizations with similar features. Here the order-d tensor a is referred to as core tensor, the matrix U(i) with column vectors U(i)k ∈ `2(∇), k = 1, . . . , ri, as the i-th mode frame.
This is the so-called Tucker format [144, 145] or subspace representation. Note that for ˆd∈ N, we can also represent u∈ `2(∇ddˆ ) in the form (5.14) with U(i)k ∈ `2(∇dˆ). For the sake of simplicity, we consider the case ˆd = 1 in what follows, but one can proceed completely analogously for general ˆd.
Clearly, any compactly supported u ∈ `2(∇d) can be represented in the form (5.14) for some r ∈ Nd0. For general u ∈ `2(∇d), the sum in (5.14) may be infinite. We correspondingly define rank(u)∈ (N0∪ {∞})d by
rank(u)i := dim span{U(i)k : k∈ N} , i = 1, . . . , d . (5.15) This vector is referred to as the multilinear rank of u. Note that in a representation of the form (5.14), one can always orthogonalize the columns of U(i) to obtainhU(i)k , U(i)l i = δkl for all i. We shall refer to U(i) with the latter property as orthonormal mode frames.
Remark 5.1. As we have seen in Section 4.3, when approximating the wavelet coefficients of hydrogenic ground states in the format (5.14), one can achieve exponential decrease of the H1 -approximation error with respect to the multilinear ranks.
In the case of helium or hookium, fast convergence of the representation with respect to the ranks can only be expected when – assuming single-electron coordinates x, y∈ R3 – the coordinate pairs(xi, yi) for i = 1, 2, 3 are not separated. We have shown for hookium in Section 4.3 that the corresponding wavelet coefficients u ∈ `2((∇2)3) can be represented efficiently in the form (5.14) withd = 3 and U(i)k ∈ `2(∇2), where each U(i) corresponds to a coordinate pair (xi, yi).
To simplify notation for the sums in (5.14), for r∈ Nd0 we define
Kd(r) := for the corresponding vectors with entry i deleted. We shall also need the auxiliary quantities
a(i)pq :=
5 Adaptive Wavelet Schemes for Tensor Representations
There exists an analogue of the singular value decomposition of matrices, the higher-order sin-gular value decomposition [107], for the Tucker tensor format (5.14). In the following theorem, we summarize its properties in the more general case of sequence spaces, where the the singular value decomposition is replaced by the spectral theorem for compact operators.
Theorem 5.2. For any u∈ `2(∇d) there exist orthonormal mode frames{U(i)k }k∈N, i = 1, . . . , d,
Proof. The following is essentially an adaptation of the arguments for the finite-dimensional case given in [107] to the infinite-dimensional sequence space `2(∇d).
Let u = (uν)ν∈∇d ∈ `2(∇d). For each i∈ {1, . . . , d} we consider the mode-i matricization of u,
By the spectral theorem, for each i there exist a nonnegative real sequence (σn(i))n∈N, where σ(i)n are the eigenvalues of (T(i))∗T(i)1/2
The representation (5.19) converges in the Hilbert-Schmidt norm, and as a consequence we have u= X
5.2 Tensor Structures for Wavelet Coordinates
which by orthonormality of{U(i)n }n∈N yields
= σ(i)p σ(i)q X orthonormality of{Vn(i)}n∈N. Property (iii) follows with the observations
The additional properties of the decomposition for finitely supported u are clear, since in this case the spectral decomposition reduces to a finite-dimensional singular value decomposition.
Remark 5.3. The result of Theorem 5.2 holds analogously for u∈ `2(∇d ˆd) and U(i)k ∈ `2(∇dˆ) with d, ˆd∈ N. As noted in Remark 5.1, this is relevant for two-electron systems such as helium, where we use a decomposition withd = 3 and ˆd = 2.
Note that by analogy to (5.10), the σk(i) are also referred to as mode-i singular values of u.
Property (iii) in Theorem 5.2 leads to a simple procedure for truncation to lower multilinear ranks with an explicit error estimate in terms of the mode-i singular values. In this manner, one does not necessarily obtain the best approximation for prescribed rank, but the approximation is quasi-optimal in the sense that the error is at most by a factor √
d larger than the error of best approximation with the same multilinear rank.
In principle, a representation as in Theorem 5.2 can be obtained for any u ∈ `2(∇d) by a combination of standard linear algebra procedures. For our purposes, the relevant task is to obtain such a representation for finitely supported u given in the form
u= X
without further assumptions on ˜U(i) and ˜a, where in particular the columns of the ˜U(i) may be linearly dependent. From the arguments in [107], one can extract the well-known procedure described in Algorithm 5.1 that yields a representation as in Theorem 5.2 for finitely supported u.
Remark 5.4. Assuming supp u⊆ Λ1× · · · × Λd, hosvd as in Algorithm 5.1 can be performed in O(|˜r|∞Q
i˜ri+P
ir˜2i#Λi) operations, using only standard linear algebra operations.
The procedure hosvd can be interpreted as the selection of basis functions, given by the columns of U(i), which are adapted to a given u. However, the tensor a containing the coefficients for
5 Adaptive Wavelet Schemes for Tensor Representations
Algorithm 5.1 [{U(i)}, a] = hosvd({ ˜U(i)}, ˜a) input u in representation (5.21)
output mode frames U(i) and core tensor a as in Theorem 5.2
1: for all i∈ {1, . . . , d}
2: perform QR–factorization ˜U(i)= Q(i)R(i)
. Q(i) ∈ RΛ(i)×{1,...,ri}, R(i) ∈ R{1,...,ri}×{1,...,˜ri}, where r≤ ˜r.
. columns of Q(i) are orthonormal, R(i) is right upper triangular
3: end for
4: ˆak←P
l∈Kd(˜r)R(1)k
1,l1· · · Rk(d)d,ld˜al for k∈ Kd(r)
5: for all i∈ {1, . . . , d}
6: build matricization T(i) ∈ R{1,...,ri}×Ii with entries T(i)
ki,ˇki := ˆak1,...,kd
. Ii ={1, . . . , r1} × · · · × {1, . . . , ri−1} × {1, . . . , ri+1} × . . . × {1, . . . , rd}
7: perform singular value decomposition T(i) = V(i)Σ(i)(W(i))T
8: U(i)← Q(i)V(i)
9: end for
10: ak←P
l∈Kd(r)Vl(1)
1,k1· · · Vl(d)d,kdˆal for k ∈ Kd(r)
these basis functions has Q
iri components, which even for moderate multilinear ranks becomes too expensive for larger d.
A possible alternative for higher dimensions is the Hierarchical Tucker or H–Tucker format [75].
It has similar features as the Tucker format, and there exists a scheme that parallels hosvd in its properties [64], but has storage and work complexity linear in d when applied to input tensors given in the appropriateH–Tucker representation. This comes at the price of additional structural constraints on the tensors in the form of a modified notion of tensor ranks. For the model problems that we are considering in this chapter, theH–Tucker format does not give an advantage over the Tucker format, and hence what follows will be formulated for the latter. The schemes developed here can, however, be applied to the H–Tucker format without major modifications; we will come back to this point in Section 5.3.3.