arXiv:1512.09161v4 [math.DS] 11 Jan 2016
REAL LINE
MRINAL KANTI ROYCHOWDHURY
Abstract. Let P be a Borel probability measure on R generated by an infinite system of self-similar mappings associated with a probability vector. For such a probability measure P , in this paper, the optimal sets of n-means and the nth quantization error are calculated for every natural number n. Moreover, the connection between the rate of convergence of the logarithms of the quantization errors for n going to infinity and the Hausdorff dimension of the limit set of the infinite iterated function system is indicated.
1. Introduction
The history of the theory and practice of quantization dates back to 1948, although similar ideas had appeared in the literature in 1897 (see [S]). It is used in many applications such as signal processing and telecommunications, data compression, pattern recognitions and cluster analysis (for details see [GG, GN]). It is also closely connected with centroidal Voronoi tessel- lations. Let Rd denote the d-dimensional Euclidean space, k · k denote the Euclidean norm on Rd for any d ≥ 1, and P be a Borel probability measure on Rd. Given a finite subset α ⊂ Rd, the Voronoi region generated by a ∈ α is defined by
M(a|α) = {x ∈ Rd : kx − ak ≤ kx − bk for all b ∈ α}
i.e., the Voronoi region generated by a ∈ α is the set of all points in Rd which are closer to a ∈ α than to all other points in α, and the set {M(a|α) : a ∈ α} is called the Voronoi diagram or Voronoi tessellation of Rd with respect to α. A Borel measurable partition {Aa : a ∈ α} of Rd is called a Voronoi partition of Rd with respect to α (and P ) if Aa ⊂ M(a|α) (P -almost surely) for every a ∈ α. The Voronoi tessellation {M(a|α) : a ∈ α} gener- ated by the set of points α is called the centroidal Voronoi tessellation (CVT) if the points a∈ α are also the centroids of their corresponding Voronoi regions, i.e., for each a ∈ α,
a= 1
P(M(a)) Z
M(a)
xdP = R
M(a)xdP R
M(a)dP .
For details about CVT and its application one can see [DFG]. If α is a finite set, the error R mina∈αkx − ak2dP(x) is often referred to as the variance, cost, or distortion error for α with respect to the probability measure P , and is denoted by V (α) := V (P ; α). On the other hand, inf{V (P ; α) : α ⊂ Rd, card(α) ≤ n} is called the nth quantization error for the probability measure P , and is denoted by Vn:= Vn(P ). If R kxk2dP(x) < ∞ then there is some set α for which the infimum is achieved (see [GKL, GL1, GL2]). Such a set α for which the infimum occurs and contains no more than n points is called an optimal set of n-means. It is known that for a continuous probability measure an optimal set of n-means always has exactly n-elements (see [GL2]). For a Borel probability measure P on Rd, an optimal set of n-means forms a CVT with n-means (n-generators) of Rd; however, the converse is not true in general (see
2010 Mathematics Subject Classification. 60Exx, 28A80, 94A34.
Key words and phrases. Probability measure, infinite similitudes, optimal quantizers, quantization error, quantization dimension, Hausdorff dimension.
The research of the author was supported by U.S. National Security Agency (NSA) Grant H98230-14-1-0320.
1
[DFG, R2]). A CVT with n-means is called an optimal CVT with n-means if the generators of the CVT form an optimal set of n-means with respect to the probability distribution P . Let us now state the following proposition (see [GG, GL2]).
Proposition 1.1. Let α be an optimal set of n-means, a ∈ α, and M(a) be the Voronoi region generated by a ∈ α. Then for every a ∈ α, (i) P (M(a)) > 0, (ii) P (∂M(a)) = 0, (iii) a = E(X : X ∈ M(a)), and (iv) P -almost surely the set {M(a) : a ∈ α} forms a Voronoi partition of Rd.
For κ > 0, we define the κ-dimensional lower and upper quantization coefficients for P by, Qκ(P ) := lim inf
n n2κVn(P ) and Qκ(P ) := lim sup
n
n2κVn(P ).
If Qκ(P ) and Qκ(P ) coincide, it is called the quantization coefficient for P . The lower and the upper quantization dimensions of P are defined to be
D(P ) := lim inf
n→∞
2 log n
− log Vn(P ) and D(P ) := lim sup
n→∞
2 log n
− log Vn(P ).
If D(P ) and D(P ) coincide, we call the common value the quantization dimension of the prob- ability measure P . Quantization dimension measures the speed at which the specified measure of the error goes to zero as n tends to infinity. For details about quantization coefficients and quantization dimensions one is referred to [GL2, P1, P2].
A transformation f : X → X on a metric space (X, d) is called contractive or a contraction mapping if there is a constant 0 < c < 1 such that d(f (x), f (y)) ≤ cd(x, y) for all x, y ∈ X.
On the other hand, f is called a similitude or a similarity mapping if there exists a constant s >0 such that d(f (x), f (y)) = sd(x, y) for all x, y ∈ X. Here s is called the similarity ratio of the similarity mapping f . It is known that the classical Cantor set C is generated by the two contractive similarity mappings S1 and S2 given by S1(x) = 13xand S2(x) = 13x+23 for all x∈ R. Let P be a Borel probability measure on R such that P = 12P ◦ S1−1+12P ◦ S2−1, where P◦ Si−1 denotes the image measure of P with respect to Si for i = 1, 2 (see [H], Theorem 4.4(1) for a generalization of self-similar measure). Then, P has support the Cantor set C. For this probability measure Graf and Luschgy gave a closed formula to determine the optimal sets of n-means and the nth quantization error for n ≥ 2. They also showed that the quantization dimension of this probability measure equals the Hausdorff dimension of the Cantor set C, but the quantization coefficient for P does not exist (see [GL3]). Later for n ≥ 2, L. Roychowdhury gave an induction formula to determine the optimal sets of n-means and the nth quantization error for a probability distribution P on R, given by P = 12P ◦ S1−1 + 12P ◦ S2−1 which has support the Cantor set generated by S1 and S2, where S1(x) = 14x and S2(x) = 12x+ 12 for all x ∈ R (see [R1]). In [R2], the author investigated the optimal sets of n-means and the centroidal Voronoi tessellations with n generators, n ∈ N, for a Borel probability measure P = 12P ◦ S1−1+ 12P ◦ S2−1 on R supported by a dyadic Cantor set generated by a set of two contractive similarity mappings S1 and S2 such that S1(x) = rx and S2(x) = rx + (1 − r) for all x ∈ R with similarity ratios r where 0.4364590141 ≤ r ≤ 0.4512271429.
In this paper, we have considered a probability measure P on R which is generated by an infinite collection of similitudes {Sj}∞j=1 on R, where Sj(x) = 31jx+ 1 − 3j−11 for all x ∈ R and P is given by P = P∞
j=1 1
2jP ◦ Sj−1. For this probability measure we determine the optimal sets of n-means and the nth quantization error. Besides, we showed that like the classical Cantor distribution considered by Graf-Luschgy in [GL3], the quantization coefficient for the probability measure P considered in our paper does not exist, but the quantization dimension of P exists and equals the Hausdorff dimension of the limit set generated by the infinite similitudes.
2. Basic definitions, lemmas and proposition
Let N denote the set all natural numbers, i.e., N = {1, 2, · · · }. By a string or a word ω over the alphabet N, we mean a finite sequence ω := ω1ω2· · · ωkof symbols from the alphabet, where k ≥ 1, and k is called the length of the word ω. The length of a word ω is denoted by |ω|. A word of length zero is called the empty word, and is denoted by ∅. We denote the set of all words of length k by Nk. By N∗ we denote the set of all words over the alphabet N of some finite length k including the empty word ∅. For any two words ω := ω1ω2· · · ωk
and τ := τ1τ2· · · τℓ in N∗, by ωτ := ω1· · · ωkτ1· · · τℓ we mean the word obtained from the concatenation of the two words ω and τ . For n ≥ 1 and ω = ω1ω2· · · ωn ∈ N∗ we define ω− := ω1ω2· · · ωn−1, i.e., ω− is the word obtained from the word ω by deleting the last letter of ω. Note that ω− is the empty word if the length of ω is one. For ω ∈ N∗, by (ω, ∞) it is meant the set of all words ω−(ω|ω|+ j), obtained by concatenation of the word ω− with the word ω|ω| + j for j ∈ N, i.e.,
(ω, ∞) = {ω−(ω|ω|+ j) : j ∈ N}.
Let {Si}∞i=1 be an infinite collection of contractive similitudes on R such that Si(x) = 1
3ix+ 1 − 1 3i−1,
for each i ∈ N and x ∈ R. With the similitudes let us now associate a probability vector (p1, p2,· · · ) where pi = 21i for all i ∈ N. Then, there exists a unique Borel probability measure P on R such that
P =
∞
X
i=1
piP ◦ Si−1,
which has support lying in the closed interval [0, 1]. This paper deals with this this probability measure P . For ω = ω1ω2· · · ωn ∈ Nn, write
Sω := Sω1 ◦ · · · ◦ Sωn, Jω := Sω(J), sω := sω1· · · sωn, pω := pω1· · · pωn, where J := J∅ = [0, 1]. Then, for any ω ∈ N∗, we write
J(ω,∞) := ∞∪
j=1Jω−(ω|ω|+j) and p(ω,∞) := P (J(ω,∞)) =
∞
X
j=1
P(Jω−(ω|ω|+j)) =
∞
X
j=1
pω−(ω|ω|+j).
Note that for any ω ∈ N∗, p(ω,∞) =
∞
X
j=1
pω−(ω|ω|+j)= pω−
∞
X
j=1
1
2ω|ω|+j = pω−pω|ω|
∞
X
j=1
1
2j = pω−pω|ω| = pω. Let us now give the following lemmas.
Lemma 2.1. Let f : R → R be Borel measurable and k ∈ N. Then Z
f dP = X
ω∈Nk
pω
Z
f ◦ SωdP.
Proof. We know P = P∞
j=1pjP ◦ Sj−1, and so by induction P = P
ω∈NkpωP ◦ Sω−1, and thus
the lemma is yielded.
Lemma 2.2. Let X be a random variable with probability distribution P . Then, the expectation E(X) and the variance V := V (X) of the random variable X are given by
E(X) = 1
2 and V = 1 8.
Proof. Using Lemma 2.1, we have E(X) =
Z
xdP(x) =
∞
X
j=1
1 2j
Z
Sj(x)dP =
∞
X
j=1
1 2j
Z 1
3jx+ 1 − 1 3j−1
dP
=
∞
X
j=1
1
6jE(X) + 1 2j − 3
6j
= 1
5E(X) + 1 − 3 5, which implies E(X) = 12. Now,
E(X2) = Z
x2dP(x) =
∞
X
j=1
1 2j
Z 1
3jx+ 1 − 1 3j−1
2
dP
=
∞
X
j=1
1 2j
Z 1
9jx2+ 2
3j(1 − 1
3j−1)x + (1 − 1 3j−1)2
dP.
Since,
∞
X
j=1
1 18j = 1
17, and
∞
X
j=1
1 2j
Z 2
3j(1 − 1
3j−1)xdP = 2
∞
X
j=1
(1 6j − 3
18j)E(X) = 1 5 − 3
17 = 2 85,
and ∞
X
j=1
1
2j(1 − 1 3j−1)2 =
∞
X
j=1
1
2j(1 − 2
3j−1 + 1
9j−1) = 1 − 6 5+ 9
17 = 28 85, we have E(X2) = 171E(X2) + 852 +2885 which yields E(X2) = 38. Thus,
V(X) = E(X2) − (E(X))2 = 3 8 −1
4 = 1 8,
which is the lemma.
Lemma 2.3. For any k ≥ 1, we have
E(X|X ∈ Jk∪ Jk+1∪ · · · ) = 1 − 1 2
1 3k−1. Proof.
E(X|X ∈ Jk∪ Jk+1∪ · · · ) = 1 P∞
j=k 1 2j
X∞
j=k
1 2jSj(1
2)
= 2k−1
∞
X
j=k
1
2j(1 − 5 2
1 3j)
= 2k−1( 1 2k−1 − 1
2 1
6k−1) = 1 − 1 2
1 3k−1,
which is the lemma.
Now, the following notes are in order.
Note 2.4. For k ∈ N, we have Sk(12) = 31k
1
2 + 1 − 3k−11 = 1 − 5231k. Thus, by Lemma 2.3, for k ∈ N,
E(X|X ∈ Jk∪ Jk+1∪ · · · ) = Sk(1 2) + 1
3k = 1
2(Sk(1) + Sk+1(0)).
Following the standard theory of probability, for any x0 ∈ R, we have R (x − x0)2dP(x) = V(X) + (x0− E(X))2. Thus, one can deduce that the optimal set of one-mean is the expected value and the corresponding quantization error is the variance V of the random variable X.
For ω ∈ Nk, k ≥ 1, using Lemma 2.1, we have E(X : X ∈ Jω) = 1
P(Jω) Z
Jω
xdP(x) = Z
Jω
xdP ◦ Sω−1(x) = Z
Sω(x)dP (x) = E(Sω(X)).
Since Sj are similitudes, it is easy to see that E(Sj(X)) = Sj(E(X)) for j ∈ N, and so by induction, E(Sω(X)) = Sω(E(X)) for ω ∈ Nk, k ≥ 1.
Note 2.5. For words β, γ, · · · , δ in N∗, by a(β, γ, · · · , δ) we denote the conditional expectation of the random variable X given Jβ∪ Jγ∪ · · · ∪ Jδ, i.e.,
a(β, γ, · · · , δ) = E(X|X ∈ Jβ ∪ Jγ∪ · · · ∪ Jδ) = 1
P(Jβ ∪ · · · ∪ Jδ) Z
Jβ∪···∪Jδ
xdP.
Thus by Note 2.4, for ω ∈ N∗, we have
a(ω) = Sω(E(X)) = Sω(12), and
a(ω, ∞) = E(X|X ∈ Jω−(ω|ω|+1)∪ Jω−(ω|ω|+2)∪ · · · ) = Sω−(ω|ω|+1)(12) + sω−(ω|ω|+1). (1)
Moreover, for any ω ∈ N∗ and j ≥ 1, since pω−(ω|ω|+j) = pω−pω|ω|+j = pω−pω|ω|pj = pωpj, and similarly sω−(ω|ω|+j) = sωsj, for any x0 ∈ R, it is easy to see that
R
Jω(x − x0)2dP = pωR (x − x0)2dP ◦ Sω−1 = pω
s2ωV + (Sω(12) − x0)2 , and R
J(ω,∞)(x − x0)2dP =P∞ j=1pωj
s2ωjV + (Sω−(ω|ω|+j)(12) − x0)2 (2) .
The expressions (1) and (2) are useful to obtain the optimal sets and the corresponding quan- tization errors with respect to the probability distribution P .
The following lemma is easy to prove.
Lemma 2.6. Let P be the probability measure as defined before and let ω ∈ N∗. Then, Z
J(ω,∞)
(x − a(ω, ∞))2dP = Z
Jω
(x − a(ω))2dP.
Remark 2.7. By (1) and Lemma 2.6, we see that for any ω ∈ N∗, Z
J(ω,∞)
(x − a(ω, ∞))2dP = Z
Jω
(x − a(ω))2dP = pωs2ωV.
The following lemma is useful.
Lemma 2.8. For any two words ω, τ ∈ N∗, if pω = pτ then, Z
Jω
(x − a(ω))2dP = Z
Jτ
(x − a(τ ))2dP.
Proof. Let ω, τ ∈ N∗. Let ω = ω1ω2· · · ωk and τ = τ1τ2· · · τm for some k, m ∈ N. Then, pω = pτ implies ω1 + ω2+ · · · + ωk = τ1+ τ2+ · · · + τm, and so sω = sτ. Thus,
Z
Jω
(x − a(ω))2dP = pωs2ωV = pτs2τV = Z
Jτ
(x − a(τ ))2dP,
which is the lemma.
Definition 2.9. For n ∈ N with n ≥ 2 let ℓ(n) be the unique natural number with 2ℓ(n)≤ n <
2ℓ(n)+1. Write
α(ℓ(n)) := {a(ω) : ω ∈ N∗ and pω = 1
2ℓ(n)} ∪ {a(ω, ∞) : ω ∈ N∗ and pω = 1 2ℓ(n)}.
For I ⊂ α(ℓ(n)) with card(I) = n − 2ℓ(n), write
αn(I) : = (α(ℓ(n)) \ I) ∪ {a(ω1) : a(ω) ∈ I} ∪ {a(ω1, ∞) : a(ω) ∈ I}
∪ {a(ω−(ω|ω|+ 1)) : a(ω, ∞) ∈ I} ∪ {a(ω−(ω|ω|+ 1), ∞) : a(ω, ∞) ∈ I}.
Remark 2.10. In Definition 2.9, if n = 2ℓ(n), then I = ∅, and so, αn(I) = α(ℓ(n)).
Using Definition 2.9, we now give few examples.
Example 2.11. Let n = 3. Then, ℓ(n) = 1, α(1) = {a(1), a(1, ∞)} = {16,56}, card(I) = 1. If I = {a(1)}, then
α3(I) = {a(11), a(11, ∞), a(1, ∞)} = { 1 18, 5
18,5 6}.
If I = {a(1, ∞)}, then,
α3(I) = {a(1), a(2), a(2, ∞)} = {1 6,13
18,17 18}.
Example 2.12. Let n = 4. Then, ℓ(n) = 2, I = ∅, and so
α4(I) = α(2) = {a(11), a(11, ∞), a(2), a(2, ∞)} = { 1 18, 5
18,13 18,17
18}.
Example 2.13. Let n = 5. Then, ℓ(n) = 2, α(2) = {a(11), a(11, ∞), a(2), a(2, ∞)}, I ⊂ α(2) with card(I) = 1. If I = {a(11)}, then
α5(I) = {a(111), a(111, ∞), a(11, ∞), a(2), a(2, ∞)} = { 1 54, 5
54, 5 18,13
18,17 18}.
If I = {a(2)}, then
α5(I) = {a(11), a(11, ∞), a(21), a(21, ∞), a(2, ∞)} = { 1 18, 5
18,37 54,41
54,17 18}.
If I = {a(11, ∞)}, then
α5(I) = {a(11), a(12), a(12, ∞), a(2), a(2, ∞)} = { 1 18,13
54,17 54,13
18,17 18}.
If I = {a(2, ∞)}, then
α5(I) = {a(11), a(11, ∞), a(2), a(3), a(3, ∞)} = { 1 18, 5
18,13 18,49
54,53 54}.
Let us now prove the following proposition.
Proposition 2.14. Let αn(I) be the set as defined in Definition 2.9. Then Z
a∈αminn(I)kx − ak2dP = 1 18ℓ(n)
1 8
2ℓ(n)+1− n + 1
9(n − 2ℓ(n)) . Proof. Using the definition of αn(I), we have
Z
a∈αminn(I)kx − ak2dP
= X
a(ω)∈α(ℓ(n))\I
Z
Jω
(x − a(ω))2dP + X
a(ω,∞)∈α(ℓ(n))\I
Z
J(ω,∞)
(x − a(ω, ∞))2dP
+ X
a(ω)∈I
Z
Jω1
(x − a(ω1))2dP + Z
J(ω1,∞)
(x − a(ω1, ∞))2dP
+ X
a(ω,∞)∈I
Z
Jω−
(ω|ω|+1)
(x − a(ω−(ω|ω|+ 1)))2dP +
Z
J(ω−(ω|ω|+1),∞)
(x − a(ω−(ω|ω|+ 1), ∞))2dP .
Now, using Remark 2.7, we have X
a(ω)∈α(ℓ(n))\I
Z
Jω
(x − a(ω))2dP + X
a(ω,∞)∈α(ℓ(n))\I
Z
J(ω,∞)
(x − a(ω, ∞))2dP
= X
a(ω)∈α(ℓ(n))\I
pωs2ωV + X
a(ω,∞)∈α(ℓ(n))\I
pωs2ωV
= 1
18ℓ(n) 1
8 card(α(ℓ(n)) \ I) = 1 18ℓ(n)
1
8(2ℓ(n)+1− n).
Again, by Remark 2.7, we have X
a(ω)∈I
Z
Jω1
(x − a(ω1))2dP + Z
J(ω1,∞)
(x − a(ω1, ∞))2dP
= 2p1s21V X
a(ω)∈I
pωs2ω, and
X
a(ω,∞)∈I
Z
Jω−
(ω|ω|+1)
(x − a(ω−(ω|ω|+ 1)))2dP +
Z
J(ω−(ω|ω|+1),∞)
(x − a(ω−(ω|ω|+ 1), ∞))2dP
= 2p1s21V X
a(ω,∞)∈I
pωs2ω.
Combining all these, Z
a∈αminn(I)kx − ak2dP = 1 18ℓ(n)
1
8(2ℓ(n)+1− n) + 2p1s21V X
a(ω)∈I
pωs2ω+ X
a(ω,∞)∈I
pωs2ω
= 1
18ℓ(n) 1
8(2ℓ(n)+1− n) +1 9
1 8
1
18ℓ(n)card(I) = 1 18ℓ(n)
1 8
2ℓ(n)+1− n + 1
9(n − 2ℓ(n)) ,
which is the lemma.
Corollary 2.15. Let Vn be the nth quantization error for every n ≥ 1. Then, Vn≤ 1
18ℓ(n) 1 8
2ℓ(n)+1− n + 1
9(n − 2ℓ(n)) .
In the next sections first we determine the optimal sets of two- and three-means, and then, we will show that the set αn(I) is an optimal set of n-means for P and Vn is the corresponding quantization error.
3. Optimal sets of 2- and 3-means
In this section we determine the optimal sets of n-means for n = 2 and n = 3. The results and the proofs for these two cases are the key to understand the general case.
Lemma 3.1. Let α = {a1, a2} be an optimal set of two-means, a1 < a2. Then, a1 = a(1) = 16, a2 = a(1, ∞) = 56 and the corresponding quantization error is V2 = 721 = 0.0138889.
Proof. by Corollary 2.15, V2 ≤ 721. Let α = {a1, a2} be an optimal set of two-means, a1 < a2. Since a1 and a2 are the centroids of their own Voronoi regions, we have 0 ≤ a1 < a2 ≤ 1. If possible, let a1 ≥ 13. Then, using (2), we have
1
72 ≥ V2 ≥ Z
J1
(x − a1)2dP = 1 2
1
9V + (S1(1
2) − a1)2
> 1 72,
which is a contradiction, and so a1 ≤ 13. If a2 < 23, then using (2), we have 1
72 ≥ V2 ≥ Z
J(1,∞)
(x − a2)2dP >
Z
J2∪J3∪J4
(x − a2)2dP = 1 22
1
92V + (S2(1
2) − a2)2 + 1
23
1
272V + (S3(1
2) − a2)2 + 1
24
1
812V + (S4(1
2) − a2)2
> 1 22
1
92V + (S2(1 2) −2
3)2 + 1
23
1
272V + (S3(1 2) − 2
3)2 + 1
24
1
812V + (S4(1 2) −2
3)2 , which implies 721 ≥ V2 >0.0141425 > 721, a contradiction. Thus, a2 ≥ 23. Since 0 ≤ a1 ≤ 13 <
2
3 ≤ a2 ≤ 1, we have 13 ≤ a1+a2 2 ≤ 23, and so J1 ⊆ M(a1|α) and J(1,∞) ⊆ M(a2|α). Thus, Z
mina∈α kx − ak2dP = Z
J1
(x − a1)2dP + Z
J(1,∞)
(x − a2)2dP,
which is minimum when a1 = a(1) = S1(12) = 16 and a2 = a(1, ∞) = S2(12) + 312 = 56, and the corresponding quantization error is V2 = 721. Hence the lemma.
Using the technique of Lemma 3.1, the following corollary can be proved.
Corollary 3.2. For any ω ∈ N∗ with respect to the probability distribution P , the set {a(ω1), a(ω1, ∞)} forms a unique optimal CVT of Jω, and the set {a(ω−(ω|ω|+ 1)), a(ω−(ω|ω|+ 1), ∞)} forms a unique optimal CVT of J(ω,∞).
We now give the following lemma.
Lemma 3.3. Let α be an optimal set of three-means. Then, α = {a(11), a(11, ∞), a(1, ∞)} = {181,185 ,56}, or α = {a(1), a(2), a(2, ∞)} = {16,1318,1718} with quantization error V3 = 6485 = 0.00771605.
Proof. Let α be an optimal set of three-means with α = {a1, a2, a3}, where a1 < a2 < a3. Proceeding in the same way as Lemma 3.1, it can be shown that 0 ≤ a1 ≤ 13 and 23 ≤ a3 ≤ 1.
Let us now show that a2 6∈ (13,23). Consider the following cases:
Case 1: If possible let a2 ∈ [12,23).
Then, 12(a1+ a2) < 13, otherwise, quantization error could be strictly reduced by moving a2
to 23. Thus, we have
a1 < 2
3− a2 ≤ 2 3 − 1
2 = 1 6 < 2
9 = S12(0).
Since a1 is the centroid of its own Voronoi region, we have a1 = 1
P[0,16] Z
[0,16]
xdP = 1
P([0,19]) Z
[0,19]
xdP = S11(1 2) = 1
18,
and so a2 < 23 − a1 = 23 − 181 = 1118. Again, 12(a2 + a3) > 23, otherwise, quantization error could be strictly reduced by moving a2 to 13, and thus, a3 > 43 − a2 > 43 − 1118 = 1318. Now, for x∈ J12= [29,277 ], we get
x− a1 ≥ x − (2 9− 1
18) = x − 1
6, and a2− x ≥ 1 2− 7
27 − x = 13 54− x, which implies min
a∈α(x − a)2 ≥ (x −16)2. Similarly, for x ∈ J21= [23,1927], x− a2 ≥ x − (2
3 −11
18) = x − 1
18, and a3− x ≥ 13 18− 19
27− x = 1 54− x,
which yields min
a∈α(x − a)2 ≥ (x −541 )2. Thus, using (2), V3 =
Z
mina∈αkx − ak2dP ≥ Z
J11
(x − 1
18)2dP + Z
J12
(x − 1
6)2dP + Z
J21
(x − 1 54)2dP
= 661
11664 = 0.0566701.
But, by Corollary 2.15, V3 ≤ 181 18(4 − 3 + 19(3 − 2)) = 6485 = 0.00771605. Thus, a contradiction arises in this case, and so a2 6∈ [12,23).
Case 2: If possible let a2 ∈ (13,12].
This leads to a contradiction in a similar way as Case 1.
Thus, by Case 1 and Case 2, we have a2 6∈ (13,23), i.e., either a2 ∈ [0,13] or a2 ∈ [23,1]. Let us first assume a2 ∈ [0,13] = J1. Let α1 = {a1, a2} and α2 = {a3}. Since α = α1 ∪ α2, by Lemma 2.1, we deduce
V3 = Z
J1
a∈αmin1
(x − a)2dP + Z
J(1,∞)
(x − a3)2dP = 1 18
Z
a∈3αmin1
(x − a)2dP + Z
J(1,∞)
(x − a3)2dP.
We now show that S1−1(α1) is an optimal set of two-means. If S1−1(α1) := 3α1is not an optimal set of two-means, then we could find a set β ⊂ R with card(β) = 2 such thatR min
b∈β(x−b)2dP <
R mina∈α1(x − 3a)2dP. But, then (13β) ∪ α2 is a set of cardinality three withR mina∈1
3β∪α2(x − a)2dP < R mina∈α(x − a)2dP, which contradicts the optimality of α. Thus, S1−1(α1) is an optimal set of two-means, i.e., S1−1(α1) = {a(1), a(1, ∞)} which gives α1 = {a(11), a(11, ∞)}.
Again, V3 being the optimal error, we must have a3 = a(1, ∞). Thus, under the assumption a2 ∈ [0, 13] = J1, we have α = {a(11), a(11, ∞), a(1, ∞)}, and then using (2), we have V3 = 6485 . Let us now assume 23 ≤ a2. Let β = {a2, a3}. Then,
V3 = Z
J1
(x − 1
18)2dP + Z
J(1,∞)
minb∈β(x − b)2dP = 1 144 +
Z
J(1,∞)
minb∈β(x − b)2dP.
We show that a2 < 79. If a2 > 79, then a2− S2(12) > 79 −1318 = 181, which implies Z
J(1,∞)
minb∈β(x − b)2dP >
Z
J2
(x − a2)2dP = 1 4
1
81V + (S2(1
2) − a2)2
> 1 864,
which is not true since V3 = 6485 . Thus, 23 ≤ a2 ≤ 79. Similarly, one can show that 89 ≤ a3 ≤ 1.
Thus, under the assumption 23 ≤ a2, we have Z
minb∈β kx − bk2dP = Z
J2
(x − a2)2dP + Z
J(2,∞)
(x − a3)2dP,
which is minimum when a2 = a(2) and a3 = a(2, ∞). Hence, in this case we obtain α = {a(1), a(2), a(2, ∞)}. Thus, the proof of the lemma is complete.
4. Quantization error and the optimal sets of n-means in the general case In this section, we determine the optimal sets of n-means and the nth quantization error for the probability distribution P for all n ≥ 2.
Let us first state and prove the following proposition. The technique of the proof is adapted from [R1].
Proposition 4.1. For any n ≥ 2, let αn be an optimal set of n-means with respect to the probability distribution P . Write
W(αn) := {ω ∈ N∗ : a(ω) or a(ω, ∞) ∈ αn}, and
W˜(αn) := {τ ∈ W (αn) : pτs2τ ≥ pωs2ω for all ω ∈ W (αn)}.
Then, for any τ ∈ ˜W(αn) the set αn+1 := αn+1(τ ), where αn+1(τ ) = (αn\ {a(τ )}) ∪ {a(τ 1), a(τ 1, ∞)} if a(τ ) ∈ αn,
(αn\ {a(τ, ∞)}) ∪ {a(τ−(τ|τ |+ 1)), a(τ−(τ|τ |+ 1), ∞)} if a(τ, ∞) ∈ αn, is an optimal set of (n + 1)-means.
Proof. Let us first claim that for any ω, τ ∈ N∗, pτs2τ ≥ pωs2ω if and only if Z
Jτ1
(x − a(τ 1))2dP + Z
J(τ 1,∞)
(x − a(τ 1, ∞))2dP + Z
Jω
(x − a(ω))2dP
≤ Z
Jτ
(x − a(τ ))2dP + Z
Jω1
(x − a(ω1))2dP + Z
J(ω1,∞)
(x − a(ω1, ∞))2dP.
Using Remark 2.7, we simplify the above inequality and obtain LHS = 2pτ1s2τ1V + pωs2ωV = 1
9pτs2τV + pωs2ωV, RHS = pτs2τV + pω1s2ω1V = pτs2τV +1
9pωs2ωV.
Thus, LHS ≤ RHS if and only if pτs2τ ≥ pωs2ω, which is the claim.
We now prove the proposition by induction. By Lemma 3.1, we know that the optimal set of two-means is α2 = {a(1), a(1, ∞)}. Here ˜W(α2) = W (α2) = {1}. Since a(1) ∈ α2, we have α3 = {a(11), a(11, ∞), a(1, ∞)}. Again, as a(1, ∞) ∈ α2, we have α3 = {a(1), a(2), a(2, ∞)}.
Clearly by Lemma 3.3, the sets α3 are optimal sets of three-means. Thus, the proposition is true for n = 2. Let us now assume that αm is an optimal set of m-means for some m ≥ 2.
Write
W(αm) := {ω ∈ N∗ : a(ω) or a(ω, ∞) ∈ αm}, and
W˜(αm) := {τ ∈ W (αm) : pτs2τ ≥ pωs2ω for all ω ∈ W (αm)}.
If τ 6∈ ˜W(αm), i.e., if τ ∈ W (αm) \ ˜W(αm), then by the claim, if a(τ ) ∈ αm the error Z
min{(x − a)2 : a ∈ (αm\ {a(τ )}) ∪ {a(τ 1), a(τ 1, ∞)}}dP, or if a(τ, ∞) ∈ αm the error
Z
min{(x − a)2 : a ∈ (αm\ {a(τ, ∞)}) ∪ {a(τ−(τ|τ |+ 1)), a(τ−(τ|τ |+ 1), ∞)}}dP
is either equal or larger, in fact strictly larger if n is not of the form 2k for any positive integer k, than the corresponding error obtained in the case where τ ∈ ˜W(αm). Hence, for any τ ∈ ˜W(αn) the set αm+1 := αm+1(τ ), where
αm+1(τ ) = (αm\ {a(τ )}) ∪ {a(τ 1), a(τ 1, ∞)} if a(τ ) ∈ αm,
(αm\ {a(τ, ∞)}) ∪ {a(τ−(τ|τ |+ 1)), a(τ−(τ|τ |+ 1), ∞)} if a(τ, ∞) ∈ αm, is an optimal set of (m + 1)-means. Thus, by the principle of mathematical induction, the proposition is true for all positive integers n ≥ 2. Thus, the proof of the proposition is
complete.
Lemma 4.2. Let n ∈ N be such that n = 2k for some k ≥ 1. Then, α(k) := {a(ω) : pω = 1
2k} ∪ {a(ω, ∞) : pω = 1 2k}
is an optimal set of n-means. Set αj(k) = α(k) ∩ Jj for 1 ≤ j ≤ k. Then, Sj−1(αj(k)) is an optimal set of 2k−j-means for 1 ≤ j ≤ k. Moreover, n =Pk
j=12k−j+ 1 and Vn=
k
X
j=1
1
18jV2k−j + 1 18kV1.
Proof. Let us prove the lemma by induction. If n = 2, by Lemma 3.1, we have α(1) = {a(1), a(1, ∞)} = {a(ω) : pω = 12} ∪ {a(ω, ∞) : pω = 12} is an optimal set of two-means. Here α1(1) = α(1) ∩ J1 = {a(1)}. Note that card(α1(1)) = 1, and S1−1(α1) = {12} is an optimal set of one-mean. Moreover, V2 = 181 V1+ 181V1. Thus, the lemma is true for n = 2. Let the lemma be true if n = 2k for some k = m, where m ∈ N and m ≥ 2. We will show that it is also true for k = m + 1. We have
α(m) = {a(ω) : pω = 1
2m} ∪ {a(ω, ∞) : pω = 1 2m}.
List the elements of α(m) as a1, a2,· · · , a2m, i.e., α(m) = {aj : 1 ≤ j ≤ 2m}. Construct the sets Aj for 1 ≤ j ≤ 2m as follows:
Aj := {a(ω1), a(ω1, ∞)} if aj = a(ω) for some ω ∈ N∗,
{a(ω−(ω|ω|+ 1)), a(ω−(ω|ω|+ 1), ∞)} if aj = a(ω, ∞) for some ω ∈ N∗. For 1 ≤ j ≤ 2m, set α2m+j = (α(m) \ ∪j
k=1{ak}) ∪ A1∪ A2∪ · · · ∪ Aj. Since α2m is an optimal set of 2m-means, by Proposition 4.1, α2m+1 is an optimal set of (2m+ 1)-means, which implies α2m+2 is an optimal set of (2m+ 2)-means, and thus proceeding inductively, one can say that the set
α2m+1 := α2m+2m = (α(m) \ ∪2k=1m {ak}) ∪ A1∪ A2 ∪ · · · ∪ A2m = A1∪ A2∪ · · · ∪ A2m is an optimal set of 2m+1-means. Note that for any ω ∈ N∗ if a(ω) or a(ω, ∞) ∈ Aj, then pω = 2m+1, and so
α2m+1 = α(m + 1) = {a(ω) : pω = 1
2m+1} ∪ {a(ω, ∞) : pω = 1 2m+1}.
Therefore, by using the principle of mathematical induction, one can say that the set α(k) is an optimal set of n-means if n ∈ N and n = 2k for some k ≥ 1. To complete the rest of the proof, we proceed as follows: For any ω = ω1ω2· · · ω|ω| ∈ N∗, we have a(ω) := Sω(12) ∈ Jω1. Again, from the definitions of a(ω), a(ω, ∞), if a(ω) ∈ Jω1 and |ω| > 1, then a(ω, ∞) ∈ Jω1. Keeping ω1 fixed, if ω1 < k, it is easy to see that there are 2k−ω1−1 different τ ∈ N∗ such that pωτ = 21k. Thus, we see that for any ω = ω1ω2· · · ω|ω| ∈ N∗ with |ω| > 1 and pω = 21k, the optimal set α(k) contains 2k−ω1 elements from Jω1; in other words, card(α(k) ∩ Jω1) = 2k−ω1. If |ω| = 1 and pω = 21k, i.e., when ω = k, then a(k) ∈ Jk, i.e., α(k) contains only one element from Jk. Besides, α(k) contains the element a(k, ∞). Write αj(k) = α(k)∩Jj. Then, card(αj(k)) = 2k−j for 1 ≤ j ≤ k. For any 1 ≤ j ≤ k − 1, by the definition of the mappings, we have
Sj−1(αj(k)) = {a(ωj+1· · · ω|ω|) : pωj+1···ω|ω| = 1
2k−j} ∪ {a(ωj+1· · · ω|ω|,∞) : pωj+1···ω|ω| = 1 2k−j}, and Sk−1(αk(k)) = {12}. Thus, for all 1 ≤ j ≤ k, one can see that Sj−1(αj(k)) = α(k − j).
Hence, by the first part of the lemma, for each 1 ≤ j ≤ k, the set Sj−1(αj(k)) is an optimal set
of 2k−j-means. Now, Vn =
Z
a∈α(k)min kx − ak2dP =
k
X
j=1
Z
Jj
a∈αminj(k)(x − a)2dP + Z
J(k,∞)
(x − a(k, ∞))2dP
=
k
X
j=1
pj
Z
a∈αminj(k)(x − a)2dP ◦ Sj−1+ Z
Jk
(x − a(k))2dP, which yields
Vn=
k
X
j=1
1 18j
Z
min
a∈SJ−1(αj(k))
(x − a)2dP + 1 18kV1 =
k
X
j=1
1
18jV2k−j + 1 18kV1.
Thus, the proof of the lemma is complete.
Remark 4.3. The set α(k)) given by Lemma 4.2 is a unique optimal set of n-means where n= 2k for some k ∈ N.
In regard to Lemma 4.2 let us give the following example.
Example 4.4. Take n = 16 = 24. Then,
α(4) = {a(1111), a(1111, ∞), a(112), a(112, ∞), a(121), a(121, ∞), a(13),
a(13, ∞), a(211), a(211, ∞), a(22), a(22, ∞), a(31), a(31, ∞), a(4), a(5, ∞)}.
Since, αj(4) = α(4) ∩ Jj for 1 ≤ j ≤ 4, we have
α1(4) = {a(1111), a(1111, ∞), a(112), a(112, ∞), a(121), a(121, ∞), a(13), a(13, ∞)}, α2(4) = {a(211), a(211, ∞), a(22), a(22, ∞)},
α3(4) = {(31), a(31, ∞)}, α4(4) = {a(4)}.
Here, S1−1(α1(4)) = {a(111), a(111, ∞), a(12), a(12, ∞), a(21), a(21, ∞), a(3), a(3, ∞)} is an op- timal set of 23-means, S2−1(α2(4)) = {a(11), a(11, ∞), a(2), a(2, ∞)} is an optimal set of 22- means, S3−1(α3(4)) = {a(1), a(1, ∞)} is an optimal set of 2-means, and S4−1(α4(4)) = {12} is an optimal set of one-mean. Moreover, one can see that
V16= 1
18V8+ 1
182V4+ 1
183V2 + 1
184V1+ 1 184V1.
Theorem 4.5. For n ∈ N with n ≥ 2 let ℓ(n) ∈ N satisfy 2ℓ(n)≤ n < 2ℓ(n)+1. Let α(ℓ(n)) and αn(I) be the sets as defined in Definition 2.9. Then, αn(I) is an optimal set of n-means with quantization error
Vn= 1 18ℓ(n)
1 8
2ℓ(n)+1− n + 1
9(n − 2ℓ(n)) .
The number of such sets is 2ℓ(n)Cn−2ℓ(n), where uCv = uv is a binomial coefficient.
Proof. By Lemma 4.2, α(ℓ(n)) is an optimal set of 2ℓ(n)-means. Choose I ⊂ α(ℓ(n)) such that card(I) = n−2ℓ(n). List the elements of I as a1, a2,· · · , an−2ℓ(n), i.e., I = {aj : 1 ≤ j ≤ n−2ℓ(n)}.
Construct the sets Aj for 1 ≤ j ≤ n − 2ℓ(n) as follows:
Aj := {a(ω1), a(ω1, ∞)} if aj = a(ω) for some ω ∈ N∗,
{a(ω−(ω|ω|+ 1)), a(ω−(ω|ω|+ 1), ∞)} if aj = a(ω, ∞) for some ω ∈ N∗. For 1 ≤ j ≤ n − 2ℓ(n), set
α2ℓ(n)+j = (α(ℓ(n)) \ ∪j
k=1{ak}) ∪ A1∪ A2∪ · · · ∪ Aj.
As shown in Lemma 4.2, proceeding inductively, the set αn(I) := α2ℓ(n)+(n−2ℓ(n)) = (α(ℓ(n)) \ I) ∪ A1 ∪ A2 ∪ · · · ∪ An−2ℓ(n) is an optimal set of n-means. Then, using Proposition 2.14, we obtain the quantization error as
Vn = Z
a∈αminn
kx − ak2dP = 1 18ℓ(n)
1 8
2ℓ(n)+1− n + 1
9(n − 2ℓ(n)) .
Since the subset I from the set α(ℓ(n)) can be chosen in2ℓ(n)Cn−2ℓ(n) different ways, the number of αn(I) is 2ℓ(n)Cn−2ℓ(n). Thus, the proof of the theorem is complete.
Remark 4.6. Let β be the Hausdorff dimension of the limit set generated by the infinite similitudes {Sj}∞j=1. Then, we know (see [M]):
∞
X
j=1
(1
3j)β = 1,
which gives β = log 2log 3, and it is same as the Hausdorff dimension of the classical Cantor set C generated by the similitudes S1 and S2 where S1(x) = 13x and S2(x) = 13x+23 for all x ∈ R.
Recall that the quantization dimension of a probability measure P is defined to be the number
n→∞lim
2 log n
− log Vn
,
if the limit exists in R, and for any κ > 0 the number lim
n→∞n2κVn, if it exists, is called the κ-dimensional quantization coefficient for P . Since the nth quantization error Vn for the probability measure P generated by the infinite similitudes considered in this paper is same as the nth quantization error for the Cantor distribution considered by Graf-Luschgy in [GL3], the following two theorems are also true for the probability measure P in this paper.
Theorem 4.7. (see [GL3, Theorem 6.3]) The set of accumulation points of the sequence (nβ2Vn)n∈N equals
h1
8, f( 17 8 + 4β)i
,
i.e., the β-dimensional quantization coefficient for the probability measure P does not exist, where f : [1, 2] → R is defined by f (x) = 721xβ2(17 − 8x).
Theorem 4.8. (see [GL3, Theorem 6.6]) The quantization dimension of P equals the Hausdorff dimension β of the limit set generated by the infinite similitudes.
References
[DFG] Q. Du, V. Faber and M. Gunzburger, Centroidal Voronoi Tessellations: Applications and Algorithms, SIAM Review, Vol. 41, No. 4 (1999), 637-676.
[GG] A. Gersho and R.M. Gray, Vector quantization and signal compression, Kluwer Academy publishers:
Boston, 1992.
[GKL] R.M. Gray, J.C. Kieffer and Y. Linde, Locally optimal block quantizer design, Information and Control, 45 (1980), 178-198.
[GL1] A. Gy¨orgy and T. Linder, On the structure of optimal entropy-constrained scalar quantizers, IEEE transactions on information theory, vol. 48, no. 2, February 2002.
[GL2] S. Graf and H. Luschgy, Foundations of quantization for probability distributions, Lecture Notes in Mathematics 1730, Springer, Berlin, 2000.
[GL3] S. Graf and H. Luschgy, The Quantization of the Cantor Distribution, Math. Nachr., 183 (1997), 113-133.
[GN] R. Gray and D. Neuhoff, Quantization, IEEE Trans. Inform. Theory, 44 (1998), 2325-2383.
[H] J. Hutchinson, Fractals and self-similarity, Indiana Univ. J., 30 (1981), 713-747.
[M] M. Moran, Hausdorff measure of infinitely generated self-similar sets, Monatsh. Math. 122 (1996), 387- 399.
[P1] P.L. Zador, Asymptotic quantization error of continuous signals and the quantization dimension, IEEE Transactions on Information Theory, 1982, Vol. 28 Issue 2, 139-149.
[P2] K. P¨otzelberger, The quantization dimension of distributions, Math. Proc. Camb. Phil. Soc. (2001), 131, 507-519.
[R1] L. Roychowdhury, Optimal quantizers for probability distributions on nonhomogeneous Cantor sets, arXiv:1512.00379 [stat.CO].
[R2] M.K. Roychowdhury, Quantization and centroidal Voronoi tessellations for probability measures on dyadic Cantor sets, arXiv:1509.06037 [math.DS].
[S] W.F. Sheppard, On the Calculation of the most Probable Values of Frequency-Constants, for Data arranged according to Equidistant Division of a Scale, Proc. London Math. Soc. (1897) s1-29 (1): 353- 380.
School of Mathematical and Statistical Sciences, University of Texas Rio Grande Valley, 1201 West University Drive, Edinburg, TX 78539-2999, USA.
E-mail address: [email protected]