Further proofs - CHI-SQUARE APPROXIMATION BY STEIN S METHOD WITH APPLICATION TO PEARSON S STATI

PROOF OFLEMMA3.1. From Lemma I.4 in [33], ψ exists and ψ (w)= e¹²^w² ^∞

g⁽³⁾(s)e⁻¹²^s²ds

= −e¹²^w² ^w

−∞g⁽³⁾(s)e⁻¹²^s²ds.

From g⁽³⁾(s)= 3sf(s²)+ 2s³f⁽³⁾(s²), we get that (5.1) g⁽³⁾(x)≤ 3|x|f+ 2|x|³f⁽³⁾. Hence, for w > 0

ψ (w)≤ 3fe¹²^w²

_∞ w

se⁻¹²^s²ds+ 2f⁽³⁾e¹²^w²

_∞ w

s³e⁻¹²^s²ds

= 3f+ 2w²+ 2f⁽³⁾. Similarly, for w < 0

ψ (w)≤ 3fe¹²^w²

−∞(−s)e⁻¹²^s²ds+ 2f⁽³⁾e¹²^w²

−∞(−s)³e⁻¹²^s²ds

= 3f+ 2w²+ 2f⁽³⁾,

proving the first bound. The second bound uses (3.6) to obtain xψ(x)= x²ψ (x)+ xg⁽³⁾(x)

and the bound then follows directly with (3.7) and (5.1). For the last bound, differ-entiate (3.6) to get

ψ(x)= xψ(x)+ ψ(x) + g⁽⁴⁾(x) and combine (3.7), (3.8) and (3.4).

PROOF OFLEMMA4.2. The derivatives of g are

∂g

∂s_j(s)=1

2sjf(w),

∂²g

∂s_j²(s)=1

2f(w)+ sj²f(w),

∂²g

∂sj∂sk

(s)= sjskf(w), j= k.

Note that we can write the multivariate normal Stein equation (4.7) as

Putting all the above together gives that AMVN(0,_S)g(s)= wf(w)+1

where we used that 0 < θj <1 and npj ≥ 1 to obtain the first inequality. Similarly, EIj(1)ξ_j⁴≤ pj

ES_j⁽¹⁾⁴+ 4ES_j⁽¹⁾³+ 6ES_j⁽¹⁾²+ 4ES_j⁽¹⁾+ 1

< pj

4+ 4 · 4^3/4+ 6 + 4 + 1<27pj

and

EIj(1)ξ_j⁶≤ pj

ES_j⁽¹⁾⁶+ 6ES_j⁽¹⁾⁵+ 15ES_j⁽¹⁾⁴+ 20ES_j⁽¹⁾³ + 15ES_j⁽¹⁾²+ 6ES_j⁽¹⁾+ 1

< pj

42+ 6 · 42^5/6+ 15 · 4 + 20 · 4^3/4+ 15 + 6 + 1<305pj, where we used Lemma4.1to bound the absolute moments of S_j⁽¹⁾. Also,

EI_j(1)ξ_j≤ pj

ES_j⁽¹⁾+ 1<2p_j and

EIj(1)ξ_j³≤ pj

ES_j⁽¹⁾³+ 3ES_j⁽¹⁾²+ 3ES_j⁽¹⁾+ 1

< pj

4^3/4+ 3 + 3 + 1<14pj.

When j= k, it is clear that the same bounds still hold, as Ij(1)Ik(1)= 0. PROOF OFLEMMA 4.4. Since the covariance-matrix _S is nonnegative defi-nite, the solution of the Stein equation (4.15) is well defined and is given by (see [24]):

ψi(s)= − ^∞

0 Ehi

e^−us+1− e^−2uZdu.

By dominated convergence,

∂³ψ_i

∂sa∂sb∂sc

(s)= − ^∞

e^−3uE

∂³h_i

∂sa∂sb∂sc

e^−us+1− e^−2uZdu,

and so

∂³ψi

∂sa∂sb∂sc

(s)≤ ^∞

e^−3uE ∂³hi

∂sa∂sb∂sc

e^−us+1− e^−2uZdu.

We now obtain bounds for the third-order partial derivatives of h₁ and h₂. By straightforward differentiation,

∂³h1

∂sa∂sb∂sc

(s)

= 2[δj a+ δj b+ δj c]f⁽³⁾(w)+ 4s_j(s_a+ sb+ sc) + sbscδj a+ sascδj b+ sasbδj c

f⁽⁴⁾(w)+ 8sjsasbscf⁽⁵⁾(w),

∂³h2

∂s_a∂s_b∂s_c(s)

= 6δj aδj bδj cf⁽³⁾(w)

+ 12sj[saδj bδj c+ sbδj aδj c+ scδj aδj b]f⁽⁴⁾(w)

+ 4s_j³(sa+ sb+ sc)+ 3sj²(sbscδj a+ sascδj b+ sasbδj c)f⁽⁵⁾(w) + 8sj³sasbscf⁽⁶⁾(w),

where δjj = 1 and δj k= 0 if j = k. We can bound these partial derivatives by using the inequalities δ_{j k}≤ 1 andⁿ_k₌₁|ak| ≤ ¹_nⁿ_k₌₁|ak|ⁿ for n≥ 1. Doing so yields the bounds

∂³h1

∂sa∂sb∂sc

(s)≤ 6f⁽³⁾+ 6f⁽⁴⁾s_a²+ s_b²+ s_c²+ s_j² + 2f⁽⁵⁾s_a⁴+ s_b⁴+ s_c⁴+ s_j⁴,

∂³h₂

∂sa∂sb∂sc

(s)≤ 6f⁽³⁾+ 6f⁽⁴⁾s_a²+ sb²+ sc²+ 3sj² +f⁽⁵⁾7s_s⁴+ 7sb⁴+ 7sc⁴+ 27sj⁴

3f⁽⁶⁾s_a⁶+ sb⁶+ sc⁶+ 3sj⁶ .

We now use the inequality for the third-order partial derivative of h2(s)to bound the third-order partial derivatives of ψ2(s). The random vector Z∼ MVN(0, S) can be written as (Z1, . . . , Zm), where Zj ∼ N(0, 1 − pj) and Cov(Zj, Zk)=

−√pjpk for j = k. On applying the inequality |a1+ a2|^r ≤ 2^r⁻¹(|a1|^r+ |a2|^r), where r≥ 1, we have

∂³ψi

∂sa∂sb∂sc

(s)

≤ ^∞

0 e^−3uE6f⁽³⁾+ 12f⁽⁴⁾e^−2us_a²+ s_b²+ s_c²+ 3s_j² +1− e^−2uZ²_a+ Zb²+ Zc²+ 3Z²j

+ 8f⁽⁵⁾e^−4u7s_a⁴+ 7sb⁴+ 7sc⁴+ 27sj⁴

+1− e^−2u²7Z⁴_a+ 7Zb⁴+ 7Z⁴c+ 27Z⁴j

+128

3 f⁽⁶⁾e^−6us_a⁶+ sb⁶+ sc⁶+ 3sj⁶ +1− e^−2u³Z⁶_a+ Zb⁶+ Zc⁶+ 3Z⁶j

≤ f⁽³⁾ 2 +12

5 f⁽⁴⁾4+ s_a²+ s_b²+ s_c²+ 3s_j²

+ 8 This completes the proof of inequality (4.18), and inequality (4.17) follows from a similar calculation.

PROOF OF THEOREM4.3. As was the case in the proof of Theorem4.2, we require a bound for the expressionEAm−1f (W ), which is equivalent to bounding EAMVN(0,_S)g(S). By using Taylor expansions in a similar manner to that used in the proof of Theorem4.2, we have that

Eh(W) − χ(m² −1)h=EAm−1f (W )=EAMVN(0,_S)g(S)≤ |R1| + |R2|, for some θ_j ∈ (0, 1). Carrying out a calculation similar to the one used to obtain (4.11) yields

To complete the proof, we require bounds on the expectations on the right-hand side of (5.2). Now recall that g(s)=¹₄f (w). By a straightforward differentiation,

∂³g

∂s_j³(s)= 3sjf(w)+ 2sj²f⁽³⁾(w), and so, for any j , k, l,

(5.3) ∂³g

∂sj∂sk∂sl

(s)≤|sj| + |sk| + |sl|f+2 3

|sj|³+ |sk|³+ |sl|³f⁽³⁾. To bound the expectations, we also use the inequalities E|Ij(1)ξk| < 2pj and E|Ij(1)ξ_k³| < 14pj, j, k= 1, . . . , m from Lemma4.3. Using (5.3) and these in-equalities yields

EIj(1) ∂³g

∂sj∂sk∂sl

(ξ )≤ 3·2pjf+2·14pjf⁽³⁾= 2pj

3f+14f⁽³⁾. Hence, we obtain the bound

Eh(W) − χ(m² −1)h≤ 1

√n

3f+ 14f⁽³⁾

_m j=1

√1pj + 2m m j=1

√pj

+ m j=1

m k=j

√pk+ 2 m j=1

m k=j

m l=1

√pjpkpl

(5.4)

≤ 6

√n

3f+ 14f⁽³⁾

m j=1

√1pj

Using inequality (2.11) to translate bounds for the derivatives of the solution f to bounds on the derivatives of the test function h completes the proof.

PROOF OFCOROLLARY4.2. Let α > 0, and for some fixed z > 0 define

hα(x)=

⎧⎪

⎪⎪

⎪⎨

⎪⎪

⎩

1, if x≤ z,

1− 2(x − z)²/α², if z < x≤ z + α/2, 2x− (z + α)²/α², if z+ α/2 < x ≤ z + α,

0, if x≥ z + α.

Then h_αexists and is Lipshitz continuous with hα = 1, h_α = 2/α and h_α = 4/α². Let Y_d be a χ_(d)² random variable, then by (4.5),

P(W ≤ z) − P(Ym−1≤ z)

≤ Ehα(W )− Ehα(Ym−1)+ Ehα(Ym−1)− P(Ym−1≤ z) (5.5)

≤ 12

√np_∗

6 hα + 46h_α+ 84h_α+ P(z ≤ Ym−1≤ z + α)

= 12

√np_∗

6+92

α +336 α²

+ P(z ≤ Yd≤ z + α).

Now, for d= 1 (which corresponds to m = 2), P(z ≤ Y1≤ z + α) = ^z^+α

e^−x/2

√2π xdx≤ ^α

√1

2π xdx=

2α

π .

For d≥ 2, the mode of Yd is given by d− 2. The density of Y2is clearly bounded by ¹₂, and, for d≥ 3, the density of Yd can be bounded by

2^d/2(^d₂)x^d/2⁻¹e^−x/2≤ 1

2^d/2(^d₂)(d− 2)^d/2⁻¹e^−(d−2)/2≤ 1 2√

π(d− 2), where the last inequality follows from Stirling’s inequality (x + 1) ≥

√2π x^x^+1/2e^−x, which holds for all x > 0. Therefore,

(5.6) P(z ≤ Ym−1≤ z + α) ≤

⎧⎪

⎪⎪

⎨

⎪⎪

⎪⎩

2α/π , if m= 2,

α/2, if m= 3,

α 2√

π(m− 3), if m≥ 4.

Bounds for m= 2, m = 3 and m ≥ 4 now follow on substituting inequality (5.6) into (5.5) and choosing an appropriate α. For m= 2, we take α = 52.75n^−1/5; for m= 3, we choose α = 25.27n^−1/6; and α= 30.58(m − 3)^1/6n^−1/6is taken when m≥ 4. We can obtain a lower bound similarly, which is the negative of the upper bound. The proof is now complete.

Acknowledgements. The problem was first brought to our attention by Persi Diaconis, and we are thankful for that. As so many papers, this paper would not exist without him. Larry Goldstein provided many fruitful discussions. We would also like to thank an anonymous referee for their report.

REFERENCES

[1] ASYLBEKOV, Z. A., ZUBOV, V. N. and UL’YANOV, V. V. (2011). On the approximation of some statistics of goodness-of-fit tests for the case of discrete three-dimensional data.

Sibirsk. Mat. Zh. 52 728–744.MR2883210

[2] BARBOUR, A. D. (1990). Stein’s method for diffusion approximations. Probab. Theory Related Fields 84 297–322.MR1035659

[3] BENTKUS, V. (2003). On the dependence of the Berry-Esseen bound on dimension. J. Statist.

Plann. Inference 113 385–402.MR1965117

[4] CHATTERJEE, S., FULMAN, J. and RÖLLIN, A. (2011). Exponential approximation by Stein’s method and spectral graph theory. ALEA Lat. Am. J. Probab. Math. Stat. 8 197–223.

MR2802856

[5] CHEN, L. H. Y. (1975). Poisson approximation for dependent trials. Ann. Probab. 3 534–545.

MR0428387

[6] CHEN, L. H. Y., GOLDSTEIN, L. and SHAO, Q.-M. (2011). Normal Approximation by Stein’s Method. Springer, Heidelberg.MR2732624

[7] CRESSIE, N. and READ, T. R. C. (1984). Multinomial goodness-of-fit tests. J. Roy. Statist.

Soc. Ser. B 46 440–464.MR0790631

[8] DIACONIS, P. and ZABELL, S. (1991). Closed form summation for classical distributions:

Variations on a theme of de Moivre. Statist. Sci. 6 284–302.MR1144242

[9] DICKER, L. H. and ERDOGDU, M. A. (2016). Flexible results for quadratic forms with appli-cations to variance components estimation. Ann. Statist. To appear.

[10] DÖBLER, C. (2015). Stein’s method of exchangeable pairs for the beta distribution and gener-alizations. Electron. J. Probab. 20 34.MR3418541

[11] DÖBLER, C., GAUNT, R. E. and VOLLMER, S. J. (2015). An iterative technique for bounding derivatives of solutions of Stein equations. Available atarXiv:1510:02623.

[12] FAN, J., HUNG, H.-N. and WONG, W.-H. (2000). Geometric understanding of likelihood ratio statistics. J. Amer. Statist. Assoc. 95 836–841.MR1804442

[13] GAUNT, R. E. (2013). Rates of convergence of variance-gamma approximations via Stein’s method. D.Phil. thesis, Univ. Oxford.

[14] GAUNT, R. E. (2014). Variance-gamma approximation via Stein’s method. Electron. J. Probab.

19 33.MR3194737

[15] GOLDSTEIN, L. and RINOTT, Y. (1996). Multivariate normal approximations by Stein’s method and size bias couplings. J. Appl. Probab. 33 1–17.MR1371949

[16] GÖTZE, F. (1991). On the rate of convergence in the multivariate CLT. Ann. Probab. 19 724–

739.MR1106283

[17] GÖTZE, F. and ULYANOV, V. V. Asymptotic distribution of χ²-type statistics. In Research Group Spectral Analysis, Asymptotic Distributions and Stochastic Dynamics. Bielefeld Univ., Preprint 03–033.

[18] GREENWOOD, P. E. and NIKULIN, M. S. (1996). A Guide to Chi-Squared Testing. Wiley, New York.MR1379800

[19] KOEHLER, K. J. and LARNTZ, K. (1980). An empirical investigation of goodness-of-fit statis-tics for sparse multinomials. J. Amer. Statist. Assoc. 75 446–344.

[20] LOH, W.-L. (1992). Stein’s method and multinomial approximation. Ann. Appl. Probab. 2 536–554.MR1177898

[21] LUK, H. M. (1994). Stein’s method for the Gamma distribution and related statistical applica-tions. Ph.D. thesis, Univ. Southern California, Los Angeles, CA.

[22] MANN, B. (1997). Stein’s method for χ²of a multinomial. Unpublished manuscript.

[23] MANN, B. (1997). Convergence rate for χ²of a multinomial. Unpublished manuscript.

[24] MECKES, E. (2009). On Stein’s method for multivariate normal approximation. In High Di-mensional Probability V: The Luminy Volume. Inst. Math. Stat. Collect. 5 153–178. IMS, Beachwood, OH.MR2797946

[25] NOURDIN, I. and PECCATI, G. (2009). Stein’s method on Wiener chaos. Probab. Theory Re-lated Fields 145 75–118.MR2520122

[26] PEARSON, K. (1900). On the criterion that a given system of deviations is such that it can be reasonably supposed to have arisen from random sampling. Philos. Mag. 50 157–175.

[27] PEKÖZ, E. A. and RÖLLIN, A. (2011). New rates for exponential approximation and the the-orems of Rényi and Yaglom. Ann. Probab. 39 587–608.MR2789507

[28] PICKETT, A. M. (2004). Rates of convergence of χ² approximations via Stein’s method.

D.Phil. thesis, Univ. Oxford.

[29] PIKE, J. and REN, H. (2014). Stein’s method and the Laplace distribution. ALEA Lat. Am. J.

Probab. Math. Stat. 11 571–587.MR3283586

[30] RICE, J. A. (1995). Mathematical Statistics and Data Analysis, 2nd ed. Duxbury Press, North Scituate, MA.

[31] ROSCOE, J. T. and BYARS, J. A. (1971). An investigation of the restraints with respect to sample size commonly imposed on the use of the Chi-Square statistic. J. Amer. Statist.

Assoc. 66 755–759.

[32] STEIN, C. (1972). A bound for the error in the normal approximation to the distribution of a sum of dependent random variables. In Proceedings of the Sixth Berkeley Symposium on Mathematical Statistics and Probability, Probability Theory 2 583–602. Univ. California Press, Berkeley, CA.MR0402873

[33] STEIN, C. (1986). Approximate Computation of Expectations. IMS, Hayward, CA.

MR0882007

[34] STUART, A. and ORD, J. K. (1987). Kendall’s Advanced Theory of Statistics: Volume I, 5th ed.

Charles Griffin. London.

[35] TUMANYAN, S. H. (1956). Asymptotic distribution of χ²criterion when the size of observa-tions and the number of groups simultaneously increase. Teor. Veroyatn. Primen. 1 131–

145.MR0088090

[36] ULYANOV, V. V. and ZUBOV, V. N. (2009). Refinement on the convergence of one family of goodness-of-fit statistics to chi-squared distribution. Hiroshima Math. J. 39 133–161.

MR2499200

[37] YARNOLD, J. K. (1972). Asymptotic approximations for the probability that a sum of lattice random vectors lies in a convex set. Ann. Math. Stat. 43 1566–1580.MR0372967

DEPARTMENT OFSTATISTICS UNIVERSITY OFOXFORD 24-29 ST. GILES’ OXFORDOX1 3LB UNITEDKINGDOM

E-MAIL:[email protected] [email protected] [email protected]

In document CHI-SQUARE APPROXIMATION BY STEIN S METHOD WITH APPLICATION TO PEARSON S STATISTIC (Page 29-37)