• No results found

Testing a differential condition and local normality of densities

N/A
N/A
Protected

Academic year: 2020

Share "Testing a differential condition and local normality of densities"

Copied!
8
0
0

Loading.... (view fulltext now)

Full text

(1)

Testing a differential condition and local

normality of densities

Mynbayev, Kairat and Aipenova, Aziza

Kazakh-British Technical University, Kazakh National University

after al-Farabi

2013

(2)

N E W S

OF THE NATIONAL ACADEMY OF SCIENCES OF THE REPUBLIC OF KAZAKHSTAN

PHYSICO-MATHEMATICAL SERIES

ISSN 1991-346Х

Volume 5, Number 297 (2014), 42 – 48

UDK 519.224

TESTING A DIFFERENTIAL CONDITION

AND LOCAL NORMALITY OF DENSITIES

K. T. Mynbaev1, A. S. Aipenova2

1

Kazakh British Technical University, Almaty, Kazakhstan,

RSE «Institute of Mathematics and Mathematical Modeling», Almaty, Kazakhstan;

2

Kazakh National University after al-Farabi, Almaty,

RSE «Institute of Mathematics and Mathematical Modeling», Almaty, Kazakhstan. E-mail: [email protected], [email protected]

Key words: testing, local normality test, alternative hypothesis, null hypothesis, asymptotic normality.

Abstract. In this paper, we consider testing if a density satisfies a differential equation. This result can be applied to see if a density belongs to a particular family of distributions. For example, the standard normal density

1/ 2 2

( ) (2 ) exp( / 2)

f t = π − −t satisfies the differential equation f t'( )+tf t( )=0. If a density satisfies this equation at that point t, then it is called locally standard normal at that point. Thus, there is a practical need to test whether a

density satisfies a certain differential equation. We consider the expression ( )

0

( ) ( ) ( )

L

l l l

F x g x f x

=

=

.We can test the

null hypothesis H0: f satisfies the equation F(x) = 0 against the alternative hypothesis Ha: F(x) ≠ 0. The testing

pro-cedure is accompanied by an asymptotic normality statement.

1. Introduction. In statistics, normality tests are used to determine if a data set is well-modeled by a normal distribution and to compute how likely it is for a random variable underlying the data set to be normally distributed. Nevertheless normality tests are useful in many areas of forecasting and econometric inference as complements to other diagnostic tests. The tests are a form of model selection, and can be interpreted several ways, depending on one's interpretations of probability, for example, in frequentist statistics, statistical hypothesis testing, data are tested against the null hypothesis that it is normally distributed.

In this paper, we propose a local normality test. Consider, for example, the standard normal density

1/ 2 2

( )

(2 )

exp(

/ 2)

f t

=

π

t

. It satisfies a differential equation

f t

'( )

+

tf t

( )

=

0

. The general solution of this equation is

f t

( )

=

c

exp(

t

2

/ 2)

, and if it is to be a density, one has to put

c

=

(2 )

π

−1/ 2. We say that a density f is locally standard normal at point t if it satisfies the above differential equation at that point. Thus, there is a practical need to test whether a density satisfies a certain differential equation. The testing procedure is accompanied by an asymptotic normality statement.

(3)

the highest power under all conditions for large sample size. Most existing tests are based on some global properties of normal distributions. Our test is local. Both global and local approaches have their advantages and deficiencies. The main difference between the global and local approaches consists in the amount of calculation: rejecting normality locally is enough to reject it globally.

2. Main results. Let X1, X2, … Xn be independent identically distributed observation from distribution

having unknown density f. The Rosenblat-Parzen estimator for the density f evaluated at xR is defined by

1

1 1

ˆ ( ) n j

R

j

x X

f x K

n = h h

⎛ ⎞

=

⎝ ⎠

,

where h> 0 is a bandwidth and K is a kernel on R satisfying ( ) 1

R

K t dt=

Denote j( ) j ( )

R

K x K x dx

α

=

the j-th moment of K and let K be a kernel of order q, that is

α

j

( )

K

=

0

,

j

=

1,...,

q

1

,

α

q

( )

K

0

If f and K are l times continuously differentiable, then differentiation of

f

ˆ ( )

R

x

leads to the estimator

of

f

( )l

( )

x

( ) ( )

1 1

1 1

ˆ ( )l n l j

l j

x X

f x K

n = h+ h

⎛ ⎞

=

⎝ ⎠

. (1) In asymptotic statements the sample size n tends to infinity and the bandwidth h depends on n but this dependence usually is not reflected in the notation.

Now we turn to the testing for local normality. More generally, consider the expression

( )

0

( )

( )

( )

L

l l l

F x

g x f

x

=

=

, (2)

where

{

g xl( )

}

are given functions and the senior coefficient gL is different from zero at the given point

x. We can test

a) the null hypothesis H0: f satisfies the equation F(x) = 0 against

b) the alternative hypothesis Ha: F(x) ≠ 0.

It is convenient to use the differential operator D defined by

(

Df

)( )

x

=

F x

( )

. Since the derivative

( )

ˆ ( )

l

f

x

is estimated by (1), it is natural to estimate F(x) by

( ) ( )

1

0 1 0

( ) 1

ˆ

ˆ ( ) ( ) ( , )

L n L

l l l j

h l h l

l j l

x X g x

F x g x f x K K

n h+ h

= = =

⎛ ⎞

= =

⎝ ⎠

∑∑

. (3)

As one can see from part (a) of the next theorem, under the null hypothesis it also makes sense to consider the random variable G xˆh( )=F xˆh( ) /h. Provided that

( ) L( ) 0

f x g x ≠ , (4)

let Ψ denote a normal variable distributed as ( )

2

0, ( ) L( ) L ( )

N⎛⎜ f x g x K t dt⎞⎟

⎠.

Assumption 1.

a)f is infinitely differentiable.

b)K has l continuous derivatives and maxl=0,...,L&K( )l &C R( )< ∞.

Theorem 1. Suppose that Assumption 1 holds. Then the following statements are true: a) The bias of (3) is given by

( )

( )

( )

ˆ ( )

( )

( )

( )

( )

!

q L

q

h L q

f

x

EF x

F x

g

x

h

K

O h

q

α

+

=

+

. (5)

Consequently, under H0

ˆ ( ) ( q)

h

(4)

If, however, EF xˆ ( )hconst≠0, as h→0, then

F x

( )

0

and

H

0 can be rejected.

b)If

nh

2L+1

→ ∞

and (4) holds, then under the null plimF xˆh( )=0 (this equation is preferable to (6)

because in practice EF xˆ ( )h is unknown).

c)If nh→ ∞ and (4) holds, then

d)

(

2L 1

)

1/ 2 ˆ ( ) ˆ ( ) d

h h

nh + ⎡F xEF x⎯⎯→ Ψ. (7)

If, in addition,

nh

2L+3

0

, then

(

2 1

)

1/ 2

ˆ ( )

( )

d

L

h

nh

+

F x

F x

⎯⎯

→Ψ

e) If nh→ ∞, and

nh

2L+3

0

and (4) holds, then under the null

(

nh

2L+1

)

1/ 2

G x

ˆ ( )

h

⎯⎯

d

→Ψ

.

Proof: a) Denoting

( ) 1 0 ( ) , 1,..., L l j l j l l x X g x

K j n

h h

ω

+ = − ⎛ ⎞ = = ⎝ ⎠

we rewrite (3) as

1

1

ˆ ( )h n j

j

F x

n =

ω

=

, (8)

( )

1 1

1 0 1

( )

1

1

ˆ ( )

n L l l j n

h l j

j l j

x

X

g x

EF x

E

K

E

E

n

= =

h

+

h

n

=

ω

ω

=

=

=

∑ ∑

. (9)

Assumption 2.

To justify integration by parts below, for any h > 0 and

j

=

0,...,

l

1,

l

1

( ) ( 1 ) ( ) ( 1 )

lim

j

(

)

l j

(

)

lim

j

(

)

l j

(

)

0

s

K

s f

x

sh

s

K

s f

x

sh

− − − −

→−∞

+

=

→+∞

+

=

Using (9),

( ) ( )

1 1 1

1 0 0

( )

( )

1

ˆ ( )

( )

n L L

l j l

l l

h l l

j l l R

x

X

g x

g x

x

s

E

EF x

E

K

K

f s ds

n

h

h

h

h

ω

+ +

= = =

=

=

=

=

∑ ∑

(changing variables) ( )

( )

( ) 0 0

( ) 1

( ) ( ) ( ) ( ) L L l l l l l l

l R l R

g x

K t f x ht dt g x K t f x ht dt

h h = = ⎛ ⎞ = − + = − + = ⎝ ⎠

(then Assumption 2 allows us to integrate l times by parts)

( )1 ( )1

1 0 0

1

1

( )

(

) (

)

( )

(

) '(

)

...

L L l l

l l l l

l l R

g x

K

t f x

ht

g x

K

t f x

ht dt

h

h

+∞ − − − = −∞ =

=

+

+

+

= =

( ) ( 1) 0 0

1

( )

(

)

(

)

( )

(

)

(

)

L L l l l l

l l R

g x

K

t f

x

ht

g x

K

t f

x

ht dt

h

+∞ − = −∞ =

=

+

+

+

=

( ) 0 ( ) ( ) ( ) L l l l R

g x K t f x ht dt

=

⎛ ⎞

= − +

⎝ ⎠

This integral typically is not analytically solvable, so we approximate it using Taylor expansion of

( )

(

)

l

f

x

+

ht

in the argument ht, which is valid as h → 0. For a q-th order kernel we take the expansion out to the q-th term

( ) ( )

( )

( )

( ) ( )

( )

( )

1

(

)

( )

(

)

i l q l

i q

l

f

x

l

f

x

q

f

x

ht

ht

f

x

ht

o h

+ +

+

(5)

Using

K t dt

( )

1

+∞

−∞

=

and q( ) q ( )

R

K x K x dx

α

=

, we get

( ) ( )

( )

1

1 0 0 ( ) ( ) ( ) ( ) ( )( ) ( ) ! q l L L q

l q q

l l

l l R

f x

E g x f x g x h K t t dt o h

q

ω

+ +

= = ⎛ ⎞ = + − − − + = ⎝ ⎠

( )

( )

1

( )

(

)( )

( )

( )

(

)

!

q L q q L q

f

x

Df

x

g x

h

K

o h

q

α

+

+

=

+

+

. (10)

Equation (5) follows from (3) and (10)

( )

( )

1

( )

ˆ ( )

( )

( )

( )

(

)

!

q L

q q

h L q

f

x

EF x

F x

g x

h

K

o h

q

α

+

+

=

+

.

The rest of part a) is an obvious consequence of (5).

b) We need an asymptotic expression for the variance of F xˆ ( )h . By the i.i.d. assumption

(

)

( )

2

(

)

2

1 1

2 1

1 1

ˆ

var ( ) var

n

h j

j

F x E E

n =

ω

n

ω

ω

⎡ ⎤

=

= .

We need to evaluate

E

ω

12 and

(

)

2 1

E

ω

.

( ) ( )

2 1 1

1 2 , 0 ( ) ( ) L l m l m l m l m

g x g x x X x X

E EK K

h h h

ω

+ +

= − − ⎛ ⎞ ⎛ ⎞ = = ⎝ ⎠ ⎝ ⎠

( ) ( ) ( )

( )

( )

( )

( )

( )

(

)

( )

( )

( )

( )

2 , 0 1 , 0

2 2 2

2

2 1 2 1 2

0 , 2 1 ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) L l m l m l m

l m R

L

l m

l m

l m

l m R

L l m L l m l m

L L

L L

l m L L

R R

l m L

g x g x x s x s

K K f s ds

h h h

g x g x

K t K t f x ht dt h

g x g x

g x g x

f x K t dt h f x K t K t dt

h h g x

+ + = + + = − − + + ≤ ≤ + ≤ − − − ⎛ ⎞ ⎛ ⎞ = = ⎝ ⎠ ⎝ ⎠ = − − + = ⎧ ⎫ ⎡ = + − − ⎩ ⎭ ⎣

( )

( )

(

)

2 2 2 1 ( )

( ) L ( ) .

L L

R

g x

f x K t dt O h

h + ⎤ = ⎢ ⎥ ⎦ ⎧ ⎫ = +

⎭ (11) Combining (10) and (11), we obtain the expression for the variance

(

)

{

( )

}

[

]

( )

{

}

2 2 2 0 2 1 2 0 2 1

( )

1

ˆ

var

( )

( )

( )

( )

( )

1

( )

( )

( ) .

L L h L L L L

g

x

F x

f x

K

O h

F x

O h

nh

n

f x

g x K

O h

nh

α

α

+ +

=

+

+

=

=

+

(12)

If

H

0 holds then by (5) EF xˆ ( )h →0, h→0 and by the Chebyshov inequality (12) implies

(

)

2

(

)

1

ˆ

( )

ˆ

( )

var

ˆ

( )

0,

0

h h h

P F x

EF x

ε

F x

h

ε

, for any

ε

>0.

Hence,

F x

ˆ

h

( )

=

F x

ˆ

h

( )

EF x

ˆ

h

( )

+

EF x

ˆ

h

( )

⎯⎯

P

0

.

c) Let us prove convergence in distribution of the standardized version

1/ 2

ˆ ( ) ˆ ( ) / var(ˆ ( ))

n h h h

(6)

1/ 2

1 1

1

ˆ

var(

( ))

n n j j n nj j j h

E

S

X

n

F x

ω

ω

= =

=

=

where 1/ 2 ˆ

/ var( ( ))

nj j j h

X =

ω

E

ω

F x .

It is easy to see that by the i.i.d property

0,

nj

EX

=

2

var(

)

1

var(

)

ˆ

var(

( ))

j nj

h

X

n

n

F x

ω

=

=

,

var(

S

n

) 1

=

.

Alternatively, using the notation in the Lindeberg-Feller theorem [7], we can rewrite

0,

nj

µ

=

nj

1

,

n

σ

=

σ

n

=

1

.

Let

F

nj be the distribution function of

X

nj. Since

X

nj are i.i.d., all

F

nj coincide with

F

n1 and the Lindeberg function takes the form

2 2 2 1 1 2 1 2

2 1 1

1 1 / 2

1

1

( )

( )

.

ˆ

var(

( ))

n

nj n n

j

n x x

n

h

n

x dF

n

x dF

x

x

dF

x

E

E

n

E X

n

F x

δ δ ε ε δ δ δ

δ δ δ

λ

σ

ε

ω

ω

ε

ε

+

= > >

+ + + +

=

=

=

=

∑ ∫

(13)

By Holder’s inequality

(

)

1

(

)

1

2 2 2 2

1 1

2

1

E

ω

E

ω

+δ +δ

E

ω

+δ +δ

(plugging

ω

1 in and applying Minkowski’s inequality)

( ) ( )

( )

( )

( )

( )

1 1

2 2 2 2

1 1 1 0 0 1 1 2 2 2 2 1 1 0 ( ) ( )

2 2 ( )

( ) ( )

2 ( ) 3 ( ) ( ) .

L L

l l

l l

l l

l l R

L

l L

l L

l L

l R R

g x x X g x x s

E K K f s ds

h h h h

g x g x

K t f x ht dt f x K t dt O h

h h

δ δ δ δ

δ δ δ δ + + + + + + = = + + + + + + = ⎡ ⎤ ⎡ ⎤ ≤ ⎢ ⎥ = ⎢ ⎥ = ⎝ ⎠ ⎝ ⎠ ⎢ ⎥ ⎢ ⎥ ⎣ ⎦ ⎣ ⎦ ⎧ ⎫ ⎡ ⎤ ⎡ ⎤ = + + ⎣ ⎦ ⎪⎣ ⎦ ⎪

Therefore,

(

2

)

2 ( )

( )

2

2

1 1 1 1/(2 )

( )

3 LL ( ) L ( )

R

g x

E E f x K t dt O h

h δ δ δ δ δ

ω

ω

+ + + + + − + ⎛ ⎞ − ≤ ⎜ + ⎟ ⎜ ⎟ ⎝

. (14)

Combing (12) and (14), we get

( )

( )

( )

{

}

2 2

1 / 2 (2 1)(1 / 2)

1 / 2

1 1 1/(2 ) 2 / 2

0

3 ( ) ( ) ( )

( , , ) 0, ( ) ( ) ( ) ( ) L L L R L L L

g x f x K t dt O h

n h c x

n h nh

f x g x K O h

δ δ

δ δ

δ

δ δ δ δ

ε δ

λ

ε

α

+ + + + + + + + − + ⎧ ⎫ ⎡ ⎤ + ⎣ ⎦ ⎩ ⎭ ≤ = → ⎡ ⎤ + ⎣ ⎦

as nh→ ∞.

By the Lindeberg-Feller theorem the assumption nh→ ∞ implies

S

n

⎯⎯

d

N

(0,1)

.

Now we can prove the convergence stated in (7). By (12) the limit in distribution of

(

2 1

)

1/ 2

ˆ ( ) ˆ ( )

L

h h

(7)

( )

{

}

( )

{

}

1/ 2 2

0 1/ 2

1/ 2 2

0

ˆ

( )

ˆ

( )

( )

( )

( )

ˆ

var(

( ))

( )

( )

( )

.

L h h

L

h

L d

L n

F x

EF x

f x

g

x K

O h

F x

f x

g

x K

O h

S

α

α

⎤ +

=

=

+

⎯⎯

→Ψ

In the equation

(

2 1

)

1/ 2

(

2 1

)

1/ 2

(

2 1

)

1/ 2

ˆ ( ) ( ) ˆ ( ) ˆ ( ) ˆ ( ) ( )

L L L

h h h h

nh + F xF x= nh + F xEF x+ nh + ⎡EF xF x

the first term on the right converges in distribution, as we have just proved, and the second is of order

(

)

(

2 1 1/ 2

)

(

(

2 3

)

1/ 2

)

(1)

L L

O

nh

+

h

=

O

nh

+

=

o

, according to (5) and the assumption

nh

2L+3

0

.

(d) Under the null one has the identity

(

nh2L+3

)

1/ 2G xˆh( )=

(

nh2L+1

)

1/ 2⎡F xˆh( )−F x( )⎤, the conclusion follows from part (c).

REFERENCES

[1] Razali N.M., Wah Y.B. Power comparisons of Shapiro-Wilk, Kolmogorov-Smirnov, Lilliefors and Anderson-Darling tests. Journal of Statistical Modeling and Analytics. 1983. Vol. 2, N 1. P. 21-33.

[2] Thadewald T., Buning H. Jarque-Bera test and its competitors for testing normality – a power comparison. Journal of Applied Statistics. 2007. Vol. 34. P. 87-105.

[3] Surucu B. A power comparison and simulation study of goodness-of-fit tests. Computers and Mathematics with Applications. 2008. Vol. 56. P. 1617-1625.

[4] Farrell P.J., Rogers-Stewart K. Comprehensive study of tests for normality and symmetry: extending the Spiegel-haltertest. Journal of Statistical Computation and Simulation. 2008. Vol. 76. P. 803-816.

[5] Szekely G.J., Rizzo M.L. A new test for multivariate normality. Journal of Multivariate Analysis. 2005. Vol. 93. P. 87-105.

[6] Saculinggan M., Balase E.A. Empirical power comparison of goodness of fit tests for normality in the presence of outliers. Journal of Physics. 2013. Vol. 435. P. 1-11.

[7] Henze N., Zirkler B. A class of invariant and consistent tests for multivariate normality. Communications in Statistics – Theory and Methods. 1990. Vol. 19. P. 3595–3617.

[8] Davidson J. Stochastic limit theory. New York Oxford University Press, 1994.

ДИФФЕРЕНЦИАЛДЫҚШАРТТАРДЫТЕСТІЛЕУ

ЖƏНЕТЫҒЫЗДЫҚТЫҢЖЕРГІЛІКТІНОРМАЛI

Қ. Т. Мынбаев1, А. С. Аипенова2

1Қазақ

cтан-Британтехникалықуниверситеті, Алматы, Қазақстан

РМК «Математикажəнематематикалықмодельдеуинституты», Алматы, Қазақстан,

2əл

-ФарабиатындағыҚазақұлттықуниверситеті, Алматы, Қазақстан,

РМК «Математикажəнематематикалықмодельдеуинституты», Алматы, Қазақстан

Тірек сөздер: тестілеу, жергіліктінормальды тест, баламалы болжау, нөлдік болжау, асимптотикалық

нормальдық.

Аннотация. Жұмыста, тығыздық функциясы дифференциалдық теңдеуді қанағаттандыратын тестілеу

əдісіұсынылады. Бұлнəтиже тығыздықфункциясынақтыүлестірімдержиынына жататынынтексеруүшін

қолданылуымүмкін. Мысалы, стандарттынормальді тығыздық f t( )=(2 )

π

−1/ 2exp(−t2/ 2) жиыныкелесі

дифференциалдықтеңдеуді f t'( )+tf t( )=0 қанағаттандырады. Егер тығыздық осы теңдеудіt нүктесінде

қанағаттандырса, ондаолосынүктедежергіліктістандарттынормальдідепаталады. Сондықтан, тексерудің

тəжірибелік қажеттілігі туындайды, яғни тығыздық функциясы қандайда бір дифференциалдық теңдеуді

қанағаттандыратынынтексерукерек. Мынадайөрнекті ( )

0

( ) ( ) ( )

L

l l l

F x g x f x

=

=

қарастырамыз. БізH0нөлдік

болжаудытексере аламыз, егер f функциясы F(x) = 0 теңдікті қанағаттандырса баламалы болжауға қарсы

Ha: F(x) ≠ 0.

(8)

ТЕСТИРОВАНИЕДИФФЕРЕНЦИАЛЬНЫХУСЛОВИЙ

ИЛОКАЛЬНАЯНОРМАЛЬНОСТЬПЛОТНОСТИ

К. Т. Мынбаев1, А. С. Аипенова2

1Казахстанско

-Британскийтехническийуниверситет, Алматы, Казахстан,

РГП «Институтматематикииматематическогомоделирования», Алматы, Казахстан;

2Казахскийнациональныйуниверситетим

. аль-Фараби, Алматы, Казахстан,

РГП «Институтматематикииматематическогомоделирования», Алматы, Казахстан

Ключевыеслова:тестирование, тестированиелокальнойнормальности, альтернативнаягипотеза, нуле

-ваягипотеза, асимптотическаянормальность.

Анотация. В этой статье предлагается процедура тестирования ли плотность удовлетворяет ли

плотность дифференциальному уравнению. Этот результат может быть применен для проверки, что

плотностьотноситсякконкретномусемействураспределений. Например, стандартнаянормальнаяплотность

1/ 2 2

( ) (2 ) exp( / 2)

f t =

π

− −t удовлетворяет дифференциальному уравнению f t'( )+tf t( )=0. Если плот

-ностьудовлетворяетэтомууравнениювэтойточкеt,тоонаназываетсялокальностандартнойнормальнойв

этойточке. Такимобразом, существуетпрактическаянеобходимостьвпроверке, удовлетворяетлиплотность

некоторому дифференциальному уравнению. Рассмотрим выражение ( )

0

( ) ( ) ( )

L

l l l

F x g x f x

=

=

. Мы можем

проверить нулевую гипотезу H0: f удовлетворяет уравнению F(x) = 0 против альтернативной гипотезы

Ha: F(x) ≠ 0.

Методтестирования сопровождается утверждениемобасимптотической нормальности.  

References

Related documents

walmart does notary service businesses offering free notaries public on staff, an original document online notarization providers connect you recommend and regulated by plea to

None of reference is invalid, drug information to opioids under the references from a group of the technologies we cannot show you already on our pdr.. Just some electronic access

Product mix is shifting towards Google and new customer sales have declined dramatically Product mix is shifting towards Google and new customer sales have declined dramatically

 Using symmetric algorithms for payment card data invites

Our discussion within each of these arenas seeks to draw together and develop three literature- based propositions: (1) that states affect unfree labour through their governance

The first step is in 2011 when Team Kshatriya will compete at the most prestigious Mini Baja event, SAE Mini Baja Asia India at NATRAX, Pithampur,Madhya Pradesh.. Our goal is to

The HistoryMakers is a national 501(c)(3) non profit educational institution founded in 1999, committed to preserving, developing and providing easy access to an

That within each program plan tactics that location, setting a great success is the messages should be improved marketing strategy down into loyal buyers.. When to influencing the