Statistical inference for the shape parameter change point estimator in negative associated gamma distribution

(1)

R E S E A R C H

Open Access

Statistical inference for the shape parameter

change-point estimator in negative

associated gamma distribution

Chang-chun Tan

1*

_{, Bai-qi Miao}

2

_{and Xing-cai Zhou}

3

*_{Correspondence:}

[email protected]

1_{School of Mathematics, Hefei}

University of Technology, Heifei, Anhui 230009, China Full list of author information is available at the end of the article

Abstract

In this paper, the change-point estimator for the shape parameter is proposed in a negative associated gamma random variable sequence. Suppose thatX1,. . .,Xnare

negative associated random variables satisfying thatX1,. . .,X[nτ0]are identically

distributed with

(x;

ν

1,

λ

), and thatX[nτ0]+1,. . .,Xnare identically distributed with

(x;

ν

2,

λ

); the change point

τ

0is unknown. The weak and strong consistency, and

the weak and strong convergence rate of the change-point estimator, are given by the CUSUM method. Furthermore, theOPconvergence rate of the change-point

estimator is presented under the local alternative hypothesis condition.

MSC: 62F12; 62G10

Keywords: change point;

-distribution; negative associated; shape parameter; convergence rate

1 Introduction

The gamma distribution occurs frequently in a variety of applications, especially in relia-bility, in survival analysis and in modeling income distributions. The density of a gamma-distributed random variableXwith a shape parameterνand a scale parameterλis given by

f(x;ν,λ) = λ ν

(ν)x

ν–_e–λx_I ₍_x_{> ),} _()

whereI(·) is the indicator function,(·) is afunction with(p) =_∞e–x_xp–_dx_.

The family of gamma distributions includes the chi-squared distribution, exponential distribution and Erlang distribution. For example, the gamma distribution is an Erlang distribution with a positive integerν. When the shape parameterν= , the gamma dis-tribution is an exponential disdis-tribution with parameterλ; whenλ=_, the gamma distri-bution is a chi-squared distridistri-bution, with ν degrees of freedom. The shape parameter is especially of interest in reliability theory because the gamma distribution is either a decreasing failure rate (DFR), a constant or an increasing failure rate (IFR) according to whether the shape parameter is negative, zero or positive. The shape parameter also plays an important role in renewal theory when modeling arrival times of events.

As for the gamma distribution parameter change-point problems, Kander and Zacks [] proposed a statistic for testing a change in the one-parameter exponential family; Hsu []

(2)

considered a change point for the scale parameter of gamma random variables, assuming that the shape parameter was constant; Diaz [] posed the Bayesian test regarding the scale parameter change point for the independent gamma variables; Gupta and Ramanayake [], Ramanayake and Gupta [] discussed a linear trend change for the exponential distribu-tion; Ramanayake [] proposed some tests for detecting a change in the shape parameter of gamma distributions assuming thatλis constant. The strong consistency and conver-gence rate of the change-point estimator have been investigated by applying the moving averages method (Tanet al.[]), assuming that there is at most one change point.

Change-point analysis is widely used in ﬁelds such as quality control, economics and ﬁnance, biostatistics and so on (see Page []; Bai and Perron []; Braunet al.[]; Chen

et al.[]). Change-point problems have also received considerable attention due to the wide variety of applications and recent developments in computational methods. There is a considerable body of literature on change-point analysis that assume that the random variables being considered are independent.

Let X,X, . . . ,Xn be a negative associated sequence that satisﬁes the conditions that

X, . . . ,X[nτ] have the common distribution(x;ν,λ), and that X[nτ]+, . . . ,Xnhave the

common distribution (x;ν,λ), whereτis an unknown parameter called the change

point;ν,νare the shape parameters before and after change, respectively. In this paper,

we assume that the scale parameter does not change, but the shape parameter is suscep-tible to change at an unknown time [nτ] in the sequence. Noticing thatλX∼(x;ν, )

and its distribution is not related to the scale parameter, logarithm transformations may be made for{Xi,i= , . . . ,n}as follows. Let

Yi=lnλXi, i= , , . . . ,n. ()

It can be shown that the mean ofYisμ=EY=(ν) and the mean ofY[nτ]+isμ= EY[nτ]+=(ν), where(ν) is the derivation ofln(ν); that is,

(ν) =d[ln((ν))]

dν = (ν)

(ν).

(ν) can be expressed, as in [, p.], by

(ν) = –γ +

+∞



e–t_–_e–νt  –e–t dt,

where γ is the Euler-Mascheroni constant, that is,γ = –_+∞e–xlnx dx. Since (ν) =

+∞ 

t

–e–te–νtdt> , hence(ν) is an increasing function in (, +∞).

Deﬁne

Uk= k

i= Yi–

k n

n

i= Yi,

ρ=ν–ν, μ=μ–μ.

()

Since

Uk= k

i=

lnλXi–

k n

n

i=

lnλXi= k

i=

lnXi–

k n

n

i=

(3)

are not related to the scale parameterλ, then if we know in advance or by test that there is a change in the shape parameter, we may deﬁne the estimator of the change pointτas

τ= 

nmin

k:|Uk|=max

≤j≤n|Uj|

. ()

For convenience, throughout this paper,c,c, . . . represent a constant which is

indepen-dent ofnand may take diﬀerent values in diﬀerent expressions.

The paper is arranged as follows. In Section , the change-point estimatorτis proposed based on the CUSUM method by an appropriate logarithm transformation for {Xi,i= , . . . ,n}, and its constancy and convergence rate are investigated. The proofs of theorems are given in Section .

2 Main results

Theorem  Assume that X,X, . . . ,Xnis a negative associated random variable sequence

satisfying the conditions that X, . . . ,X[nτ]are identically distributed with(x;ν,λ),and X[nτ]+, . . . ,Xnare identically distributed with(x;ν,λ).Let

k= [nτ], k= [nτ], k= [nτ] for some <τ< , ()

where[A]denotes the integer part of a number A.If theρ=ν–νis a non-zero constant, thenτis a consistent estimator ofτand

|τ–τ|=oP

n–_l₍_n₎_, _()

where l(n)is a slowly varying function withlimn→∞l(n) = +∞.

Theorem  Assume that the conditions of Theoremhold,thenτ is a strong consistent

estimator ofτ,and

|τ–τ|=o

n–+δ _, _{a.s. for some}_{ <}_δ_<

. ()

Next, we will study theOPconvergence rate ofτunder the local alternative hypothesis; that is,ρis not a constant independent ofn, but it depends onnand is denoted byρn. Noticing that ifρnis large, the change-point estimation is usually quite precise. In practice it may be more important to construct conﬁdence intervals forτwhenρnis small. We hence assume thatρn–→ asn–→ ∞. It can be seen that the results obtained in the above theorems cannot be applied here, and we need to establish stronger results than those obtained in the above theorems.

Notice thatμi=(νi),i= , . Then, by the mean theorem,μn(under the local alterna-tive hypothesis, denotingμasμn) can be expressed as

μn=μ–μ=(ν˜)(ν–ν) =(ν˜)ρn,

(4)

Theorem  Assume that X,X, . . . ,Xnis a negative associated random variable sequence,

and X, . . . ,X[nτ]are identically distributed by(x;ν,λ),and X[nτ]+, . . . ,Xnare identically distributed by(x;ν,λ).Ifμnsatisﬁes

μn–→,

√

nμn–→ ∞, ()

then

|τ–τ|=OP



nμ

n

. ()

Remark  Theorems  and  give the weak and strong consistency and convergence rates

for the change-point estimatorτ of the shape parameter in a gamma distribution. In The-orem , theOPconvergence rate of the change-point estimatorτ of the shape parameter is proposed under the local alternative condition, and it is one of the necessary conditions for studying the limiting distribution ofτ. Having thisOPvalue, we can study the limiting distribution ofτ. This will be the subject of a future paper.

3 Proof of the theorem

To prove the above theorems, we ﬁrst consider the following lemmas.

Lemma  Let A,A, . . . ,Ambe disjoint subsets of{, , . . . ,n},and let ai=(Ai)be the

num-ber of elements in Ai,i= , , . . . ,m.Assume that Z,Z, . . . ,Znare negative associated

vari-ables,then if

fi:Rai→R, i= , , . . . ,m ()

are the increasing (or decreasing) positive functions for every element, then f(Zj,j ∈ A), . . . ,fm(Zj,j∈Am)are the negative associated variables.

Proof See Joag-Dev and Proschan [].

Lemma  Let{Zj,j∈N} be a negative associated sequence with zero mean satisfying

βp=supj∈NE|Zj|p<∞for some p≥.Denoting Sa,k=

k–

j=Za+j,then there exist constants

Cp,Kp≥related to p,for all a,n∈N,such that

E|S,n|p≤Cpn

p –

n

j=

E|Zj|p; E

max

≤k≤n|Sa,k|

p

≤Kpβpn

p

_. _()

Proof See Su, Zhao and Wang [].

Lemma  Let{Zn,n≥}be a negative associated sequence,if{bk,k≥}is an increasing number serial,then∀ε> and m≤n,we have

P

max

≤k≤n



bk k

i=

(Xi–EXi)

≥ε

≤ 

ε

n

j=

Var(Xj)

b_j ; ()

P

max m≤k≤n



bk k

i=

(Xi–EXi)

≥ε

≤ 

ε _m

j=

Var(Xj)

b

m + 

n

j=m+

Var(Xj)

b

j

(5)

Proof See Hu and Zhang [].

Proof of Theorem Noticing thatln(λx) is an increasing positive function andX, . . . ,Xn are negative associated sequences, we have from Lemma  that{Yi,i= , . . . ,n}are negative associated sequences. Without loss of generality, assuming thatν>ν, by the increasing

character of(ν) in (,∞), we know thatμ=μ–μ> . By simple computation, it can

be shown that

EUk=

k(n–k)

n (μ–μ) =nτ( –τ)μ, ()

and

EUk=

⎧ ⎨ ⎩

k(n–k)

n μ, k≤k,

(n–k)k

n μ, k>k, =

⎧ ⎨ ⎩

nτ( –τ)μ, k≤k,

n( –τ)τμ, k>k.

()

Hence,

|EUk|–|EUk|= ⎧ ⎨ ⎩

n( –τ)(τ–τ)μ, k≤k,

nτ(τ–τ)μ, k>k,

≥nτ∗μ|τ–τ|, ()

whereτ∗=min{τ,  –τ}.

From the triangle inequality, it can easily be shown that

|Uk|–|Uk| ≤max

≤k≤n|Uk–EUk|+|EUk|–|EUk|,

namely,

|EUk|–|EUk| ≤ max

≤k≤n|Uk–Eμk|+|Uk|–|Uk|.

Noticing that|Uk| ≤ |Uk|, hence we have

|EUk|–|EUk| ≤ max

≤k≤n|Uk–EUk|. ()

LetY_i∗=Yi–EYi, then by () and (), it follows that

nτ∗μ|τ–τ| ≤max

≤k≤n|Uk–EUk|= max≤k≤n

k

i= Y_i∗–k

n

i= Y_i∗

= max

≤k≤n

n–k n

k

i= Y_i∗–k

n

i=k+ Y_i∗

≤max

≤k≤n

k

i= Y_i∗

+ max≤k≤n

n

i=k+ Y_i∗

(6)

Hence,

Pg(n)|τ–τ|>ε =P

|τ–τ|>

ε

g(n)

≤P



nτ∗μ

max

≤k≤n

k i= Y_i∗

+max≤k≤n

n

i=k+ Y_i∗

> ε

g(n)

≤P

max

≤k≤n

k i= Y_i∗

>

nτ∗μ

g(n)

ε

+P

max

≤k≤n

n

i=k+ Y_i∗

>

nτ∗μ

g(n)

ε

=A+A. ()

SinceY,Y, . . . ,Ynare the negative associated variables, by the Markov inequality and Lemma ,∀r> , we have

A≤E

max

≤k≤n

k i= Y_i∗

r

nτ∗με

g(n) r

= 

r

(ετ∗μ)r gr

(n) nr E

max

≤k≤n

k i= Y_i∗

r

≤ r

(ετ∗μ)r gr

(n) nr Krβrn

r  ≤ 

r_K rβr (ετ∗μ)r

n g(n)

–r

, ()

whereβr=max{E|Y∗|r,E|Y[∗nτ]+|

r_}_{is a constant independent of}_n_{. Similar arguments give} that

A≤

r_K rβr (ετ∗μ)r

n g(n)

–r

. ()

Hence, if we chooseg(n) =n 

l–(n), wherel(n) is a slowly varying function satisfying

limn→∞l(n) = +∞, then combining ()-() we have, asn→ ∞,

Pg(n)|τ–τ|>ε –→,

that is,τ is the weak consistent estimator ofτ, and

|τ–τ|=oP

n–_l₍_n₎_.

Proof of Theorem From () to () we have, for∀ε> ,

∞

n=

Pg(n)|τ–τ|>ε ≤ ∞

n=

r_K rβr (ετ∗μ)r

n g(n)

–r

if we chooseg(n) =n 

–δfor some  <_δ< . Letr>



δ, then we have, asn→ ∞,

∞

n=

Pg(n)|τ–τ|>ε <∞. ()

By theBorel-Cantellilemma, we obtain thatτ is the strongly consistent estimator ofτ, and

|τ–τ|=o

n–+δ _{a.s. for some  <}_δ_< 

(7)

Proof of Theorem To this end, we choose a value  <θ<_such thatτ∈(θ,  –θ). By () and ()-() (g(n) = ), it is easily found thatτ is a consistent estimator ofτ. Therefore,

for everyε> ,P(τ ∈/(θ,  –θ)) <ε. Thus, we now have only to examine the behavior ofUk over thosekfor whichnθ≤k≤n( –θ). To prove|τ–τ|=OP(_nμ

n), we shall prove that

P

|τ–τ|> M nμ_

–→, ()

whenM–→ ∞. For everyM> , deﬁne

Dn,M=

k:nθ≤k≤n( –θ),|k–k|> M

μ

n

.

Then we have

P

|τ–τ|> M nμ_

≤Pτ∈/(θ,  –θ) +P

|τ–τ|> M nμ

n

,τˆ∈(θ,  –θ)

≤ε+P

sup k∈Dn,M

|Uk| ≥ |Uk|

. ()

Since

P

sup k∈Dn,M

|Uk| ≥ |Uk|

=P

sup k∈Dn,M

|Uk| ≥Uk,Uk≥

+P

sup k∈Dn,M

|Uk| ≥–Uk,Uk< 

=P

sup k∈Dn,M

|Uk|–Uk≥,Uk≥

+P

sup k∈Dn,M

|Uk|+Uk≥,Uk< 

≤P

sup k∈Dn,M

(Uk–Uk)≥

+P

inf k∈Dn,M

(Uk+Uk)≤

=B+B. ()

Noticing that Uk+Uk≤ impliesUk–EUk+Uk–EUk≤–EUk–EUk≤–EUk,

which in turn implies that

Uk–EUk≤– 

EUk or Uk–EUk≤–



EUk. ()

SinceEUk> , we have

|Uk–EUk| ≥ 

EUk or |Uk–EUk| ≥

 EUk.

Furthermore, we obtain

B≤P

sup k∈Dn,M

|Uk–EUk| ≥  EUk

+P

|Uk–EUk| ≥

 EUk

(8)

It can be seen from the deﬁnition ofDn,Mthat

D≤P

sup nθ≤k≤n(–θ)

|Uk–EUk| ≥  EUk

=P

sup nθ≤k≤n(–θ)

k i= Y∗–k

n n i= Y∗ ≥ 

nτ( –τ)μn

≤P

max

≤k≤n

k i= Y_i∗–k

n

i= Y_i∗

≥



nτ( –τ)μn

≤P

max

≤k≤n

k i= Y_i∗

≥



nτ( –τ)μn

+P

max

≤k≤n

k n n i= Y_i∗

≥



nτ( –τ)μn

≤P

max

≤k≤n

k i= Y_i∗

≥



nτ( –τ)μn

+P n i= Y_i∗

≥



nτ( –τ)μn

=E+E. ()

BecauseY_∗, . . . ,Y_n∗are negative associated variables, by the Markov inequality, Lemma  and (),∀p≥, we obtain

E =P

max

≤k≤n

k i= Y_i∗

≥



nτ( –τ)μn

≤E max k k i= Y_i∗

p



nτ( –τ)μn

p

≤ p

[nτ( –τ)μn]p Kpβpn

p

 ≤_c_ 

(n_μ_n)p

–→ asnμ_n–→ ∞ . ()

Similar arguments give

E=P n i= Y_i∗

≥



nτ( –τ)μn

≤ p

[nτ( –τ)μn]p

Cpn

p –

n

i=

E|Y∗|p≤c



(n_μ_n)p

–→ asnμ_n–→ ∞ . ()

Combining ()-(), we see that

D–→ asnμn–→ ∞. ()

Similar arguments as those forDgive

D =P k i=

Y_i∗–k

n

i= Y_i∗

≥



nτ( –τ)μn

≤P k i= Y_i∗

≥



nτ( –τ)μn

(9)

+P n i= Y_i∗

≥



nτ( –τ)μn

≤c



(n_μ_n)p

–→ asnμ_n–→ ∞. ()

Combining (), () and (), we obtain

B–→. ()

Now considerB. Because of symmetry, we only consider the case ofk≤k. The event Uk–Uk≥ implies that

Uk–EUk– (Uk–EUk)

≥EUk–EUk

⇐⇒

_k

i= Y_i∗–k

n

i= Y_i∗

–

_k_

i=

Y_i∗–k

n

i= Y_i∗

≥nτ∗μn|τ–τ|

⇐⇒

– k

i=k+

Y_i∗–k–k

n

i= Y_i∗

≥nτ∗μn|τ–τ|

⇐⇒ –

k–k

k

i=k+ Y_i∗+ 

n

i=

Y_i∗≥τ∗μn.

Therefore,

B≤P

max k∈Dn,M



k–k

k

i=k+ Y_i∗+

n

i= Y_i∗

≥τ∗μn

≤P

max k∈Dn,M



k–k

k

i=k+ Y_i∗

≥  τ ∗_μ n +P max k∈Dn,M

n i= Y_i∗

≥

 nτ

∗_μ

n

=F+F. ()

From the Markov inequality, Lemma  and (), we obtain

F≤p CP (nτ∗μn)pn

p  ≤_c_



nμ

n

p 

–→. ()

Denotingσ_=Var(Y_∗), from Lemma  and (), we obtain

F =P

max nθ≤k≤k–M

μn



k–k

k

i=k+ Y_i∗

≥  τ ∗_μ n =P max M

μn≤k–k≤k–nθ



k–k

j= Y_k∗__+–_j

≥  τ ∗_μ n =P max M

μ_n≤t≤k–nθ

 t t j= Y_k∗__+–_j

(10)

≤ ∗

(τ∗μn) M

μn

j=

Var(Y_k∗ +–j)

(_μM n)

 + 

k–nθ

j=M

μn+

Var(Y_k∗ +–j) j

≤ σ

 

(τ∗)_μ

n

μ

n

M + 

k–nθ

j=M

μ_n+ 

j(j– )

=  σ

 

(τ∗)_μ

n

μ_n

M + 

μ_n

M –



k–nθ– 

=  σ

 

(τ∗)



M–

 (k–nθ– )μn

–→ asnμ_n→ ∞,M→ ∞ . ()

Combining ()-(), we have

B–→. ()

From (), () and (), we know that () holds; that is, Theorem  is proved.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

C-cT conceived of the study questions which can be done, participated in the proofs and drafted the manuscript. B-qM participated in the proofs and provided the related reference. X-cZ participated in the proof of Theorems and helped to draft the manuscript. All authors read and approved the ﬁnal manuscript.

Author details

1_{School of Mathematics, Hefei University of Technology, Heifei, Anhui 230009, China.}2_{Department of Statistics and}

Finance, University of Science and Technology of China, Hefei, Anhui 230026, China.3_{Department of Mathematics and}

Computer Science, Tongling University, Tongling, Anhui 244000, China.

Acknowledgements

We are grateful to the three referees for the valuable comments and advices that led to substantial improvement of an original draft of this paper.

The work of Dr. Tan was partially supported by a grant from the National Natural Science Foundation of China (No. 11201108), the Humanities and Social Sciences Project from Ministry of Education of China (No. 12YJC910007, 11YJC790311), the Anhui Provincial Natural Science Foundation (No. 1208085QA12) and the twelfth Five-Year plan National Science and Technology major Project (2012ZX10004609).

Received: 25 October 2012 Accepted: 18 March 2013 Published: 8 April 2013 References

1. Kander, Z, Zacks, S: Test procedures for possible changes in parameters of statistical distributions occurring at unknown time points. Ann. Math. Stat.37, 1196-1210 (1966)

2. Hsu, DA: Detecting shifts of parameter in gamma sequences with applications to stock price and air traﬃc ﬂow analysis. J. Am. Stat. Assoc.74, 31-40 (1979)

3. Diaz, J: Bayesian detection of a change of scale parameter in sequences of independent gamma random variables. J. Econom.19, 23-29 (1982)

4. Ramanayake, A, Gupta, AK: Change points with linear trend followed by abrupt change for the exponential distribution. J. Stat. Comput. Simul.72(4), 263-278 (2002)

5. Gupta, AK, Ramanayake, A: Change points with linear trend for the exponential distribution. J. Stat. Plan. Inference93, 181-195 (2001)

6. Ramanayake, A: Tests for a change point in the shape parameter of Gamma random variables. Commun. Stat., Theory Methods33(4), 821-833 (2004)

7. Tan, CC, Zhao, LC, Miao, BQ: Inference and application in ﬁnance of-distribution with at most one change-point. J. Syst. Sci. Math. Sci.27, 2-10 (2007) (in Chinese)

8. Page, ES: Continuous inspection schemes. Biometrika42, 100-114 (1954)

9. Bai, J, Perron, P: Estimating and testing linear models with multiple structural changes. Econometrica66, 47-78 (1998) 10. Braun, JV, et al.: Multiple changepoint ﬁtting via quasilikelihood with application to DNA sequence segmentation.

(11)

11. Chen, C, et al.: Nonparametric estimation of structural changepoints in volatility models for time series. J. Econom. 126, 79-114 (2005)

12. Magnus, W, Oberhettinger, F, Soni, RP: Formulas and Theorems for the Special Functions of Mathematical Physics. Springer, Berlin (1966)

13. Joag-Dev, K, Proschan, F: Negative association of random variables with application. Ann. Stat.11, 286-295 (1983) 14. Su, C, Zhao, LC, Wang, YB: Moment inequalities and weak convergence for negatively associated sequences. Sci.

China Ser. A40(2), 172-182 (1997)

15. Hu, SH, Zhang, LS: Hajek-Renyi-type inequality under second moment conditions and its application. Acta Math. Appl. Sin.28(2), 227-235 (2005) (in Chinese)

doi:10.1186/1029-242X-2013-161