• No results found

Statistical inference for the shape parameter change point estimator in negative associated gamma distribution

N/A
N/A
Protected

Academic year: 2020

Share "Statistical inference for the shape parameter change point estimator in negative associated gamma distribution"

Copied!
11
0
0

Loading.... (view fulltext now)

Full text

(1)

R E S E A R C H

Open Access

Statistical inference for the shape parameter

change-point estimator in negative

associated gamma distribution

Chang-chun Tan

1*

, Bai-qi Miao

2

and Xing-cai Zhou

3

*Correspondence:

[email protected]

1School of Mathematics, Hefei

University of Technology, Heifei, Anhui 230009, China Full list of author information is available at the end of the article

Abstract

In this paper, the change-point estimator for the shape parameter is proposed in a negative associated gamma random variable sequence. Suppose thatX1,. . .,Xnare

negative associated random variables satisfying thatX1,. . .,X[nτ0]are identically

distributed with

(x;

ν

1,

λ

), and thatX[nτ0]+1,. . .,Xnare identically distributed with

(x;

ν

2,

λ

); the change point

τ

0is unknown. The weak and strong consistency, and

the weak and strong convergence rate of the change-point estimator, are given by the CUSUM method. Furthermore, theOPconvergence rate of the change-point

estimator is presented under the local alternative hypothesis condition.

MSC: 62F12; 62G10

Keywords: change point;

-distribution; negative associated; shape parameter; convergence rate

1 Introduction

The gamma distribution occurs frequently in a variety of applications, especially in relia-bility, in survival analysis and in modeling income distributions. The density of a gamma-distributed random variableXwith a shape parameterνand a scale parameterλis given by

f(x;ν,λ) = λ ν

(ν)x

ν–eλxI (x> ), ()

whereI(·) is the indicator function,(·) is afunction with(p) =exxp–dx.

The family of gamma distributions includes the chi-squared distribution, exponential distribution and Erlang distribution. For example, the gamma distribution is an Erlang distribution with a positive integerν. When the shape parameterν= , the gamma dis-tribution is an exponential disdis-tribution with parameterλ; whenλ=, the gamma distri-bution is a chi-squared distridistri-bution, with ν degrees of freedom. The shape parameter is especially of interest in reliability theory because the gamma distribution is either a decreasing failure rate (DFR), a constant or an increasing failure rate (IFR) according to whether the shape parameter is negative, zero or positive. The shape parameter also plays an important role in renewal theory when modeling arrival times of events.

As for the gamma distribution parameter change-point problems, Kander and Zacks [] proposed a statistic for testing a change in the one-parameter exponential family; Hsu []

(2)

considered a change point for the scale parameter of gamma random variables, assuming that the shape parameter was constant; Diaz [] posed the Bayesian test regarding the scale parameter change point for the independent gamma variables; Gupta and Ramanayake [], Ramanayake and Gupta [] discussed a linear trend change for the exponential distribu-tion; Ramanayake [] proposed some tests for detecting a change in the shape parameter of gamma distributions assuming thatλis constant. The strong consistency and conver-gence rate of the change-point estimator have been investigated by applying the moving averages method (Tanet al.[]), assuming that there is at most one change point.

Change-point analysis is widely used in fields such as quality control, economics and finance, biostatistics and so on (see Page []; Bai and Perron []; Braunet al.[]; Chen

et al.[]). Change-point problems have also received considerable attention due to the wide variety of applications and recent developments in computational methods. There is a considerable body of literature on change-point analysis that assume that the random variables being considered are independent.

Let X,X, . . . ,Xn be a negative associated sequence that satisfies the conditions that

X, . . . ,X[] have the common distribution(x;ν,λ), and that X[]+, . . . ,Xnhave the

common distribution (x;ν,λ), whereτis an unknown parameter called the change

point;ν,νare the shape parameters before and after change, respectively. In this paper,

we assume that the scale parameter does not change, but the shape parameter is suscep-tible to change at an unknown time [] in the sequence. Noticing thatλX(x;ν, )

and its distribution is not related to the scale parameter, logarithm transformations may be made for{Xi,i= , . . . ,n}as follows. Let

Yi=lnλXi, i= , , . . . ,n. ()

It can be shown that the mean ofYisμ=EY=(ν) and the mean ofY[]+isμ= EY[]+=(ν), where(ν) is the derivation ofln(ν); that is,

(ν) =d[ln((ν))]

= (ν)

(ν).

(ν) can be expressed, as in [, p.], by

(ν) = –γ +

+∞

eteνt  –et dt,

where γ is the Euler-Mascheroni constant, that is,γ = –+∞exlnx dx. Since (ν) =

+∞ 

t

–eteνtdt> , hence(ν) is an increasing function in (, +∞).

Define

Uk= k

i= Yi

k n

n

i= Yi,

ρ=ν–ν, μ=μ–μ.

()

Since

Uk= k

i=

lnλXi

k n

n

i=

lnλXi= k

i=

lnXi

k n

n

i=

(3)

are not related to the scale parameterλ, then if we know in advance or by test that there is a change in the shape parameter, we may define the estimator of the change pointτas

τ= 

nmin

k:|Uk|=max

≤jn|Uj|

. ()

For convenience, throughout this paper,c,c, . . . represent a constant which is

indepen-dent ofnand may take different values in different expressions.

The paper is arranged as follows. In Section , the change-point estimatorτis proposed based on the CUSUM method by an appropriate logarithm transformation for {Xi,i= , . . . ,n}, and its constancy and convergence rate are investigated. The proofs of theorems are given in Section .

2 Main results

Theorem  Assume that X,X, . . . ,Xnis a negative associated random variable sequence

satisfying the conditions that X, . . . ,X[]are identically distributed with(x;ν,λ),and X[]+, . . . ,Xnare identically distributed with(x;ν,λ).Let

k= [], k= [], k= [] for some <τ< , ()

where[A]denotes the integer part of a number A.If theρ=ν–νis a non-zero constant, thenτis a consistent estimator ofτand

|ττ|=oP

n–l(n), ()

where l(n)is a slowly varying function withlimn→∞l(n) = +∞.

Theorem  Assume that the conditions of Theoremhold,thenτ is a strong consistent

estimator ofτ,and

|ττ|=o

n–+δ , a.s. for some <δ<

. ()

Next, we will study theOPconvergence rate ofτunder the local alternative hypothesis; that is,ρis not a constant independent ofn, but it depends onnand is denoted byρn. Noticing that ifρnis large, the change-point estimation is usually quite precise. In practice it may be more important to construct confidence intervals forτwhenρnis small. We hence assume thatρn–→ asn–→ ∞. It can be seen that the results obtained in the above theorems cannot be applied here, and we need to establish stronger results than those obtained in the above theorems.

Notice thatμi=(νi),i= , . Then, by the mean theorem,μn(under the local alterna-tive hypothesis, denotingμasμn) can be expressed as

μn=μ–μ=(ν˜)(ν–ν) =(ν˜)ρn,

(4)

Theorem  Assume that X,X, . . . ,Xnis a negative associated random variable sequence,

and X, . . . ,X[]are identically distributed by(x;ν,λ),and X[]+, . . . ,Xnare identically distributed by(x;ν,λ).Ifμnsatisfies

μn–→,

nμn–→ ∞, ()

then

|ττ|=OP

n

. ()

Remark  Theorems  and  give the weak and strong consistency and convergence rates

for the change-point estimatorτ of the shape parameter in a gamma distribution. In The-orem , theOPconvergence rate of the change-point estimatorτ of the shape parameter is proposed under the local alternative condition, and it is one of the necessary conditions for studying the limiting distribution ofτ. Having thisOPvalue, we can study the limiting distribution ofτ. This will be the subject of a future paper.

3 Proof of the theorem

To prove the above theorems, we first consider the following lemmas.

Lemma  Let A,A, . . . ,Ambe disjoint subsets of{, , . . . ,n},and let ai=(Ai)be the

num-ber of elements in Ai,i= , , . . . ,m.Assume that Z,Z, . . . ,Znare negative associated

vari-ables,then if

fi:RaiR, i= , , . . . ,m ()

are the increasing (or decreasing) positive functions for every element, then f(Zj,jA), . . . ,fm(Zj,jAm)are the negative associated variables.

Proof See Joag-Dev and Proschan [].

Lemma  Let{Zj,jN} be a negative associated sequence with zero mean satisfying

βp=supjNE|Zj|p<∞for some p≥.Denoting Sa,k=

k–

j=Za+j,then there exist constants

Cp,Kp≥related to p,for all a,nN,such that

E|S,n|pCpn

p –

n

j=

E|Zj|p; E

max

≤kn|Sa,k|

p

Kpβpn

p

. ()

Proof See Su, Zhao and Wang [].

Lemma  Let{Zn,n≥}be a negative associated sequence,if{bk,k≥}is an increasing number serial,thenε> and mn,we have

P

max

≤kn

bk k

i=

(XiEXi)

ε

≤ 

ε

n

j=

Var(Xj)

bj ; ()

P

max mkn

bk k

i=

(XiEXi)

ε

≤ 

εm

j=

Var(Xj)

b

m + 

n

j=m+

Var(Xj)

b

j

(5)

Proof See Hu and Zhang [].

Proof of Theorem Noticing thatln(λx) is an increasing positive function andX, . . . ,Xn are negative associated sequences, we have from Lemma  that{Yi,i= , . . . ,n}are negative associated sequences. Without loss of generality, assuming thatν>ν, by the increasing

character of(ν) in (,∞), we know thatμ=μ–μ> . By simple computation, it can

be shown that

EUk=

k(nk)

n (μ–μ) =( –τ)μ, ()

and

EUk=

⎧ ⎨ ⎩

k(nk)

n μ, kk,

(nk)k

n μ, k>k, =

⎧ ⎨ ⎩

( –τ)μ, kk,

n( –τ)τμ, k>k.

()

Hence,

|EUk|–|EUk|= ⎧ ⎨ ⎩

n( –τ)(τ–τ)μ, kk,

(ττ)μ, k>k,

μ|ττ|, ()

whereτ∗=min{τ,  –τ}.

From the triangle inequality, it can easily be shown that

|Uk|–|Uk| ≤max

≤kn|UkEUk|+|EUk|–|EUk|,

namely,

|EUk|–|EUk| ≤ max

≤kn|UkEμk|+|Uk|–|Uk|.

Noticing that|Uk| ≤ |Uk|, hence we have

|EUk|–|EUk| ≤ max

≤kn|UkEUk|. ()

LetYi∗=YiEYi, then by () and (), it follows that

μ|ττ| ≤max

≤kn|UkEUk|= max≤kn

k

i= Yi∗–k

n

n

i= Yi

= max

≤kn

nk n

k

i= Yi∗–k

n

n

i=k+ Yi

≤max

≤kn

k

i= Yi

+ max≤kn

n

i=k+ Yi

(6)

Hence,

Pg(n)|ττ|>ε =P

|ττ|>

ε

g(n)

P

μ

max

≤kn

k i= Yi

+max≤kn

n

i=k+ Yi

> ε

g(n)

P

max

≤kn

k i= Yi

>

μ

g(n)

ε

+P

max

≤kn

n

i=k+ Yi

>

μ

g(n)

ε

=A+A. ()

SinceY,Y, . . . ,Ynare the negative associated variables, by the Markov inequality and Lemma ,∀r> , we have

A≤E

max

≤kn

k i= Yi

r

με

g(n) r

= 

r

(ετμ)r gr

(n) nr E

max

≤kn

k i= Yi

r

≤ r

(ετμ)r gr

(n) nr Krβrn

r  ≤ 

rK rβr (ετμ)r

n g(n)

r

, ()

whereβr=max{E|Y∗|r,E|Y[∗]+|

r}is a constant independent ofn. Similar arguments give that

A≤

rK rβr (ετμ)r

n g(n)

r

. ()

Hence, if we chooseg(n) =n

l–(n), wherel(n) is a slowly varying function satisfying

limn→∞l(n) = +∞, then combining ()-() we have, asn→ ∞,

Pg(n)|ττ|>ε –→,

that is,τ is the weak consistent estimator ofτ, and

|ττ|=oP

n–l(n).

Proof of Theorem From () to () we have, for∀ε> ,

n=

Pg(n)|ττ|>ε ≤ ∞

n=

rK rβr (ετμ)r

n g(n)

r

if we chooseg(n) =n

–δfor some  <δ< . Letr>

δ, then we have, asn→ ∞,

n=

Pg(n)|ττ|>ε <∞. ()

By theBorel-Cantellilemma, we obtain thatτ is the strongly consistent estimator ofτ, and

|ττ|=o

n–+δ a.s. for some  <δ<

(7)

Proof of Theorem To this end, we choose a value  <θ<such thatτ∈(θ,  –θ). By () and ()-() (g(n) = ), it is easily found thatτ is a consistent estimator ofτ. Therefore,

for everyε> ,P(τ ∈/(θ,  –θ)) <ε. Thus, we now have only to examine the behavior ofUk over thosekfor whichkn( –θ). To prove|ττ|=OP(

n), we shall prove that

P

|ττ|> M

–→, ()

whenM–→ ∞. For everyM> , define

Dn,M=

k:kn( –θ),|kk|> M

μ

n

.

Then we have

P

|ττ|> M

∈/(θ,  –θ) +P

|ττ|> M

n

,τˆ∈(θ,  –θ)

ε+P

sup kDn,M

|Uk| ≥ |Uk|

. ()

Since

P

sup kDn,M

|Uk| ≥ |Uk|

=P

sup kDn,M

|Uk| ≥Uk,Uk≥

+P

sup kDn,M

|Uk| ≥–Uk,Uk< 

=P

sup kDn,M

|Uk|–Uk≥,Uk≥

+P

sup kDn,M

|Uk|+Uk≥,Uk< 

P

sup kDn,M

(UkUk)≥

+P

inf kDn,M

(Uk+Uk)≤

=B+B. ()

Noticing that Uk+Uk≤ impliesUkEUk+Uk–EUk≤–EUkEUk≤–EUk,

which in turn implies that

UkEUk≤– 

EUk or Uk–EUk≤–

EUk. ()

SinceEUk> , we have

|UkEUk| ≥ 

EUk or |Uk–EUk| ≥

 EUk.

Furthermore, we obtain

B≤P

sup kDn,M

|UkEUk| ≥  EUk

+P

|Uk–EUk| ≥

 EUk

(8)

It can be seen from the definition ofDn,Mthat

D≤P

sup kn(–θ)

|UkEUk| ≥  EUk

=P

sup kn(–θ)

k i= Y∗–k

n n i= Y∗ ≥ 

( –τ)μn

P

max

≤kn

k i= Yi∗–k

n

n

i= Yi

( –τ)μn

P

max

≤kn

k i= Yi

( –τ)μn

+P

max

≤kn

k n n i= Yi

( –τ)μn

P

max

≤kn

k i= Yi

( –τ)μn

+P n i= Yi

( –τ)μn

=E+E. ()

BecauseY∗, . . . ,Yn∗are negative associated variables, by the Markov inequality, Lemma  and (),∀p≥, we obtain

E =P

max

≤kn

k i= Yi

( –τ)μn

E max k k i= Yi

p

( –τ)μn

p

≤ p

[( –τ)μn]p Kpβpn

p

 ≤c

(nμn)p

–→ asn–→ ∞ . ()

Similar arguments give

E=P n i= Yi

( –τ)μn

≤ p

[( –τ)μn]p

Cpn

p –

n

i=

E|Y∗|pc

(nμn)p

–→ asn–→ ∞ . ()

Combining ()-(), we see that

D–→ asn–→ ∞. ()

Similar arguments as those forDgive

D =P ki=

Yi∗–k

n

n

i= Yi

( –τ)μn

P ki= Yi

( –τ)μn

(9)

+P n i= Yi

( –τ)μn

c

(nμn)p

–→ asn–→ ∞. ()

Combining (), () and (), we obtain

B–→. ()

Now considerB. Because of symmetry, we only consider the case ofkk. The event UkUk≥ implies that

UkEUk– (Uk–EUk)

EUk–EUk

⇐⇒

k

i= Yi∗–k

n

n

i= Yi

k

i=

Yi∗–k

n

n

i= Yi

μn|ττ|

⇐⇒

k

i=k+

Yi∗–kk

n

n

i= Yi

μn|ττ|

⇐⇒ –

k–k

k

i=k+ Yi∗+ 

n

n

i=

Yi∗≥τμn.

Therefore,

B≤P

max kDn,M

k–k

k

i=k+ Yi∗+

n

n

i= Yi

τμn

P

max kDn,M

k–k

k

i=k+ Yi

≥  τμ n +P max kDn,M

n i= Yi

 

μ

n

=F+F. ()

From the Markov inequality, Lemma  and (), we obtain

F≤p CP (μn)pn

p  ≤c

n

p

–→. ()

Denotingσ=Var(Y∗), from Lemma  and (), we obtain

F =P

max kk–M

μn

k–k

k

i=k+ Yi

≥  τμ n =P max M

μnk–kk–

k–k

k–k

j= Yk+–j

≥  τμ n =P max M

μntk–

t t j= Yk+–j

(10)

≤ ∗

(τμn)M

μn

j=

Var(Yk∗ +–j)

(μMn)

 + 

k–

j=M

μn+

Var(Yk∗ +–j) j

≤ σ

 

(τ∗)μ

n

μ

n

M + 

k–

j=M

μn+ 

j(j– )

=  σ

 

(τ∗)μ

n

μn

M + 

μn

M

k–– 

=  σ

 

(τ∗)

M

 (k–– )μn

–→ asn→ ∞,M→ ∞ . ()

Combining ()-(), we have

B–→. ()

From (), () and (), we know that () holds; that is, Theorem  is proved.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

C-cT conceived of the study questions which can be done, participated in the proofs and drafted the manuscript. B-qM participated in the proofs and provided the related reference. X-cZ participated in the proof of Theorems and helped to draft the manuscript. All authors read and approved the final manuscript.

Author details

1School of Mathematics, Hefei University of Technology, Heifei, Anhui 230009, China.2Department of Statistics and

Finance, University of Science and Technology of China, Hefei, Anhui 230026, China.3Department of Mathematics and

Computer Science, Tongling University, Tongling, Anhui 244000, China.

Acknowledgements

We are grateful to the three referees for the valuable comments and advices that led to substantial improvement of an original draft of this paper.

The work of Dr. Tan was partially supported by a grant from the National Natural Science Foundation of China (No. 11201108), the Humanities and Social Sciences Project from Ministry of Education of China (No. 12YJC910007, 11YJC790311), the Anhui Provincial Natural Science Foundation (No. 1208085QA12) and the twelfth Five-Year plan National Science and Technology major Project (2012ZX10004609).

Received: 25 October 2012 Accepted: 18 March 2013 Published: 8 April 2013 References

1. Kander, Z, Zacks, S: Test procedures for possible changes in parameters of statistical distributions occurring at unknown time points. Ann. Math. Stat.37, 1196-1210 (1966)

2. Hsu, DA: Detecting shifts of parameter in gamma sequences with applications to stock price and air traffic flow analysis. J. Am. Stat. Assoc.74, 31-40 (1979)

3. Diaz, J: Bayesian detection of a change of scale parameter in sequences of independent gamma random variables. J. Econom.19, 23-29 (1982)

4. Ramanayake, A, Gupta, AK: Change points with linear trend followed by abrupt change for the exponential distribution. J. Stat. Comput. Simul.72(4), 263-278 (2002)

5. Gupta, AK, Ramanayake, A: Change points with linear trend for the exponential distribution. J. Stat. Plan. Inference93, 181-195 (2001)

6. Ramanayake, A: Tests for a change point in the shape parameter of Gamma random variables. Commun. Stat., Theory Methods33(4), 821-833 (2004)

7. Tan, CC, Zhao, LC, Miao, BQ: Inference and application in finance of-distribution with at most one change-point. J. Syst. Sci. Math. Sci.27, 2-10 (2007) (in Chinese)

8. Page, ES: Continuous inspection schemes. Biometrika42, 100-114 (1954)

9. Bai, J, Perron, P: Estimating and testing linear models with multiple structural changes. Econometrica66, 47-78 (1998) 10. Braun, JV, et al.: Multiple changepoint fitting via quasilikelihood with application to DNA sequence segmentation.

(11)

11. Chen, C, et al.: Nonparametric estimation of structural changepoints in volatility models for time series. J. Econom. 126, 79-114 (2005)

12. Magnus, W, Oberhettinger, F, Soni, RP: Formulas and Theorems for the Special Functions of Mathematical Physics. Springer, Berlin (1966)

13. Joag-Dev, K, Proschan, F: Negative association of random variables with application. Ann. Stat.11, 286-295 (1983) 14. Su, C, Zhao, LC, Wang, YB: Moment inequalities and weak convergence for negatively associated sequences. Sci.

China Ser. A40(2), 172-182 (1997)

15. Hu, SH, Zhang, LS: Hajek-Renyi-type inequality under second moment conditions and its application. Acta Math. Appl. Sin.28(2), 227-235 (2005) (in Chinese)

doi:10.1186/1029-242X-2013-161

References

Related documents

Cuando Martín, Felipe, Cristóbal, me preguntaban dónde me metía y les explicaba mi recorrido con algunos detalles cercanos al lugar que elegí o me atrajo para

First step: we need to capture the entire packet from the access point so we are using the airodump-ng tool helps to capture packet by selecting particular Mac

There are several existent literature on the influence of demographic factors like gender, age, and level of study on the utilization of the online public

This project summarizes how to design turbine rotor blade geometry, and predict that geometry is safe on the basis of static structural analysis (based on elongation

Another image of the same sherd (Fig.  6 ) was obtained (using 30 keV at 2.2e − 2  Pa vacuum) on the Tescan Vega 3 XMU using the conductive annular mono-crystal scintillator-type

are: (1 ) only a constant, fixed number of phage genomes can function hetero- catalytically in a given infected bacterium; and (2) the genomes which do express are drawn

The 444 horsepower (SAE) is supplemented by 457 pound-feet (620 Nm) of torque, yet all of that awesome energy is kept under control by all-wheel drive and the amazing stopping

We do not doubt that there shall be further attacks against multivariate schemes, attacks against the TTS genre and even specific attacks tailored against TTS/4, but we are