• No results found

Note on nonparametric $M$-estimation for spatial regression

N/A
N/A
Protected

Academic year: 2020

Share "Note on nonparametric $M$-estimation for spatial regression"

Copied!
8
0
0

Loading.... (view fulltext now)

Full text

(1)

Note on nonparametric

M-estimation for spatial regression

A. Gheriballah

a

and R. Rouane

b,∗

a,bLaboratory of Statistical and stochastic processes, Univ. Djillali Liabs, BP 89, 22000 Sidi Bel Abbs, Algeria.

Abstract

In this paper, we investigate a nonparametric robust estimation for spatial regression. More precisely, given a strictly stationary random fieldZi = (Xi,Yi),iNN, we consider a family of robust nonparametric

estimators for a regression function based on the kernel method. We establish ap-mean consistency results of the kernel estimator under some conditions.

Keywords: Quadratic error,p-mean consistency, Nonparametric regression, Spatial process, Robust estimation.

2010 MSC:26A33, 34A08, 35R12, 47H10. 2012 MJM. All rights reserved.c

1

Introduction

In the last years spatial statistics has been widely applied in diverse areas such as climatology, epidemiol-ogy, agronomy, meteorolepidemiol-ogy, econometrics, image processing, etc. There is a vast literature on spatial models, see, for example, the books by Cressie (1991), Guyon (1995), Anselin and Florax (1995), Banerjee, Carlin and Gelfand (2004), Gelfand et al. (2010) and Cressie and Wikle (2011) for broad discussion and applications. However, the nonparametric treatment of such data has so for being limited. The first results were obtained by Tran (1990). For relevant works on the nonparametric modelization of spatial data, see Biau and Cadre (2004), Carbon et al. (2007), Li et al. (2009). In this paper, we consider the problem of the estimation of the regression function as the analysis tool of such kind of data. Noting that, this model is very interesting in practice. It is used as an alternative approach to classical methods, in particular when the data are affected by the presence of outliers. There is an extensive literature on robust estimation (see, for instance Huber (1964), Robinson (1984), Collomb and H¨ardle (1986), Fan et al. (1994) for previous results and Boente et al. (2009) for recent advances and references). The first results concerning the nonparametric robust estimation in func-tional statistic were obtained by Azzedine et al. (2008). They studied the almost complete convergence of robust estimators based on a kernel method, considering independent observations. Crambes et al. (2008) stated the convergence in Lp norm in both cases (i.i.d and strong mixing). While the asymptotic normality of these estimators is proved by Attouch et al. (2010). The main goal of this paper is to study the robust nonparametric, we studyLpmean consistency results of a nonparametric estimation of the spatial regression by using the robust approach.

The paper is organized as follows. We present our model and estimator in Section 2. Section 3 is devoted to assumptions. The p-mean consistency of the robust nonparametric estimators is stated in Section 4. Proofs are provided in the appendix.

2

The model

Consider Zi = (Xi,Yi),iNN be aRR-valued measurable and strictly stationary spatial process,

defined on a probability space (Ω,A,). We assume that the process under study (Zi) is observed over a

Corresponding author.

(2)

rectangular domainIn = i= (i1, ...,iN)∈NN, 1≤ik≤nk,k=1, ...,N},n= (n1, ...,nN)∈ NN. A pointi will be referred to as asite. We will writen → ∞if min{nk} → ∞and |

nj

nk

| <Cfor a constantCsuch that

0 < C < ∞for all j, ksuch that 1 ≤ j,k ≤ N. Forn = (n1, ...,nN) ∈ NN, we set ˆn = n1×...×nN. The nonparametric model studied in this paper, denoted byθx, is implicitly defined, for all vectorsx ∈Rd, as a

zero with respect to (w.r.t.)t∈Rof the equation

Ψ(x,t) = [ψx(Yi,t)|Xi=x] =0

whereψxis a real-valued integrable function satisfying some regularity conditions to be stated below. In what follows, we suppose that, for allx∈Rd,θ

xexists and is unique (see, for instance, Boente and Fraiman (1989)). For all(x,t)∈Rd+1, we propose a nonparametric estimator ofΨ(x,t)given by

ˆ

Ψ(x,t) = ∑i∈InK(h

−1(xX

i))ψx(Yi,t) ∑i∈InK(h−1(x−Xi))

,

with the convention 0

0 = 0, whereKis a kernel and h = hn is a sequence of positive real numbers. A natural estimator ˆθxofθxis a zero w.r.t.tof the equation

ˆ

Ψ(x,t) =0.

In this work, we will assume that the random filed(Zi,iNN)satisfies the following mixing condition:

       

      

There exists a functionϕ(t)↓0 quandt→∞, such that

∀E, E0subsets ofNNwith finite cardinals

α

B(E),BE0= sup B∈B(E),C∈B(E0)

|(B∩C)−(B) (C)|

≤sCard(E), CardE0ϕ

distE,E0,

(2.1)

where B(E)(resp. BE0) denotes the Borel σ-field generated by (Zi, i∈E) (resp.

Zi, i∈E

0

), Card(E)

(resp. CardE0) the cardinality of E (resp. E0), distE,E0 the Euclidean distance between E and E0 and

s:N2→R+is a symmetric positive function nondecreasing in each variable such that either

s(n,m)≤Cmin(n,m), ∀n,m∈N (2.2)

or

s(n,m)≤C(n+m+1)eβ, ∀n,mN (2.3)

for someβe≥1 and someC>0.We assume also that the process satisfies a polynomial mixing condition:

ϕ(t)≤Ct−θ, θ>0, t∈R. (2.4)

3

Assumptions

From now on, letxstand for a fixed point inRdand we assume that theZi’s have the same distribution

with(X,Y). Moreover, we setf(·)to be the density ofXandh(·|x)the conditional density ofYgivenX=x. Consider the following hypotheses.

(H1) The functions f andhsuch that:

(i) The density f(.)has continuous derivative in the neighborhood ofxwith f(x)>0

(3)

(iii) The following functions, defined for(u,t)∈Rd+1by

 

 

g(u,t) = (ψx(Y,t)/X=u)f(u)

λ(u,t) = (ψ2x(Y,t)/X=u)f(u) and

Γ(u,t) = (∂ψx

∂t (Y,t)/X=u)f(u)

(3.5)

have, also, continuous derivative w.r.t. the first component.

(H2) The functionψxis continuous, differentiable, strictly monotone bounded w.r.t. the second component

and its derivative∂ψx(y,t)

t is bounded and continuous atθxuniformly iny.

(H3) The joint probability density fi,jofXiandXjexists and satisfies

|fi,j(u,v)−f(u)f(v)| ≤C

for some constantCand for all u, v,iandj.

(H4) The mixing coefficient defined in (2.2) satisfies, for someq>2 and some integerr≥1

lim T→∞T

a

i=T

tNr−1(ϕ(t))qr−2/qr =0,

for somea≥(rq−2)Nr/(2+rq−4r)withq>(4r−2)/r.

(H5) The probability kernel functionKis a symmetric and bounded density function onRdwith compact support,CK, and finite variance such that

|K(x)−K(y)| ≤ Mkx−ykforx,y∈CKand 0<M<∞.

(H6) The individualhnsatisfy

lim

n→∞h=0 and lim infn→∞ nˆh

(2(r−1)a+N(qr−2))/(a+N)q >0.

4

Main results

Letp∈[1,+∞[. In this section we state a pointwisep-mean consistency result for the estimator ˆθx. We start

with the case wherep=2 , we give precise asymptotic evaluations of the quadratic error of this estimator.

4.1

Mean square error

Theorem 4.1. If the assumptions (H1)-(H6) are satisfied and ifΓ(x,θx)6=0then

(θˆx−θx)2=B2(x,θx)h4+A

(x,θx)

ˆnhd +O

1 ˆ nhd

where A(x,θx) =

λ(x,θx)

Γ(x,θx)

Z

K2(t)dt and B(x,θx) = g

(2)(x,

θx)

Γ(x,θx)

Z

t2K2(t)dt

Before giving the proof, let us introduce some notation. Fory∈R, let

ˆ

g(x,t) = 1

ˆnhd

i∈In

K(h−1(x−Xi))ψx(Yi,t) and fˆ(x) =

1 ˆnhd

i∈In

K(h−1(x−Xi)).

So that if ˆf(x)6=0 we have

ˆ

Ψ(x,t) = g(xˆ ,t)

ˆ

(4)

Proof.

A Taylor expansion of the function ˆg(x, .)in a neighborhood ofθxgives:

ˆ

g(x, ˆθx) =g(xˆ ,θx) + (θˆx−θx)

t(x,θ

∗ xn)

whereθxnis between ˆθxandθxsuch that

ˆ

g(x, ˆθx) =g(x,θx) =0.

Thus, we have under the case where ˆf(x)6=0

ˆ

θx−θx=

−gˆ(x,θx)

t(x,θ

∗ xn)

.

We have by lemma 6 of Gheriballah et al (2010),

t(x,θ

xn)−Γ(x,θx)→0 almost completely ( a.co.)

It follows that

(θˆx−θx)2= Γ 1

(x,θx)

[(gˆ(x,θx))2] +o([(gˆ(x,θx))2]) + (fˆ(x) =0)

Now, Theorem 4.1 is a consequence of the following intermediate results, whose proofs are given in the Appendix.

Lemma 4.1. Under Hypotheses (H1) and (H2)-(H4), we have,

Var

ˆ

g(x,θx)

= A(x,θx)

ˆnhd +O

1 ˆ nhd

.

Lemma 4.2. Under Hypotheses (H1) and (H2)-(H4), we have,

ˆ

g(x,θx)

=B(x,θx)h2+o(h2).

Lemma 4.3. Under the conditions of Theorem 4.1, we have

(fˆ(x) =0) =O

1 ˆ nhd

p

.

4.2

Convergence in

L

p

norm

Theorem 4.2. Under conditions (H1)-(H6) and ifΓ(x,θx)6=0we get

kθˆx−θxkp=O

h2+O

1 ˆ nhd

1/2

wherek.kpis the norm Lp

Proof.We prove the case wherep =2r(for allr∈N)and we use the Holder inequality for lower values

ofp. Moreover, we use the same analytical arguments as those used in previous theorem, we have

kθˆx−θxk2r≤Ckg(xˆ ,θx)k2r+ko( [g(xˆ ,θx)])k2r. (4.6)

Furthermore, we write

kgˆ(x,θx)k2r= 1 ˆnhd

 

i∈In

ξi

!2r

 

 1/2r

.

whereξi=Kiψx(Yi,θx) =Ki[ψx(Yi,θx)−[ψx(Yi,θx)/Xi=x]]withKi=K(h−1(x−Xi)).

(5)

5

Appendix

Proof of Lemma 4.1.Let∆i(x) =K(h−1(x−Xi))ψx(Yi,θx), then

Var

ˆ

g(x,θx)

= Var 1 ˆnhd

i∈In

Kiψx(Yi,θx)

!!

= 1

ˆn2h2dVar

i∈In

∆i

= 1

ˆnh2dVar(∆i) + 1 ˆn2h2d

i6=j∈In

Cov(∆i,∆j)

Concerning the variance term, we have

Var(∆i) = [∆i]2−2[∆i]

By the stationarity of the observations(Xi,Yi)we get, firstly,

[∆i] = [K(h−1(x−X1))ψx(Y1,θx)]

= [K(h−1(x−X1))(ψx(Y1,θx)|X=X1)]

= Z

RdK(h

−1(xz))(

ψx(Y1,θx)|X=X1)f(z)dz.

Next, by a classical change of variables,u=h−1(x−z)we write

[∆i] =hd Z

RdK

(u)g(x−hu,θx)du

and by the Taylor expansion, under (H1), we obtain

[∆i] =hdg(x,θx)

Z

RdK(u)du

+o(hd) =o(hd) since g(x,θx) =0

Secondly, by a similar arguments, we have

[∆i]2 = [K2(h−1(x−X1))ψ2x(Y1,θx)]

= [K2(h−1(x−X1))(ψ2x(Y1,θx)|X=X1)]

= Z

RdK

2(h−1(xz))(

ψ2x(Y1,θx)|X1=z)f(z)dz

= hdλ(x,θx)

Z

RdK

2(u)du+o(hd).

Hence,

Var(∆i) = A(x,θx)hd+o(hd). (5.7)

Now, to evaluate the second part, denoted by Rn =

i6=j∈In

Cov(∆i(x),∆j(x))

, we divide the rectangular

regionIninto two sets.

S1={i,j∈ In: 0<kijk ≤cn},S2={i,j∈ In:kijk>cn},

wherecnis a real sequence that converges to infinity and will be made precise later:

Rn =

(i,j)∈S1

Cov ∆i(x),∆j(x)

+

(i,j)∈S2

Cov ∆i(x),∆j(x)

(6)

On the one hand, onS1we have, under (H2):

Cov ∆i(x),∆j(x)

≤ C

KiKj

−[Ki]

Kj ≤ C Z Rd Z

RdK(h

−1(xv))|f

ij(u,v)−f(u,v)|dudv

≤ C Z

RdK

(h−1(x−v))du 2

≤ Ch2d.

R1n =

(i,j)∈S1

KiKj

− [Ki]

Kj

≤ CnˆcnNh2d.

On the other hand, onS2we apply Lemma 2.1(ii) of Tran(1990) and we deduce that

Cov ∆i,∆j

≤Cϕ(kijk). (5.8)

By (5.8) we have for somev>N

R2n=

(i,j)∈S2

Cov ∆i,∆j

≤ C

(i,j)∈S2

ϕ(kijk)≤Cnˆ

i:kik≥cn

ϕ(kik)

≤ Cnˆc−vn

i:kik≥cn

kikvϕ(kik).

Takingcn=h−d/v, we see

R2n ≤ Cnˆhd

i:kik≥cn

kikvϕ(kik)

≤ Cnˆhd

i:kik≥cn

kikvϕ(kik).

Employing (H4) and the fact thata−1>2 choosevpositive numbers such that

i

kikvϕ(kik)<∞ and v>N.

Which allows us to write

R2n=o(nˆhd) and R1n =o(nˆhd)

Finally

Rn=o

1 ˆnhd

.

Combining the last result together with equation (5.7) we derive

Var

ˆ

g(x,θx)

= A(x,θx)

ˆnhd +o

1 ˆnhd

Proof of Lemma 4.2.Keeping the notation of previous lemma, we write

ˆ

g(x,θx)

=

1 ˆnhd

i∈In

Kiψx(Yi,θx)

!# = 1 hd ∆i = 1 hd Z

RdK(h

−1(xz))(

(7)

Next, by the classical change of variables,u=h−1(xz)we have

1

hd[∆i] =

Z

RdK

(u)g(x−hu,θx)du

Using a Taylor expansion of order two , under (H1), we obtain

ˆ

g(x,θx)

=B(x,θx)h2+o(hd) =o(hd).

Proof of Lemma 4.3.We have

(fˆ(x) =0) = (fˆ(x)≤ f(x)−e)

≤ (|fˆ(x)−f(x)| ≥e)

The Markov’s inequality allows to get, for anyp>0,

(fˆ(x) =0)≤ (|fˆ(x)−f(x)|)

p

ep

This yields the proof.

6

Acknowledgment

The authors wish to thank the referee for helpful comments.

References

[1] Anselin, L., Florax, R.J.G.M., (1995). New Directions in Spatial Econometrics. Springer, Berlin.

[2] Attouch, M., Laksaci, A., Ould-Sad, E. (2010). Asymptotic normality of a robust estimator of the regres-sion function for functional time series data. J. Korean Stat. Soc. 39, 489-500.

[3] A. Gheriballa, A. Laksaci, R. Rouane. (2010). Robust nonparametric estimation for spatial regression, J. Stat. Plan. Infer 140, 1656 -1670.

[4] Azzeddine, N., Laksaci, A., Ould-Sad, E. (2008) On the robust nonparametric regression estimation for functional regressor. Statistics & Probability Letters, 78, 3216-3221.

[5] Banerjee, S., Carlin, B.P. and Gelfand, A.E. (2004). Hierarchical Modeling and Analysis for Spatial Data. Boca Raton, FL: Chapman and Hall/CRC Press.

[6] Biau, G. and Cadre, B. (2004). Nonparametric spatial prediction, Statist. Inference Stoch. processes, 7, 327-349.

[7] Boente, G., Gonzalez-Manteiga, W., Gonzalez, A., (2009). Robust nonparametric estimation with missing data. J. Statist. Plann. Inference 139, 571-592.

[8] Boente, G. and Fraiman, R. (1989). Nonparametric regression estimation, Journal of Multivariate Analy-sis, 29, 180-198.

[9] Carbon, M., Tran, L. T. and Wu, B. (1997). Kernel density estimation for random fields, Stat. Prob. Lett. 36, 115-125.

[10] Carbon, M., Francq, C., Tran, L.T., (2007). Kernel regression estimation for random fields. J. Statist. Plann. Inference 137, 778-798.

(8)

[12] Cressie, N.A.C., (1991). Statistics for spatial Data, Wiley Series in Probability and Mathematical Statistics. Wiley, New York.

[13] Cressie, N.A.C. and Wikle, C.K. (2011). Statistics for Spatio-Temporal Data. New York: Wiley.

[14] Crambes, C, Delsol, L., Laksaci, A.(2008). Robust nonparametric estimation for functional data. J. Non-parametr. Stat. 20, 573-598.

[15] Fan, J., Hu, T.C., Truong, Y.K., (1994). Robust nonparametric function estimation. Scand. J. Statist. 21, 433-446.

[16] Gelfand, A.E., Diggle, P., Guttorp, P. and Fuentes, M. (2010). Handbook of Spatial Statistics.Publisher: CRC/Taylor and Francis.

[17] Guyon, X., (1995). Random Fields on a Network-Modeling, Statistics, and Applications. Springer, New York.

[18] J. Gao, Z. Lu, and D. Tjøstheim, (2008). Moment Inequalities for Spatial Processes, Statist. Probab. Lett. 78, 687-697.

[19] Huber, P.J., (1964). Robust estimation of a location parameter. Ann. Math. Statist. 35, 73-101.

[20] Li, J., Tran, L.T., (2009). Nonparametric estimation of conditional expectation. J. Statist. Plann. Inference 139, 164-175.

[21] Robinson, R., (1984). Robust Nonparametric Autoregression. Lecture Notes in Statistics, vol. 26. Springer, New York, pp. 247-255.

[22] Schabenberger, O. and Gotway, C.A. (2004). Statistical Methods for Spatial Data Analysis (Texts in Statis-tical Science Series). Boca Raton: Chapman and Hall/CRC.

[23] S. A. Ould Abdi, S. Dabo-Niang, A.Diop, and A. Ould Abdi,(2010). Consistency of a Nonparametric Conditional Quantile Estimator for Random Fields. Mathematical Methods of Statistics,19, 1-21.

[24] Tran, L. T. (1990). Kernel density estimation on random fields, Journal of Multivariate Analysis, vol. 34, pp. 37-53.

Received: February 28, 2013;Accepted: November 25, 2013

UNIVERSITY PRESS

References

Related documents

The purpose of the Community Support and Assistance (CSA) Budget Control Level (formerly Transitional Living and Support) is to provide resources and services to Seattle's

When the counter value is “8” (reload value – 1), the counter value will be deleted with the next valid edge and an interrupt will be released, i.e. when the reload value is

•Courses taught: research methods for Tech/Pro; research writing, Gender, Rhetoric and the Body, Intro to Rhetoric •Research interests: writing research and administration •Two

The spatial keyword based pruning is applied in the sense that only objects within the search region are accessed dur- ing the network expansion (i.e., network distance constraint)

http://gtri.com/downloads/FinancialCalculatorWMTSWLAN.xlsx The costs of implementing patient monitoring on Wi-Fi are significantly less than on a WMTS network..

Study 2 provides initial support for our hypothesis that in ambiguous situations, observers make use of the social dissimilarity rule to make a moral judgment about the situation:

Specifically, as the FDIC has noted, enhancing the insurance determination process reduces the chance that policymakers will invoke the systemic risk exception of the Federal

After installing AI Suite II from the motherboard support DVD, launch ASUS Update by clicking Update&gt; ASUS Update on the AI Suite II main menu bar.. Using