• No results found

Regularized Poisson and logistic methods for spatial point processes intensity estimation with a diverging number of covariates

N/A
N/A
Protected

Academic year: 2021

Share "Regularized Poisson and logistic methods for spatial point processes intensity estimation with a diverging number of covariates"

Copied!
19
0
0

Loading.... (view fulltext now)

Full text

(1)

Regularized Poisson and logistic methods

for spatial point processes intensity estimation

with a diverging number of covariates

Achmad Choiruddin1

*Work with: Jean-fran¸cois Coeurjolly1,2 and Fr´ed´erique Letu´e1

1Laboratory Jean Kuntzmann, Universit´e Grenoble Alpes, France 2 Department of Mathematics, Universit´e du Qu´ebec `a Montr´eal, Canada

19th Workshop on Stochastic Geometry, Stereology and Image Analysis (SGSIA), Luminy, France

(2)

Contents

Introduction Motivation

Spatial point processes intensity estimation Proposed methods

Methodology Asymptotic results

Application to forestry dataset Conclusion and possible extensions

(3)

Introduction

Motivation: Barro Colorado Island (BCI) study

3604 locations of Beilschmiedia pendula Lauraceae (BPL) tree Intensity: function of covariates ρ(u; β) = exp(β>z(u)), β ∈ Rp, z(u) = {z1(u), · · · , zp(u)}>are

spatial covariates at u p = moderate!

(4)

Introduction

Motivation: Barro Colorado Island (BCI) study

What if ...

Parsimonius model

Improve the prediction

(5)

Spatial point processes

Notations and existing methods

X is a spatial point process, D is the observed domain, |D| is the volume of observed domain

The Poisson log-likelihood

`(β) = X u∈X∩D β>z(u) − Z D exp(β>z(u))du. Estimate β by solving `(1)(β) = 0

Estimating equation-based methods for more general point processes

(6)

Spatial point processes

Notations and existing methods

X is a spatial point process, D is the observed domain, |D| is the volume of observed domain

The weightedPoisson log-likelihood (Guan and Shen, 2010)

`(w, β) = X

u∈X∩D

w(u)β>z(u) − Z

D

w(u)exp(β>z(u))du.

Estimate β by solving `(1)(w, β) = 0

Estimating equation-based methods for more general point processes

(7)

Spatial point processes

Notations and existing methods

X is a spatial point process, D is the observed domain, |D| is the volume of observed domain

The weighted logistic regressionlog-likelihood (Baddeley et al., 2014; Choiruddin et al., 2017)

`(w; β) = X u∈X∩D w(u) log  ρ(β) g(u) + ρ(β)  − Z D

w(u)g(u) log ρ(β) + g(u) g(u)

 du.

Estimate β by solving `(1)(w, β) = 0

Estimating equation-based methods for more general point processes

(8)

Proposed methods

Methodology Regularized likelihood : Q(w, β) = `(w, β) − |D| p X j=1 pλj(|βj|) where :

`(w, β) is either the Poisson or logistic log-likelihoods pλ(θ) is a penalty function parameterized by λ ≥ 0

(9)

Proposed methods

Regularization methods Method Pp j=1pλj(|βj|) Ridge Pp j=1 1 2λβ 2 j Lasso Pp j=1λ|βj| Enet Pp j=1λ{α|βj| + 1 2(1 − α)β 2 j} SCAD Pp j=1pλ(|βj|), with pλ(θ) =        λθ if (θ ≤ λ) γλθ−12(θ2+λ2) γ−1 if (λ < θ < γλ) λ2(γ2−1) 2(γ−1) if (θ ≥ γλ) MC+ Pp j=1 n λ|βj| − β2 j 2γ  I(|βj| < γλ) +12γλ2I(|βj| ≥ γλ) o Adaptive Lasso Pp j=1λj|βj| Adaptive enet Pp j=1λj{α|βj| + 1 2(1 − α)β 2 j}

(10)

Asymptotic theory

Conditions

The pn-dimensional vector of true coefficient values:

β0= {β01, . . . , β0pn}

>

= {β01, . . . , β0s, β0(s+1), . . . , β0pn}

>

= {β>01, β>02}>= {β>01, 0>}>

X observed over D = Dn, n = 1, 2, . . . which expands to Rdas n → ∞.

We allow pn→ ∞ as n → ∞ and we have a fixed s.

We assume that p3n/|Dn| → 0 as n → ∞. We define an= max j=1,...,s{p 0 λn,j(|β0j|)}, bn= inf j=s+1,...,pn inf |θ|≤n θ6=0 p0λn,j(|θ|), for n= K p pn/|Dn|, and cn= max j=1,...,s{p 00 λn,j(|β0j|)}.

(11)

Asymptotic theory

Main results

Theorem 1

Under some regularity conditions, if an= O(|Dn|−1/2) and cn→ 0, there

exists a local maximizer ˆβ of Q(w, β) such that k ˆβ − β0k = OP √pn(|Dn|−1/2+ an).

Theorem 2

Under some regularity conditions, If an|Dn|1/2→ 0, bnp|Dn|/p2n→ ∞ and

pncn→ 0 as n → ∞, the root-(|Dn|/pn) consistent local maximizers

ˆ β = ( ˆβ>1, ˆβ > 2) > in Theorem 1 satisfy: (i) Sparsity: P( ˆβ2= 0) → 1 as n → ∞,

(ii) Asymptotic Normality: |Dn|1/2Σn(w, β0) −1/2 ( ˆβ1− β01) d −→ N (0, Is), where Σn(w, β0) = |Dn|{An,11(w, β0) + Πn}−1{Bn,11(w, β0) + Cn,11(w, β0)}{An,11(w, β0) + Πn}−1

(12)

Asymptotic theory

Discussion

Note! Need to obey |Dn|1/2an→ 0 and p|Dn|/p2nbn→ ∞ as

n → ∞ to satisfy the Theorem 2

Method an bn Satisfy? Ridge λn max j=1,...s{|β0j|} 0 No Lasso λn λn No Enet λn[(1 − α) max j=1,...s{|β0j|} + α] λnα No ALasso max j=1,...s{λn,j} j=s+1,...pinf {λn,j} Yes Aenet max j=1,...s{λn,j (1 − α)|β0j| + α} αj=s+1,...pinf {λn,j} Yes SCAD 0* λn** Yes MC+ 0* λn− K √ pn γ√|Dn| ** Yes

(13)

Application

Barro Colorado Island (BCI) study

Estimate BPL intensity: log ρ(u; β) = β1z1(u) + · · · + β93z93(u)

93 covariates: 2 topology, 13 soil nutrients, 78 interactions Regularized (un)weighted PL with LASSO, AL and SCAD

Method Unweighted Weighted

#Selected #No #Selected #No

LASSO 77 16 45 48

AL 50 43 10 83

(14)

Application

10 common selected covariates

Covariates Unweighted Weighted

LASSO ALASSO SCAD LASSO ALASSO SCAD

Elev 0.32 0.40 0.33 0.40 0.32 0 Slope 0.39 0.40 0.36 0.42 0.44 0 Cu 0.56 0.31 0.61 0.39 0.33 0 Mn 0.14 0.14 0.09 0.15 0.22 0 P -0.48 -0.43 -0.54 -0.33 -0.57 -1.07 Zn -0.75 -0.66 -0.83 -0.58 -0.40 0 Al:P -0.30 -0.29 -0.31 -0.28 -0.16 0 Mg:P 0.62 0.29 0.45 0.48 0.42 0 Zn:N 0.21 0.35 0.30 0 0 0.62 N.Min:pH 0.44 0.44 0.49 0.25 0.27 0

(15)

Application

10 common selected covariates

Covariates Unweighted Weighted

LASSO ALASSO SCAD LASSO ALASSO SCAD

Elev 0.32 0.40 0.33 0.40 0.32 0 Slope 0.39 0.40 0.36 0.42 0.44 0 Cu 0.56 0.31 0.61 0.39 0.33 0 Mn 0.14 0.14 0.09 0.15 0.22 0 P -0.48 -0.43 -0.54 -0.33 -0.57 -1.07 Zn -0.75 -0.66 -0.83 -0.58 -0.40 0 Al:P -0.30 -0.29 -0.31 -0.28 -0.16 0 Mg:P 0.62 0.29 0.45 0.48 0.42 0 Zn:N 0.21 0.35 0.30 0 0 0.62 N.Min:pH 0.44 0.44 0.49 0.25 0.27 0

(16)

Application

Estimates of BPL intensity (log scale)

3604 locations of BPL

LASSO WLASSO

AL WAL

(17)

Conclusion and possible extensions

Conclusion

Regularized versions of EE derived from Poisson and logistic regression likelihoods

Large classes of spatial point processes and many penalty functions

Nice theoretical properties and computationally preferable Satisfy our Theorems: SCAD, MC+, ALasso, Aenet Regularized WPL may be more appropriate for a clustered process

(18)

Conclusion and possible extensions

Possible extensions

Topology Soil Nutrients

Other trees

Deal with very large datasets → some troubles The Dantzig selector vs. penalization methods

(19)

Some References

Baddeley, A; Rubak, E; and Turner, R. (2015). Spatial point patterns: methodology and application with R, CRC press.

Baddeley, A; Coeurjolly, J.-F; Rubak, E; and Waagepetersen , R. (2014). Logistic regression for spatial Gibbs point processes, Biometrika, 101(2):377–392.

Choiruddin, A; Coeurjolly, J-F; and Letu´e, F. (2017). Convex and non-convex regularization methods for spatial point processes intensity estimation, ArXiv preprint arXiv:1703.02462.

Friedman, J; Hastie, T; Hofling H; Tibshirani, R; et al. (2007). Pathwise coordinate optimization. The Annals of Applied Statistics, 1(2):302–332. Guan, Y and Shen Y. (2010). A weighted estimating equation approach for inhomogeneous spatial point process, Biometrika, 97(4), 867-880. Thurman, A. L; Fu, R; Guan, Y; and Zhu, J. (2015). Regularized

estimating equations for model selection of clustered spatial point process, Statistica Sinica,25(1), 173-188.

References

Related documents

Given the importance of reducing power consumption in data centers and the fact that data centers have low server utilization, we conclude that dynamic server provisioning is

Helen Chang, Esther Chiang, Jancy Munguia, and Elizabeth Reilly Educational Technology Services, Bryn Mawr

CRT Recyclers Who Did Not Respond To Survey Company Name Total Tonnage of E-Waste Collected in 2003 Amount of CRT Glass Recycled in 2003 Recycled CRT glass as a percentage

In the light of the above statements, choose the most appropriate answer from the options given below :.. (A) Both A and R are correct and R is the correct explanation

She added that “the contact center where our customer’s work is being handled is state of the art, and the agent pool is mature, well educated and harder working than one would

Although there is a relevant body of research on Target Costing, many questions are still in need of clarification, especially its linkages to competitive

This will toward ‘proactive’ rather than ‘reactive’ stances in this arena is in fact in evidence in the mind and desires, if not yet the col- lective Association actions, of

IBM’s open approach – open standards, open source and open architecture – will be instrumental in the next generation of applications for student learning and for the