• No results found

Maximum likelihood Estimation for Stochastic Differential Equations with two Random Effects in the Diffusion Coefficient

N/A
N/A
Protected

Academic year: 2021

Share "Maximum likelihood Estimation for Stochastic Differential Equations with two Random Effects in the Diffusion Coefficient"

Copied!
8
0
0

Loading.... (view fulltext now)

Full text

(1)

Maximum likelihood Estimation for Stochastic Differential Equations with

two Random Effects in the Diffusion Coefficient

Alsukaini Mohammed Sari

1,2

, AlkreemawiWalaa khazaal

1,2

, Wang xiang-jun

1* 1

School of Mathematics and Statistics, Huazhong University of Science and Technology, Wuhan, Hubei, 430074P.R.China.

2

Department of Mathematics, College of Science, Basra University, Basra, Iraq [email protected],2

Abstract

We study 𝑛 independent stochastic processes 𝑋𝑖 𝑑 , 𝑑 ∈ 0, 𝑇𝑖 , 𝑖 = 1, … , 𝑛 , defined by a stochastic differential equation

with diffusion coefficients depending nonlinearly on a random variables βˆ…π‘– and πœ‡π‘– (the random effects).The distributions of

the random effects βˆ…π‘– and πœ‡π‘– depends on unknown parameters which are to be estimated from the continuous

observations of the processes 𝑋𝑖 𝑑 . When the distributions of the random effects βˆ…π‘– ,πœ‡π‘– are Gaussian and exponential

respectively, we obtained an explicit formula for the likelihood function and the asymptotic properties (consistency and asymptotic normality) of the maximum likelihood estimator (MLE) are derived when 𝑛 tend to infinity.

Keywords:

stochastic differential equations, Maximum likelihood estimator; nonlinear random effects; strong

consistency; asymptotic normality

.

Council for Innovative Research

Peer Review Research Publishing System

(2)

1 INTRODUCTION

Stochastic differential equations play an important role in many areas of science fields as physics, engineering, chemistry, neuroscience, biology, finance (Gugushvili and P. Spreij (2012)[11]). Statistical estimation of parameters in the diffusion processes has been studied for a long time; Feigin [10] provided a useful historical overview of the early studies and introduced a general asymptotic theory of maximum likelihood estimation for continuous diffusion processes. In the recent years, the stochastic differential equations with random effects have been the subject of diverse applications such as pharmacokinetic/pharmacodynamics, neuronal modeling (Delattre and Lavelle, 2013[5], Donnet and Samson, 2013[9], Picchini et al. 2010[15]).Maximum likelihood estimator of the parameters of the random effect, is generally not possible, because of the likelihood function is not available in most cases, exceptin (Ditlevsen and De Gaetano (2005) [5]) and [8] which are found a special case of SDE and derived an explicit MLE. Many references proposed approximations for the unknown likelihood function, for general mixed SDEs an approximations of the likelihood have been proposed (Picchini and Ditlevsen, 2011[14]), linearization (Beal and Sheiner (1982)[4]),Laplace's approximation (Wolfinger, (1993)[17]) or approximating the conditional transition density of the diffusion process given the random effects by a Hermit expansion, (AΓ―t-Sahalia (2002)[1]).

Delattre et al. (2012) [7] and alkreemawi et al. (2015) [2] are studied the maximum likelihood estimator for random effects

in more generally for fixed𝑇 and 𝑛 tending to infinity (for non i.i.d. sample paths, see Maitra et al. (2014) [13]) and they

found an explicit expression for likelihood function and exact likelihood estimator by investigate the linear random effect in the drift (multiple and additive case respectively) together with a specific distribution for the random effect. Almost researcher studied the random effect in the drift not in diffusion except Delattre et al.(2014) [6] and alsukaini et al. (2015) [3]who used one random effect in the diffusion coefficient (linearly and nonlinearly respectively) with a specific distributions and focus on (discretely and continuously) observed SDEs respectively.

In the present work we focus on stochastic differential equation with two random effects in diffusion term and without

random effect in drift term. We consider 𝑛 real valued stochastic processes 𝑋𝑖 𝑑 , 𝑑 ∈ 0, 𝑇𝑖 , 𝑖 = 1, … , 𝑛 , with dynamics

ruled by the following SDEs:

whereπ‘Š1, … , π‘Šπ‘› are 𝑛 independent wiener processes, βˆ…1 , … , βˆ…π‘› and πœ‡1 , … , πœ‡π‘›are 𝑛𝑖. 𝑖. 𝑑. random variables taking values in

( ℝ and ℝ+) respectively, βˆ…

1 , … , βˆ…π‘› ,πœ‡1 , … , πœ‡π‘› and π‘Š1, … , π‘Šπ‘› are independent and π‘₯𝑖, 𝑖 = 1, … , 𝑛 are known real values.

The functions 𝑏 π‘₯ and 𝜍(π‘₯)are known real valued. Each process𝑋𝑖 𝑑 represents an individual, the variables βˆ…π‘– and πœ‡π‘–

represents the random effects of individual 𝑖, the random variablesβˆ…1 , … , βˆ…π‘› have a common distribution 𝑔 πœ‘, πœƒ π‘‘πœ(πœ‘)

on ℝ and the random variables πœ‡1 , … , πœ‡π‘› have a common distribution 𝑕 πœ‡, 𝛽 𝑑𝑒(πœ‡) on ℝ+where πœƒ and 𝛽 are an unknown

parameters belonging to a set 𝚯 βŠ‚ ℝ𝑝 where 𝜐 and 𝑒 are a dominating

measures.

Our aim is to estimate πœ“ = (πœƒ, 𝛽) from the continuous observations 𝑋𝑖 𝑑 , 𝑑 ∈ 0, 𝑇𝑖 , 𝑖 = 1, … , 𝑛 , we focus on a special

case of nonlinear random effect in the diffusion coefficient in the model (1), i.e. Οƒ π‘₯, βˆ…π‘–, πœ‡π‘– = 1

βˆ…π‘–+πœ‡π‘– 𝜍(π‘₯) , where 𝜍 is a

known real function and βˆ…π‘– is a Gaussian and πœ‡π‘– is an exponential, we find an explicit likelihood formula and the maximum

likelihood estimator of πœ“ .We use the following sufficient statistics π‘ˆπ‘– and 𝑉𝑖 as in [7]:

π‘ˆπ‘–= 𝑏 𝑋𝑖 𝑠 𝜍2(𝑋 𝑖 𝑠 ) 𝑇𝑖 0 𝑑𝑋𝑖 𝑠 , 𝑉𝑖= 𝑏2 𝑋 𝑖 𝑠 𝜍2(𝑋 𝑖 𝑠 ) 𝑇𝑖 0 𝑑𝑠 , 𝑖 = 1, … , 𝑛.

The consistency and asymptotic normality of the maximum likelihood estimator of πœ“ are proved without invoking the

result in the literature.

The rest of this paper is organized as follows. Section 2 contains the notation and assumptions that we will need throughout the paper. The explicit likelihood function, specific distributions for the random effects and the maximum likelihood estimators of the parameters of random effects are introduced in section 3. In section 4 we study the asymptotic properties of the maximum likelihood estimator when the random effects are Gaussian and exponential distribution respectively.

2 Notations and Assumptions

Consider 𝑛 real valued stochasticprocesses (𝑋𝑖 𝑑 , 𝑑 β‰₯ 0), 𝑖 = 1, … , 𝑛with dynamics ruled by (1). The

processesπ‘Š1, … , π‘Šπ‘›and the random variables βˆ…1 , … , βˆ…π‘›and πœ‡1 , … , πœ‡π‘›are defined on a common probability space

(Ξ©, β„±, β„™).Consider the filtration(ℱ𝑑, 𝑑 β‰₯ 0)defined byℱ𝑑= 𝜍(βˆ…π‘–, πœ‡π‘–, π‘Šπ‘– 𝑠 , 𝑠 ≀ 𝑑, 𝑖 = 1, … , 𝑛).As ℱ𝑑= 𝜍(βˆ…π‘–, πœ‡π‘–, π‘Šπ‘– 𝑠 , 𝑠 ≀ 𝑑)⋁ℱ𝑑𝑖

withℱ𝑑𝑖 = 𝜍(βˆ…π‘–, βˆ…π‘—, πœ‡π‘–, πœ‡π‘—, π‘Šπ‘— 𝑠 , 𝑠 ≀ 𝑑 , 𝑗 β‰  𝑖) independent of π‘Šπ‘– , each process π‘Šπ‘– is a (ℱ𝑑, 𝑑 β‰₯ 0)-Brownian motion.

Moreover, the random variables βˆ…π‘– , πœ‡π‘– are β„±0 – measurable. We used β–‘ referring to the end of the proofs.

We assume that:

H1the function 𝜍 belongs to 𝑐2(ℝ Γ— ℝ Γ— ℝ+) and for all π‘₯ ∈ ℝ , 0 < 𝜍

02 ≀ 𝜍2(π‘₯, πœ‘, πœ‡) ≀ 𝜍12

From H1 , the process (𝑋𝑖 𝑑 ) is well define and (βˆ…π‘–, πœ‡π‘–, 𝑋𝑖 𝑑 ) adapted to filtration (ℱ𝑑, 𝑑 β‰₯ 0) .The 𝑛 processes

(3)

π‘‘π‘‹π‘–πœ‘ ,πœ‡ 𝑑 = 𝑏 π‘‹π‘–πœ‘ ,πœ‡ 𝑑𝑑 + 𝜍 π‘‹π‘–πœ‘ ,πœ‡, πœ‘, πœ‡ π‘‘π‘Šπ‘– 𝑑 , 𝑋𝑖

πœ‘,πœ‡ 0 = π‘₯𝑖 (2).

Admits a unique strong solution process (π‘‹π‘–πœ‘ ,πœ‡ 𝑑 , 𝑑 β‰₯ 0) adapted to filtration (ℱ𝑑, 𝑑 β‰₯ 0) . We infer that the conditional

distribution of 𝑋𝑖 given βˆ…π‘–= πœ‘ and πœ‡π‘–= πœ‡ identical to the distribution of 𝑋𝑖

πœ‘ ,πœ‡

.

3 Likelihood, A specific distributions for the random effects and Maximum likelihood

estimators

3.1 Likelihood

We introduce the distribution π‘„πœ‘,πœ‡π‘₯𝑖,𝑇𝑖of (𝑋

𝑖 πœ‘ ,πœ‡

𝑑 , 𝑑 ∈ 0, 𝑇𝑖 ).

Letπ‘ƒπœ“π‘– = 𝑔 πœ‘, πœƒ 𝑑𝑣(πœ‘)⨂ 𝑕 πœ‡, 𝛽 𝑑𝑒(πœ‡)β¨‚π‘„πœ‘π‘₯

𝑖,𝑇

𝑖denote the joint distribution of (βˆ…

𝑖, πœ‡π‘–, 𝑋𝑖 𝑑 ) and let π‘„πœ“π‘– denote the

marginal distribution of (𝑋𝑖 𝑑 , 𝑑 ∈ 0, 𝑇𝑖 ). Let us consider the following assumption:

H2For 𝑖 = 1, … , 𝑛 and for all πœ‘, πœ‡, πœ‘β€², πœ‡β€², π‘„πœ‘,πœ‡π‘₯

𝑖,𝑇 𝑖 𝑏 2 𝑋 𝑖 πœ‘ ,πœ‡ 𝑑 𝜍2 𝑋 𝑖 πœ‘ ,πœ‡ 𝑑 ,πœ‘β€²,πœ‡β€² 𝑇𝑖 0 𝑑𝑑 < +∞ = 1.

Proposition 3.1

under H1-H2 and let πœ‘ ∈ ℝ , πœ‡ ∈ ℝ+, we have, the distribution 𝑄 πœ‘,πœ‡ π‘₯𝑖,𝑇

𝑖are absolutely continuous

w.r.t. 𝑄𝑖= 𝑄 πœ‘0,πœ‡0 π‘₯𝑖,𝑇 𝑖 with density: π‘‘π‘„πœ‘,πœ‡π‘₯ 𝑖,𝑇 𝑖 𝑑𝑄𝑖 𝑋𝑖 = 𝐿𝑇𝑖 𝑋𝑖, πœ‘, πœ‡ = exp 𝑏 𝑋𝑖 𝑠 𝜍2 𝑋 𝑖 𝑠 , πœ‘, πœ‡ 𝑇𝑖 0 𝑑𝑋𝑖 𝑠 βˆ’ 1 2 𝑏2 𝑋 𝑖 𝑠 𝜍2 𝑋 𝑖 𝑠 , πœ‘, πœ‡ 𝑇𝑖 0 𝑑𝑠 3 .

(See Liptser and Shiryaev [12]), which is admits a continuous version 𝑄𝑖 a.s.

Proof: (see the proof of proposition 2) in [7].

By independent of individuals, π‘ƒπœ“= ⨂𝑖=1𝑛 π‘ƒπœ“π‘– is the distribution of (βˆ…π‘–, πœ‡π‘–, 𝑋𝑖 . ), 𝑖 = 1, … , 𝑛 and π‘„πœ“= ⨂𝑖=1𝑛 π‘„πœ“π‘– is the

distribution of the sample 𝑋𝑖 𝑑 , 𝑑 ∈ 0, 𝑇𝑖 , 𝑖 = 1, … , 𝑛 .

We can compute the density of π‘„πœ“ w.r.t. 𝑄 = ⨂𝑖=1𝑛 𝑄𝑖 as follow:

π‘‘π‘„πœ“

𝑑𝑄𝑖 𝑋𝑖 = 𝐿𝑇𝑖 𝑋𝑖, πœ‘, πœ‡ 𝑔 πœ‘, πœƒ 𝑕 πœ‡, 𝛽 𝑑𝑣(πœ‘)𝑑𝑒(πœ‡)

ℝ

ℝ+ = 𝛾𝑖

𝑋𝑖, πœ“ .

The distribution π‘„πœ“ admits a density given by:

π‘‘π‘„πœ“

𝑑𝑄 𝑋1, … , 𝑋𝑛 = 𝛾𝑖 𝑋𝑖, πœ“ ,

𝑛

𝑖=1

And the exact likelihood of whole sample 𝑋𝑖 𝑑 , 𝑑 ∈ 0, 𝑇𝑖 , 𝑖 = 1, … , 𝑛 is

πœ‰π‘›(πœ“) = 𝛾𝑖 𝑋𝑖, πœ“ . 𝑛

𝑖=1 3.2 A Specific distributions for the random effects

Consider model (1) with nonlinear random effects in the diffusion coefficient 𝜍 π‘₯, πœ‘, πœ‡ =πœ‘ +πœ‡1 𝜍(π‘₯) where πœ‘ ∈ ℝ , πœ‡ ∈

ℝ+ and 𝑏 . , 𝜍(. ) are known functions. We assume that:

𝑏2 𝑋 𝑖 𝑠 𝜍2 𝑋 𝑖 𝑠 𝑇𝑖 0 𝑑𝑠 < ∞ , π‘„πœ‘ ,πœ‡π‘₯ 𝑖,𝑇 π‘–βˆ’ π‘Ž. 𝑠,

for all πœ‘, πœ‡ and for 𝑖 = 1, … , 𝑛 ; 𝑇𝑖= 𝑇, π‘₯𝑖= π‘₯, so that 𝑋𝑖 𝑑 , 𝑑 ∈ 0, 𝑇 , 𝑖 = 1, … , 𝑛 are 𝑖. 𝑖. 𝑑. We will use the well define

statistics as follow: π‘ˆπ‘–= 𝑏 𝑋𝑖 𝑠 𝜍2(𝑋 𝑖 𝑠 ) 𝑇 0 𝑑𝑋𝑖 𝑠 , 𝑉𝑖= 𝑏2 𝑋 𝑖 𝑠 𝜍2(𝑋 𝑖 𝑠 ) 𝑇 0 𝑑𝑠 4 . So that the density 𝛾𝑖(𝑋𝑖, πœ“) is given by:

𝛾𝑖 𝑋𝑖, πœ“ = exp πœ‘ + πœ‡ 2 π‘ˆπ‘–βˆ’ 1 2𝑉𝑖 𝑔 πœ‘, πœƒ 𝑕 πœ‡, 𝛽 𝑑𝑣 πœ‘ 𝑑𝑒 πœ‡ ℝ ℝ+ 5 .

(4)

For a general distributions, 𝑔 πœ‘, πœƒ 𝑑𝑣 πœ‘ of the random effect βˆ…π‘– and 𝑕 πœ‡, 𝛽 𝑑𝑣(πœ‘)of the random effect πœ‡, it is not

possible find an explicit expression for 𝛾𝑖(𝑋𝑖, πœ“) above, therefor we propose a specific distributions (Gaussian

(πœ†, πœ”2)for the random effects πœ‘ and exponential (𝛽) for the random effect πœ‡ ) which will give an explicit likelihood and

then find the maximum likelihood estimators of the unknown parameters. In the next proposition an evident

expression for 𝛾𝑖(𝑋𝑖, πœ“)is obtained when the above distributions of the random effects is with unknown parameter πœ“ =

(πœ†, πœ”2, 𝛽) ∈ ℝ Γ— ℝ+Γ— ℝ+.The true value is denoted byπœ“

0= (πœ†0, πœ”20, 𝛽0).

Proposition 3.2

suppose that 𝑔 πœ‘, πœƒ 𝑑𝑣 πœ‘ = 𝒩(πœ†, πœ”2), and 𝑕 πœ‡, 𝛽 𝑑𝑒 πœ‡ = 𝑒π‘₯𝑝(𝛽) then:

𝛾𝑖 𝑋𝑖, πœ“ = πœ‹π›½ 𝑀𝑖 𝑒π‘₯𝑝 βˆ’1 4 (𝛽 1 βˆ’ 2π‘€π‘–πœ”2 βˆ’ 2πœ†π‘€π‘–)2 𝑀𝑖 1 βˆ’ 2πœ”2𝑀𝑖 + πœ† 2βˆ’ πœ†(1 βˆ’ 2𝑀 π‘–πœ”2) 2πœ”2 1 βˆ’ 2𝑀 π‘–πœ”2 , where 𝑀𝑖= π‘ˆπ‘–βˆ’ 1 2𝑉𝑖 .

Proof:

from (5) we compute the joint density of βˆ…π‘–, πœ‡π‘–, 𝑋𝑖 :

exp πœ‘ + πœ‡ 2 π‘ˆ π‘–βˆ’ 1 2𝑉𝑖 Γ— 1 2πœ‹πœ”2𝑒π‘₯𝑝 βˆ’ 1 2πœ”2 πœ‘ βˆ’ πœ† 2 Γ— 𝛽𝑒π‘₯𝑝 βˆ’π›½πœ‡ . Let 𝑀𝑖= π‘ˆπ‘–βˆ’ 1

2𝑉𝑖 , then the exponent become:

𝐷𝑖= πœ‘2π‘€π‘–βˆ’

1

2πœ”2 πœ‘ βˆ’ πœ† 2+ 2πœ‘πœ‡π‘€π‘–+ πœ‡2π‘€π‘–βˆ’ π›½πœ‡. 6 .

We will compute the first part (πœ‘2𝑀

π‘–βˆ’ 1

2πœ”2 πœ‘ βˆ’ πœ†

2+ 2πœ‘πœ‡π‘€

𝑖) of the exponent as follow:

πœ‘2𝑀 π‘–βˆ’ 1 2πœ”2 πœ‘ βˆ’ πœ† 2+ 2πœ‘πœ‡π‘€π‘–= π‘€π‘–βˆ’ 1 2πœ”2 πœ‘2+ πœ† πœ”2+ 2πœ‡π‘€π‘– πœ‘ βˆ’ πœ†2 2πœ”2 =βˆ’1 2 1 πœ”2βˆ’ 2𝑀𝑖 πœ‘2βˆ’ 2 πœ† + 2πœ”2𝑀 π‘–πœ‡ 1 βˆ’ 2πœ”2𝑀 𝑖 πœ‘ βˆ’ πœ† 2 2πœ”2 =βˆ’1 2 1 πœ”2βˆ’ 2𝑀𝑖 πœ‘2βˆ’ πœ† + 2πœ”2𝑀 π‘–πœ‡ 1 βˆ’ 2πœ”2𝑀 𝑖 2 βˆ’ πœ† + 2πœ” 2𝑀 π‘–πœ‡ 1 βˆ’ 2πœ”2𝑀 𝑖 2 βˆ’ πœ† 2 2πœ”2 =βˆ’1 2 1 βˆ’ 2πœ”2𝑀 𝑖 πœ”2 πœ‘ βˆ’ πœ† + 2πœ”2𝑀 π‘–πœ‡ 1 βˆ’ 2πœ”2𝑀 𝑖 2 + πœ† + 2πœ” 2𝑀 π‘–πœ‡ 2 2πœ”2 1 βˆ’ 2πœ”2𝑀 𝑖 βˆ’ πœ†2 2πœ”2 .

Now, bysplliting the result into two parts that are independent and dependent on the random effect πœ‘ respectively, the integral of the dependent part is the integral of a Gaussian density.

Then the first integral in (5) with respect to πœ‘ yields the following result : 1 1 βˆ’ 2πœ”2𝑀 𝑖 𝑒π‘₯𝑝 πœ† + 2πœ” 2𝑀 π‘–πœ‡ 2 2πœ”2 1 βˆ’ 2πœ”2𝑀 𝑖 βˆ’ πœ† 2 2πœ”2 .

By substituting in (5), the second part of the exponent is: 𝐸𝑖= πœ† + 2πœ”2𝑀 π‘–πœ‡ 2 2πœ”2 1 βˆ’ 2πœ”2𝑀 𝑖 + πœ‡2𝑀 π‘–βˆ’ π›½πœ‡ βˆ’ πœ†2 2πœ”2 = 2πœ” 2𝑀 𝑖2 1 βˆ’ 2πœ”2𝑀 𝑖+ 𝑀𝑖 πœ‡ 2βˆ’ 𝛽 βˆ’ 2πœ†π‘€π‘– 1 βˆ’ 2πœ”2𝑀 𝑖 πœ‡ + πœ†2βˆ’ πœ† 1 βˆ’ 2πœ”2𝑀 𝑖 2πœ”2 1 βˆ’ 2πœ”2𝑀 𝑖 = 𝑀𝑖 1 βˆ’ 2πœ”2𝑀 𝑖 πœ‡2βˆ’ 𝛽 1 βˆ’ 2πœ”2𝑀𝑖 βˆ’ 2πœ†π‘€π‘– 𝑀𝑖 πœ‡ + πœ† 2βˆ’ πœ† 1 βˆ’ 2πœ”2𝑀 𝑖 2πœ”2 1 βˆ’ 2πœ”2𝑀 𝑖 = 𝑀𝑖 1 βˆ’ 2πœ”2𝑀 𝑖 πœ‡ βˆ’1 2 𝛽 1 βˆ’ 2πœ”2𝑀 𝑖 βˆ’ 2πœ†π‘€π‘– 𝑀𝑖 2 βˆ’ 1 2 𝛽 1 βˆ’ 2πœ”2𝑀 𝑖 βˆ’ 2πœ†π‘€π‘– 𝑀𝑖 2 + πœ† 2βˆ’ πœ† 1 βˆ’ 2πœ”2𝑀 𝑖 2πœ”2 1 βˆ’ 2πœ”2𝑀 𝑖 = βˆ’1 2 𝑀𝑖 πœ”2𝑀 π‘–βˆ’ 1 2 πœ‡ βˆ’1 2 𝛽 1 βˆ’ 2πœ”2𝑀 𝑖 βˆ’ 2πœ†π‘€π‘– 𝑀𝑖 2 βˆ’1 4 𝛽 1 βˆ’ 2πœ”2𝑀 𝑖 βˆ’ 2πœ†π‘€π‘– 2 𝑀𝑖 1 βˆ’ 2πœ”2𝑀𝑖 + πœ†2βˆ’ πœ† 1 βˆ’ 2πœ”2𝑀 𝑖 2πœ”2 1 βˆ’ 2πœ”2𝑀 𝑖 .

Now, by rearrange the second integral we see that the first part is normal depend on the random effect πœ‡ with mean is

𝛿𝑖= 1 2 𝛽 1 βˆ’ 2πœ”2𝑀 𝑖 βˆ’ 2πœ†π‘€π‘– 𝑀𝑖 ,

(5)

And variance, πœŒπ‘–2= πœ”2𝑀 π‘–βˆ’ 1 2 𝑀𝑖 , Then, the conditional distribution of (βˆ…π‘–, πœ‡π‘–) given 𝑋𝑖is 𝒩(𝛿𝑖, πœŒπ‘–2).

And hence, 𝛾𝑖 𝑋𝑖, πœ“ = πœ‹π›½ 𝑀𝑖 𝑒π‘₯𝑝 βˆ’1 4 𝛽 1 βˆ’ 2π‘€π‘–πœ”2 βˆ’ 2πœ†π‘€π‘– 2 𝑀𝑖 1 βˆ’ 2πœ”2𝑀𝑖 +πœ† 2βˆ’ πœ†(1 βˆ’ 2𝑀 π‘–πœ”2) 2πœ”2 1 βˆ’ 2𝑀 π‘–πœ”2 . β–‘

3.3 Maximum likelihood estimator for

πœ“ = (πœ†, πœ”2, 𝛽):

Since we find the density 𝛾𝑖 𝑋𝑖, πœ“ , a natural approach to estimateπœ“ is the maximum likelihood estimation, so in order

to find MLE πœ“ , the likelihood function is written as:

πœ‰π‘› πœ“ = 𝑛 𝑖=1 πœ‹π›½ 𝑀𝑖 𝑒π‘₯𝑝 βˆ’1 4 𝛽 1 βˆ’ 2π‘€π‘–πœ”2 βˆ’ 2πœ†π‘€π‘– 2 𝑀𝑖 1 βˆ’ 2πœ”2𝑀𝑖 + πœ† 2βˆ’ πœ† 1 βˆ’ 2𝑀 π‘–πœ”2 2πœ”2 1 βˆ’ 2𝑀 π‘–πœ”2 . And hence, the logarithm of likelihood function is,

ℒ𝑛 πœ“ = π‘™π‘œπ‘” 𝑛𝑖=1 πœ‰π‘›(πœ“) (7) = π‘™π‘œπ‘”πœ‹12𝛽𝑛+1 2log 𝑀𝑖 βˆ’ 1 4 𝛽 1 βˆ’ 2π‘€π‘–πœ”2 βˆ’ 2πœ†π‘€π‘– 2 𝑀𝑖 1 βˆ’ 2πœ”2𝑀𝑖 βˆ’ πœ† 2βˆ’ πœ†(1 βˆ’ 2𝑀 π‘–πœ”2) 2πœ”2 1 βˆ’ 2𝑀 π‘–πœ”2 . 𝑛 𝑖=1

We will study the following score function

𝐺𝑛 πœ“ = βˆ‚ βˆ‚πœ‡β„’π‘› πœ“ , βˆ‚ βˆ‚π›½β„’π‘ πœ“ , βˆ‚ βˆ‚πœ”2ℒ𝑛 πœ“ β€²

, where π‘₯β€²denotes the transpose of π‘₯, such that:

βˆ‚ βˆ‚πœ†β„’π‘› πœ“ = 𝛽 1 βˆ’ 2π‘€π‘–πœ”2 βˆ’ 2πœ†π‘€π‘– 1 βˆ’ 2πœ”2𝑀 𝑖 +2πœ† βˆ’ (1 βˆ’ 2π‘€π‘–πœ” 2) 2πœ”2 1 βˆ’ 2𝑀 π‘–πœ”2 𝑛 𝑖=1 , βˆ‚ βˆ‚πœ”2ℒ𝑛 πœ“ = 𝛽 1 βˆ’ 2π‘€π‘–πœ”2 βˆ’ 2πœ†π‘€π‘– 𝛽 1 βˆ’ 2π‘€π‘–πœ”2 βˆ’ 1 2 𝛽 1 βˆ’ 2π‘€π‘–πœ”2 βˆ’ 2πœ†π‘€π‘– 2 1 βˆ’ 2πœ”2𝑀 𝑖 2 βˆ’ πœ†2 2 βˆ’ 8𝑀 π‘–πœ”2 2πœ”2 1 βˆ’ 2𝑀 π‘–πœ”2 2+ πœ† 2 πœ”2 2 , 𝑛 𝑖=1 βˆ‚ βˆ‚π›½β„’π‘› πœ“ = 1 π›½βˆ’ 1 2𝛽 1 βˆ’ 2π‘€π‘–πœ” 2 𝑀𝑖 + πœ† . 𝑛 𝑖=1

When πœ”02, 𝛽0 are known, the explicit estimator πœ†0 is:

πœ† 𝑛= 1βˆ’2π‘€π‘–πœ”02𝛽0 2𝑀𝑖 𝑛 𝑖=1 1βˆ’2π‘€π‘–πœ”02 1βˆ’2π‘€π‘–πœ”02 𝑛 𝑖=1 ,

and when πœ”02, πœ†0 are known, the explicit estimator of 𝛽0 is:

𝛽 𝑛= βˆ’2π‘›πœ† + 4𝑛2πœ†02+ 8𝑛𝐾𝑖 2𝐾𝑖 βˆ’2π‘›πœ† βˆ’ 4𝑛2πœ† 0 2+ 8𝑛𝐾 𝑖 2𝐾𝑖 , where𝐾𝑖= 1βˆ’2π‘€π‘–πœ”02 𝑀𝑖 𝑛 𝑖=1 .

If all the parameters are unknown, the MLEs of πœ“0= (πœ†0, πœ”02, 𝛽0) are given by the system:

πœ† 𝑁= 1 βˆ’ 2π‘€π‘–πœ”π‘2𝛽 𝑁 2𝑀𝑖 𝑛 𝑖=1 1 βˆ’ 2π‘€π‘–πœ”π‘ 2 1 βˆ’ 2π‘€π‘–πœ”π‘2 𝑛 𝑖=1 βˆ’1

(6)

𝛽 𝑛= βˆ’2π‘›πœ† + 4𝑛2πœ† 𝑛 2+ 8𝑛𝐾 𝑖 2𝐾𝑖 βˆ’2π‘›πœ† βˆ’ 4𝑛2πœ† 𝑛 2+ 8𝑛𝐾 𝑖 2𝐾𝑖 ,

Such that 𝛽 𝑛 is a maximum likelihood estimator defined as any solution of

ℒ𝑛 𝛽 𝑛 = π‘ π‘’π‘πœ“βˆˆπš―β„’π‘ πœ“ . 1 2 𝛽 𝑛 1 βˆ’ 2π‘€π‘–πœ”π‘›2 βˆ’ 2πœ† 𝑛𝑀𝑖 2 πœ† 𝑛 1 βˆ’ 2πœ”π‘›2𝑀𝑖 2 βˆ’ 𝛽 𝑛 1 βˆ’ 2π‘€π‘–πœ”π‘› 2 βˆ’ 2πœ† 𝑛𝑀𝑖 𝛽 𝑛 πœ† 𝑛 1 βˆ’ 2π‘€π‘–πœ”π‘›2 βˆ’ πœ† 𝑛 2 2 βˆ’ 8𝑀 π‘–πœ”π‘›2 πœ† 𝑛 2πœ”π‘›2 1 βˆ’ 2π‘€π‘–πœ”π‘›2 2 𝑛 𝑖=1 = 1 2 πœ”π‘›2 2 𝑛 𝑖=1 .

The second derivatives of ℒ𝑁 πœ“ with respect to the parameters isas follow:

βˆ‚2 βˆ‚πœ†2ℒ𝑛 πœ“ = βˆ’2𝑀𝑖 1 βˆ’ 2πœ”2𝑀 𝑖+ 1 πœ”2 1 βˆ’ 2𝑀 π‘–πœ”2 8 , 𝑛 𝑖=1 βˆ‚2 βˆ‚πœ”2βˆ‚πœ”2ℒ𝑛 πœ“ = βˆ’4πœ†π‘€π‘–2𝛽 1 βˆ’ 2πœ”2𝑀 𝑖 2 +2𝑀𝑖𝛽 𝛽 1 βˆ’ 2π‘€π‘–πœ” 2 βˆ’ 2πœ†π‘€ 𝑖 1 βˆ’ 2πœ”2𝑀 𝑖 2 βˆ’ πœ† πœ”2 3 𝑛 𝑖=1 βˆ’2𝑀𝑖 𝛽 1 βˆ’ 2π‘€π‘–πœ” 2 βˆ’ 2πœ†π‘€ 𝑖 2 1 βˆ’ 2πœ”2𝑀 𝑖 3 + 8𝑀𝑖 2πœ†2 2πœ”2 1 βˆ’ 2πœ”2𝑀 𝑖 2βˆ’ 2πœ†2 2 βˆ’ 8πœ”2 2 2πœ”2 1 βˆ’ 2πœ”2𝑀 𝑖 3 9 , βˆ‚2 βˆ‚π›½2ℒ𝑛 πœ“ = βˆ’ 1 βˆ’ 2π‘€π‘–πœ”2 𝑀𝑖 𝑛 𝑖=1 10 , βˆ‚2 βˆ‚πœ† βˆ‚πœ”2ℒ𝑛 πœ“ = βˆ’4πœ†π‘€π‘–2 1 βˆ’ 2πœ”2𝑀 𝑖 2 βˆ’ πœ† 1 βˆ’ 4π‘€π‘–πœ” 2 πœ”2 1 βˆ’ 2𝑀 π‘–πœ”2 2+ 1 2 πœ”2 2 𝑛 𝑖=1 = βˆ‚ 2 βˆ‚πœ”2βˆ‚πœ†β„’π‘ πœ“ 11 , βˆ‚2 βˆ‚πœ† βˆ‚π›½β„’π‘› πœ“ = 𝑛 = βˆ‚2 βˆ‚π›½ βˆ‚πœ†β„’π‘› πœ“ , βˆ‚2 βˆ‚πœ”2βˆ‚π›½β„’π‘› πœ“ = 𝑛𝛽 = βˆ‚2 βˆ‚π›½ βˆ‚πœ”2ℒ𝑛 πœ“ 12 .

And the information matrix ,

𝐼(πœ“) = πΈπœ“ βˆ‚2 βˆ‚πœ†2ℒ𝑛 πœ“ πΈπœ“ βˆ‚2 βˆ‚πœ† βˆ‚πœ”2ℒ𝑛 πœ“ πΈπœ“ βˆ‚2 βˆ‚πœ† βˆ‚π›½β„’π‘› πœ“ πΈπœ“ βˆ‚ 2 βˆ‚πœ”2βˆ‚πœ†β„’π‘› πœ“ πΈπœ“ βˆ‚2 βˆ‚πœ”2βˆ‚πœ”2ℒ𝑛 πœ“ πΈπœ“ βˆ‚2 βˆ‚πœ”2βˆ‚π›½β„’π‘› πœ“ πΈπœ“ βˆ‚ 2 βˆ‚π›½ βˆ‚πœ†β„’π‘› πœ“ πΈπœ“ βˆ‚2 βˆ‚π›½ βˆ‚πœ”2ℒ𝑛 πœ“ πΈπœ“ βˆ‚2 βˆ‚π›½2ℒ𝑛 πœ“ (13),

is the covariance matrix of the vector

βˆ‚ βˆ‚πœ†β„’π‘› πœ“ βˆ‚ βˆ‚πœ”2ℒ𝑛 πœ“ βˆ‚ βˆ‚π›½β„’π‘› πœ“ .

The following lemma is important to investigate the consistency and asymptotic normality of the estimators.

Lemma 3.1

If 𝐻𝑖= βˆ’2𝑀𝑖, for all πœ“ = (πœ†, πœ”2, 𝛽) ∈ ℝ Γ— ℝ+Γ— ℝ+ and all π‘š ∈ ℝ,

πΈπœ“ 𝑒π‘₯𝑝 π‘š

𝑀𝑖

1 + πœ”2𝐻

𝑖 < +∞.

The proof of the above lemma is similar to proof of lemma 1 in [7] and omitted.

Remark 3.1

from lemma (3.1) and proof of proposition (7) in [7], the second derivatives of log-likelihood function

(7)

We need the following additional assumptions to prove the asymptotic properties:

H3 The parameter set 𝚯 is a compact subset of ℝ Γ— ℝ Γ— ℝ+.

H4 The true value πœ“0 belongs to 𝚯o.

H5 The matrix 𝐼 πœ“0 is invertible.

4 Asymptotic properties of maximum likelihood estimator

4.1 Consistency of MLE

We consider the theorem 7.49 and 7.54 of schervish (1995) [16] and verify the regularity conditions in this theorem for our purpose with suppose that 𝛺 is compact. The consistency here is strong consistency that is mean any solution of

the likelihood is consistent

.

Theorem 1

[16]:

Let {π‘₯𝑛}𝑛=1∞ be conditionally 𝑖. 𝑖. 𝑑 given πœƒ with density 𝑓1(π‘₯ πœƒ ) with respect to a measure 𝑣 on a space(πœ’1, ℬ1). Fix

πœƒπœŠβˆˆ 𝛺 , and define, for each 𝑀 βŠ† 𝛺 and π‘₯ ∈ πœ’1,

𝑍 𝑀, π‘₯ = 𝑖𝑛𝑓𝛼 βˆˆπ‘€π‘™π‘œπ‘”

𝑓1(π‘₯ πœƒπœŠ )

𝑓1(π‘₯ 𝛼 )

.

Assume that for each πœƒ β‰  πœƒπœŠ, there is an open set π‘πœƒ such that πœƒ ∈ π‘πœƒ and thatπΈπœƒπœŠπ‘ π‘πœƒ, 𝑋𝑖 > βˆ’βˆž .

Also assume that 𝑓1(π‘₯ . ) is continuous at πœƒ for every πœƒ, a.s. [π‘ƒπœƒπœŠ].

Then if πœƒ 𝑛 is the MLE of πœƒ corresponding to 𝑛 observations. It holds that lim𝑛 β†’βˆžπœƒ 𝑛= πœƒπœŠ a.s. [π‘ƒπœƒπœŠ].

We note that for any π‘₯ , 𝑓1 π‘₯ πœ“ = πœ†1 π‘₯, πœ“ = πœ† π‘₯, πœ“ . Which is clearly continuous in πœ“. In our case we compute for

every πœ“ β‰  πœ“πœŠ: π‘™π‘œπ‘”π‘“1 π‘₯ πœ“πœŠ 𝑓1 π‘₯ πœ“ = π‘™π‘œπ‘” 𝛽0 𝛽 βˆ’ 1 4 𝛽0 1 βˆ’ 2πœ”02𝑀𝑖 βˆ’ 2πœ†0𝑀𝑖 2 𝑀𝑖 1 βˆ’ 2πœ”02𝑀𝑖 +πœ†0βˆ’ 2 πœ† 0 1 βˆ’ 2πœ”02𝑀𝑖 2πœ”02 1 βˆ’ 2πœ” 02𝑀𝑖 +1 4 𝛽 1 βˆ’ 2πœ”2𝑀 𝑖 βˆ’ 2πœ†π‘€π‘– 2 𝑀𝑖 1 βˆ’ 2πœ”2𝑀𝑖 +πœ† 2βˆ’ πœ† 1 βˆ’ 2πœ”2𝑀 𝑖 2πœ”2 1 βˆ’ 2πœ”2𝑀 𝑖 = π‘™π‘œπ‘”π›½0 𝛽 βˆ’ 1 4 𝛽02 1 βˆ’ 2πœ”02𝑀𝑖 2βˆ’ 4𝛽0πœ†0𝑀𝑖 1 βˆ’ 2πœ”02𝑀𝑖 + 4πœ†02𝑀𝑖2 2 𝑀𝑖 1 βˆ’ 2πœ”02𝑀𝑖 + πœ†0 2 2πœ”02 1 βˆ’ 2πœ”02𝑀𝑖 βˆ’ πœ†0 2πœ”02 + 1 4 𝛽2 1 βˆ’ 2πœ”2𝑀 𝑖 2βˆ’ 4π›½πœ†π‘€π‘– 1 βˆ’ 2πœ”2𝑀𝑖 + 4πœ†2𝑀𝑖2 2 𝑀𝑖 1 βˆ’ 2πœ”2𝑀𝑖 + πœ†2 2πœ”2 1 βˆ’ 2πœ”2𝑀 𝑖 βˆ’ πœ† 2πœ”2 = π‘™π‘œπ‘”π›½0 𝛽 βˆ’ 1 4 𝛽02 1 βˆ’ 2πœ”02𝑀𝑖 𝑀𝑖 + 𝛽0πœ†0βˆ’ πœ†20𝑀𝑖 1 βˆ’ 2πœ”02𝑀𝑖 + πœ†0 2 2πœ”02 1 βˆ’ 2πœ”02𝑀𝑖 βˆ’ πœ†0 2πœ”02 +1 4 𝛽2 1 βˆ’ 2πœ”2𝑀 𝑖 𝑀𝑖 βˆ’ π›½πœ† + πœ† 2𝑀 𝑖 1 βˆ’ 2πœ”2𝑀 𝑖 + πœ† 2 2πœ”2 1 βˆ’ 2πœ”2𝑀 𝑖 βˆ’ πœ† 2πœ”2 = π‘™π‘œπ‘”π›½0 𝛽 βˆ’ 1 4 𝛽02 1 βˆ’ 2πœ”02𝑀𝑖 𝑀𝑖 βˆ’π›½ 2 1 βˆ’ 2πœ”2𝑀 𝑖 𝑀𝑖 βˆ’ πœ†0 2𝑀 𝑖 1 βˆ’ 2πœ”02𝑀 𝑖 βˆ’ πœ† 2𝑀 𝑖 1 βˆ’ 2πœ”2𝑀 𝑖 + πœ†0 2 2πœ”02 1 βˆ’ 2πœ” 0 2𝑀 𝑖 + πœ† 2 2πœ”2 1 βˆ’ 2πœ”2𝑀 𝑖 βˆ’ π›½πœ†βˆ’π›½0πœ†0 βˆ’ πœ†0 2πœ”02βˆ’ πœ† 2πœ”2 . We note that πΈπœ“πœŠπ‘™π‘œπ‘” 𝛽0 𝛽 , πΈπœ“πœŠ π›½πœ†βˆ’π›½0πœ†0 and πΈπœ“πœŠ πœ†0 2πœ”02βˆ’ πœ†

2πœ”2 are finite and by using lemma(3.1) ,πΈπœ“πœŠ

𝛽02 1βˆ’2πœ”02𝑀𝑖

𝑀𝑖 βˆ’

𝛽21βˆ’2πœ”2𝑀𝑖𝑀𝑖,πΈπœ“πœŠπœ†02𝑀𝑖1βˆ’2πœ”02π‘€π‘–βˆ’πœ†2𝑀𝑖1βˆ’2πœ”2𝑀𝑖andπΈπœ“πœŠπœ†022πœ”021βˆ’2πœ”02𝑀𝑖+πœ†22πœ”21βˆ’2πœ”2𝑀𝑖 are also finite, and by assume that π‘πœ“= (πœ†, πœ†) Γ— (πœ”2, πœ”2) Γ— (𝛽, 𝛽) , follows thatπΈπœ“πœŠπ‘ π‘πœ“, 𝑋𝑖 > βˆ’βˆž, hence lim𝑛 β†’βˆžπœ“ 𝑁= πœ“πœŠ a.s.

[π‘ƒπœ“πœŠ]. β–‘

4.2 Asymptotic normality of MLE

To corroborate asymptotic normality of MLE πœ“ , we investigate the conditions in the next theorem provided in ([16], Theorem 7.63):

Theorem 2 [16]:

Let 𝛺 be a subset of ℝ𝑑 and let {π‘₯

𝑛}𝑛 =1∞ be conditionally 𝑖. 𝑖. 𝑑 given πœƒ with density 𝑓1(. πœƒ ) .let

πœƒ 𝑁 be an MLE .Assume that πœƒ 𝑛 𝑃

(8)

derivatives with respect to πœƒ and that differentiation can be passed under the integral sign. Assume that there exist

π»π‘Ÿ π‘₯, πœƒ such that, for each πœƒ0∈ 𝑖𝑛𝑑(𝛺) and each π‘˜, 𝑗 ,

sup πœƒβˆ’πœƒ0 β‰€π‘Ÿ βˆ‚2 βˆ‚πœƒπ‘˜βˆ‚πœƒπ‘—log 𝑓𝑋1 Θ π‘₯ πœƒ0 βˆ’ βˆ‚2 βˆ‚πœƒπ‘˜βˆ‚πœƒπ‘—log 𝑓𝑋1 Θ π‘₯ πœƒ ≀ π»π‘Ÿ π‘₯, πœƒ0 , (14) With limπ‘Ÿβ†’0πΈπœƒ0π»π‘Ÿ π‘₯, πœƒ0 = 0. (15)

Assume that the fisher information matrix 𝐼(πœƒ) is finite and non-singular. Then under π‘ƒπœƒπœŠ,

𝑛 πœƒ 𝑛→ πœƒ0 β„’

β†’ 𝒩(𝟎, πΌβˆ’1(πœƒ

0)).

The maximum likelihood estimator πœ“ is almost sure consistency (see section 4.1 above) that is mean πœ“ 𝑛

𝑃

β†’ πœ“0 under

π‘ƒπœ“, βˆ€πœ“ .The differentiation can be passed under the integral sign (see [7], proof of proposition 5) and from H4 we

fined πœ“0∈ 𝑖𝑛𝑑(𝛺) .from (8),(9),(10),(11) and (12), we deduce that βˆ‚

2

βˆ‚πœƒπ‘˜βˆ‚πœƒπ‘—log 𝑓𝑋1 Θ (π‘₯ πœƒ ) is differentiable in πœ“ =

(πœ†, πœ”2, 𝛽).From remark (3.1) , the derivatives has finite expectation , Hence (14) and (15) holds, we obtain that the

information matrix 𝐼(πœƒ) is finite and from H5 , 𝐼(πœƒ) is invertible, hence πœ“ is asymptotically normal.β–‘

References

[1] AΓ―t‐SahaliaY. (2002). Maximum Likelihood Estimation of Discretely Sampled Diffusions: A Closed‐form Approximation Approach, Econometrica 70, no.1 ,223-262.

[2] Alkreemawi W.K, Alsukaini M. S. and Wang X.J. (2015). On Parameters Estimation in Stochastic Differential Equations with Additive Random Effects, journal of advances in mathematics, 11, no.3 5018-5028. [3] Alsukaini M. S., Alkreemawi W.K., and Wang, X.J. (2015).Asymptotic Properties of MLE in Stochastic Differential

Equations with Random Effects in the Diffusion Coefficient.International Journal of Contemporary Mathematical Sciences, 10, no. 6, 275 - 286.

[4] Beal S. and Shiner L. (1982).Estimating population kinetics, Critical Reviews in Biomedical Engineering, 8 195 - 222. [5] Delattre M. andLavielle M. (2013). Coupling the SAEM algorithm and the extended Kalman filter for maximum

likelihood estimation in mixed-effects diffusion models, Statistics and Its Interface, 6 519 -

532.http://dx.doi.org/10.4310/sii.2013.v6.n4.a10.

[6] M. Delattre, V. Genon-Catalot, and A. Samson, Estimation of population parameters in stochastic differential equations with random effects in the diffusion coefficient, Preprint MAP, 5 (2014), 2014 - 07.

[7] Delattre M., Genon-Catalot V. and Samson A. (2012). Maximum likelihood estimation for stochastic differential equations with random effects, Scandinavian Journal of Statistics, 40 322 - 343.http://dx.doi.org/10.1111/j.1467-9469.2012.00813.x.

[8] DitlevsenS. and De Gaetano A. (2005), Mixed effects in stochastic differential equation models, REVSTAT Statistical Journal, 3 137 - 153.

[9] Donnet S. and Samson A. (2013). A review on estimation of stochastic differential equations for

pharmacokinetic-pharmacodynamics models, Advanced Drug Delivery Reviews, 65 929-

939.http://dx.doi.org/10.1016/j.addr.2013.03.005.

[10] Feigin P.D.(1976). Maximum likelihood estimation for continuous-time stochastic processes, Advances in Applied Probability 712-736.

[11] Gugushvili S. andSpreij P.(2012). Parametric inference for stochastic differential equations: a smooth and match approach,ALEA, Lat. Am. J. Probab. Math. Stat. 9 (2), 609–635.

[12]Liptser R. S. and Shiryaev A. N. (2001).Statistics of Random Processes I. General Theory, 2nd edition. Springer-Verlag, Berlin, Heidelberg,

[13] Maitra T. and Bhattacharya S.(2014).On asymptotic related to classical inference in stochastic differential equations with random effects, ArXiv: 1407.3968v1, 1 - 12.

[14] Picchini U.andDitlevsen S. (2011). Practical estimation of high dimensional stochastic differential mixed-effects models, Computational Statistics & Data Analysis, 55 1426 - 1444.http://dx.doi.org/10.1016/j.csda.2010.10.003. [15] Picchini U., De Gaetano A. and Ditlevsen S. (2010). Stochastic differential mixed-effects models, Scand. J. Statist., 37

,67 – 90.

[16] M. J. Schervish. (1995). Theory of Statistics, Springer-Verlag, New York.http://dx.doi.org/10.1007/978-1-4612-4250-5. [17] Wolfinger R. (1993) Laplace’s approximation for nonlinear mixed models, Biometrika, 80 791 - 795. ISSN 0006-3444.

References

Related documents