On the Optimization Problem of Stochastic Observations
of Random Walks
Alexander A. Butov
Faculty of Mathematics and Information technologies of Ulyanovsk State University, Ulyanovsk, Russian Federation *Corresponding Author: [email protected]
Copyright © 2014 Horizon Research Publishing All rights reserved.
Abstract
The optimal control problem for the intensity of observation events of the process of random walk is considered for the case of counting Poisson process in semimartingale terms. The linear function of the intensity as a cost of observations and the expected value of the quadratic form of errors of estimation as a cost of an error are reckoned in a loss function. The analogues result for the problem of the optimal intensity of stochastic approximation is presented.Keywords
Random Walk, Poisson Process, Optimal Control, Estimation, Semimartingale1. Introduction and Definitions
LetB=( ,ΩF,F=( )Ft t≥0, )P be a stochastic basis satisfying the «usual conditions» of Dellacherie, [1]. We consider the following model. OnBa process X Xt t=
( )
≥0is a random walk on the latticeZ={
...,−1,0,1,2,...}
with trajectories in the Skorokhod space and ∆ = −X X Xt t t−∈ −{
1,0,1 ,}
[2]. AlongwithX Xt t=( ) 0≥ we shall consider on B a Poisson process
( ) 0
N Nt t= ≥ with intensity
λ
>0. For all k=1,2,... define stopping times ( )τ k such, that∆N kτ
( )=1:( )
k inf :(
t t 0,N kt)
τ = > = (1) and denote (0) 0τ = for k=0 . Then random variables
{
θ
( ) 1,2,...k k}
= with θ( ) ( ) ( )
k=τ k−τ k−1 are independent andidentically distributed with the density function ρ
( )
x: ( ) 0xρ = forx<0and
{
}
( )x exp x
ρ = ⋅λ − ⋅λ for x≥0. (2)
Mathematical expectations and variances of ( )θ k and
( )
kτ
for allk≥1are equal toEθ( )
k =1 ,λ Dθ( )
k =1 λ2 andt∈[τ( ) (
k,τ k+1 ),)
Dτ( )
k =k λ2 respectively. Suppose that processesXandNare independent and hence times oftheir jumps cannot be simultaneous:
{ 0} 1
0
N s s s t
∆Χ ⋅∆ = = ≤ ≤
∑
P for all t≥0.
In the model N is supposed to be the counting process of observation events. The initial value 0X is observable:
( ) ( )
0 0
Y ω =X ω for all
ω
∈Ωand hence we (0) 0.τ = Because the jump of counting process∆Nt is equal zero on the interval t∈[0, (1)),τ the observation Yt remains equal to(0) 0
X
τ
=X for any timetfrom this interval: Y Yt=τ(0)=X0.The observationYtcan change its value only at stopping time (1)
τ (the time of the first observation): as it follows from the definition (1) Y
τ
(1)=Xτ
(1). Hence( )1 (1) (1) (1) (0) (1) (0) (0)
Y
τ
=Yτ
−+∆Yτ
=Yτ
+ Xτ
−Yτ
⋅ =Yτ
+ ( (1)Xτ
) . (1) (1)
Y
τ
Nτ
− − ⋅∆ Analogously we can describe the algorithm of observation at times ( )τ k for allk≥1 Y
τ
( )k =Xτ
( )k and( )
Y Yt=
τ
k for times [ ( ), ( 1)).t∈τ k τ k+ Hence the process of observationsY =( )
Yt t≥0 is a solution of the following stochastic equation:(
)
0 0
t
Y Xt= +
∫
X Ys s− − dNs.Note, that a process of random walkXcan be represented as a difference of two counting processes:
, 0
X X A B= + − (3) whereA At t=( ) 0≥ andB Bt t=( ) 0≥ are the counting processes of the numbers of positive and negative jumps of X
respectively:
(
1 ,)
0
At Xs
s t
= Ι ∆ =
< ≤
∑
(
)
1 , 0
Bt Xs
s t
= Ι ∆ = −
< ≤
∑
0 0
As Bs s t
∆ ⋅ ∆ =
< ≤
∑
for all t≥0trajectories from Skorokhod space on the lattice Z is the most general possible, but in this article we shall restrict the model by the assumption of very simple distributions of processes A and B.Let A and B be independent Poisson processes with intensities
α
≥0 andβ
≥0 respectively. According to the well-known Doob–Meyer decomposition of submartingales, [3],AandBcan be represented as, ,
A B
At= ⋅ +αt m Bt t= ⋅ +βt mt (4)
wheremA=(mt tA) 0≥ andmB=(mt tB) 0≥ are square integrable martingales onBwith quadratic characteristics:
, , , 0.
A B A B
m t m t m m
t= ⋅
α
t= ⋅β
t=It is clear that the processY Yt t=( ) 0≥ is a stochastic discrete time approximation of the processX Xt t=( )≥0.In this simple
case it is possible to estimate the value of Xtgiven the observations
{
Yt,0≤ ≤s t}
for allt≥0.The more is the rate of observations, the better is approximation and the less could be the error of estimation. In many applications the cost of observations is not negligible. So the problem is in finding such a compromise intensity of observationsλ, which results in minimization of a loss function reckoning in a cost of observations and a cost of an error of estimation.
In the model a cost of observations is supposed to be linearly depending on its averaged and normalized number
NTin a space of timeT>0,and therefore on the intensity λ of the Poisson processN.A cost of errors of observations is the expected value of a quadratic form of errors. The process of estimation is considered here as a continuous-time construction of a (discontinuous) process ˆX Xt t=( ) 0ˆ ≥ with random variable Xtˆ , defined as an optimal mean square estimate ofXt given
{
Ys,0≤ ≤s t}
for all t≥0(i.e. given adiscreet set of observations Y
τ
( )k =Xτ
( )k for all( )
k t k, 0,1,2,...):τ ≤ = Xˆt=E( |Xt tFY), where a σ-algebraFtY =
(
Y s ts;)
σ ≤ of a non–decreasing family FY=(Ft tY) 0≥ is completed by sets fromFofP measure zero. Thus a cost of error of estimation is a normalized and expected value of an integral of the varianceγt=Eεt2of the error of estimation
ˆ
X X t t t
ε
= − for t≥0.Along with a consideration of errors of estimations given the observationsFY it is possible to examine proper errors of approximationδt= −X Yt tand theexpected value of the quadratic form of errors withΓ =t Eδt2.
2. Results
Let us define the loss process
ϕ
=( ( )) 0ϕ
t t≥ as a linearfunction of numbers of observations (as a cost of observations) and the quadratic form of errors of estimation (as a cost of an error) with positive constantshandg:
( )
( )
2 ,0
t
t h s ds g Nt
φ = ⋅
∫
ε + ⋅ (5)The expected and normalized value ( )ψ λ of the loss function
φ
( )
t corresponding to the intensityλis( )
lim 1( )
T ,T T
ψ λ = φ
→∞ E (6) In terms of optimal control the problem is in finding such intensity *λ that
( )
*
( ) min . 0
ψ λ ψ λ
λ =
≥ (7) In order to investigate the variances of errorsγt=Eεt2we at first formulate preliminary results for auxiliary random variables γt( )u being conditional variances
( )u (( ( ) 2u ) |FY) u
t t
γ =E ε of the errorεt
( )
u = Xt u+ −Xˆt( )u of the estimate Xˆt( )u =E(Xt u u+ |FY)fort≥0andu≥0 (timestandu can be considered as arbitrary randomFY -adapted stopping times). The properties ofγt
( )
u are studied in the following lemma:Lemma 1. For all 1,2,...k= and stopping times
τ
( )
k for0
λ
> the following equality holds:( )
(
)
( )
( )
1
2 1
k
k dt t
k τ
α β τ
γ
λ τ
+ − =
−
∫
E .
Proof. According to the semimartingale presentation (4) of
components (3) of X and the equality Y kτ
( )
−1 =X kτ( )
−1 the estimate Xtˆ(τ( )
k−1 )=E(
Xt k+τ( ) ( )
−1|FτYk−1)
is equal to( )
1X kτ − +
(
α β− ⋅)
t. Therefore the conditional error can bepresented as εt( ( 1))τ k− = (mτA( 1)k− +t−mτA( 1)k− ) - (m k tτB
( )
− +1 −). ( 1)
B
m kτ − Because the martingalesmAandmB in (4) are
independent, the conditional variance
( ( 1)) {( ( ( 1)) 2) | } ( 1)
F
k k Y
tτ tτ k
γ − =E ε − τ − is equal to
( )
( 1) ( 1)
A A
m m
k t k
τ − +− τ − +(mBτ( 1)k− +t− mBτ( 1)k− )= =t⋅ +
(
α β)
. (8)distribution density ρ
( )
x is exponential, (2), then for all 1 k≥( )
(
)
( )
( )
(
)
( )
( )
1 1 1 k kk dt t dt
t
k k
τ τ
τ
γ α β
τ τ
− = ⋅ +
− −
∫
∫
E E =
(
)
( )
( ) (
)
2 .0 0
0
k x
t dt x t dt dx
θ
α β
α β ρ α β
λ
∞ +
⋅ + =∫ ⋅ ⋅ +∫ =
∫
E
Lemma 1 is proved.
The following lemma gives the way of calculation of ( )ψ λ in (6) in terms of
φ
( )
t andτ
( )
k :Lemma 2. For the loss process Φ and k=
[
λ⋅T]
thefollowing convergence takes place
( )
lim( )
/k(
( )
k)
,k
ψ λ= λ ⋅ φ τ
→∞ E
where
[ ]
⋅ is a greatest integer function.Proof. As it follows from the definition (1) the equality
( )
λ
/k⋅ ⋅
g N
τ
( )k= ⋅
g
λ
holds for all 1.k≥ The equality( )
1/T g N g⋅ ⋅ T= ⋅λE takes place for all T>0 because the
intensity of the Poisson process N is equal to λ Hence for the normalized and expected right summand in (5) at time
( )
kτ
the following equality holds:( )
1{
}
lim N k lim g NT
k T
k T
g
λ
τ
λ
⋅ = ⋅
→∞ E →∞ E
= ⋅
. (9)Consider the normalized and expected left summand in (5) at time ( )τ k at time T k= λ,k≥1.It is clear that
( )
( )
( )
( )
(
)
( )
( )
1 2 10 0 1
k k k i
i
dt dt dt
t t t
i i
τ τ τ
τ
ε γ γ
τ − = = = −
∑
∫
∫
∫
E E E
and, as it follows from Lemma 1,therefore holds
( )
( )
2 2 0 kh t dt h k h
k k
τ
λ ε λ α β α β
λ λ
+ +
⋅ ⋅E
∫
= ⋅ ⋅ ⋅ = ⋅Hence the proof of Lemma 2 is reduced to the verification of the statement (9):
( )
1 2 lim 0 T dt t T k α β ε λ + ⋅ =→∞ E
∫
. (10)Let us consider the auxiliary random variable
ζ
( )
T with values in [0, ]T for T >0 defined as( )
T sup :{
s s T Ns, 1}
ζ = ≤ ∆ = =
=inf : 0, ,
{
s s∈[ ]
T N Ns T=}
.It is clear that
ζ
( )
t is not a stopping time on basis B (because in general case the set{
ω ζ:( )
t u u≤ ∉}
F for u t< ).Nevertheless it is possible to investigate the set
{
Θ( )T T}
>0 of random variables( )
( )
( ) 0
T
T t dt
ζ γ
Θ =
∫
for T>0in terms of inverse time. It is clear, that from the convergence
( ) ( )
k/ k 1 a s. .τ λ⋅ → P− (ask→∞)it follows that (fork→ ∞ and T= ⋅ →∞λk )P −a s.
( )
{
( ) ( )
}
( 1 ( ) )1 0
i
T i T dt
t
T i T
τ
τ γ
λ λ
∞ Θ
= Ι =Θ ⋅ ⋅ =
⋅ =
∑
∫
( ) ( )
{
}
2 ,1
i
i T
T i
α β α β τ
λ λ λ
∞ + +
Ι =Θ ⋅ ⋅ →
=
∑
(11)where {}Ι ⋅ is an indicator function (Ι
{ }
true=1,Ι{
false}
=0). So the proof of the lemma would follow from the convergence( )
1 ( ) 0. 0
T
dt T t
T⋅
∫
γ −Θ →E (12)
Note, that
( )
( )
0
T T
dt T dt
t t
T
γ γ
ζ −Θ =
∫
∫
(13)and in inverse time presentation the process
( ) [ ]
0,R= Ru u∈ T for u T t= − with Ru=NT t− is R F -adapted, where FR=(FuR)0≤ ≤u T and FuR =
{
Rv;0 v u}
σ ≤ ≤ = σ
{
N T u t Tt, − ≤ ≤}
. The process R is asupermartingale and therefore admits the following Doob – Meyer decomposition
r Ru=NT − =ru NT − +ru mu,
where r =
( )
ru 0≤ ≤u T is a compensator and mr =( )
mur 0 u t≤ ≤ is a square – integrable martingale. The compensator ru is equal to
0
u ru
ru du
T u =
−
∫
(14)
(note that formula (14) is similar to that of semimartingale presentation of a Brownian bridge and results from the infinitesimal semimartingale presentation of Poisson process in inverse time). According to (14) and from the well-known formula of Dellacherie d r d F xx= ζ
( )
/ 1(
−F xζ( )
−)
(see, e.g. [1]) it follows that for the conditional distribution function( )
the first jump of the process r given the random value NT hold
( )
F xζ =1 1 /− −
(
x T)
NT ,( )
x N T xT(
)
NT /TNTρζ = ⋅ − (15)
Because R coincides with N in inverse time, then ζ ζ= . From the formula (8) and independence of the processes
( )
0X= Xt t≥ andN=
( )
Nt t≥0it follows that( )
( )
1 1
0
T T
dt du
t u
T T T
ζ
γ γ
ζ
⋅
∫
= ⋅∫
E E =
( )
21
2 0
T x
x dx T⋅
∫
ρζ ⋅ ,where γu =γT t− for u T t= − . From (15) we receive
( )
2{
}
1 1 1
2 0
T x
x dx NT
T⋅
∫
ρζ ⋅ = NT ⋅Ι ≥ . Because P{
1≤NT ≤T/ 2}
→0 for T → ∞ and{
}
{
Ι NT ≥1 /NT}
≤ E{
}
1{
}
1 1 / 2 / 2
/ 2
NT T T NT
T
≤ ⋅Ι ≤ ≤ + ⋅Ι < ≤
E
{
1 / 2}
1/ 2
NT T T
≤P ≤ ≤ + ,
then
( )
1 T dt 0
t T ⋅ζ
∫
T γ →E . (16)
The convergences (11) and (16) along with the equality (13) result in (12) and in the statement of Lemma 2. Lemma is proved.
The value of ( )ψ λ in (6) is obtained in the following lemma.
Lemma 3. Under assumptions of the Theorem the function
( )
ψ λ
is equal to( )
h ( ) / gψ λ
= ⋅α β λ
+ + ⋅λ
.Proof of thelemma follows from the statements (9) and (10).
Here is the solution of the problem (6)-(7):
Theorem 1. Leth≥0,g>0.Then the valueλ*in the problem
(7) for the loss function (6) is equal to
* h g/ ,
λ = ⋅ α β+ (17) and the value of loss function is
( )
* 2 h gψ λ = ⋅ ⋅ α β+ . (18)
Proof of thetheoremfollows from the statement of Lemma 3
and the equationdψ λ λ
( )
/d =0, resulting in (17) and (18). Now we study an intensity of observations λ as a rate of stochastic approximation of X with Y . In order to investigate the rates of approximation we define functions( )
tΦ andΨ
( )
λ
by analogy withφ
( )
t andψ
( )
t substituting the estimates Xsˆ by the observations Ys (and hence substituting εsbyδs) in (5)-(6):( )
( )
20
t
t h δs ds g Nt Φ = ⋅
∫
+ ⋅ ,( )
lim 1( )
T T T λΨ = Φ
→∞ E , (19) and we consider the optimal control problem of finding such intensity of stochastic approximation Λ that
( )
( ) min 0
λ
λ
Ψ Λ = Φ
≥ (20) By analogy withγt( )u we defineΓ( )tu =E((δt( ) 2u ) |FuY)with
( )u t
δ =Xt u u+ −Y .
Then the following result forΓ( )tu is true:
Lemma 4. For all k=1,2,...and stopping times
τ
( )
k for0
λ
> there holds( )
(
)
( ) 2
2( )
1
2 3
( 1)
k
k dt t
k
τ α β α β
τ
λ λ
τ
+ ⋅ −
−
Γ = +
−
∫
EProof of the lemma follows from the obvious equality
( )
( ( 1)) ( 1)
k
k dt t
k τ
τ τ
− Γ −
∫
E =
= ( )
( ( 1)) ( 1)
k
k dt t
k τ
τ γ τ
−
−
∫
E + ( )( ( ) )2 , 0 0
x
x tdt dx
ρ α β
∞
⋅ −
∫
∫
from (2) and from the statement of Lemma 1.
The next result gives a way for finding ofΨ
( )
λ
in terms of ( )tΦ and stopping times
τ
( )
k :Lemma 5. For the loss processΦandk= ⋅[λT]the following
convergence takes place:
( )
lim( )
/k(
( ) .k)
k
λ λ τ
Ψ = ⋅ Φ
→∞ E
Proof is similar to that of Lemma 2.
Lemma 6. Leth≥0, g>0.Then for the functionΨ
( )
λ
holds( )
λ 2 (h α β) /2 2λ h(
α β λ)
/ gλ.Ψ = ⋅ − + ⋅ + + ⋅ (21)
Proof is similar to the proof of Lemma 3.
The next result gives a way for solving the problem (20):
Theorem 2. Leth≥0, g>0.Then the valueΛin the problem
(20) for the loss function (19) is a solution of the following equation:
3 ( )( / ) 4(h g )( / ) 0.h g λ λ α β− ⋅ + ⋅ − ⋅ − ⋅α β = The value of loss functionΨ( )Λ is defined by (21).
Proof is analogous to that of Theorem 1 and follows from the requirementdΨ
( )
λ λ/d =0.3. Conclusion
The method discussed in the paper is based on the semimartingale approach developed for the random walks of a general type in [2]. This approach was shown to be useful for the limit theorems and for the problems of estimation. The problems of the optimal intensity of observations of the processes are developed mostly for the Gaussian or for the stationary systems (see, e.g. [4-6]) but not for random walks yet. Nevertheless it is possible to receive new results by means of this (martingale) approach. The investigated in the article simple case of the random walk is interesting because it demonstrates the method developed for a non-stationary (and non-ergodic) case. Thus it can be easily applied to a variety of processes (including the case of random walks in the conditionally-Markov random environments, considered as an example of the martingale approach in [2]) and point counting processes for the numbers of observation. The main result of the paper, stated in the Theorem 1, permits to consider the optimal control problems for the rate of instant observations of nonstationary systems (e.g. streams of data in queueing systems similar to [7]). The problems of comparison of optimal intensities of observations (for estimation) and optimal rates of approximation are of especial interest, and
the statement of the Theorem 2 (along with the results of
Theorem 1) can be considered as a simple approach to the problems of such type.
Acknowledgements
The author is grateful to the referee for helpful comments that led to an improved manuscript. This research was partially supported by the Ministry of Education and Science of the Russian Federation (Research Projects of Ulyanovsk State University).
REFERENCES
[1] C. Dellacherie. Capacites et processus stochastiques. Springer-Verlag, Berlin, Heidelberg, New York, 1972. [2] A. A. Butov. Random walks in random environments of a
general type. Stochastics and Stochastics Reports, Gordon and Breach Science Publishers S.A., Vol. 48, pp 145-160, 1994.
[3] R. Sh. Liptser, A. N. Shiryaev. Theory of martingales. Dordrecht, Kluwer Academic Publishers, 1989.
[4] M. Ades, P. E. Caines, R. P. Malhame. Stochastic optimal control under Poisson-distributed observations. Automatic Control, IEEE Transactions on, Vol. 45 , Issue 1, pp 3-13, 2000.
[5] R. A. Davis, W. T. M. Dunsmuir, S. B. Streett. Observation-driven models for Poisson counts. Biometrika, Vol. 90, No. 4, pp. 777-790, 2003.
[6] Tang Shanjian, Hou Shui-hung. Optimal Control of Point Processes with Noisy Observations: The Maximum Principle. Applied Mathematics & Optimization, Vol. 45 Issue 2, p185, 2002.