State and parameter estimation from exact partial state observation in stochastic reaction networks

(1)

State and parameter estimation from exact

partial state observation in stochastic reaction

networks

Mingkai Yu

Joint work with Dr. Muruhan Rathinam

(2)

Motivation

Fluorescent technologies make it possible to measure the dynamics of a small number of key molecular species within a living cell.

0 20 40 60 80 100 120 140 160 180 200 time -10 0 10 20 30 40 50 60 70 molecule counts filtered S1 actual S1 observed S2

Goal: Estimating the unobserved molecule counts of the species in the chemical reaction system given the measured data of few

(3)

Stochastic reaction networks

We consider a stochastic reaction network with n molecular species and m reaction channels. It could be modelled as stochastic ODE

Z (t) = Z (0) + m X j =1 νjRj Z t 0 aj(Z (s))ds , (1)

where Rj(·) are independent unit rate Poisson processes.

Z (t) ∈ Zn

+ : the copy number vector of the species at time t.

Stoichiometric vector: νj ∈ Zn, j = 1, ...m. Propensity functions aj(z). Example: A + B c1 −→ C C c2 −→ A + B Z (t) =   #A(t) #B(t)  , ν1=   −1 −1  , ν2=   1 1  , a1(z) = c1z1z2, a2(z) = c2z3

(4)

Evolution of a system

Recall that Z (t) = Z (0) + m X j =1 νjRj Z t 0 aj(Z (s))ds , where Rj(·) are independent unit rate Poisson processes.

aj(z)h + o(h) = Probability that reaction j will occur in infinitesimal time interval [t, t + h), givenZ (t) = z.

The time evolution of the probability mass function

p(t, z) = P{Z (t) = z} is given by the Kolmogorov’s forward equations also known as the chemical master equations (CME):

p0(t, z) = m X j =1 aj(z − νj) p(t, z − νj) − m X j =1 aj(z) p(t, z), ∀t > 0, z ∈ Zn+ (2)

(5)

Stochastic Simulation Algorithm [Gillespie,1977]

1: Input: Initial state z0, terminal time T 2: Initialize z = z0, t = 0

3: while t < T do

4: Set λi = aj(z) for j = 1, ..., m, and λ0=Pm_{j =1}λj

5: Generate τ ∼ Exponential(λ0) 6: Generate u ∼ Uniform[0, 1] 7: if t + τ < T then 8: Find j ∈ {1, . . . , m} s.t. Pj −1 l =1λl < uλ0 ≤ Pj l =1λl 9: Set t = t + τ , z = z + ν_j0 10: else 11: Set t = T 12: end if 13: end while

(6)

Exact partial state observation

Assumption: We make exact (noiseless) observations of the copy

number of the last n2 species in continuous time.

Z (t) = (X (t), Y (t)), where X (t) ∈ Zn1

+ is the unobserved

state and Y (t) ∈ Zn2

+ is the observed state.

π(t, x ) = P{X (t) = x | Y (s) = y (s), 0 ≤ s ≤ t} ∀x ∈ Zn1 +. (3) νj = (νj0, νj00), where νj0 ∈ Zn1, νj00∈ Zn2. Observable reactions: O = {j : ν_j006= 0} Unobservable reactions: U = {j : ν_j00= 0}. aO(x , y ) =X j ∈O aj(x , y ), aU(x , y ) = X j ∈U aj(x , y ),

(7)

Filtering equations

We shall denote by tk, k = 1, 2, . . . the successive jump times of y (t).

On the interval [tk, tk+1), the unnormalized conditional distribution

ρ(x , t) satisfies ρ0(t, x ) =X j ∈U ρ(t, x − ν_j0) aj(x − νj0, y (tk)) − X j ∈U ρ(t, x ) aj(x , y (tk)) − ρ(t, x) aO(x , y (tk)) ∀x ∈ Zn+1. (4)

At jump times tk, ρ(t, x ) jumps according to

ρ(tk, x ) = 1 |Ok| X j ∈Ok aj(x − νj0, y (tk−1)) ρ(tk−, x − νj0), x ∈ Z n1 +, (5) where Ok = {j ∈ O | νj00= y (tk) − y (tk−)}. We have π(t, x ) = ρ(t, x )/P ˜ xρ(t, ˜x ).

(8)

Particle filter

Our Monte Carlo algorithm involves generating a trajectory V (t) with same state space Zn1

+ as X (t), along with a weight trajectory

w (t), such that at any time t ≥ 0 and for x ∈ Zn1

+

π(t, x ) = E[w (t)1{x}(V (t))]

E[w (t)] . (6)

In practice, from a sample of Ns identically distributed trajectories V(i ) along with weights w(i ) for i = 1, . . . , Ns, π(t, x ) is estimated by ˆ π(t, x ) = PNs i =11{x}(V(i )(t)) w(i )(t) PNs i =1w(i )(t) (7)

(9)

Updating (V (t), w (t))

In between jump times, tk ≤ t < tk+1.

The process V is evolved according to the underlying reaction network with all observable reactions removed.

The weight process w (t) is evolved according to w (t) = w (tk) exp − Z t tk aO(V (s), y (tk)) ds . (8)

(10)

Continuous Evolution [t

k

, t

k+1

)

Assume unobservable reactions are numbered 1, 2, . . . , mu, and observable reactions are numbered mu+ 1, . . . , m

1: while t < tk+1 do 2: Set λj = aj(V , y ) for j = 1, ..., mu 3: Set λO =Pm j =mu+1λj and λ U ₌Pmu j =1λj 4: Generate τ ∼ Exponential(λU) 5: if t + τ < tf then 6: Generate u ∼ Uniform[0, 1] 7: Find j ∈ {1, . . . , mu} s.t. Pj −1_{l =1}λl < uλU ≤Pj_{l =1}λl 8: Set V = V + ν_j0 9: end if 10: Set tn= min{t + τ, tk+1} 11: Set w = w × exp{−(tn− t) λO} 12: Set t = tn 13: end while

(11)

At jump times t

k

The process V (t), w (t) jumps as follows. Randomly pick one j ∈ Ok with uniform probability 1/|Ok|, and set

V (tk) = V (tk−) + νj0, (9)

w (tk) = w (tk−) aj(V (tk−), y (tk−1)), (10)

where Ok = {j ∈ O | ν_j00= y (tk) − y (tk−)}.

Main Result [Rathinam and Yu, 2020]

The process (V (t), w (t)) described by the above procedure satisfies

E[1{x}(V (t)) w (t)] = ρ(t, x ), for all t ≥ 0 and x , where ρ(t, x ) is the solution to the unnormalized filtering equations (4) and (5).

(12)

Parameter estimation

In general, the propensity functions aj(x , y ) depend on a vector c of parameters, thus aj = aj(x , y , c) where c ∈ Rp. Here we consider a Bayesian framework for inferring the parameters c from the observation Y (s) = y (s) for 0 ≤ s ≤ t.

Treat c as a random variable, or rather a stochastic process C (t) which remains constant in time t ≥ 0, and “absorb” C (t) into X (t)

Start with a prior distribution ¯µ on the parameter space Rp Compute the posterior distribution probability mass/density function π(t, x , c) which is characterized by

P{X (t) = x , C (t) ∈ A | Y (s) = y (s), 0 ≤ s ≤ t} = Z

c∈A

π(t, x , c) dc. (11)

(13)

A linear propensity example

We consider the simple reaction network S −→ S + A,c1 ∅−→ S,c2

S −→ ∅,c3

(12)

with mass action form of propensities a1(z) = c1z2, a2(z) = c2, a3(z) = c3z2. Species S is observed exactly while the species A is unobserved. Thus, Z (t) = (X (t), Y (t)) = (#A(t), #S (t)).

π(t, x ) = λ

x_e−λ

x ! where λ =R₀tc1y (t)dt.

(14)

80 90 100 110 120 130 140 150 160 170

the copy number of species A

0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04

Conditional probability mass function

Filter Theory

Figure 1:Estimated conditional probability mass function of the copy

number of species A compared with the exact theoretical function at time

T = 20. The experiment is run under initial condition z0= (0, 5),

(15)

A point estimate of conditional expectation E[X (t) | Yt] E(t) = PNs i =1V(i )(t) w(i )(t) PNs i =1w(i )(t) . 60 80 100 120 140 160

actual number of species A

60 70 80 90 100 110 120 130 140 150 160

estimated number of species A

Figure 2:Scatter plot of the point estimates E (T ) of the copy number of

(16)

We performed a Bayesian estimation of parameter c2 with a uniform prior distribution on [4, 6].

4 4.5 5 5.5 6 actual c 2 4 4.2 4.4 4.6 4.8 5 5.2 5.4 5.6 5.8 6 estimated c 2 0 5 10 15 20 25 30 35 40 T -0.1 0 0.1 0.2 0.3 0.4 0.5 0.6 bias or error bias L2 error

Figure 3: A scatter plot of ¯C2(T ), the estimated value of C2 (based on

the observation of species A) against the actual value of C2. On the

right, the L2

error E[( ¯C2(T ) − C2)2]1/2 as well as the bias E[ ¯C2(T ) − C2]

are plotted against T .

(17)

References

Alan Bain and Dan Crisan, Fundamentals of stochastic filtering, vol. 60, Springer Science & Business Media, 2008. Fulvia Confortola and Marco Fuhrman, Filtering of

continuous-time markov chains with noise-free observation and applications, Stochastics An International Journal of

Probability and Stochastic Processes 85 (2013), no. 2, 216–251.

D. T. Gillespie, Exact stochastic simulation of coupled chemical reactions, J. Phys. Chem. 81 (1977), 2340–2361. Muruhan Rathinam and Mingkai Yu, State and parameter estimation from exact partial state observation in stochastic reaction networks, arXiv preprint arXiv:2010.04346 (2020).