• No results found

Supplementary information for Case study 3

2.8 Supplementary Information

2.8.5 Supplementary information for Case study 3

GH3 rat pituitary cells stably transfected with 5kb human prolactin promoter desta- bilized EGFP reporter construct (hPRL-d2EGFP) were seeded onto 35 mm glass

coverslip-based dishes (IWAKI, Japan) and cultured in 10 % FCS for 24 h prior to imaging. Cells were transferred to the stage of a Zeiss Axiovert 200 equipped with an XL incubator (maintained at 37C, 5 % CO2, in humid conditions) and images were obtained using a Fluar x20, 0.75 numerical aperture (Zeiss), air objective. Excitation of d2EGFP was performed using an Argon ion laser at 488nm. Emitted light was captured through a 505-550 nm bandpass filter from a 545 nm dichroic mirror. Images were captured every 15 min. 5 M forskolin and 0.5 M BayK 8644 were added directly to the dish at the start of the experiment. Data was captured and analysed using LSM510 software with consecutive autofocus. Analysis was performed using Kinetic Imaging software AQM6. Regions of interest were drawn around each single cell and mean intensity data was measured 108 times in 15 minutes intervals giving a total of 27 hours of data (see Figure 2.3).

Transformation of parameters

To reduce correlation and improve convergence of the chain we re-parameterized the model in case study 3 as follows

h(Θ, M, P) = ( ˆΘ,M , P˜ ) (2.23)

= (ˆθ1,θˆ2,θˆ3,θˆ4,θˆ5,θˆ6,θˆ7,θˆ8,θˆ9,M , P˜ ) (2.24)

= (log(δM), log(δP), log(sPαb0), log(sPα), log(α),

log(α sPb4), log(b3), log(b1), log(b2),M , P˜ ),

whereM˜ =sM αM −δPP.

Algorithm for inference

1. Set iteration counter i= 0. Initialise all parameters Θˆ(i), hidden process M˜

and bridges P∗.

2. Seti=i+ 1

3. Update (ˆθ(i)1 ,θˆ3(i),θˆ(i)6 ) in one block using multivariate normal proposals and either all are accepted or all are rejected.

4. Separately update each remaining component of Θ(i) using random walk proposals.

5. Update P∗ bridges (using the method in [Elerian et al., 2001])

6. Similarly, update the latent process M˜.

7. Repeat from Step 2 until a sufficient sample from the converged chains has been obtained.

value prior Simulation Experiment

δM 0.44 Γ(0.44,0.02) 0.56 ( 0.36 - 0.92 ) 0.45(0.26 - 0.82 ) δP 0.52 Γ(0.52,0.02) 0.59 (0.38 - 0.89) 0.71 ( 0.45 - 1.09 ) α 20 Exp(100) 16.97 ( 6.54 - 78.98 ) 0.46 ( 0.14 - 1.51 ) sP 0.2 Exp(1) 0.17 ( 0.09 - 0.3 ) 2.11 ( 1.24 - 3.56 ) b0 23 Exp(100) 40.48 ( 9.11 - 136.3 ) 112.7 ( 29.52 - 364.8 ) b1 10 Exp(50) 31.52 ( 7.08 - 83.64 ) 54.84 ( 21.67 - 97.14 ) b2 30 Exp(50) 22.83 ( 5.82 - 60.13 ) 59.43 ( 19.08 - 130.6 ) b3 5 Exp(7) 7.22 ( 4.35 - 9.55 ) 13.78 ( 11.08 - 16.05 ) b4 5 Exp(100) 6.4 ( 1.55 - 23.68 ) 7.39 ( 0.25 - 30.75 ) M0 15 Exp(70) 23.11 ( 4.8 - 66.92 ) 31.79 ( 7.81 - 97.87 )

Table 2.6: Posterior inference results for case study 3. Parameter values used in simu- lation study. Priors, posterior medians and 95% credibility intervals inferred from both

simulated and experimental data. Rates are per hour. Γ(µ, σ2) denotes gamma distri-

bution with mean µand varianceσ2.

Updating Bridges

Suppose data Y = (Y(t1), ..., Y(tT)) are provided at sampling intervals that are

too coarse to allow parameter estimation in the SDE approach without bridging. For example, LUC protein imaging data may be available every 30 minutes while for artificial stochastic process data from simulated clock models we find that a small enough time interval for the normal approximation in (*) in section 2.4 to produce reasonably accurate parameter estimates is 0.1 hour. The methods applied in this study make use of established strategies developed for nonlinear stochastic differential equations [Durham G. B, 2002, Elerian et al., 2001, Eraker, 2001, Kim et al., 1998, Pedersen, 1995]. The basic idea is to augment the observed data

by introducing a number of latent data points (called bridges) Y∗ in-between the measurements. The bridges are constructed so that the data together with the bridges (augmented data) give a time series with interval length∆ti =0.1 h which

we know from simulation studies allows for accurate parameter estimation.

To provide an estimate of the parameters θ from sparsely sampled data, we use MCMC to sample from the joint posteriorf(θ, Y∗|Y)of the parametersθ and the auxiliary variablesY∗ given the data Y, using the fact that, by Bayes’ theorem,

f(θ, Y∗|Y)∝LSDE(Y∗, Y|θ)π(θ) (2.25)

where, as before, π(θ) denotes the prior distribution on θ and LSDE(Y∗, Y|θ) is the approximated augmented likelihood. This is achieved by sampling in turn from the full conditional densities ofθ|Y∗, Y and Y∗|θ, Y ([Tanner and Wong, 1987]). The general structure of the algorithm that we employ is thus as follows:

1. Initialise Y∗ by constructing linear bridges between each of the given data points. The parametersθ are initialised as usual.

2. Sample Yi∗ from Yi∗|Y(ti), Y(ti+1), θ for i = 1,2, . . . , T − 1. The two

samples constitute a full set of imputed data Y∗.

3. Sample θ from θ|Y, Y∗, i.e. use the fully augmented data to update the parameter vector.

4. Repeat steps 2 and 3 until the required sample is obtained after the chain has converged.

Updating the parameter vector in step 3 is quite straightforward as for a given fully augmented time path the constant rate approximation for (*) in section 2.4 is valid and the inference problem is the same as for a finely sampled time path. To sample Yi∗ in step 2 we use the bridging methodology suggested by [Elerian et al., 2001] which has proved very satisfactory but it should be noted that there exist various other available methods for bridging (see [Durham G. B, 2002] for a survey) that may also be used for this kind of problem. We now briefly describe

the bridge building part (step 2) of the algorithm using the [Elerian et al., 2001] sampler. Consider a general SDE of the form:

dy(ti) =µ(y(ti), ti, θ)dt+σ(y(ti), ti, θ)dW, (2.26)

whereycould be, for example, mRNA (M) or protein (P). We denote y(ti)byyi

and y∗(τi,j) by yi,j∗ . Consider any two consecutive observations(yi, yi+1), the ob-

served time series being given byy= (y1, y2, . . . , yT). We want to impute a bridge

of F auxiliary data points between the pair yi and yi+1 at times (τi,1, . . . , τi,F),

where τi,j+1 −τi,j = ∆ (= 0.1 h for example) for all j = 1, . . . , F − 1. Let

yi∗ = (yi,1∗ , . . . , y∗i,F) denote the auxiliary bridge and let y∗ = (y∗1, . . . , yT1) de-

note all the auxiliary bridges.

We know thaty∗i is conditionally independent of the other bridges, given(yi, yi+1, θ).

Thus f(y∗|y1, y, θ) = T−1 Y i=1 f(yi∗|yi, yi+1, θ), and f(yi∗|yi, yi+1, θ) ∝ F Y j=0 f(yi,j+1∗ |yi,j∗ , θ), ∝ F Y j=0

Φ(yi,j+1∗ −yi,j∗ ;µ(yi,j∗ , τi,j, θ)∆, σ2(Yi,j∗, τi,j, θ)∆),

whereΦ is the Normal density.

Given two data points we construct a bridge of length F between them, but sampling a bridge of lengthF is not recommended because it is difficult to sample a high-dimensionalyi∗ in one block. Instead we construct sub-bridges of length m. A sub-bridge of m auxiliary data points starts at yi,k∗ and ends at yi,k+m1.

yi(k,m)∗ = (y∗i,k, y∗i,k+1, . . . , yi,k+m1), k = 1, m−1,2m−1, . . . (2.27)

The density conditioned on the two points at either end of this sub-bridge,y∗i,k1, yi,k+m∗ , is given by

f(y∗i(k,m)|yi,k1, yi,k+m∗ , θ)∝

k+m

Y

j=k−1

We sample each of the sub-bridges of length m in sequence and accept or reject each of them using the Metropolis-Hastings algorithm [Chib and Greenberg, 1995].

Let q(y∗i(k,m)|yi,k1, yi,k+m∗ , θ) denote the proposal density. Suppose that at each

iterationn of our MCMC algorithm, the sub-bridge y∗i(k,m) is given byy∗i(k,m)(n) . We propose a new sub-bridge w ∼ q(yi(k,m)∗ |y∗i,k1, yi,k+m∗ , θ). The new sub-bridge is then accepted with probability:

α(y∗i(k,m)(n) , w|y∗i,k1, yi,k+m∗ , θ) =

min 1,f(w|y

i,k−1, yi,k+m∗ , θ)q(y

∗(n)

i(k,m)|y

i,k−1, yi,k+m∗ , θ)

f(yi(k,m)∗(n) |y∗

i,k−1, yi,k+m∗ , θ)q(w|yi,k∗ −1, yi,k+m∗ , θ)

!

The proposal density q(.|.) is chosen to be a multivariate Normal approximation of the target density at the mode. The location of q(.|.) is given by the mode of the target density obtained by a few Newton-Raphson steps and the dispersion is given by the negative of the inverse Hessian evaluated at the mode. This is a multi-dimensional independence sampler, as the proposal distribution q(.|.) does not depend on the current value of the chain. [Elerian et al., 2001] give analytic functions for both the gradient and negative Hessian for the type of stochastic differential equation we are considering, removing the need to approximate these functions.

Step 3, i.e. updating the parameters, is carried out as usual with the augmented