• No results found

Parameter Expanded, Non-centered Parameterization (PX-NC)

CHAPTER 3. REPARAMETERIZATIONS OF BAYESIAN FOURIER-

3.3 Alternative Parameterizations for Fourier-form DLMs

3.3.3 Parameter Expanded, Non-centered Parameterization (PX-NC)

Parameter expansion (PX) is an additional strategy aimed at improving convergence of posterior sampling algorithms by reducing dependence between highly correlated parameters (Meng and Van Dyk, 1997; Meng and Dyk, 1999; Liu and Wu, 1999). The method builds upon data augmentation through the introduction of redundant (or working) parameters.

These parameters are only partially identifiable, as they are unidentified given only the observed data, but fully identified given the observed and augmented data (Gelman et al., 2008). Parameter expansion alters the joint distribution of observed data, y, and augmented data, θ, by introducing a one-to-one transformation on θ dependent on introduced working parameters α (e.g. φ = f (θ, α)) such that the marginal distribution of the observed data does not depend on the working parameters. That is, p(y|ψ, α) = p(y|ψ), where ψ

are unknown model parameters. We refer the reader to Gelman et al. (2008) for a very nice outline of parameter expansion for hierarchical linear and generalized linear models, as well as discussion of convergence rates of algorithms for these particular models. We give here a brief example of parameter expansion on the hierarchical random effects model following the outline of Gelman et al. (2008) to illustrate the strategy, and then present the implementation for Fourier-form DLMs.

Consider the following non-centered hierarchical group means model

Yij = µ + θj+ i, i ∼ N (0, σe2) (3.28a)

θj ∼ N (0, σθ2). (3.28b)

where Yij are the observed data and θ = (θ1, . . . , θJ)T represent J group means and are considered the augmented data. Let ψ = (µ, σ2θ, σe2). Then a standard DA MCMC sampling scheme alters between sampling θ given ψ, and ψ given θ in a Gibbs sampler. However, as discussed earlier, these standard MCMC schemes can be very inefficient, particularly when σθ2 is small. Parameter expansion introduces the transformation ˜θ = θ/α, where α is a working parameter. Then, the parameter expanded model is

Yij = µ + α˜θj+ i, i ∼ N (0, σe2 (3.29a)

θ˜j ∼ N (0, σ2θ˜). (3.29b)

where σ2θ = α2σθ2˜. An independent, conditionally conjugate prior for α is s the normal distri-bution (treating α like a regression coefficient) and for σ2˜

θ is the inverse-gamma distribution.

The corresponding prior on σθ = |α|σθ˜ is half-cauchy (noting that this is the distribution of the absolute value of a normal random variable divided by the square root of an inde-pendent gamma random variable), and thus an MCMC scheme for sampling from the PX model is equivalent to sampling from the original model with a half-cauchy prior on σθ. It is important to note, however, that when priors on the working parameter and transformed parameters are independent, the parameters are not separately identifiable (e.g. α and σθ2˜

in 3.29), and thus the backtransformations to the original model should be performed and inferences should only be made about original parameters. If inference about the parame-ters in the parameter expanded model are preferred (i.e. if the reparameterization is done not just for computational improvements), then careful specification of the priors on the working and transformed parameters can introduce identifiability (Gelman et al., 2008).

The motivation for implementing parameter expanded approach for the Fourier-form DLM is to develop a single MCMC sampling scheme that is at least as efficient as the standard and SC-NC sampling schemes for a larger region of the parameter space of (V , W ), as the standard and SC-NC schemes are efficient in separate areas of the parameter space (discussed further in Section 3.4.2). In the following, we present the PX-NC model for Fourier-form DLMs and show empirically through simulations that the parameter expanded approach performs at least as well as the standard MCMC sampler, and better than the SC-NC sampler, when evolution variances are non-negligible. This suggests the PX-NC model can be more universally applied to time series with varying degrees of evolution in seasonality than the SC-NC and standard parameterizations.

To derive the PX-NC model, we perform parameter expansion on the non-centered states θ˜t = θt− Gtθ0, introducing the working parameter vector . Under Model 1,  is a scalar, and the transformed states are θpxt = ˜θt/ and ση2 = σw2/2. Under Model 2, define L = blockdiag (1I2, . . . , qI2), and the transformation θtpx= L−1 θ˜t. The transformed variances are σ2ηj = σw2j/2j for j = 1, . . . , q such that W = Lblockdiag(ση21I2, . . . , σ2η1I2)L. Then the

PX-NC Fourier-form models are as follows. For Model 1,

Yt= Xtθ0+ F θtpx+ vt vt ∼ N (0, σ2e) (3.30a) θtpx= Gθpxt−1+ ηt ηt ∼ N (0, σ2ηI) (3.30b)

θ0 ∼ N (0, σ2w0I) (3.30c)

σe ∼ Cauchy+(0, c) (3.30d)

 ∼ N (0, 1) (3.30e)

ση2 ∼ IG(1/2, d2/2) (3.30f)

σw2

0 ∼ IG(a, b) (3.30g)

and for Model 2.

Yt= Xtθ0+ F Lθtpx+ vt vt ∼ N (0, σ2e) (3.31a) θtpx= Gθpxt−1+ ηt ηt ∼ N (0, blockdiag(ση2

1I2, . . . , ση2qI2)) (3.31b)

θ0 ∼ N (0, σ2w0I) (3.31c)

σe ∼ Cauchy+(0, c) (3.31d)

j ∼ N (0, 1) (3.31e)

ση2j ∼ IG(1/2, d2/2) (3.31f)

σw20 ∼ IG(a, b) (3.31g)

The specific choices of priors on  and σ2ηcorrespond to Cauchy+(0, d) priors on σwj = |jηj. We also specify the non-zero prior, θpx0 ∼ N (0, blockdiag(ση21I2, . . . , ση2qI2) for Model 2, and θ0px∼ N (0, ση2I) under Model 1.

3.3.3.1 MCMC sampler for the PX-NC model

We implement a two-block DA Gibbs sampling scheme for the PX-NC Fourier-form models by choosing θ0:Tpx as the augmented data, and ψ1 = (θ0, , σe2, σ2η, σw20) and ψ2 = (θ0, , σ2e, ση2, σw20) as the unknown model parameters for Models 1 and 2, respectively. Then a DA MCMC scheme for Models 1 and 2 is implemented as follows (for ψl, l = 1, 2)

1. Draw θ0:Tpx from the conditional distribution π(θ0:Tpxl, y1:T) using FFBS.

2. Sample ψl jointly from the conditional distribution π(ψlpx0:T, y1:T) in one block (con-ditioning on θpx0:T, y1:T is implicit in the following):

(a) Sample (, θ0), given σe, from the multivariate normal posterior N(m, C) with C = 1

(b) Sample σe, given ση,  and θ0, from its full conditional using Metropolis-Hastings.

(c) Sample ση2j from an IG(cη, Cηj) where, for Model 1 that this transformation does not affect the sampling scheme, and can also be done as a post processing step.

Complete derivations of the PX-NC MCMC sampler are found in Appendix B.3. Under the PX-NC model, , σw and θ1:Tpx are not identifiable, and therefore the untransformed parameters should be used for inference.  and θpx1:T are unidentifiable up to a sign change (similarly to the SC-NC models), which only affects inference about σw. Again, this can be dealt with by implementing a random sign switch (with probability 0.5, multiply  and θ1:Tpx by -1) at each iteration of the sampler.