The Bayesian Structural Equation Model - Essays on Quantitative Methods for Consequences of Pol

In this section, a longitudinal structural equation model is proposed to deal with the interdependence between party support and party policy orientations as well as measurement issues of party policy positions. The path diagram of the model is presented in Figure 3.1. Consider a set of observations for party policy orientations and party support, respectively: for party j = 1, · · · , J measured at election time point t = 1, · · · , T_j, we have a p₁ × 1 random vector y_jt for the former and a p₂× 1

policy

Figure 3.1.: Path Diagram of the Structural Equation Model for Dynamics of Party Competition.

random vector x_jt for the latter. The number of elections parties participate, T_j, may differ from party to party.

The framework of SEM consists of two components: a measurement equation and a structural (or simultaneous) equation. The measurement equation, which relates

the latent variables to corresponding indicators and takes the measurement errors into account, is basically equivalent to a confirmatory factor analysis (CFA) given by

ujt = α + Ληjt + εjt, (3.1)

where u_jt = (y⁰_jt, x⁰_jt)⁰ is a p × 1 vector of observations and p = p₁ + p₂, α is a p × 1 vector of intercepts, Λ is a p × q factor loading matrix, η_jt is a q × 1 vector of latent common factors representing party policy orientations and party support, and ε_jt is a p × 1 vector of unique factors. The vector of unique factors ε_jt is assumed to be independent of latent factors η_jt and identically and independently distributed (i.i.d.) with a normal distribution having mean zero vector and a covariance matrix Ψ, specifically, ε_jt ∼ N_p(0, Ψ), where Ψ is a diagonal matrix with elements ψ_h² for h = 1, · · · , p.

Simultaneous equation models are widely used in the field of econometrics to deal with the problem of endogeneity (e.g., Hamilton, 1994; Hsiao, 2003). The simultane-ous equation representing relationships between a set of endogensimultane-ous latent variables is specified in Equation (3.2):

η_jt = β_j + Γ_tF(η_j1, · · · , η_j(t−1)) + δ_jt, (3.2)

where βj is a q ×1 vector of unit-specific effects, Γtis a matrix of coefficients that cap-tures election-to-election effects, F(η_j1, · · · , η_j(t−1)) is a vector-valued function which allows flexible autoregressive structures and accounts for dynamics, and δ_jt is a q × 1 vector of residuals. It is assumed that the vector of residuals δjt is independent of η_j1, · · · , η_j(t−1) and ε_jt, and identically and independently normally distributed with

means zero and covariance matrix Σ, that is, δ_jt ∼ N_q(0, Σ), where Σ is a diago-nal matrix with elements σ²_l for l = 1, · · · , q. The longitudinal structural equation model proposed here is not identified without imposing the identification restrictions.

Identifiability constraints on either factor loadings or factor variances are two com-mon approaches.⁵ I discuss the identification strategies for data analysis in the next section.

This is a Bayesian specification, so I complete the model specification by defin-ing prior distributions. Let θ_u = (α, Λ, Ψ) denote the model parameters in Equa-tion (3.1) and θη = (β, Γ, Σ) denote the model parameters in Equation (3.2) where β = {β_j; j = 1, · · · , J } and Γ = {Γ_t; t = 1, · · · , T_j}. Following the conventional ap-proach (Carlin and Louis, 2000; Gelman et al., 2004; Robert, 2001), I use conjugate prior distributions for these parameters. The parameters that are involved in the measurement equation given in Equation (3.1) have the following prior distributions:

α ∼ N_p(α₀, A₀), (3.3)

ψ²_h ∼ IG(a₀/2, b₀/2), (3.4) Λ_h|ψ²_h ∼ N_q(Λ₀, ψ²_hH₀), (3.5)

where Λ⁰_his the hth row of Λ for h = 1, · · · , p. The parameters that are involved in the simultaneous equation given in Equation (3.2) have the following prior distributions:

5The former involves setting each factor’s scale to the scale of one of its indicators, also called unit loading identification (ULI) constraint; the latter is to standardize the latent factors with zero means and variances of unity, also called unit variance identification (UVI) constraint (Kline, 2010).

Moreover, in a Bayesian analysis, informative priors that limit parameters to an appropriate range can also achieve an identified model (Congdon, 2006; Lee, 2007).

β_j ∼ N_q(β₀, B₀), (3.6) σ²_l ∼ IG(c₀/2, d₀/2), (3.7) Γ_tl|σ²_l ∼ N (Γ₀, σ_l²G₀), (3.8)

where Γ⁰_tl is the lth row of Γ_t for l = 1, · · · , q.⁶ We can give hyperparameters, α₀, A₀, a₀, b₀, Λ₀, H₀, β₀, B₀, c₀, d₀, Γ₀, and G₀, actual values to reflect prior infor-mation about the corresponding parameters or to give diffuse forms. Furthermore, I assume priori independence of θ_u, and θ_η and, therefore, the prior distribution of is given by reflect prior information about the corresponding parameters.

From the Bayes theorem, the joint posterior distribution of interest, π(θ_u, θ_η|u, η), is as follows:

π(θ_u, θ_η|u, η) ∝ p(u, η|θ_y, θ_η)π(θ_y, θ_η),

= p(u|η, θ_u)p(η|θ_η)π(θ_u)π(θ_η). (3.11)

Utilizing the idea of data augmentation (Albert and Chib, 1993; Tanner and Wong, 1987), I augment the data by including the unobserved latent variables η and operate with the conditional densities

The Bayesian estimates of the parameters and latent variables can be obtained by generating a sufficiently large number of observations from the joint posterior distri-bution through MCMC methods (see Casella and George, 1992; Chib and Greenberg, 1995; Gelfand and Smith, 1990), which can be implemented in a variety of existing programs such as WinBUGS (Lunn et al., 2000) and JAGS (Plummer, 2003).⁷

The Bayesian structural equation model has several advantages to the application of dynamic party competition. First, the measurement equation of SEM captures the relationships between unobserved latent variables and observed indicators as well as measurement errors. Party policy orientations cannot be directly observed and can only be measured based on information such as party electoral manifestos and

7Under the assumed conjugate prior distributions, the full conditional distributions required for implementing the Gibbs sampler can be found in the Appendix.

survey data. The estimation of unobserved party orientations is modeled by the CFA model.⁸

Second, the simultaneous equation of SEM assesses the relationships among latent variables. In the case of dynamic multiparty competition, I treat both party support and party policy orientations as endogenous and I study the interdependence between party support and party policy orientations through a function of lagged endogenous variables. This function allows flexible autoregressive structures and accounts for dynamics, as I show in the analysis that follows.

Third, a multilevel framework is incorporated into the structural equation model to examine the possibility that the changing sets of competing parties influence the relationships between party policy orientations and party support. The proposed model allows the coefficients matrix Γ in the structural equation to vary over elections.

Along with the inclusion of unit effects, the setting of varying coefficients explicitly account for past behavior and time-invariant party-specific effects, which enables us to understand better the determinants of both party support and party positions over time.

Finally, a Bayesian approach with MCMC methods offers flexibility for complex model specifications of SEM (Congdon, 2006; Lee, 2007; Song et al., 2011) and re-solves the inferential problems that arise in non-Bayesian approaches (Carlin and Louis, 2000; Gelman and Hill, 2007; Gill, 2008a). What makes SEM different from

8The factor analytic techniques are also used in several methods of manifesto-based left-right place-ment of political parties (see Gabel and Huber, 2000, for an evaluation for different methods).

However, these methods do not provide any uncertainty of estimated party positions (Benoit, Laver and Mikhaylov, 2009).

the common regression model is the consideration of measurement errors and the existence of latent random variables. Bayesian approaches to SEM directly model the relationship between the latent factors and their indicators rather than the sam-ple covariance matrix (Lee, 2007). The inferences on latent factors given observed indicators are appropriately reflected by Bayesian inferences, which are based on the posterior distribution given the data and the prior distribution rather than relying on large-sample theory.

In document Essays on Quantitative Methods for Consequences of Political Institutions (Page 66-73)