• No results found

The Bayesian Structural Equation Model

In this section, a longitudinal structural equation model is proposed to deal with the interdependence between party support and party policy orientations as well as measurement issues of party policy positions. The path diagram of the model is presented in Figure 3.1. Consider a set of observations for party policy orientations and party support, respectively: for party j = 1, · · · , J measured at election time point t = 1, · · · , Tj, we have a p1 × 1 random vector yjt for the former and a p2× 1

policy

Figure 3.1.: Path Diagram of the Structural Equation Model for Dynamics of Party Competition.

random vector xjt for the latter. The number of elections parties participate, Tj, may differ from party to party.

The framework of SEM consists of two components: a measurement equation and a structural (or simultaneous) equation. The measurement equation, which relates

the latent variables to corresponding indicators and takes the measurement errors into account, is basically equivalent to a confirmatory factor analysis (CFA) given by

ujt = α + Ληjt + εjt, (3.1)

where ujt = (y0jt, x0jt)0 is a p × 1 vector of observations and p = p1 + p2, α is a p × 1 vector of intercepts, Λ is a p × q factor loading matrix, ηjt is a q × 1 vector of latent common factors representing party policy orientations and party support, and εjt is a p × 1 vector of unique factors. The vector of unique factors εjt is assumed to be independent of latent factors ηjt and identically and independently distributed (i.i.d.) with a normal distribution having mean zero vector and a covariance matrix Ψ, specifically, εjt ∼ Np(0, Ψ), where Ψ is a diagonal matrix with elements ψh2 for h = 1, · · · , p.

Simultaneous equation models are widely used in the field of econometrics to deal with the problem of endogeneity (e.g., Hamilton, 1994; Hsiao, 2003). The simultane-ous equation representing relationships between a set of endogensimultane-ous latent variables is specified in Equation (3.2):

ηjt = βj + ΓtF(ηj1, · · · , ηj(t−1)) + δjt, (3.2)

where βj is a q ×1 vector of unit-specific effects, Γtis a matrix of coefficients that cap-tures election-to-election effects, F(ηj1, · · · , ηj(t−1)) is a vector-valued function which allows flexible autoregressive structures and accounts for dynamics, and δjt is a q × 1 vector of residuals. It is assumed that the vector of residuals δjt is independent of ηj1, · · · , ηj(t−1) and εjt, and identically and independently normally distributed with

means zero and covariance matrix Σ, that is, δjt ∼ Nq(0, Σ), where Σ is a diago-nal matrix with elements σ2l for l = 1, · · · , q. The longitudinal structural equation model proposed here is not identified without imposing the identification restrictions.

Identifiability constraints on either factor loadings or factor variances are two com-mon approaches.5 I discuss the identification strategies for data analysis in the next section.

This is a Bayesian specification, so I complete the model specification by defin-ing prior distributions. Let θu = (α, Λ, Ψ) denote the model parameters in Equa-tion (3.1) and θη = (β, Γ, Σ) denote the model parameters in Equation (3.2) where β = {βj; j = 1, · · · , J } and Γ = {Γt; t = 1, · · · , Tj}. Following the conventional ap-proach (Carlin and Louis, 2000; Gelman et al., 2004; Robert, 2001), I use conjugate prior distributions for these parameters. The parameters that are involved in the measurement equation given in Equation (3.1) have the following prior distributions:

α ∼ Np0, A0), (3.3)

ψ2h ∼ IG(a0/2, b0/2), (3.4) Λh2h ∼ Nq0, ψ2hH0), (3.5)

where Λ0his the hth row of Λ for h = 1, · · · , p. The parameters that are involved in the simultaneous equation given in Equation (3.2) have the following prior distributions:

5The former involves setting each factor’s scale to the scale of one of its indicators, also called unit loading identification (ULI) constraint; the latter is to standardize the latent factors with zero means and variances of unity, also called unit variance identification (UVI) constraint (Kline, 2010).

Moreover, in a Bayesian analysis, informative priors that limit parameters to an appropriate range can also achieve an identified model (Congdon, 2006; Lee, 2007).

βj ∼ Nq0, B0), (3.6) σ2l ∼ IG(c0/2, d0/2), (3.7) Γtl2l ∼ N (Γ0, σl2G0), (3.8)

where Γ0tl is the lth row of Γt for l = 1, · · · , q.6 We can give hyperparameters, α0, A0, a0, b0, Λ0, H0, β0, B0, c0, d0, Γ0, and G0, actual values to reflect prior infor-mation about the corresponding parameters or to give diffuse forms. Furthermore, I assume priori independence of θu, and θη and, therefore, the prior distribution of is given by reflect prior information about the corresponding parameters.

From the Bayes theorem, the joint posterior distribution of interest, π(θu, θη|u, η), is as follows:

π(θu, θη|u, η) ∝ p(u, η|θy, θη)π(θy, θη),

= p(u|η, θu)p(η|θη)π(θu)π(θη). (3.11)

Utilizing the idea of data augmentation (Albert and Chib, 1993; Tanner and Wong, 1987), I augment the data by including the unobserved latent variables η and operate with the conditional densities

π(η|u, θη) ∝ p(η|θη)p(u|η, θu), (3.12) π(θu, θη|u, η) ∝ p(u|η, θu)p(η|θη)π(θu)π(θη). (3.13)

The Bayesian estimates of the parameters and latent variables can be obtained by generating a sufficiently large number of observations from the joint posterior distri-bution through MCMC methods (see Casella and George, 1992; Chib and Greenberg, 1995; Gelfand and Smith, 1990), which can be implemented in a variety of existing programs such as WinBUGS (Lunn et al., 2000) and JAGS (Plummer, 2003).7

The Bayesian structural equation model has several advantages to the application of dynamic party competition. First, the measurement equation of SEM captures the relationships between unobserved latent variables and observed indicators as well as measurement errors. Party policy orientations cannot be directly observed and can only be measured based on information such as party electoral manifestos and

7Under the assumed conjugate prior distributions, the full conditional distributions required for implementing the Gibbs sampler can be found in the Appendix.

survey data. The estimation of unobserved party orientations is modeled by the CFA model.8

Second, the simultaneous equation of SEM assesses the relationships among latent variables. In the case of dynamic multiparty competition, I treat both party support and party policy orientations as endogenous and I study the interdependence between party support and party policy orientations through a function of lagged endogenous variables. This function allows flexible autoregressive structures and accounts for dynamics, as I show in the analysis that follows.

Third, a multilevel framework is incorporated into the structural equation model to examine the possibility that the changing sets of competing parties influence the relationships between party policy orientations and party support. The proposed model allows the coefficients matrix Γ in the structural equation to vary over elections.

Along with the inclusion of unit effects, the setting of varying coefficients explicitly account for past behavior and time-invariant party-specific effects, which enables us to understand better the determinants of both party support and party positions over time.

Finally, a Bayesian approach with MCMC methods offers flexibility for complex model specifications of SEM (Congdon, 2006; Lee, 2007; Song et al., 2011) and re-solves the inferential problems that arise in non-Bayesian approaches (Carlin and Louis, 2000; Gelman and Hill, 2007; Gill, 2008a). What makes SEM different from

8The factor analytic techniques are also used in several methods of manifesto-based left-right place-ment of political parties (see Gabel and Huber, 2000, for an evaluation for different methods).

However, these methods do not provide any uncertainty of estimated party positions (Benoit, Laver and Mikhaylov, 2009).

the common regression model is the consideration of measurement errors and the existence of latent random variables. Bayesian approaches to SEM directly model the relationship between the latent factors and their indicators rather than the sam-ple covariance matrix (Lee, 2007). The inferences on latent factors given observed indicators are appropriately reflected by Bayesian inferences, which are based on the posterior distribution given the data and the prior distribution rather than relying on large-sample theory.

Related documents