Continuous latent variables: Structural Equation Model (SEM)

CHAPTER 1 Literature Review

1.2 Generalized latent variable models

1.2.1 Classic latent variable models

1.2.1.1 Continuous latent variables: Structural Equation Model (SEM)

In the social sciences, there has been a long tradition in modeling the associations between latent and observed variables using structural equations model (SEMs). In mainstream statistics these models are also known as latent variable models or covariance structure models because the fundamental hypothesis is that the covariance matrix of the observed variables is a function of a set of unknown parameters

(

i e H. . ₀: Σ=Σ θ

( ))

, although the mean can also be modeled. SEMs allow us, through a system of linear equations, to model jointly the measurement model of the observed data conditional on the latent variables, and the latent variable model (aka structural model) which summarizes the relationships between latent variables (Bollen, 1989). Originally, SEMs assumed the population was homogeneous (i.e. a single covariance matrix could be used to summarize the associations among variables), and only allowed observed variables (aka indicators) to be continuous. However, conventional SEM was extended to multiple-groups SEM (Joreskog, 1971), and to allow the indicators to have other scales (Muthén, 2002). A recent review of SEM literature with applications to epidemiology for a statistical audience is given

by Sánchez et al (2005). In the Gaussian linear SEM and using the notation and parameterization by Muthén (2002), the measurement model for the i-th subject i=1, 2,K ,n

is defined in terms of the p-dimensional vector of continuous outcome random variables

Y

and is given by

= +

+

Y

ν

Λη

Kx

ε

(2)

where ν_p_×₁,Λ_{p m}_× and K_{p q}_× are parameter matrices,

η

_i is an m-dimensional vector of latent variables,

x

_i is a q-dimensional vector of covariates, and

ε

_i is a p-dimensional vector of residuals with multivariate normal distribution with mean zero-vector and covariance matrix

p p×

Θ . The continuous outcome is typically a multivariate outcome but it can also be repeated measures or a combination. The structural model defines the linear relationship among latent variables and covariates

= +

+

η

α

Bη

Γx

ζ

(3)

where

α

_m_×₁

,B

_{m m}_× and Γ_{m q}_× are parameter matrices, and

ζ

_i is an m-dimensional vector of residuals with multivariate normal distribution with mean zero-vector and covariance matrix

m m×

Ψ

that is assumed to be independent of

ε

_i. The parameter matrix

B

_{m m}_× has zeros in the diagonal because latent variables can only be influenced by other latent variables and not themselves, and I B− is assumed to be nonsingular. This parameterization of the model is equivalent to the well known LISREL model (Joreskog and Sorbom, 1989). Also, note that the m-factor model (section 1.1.1) is a special case of SEM.

Substituting the structural part of the model in the measurement part gives

(

)

[

]

i i i i i

−

= +

−

+

from which it is immediate that

Y x

|

_i has also a normal distribution because the residuals

ζ

and

ε

_iare assumed normal. The full log-likelihood is given by

(

)

( )

* * 1

(

)(

)

2 2 2 ; , n p q log 2 nlog n T FULL

π

tr +  −  = − − − _ + − − _   θ Y x Σ Σ S V µ V µ l

where

θ=vech{ν Λ, ,K, , , , ,α

B

Γ Θ Ψ},V

_iT

=(Y x

_iT

,

T_i

),

µ θ

( )≡E[ ]V

,

Σ θ

( )≡var[ ]V

_i and 1 1 ( )( ) n T i i n i= =

∑

− −

S V V V V is the sample covariance matrix. Maximizing the conditional (on

the covariates) log-likelihood over θ is equivalent to maximizing the full log-likelihood

(

; ,

)

FULL

θ

Y x

l

because the marginal distribution of the covariates is constant with respect to

θ and can be factorized out. In the social sciences literature and SEM software instead of maximizing the likelihood they minimize the fitting function

F

_ML

( )θ

which is mathematically equivalent. The fitting function is a real valued function that measures the discrepancy between the sample moment structure and the one implied by the model, and it is zero when there is perfect fit. In fact, the fitting function is the deviance.

(

)(

)

(

)

{

1 0

}

* * 1 * * 2 ( ) log | | T log | | ( ) ML H H F tr p q n −   = + _ + − − _− − + = −   θ Σ Σ S V µ V µ S l l

Extension to SEM: non-normal responses

In SEM, non-normal responses can be modeled using a generalized linear model approach or a latent response approach. The latent response formulation treats the observed response variable Y as a partial observation of the continuous latent response variable

Y

* using threshold parameters. For example, an observed three-category ordinal response is defined using two thresholds

τ

₁ and

τ

₂ as

* 1 * 1 2 * 2

0 if

1 if

2 if

Y

τ



≤



=_

<

≤



_>



and

Y

_i is replaced by Y_i* in the measurement model given in (2) with the assumption of

conditional normality for Y_i* given

x

_i. For many response types the generalized linear model and the one using the latent response formulation specify equivalent models. However, the latent response formulation allows not only responses with different scales (binary, ordinal, nominal, count, censored, semicontinuous and continuous), but to model responses with different scales simultaneously. Further, conventional generalized linear mixed effects models can be formulated in the SEM framework, and then extended. For example the linear mixed effects model for longitudinal data is the latent curve model4 (Bollen and Curran, 2006) in the SEM framework. However, some extensions that the latent curve model allows are: to regress random coefficients on each other, to estimate factor loadings (i.e. time scores) for models that are not linear in time, and to be combined with Markov models as in the autoregressive latent trajectory (ALT) model (Bollen and Curran, 2004).

Another example of an extended model in the SEM framework is the two-part growth mixture model (Muthén, 2001) inspired from the two-part random-effects model for nonnegative continuous longitudinal data with excess of zeros (Olsen and Schafer, 2001; Berk and Lachenbruch, 2002) and a zero class in a 2-class mixture model (Carlin et al, 2001). The excess of zeros can be due to structural zeros (“true zeros”) yielding a semicontinuous variable, and/or due to left-censored values (i.e. unobserved values from the continuous distribution). Two-part models estimate, separately or jointly, two models: one for the probability of nonzero values and one conditional model for the nonzero

continuous values. Usually the continuous part is skewed so it is previously transformed for the distribution to be approximately normal. Both Olsen and Schafer (2001) and Berk and Lachenbruch (2002) estimated two generalized linear mixed models (logistic and conditional mean log-response models) jointly by having the random intercepts correlated, but the latter integrated the random intercepts to obtain a marginalized model. Another difference is that the Olsen-Schafer model assumed zero values were only true zeros, whereas the Berk- Lachenbruch model distinguished true zeros from censored zeros by adding a parameter for the probability of a left-censored value in the conditional mean model. The Berk- Lachenbruch model could be useful to study food intake because often the intake’s distribution is a nonstandard mixture, a combination of a skewed continuous distribution and a one point mass at zero. These zeros can either be truly non-consumers or very-low- consumers who did not consume the food-item during the reference period.

Software

Gaussian linear SEM can be estimated using specialized commands in general purpose statistical software such as SAS, STATA and R. SEM extensions can be estimated using specialized SEM software like Mplus (Muthén and Muthén , 1998), LISREL (Jöreskog and Sörbom, 1989), and AMOS (Arbuckle, 2006).

In document Index Catalog // Carolina Digital Repository (Page 32-36)