• No results found

Lies, damned lies and latent classes: Can factor mixture models allow us to identify the latent structure of common mental disorders?

N/A
N/A
Protected

Academic year: 2021

Share "Lies, damned lies and latent classes: Can factor mixture models allow us to identify the latent structure of common mental disorders?"

Copied!
28
0
0

Loading.... (view fulltext now)

Full text

(1)

Lies, damned lies and latent classes:

Can factor mixture models allow us to identify

the latent structure of common mental

disorders?

Rachel McCrea

Mental Health Sciences Unit, University College London [email protected]

(2)

Overview

I t d ti t f t i t d l

• Introduction to factor mixture models

• How are they being used in mental health research?

M h li ti

• My research application

• Difficulties with interpreting factor mixture models • Trying to understand my results

(3)

Extension of latent class analysis

• Factor mixture models = extension of LCA

• Cornerstone of LCA is the assumption of conditional independence

– Conditional on class membership, all variables should be uncorrelated

– Observed correlations in the sample should be entirely accounted for by the latent classes

• This rules out severity variation within a class

– e.g. mild and severe depression severity

(4)

Factor mixture models

• Factor mixture models relax the assumption of conditional independence within latent classes

– Allow for severity variations within a class

Include one or more factors to model correlation structure • Include one or more factors to model correlation structure

for the variables in each class

– Combines LCA and CFA/IRT modellingCombines LCA and CFA/IRT modelling

• Specification similar to multi-group factor analysis in Mplus

– Grouping variable unmeasured = latent classes – Specify a factor model within each class

C t i i t t d f t l di t b l

– Can constrain intercepts and factor loadings to be equal – Many different specifications possible

(5)

FMMs popular for mental health research

• Identifying disorder subtypes

• Exploring diagnostic boundaries (my focus) • Exploring diagnostic boundaries (my focus)

– e.g. anxiety and depressive disorders

– One multi-faceted distress disorder or several distinct disorders?One multi faceted distress disorder or several distinct disorders?

• Resolving the ‘continuity controversy’g y y

– Do symptoms vary along continuum with normal functioning? – Or do we have a distinct disorder category with objective

b d i ? ‘t ’

boundaries? – a ‘taxon’

(may still be severity variation within a taxon)

S i t t f h i t d t t t

(6)

Can we identify the ‘true’ latent structure?

• Simulation studies suggest it may be possible (Lubke and Neale, 2006)

– Generated data to fit FA, FMM and LC models (continuous items)

– Datasets all analysed by each model structure

– AIC & saBIC usually allowed correct structure to be identified

• Less clear for ordinal data (Lubke and Neale, 2008)

– BIC best for identifying correct structure

– Fit indices tended to favour models with too few classes

(many category intercepts needed per extra class) (many category intercepts needed per extra class)

(7)

Example – Autism Spectrum Disorder

“Validation of Proposed DSM-5 Criteria for Autism Spectrum Disorder” (Frazier et al., 2012)

• Children with diagnosed ASD and undiagnosed siblings • Compared LCA, EFA and FMMs

• Chose FMM with 2 classes and 2 dimensions

• Authors’ conclusions:

– Validates DSM-5 proposal for categorical ASD diagnosis with 2 dimensions within it

“The presence of an ASD versus non ASD distinction coheres with – The presence of an ASD versus non-ASD distinction coheres with

(8)

Example 2 – Health anxiety

Should health anxiety be carved at the joint?”

(Asmundson et al 2012) (Asmundson et al., 2012)

• Used large samples of undergraduate studentsUsed large samples of undergraduate students • Selected model: FMM with 2 classes

– ‘anxious’ and ‘nonanxious’anxious and nonanxious

• Authors’ conclusion (from the abstract):( )

– “Contrary to current conceptualizations […], the FMM results

indicate the latent structure of health anxiety to be taxonic rather than continuous ”

(9)

My application of FMMs

• Aim: to investigate the latent structure of symptoms of common mental disorders

• Data: 3 household surveys of psychiatric morbidity in UK – repeated cross-sectional surveys

(1993, 2000, 2007)

• Combined dataset ~ 22,000 individuals aged 16-64

• Symptoms of CMD measured by standardised interview

(10)

Structure of CIS-R interview

• 13 sections covering different symptom areas

– Ordinal score (0-4) for each symptom

– Based on symptoms from past 7 days only – Symptoms not necessarily signs of illness

• Symptoms covered: • Somatic symptoms Fatigue • Worry Anxiety • Fatigue • Concentration/forgetfulness • Sleep • Irritability • Anxiety • Phobias • Panic • Compulsions • Irritability

• Worry about physical health • Depression

• Compulsions • Obsessions

(11)

Model comparison from CIS-R data: n=11,230

Model # p BIC

Pairs of variables with ‘poor fit’ (out of 78)

p p ( ) Factor 1f 65 182,531 35 Based on bivariate Factor mixture 1f 2c 119 182,015 8 Factor mixture 1f 3c 173 181 895 4 bivariate chi-squares in Tech10 Factor mixture 1f 3c 173 181,895 4 Latent class 4c 211 182,961, 8

(12)

Factor mixture model 1 factor 3 classes Factor mixture model – 1 factor, 3 classes

1

Red class 12%

0.8

re >

=

2 Green class Blue class 29%59%

0.6 a vi n g sco r 0.4 b ilit y of h a 0.2 Pro b a b at ic g ue F or g e ep ility o rry s io n o rry iety o bi a a ni c o ns o ns 0 So m Fa ti g Co n c /F Sle Irri ta b H e a lth w o D epr es s W o An x Ph o P a C o m pul si o O b se ssi o

(13)

What do the latent classes mean?

• Do these three latent classes really represent distinct clinical groups in the population?

clinical groups in the population?

– I need to be sure before making claims

– Class membership probs. – no obvious clinical interpretation

• FMMs allow for a severity dimension within class

– Can’t be simple severity classes

– If FMM fits better than factor model without classes, surely classes must be real groups?

must be real groups?

(14)

Two roles of latent classes

• Direct role: represent true groups

• Indirect role: approximating a continuous distribution

• Situations where a factor mixture model may appear to describe data better in the absence of true groups

(Bauer and Curran, 2004)

• Non-normality of the factor(s)

• Miss-specification of measurement model

• Non-linearity (for logistic models: on logit scale)

M t l t lt ti t l d l

(15)

Are classes just modelling non-normality?

• Standard factor model assumes normally distributed factor

40

c

e (

%

)

– Inappropriate for mental health – Classes accommodating this?

20 30 s s pr eval en c

• Test: Fit ‘latent class factor model’ 01 0 ti m at ed cl a s

– Approximates continuous factor distribution with ‘located’ classes

– ‘Non-parametric’ factor analysis LCFA class factor scores

Es

t

-8 -6 -4 -2 0 2

– Non-parametric factor analysis

• Result: no improvement on fit of standard factor model

- 2 and 3 class FMMs still much better

(16)

What about non-linearity?

• Factor model for ordinal data related to ordinal logistic regression:g

– Cumulative probability model (as in ‘ologit’ in Stata)

– Assumes linear relationship between probabilities of responses to each variable and the factor on logit scale responses to each variable and the factor on logit scale

– Assumes proportional odds for each ordered responseAssumes proportional odds for each ordered response category (= equal slopes = parallel lines)

(17)

Linearity and proportional odds assumptions

10 1 2 3 s cal e 1. 0 1 2 3 e r 5 3 4 on t h e l o gi t s 6 0. 8 3 4 c or e or hi gh e 0 o re or hi gher 0 .4 0 .6 f o b ta in in g s c -5 a bi li ty of s c o 0. 2 0 P ro babi li ty o f 5 0 5 10 15 -1 0 Pr o b a 5 0 5 10 15 0. 0 P -5 0 5 10 15 Factor score -5 0 5 10 15 Factor score

(18)

Investigating nonlinearity

• Maybe the FMM is relaxing the linearity and proportional odds assumptions

• How to investigate this for ordinal data?

– No straightforward way to assess true shape of relationship – Used a whole suite of approaches

All h d li i i b f i l i i

– All had some limitations, but fairly consistent picture

Cl t t t

• Clearest to present:

– Lowess curves (descriptive: form of non-parametric regression) Used summed item scores to ‘represent’ factor scores

(19)

Lowess curves

Describing cumulative probabilities, as in:

Fatigue Health worry Compulsions

0. 8 e or hi ghe r 0. 4 it y of sco re 0. 0 P robabi l 0 10 20 30 40 0 10 20 30 40 0 10 20 30 40

(20)

Conclusions

• Factor mixture models appear to describe the CIS-R data better than models without classes

better than models without classes

• However, evidence of non-linearity (on logit scale) andHowever, evidence of non linearity (on logit scale) and violations of proportional odds

• Careful examination suggests latent classes are accommodating these violations

– Class allocations consistent with patterns of non-linearity – Implies classes unlikely to represent real groups

– BUT can’t prove this either way – BUT can t prove this either way

(21)

Conclusions (continued)

• No clear evidence for any ‘disorder classes’

• BUT doesn’t prove that there are no discrete disorders

‘Signal’ drowned out by ‘noise’ from factor model misfit? – Signal drowned out by noise from factor model misfit? – May be impossible to distinguish dimensions from discrete

categories empirically

– Key discriminating characteristics not measured? – Lack of power?

• My view: disappointingly ambiguous conclusions for a very time consuming exercise

(22)

Interpretation of FMMs in the literature

• Some papers mention alternative roles of classes

– Usually simulation studies or illustration papers – Tend to avoid drawing substantive conclusions

• ‘Taxonicity’ of classes often unquestioned in applied • Taxonicity of classes often unquestioned in applied

psychiatric research papers

– Papers frequently don’t mention that classes may reflect non-p q y y normality or other factor model violations

– Authors may not be aware

• BUT model violations may be common in mental health

– Measures often designed as screening toolsMeasures often designed as screening tools – Items not selected for psychometric properties

(23)

Lies, damned lies and latent classes?

• These hybrid mixture models are very complex

– Huge effort required to develop real understanding – Many readers will have to take findings ‘on trust’

– Reviewers may lack sufficient expertise to spot problems

• FMMs present severe risk of over-interpretation

– Not a magic bullet for identifying true latent structureg y g – Could lead to research blind alleys

R h ti FMM t hi hli ht d l

• Researchers reporting FMMs must highlight and explore alternative interpretations

If not we should be sceptical of any claims – If not, we should be sceptical of any claims

(24)

References

• Asmundson, G.J.G. et al., 2012. Should health anxiety be carved at the joint? A look at

the health anxiety construct using factor mixture modeling in a non-clinical sample.

Journal of Anxiety Disorders, 26(1), pp.246–251.

• Bauer, D.J. & Curran, P.J., 2004. The integration of continuous and discrete latent

variable models: Potential problems and promising opportunities. Psychological

Methods, 9(1), pp.3-29.

• Frazier, T.W. et al., 2012. Validation of Proposed DSM-5 Criteria for Autism Spectrum

Disorder. Journal of the American Academy of Child & Adolescent Psychiatry, 51(1),

pp.28–40.e3. pp.28 40.e3.

• Lubke, G.H. & Neale, M.C., 2006. Distinguishing between latent classes and continuous

factors: Resolution by maximum likelihood? Multivariate Behavioral Research, 41(4), y , ( ),

pp.499–532.

• Lubke, G.H. & Neale, M.C., 2008. Distinguishing between latent classes and continuous

factors with categorical outcomes: Class invariance of parameters of factor mixture models. Multivariate Behavioral Research, 43(4), pp.592–620.

(25)

Three main families of factor mixture model

Measurement

invariance?

(intercepts/loadings)

Factor variance

within class? Example

( g )

‘Semi-parametric

factor model’ Yes

Yes

a.k.a. mixture factor model factor*;

‘Latent classLatent class N

factor model’

a.k.a. non-parametric factor model Yes No factor@0; ‘Factor mixture No / weak Yes model’ No / weak factor*;

(26)

Mplus code: ‘Latent class factor model’ 1f 4c

f $ * ( ) !This code is for ordinal data

Variable: … Categorical are ; [ c#1*]; [ c#2*]; [ c#3*]; [ fatigue$3*] (22); [ fatigue$4*] (23); ... Categorical are …;

Classes = c(4); ! num. classes

Analysis:

%C#1% !One section for each class factor BY somatic@1; factor BY fatigue* (1); %C#2% factor BY somatic@1; factor BY fatigue* (1); factor BY concforg* (2); Estimator = MLR; Algorithm = Integration; Type = Mixture; St t 100 50 factor BY concforg* (2); factor BY sleep* (3); ... f t @0 !Fi d @0 i ll l factor BY concforg* (2); factor BY sleep* (3); ... Starts = 100 50; Model: %OVERALL%

factor@0; !Fixed @0 in all classes [ factor@0 ]; !Fixed in 1 class only

[ somatic$1* ] (16);

factor@0;

[ factor* ]; !Free in other classes

[ somatic$1* ] (16); %OVERALL% factor BY somatic; factor BY fatigue; factor BY concforg; [ somatic$1 ] (16); [ somatic$2* ] (17); [ somatic$3* ] (18); [ somatic$4* ] (19); [ somatic$1 ] (16); [ somatic$2* ] (17); [ somatic$3* ] (18); [ somatic$4* ] (19); factor BY sleep; … [ fatigue$1* ] (20); [ fatigue$2* ] (21); [ fatigue$1* ] (20); [ fatigue$2* ] (21); ...

(27)

Mplus code: a ‘Factor mixture model’ 1f 3c

f $ * ( ) !This code is for ordinal data

Variable:

Categorical are ;

[ c#1*]; [ c#2*];

%C#1% !One section for each class

[ fatigue$3*] (22); [ fatigue$4*] (23); ... Categorical are …; Classes = c(3); Analysis:

%C#1% !One section for each class factor BY somatic@1;

factor BY fatigue* (1); factor BY concforg* (2);

%C#2%

factor BY somatic@1;

factor BY fatigue* (1); !Loadings factor BY concforg* (2);!still Estimator = MLR; Algorithm = Integration; Type = Mixture; St t 2000 500 !N d l t factor BY sleep* (3); ... f t * !F i ll l

factor BY concforg* (2);!still factor BY sleep* (3); !equal. ...

Starts = 2000 500; !Need lots

Model:

%OVERALL%

factor* ; !Free in all classes [ factor@0 ]; !Fixed in all classes

[ somatic$1* ] ; !Intercepts can now

factor* ; !Free [ factor@0 ]; !Fixed [ somatic$1* ] ; %OVERALL% factor BY somatic; factor BY fatigue; factor BY concforg;

[ somatic$1 ] ; !Intercepts can now [ somatic$2* ] ; !differ between [ somatic$3* ] ; !classes. [ somatic$4* ] ; [ somatic$1 ] ; [ somatic$2* ] ; [ somatic$3* ] ; [ somatic$4* ] ; factor BY sleep; … [ fatigue$1* ] ; [ fatigue$2* ] ; [ fatigue$1* ] ; [ fatigue$2* ] ; ...

(28)

Example code for lowess curves in R

## This code was written for ordinal data with five categories per item (coded as 0-4)

## Curves estimated separately – may cross inappropriately in regions where data are sparse

library(Hmisc) ## Package containing the plsmo() function responses < read table("C:/Data/mplusexport dat" sep=" ") responses <- read.table("C:/Data/mplusexport.dat", sep=",")

item <- 1 ## This number should be the column number of the item you wish to plot score4 <- as.numeric(responses[,item]==4)( p [, ] )

score3 <- as.numeric(responses[,item]==4|responses[,item]==3)

score2 <- as.numeric(responses[,item]==4|responses[,item]==3|responses[,item]==2) score1 <- as.numeric(responses[,item]==4|responses[,item]==3|responses[,item]==2|

responses[,item]==1)

totscores <- rowSums(responses) ## Assumes there are no other variables in the dataset restscores <- totscores - responses[,item]

plsmo(restscores, score1, ylab="Probability of score or higher", xlab="Restscore",

ylim=c(0,1), trim=0, f=0.1) ## "f=0.1" controls the spikiness of the curve - it can range from 0 to 1. plsmo(restscores, score2, trim=0, add=T, f=0.1)

p ( , , , , )

plsmo(restscores, score3, trim=0, add=T, f=0.1) plsmo(restscores, score4, trim=0, add=T, f=0.1)

References

Related documents

Starting with a population of 1,000 patients, Figure 3 (left) represents the number of patients who relapse after the end of follow-up and who could have been

Nortriptyline Hydrochloride Valerian 12.5 MG/QD Unknown Unknown 12.5 mg, Daily UNK Mental disorder Product used for unknown indication Akathisia Delirium Drug interaction

It is important that you select your school worded as Northwest Florida State College - Welding - Drug Screen Only - Welding. Complete all required fields

We argue that FDI contributes to economic development by improving institutional quality in the host country and we attempt to test this proposition using a large panel data set of

In the case of string theory, the natural energy scale is much higher, and so it is more difficult to make contact with experiment. To date, the interactions with mathematicians

This chapter extends the existing literature by contextualising health expenditure within a broader framework of health seeking behaviour. After brief consideration of the way in

Partially as a result of the broadening of the defi- nition of bank in the BHCA through time, both nonfi- nancial (commercial) firms that wished to own a bank and were prohibited

patentfreedom.com/about-npes/industry/ (last visited Jan. Executive Office of the President, supra note 89, at 8. “[T]he sheer volume of patents makes searching for relevant