• No results found

Bayesian hypothesis testing for hierarchical models using transdimensional Markov chain Monte Carlo methods

N/A
N/A
Protected

Academic year: 2021

Share "Bayesian hypothesis testing for hierarchical models using transdimensional Markov chain Monte Carlo methods"

Copied!
43
0
0

Loading.... (view fulltext now)

Full text

(1)

Bayesian hypothesis testing for hierarchical models

using transdimensional Markov chain Monte Carlo

methods

Tom Lodewyckx (University of Amsterdam)

Michael Lee (University of California, Irvine)

Eric-Jan Wagenmakers (University of Amsterdam)

July 28, 2008

July 28, 2008

(2)

OUTLINE

1

Bayesian hypothesis testing

2

Hypothesis testing with transdimensional MCMC

3

Applications

(3)

OUTLINE

1

Bayesian hypothesis testing

2

Hypothesis testing with transdimensional MCMC

3

Applications

4

Conclusion

(4)

Question

When tossing a coin frequently, the

true probabilities of heads and tails

are equal

(5)

Hypothesis testing

N: Number of tosses

K

: Frequency of tails

(6)

N: Number of tosses

K

: Frequency of tails

Null model (M

0

)

K

Binomial

(

θ,

N) and

θ

=

.

5

Equal probabilities

Full model (

M

1

)

K

Binomial

(

θ,

N) and

θ

6

=

.

5

Heads or tails more probable

(7)

Hypothesis testing

N: Number of tosses

K

: Frequency of tails

Null model (M

0

)

K

Binomial

(

θ,

N) and

θ

=

.

5

Equal probabilities

Full model (

M

1

)

K

Binomial

(

θ,

N) and

θ

6

=

.

5

Heads or tails more probable

Which is the most plausible assumption for

θ?

Bayes factor

(8)

What is a Bayes factor?

Model selection tool in Bayesian framework

Compares the “evidences” of both models

Model with highest evidence is supported

Quantification of how strong that support is

(9)

The Bayes factor

What is a Bayes factor?

Model selection tool in Bayesian framework

Compares the “evidences” of both models

Model with highest evidence is supported

Quantification of how strong that support is

Notation

B

10

is the Bayes factor in favor of

M

1

(full model)

(10)

Formal definition

B

10

=

Marginal LL (M

1

)

Marginal LL (M

0

)

=

f

(y

|

M

1

)

f

(y

|

M

0

)

(11)

The Bayes factor

Formal definition

B

10

=

Marginal LL (M

1

)

Marginal LL (M

0

)

=

f

(y

|

M

1

)

f

(y

|

M

0

)

=

Posterior model odds

Prior model odds

=

P(M

1

|

y)

/

P(M

0

|

y)

P

(M

1

)

/

P(M

0

)

(12)

Interpretation scheme Raftery (1995)

log(B

10

)

Evidence?

<

-5

Very strong evidence for

M

0

-5 to -3

Strong evidence for

M

0

-3 to -1

Positive evidence for

M

0

-1 to 0

Weak evidence for

M

0

0

No evidence

0 to 1

Weak evidence for

M

1

1 to 3

Positive evidence for

M

1

3 to 5

Strong evidence for

M

1

(13)

The Bayes factor

Advantages

Intuitive

Model averaging

Model complexity

(14)

Advantages

Intuitive

Model averaging

Model complexity

Problems

Depends on prior distribution

Computational

(15)

Research goal

Research goal:

Estimating Bayes factors should be..

1

easy to implement

2

precise

3

flexible

(16)

OUTLINE

1

Bayesian hypothesis testing

2

Hypothesis testing with transdimensional MCMC

3

Applications

(17)

Transdimensional MCMC methods

Markov chain Monte Carlo (MCMC) methods

What?

Simulation techniques to simulate values from

posterior distribution

Why?

Facilitate Bayesian parameter estimation

Where?

Parameter space Ω = [Θ] = [

{

α, β, γ, . . .

}

]

(18)

Markov chain Monte Carlo (MCMC) methods

What?

Simulation techniques to simulate values from

posterior distribution

Why?

Facilitate Bayesian parameter estimation

Where?

Parameter space Ω = [Θ] = [

{

α, β, γ, . . .

}

]

Transdimensional MCMC methods

What?

MCMC methods that operates on at least 2 models

Why?

Simultaneous estimation of Bayesian models,

hypothesis testing, model selection

(19)

Transdimensional MCMC methods

Transdimensional MCMC methods of interest

What?

MCMC methods that operates on

M

0

and

M

1

Why?

hypothesis testing

Where?

Parameter space Ω = [M

,

Θ

0

,

Θ

1

]

(20)
(21)

Hypothesis testing

Specify prior distribution:

Ω = [M

,

Θ

0

,

Θ

1

]

Transdimensional MCMC sampling:

Simulate values from

posterior distribution of

M

M

= 0

Simulate values posterior Θ

0

M

= 1

Simulate values posterior Θ

1

(22)

Specify prior distribution:

Ω = [M

,

Θ

0

,

Θ

1

]

Transdimensional MCMC sampling:

Simulate values from

posterior distribution of

M

M

= 0

Simulate values posterior Θ

0

M

= 1

Simulate values posterior Θ

1

Estimate Bayes factor:

Use prior and posterior chances of

M

B

10

=

P

(M

= 1

|

y

)

/

P

(M

= 0

|

y)

(23)

Hypothesis testing: Prior distribution

0

1

M

Prior density

0

.5

1

θ

Prior density

M

1

0

.5

1

0

1

2

θ

Prior density

M

0

0

.5

1

0

(24)

M 0 2500 5000 0 1 Iteration θ

M

1

0 2500 5000 0 .5 1 θ

M

0

0 2500 5000 0 .5 1
(25)

Hypothesis testing: Bayes factor

log(BF

10

) =

1

.

81

0

1

M

Prior density

0

.5

1

0

1

M

Prior density

0

.5

1

(26)

θ

M

1

0

.5

1

θ

M

0

0

.5

1

(27)

C

3

method

Combined Carlin & Chib (C

3

) method

Pseudopriors are used for sampling from the parameter vector

when the model is deactivated

Recommended choice: posterior distribution

Combination of three sampling paths:

1

Ω = [

M

,

Θ

0

,

Θ

1

]

2

Ω = [Θ

0

]

Pseudoprior Θ

0

3

Ω = [Θ

1

]

Pseudoprior Θ

1

(28)

θ

M

1

0

2500

5000

0

.5

1

θ

M

0

0

2500

5000

0

.5

1

(29)

OUTLINE

1

Bayesian hypothesis testing

2

Hypothesis testing with transdimensional MCMC

3

Applications

4

Conclusion

(30)

Question

Subliminal priming studies

assume that the prime

stimulus is perceived on a

subliminal level

(31)

Is priming truly subliminal?

Study by Rouder, Morey, Speckman & Pratte (2007)

Visual stimuli [2

,

3

,

4

,

6

,

7

,

8]

In each trial, participant was presented a 22 ms prime

stimulus, followed by a 200 ms target stimulus

Indicate whether prime stimulus was higher than 5

(“Yes”/“No”)

Results in

K

correct identifications out of

N

trials

(32)

Null model (M

0

)

K

Binomial(

θ,

N) and

θ

=

.

5 (at chance)

(33)

Is priming truly subliminal?

Null model (M

0

)

K

Binomial(

θ,

N) and

θ

=

.

5 (at chance)

Subliminal perception of prime stimulus

Full model (

M

1

)

K

Binomial(

θ,

N) and

θ > .

5 (above chance)

Supraliminal perception of prime stimulus

(34)

Null model (M

0

)

K

Binomial(

θ,

N) and

θ

=

.

5 (at chance)

Subliminal perception of prime stimulus

Full model (

M

1

)

K

Binomial(

θ,

N) and

θ > .

5 (above chance)

Supraliminal perception of prime stimulus

Estimate log Bayes factor with Combined Carlin & Chib

method for each subject (non-hierarchical) and for the group

(35)

Non-hierarchical application: Graphical model

φ

θ

K

N

K

Binomial(

θ, N

)

θ

= Φ(

φ

)

M

0

:

φ

= 0

M

1

:

φ

Normal

(0

,

+

)

(0

,

1)

(36)

Log Bayes factor C³ method

−3

−1

0

1

3

5

Very strong evidence for M1

Strong evidence for M1

Positive evidence for M1

Weak evidence for M1

Weak evidence for M0

Positive evidence for M0

(37)

Non-hierarchical application: Validation

Log Bayes factor IS method

Log Bayes factor C³ method

−4

−2

0

2

4

6

8

−4

−2

0

2

4

6

8

(38)

µ

φ

σ

φ

φ

i

θ

i

K

i

N

i

K

i

Binomial(

θ

i

, N

i

)

θ

i

= Φ(

φ

i

)

φ

i

Normal

(0

,

+

)

(

µ

φ

, σ

φ

)

σ

φ

Uniform(0

,

1

.

5)

M

0

:

µ

φ

= 0

M

1

:

µ

φ

Normal

(0

,

+

)

(0

,

1)

i

= 1

, . . . ,

27

(39)

Hierarchical application: Results & Validation

Log Bayes factor?

C

3

method:

log

(BF

10

)

≈ −

3

.

6

Strong evidence

M

0

Consistent with importance sampling method

(40)

OUTLINE

1

Bayesian hypothesis testing

2

Hypothesis testing with transdimensional MCMC

3

Applications

(41)

Conclusion

Estimating Bayes factors with C

3

method is..

1

easy to implement

2

precise

3

flexible

(42)

Estimating Bayes factors with C

3

method is..

1

easy to implement

2

precise

3

flexible

C

3

method seems a good candidate for Bayesian

hypothesis testing in experimental psychology

(43)

Questions

References

Related documents

7-9 show the corrosion potential of carbon steel, austenitic (SS-304) and duplex (SS-23-04) stainless steel bars of the multi-reinforcement probes embedded in three segments ( A ,

It initially illustrates the relationship between PPP and sustainable development and then uses a case study of one of the largest PPP hospital projects in the UK,

developments in areas such as: the guidance, navigation, and control of aircraft, spacecraft, missiles, robotics, and other aerospace systems; general aviation; in-flight

In conclusion, carbon dioxide functions as an accurate analog of nitrous oxide for self- pressurizing propellant tank testing at low to medium mass flow rates, but once the flow rates

In Figure 9, the acceleration response spectra for horizontal (RSHA) and vertical (RSVA) components (at 5% damping), observed at two different stations (AQG – class B and GSA –

1) The host team(s) will always have their choice of dugout throughout the tournament. 2) The host team(s) will be guaranteed the evening games on the first two nights only. 5)

Unit costs at 2013/2014 prices 10 – 12 were applied to the resource use in the model to estimate the total NHS cost of patient management from the time a patient entered the data

Survey respondents with rented housing problems most often chose to handle their problems alone (or obtain some informal advice) (73 per cent of problems) with just over 6 per