• No results found

(Semi)Parametric Models vs Nonparametric Models

N/A
N/A
Protected

Academic year: 2021

Share "(Semi)Parametric Models vs Nonparametric Models"

Copied!
5
0
0

Loading.... (view fulltext now)

Full text

(1)

February, 2003

© 2001, 2003 Scott S. Emerson M.D., Ph.D. Nonparametric Models: 1

(Semi)Parametric Models

vs

Nonparametric Models

February, 2003

© 2001, 2003 Scott S. Emerson M.D., Ph.D. Nonparametric Models: 2

Probability Models

I define parametric, semiparametric, and nonparametric models in the two sample setting

My definition of semiparametric models is a little stronger than some statisticians

™The distinction is to isolate models with

assumptions that I think too strong Notation for two sample probability model

G Y Y F X X iid m iid n ~ , , : Control ~ , , : Treatment 1 1 K K

Probability Models

Parametric models

F, Gare known up to some finite dimensional parameter vectors

( )

(

)

( )

(

)

( )

unknown

and

l

dimensiona

finite

is

form

known

has

,

:

where

,

,

Φ

Ψ

Φ

Ψ

=

Φ

Ψ

=

r

r

r

Y X

t

t

G

t

t

F

Probability Models

Parametric models: Examples

( )

( )

( )

( )

( )

µ

( )

ν

ν

µ

τ

ν

σ

µ

E

Y

E

X

B

Y

B

X

N

Y

N

X

j i j i j i

~

~

:

l

Exponentia

,

1

~

,

1

~

:

Bernoulli

,

~

,

~

:

Normal

2 2

(2)

February, 2003

© 2001, 2003 Scott S. Emerson M.D., Ph.D. Nonparametric Models: 5

Probability Models

Semiparametric models

Forms of F, Gare unknown, but related to each other by some finite dimensional parameter vector

™Gcan be determined from Fand a finite

dimensional parameter

™(Most often: under the null hypothesis, F = G)

February, 2003

© 2001, 2003 Scott S. Emerson M.D., Ph.D. Nonparametric Models: 6

Probability Models

Semiparametric models

Forms of F, Gare unknown, but related to each other by some finite dimensional parameter vector

™Gcan be determined from Fand a finite

dimensional parameter

( )

( )

( )

( )

( )

unknown and l dimensiona finite is bility) (identifia known and l dimensiona finite is ) (in form unknown has , : where , , Y X Y X t t t G t t F Φ Φ ⋅ ⋅ Ψ Φ Ψ = Φ Ψ = r r r r February, 2003

© 2001, 2003 Scott S. Emerson M.D., Ph.D. Nonparametric Models: 7

Probability Models

Semiparametric models: Examples

( )

(

)

( )

( )

( )

( )

[

( )

]

γ

γ

σ

µ

µ

t

F

t

G

t

F

t

G

t

F

t

G

t

F

t

G

=

=

=

=

1

-1

:

hzd

Prop

:

failure

Accel

:

scale

-Shift

:

Shift

February, 2003

© 2001, 2003 Scott S. Emerson M.D., Ph.D. Nonparametric Models: 8

Probability Models

Nonparametric models

Forms of F, Gare completely arbitrary and unknown

™An infinite dimensional parameter is needed to

derive the form of Gfrom F

™(I demand that the above hold under all

hypotheses, unless the test is consistent when F ≠G)

Examples of truly nonparametric analyses:

™Kolmogorov-Smirnov test

(3)

February, 2003

© 2001, 2003 Scott S. Emerson M.D., Ph.D. Nonparametric Models: 9

The Problem:

A Logical Disconnect

February, 2003

© 2001, 2003 Scott S. Emerson M.D., Ph.D. Nonparametric Models: 10

“Because the light is so

much better

here under the streetlamp”

- a drunk looking for the keys

he lost half a block away

The Problem

In the development of statistical models, and even moreso in the teaching of statistics, parametric probability models have received undue emphasis

Examples:

™t test is typically presented in the context of the

normal probability model

™theory of linear models stresses small sample

properties

™random effects specified parametrically ™Bayesian (and especially hierarchical Bayes)

models are replete with parametric distributions

The Problem

ASSERTION: Such emphasis is not typically in keeping with the state of knowledge as an experiment is being conducted

The parametric assumptions are more detailed than the hypothesis being tested, e.g.,:

™Question: How does the intervention affect the

first moment of the probability distribution?

™Assumption: We know how the intervention

affects the 2nd, 3rd, …,

central moments of the probability distribution.

(4)

February, 2003

© 2001, 2003 Scott S. Emerson M.D., Ph.D. Nonparametric Models: 13

The Problem

Conditions under which an intervention might be expected to affect many aspects of a probability distribution

Example 1: Cell proliferation in cancer prevention

™Within subject distribution of outcome is

skewed (cancer is a focal disease)

™Such skewed measurements are only

observed in a subset of the subjects

™The intervention affects only hyperproliferation

(our ideal)

February, 2003

© 2001, 2003 Scott S. Emerson M.D., Ph.D. Nonparametric Models: 14

The Problem

Conditions under which an intervention might be expected to affect many aspects of a probability distribution (cont.)

Example 2: Treatment of hypertension

™Hypertension has multiple causes ™Any given intervention might treat only

subgroups of subjects (and subgroup membership is a latent variable)

™The treated population has a mixture

distribution

− (and note that we might expect greater

variance in the group with the lower mean)

February, 2003

© 2001, 2003 Scott S. Emerson M.D., Ph.D. Nonparametric Models: 15

The Problem

Conditions under which an intervention might be expected to affect many aspects of a probability distribution (cont.)

Example 3: Effects on rates

™The intervention affects rates

™The outcome measures a cumulative state ™Arbitrarily complex mean-variance

relationships can result

February, 2003

© 2001, 2003 Scott S. Emerson M.D., Ph.D. Nonparametric Models: 16

The Problem

These and other mechanisms would seem to make it likely that the problems in which a fully parametric model or even a semiparametric model is correct constitute a set of measure zero

Exception: independent binary data must be binomially distributed in the population from which they were sampled randomly (exchangeably?)

(5)

February, 2003

© 2001, 2003 Scott S. Emerson M.D., Ph.D. Nonparametric Models: 17

The Problem

Impact on what we teach about optimality of statistical models

Clearly, parametric theory may be irrelevant in an exact sense (though as guidelines it is still useful) Much of what we teach about the optimality of nonparametric tests is based on semiparametric models

™e.g., Lehmann, 1975: location-shift models

February, 2003

© 2001, 2003 Scott S. Emerson M.D., Ph.D. Nonparametric Models: 18

The Problem

Example: the Wilcoxon rank sum test

Common teaching:

™Not too bad against normal data

™Better than t test when data have heavy tails

More accurate guidelines:

™Above holds when a shift model holds for

some monotonic transformation of the data

™If propensity to outliers (mixture distributions)

is different between groups, the t test may be better even in presence of heavy tails

™In the general case, the t test and the

Wilcoxon are not testing the same summary measure

References

Related documents

The “second pillar” instruments of the European Union’s (EU) Common Agricultural Policy (CAP), such as investment aid, agri-environmental payments, and a broad range of

To investigate whether these differences could explain the differences in improvement after the training, we included the variables for illness duration and baseline SSL scores

We are sympathetic with Bentley et al s attempt to encompass the wisdom of crowds in a generative model, but posit that a successful at using Big Data will include more

(For the simple hypothesis of nor- mality (mean and variance known) the test statistic is asymptotically chi-square distributed with n.classes - 1 degrees of freedom.) This is,

Vehicle maintenance decisions are fully delegated to Municipal Fleet Maintenance Division whereas Ottawa Police Service Fleet and Technical Services concentrate mostly on

Studying a sample of 18 countries over a period of 30 years revealed that social security transfers are not affected by social and political openness and that the generosity

• Capul de imprimare nu poate fi aliniat prin intermediul butoanelor de control ale aparatului digital. Pute Ġ i alinia capul de imprimare prin intermediul butoanelor de control

The curriculum for the 4-year accounting degree equips graduates with a knowledge base in accounting, auditing, economics, marketing, production and operations management,