• No results found

ANOVA Notes Page 1. Analysis of Variance for a One-Way Classification of Data

N/A
N/A
Protected

Academic year: 2021

Share "ANOVA Notes Page 1. Analysis of Variance for a One-Way Classification of Data"

Copied!
7
0
0

Loading.... (view fulltext now)

Full text

(1)

ANOVA Notes Page 1

Al Lehnen Madison Area Technical College 4/19/2010

Analysis of Variance for a One-Way Classification of Data

Consider a single factor or treatment done at k levels (i.e., there are 1, 2, 3, … k different

variations on the prescribed treatment). Within a given treatment level there are n measurements i

or scores. The subscript or index i is between 1 and k and labels the different factor levels or

treatment variations. The j’th score in the i’th level is designated as

,

i j y

i is the the treatment factor index; 1 i≤ ≤ k

j labels the score the within the i’th treatment: 1≤ ≤ . j ni

Note: It is not required and it is typically not the case that you have an equal number of measurements from each treatment group.

The null hypothesis asserts that no treatment population differs from any other. Thus, the k

populations of scores of the treatment levels should have the same mean and variance. For theoretical convenience, we will assume the scores are normally distributed with a common variance within each treatment level, but the tests which follow are fairly robust for departures from normality.

Null Hypothesis: H : 0 μ1= μ2= μ3=… μk

This is equivalent to saying that the k different treatments are really just k different samples all

taken from a single population. Thus, all of the variation seen in the measured scores is due to sample variations, i.e., randomness.

The total number of scores is n1+n2+n3+…nk =n. In R. Johnson’s text, this is designated as upper case N, however, to be consistent with our earlier notation that lower case n is associated

with a sample and upper case with a population, these notes will use n.

1 k i i n n = =

(1)

The average of all scores is often called the “grand mean” and is given by

1 2 1,1 1,2 1,3 1,n 2,1 2,2 2,3 2,n k,1 k,2 k,3 k n, k y y y y y y y y y y y y y n + + + + + + + + + + + = … … … … .

For notational convenience, this is written as a double summation.

, 1 1 i n k i j i j y y n = = =

∑∑

(2)

If the null hypothesis is true and all of the treatments give the same mean result, then y should be

a fair estimate of every score. Formally, we can write the following.

(

)

(

)

, ,

i j i i j i

(2)

Al Lehnen Madison Area Technical College 4/19/2010 Here y is the mean of treatment variation i . i

, 1 i n i j j i i y y n = =

(4)

In equation (3), yi− can be looked upon as a correction term to the grand mean to take account y

of how treatment group i differs from the grand mean, while yi j, − is an additional correction yi

term to take care of variations within treatment group i. If the null hypothesis is true, both of

these terms should be of the same magnitude and both reflect random deviations from the grand mean. From equation (4) within each treatment the sum of the deviations about the treatment mean vanishes.

(

,

)

1 0 i n i j i j y y = − =

(5) Furthermore,

(

)

, 1 , 1 1 1 1 1 1 0 i i i n i j n n k k k k j i i i i i j i i j i i i j y y y n y y n n y n y y n n = = = = = = = − = − ⋅ = − ⋅ = − ⋅ =

∑∑

∑∑

,

where the zero is a consequence of equation (2).

(

)

1 1 1 0 i n k k i i i i j i y y n y y n = = = − = − ⋅ =

∑∑

(6)

So, the sum of the deviations of the treatment means about the grand mean vanishes. This means that the sample means have k – 1 degrees of freedom since only k – 1 of them can be arbitrarily specified given the value of the grand mean.

Consider the total variation of y, the sum of squared deviations of each score about the grand mean. The deviation of any score is given by the following.

(

)

(

)

, ,

i j i j i i

y − =y yy + yy (7)

So the sum of squared deviations or total variation in y is

(

)

2

(

)

2

(

)

(

)

(

)

2 , , , 1 1 1 1 2 i i n n k k i j i j i i i j i i i j i j y y y y y y y y y y = = = = ⎡ ⎤ − = − + − − + − ⎣ ⎦

∑∑

∑∑

(8)

Now from equation (5),

(

)

(

,

)

(

)

(

,

)

1 1 1 1 2 2 0 i i n n k k i i j i i i j i i j i j y y y y y y y y = = = = − − = − − =

∑∑

.

So the total variation in y is made up of two contributions.

(

)

2

(

)

2

(

)

2 , , 1 1 1 1 i i n n k k y i j i j i i Tr i j i j SS y y y y y y SSE SS = = = = ⎡ ⎤ = − = − + − = + ⎣ ⎦

∑∑

∑∑

(9)

(3)

ANOVA Notes Page 3

Al Lehnen Madison Area Technical College 4/19/2010

(

)

2 1 k Tr i i i SS n y y = =

(10)

(

,

)

2 1 1 i n k i j i i j SSE y y = = =

∑∑

(11)

The term SSTris the Treatment sum of squares and represents that part of the total variation in y that is due to the differences between the treatments (factor levels). This variation has k – 1 degrees of freedom. The second term, SSE , is the residual (left over) variation due to differences in scores within the different treatment groups. The i’th treatment level hasni− degrees of 1 freedom due to the constraint imposed by equation (5). The total degrees of freedom of the residual or Error sum of squared deviations is given by

(

)

1 1 1 k k i i i i n n k n k = = − = − = −

(12)

The sum of the degrees of freedom of the treatment sum of squares and the error sum of squares is k− + − = − . This is the degrees of freedom of the total variation1 n k n 1 SSy. This is n – 1 since the grand mean “uses up” a degree of freedom. R. Johnson’s text uses the notation SST for

y

SS and SS(Tr) for SSTr.

If the null hypothesis is indeed true, then the mean squares of treatment and error both estimate the same common population variance of each treatment population. The mean squares are computed as a variation divided by associated degrees of freedom.

The Treatment or Between Mean Square:

(

)

2 1 1 1 k i i i Tr Tr n y y SS MS k k = − = = − −

(13)

The Error or Within Mean Square:

(

)

2 , 1 1 i n k i j i i j y y SSE MSE n k n k = = − = = − −

∑∑

(14)

To test the null hypothesis that all treatment levels have the same mean, we compute the observed Fisher F score. Tr MS F MSE = (15)

For a given level of significance,α, this value is compared against a critical score calculated from an F distribution used to compare two variances obtained from sampling variances from two normally distributed populations. The numerator degrees of freedom is k – 1 and the denominator degrees of freedom is n – k. If F >Fα

(

k−1,n− , the null hypothesis is rejected, while if k

)

(

1,

)

F<Fα kn− we fail to rejectk H . 0

(4)

Al Lehnen Madison Area Technical College 4/19/2010 , 1 i n i i j j T y = =

(16) 2 1 k i i i T A n = =

(18)

( )

2 , 1 i n i i j j B y = =

(17) 1 k i i T T = =

(19) 1 k i i B B = =

(20)

From equations (2) and (4),

i i i T y n = (21) y T n = (22)

Also the i’th sample variance is given by

2 2 1 i i i i i T B n s n − = − . (23) Now,

(

)

2 2 2 2 2 2 2 1 1 1 1 1 2 2 k k k k k i i Tr i i i i i i i i i i i i i i T T T T T SS n y y y y n n y n n n n n n = = = = = ⎛ ⎞ = − + = ⎜ ⎟ − + = − ⎝ ⎠

∑ ∑

, so 2 Tr T SS A n = − . (24) Similarly,

(

)

2 2 2 2 2 2 , , , , 1 1 1 1 1 1 1 1 1 2 2 i i i n n n k k k k k k i i i j i i j i i j i i i i j i i i j i j i i i j i T T SSE y y y y y y T n y n n = = = = = = = = = ⎛ ⎞ = − + = − + ⎜ ⎟ = − ⎝ ⎠

∑∑

∑∑

∑∑

, so SSE= − . (25) B A

Finally, as a check, the total variation must be the sum of treatment and error variations.

(

)

(

)

2 2 2 2 , , , 1 1 1 1 1 1 2 2 i i i n n n k k k y i j i j i j i j i j i j Tr T SS y y y y y y n B n T A B A SS SSE n = = = = = = = − = − + = − ⎛ ⎞ =⎜ − ⎟+ − = + ⎝ ⎠

∑∑

∑∑

∑∑

(5)

ANOVA Notes Page 5

Al Lehnen Madison Area Technical College 4/19/2010

The Treatment or Between Mean Square:

2 1 1 Tr Tr T A SS n MS k k − = = − − (26)

The Error or Within Mean Square: MSE SSE B A

n k n k

= =

− − (27)

So a scheme to calculate a one-way classification analysis of variance is to lay out the data in columns as in a spreadsheet, with each column representing a different treatment or factor level. Leave room between treatments for a second column which is the square of the first column. Thus, each treatment is associated with a pair of columns. Sum the column of scores to obtainT i

and sum the column of squares of scores to obtainBi. Then for each treatment, calculate the

sample mean,yi,

2

i i T

n and the sample variance

2 2 1 i i i i i T B n s n − =

− . Then sum over the treatment groups to obtain T, B, A, and y . Note: The table below should not be interpreted as implying that the number of rows each column is the same. In general, it is not true that n1=n2=n3= nk.

Treatment 1 Treatment 2 Treatment 3 Treatment k

1,1 y 2 1,1 y y2,1 y2,12 y3,1 y3,12 yk,1 yk2,1 1,2 y 2 1,2 y y2,2 y2,22 y3,2 y3,22 yk,2 yk2,2 1,3 y 2 1,3 y y2,3 y2,32 y3,3 y3,32 yk,3 yk2,3 1 1,n y 1 2 1,n y y2,n2 2 2 2,n y y3,n3 3 2 3,n y yk n, k 2 , k k n y --- --- 1 T B1 T2 B2 T3 B3 Tk Bk 1 y 2 1 s 2 y 2 2 s 3 y 2 3 s k y 2 k s

For each pair of columns: ,

1 i n i i j j T y = =

( )

, 2 1 i n i i j j B y = =

Summing over the columns:

2 1 k i i i T A n = =

1 k i i B B = =

1 k i i T T = =

y T n =

At this point we can construct an ANOVA table.

Source Sum of Squares Degrees of Freedom Mean Square Treatment SSTr

=

A – T 2

/

n k - 1 MSTr

=

SSTr

/

( k – 1)

Error SSE = B - A n - k MSE = SSE /( n – k)

Total SSy = B – T 2

/

n n - 1 2 1 y y SS s n = −

(6)

Al Lehnen Madison Area Technical College 4/19/2010

Compute F MSTr MSE

= and compare its value against Fα

(

ν1= −k 1,ν2= − for a stated level of n k

)

significance,α.

If F<Fα

(

k−1,n− , we fail to reject k

)

H : 0 μ1= μ2= μ3=… μk.

If F>Fα

(

k−1,n− , we reject k

)

H and conclude that at least two of the population means are 0 different.

To determine which treatment means are different more testing needs to be done. Some

procedures which are used are Scheffe’s test and Tukey’s test. However, the method we will use is multiple – t testing, also known as the Bonferroni procedure. There are

(

1

)

2 2 k k k − ⎛ ⎞ = ⎜ ⎟ ⎝ ⎠ distinct

pairs of treatments which can be picked from the k different treatment groups. For each such pair an observed t - score is calculated. If this observed score is larger in absolute value than a critical t – score, then the null hypothesis that asserts the equality of the two population means associated with the two treatments is rejected. Specifically, let and m stand for the treatment group indices of the two treatments being examined. Then,

obs( , ) 1 1 m p m y y t m s n n − = + (28)

is the by now familiar observed t - score for comparing two independent sample means. The absolute value is used since this is a two-sided test for a difference between two populations. Since we have already assumed that the k treatment populations have measurements which are normally distributed with a common variance, we use the pooled variance which best estimates this common within treatment group variance, namely MSE.

2 p B A s MSE n k − = = − . (29) Hence, our observed t – score is computed from

obs( , ) 1 1 m m y y t m MSE n n − = ⎛ ⎞ + ⎜ ⎟ ⎝ ⎠ . (30)

Now, one would expect that thistobs( , )m would be compared against a critical t – score /2

c

t =tα , where αis the level of significance. However, we wantαto be the maximum probability of a Type I error for the entire sequence of

(

1

)

2 2

k k k

m=⎛ ⎞⎜ ⎟= −

⎝ ⎠ comparisons. If each comparison were made at a level of significance of α, then the probability of a Type I error “somewhere” in the m comparisons made would be1− −

(

1 α

)

m. This probability is larger than

α and considerably larger if m is large. To have an actual level of significance of α, we will therefore use tc=tα/(2 )m , so that the probability of a Type I error in the m comparisons becomes

(7)

ANOVA Notes Page 7

Al Lehnen Madison Area Technical College 4/19/2010

1 1 m m α α ⎛ ⎞ − −

⎝ ⎠ . Finally, when we find t , we need to know the degrees of freedom. Since we c are using MSE fors , the degrees of freedom is2p ν2= − . n k

Thus, in summary we have /(2 )

c m

t =tα with n – k degrees of freedom (31)

Consider the following example. Suppose there are five different treatments, i.e., k = 5, and suppose we wish to work at a level of significance of 10% , i,e,.α=0.10. Then the number of pair wise comparisons is 5 5 4

( )

10

2 2

m=⎛ ⎞⎜ ⎟= =

⎝ ⎠ . Suppose that the total number of measurements over the five treatments is 26, i.e., n = 26. Thus, from equation (31), tc =tα/(2 )m =t.10/20=t.005

with 26 – 5 = 21 degrees of freedom. Hence, tc=2.831. Now to facilitate the 10 different comparisons, it helps to summarize the results in tabular form. Each open cell in the following table represents one of the 10 possible comparisons.

Treatment 1 2 3 4

2 tobs(1, 2)

3 tobs(1,3) tobs(2,3)

4 tobs(1, 4) tobs(2, 4) tobs(3, 4)

5 tobs(1,5) tobs(2,5) tobs(3,5) tobs(4,5)

In these cells the computed value of tobsfor the two treatments being compared is displayed. Whenever this value of tobsexceeds 2.831 one can conclude at a 10% level of significance that the two population means are different.

References

Related documents

Biological control is the use of living organisms, such as predators, parasitoids, and pathogens, to control pest insects, weeds, or diseases.. Other items addressed

Written in the old testament break bread represented his body which we bless you, although not know his son.. Direction to the lost: did so he was made

Visual symptoms analysis showed correlation of double vision with total coma and with horizontal coma for the 5-mm and 7-mm pupil size; correlation between starburst and total coma

Field experiments were conducted at Ebonyi State University Research Farm during 2009 and 2010 farming seasons to evaluate the effect of intercropping maize with

The vaccination coverage among age groups in Table 1 underlines that the elderly are the most vaccinated group with a coverage of 17.4% for the 65-84-year-olds and 30.0% for the

OCCURRENCE: Campanian of Madagascar; Lower Campanian of Spain; the specimens from Hungary are Early Campanian in age (they were collected in Siimeg area,

 Inclusion of J Elgin Family Trust - Accepted husband's evidence he played little role in the trust, and as such this was excluded as property of the husband and an order made

Source: Enterprise Architecture as Strategy: Creating a Foundation for Business Execution , J. Example Management and Governance Activities.. Stage Name Business Silos Standardised