• No results found

TABLE OF CONTENTS. About Chi Squares What is a CHI SQUARE? Chi Squares Hypothesis Testing with Chi Squares... 2

N/A
N/A
Protected

Academic year: 2021

Share "TABLE OF CONTENTS. About Chi Squares What is a CHI SQUARE? Chi Squares Hypothesis Testing with Chi Squares... 2"

Copied!
9
0
0

Loading.... (view fulltext now)

Full text

(1)

About Chi Squares

(2)

TABLE OF CONTENTS

About Chi Squares ... 1

What is a CHI SQUARE? ... 1

Chi Squares... 1

Goodness of fit test (One-way χ2 ) ... 1

Test of Independence (Two-way χ2 ) ... 2

Hypothesis Testing with Chi Squares ... 2

STEP 1: Establish the hypotheses ... 2

STEP 2: Calculate the Chi-square ... 2

STEP 3: Assess the significance level ... 3

STEP 4: Make a decision ... 4

Post hoc Testing ... 4

METHOD 1 ... 4

METHOD 2 ... 5

Glossary ... 6

References ... 7

(3)

1

About Chi Squares

What is a CHI SQUARE?

● A chi square is a non-parametric test used to explore the relationships between categorical variables.

● Parametric statistics are concerned with interval or ratio data. Sometimes, however, the researcher wishes to use nominal or ordinal to examine the frequency of certain categories. As you might expect, these types of analyses are called non-parametric statistics.

Chi Squares

● Sometimes researchers are interested in determining differences in the frequency of events; however, we are limited in what we can do statistically with categorical data because we are unable to calculate the mean (which is a foundation of many statistical tests).

● The Chi square statistic allows us to test hypotheses about categorical variables by examining if there is a relationship between 2 or more categorical variables.

● Chi squares are most often used to test the relationship between 2 variables.

They can also be used to make a multivariate assessment of the relationship between 3 or more variables.

Goodness of fit test (One-way χ2)

● In this case, only 1 variable is being examined.

Goodness of fit tests are often used for examining fairness in games of chance, such as cards, roulette and dice games.

(4)

2

Test of Independence (Two-way χ2 )

● Two variables with at least two levels each are being examined.

● A level is a subcategory of the variable.

● The goal of this test is to learn if the variables are independent of each other.

Hypothesis Testing with Chi Squares

STEP 1: Establish the hypotheses

● The hypotheses help state the purpose of the study, by indicating what it is we are trying to conclude from conducting the particular study.

● The null hypothesis is that the two variables are independent: H0 = 0

● The alternative hypothesis is that the two variables are related: H1 ≠ 0

STEP 2: Calculate the Chi-square

● There are 3 things that need to be known before the Chi square can be calculated:

The first is the observed count. This is the observed frequency of each variable.

The second is the expected count. This is the value that we would expect to find if there was no error in our study.

It is calculated with:

E = (row total) x (column total) table total

(5)

3

Finally, we calculate the Chi square (χ ).

It is calculated with:

χ2 = Σ (O – E)2 E

● It is easier to conceptualize these 3 steps in a chart :

The cell in the bottom right is the one that gives us the value of χ2.

O E (O-E)2 (O-E)2/E

Σ(O - E)2 / E = χ2

STEP 3: Assess the significance level

● In order to conclude if the relationship we find between the two variables is statistically significant, we have to set the alpha level.

If the value that we calculated is higher than the alpha level then we can conclude that the relationship is statistically significant.

● Usually the alpha level is set at α = .05, which means that there is less than a 5% possibility that the result occurred by chance.

The alpha level and the degrees of freedom allow us to find the critical value from a statistical table, and we compare this number to the observed value in order to determine statistical significance.

● Degrees of freedom for goodness of fit tests are calculated with:

df = k – 1, where k = number of levels

Degrees of freedom for tests of independence are calculated with:

df = (number of rows – 1)(number of columns – 1)

(6)

4

STEP 4: Make a decision

● If the obtained χ2 value is larger than the critical χ2 value, then we can conclude that there is a significant relationship between the variables.

● If the obtained χ2 value is smaller than the critical χ2 value, then we can conclude that there is no relationship between the variables.

Post hoc Testing

● When you conduct a Chi-square test of independence with variables that have more than 2 levels and find a significant result, post hoc tests need to be performed in order to determine where the significance lies.

Post hoc means “after the fact.” A post hoc test is performed after a significant result is found.

The post hoc test compares each of the levels and allows you to determine which pairs of cells (levels) are significantly different.

METHOD 1

● This involves simply looking at the table.

By examining the residual levels (the observed level – the expected level), you can make a judgment on which pairs of cells are most significant.

The percentage of contribution can be calculated and compared to see which pairs contribute the most to the χ2 value.

The percentage is calculated with:

% = [(O - E)2 / χ2] x 100 E

(7)

5

● This is not the most accurate method of post hoc testing because it is based mostly on visual interpretation.

METHOD 2

● This involves extracting a 2x2 table of interest from the original table.

It is easy to collapse ordinal variables into a dichotomy because the data is rank ordered. This is much more difficult for nominal variables because they are distinct and unordered categories.

● NOTE: you are not removing data, but rather combining the cells in a meaningful way.

● This will produce a new χ2 value that might be a significant result. We need to recognize that this value is part of the larger table and that it doesn’t relate directly to the original analysis. This inflates Type I error.

● In order to counter the inflated Type I error we need to adopt a conservative alpha level by performing a Bonferroni adjustment (k):

k = r! x c!

2!(r-1)! 2!(c-1)!

Where r is the number of rows, c is the number of columns and ! refers to a factorial, which is a series of multiplications. For example, 2! = (2)(1), 3! = (3)(2)(1), 4! = (4)(3)(2)(1), etc.

Next, we divide our original alpha level by k to determine the conservative alpha level. We compare our new χ2 value to the critical value at this alpha level to determine if the result is significant.

(8)

6

Glossary

Bonferroni adjustment:

a statistical procedure that corrects for Type I error inflation when the cells in a chi-square table have been collapsed.

Categorical variable:

one that we can place into categories, but these categories may not have any logical ordering (Nominal or ordinal).

Expected count:

the expected frequency.

Non-parametric test:

a type of test that does not assume that the variables follow a specific distribution, like a normal distribution.

Observed count:

the recorded frequency.

Post hoc test:

a test that is performed after a significant result is found. When there are more than two levels, this test shows which level is significant.

Type I error:

occurs when you reject the null hypothesis even though it is true. In other words, it is concluding that there is a significant relationship when there actually isn’t one.

(9)

7

References

Cohen, R. J., & Swerdlik, M. E. (2005). Psychological testing and assessment: An

introduction to tests and measurements. (6th ed.) NY: McGraw-Hill.

Gardner, R. C. (2001). Psychological statistics using SPSS for Windows. New

Jersey: Prentice Hall.

Hamilton, L. C. (1996). Data analysis for social scientists. Scarborough: Duxbury

Press.

Quantitative Methods in Social Sciences e-lessons. (n.d.). The chi square.

Retrieved on May 30, 2007 from

http://ccnmtl.columbia.edu/projects/qmss/chi_test.htmlm

Utts, J. (2005). Seeing through the statistics. (3rd ed.). Toronto: Thomson

Brooks/Cole.

Wilson, J. H. (2005). Essential Statistics. New Jersey: Pearson Prentice Hall.

References

Related documents

Our geological investigations on the islands of Sulawesi, Seram, Ambon, Timor, Flores, Sumba, Kur, Kai and Irian Jaya and coupled with studies in the adjacent Celebes Sea,

D Data processing control unit, 114 Dead volume, 58 Detrimental volume, 59 Diesel aircraft engines, 9 Diesel-and-water emulsions, 33 Dimensions of diesel fuel-injection technology,

The results of this research are expected to be used by the government in improving inclusive policies and practices which determine that adequate education services are received

Confederation, a National Federation, or any other Signatory, Signatory’s member organisation, or a club or other member organisation of any Continental Confederation or

It takes a minute to write a safety rule It takes a hour to hold a safety meeting It takes a week to plan a safety program It takes month to put it into operation It takes a year

Metronome device-guided breathing (part II, and part III of protocol 1, Table 1 above) produced increases in the amplitude of the oscillations of heart rate, also known as heart

straightforward estimation strategy is to directly map the coefficients of a model into the objective function values, and then to evolve the coeffi- cients according to a