• No results found

PEARSON R CORRELATION COEFFICIENT

N/A
N/A
Protected

Academic year: 2021

Share "PEARSON R CORRELATION COEFFICIENT"

Copied!
8
0
0

Loading.... (view fulltext now)

Full text

(1)

PE

ARSON R

C

ORRELATION

COEFFICIENT

Intr

oduction:

Sometimes in scientific data, it appears that two variables are

connected in such a way that when one variable changes, the other variable changes also. This connection is called a correlation.

Examples of this type of correlation include: (1) in deer populations, large males seem to have more successful matings; and (2) larger numbers of birds seem to nest in areas with dense vegetation.

Scientists measure the strength of a relationship between two variables by calculating a correlation coefficient. The value of the correlation coefficient indicates to what extent the change found in one variable relates to change in another. There are several types of correlation coefficients, but the one that is most widely used is called the Pearson Product-Moment Correlation Coefficient, or simply, the Pearson r.

Stu

dent Procedure

Example: Your students have done some classroom research on

amphibian species found in your area and have discovered that the red-backed salamander uses fallen logs and debris on the forest floor for their home. During their earlier census of their Biodiversity Plot, they have noticed that some quadrats have many fallen logs whereas other quadrats have few or none. They expect that they would find more red-backed salamanders in those quadrats with many fallen logs and design an experiment to test this hypothesis. This experiment measures: (1) the number of fallen logs in each quadrat; and (2) the number of red-backed salamanders in each quadrat. This is a table of the data your class has collected:

(2)

Q u a d r a t N u m b e r # F a l l e n L o g s # S a l a m a n d e r s

1 4 3

2 6 2

3 2 0

4 3 1

5 7 3

6 1 0

7 0 0

8 2 0

9 0 0

1 0 3 1

1 1 2 1

1 2 4 2

1 3 0 0

1 4 2 1

1 5 2 0

1 6 1 0

1 7 3 1

1 8 5 3

1 9 3 1

2 0 2 2

2 1 2 1

2 2 0 0

2 3 0 0

2 4 2 0

2 5 1 0

Step 1:

Graphing a the data

Graph your data by computer, or by hand, by assigning the number of fallen logs (the second column) as the X-axis and the number of salamanders (the last column) as the Y-axis. For example, in

Quadrat 1, the X-value would be 4 and the Y-value would be 3. The results, when we plot all 25 points on the graph, look like this:

RELATIONSHIP OF FALLEN LOGS AND SALAMANDERS

0 0.5 1 1.5 2 2.5 3 3.5

0 1 2 3 4 5 6 7 8

(3)

Looking at this graph, there seems to be a positive relationship between the number of fallen logs and the number of salamanders. In other words, it appears that when the number of fallen logs

increases, the number of Red-Backed Salamanders also increases.

• The lowest value that the Pearson r can have is r = 0.00. This means there is ZERO correlation, and would indicate that X and Y are not related to one another.

• The highest value that the Pearson r can have is r = 1.00. This indicates a PERFECT correlation and would indicate that X and Y are completely related to one another in the sample.

• Pearson r values can be either positive or negative. A positive value indicates that increases in X correspond to increases in Y. A negative value indicates that increases in one variable are

associated with decreases in the other variable.

Some things to remember about the Pearson r correlation:

The following graphs illustrate some of the various types of correlations possible:

a. This is an example of a perfect, positive correlation, in that the data shows no deviation from a straight line.

b. This is an example of a perfect, negative correlation, in that the data shows no

deviation from a straight line.

c. This is an example of a high, positive correlation. Since the data shows some variability, a perfect prediction cannot be made.

d. This is an example of a high, positive correlation. Since the data shows some variability, a perfect prediction cannot be made.

e. This shows a low correlation. Although predictions could be made, and those predictions would be slightly better than chance, estimates would still be imprecise. f. This figure shows a zero correlation. Prediction would be no better than chance.

(4)

The graph below was produced by Microsoft Excel (charting

function) which calculated a correlation coefficient from the data in our example. The graph shows a trend indicating an increase in salamanders where there are more fallen logs present. Note, however, that the value calculated by this program is the Pearson r value squared. You must take the square root of this figure to give the Pearson r value. From the graph: R2 = 0.72; Pearson r = 0.85.

Because 0.85 is close to 1.0 (the maximum value for the Pearson r), this demonstrates a strong, positive correlation.

RELATIONSHIP OF FALLEN LOGS AND SALAMANDERS

R2 = 0.7175 Pearson r = 0.85

-0.5 0 0.5 1 1.5 2 2.5 3 3.5

0 1 2 3 4 5 6 7 8

Number of Fallen Logs

Number of Salamanders

If not using the Excel Software, or other graphing program, you can calculate the Pearson r by using the following formula:

FORMULA FOR CALCULATING THE PEARSON R

CORRELATION COEFFICIENT

Pearson r = N

(

XY

)

( )

X

( )

Y

N

(

X2

)

X

( )

2

[

]

N

(

Y2

)

Y

( )

2

[

]

Step 3:

Calculating the Pearson r Correlation

(5)

This formula looks complicated, but can be simplified by breaking it into its separate components. Using your original data, create the following table:

Q uadrat Num ber

(N)

# Fallen Logs

(X)

# Salam anders

(Y) X

2

Y2 XY

1 4 3 16 9 12 2 6 2 36 4 12 3 2 0 4 0 0 4 3 1 9 1 3 5 7 3 49 9 21 6 1 0 1 0 0 7 0 0 0 0 0 8 2 0 4 0 0 9 0 0 0 0 0 10 3 1 9 1 3 11 2 1 4 1 2 12 4 2 16 4 8 13 0 0 0 0 0 14 2 1 4 1 2 15 2 0 4 0 0 16 1 0 1 0 0 17 3 1 9 1 3 18 5 3 25 9 15 19 3 1 9 1 3 20 2 2 4 4 4 21 2 1 4 1 2 22 0 0 0 0 0 23 0 0 0 0 0 24 2 0 4 0 0 25 1 0 1 0 0

ΣN = 25 ΣX = 25 ΣY = 25 ΣX2 = 25

ΣY2 = 25

ΣX Y = 25

Using the values from the new table, complete the Pearson r formula:

Pearson r =

N

(

XY

)

( )

X

( )

Y

N

(

X2

)

( )

X 2

(6)

The numerator, or top of the formula, looks like this once we plug in all the numbers:

Pearson r =

( )

25

( )

90 −

( )

57

( )

22

25 213

( )

( )

57 2

[

]

[

25 46

( )

( )

22 2

]

Pearson r = 2250−1254

5325−3249

[

]

[

1150−484

]

Pearson r = 996

2076

[

]

[ ]

666

Pearson r = 996

1382616

Pearson r = 996

1175.85

Pearson r =0.8471

Again, this Pearson r correlation coefficient (being extremely close to the maximum value 1.0) demonstrates a strong positive correlation between the number of fallen logs and the number of salamanders.

(7)

St

ep 4:

Determine if your calculations have

statistical significance

You must determine whether or not your calculations have statistical significance. To do this you must determine the ‘critical value’ for your Pearson r correllation coefficient by using the following table:

So for our example:

1. Calculate the degrees of freedom (DF) by subtracting the 2 from the number of comparisons you are making (DF = N - 2)

In our case, we are sampling fallen logs and salamanders in 25 quadrats (N = 25)

DF = 25 - 2 = 23

2. Find your DF on the table below and find the critical value allowed.

In our case, the nearest DF listed is 25 with a critical value of 0.3233. Our calculated Pearson r correlation coefficient is 0.8471.

3. The calculated figure is greater than the critical value from the table; our findings have statistical significance. Therefore, we can assume that our hypothesis is true and that there is a strong positive correlation betwen the number of fallen logs and the number of salamanders and this correlation is not due to chance.

(8)

CRITICAL VALUES FOR THE PEARSON R

CORRELLATION COEFFICIENT

DF

(N - 2)

Critical

Value

(5% certainty)

1 .98769

2 .90000

3 .8054

4 .7293

5 .6694

6 .6215

7 .5822

8 .5494

9 .5214

10 .4973

11 .4762

12 .4575

13 .4409

14 .4259

15 .4124

16 .4000

17 .3887

18 .3783

19 .3687

20 .3598

25 .3233

30 .2960

35 .2746

40 .2573

45 .2428

50 .2306

60 .2108

70 .1954

80 .1829

90 .1726

References

Related documents

By providing user interfaces that are designed around tools with which your staff is already familiar it is possible to improve the productivity of staff, reduce the cost of

“The NIJ Ballistic Resistance of Body Armor Standard and NIJ Ballistic Body Armor Compliance Testing Program (“NIJ Body Armor CTP”) documents (including forms, applications and

This document is SS-AGR-105, one of a series of the Agronomy Department, Florida Cooperative Extension Service, Institute of Food and Agricultural Sciences, University of

Frequencies and age-adjusted percentages (with standard errors) of ever having been told of having a learning disability or attention deficit hyperactivity disorder

27th January, Purchasing Information Systems (Sourcing Software Suites to Support the Procurement Function).. • Date: 10th

not only has this ensured a breadth of perspective, but these organisations were also more likely to have the type of organised talent development activity we wanted to focus on:

President of the Cataract Power and Conduit Company and served as an organizer and president of the Thomson-Houston Electric Light and Power Company before it was absorbed by

To generate a complex-valued stationary Gaussian sequence with given complex- valued covariance function, one can obviously simulate the corresponding real-valued bivariate