• No results found

Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers

N/A
N/A
Protected

Academic year: 2021

Share "Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers"

Copied!
8
0
0

Loading.... (view fulltext now)

Full text

(1)

1

Correlation

Greg C Elvers

_______________________________________

_______________________________________

_______________________________________

_______________________________________

2

What Is Correlation?

Correlation is a descriptive statistic that tells you if two variables are related to each other

E.g. Is your GPA related to how much you study?

When two variables are correlated, knowing the value of one variable allows you to predict the value of the other variable

_______________________________________

_______________________________________

_______________________________________

_______________________________________

3

Perfect Correlation

When two variables are perfectly correlated, knowing the value of one variable allows you to exactly predict the value of the other variable

_______________________________________

_______________________________________

_______________________________________

_______________________________________

4

Perfect Correlation

For example, if the only thing that determined your GPA was the amount of time that you studied, then the two would be perfectly correlated

0 1 2 3 4

0 5 10

Study Hours per Day

GP A

If you know the value of one variable, you can exactly determine the value of the other variable

_______________________________________

_______________________________________

_______________________________________

_______________________________________

(2)

5

Perfect Correlation

That is, all the variability in one variable is explained by the variability in the other variable

0 1 2 3 4

0 5 10

Study Hours per Day

GP A

_______________________________________

_______________________________________

_______________________________________

_______________________________________

6

Perfect Correlations

Few, if any, psychological variables are perfectly correlated with each other Many non-psychological variables do have a perfect correlation

E.g. Time since the beginning of class and the time remaining in the class are perfectly correlated

What are other examples of perfectly correlated variables?

_______________________________________

_______________________________________

_______________________________________

_______________________________________

7

Less Than Perfect Correlations

Even if two variables are correlated, most of the time you cannot perfectly predict the value of one variable given the other

E.g., other variables besides amount of time spent studying influence your GPA

Some of the variability is people’s GPA is due to the amount of time spent studying, but not all the variability is due to it

_______________________________________

_______________________________________

_______________________________________

_______________________________________

8

Less Than Perfect Correlations

Study

Time IQ GPA

3 80 2.0

3 100 2.5

3 120 3.0

5 80 2.5

5 100 3.0

5 120 3.5 0

1 2 3 4

0 5 10

Study Hours per Day

GP A

_______________________________________

_______________________________________

_______________________________________

_______________________________________

(3)

9

Less Than Perfect Correlations

With a less than perfect correlation, we can no longer perfectly predict the value of one variable given the other variable

We cannot explain all the variability in one variable with the variability in the other variable

_______________________________________

_______________________________________

_______________________________________

_______________________________________

10

The Correlation Coefficient

Correlation coefficients tell us how

perfectly two (or more) variables are related to each other

They can also be used to determine how much variability in one variable is explainable by variation in the other variable.

_______________________________________

_______________________________________

_______________________________________

_______________________________________

11

Pearson’s Product Moment Correlation Coefficient

Pearson’s product moment correlation coefficient, or Pearson’s r, for short is a very common measure of how strongly two variables are related to each other

Pearson’s r must lie in the range of -1 to +1 inclusive

_______________________________________

_______________________________________

_______________________________________

_______________________________________

12

Interpretation of Pearson’s r

To interpret Pearson’s r, you must consider two parts of it:

The sign of r

The magnitude, or absolute value of r

_______________________________________

_______________________________________

_______________________________________

_______________________________________

(4)

13

The Sign of r

When r is greater than 0 (I.e. its sign is positive) the variables are said to have a direct relation

0 1 2 3 4

0 2 4 6

Study Hours per Night

GP A

In a direct relation, as the value of one variable increases, the value of the other variable also tends to increase

_______________________________________

_______________________________________

_______________________________________

_______________________________________

14

The Sign of r

When r is less than 0 (i.e., its sign is negative) the variables are said to have an indirect relation

0 1 2 3 4

0 2 4 6

Hours of TV per Night

GP A

In an indirect relation, as the value of one variable increases, the value of the other variable tends to decrease

_______________________________________

_______________________________________

_______________________________________

_______________________________________

15

Is the Sign of r + or -?

As the number of cigarettes smoked per day increases, GPA tends to decrease

As the number of cats in a farm yard increases, the number of mice tends to decrease

As the weight of a cat increases, the length of its whiskers tends to increase

_______________________________________

_______________________________________

_______________________________________

_______________________________________

16

Is the Sign of r + or -?

Create two examples of correlations and determine if the sign of r is positive or negative

_______________________________________

_______________________________________

_______________________________________

_______________________________________

(5)

17

The Magnitude of r

The magnitude refers to the size of the correlation coefficient ignoring the sign of r The magnitude is equivalent to taking the absolute value of r

The larger the magnitude of r is, the more perfectly the two variables are related to each other

The smaller the magnitude of r is, the less perfectly the two variables are related to each other

_______________________________________

_______________________________________

_______________________________________

_______________________________________

18

r = 1

When r equals 1.0, there is a perfect correlation between the variables

0.0 0.2 0.4 0.6 0.8 1.0

0 10 20 30 40 50 60 Minutes

Ho urs

Knowing the value of one variable exactly predicts the value of the other variable

_______________________________________

_______________________________________

_______________________________________

_______________________________________

19

r = 0

When r equals 0, either the assumptions of correlation have been violated or there is no relation between the two variables

-10 -5 0 5 10

-10 -5 0 5 10

The points in a scatter plot with r = 0 will tend to form a circular cluster

_______________________________________

_______________________________________

_______________________________________

_______________________________________

20

0 < | r | < 1

The larger the magnitude or r is, the more the scatter plot’s points will tend to cluster tightly about a line

0 < | r | < 1

_______________________________________

_______________________________________

_______________________________________

_______________________________________

(6)

21

Magnitude of r

Cohen (1988) recommends the following values of r for “small”,

“medium”, and “large”

effects

.50 to 1.00 -1.00 to -.50

Large

.30 to .49 -.49 to -.30

Medium

.10 to .29 -.29 to -.10

Small

Positive Negative

Correlation

_______________________________________

_______________________________________

_______________________________________

_______________________________________

22

Magnitude of r

List a couple of pairs of variables and guess whether the magnitude of r is closer to 0 or closer to 1

_______________________________________

_______________________________________

_______________________________________

_______________________________________

23

Pearson’s r

Pearson’s r makes several assumptions about the data

When these assumptions are violated, r must be interpreted with extreme caution Assumptions:

Linear relation Non-truncated range Sufficiently large sample size

_______________________________________

_______________________________________

_______________________________________

_______________________________________

24

Linear Relation

Pearson’s r, in its simplest form, only works for variables that are linearly related

That is, the equation that allows us to predict the value of one variable from the value of the other is a line:

Y = slope * X + intercept

Always look at the scatter plot to determine if the two variables are approximately linearly related

_______________________________________

_______________________________________

_______________________________________

_______________________________________

(7)

25

Linear Relation

If the variables are not linearly related, Pearson’s r will indicate a smaller relation than actually exists

Often, non-linear relations can be transformed into linear ones by taking the appropriate mathematical transformation

_______________________________________

_______________________________________

_______________________________________

_______________________________________

26

Square Root of Y Transformation

0 10 20 30 40 50 60

0 2 4 6 8

0 1 2 3 4 5 6 7 8

0 2 4 6 8

_______________________________________

_______________________________________

_______________________________________

_______________________________________

27

Non-Truncated Range

A truncated range occurs when the range of one of the variables is very small

When the range is truncated, Pearson’s r will indicate a smaller relation between the variables than what actually exists

Once a range truncation occurs, there is little that you can do; be careful not to design studies that will lead to a truncated range

_______________________________________

_______________________________________

_______________________________________

_______________________________________

28

Truncated Range

A linear relation clearly exists in this data

1.0 1.5 2.0 2.5 3.0 3.5 4.0

1.0 1.5 2.0 2.5 3.0 3.5 4.0

High School GPA

CollegeGPA

Consider only the data in the square (thereby truncating the range) Is the linear relation as clear as it was?

No

_______________________________________

_______________________________________

_______________________________________

_______________________________________

(8)

29

Sample Size

If the size of the sample is too small, relations can appear due to chance

These relations disappear when a larger sample is considered

Too large of a sample can make near 0 correlations statistically significant, even though they have very little explanatory power

_______________________________________

_______________________________________

_______________________________________

_______________________________________

30

Sample Size

The magnitude of r does not depend on sample size

The likelihood of finding a statistically significant r does depend on sample size The sample should be large enough to generalize to the population of interest

_______________________________________

_______________________________________

_______________________________________

_______________________________________

References

Related documents

The escalation of the Eurozone crisis in 2010 fixed attention on the design failures of the eurozone and the practicalities of having a monetary union without

Overall, as in Chang and McAleer (2014) for the top 500 journals chosen on the basis of 2YIF in both the Sciences and Social Sciences, the low correlations of the 2YIF Ranks and

Beginning at 15th level, you may force a humanoid damaged by your Dark Strike to make a Constitution saving throw (DC 8 + your proficiency bonus + your Charisma bonus) or be

NOTE: Regardless of the child’s age, depending on the location, severity and multiplicity of the injuries (cuts, bruises and/or welts), the case may be a Priority I. The

Keeping in view the demand for promotion of Urdu, the Government of India appointed a Committee for Promotion of Urdu in the year 1972 under the Chairmanship of

Additional file 6: Data Set (Team Emergency Assessment Measure scores of novices and experts rating 7 teams rotating through 6 cases each).. (XLSX

While these methods can be useful in certain limited comparisons or in generating groupings among trees, we find that they either do not scale well to comparisons on large data

Since the number of units produced decreased in the second year, while the fixed manufacturing overhead remained the same, the absorption unit cost is now higher... Absorption