Correlation What Is Correlation? Perfect Correlation
[Pages:8]Correlation
Greg C Elvers
1
What Is Correlation?
Correlation is a descriptive statistic that tells you if two variables are related to each other
E.g. Is your GPA related to how much you study?
When two variables are correlated, knowing the value of one variable allows you to predict the value of the other variable
2
Perfect Correlation
When two variables are perfectly correlated, knowing the value of one variable allows you to exactly predict the value of the other variable
3
Perfect Correlation
For example, if the only thing that determined your GPA was the amount of time that you studied, then the two would be perfectly correlated
If you know the value of one variable, you can exactly determine the value of the other variable
GPA
4
3
2
1
0
0
5
10
Study Hours per Day
4
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
Perfect Correlation
That is, all the variability in one variable is explained by the variability in the other variable
GPA
4
3
2
1
0
0
5
10
Study Hours per Day
5
Perfect Correlations
Few, if any, psychological variables are perfectly correlated with each other
Many non-psychological variables do have a perfect correlation
E.g. Time since the beginning of class and the time remaining in the class are perfectly correlated
What are other examples of perfectly
correlated variables?
6
Less Than Perfect Correlations
Even if two variables are correlated, most of the time you cannot perfectly predict the value of one variable given the other
E.g., other variables besides amount of time spent studying influence your GPA Some of the variability is people's GPA is due to the amount of time spent studying, but not all the variability is due to it
7
Less Than Perfect Correlations
Study Time IQ
3 80 3 100 3 120 5 80 5 100 5 120
GPA 2.0 2.5 3.0 2.5 3.0 3.5
GPA
4
3
2
1
0
0
5
10
Study Hours per Day
8
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
Less Than Perfect Correlations
With a less than perfect correlation, we can no longer perfectly predict the value of one variable given the other variable We cannot explain all the variability in one variable with the variability in the other variable
9
The Correlation Coefficient
Correlation coefficients tell us how perfectly two (or more) variables are related to each other They can also be used to determine how much variability in one variable is explainable by variation in the other variable.
10
Pearson's Product Moment Correlation Coefficient
Pearson's product moment correlation coefficient, or Pearson's r, for short is a very common measure of how strongly two variables are related to each other Pearson's r must lie in the range of -1 to +1 inclusive
11
Interpretation of Pearson's r
To interpret Pearson's r, you must consider two parts of it:
The sign of r The magnitude, or absolute value of r
12
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
The Sign of r
When r is greater than 0 (I.e. its sign is positive) the variables are said to have a direct relation In a direct relation, as the value of one variable increases, the value of the other variable also tends to increase
GPA
4
3
2
1
0
0
2
4
6
Study Hours per Night
13
The Sign of r
When r is less than 0 (i.e., its sign is negative) the variables are said to have an indirect relation
In an indirect relation, as the value of one variable increases, the value of the other variable tends to decrease
GPA
4
3
2
1
0
0
2
4
6
Hours of TV per Night
14
Is the Sign of r + or -?
As the number of cigarettes smoked per day increases, GPA tends to decrease As the number of cats in a farm yard increases, the number of mice tends to decrease As the weight of a cat increases, the length of its whiskers tends to increase
15
Is the Sign of r + or -?
Create two examples of correlations and determine if the sign of r is positive or negative
16
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
The Magnitude of r
The magnitude refers to the size of the correlation coefficient ignoring the sign of r The magnitude is equivalent to taking the absolute value of r The larger the magnitude of r is, the more perfectly the two variables are related to each other The smaller the magnitude of r is, the less perfectly the two variables are related to
17
each other
r = 1
When r equals 1.0, there is a perfect correlation between the variables Knowing the value of one variable exactly predicts the value of the other variable
Hours
1.0
0.8
0.6
0.4
0.2
0.0 0
10 20 30 40 50 60 Minutes
18
r = 0
When r equals 0, either the assumptions of correlation have been violated or there is no relation between the two variables
-10
The points in a scatter plot with r = 0 will tend to form a circular cluster
10
5
0
-5
0
-5
-10
5
10
19
0 < | r | < 1
The larger the magnitude or r is, the more the scatter plot's points will tend to cluster tightly about a line
0 < | r | < 1
20
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
Magnitude of r
Cohen (1988) recommends the
Correlation Negative
following values of r
for "small",
Small
-.29 to -.10
"medium", and "large"
effects
Medium -.49 to -.30
Positive .10 to .29 .30 to .49
Large
-1.00 to -.50 .50 to 1.00
21
Magnitude of r
List a couple of pairs of variables and guess whether the magnitude of r is closer to 0 or closer to 1
22
Pearson's r
Pearson's r makes several assumptions about the data When these assumptions are violated, r must be interpreted with extreme caution Assumptions:
Linear relation Non-truncated range Sufficiently large sample size
23
Linear Relation
Pearson's r, in its simplest form, only works for variables that are linearly related
That is, the equation that allows us to predict the value of one variable from the value of the other is a line: Y = slope * X + intercept Always look at the scatter plot to determine if the two variables are approximately linearly related
24
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
Linear Relation
If the variables are not linearly related, Pearson's r will indicate a smaller relation than actually exists Often, non-linear relations can be transformed into linear ones by taking the appropriate mathematical transformation
25
Square Root of Y Transformation
60
8
50
7
6
40
5
30
4
3 20
2 10
1
0
0
0
2
4
6
8
0
2
4
6
8
26
Non-Truncated Range
A truncated range occurs when the range of one of the variables is very small
When the range is truncated, Pearson's r will indicate a smaller relation between the variables than what actually exists
Once a range truncation occurs, there is
little that you can do; be careful not to
design studies that will lead to a truncated
range
27
Truncated Range
A linear relation clearly exists in this data Consider only the data in the square (thereby truncating the range) Is the linear relation as clear as it was? No
CollegeGPA
4.0
3.5
3.0
2.5
2.0
1.5
1.0
1.0
1.5
2.0
2.5
3.0
3.5
4.0
High School GPA
28
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
Sample Size
If the size of the sample is too small, relations can appear due to chance
These relations disappear when a larger sample is considered
Too large of a sample can make near 0 correlations statistically significant, even though they have very little explanatory power
29
Sample Size
The magnitude of r does not depend on sample size The likelihood of finding a statistically significant r does depend on sample size The sample should be large enough to generalize to the population of interest
30
_______________________________________ _______________________________________ _______________________________________ _______________________________________
_______________________________________ _______________________________________ _______________________________________ _______________________________________
................
................
In order to avoid copyright disputes, this page is only a partial summary.
To fulfill the demand for quickly locating and searching documents.
It is intelligent file search solution for home and business.
Related download
- 9 next of kin general order of distribution
- what is a clearinghouse for medical claims and what do
- correlation what is correlation perfect correlation
- chapter 5 military customs courtesies general
- frequently asked questions miscellaneous
- permutations and combinations
- satellite accumulation of hazardous waste by generators
- multivariate analysis of variance manova
- storage and handling of gas cylinders guidelines
- defining the humanities
Related searches
- what is correlation coefficient
- what is a good correlation coefficient
- what is correlation coefficient pdf
- what is correlation coefficient r2
- what is a medium correlation coefficient
- what is a negative correlation in psychology
- what is correlation in stats
- what is a positive correlation in psychology
- what is a perfect society
- what is correlation and regression
- what is correlation analysis
- what is past perfect tense