ANOVA Examples STAT 314
ANOVA Examples
1.
STAT 314
If we define s = MSE , then of which parameter is s an estimate?
If we define s = MSE , then s is an estimate of the common population standard deviation,
¦Ò, of the populations under consideration. (This presumes, of course, that the equalstandard-deviations assumption holds.)
2.
Explain the reason for the word variance in the phrase analysis of variance.
The reason for the word variance in the phrase analysis of variance is because the analysis
of variance procedure for comparing means involves analyzing the variation in the sample
data.
3.
The null and alternative hypotheses for a one-way ANOVA test are¡
H 0 : ?1 = ?2 = L = ?k
H a : Not all means are equal.
Suppose in reality that the null hypothesis is false. Does this mean that no two of the populations
have the same mean? If not, what does it mean?
If, in reality, the null hypothesis is false, this translates into ¡°Not all of the means are the
same¡± or equivalently ¡°At least two of the means are not the same.¡± (Notice that ¡°at least
two¡± applies to 2 or 3 or ¡ or k means.) This last statement in quotes is not equivalent to
saying ¡°No two of the populations have the same mean¡± since this is equivalent to saying,
¡°All of the population means are different.¡±
4.
In a one-way ANOVA, identify the statistic used¡
a. as a measure of variation among the sample means.
MSTr (or SSTr) is a statistic that measures the variation among the sample means for a
one-way ANOVA.
b. as a measure of variation within the samples.
MSE (or SSE) is a statistic that measures the variation within the samples for a one-way
ANOVA.
c. to compare the variation among the sample means to the variation within the samples.
The statistic that compares the variation among the sample means to the variation within
MSTr
the samples is F =
.
MSE
5.
The times required by three workers to perform an assembly-line task were recorded on five
randomly selected occasions. Here are the times, to the nearest minute.
Hank
8
10
9
11
10
Joseph
8
9
9
8
10
Susan
10
9
10
11
9
(Note: y1 = 9.6, y 2 = 8.8 , y 3 = 9.8 , s12 = 1.3, s22 = 0.7, s32 = 0.7, and y = 9.4 .) Construct the oneway ANOVA table for the data. Compute SSTr and SSE using the defining formulas.
(
)
2
(
)
2
(
)
2
SSTr = n1 y1 ? y + n2 y 2 ? y + n3 y3 ? y = 5(9.6 ? 9.4) + 5(8.8 ? 9.4) + 5(9.8 ? 9.4 ) = 2.8
2
2
2
2
2
2
SSE = (n1 ? 1)s1 + ( n2 ? 1) s2 + (n 3 ? 1) s3 = 4(1.3) + 4 (0.7) + 4(0.7) = 10.8
SSTo = SSTr + SSE = 2.8 + 10.8 = 13.6
SSTr 2.8
SSE
10.8
MSTr 1.4
MSTr =
=
= 1.4 ; MSE =
=
= 0.9; F =
=
= 1.5556
k ?1 3 ?1
N ? k 15 ? 3
MSE 0.9
Source
Treatments
Error
Total
6.
df
2
12
14
SS
2.8
10.8
13.6
MS = SS/df
1.4
0.9
p-value
p-value > 0.10
F-statistic
1.5556
Fill in the missing entries of the partially completed one-way ANOVA table.
Source
Treatments
Error
Total
df
_____
20
_____
SS
2.124
_____
_____
MS = SS/df
0.708
_____
F-statistic
0.75
SSTr
SSTr 2.124
? df Tr =
=
=3
df Tr
MSTr 0.708
MSTr
MSTr 0.708
F=
? MSE =
=
= 0.944
MSE
F
0.75
SSE
MSE =
? SSE = MSE ? df E = 0.944(20) = 18.880
df E
SSTo = SSTr + SSE = 2.124 + 18.880 = 21.004
dfTo = dfTr + df E = 3 + 20 = 23
MSTr =
Source
Treatments
Error
Total
df
3
20
23
SS
2.124
18.880
21.004
MS = SS/df
0.708
0.944
F-statistic
0.75
7.
Data on Scholastic Aptitude Test (SAT) scores are published by the College Entrance Examination
Board in National College-Bound Senior. SAT scores for randomly selected students from each of
four high-school rank categories are displayed in the following table.
Top Tenth
528
586
680
718
Second Tenth
514
457
521
370
532
Second Fifth
649
506
556
413
470
Third Fifth
372
440
495
321
424
330
( N o t e : y1 = 628.0, y 2 = 478.8, y 3 = 518.8 , y 4 = 397.0 , s12 = 7522.667, s22 = 4540.700 ,
s32 = 8018.700, s42 = 4614.400 , and y ?? = 494.1.) Construct the one-way ANOVA table for the
data. Compute SSTr and SSE using the defining formulas.
(
)
2
(
)
2
(
)
2
(
SSTr = n1 y1 ? y + n2 y 2 ? y + n3 y3 ? y + n4 y 4 ? y
)
2
= 4(628.0 ? 494.1) 2 + 5(478.8 ? 494.1)2 + 5(518.8 ? 494.1)2 + 6(397.0 ? 494.1)2 = 132,508.2
SSE = (n1 ? 1)s12 + ( n2 ? 1) s22 + (n 3 ? 1) s32 + (n4 ? 1) s42
= 3(7522.667) + 4 (4540.700) + 4 (8018.700) + 5(4614.400) = 95,877.6
SSTo = SSTr + SSE = 132,508.2 + 95,877.6 = 228,385.8
SSTr 132,508.2
SSE
95,877.6
MSTr =
=
= 44,169.40; MSE =
=
= 5,992.35;
k ?1
4 ?1
N?k
20 ? 4
MSTr 44,169.40
F=
=
= 7.37
MSE
5,992.35
Source
Treatments
Error
Total
df
3
16
19
SS
132,508.2
95,877.6
228,385.8
MS = SS/df
44,169.40
5,992.35
F-statistic
7.37
p-value
0.001 < p-value < 0.01
8.
Four brands of flashlight batteries are to be compared by testing each brand in five flashlights.
Twenty flashlights are randomly selected and divided randomly into four groups of five flashlights
each. Then each group of flashlights uses a different brand of battery. The lifetimes of the
batteries, to the nearest hour, are as follows.
Brand A
42
30
39
28
29
Brand B
28
36
31
32
27
Brand C
24
36
28
28
33
Brand D
20
32
38
28
25
Preliminary data analyses indicate that the independent samples come from normal populations with
equal standard deviations. At the 5% significance level, does there appear to be a difference in mean
lifetime among the four brands of batteries?
Let the subscripts 1, 2, 3, and 4 refer to Brand A, Brand B, Brand C, and Brand D, respectively. Each
sample size is 5, and the total number of pieces of data is 20.
Step 0 :
Step 1 :
Check Assumptions
Hypotheses
H0 : ?1 = ?2 = ?3 = ? 4 (The mean lifetimes are equal.)
Ha : Not all of the means are equal.
Step 2 :
Significance Level
Step 3 :
Critical Value and Rejection Region
Step 4 :
Reject the null hypothesis if F ¡Ý 3.24 (P-value ¡Ü 0.05).
Construct the One-way ANOVA Table
T1 = 168 ; T2 = 154 ; T3 = 149 ; T4 = 143
¦Á = 0.05
F¦Á ,( df 1 =k ?1,df 2 =N ? k) = F0.05,( df 1 =4 ?1,df 2 =20 ?4 ) = F0.05,( df 1 =3,df 2 =16) = 3.24
k
T = 614 ;
k
ni
ni
ni
ni
j =1
2
j =1
j =1
ni
¡Æ ¡Æ y ij2 = ¡Æ y12j + ¡Æ y 22j + ¡Æ y 23 j + ¡Æ y 24 j = 19,410
i=1 j =1
ni
SSTo = ¡Æ ¡Æ y ij ?
2
i=1 j =1
2
(T )
N
= 19,410 ?
(614)
20
j =1
2
= 560.2
2
2
2
2
? (T )
T2 )
T3 )
T4 ) ? (T )
(
(
(
1
SSTr = ?
+
+
+
??
n2
n3
n4 ?
N
? n1
? (168) 2 (154) 2 (149)2 (143)2 ? (614) 2
??
=?
+
+
+
= 68.2
5
5
5 ?
20
? 5
SSE = SSTo ? SSTr = 560.2 ? 68.2 = 492.0
Source
Treatments
Error
Total
Step 5 :
Step 6 :
df
3
16
19
SS
68.2
492.0
560.2
MS = SS/df
22.7333
30.75
F-statistic
0.7393
p-value
p-value > 0.10
Decision
Since 0.7393 < 3.24 (p-value > 0.05), fail to reject the null hypothesis.
State conclusion in words
At the ¦Á = 0.05 level of significance, there is not enough evidence to conclude that the
mean lifetimes of the brands of batteries differ.
9.
Manufacturers of golf balls always seem to be claiming that their ball goes the farthest. A writer for
a sports magazine decided to conduct an impartial test. She randomly selected 20 golf
professionals and then randomly assigned four golfers to each of five brands. Each golfer drove
the assigned brand of ball. The driving distances, in yards, are displayed in the following table.
Brand 1
286
276
281
274
Brand 2
279
277
284
288
Brand 3
270
262
277
280
Brand 4
284
271
269
275
Brand 5
281
293
276
292
Preliminary data analyses indicate that the independent samples come from normal populations with
equal standard deviations. Do the data provide sufficient evidence to conclude that a difference
exists in mean weekly earnings among nonsupervisory workers in the five industries? Perform the
required hypothesis test using ¦Á = 0.05. (Note: T1 = 1,117, T2 = 1,128 , T3 = 1,089 , T4 = 1,099,
5
T5 = 1,142 , and
4
¡Æ¡Æ y
i=1 j =1
2
ij
= 1,555,185 .)
Let the subscripts 1, 2, 3, 4, and 5 refer to Brand 1, Brand 2, Brand 3, Brand 4, and Brand 5 respectively.
Each sample size is 4, and the total number of data pieces is 20.
Step 0 :
Step 1 :
Check Assumptions
Hypotheses
H0 : ?1 = ?2 = ?3 = ? 4 = ?5 (The mean driving distances are equal.)
Ha : Not all of the means are equal.
Step 2 :
Significance Level
Step 3 :
Critical Value and Rejection Region
Step 4 :
Reject the null hypothesis if F ¡Ý 3.06 (P-value ¡Ü 0.05).
Construct the One-way ANOVA Table
¦Á = 0.05
F¦Á ,( df 1 =k ?1,df 2 =N ? k) = F0.05,( df 1 =5?1,df 2 = 20? 5) = F0.05,( df 1 =4,df 2 =15) = 3.06
5
T = ¡Æ Ti = 5,575
i=1
5
4
SSTo = ¡Æ ¡Æ y ?
i=1 j =1
2
2
ij
(T )
2
N
= 1,555,185 ?
(5,575)
20
2
= 1,153.75
2
2
2
2
2
? (T )
T2 )
T3 )
T4 )
T5 ) ? (T )
(
(
(
(
1
SSTr = ?
+
+
+
+
??
n2
n3
n4
n5 ?
N
? n1
? (1,117) 2 (1,128)2 (1,089) 2 (1,099) 2 (1,142)2 ? (5,575)2
??
=?
+
+
+
+
= 458.5
4
4
4
4 ?
20
? 4
SSE = SSTo ? SSTr = 1,153.75 ? 485.5 = 695.25
Source
Treatments
Error
Total
Step 5 :
Step 6 :
df
4
15
19
SS
458.50
695.25
1,153.75
MS = SS/df
114.625
46.35
F-statistic
2.4730
p-value
0.05 < p-value < 0.10
Decision
Since 2.4730 < 3.06 (p-value > 0.05), fail to reject the null hypothesis.
State conclusion in words
At the ¦Á = 0.05 level of significance, there is not enough evidence to conclude that the
mean driving distances of the brands of golf balls differ.
................
................
In order to avoid copyright disputes, this page is only a partial summary.
To fulfill the demand for quickly locating and searching documents.
It is intelligent file search solution for home and business.
Related download
- anova examples stat 314
- solutions tosome exercises from bayesian data analysis
- hypothesis testing for proportions
- cognitive behavioral therapy strategies
- telecommunications network standards and guidelines for
- econ 116 problem set 3 answer key
- practice final exam questions 2 answers
- eliminating the confusion from seismic codes and standards
- nondestructive examination nde technology and codes
- clinical dementia rating questionnaire