SIMPLE LINEAR REGRESSION Determining the Regression Equation
SIMPLE LINEAR REGRESSION
In linear regression, we consider the frequency distribution of one variable (Y) at each of several levels of a second variable (X).
Y is known as the dependent variable. The variable for which you collect data. X is known as the independent variable. The variable for the treatments.
Determining the Regression Equation
One goal of regression is to draw the "best" line through the data points. The best line usually is obtained using means instead of individual observations.
Example Effect of hours of mixing on temperature of wood pulp
Hours of mixing (X) 2 4 6 8 10 12 X=42 X2=364
Temperature of wood pulp (Y) 21 27 29 64 86 92 Y=319 Y2=21,967
XY 42 108 174 512 860 1104 XY=2800 n=6
Effect of hours of mixing on temperature of w ood pulp
Temperature
100 80 60 40 20 0 2
4
6
8
10
12
Hours of m ixing
The equation for any straight line can be written as: Y^ = b0 + b1X
where: bo = Y intercept, and b1 = regression coefficient = slope of the line
The linear model can be written as: Yi = 0 + 1X + i where: ei=residual = Yi - Y^ i
With the data provided, our first goal is to determine the regression equation
Step 1. Solve for b1
b1
=
(X - X)(Y - Y) (X - X)2
=
XY
-
(
X
n
Y)
X2
-
( X )2
n
=
SS Cross Products SS X
=
SSCP SS X
for the data in this example
X = 42
Y = 319 XY = 2,800 X2 = 364 Y2 = 21,967
b1
=
XY - X2
( X Y)
n
- (X)2 n
=
(42x319) 2800 -
6 422 364 - 6
=
567 70
=
8.1
The number calculated for b1, the regression coefficient, indicates that for each unit increase in X (i.e., hours of mixing), Y (i.e., wood pulp temperature) will increase 8.1 units (i.e., degrees).
The regression coefficient can be a positive or negative number.
To complete the regression equation, we need to calculate bo.
b0 = Y - b1X
=
319 6
-
8.1
42 6
=
- 3.533
Temperature (oF)
Therefore, the regression equation is: Y^ i = -3.533 + 8.1X
120
100
80
60
40
20
0
--4
--2
0
2
4
6
8
10
12
--20
--40
--60
Hours of mixing
Assumptions of Regression
1. There is a linear relationship between X and Y
2. The values of X are known constants and presumably are measure without error.
3. For each value of X, Y is independent and normally distributed: Y~N(0, 2).
4.
( ) Sum of deviations from the regression line equals zero: Yi - Y^i = 0.
5. Sum of squares for error are a minimum.
Temperature
Effect of hours of mixing on temperature of wood pulp
120 100
80 60 40 20
0 -20 2
4
6
8
10
12
Hours of mixing
If you square the deviations and sum across all observations, you obtain the definition formulas for the following sums of squares:
( ) Y^ i - Y 2 = Sum Squares Due to Regression ( ) Yi - Y^ i 2 = Sum Squares Due to Deviation from Regression (Residual)
( ) Yi - Y 2 = Sum Squares Total
Testing the hypothesis that a linear relationship between X and Y exists
The hypotheses to test that a linear relationship between X and Y exists are:
Ho: ?1 = 0 HA: ?1 0
These hypotheses can be tested using three different methods: 1. F-test 2. t-test 3. Confidence interval
Method 1. F-test
The ANOVA to test Ho: $1 = 0 can be done using the following sources of variation, degrees of freedom, and sums of squares:
SOV
df
Sum of Square
Due to regression 1
2
XY
-
(
X
n
Y)
X2
-
( X )2
n
SSCP2 =
SS X
Residual
n-2 Determined by subtraction
Total
n-1
Y2
-
( Y )2
n
= SS
Y
Using data from the example:
X = 42
Y = 319 XY = 2,800 X2 = 364
Y2 = 21,967
Step 1. Calculate Total SS =
( ) Y2 -
Y2
3192
= 21,967 -
= 5,006.833
n
6
................
................
In order to avoid copyright disputes, this page is only a partial summary.
To fulfill the demand for quickly locating and searching documents.
It is intelligent file search solution for home and business.
Related download
- assignment 7 solution university of california berkeley
- math 1330 precalculus university of houston
- properties of variance university of washington
- simple linear regression determining the regression equation
- lecture 4 calculating limits using limit laws
- converting quadratics vertex form to standard form
- graphing functions using transformations
- discrete mathematics problems
- x 2 university of michigan
- 7 2 finding volume using cross sections
Related searches
- simple linear regression test statistic
- simple linear regression hypothesis testing
- simple linear regression null hypothesis
- simple linear regression model calculator
- simple linear regression uses
- simple linear regression model pdf
- simple linear regression practice problems
- simple linear regression least squares
- simple linear regression excel
- simple linear regression example pdf
- simple linear regression example questions
- simple linear regression business example