Backward elimination and stepwise regression

Backward elimination and stepwise regression

(a) Backward elimination:

Assume the model with all possible covariates is


Backward elimination procedure:

Step 1:

At the beginning, the original model is set to be


Then, the following r-1 tests are carried out, [pic] The lowest partial F-test value [pic] corresponding to [pic] or t-test value [pic] is compared with the preselected significance values [pic] and [pic]. One of two possible steps (step2a and step 2b) can be taken.

Step 2a:

If [pic] or [pic], then [pic] can be deleted and the new original model is


Go back to step 1.

Step 2b:

If [pic] or [pic], the original model is the model we should choose.

Example (continue):

Suppose the preselected significance level is [pic] Thus, [pic]

Step 1:

The original model is


[pic] corresponding to [pic] is the smallest partial F value.

Step 2a:


Thus, [pic] can be deleted. Go back to step 1.

Step 1:

The new original model is


[pic] corresponding to [pic] is the smallest partial F value.

Step 2a:


Thus, [pic] can be deleted. Go back to step 1.

Step 1:

The new original model is


[pic] corresponding to [pic] is the smallest partial F value.

Step 2b:




is the selected model.

(b) Stepwise regression:

Stepwise regression procedure employs some statistical quantity, partial correlation, to add new covariate. We introduce partial correlation first.

Partial correlation:

Assume the model is


The partial correlation of [pic] and [pic], denoted by


can be obtained as follows:

1. Fit the model


obtain the residuals


Also, fit the model


obtain the residuals





[pic] and [pic].

Stepwise regression procedure:

The original model is [pic]. There are r-1 covariates, [pic].

Step 1:

Select the variable most correlated Y, say [pic], based on the correlation coefficient. Fit the model


and check if [pic] is significant. If not, then


is the best model. Otherwise, the new original model is


and go to step 2.

Step 2:

Examine the partial correlation [pic]. Find the covariate [pic] with largest value of partial correlation [pic] Then, fit


and obtain partial F-value, [pic] corresponding to [pic] and

[pic] corresponding to [pic]. Go to step 3.

Step 3:

The smallest partial F-value [pic] (one of [pic] and [pic]) is compared with the preselected significance [pic] value. There are two possibilities:


If [pic], then delete the covariate corresponding to [pic]. Go back to step 2. Note that if [pic], then examine the partial correlation



If [pic], then


is the new original model. Then, go back to step 2, but now examine the partial correlation [pic].

The procedure will automatically stop when no variable in the new original model can be removed and all the next best candidate can not be retained in the new original model. Then, the new original model is our selected model.


In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download