Second order derivatives for network pruning: Optimal Brain Surgeon

weights. The superiority of the method described here - Optimal Brain Surgeon - lies in great pan to the fact that it makes no restrictive assumptions about the form of the network's Hessian, and thereby eliminates the correct weights. Moreover, unlike other methods, OBS does not demand (typically slow) ................
................