Multiple optimal learning factors for the multi-layer perceptron

Malalur Sanjeev S; Manry Michael T<sup>*</sup>; Jesudhas Praveen

doi:10.1016/j.neucom.2014.08.043

摘要

A batch training algorithm is developed for a fully connected multi-layer perceptron, with a single hidden layer, which uses two-stages per iteration. In the first stage, Newton's method is used to find a vector of optimal learning factors (OLFs), one for each hidden unit, which is used to update the input weights. Linear equations are solved for output weights in the second stage. Elements of the new method's Hessian matrix are shown to be weighted sums of elements from the Hessian of the whole network. The effects of linearly dependent inputs and hidden units on training are analyzed and an improved version of the batch training algorithm is developed. In several examples, the improved method performs better than first order training methods like backpropagation and scaled conjugate gradient, with minimal computational overhead and performs almost as well as Levenberg-Marquardt, a second order training method, with several orders of magnitude fewer multiplications.

出版日期2015-2-3

全文

访问全文

收藏分享被引(9) 浏览

更新时间：2021-04-14 16:40

Multiple optimal learning factors for the multi-layer perceptron

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友