A New Conjugate Gradient Method with Smoothing L-1/2 Regularization Based on a Modified Secant Equation for Training Neural Networks

Li, Wenyu; Liu, Yan; Yang, Jie; Wu, Wei<sup>*</sup>

doi:10.1007/s11063-017-9737-9

摘要

Proposed in this paper is a new conjugate gradient method with smoothing L-1/2 regularization based on a modified secant equation for training neural networks, where a descent search direction is generated by selecting an adaptive learning rate based on the strong Wolfe conditions. Two adaptive parameters are introduced such that the new training method possesses both quasi-Newton property and sufficient descent property. As shown in the numerical experiments for five benchmark classification problems from UCI repository, compared with the other conjugate gradient training algorithms, the new training algorithm has roughly the same or even better learning capacity, but significantly better generalization capacity and network sparsity. Under mild assumptions, a global convergence result of the proposed training method is also proved.

出版日期2018-10
单位北华大学; 大连工业大学; 大连理工大学

全文

访问全文

收藏分享被引(1) 浏览

更新时间：2021-07-12 23:58

A New Conjugate Gradient Method with Smoothing L-1/2 Regularization Based on a Modified Secant Equation for Training Neural Networks

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友